Kun Zhou

103 papers · 2013–2026 · 15 conferences · across top CS/AI conferences

Achievements

+17 more ↓

🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (16) 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (6) 🐣 Hot Topic Early Bird

🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird 🗺️ Taxonomy Completionist (16) 🌟 Keyword Trendsetter Combo (3) 🏠 Conference Loyalist (25) 🏆 Grand Slam 🔬 Deep Specialist (14) 🧬 Topic Evolution 🤝 Dynamic Duo (30) 👥 Mega-Team (25) 🗃️ Keyword Collector (474) 💎 Century Club (93) 📈 Trend Setter 🚀 Conference Pioneer ⚡ Prolific Year (5) 🔥 Unstoppable (7) ❓ The Questioner

Conferences

CVPR (25) ACL (18) AAAI (15) EMNLP (11) ICCV (5) IJCAI (5) INTERSPEECH (5) NIPS (5) COLING (3) ECCV (3) ICLR (3) IJCNLP (2) EACL (1) ICML (1) NAACL (1)

Top co-authors

Ji-Rong Wen (30) Wayne Xin Zhao (20) Tianjia Shao (16) Xin Zhao (15) Yin Yang (10) Jiangbo Lu (10) Wenbo Li (8) Hongzhi Wu (8) Jinhao Jiang (7) He Wang (7)

Research topics

Architectures (1)

Keywords

large language model (18) 3d reconstruction (10) image restoration (6) generative adversarial network (5) graph neural network (5) neural network (5) vision-language model (4) unsupervised learning (4) image super-resolution (4) image generation (4) generative model (4) reinforcement learning (4) diffusion model (3) language model (3) novel view synthesis (3) instruction tuning (3) gaussian splatting (3) instance segmentation (3) model compression (3) facial animation (3)

Papers

LR-AdaInSeg:Adaptive Instance Segmentation of Incomplete 3D Scenes Driven by Low-Rank Networks AAAI 2026 ElastoGen: 4D Generative Elastodynamics AAAI 2026 3DTeethSAM: Taming SAM2 for 3D Teeth Segmentation AAAI 2026 Analyzing and Mitigating Object Hallucination: A Training Bias Perspective AAAI 2026 ODUTQA-MDC: A Task for Open-Domain Underspecified Tabular QA with Multi-turn Dialogue-based Clarification ACL 2026 Vision-G1: Towards General Reasoning Vision-Language Models via Reinforcement Learning AAAI 2026 C-World: A Computer Use Agent Environment Creator ACL 2026 Decentralized Arena: Towards Democratic and Scalable Automatic Evaluation of Language Models ACL 2026 Beyond the Last Frame: Process-aware Evaluation for Generative Video Reasoning ACL 2026 Deriving Character Logic from Storyline as Codified Decision Trees ACL 2026 AttentionDrag: Exploiting Latent Correlation Knowledge in Pre-trained Diffusion Models for Image Editing IJCAI 2025 Low-Light Video Enhancement via Spatial-Temporal Consistent Decomposition IJCAI 2025 Multi-band Frequency Reconstruction for Neural Psychoacoustic Coding ICML 2025 Exploring the Design Space of Visual Context Representation in Video MLLMs ICLR 2025 OpenSubstance: A High-quality Measured Dataset of Multi-View and -Lighting Images and Shapes ICCV 2025 YuLan-Mini: Pushing the Limits of Open Data-efficient Language Model ACL 2025 Towards Effective and Efficient Continual Pre-training of Large Language Models ACL 2025 KG-Agent: An Efficient Autonomous Agent Framework for Complex Reasoning over Knowledge Graph ACL 2025 ROVI: A VLM-LLM Re-Captioned Dataset for Open-Vocabulary Instance-Grounded Text-to-Image Generation ICCV 2025 ViFT: Towards Visual Instruction-Free Fine-tuning for Large Vision-Language Models EMNLP 2025 What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Instruction Tuning COLING 2025 Extracting and Combining Abilities For Building Multi-lingual Ability-enhanced Large Language Models EMNLP 2025 Enhancing Chain-of-Thought Reasoning via Neuron Activation Differential Analysis EMNLP 2025 ARM: Appearance Reconstruction Model for Relightable 3D Generation CVPR 2025 Real-time High-fidelity Gaussian Human Avatars with Position-based Interpolation of Spatially Distributed MLPs CVPR 2025 High-fidelity 3D Object Generation from Single Image with RGBN-Volume Gaussian Reconstruction Model CVPR 2025 TSP-Mamba: The Travelling Salesman Problem Meets Mamba for Image Super-resolution and Beyond CVPR 2025 EnliveningGS: Active Locomotion of 3DGS CVPR 2025 RGBAvatar: Reduced Gaussian Blendshapes for Online Modeling of Head Avatars CVPR 2025 FlexUOD: The Answer to Real-world Unsupervised Image Outlier Detection CVPR 2025 Gaussian Splashing: Unified Particles for Versatile Motion Synthesis and Rendering CVPR 2025 Enhancing Identity-Deformation Disentanglement in StyleGAN for One-Shot Face Video Re-Enactment AAAI 2025 GenesisTex2: Stable, Consistent and High-Quality Text-to-Texture Generation AAAI 2025 RETQA: A Large-Scale Open-Domain Tabular Question Answering Dataset for Real Estate Sector AAAI 2025 DATA-CUBE: Data Curriculum for Instruction-based Sentence Representation Learning ACL 2024 JiuZhang3.0: Efficiently Improving Mathematical Reasoning by Training Small Data Synthesis Models NIPS 2024 UPS: Unified Projection Sharing for Lightweight Single-Image Super-resolution and Beyond NIPS 2024 Parrot: Enhancing Multi-Turn Instruction Following for Large Language Models ACL 2024 Skin-in-the-Game: Decision Making via Multi-Stakeholder Alignment in LLMs ACL 2024 LLMBox: A Comprehensive Library for Large Language Models ACL 2024 Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint ACL 2024 MonoHair: High-Fidelity Hair Modeling from a Monocular Video CVPR 2024 Real-time Acquisition and Reconstruction of Dynamic Volumes with Neural Structured Illumination CVPR 2024 Text-Guided 3D Face Synthesis - From Generation to Editing CVPR 2024 Diffusion-NAT: Self-Prompting Discrete Diffusion for Non-Autoregressive Text Generation EACL 2024 Unveiling Advanced Frequency Disentanglement Paradigm for Low-Light Image Enhancement ECCV 2024 Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking Multimodal Large Language Models ECCV 2024 KeypointDETR: An End-to-End 3D Keypoint Detector ECCV 2024 Not Everything is All You Need: Toward Low-Redundant Optimization for Large Language Model Alignment EMNLP 2024 Image Inpainting via Iteratively Decoupled Probabilistic Modeling ICLR 2024 Phonetic Enhanced Language Modeling for Text-to-Speech Synthesis INTERSPEECH 2024 Evaluating Object Hallucination in Large Vision-Language Models EMNLP 2023 ReasoningLM: Enabling Structural Subgraph Reasoning in Pre-trained Language Models for Question Answering over Knowledge Graph EMNLP 2023 StructGPT: A General Framework for Large Language Model to Reason over Structured Data EMNLP 2023 A Unified Spatial-Angular Structured Light for Single-View Acquisition of Shape and Reflectance CVPR 2023 Diffusion Models for Non-autoregressive Text Generation: A Survey IJCAI 2023 NeRFLix: High-Quality Neural View Synthesis by Learning a Degradation-Driven Inter-Viewpoint MiXer CVPR 2023 Small Pre-trained Language Models Can be Fine-tuned as Large Models via Over-Parameterization ACL 2023 Visually-augmented pretrained language models for NLP tasks without images ACL 2023 Evaluating and Improving Tool-Augmented Computation-Intensive Math Reasoning NIPS 2023 UniKGQA: Unified Retrieval and Reasoning for Solving Multi-hop Question Answering Over Knowledge Graph ICLR 2023 ChatCoT: Tool-Augmented Chain-of-Thought Reasoning on Chat-based Large Language Models EMNLP 2023 Exploring Motion Ambiguity and Alignment for High-Quality Video Frame Interpolation CVPR 2023 Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conversion INTERSPEECH 2022 Best-Buddy GANs for Highly Detailed Image Super-resolution AAAI 2022 SimANS: Simple Ambiguous Negatives Sampling for Dense Text Retrieval EMNLP 2022 Learning Implicit Body Representations from Double Diffusion Based Neural Radiance Fields IJCAI 2022 Debiased Contrastive Learning of Unsupervised Sentence Representations ACL 2022 Continual Pre-training of Language Models for Math Problem Understanding with Syntax-Aware Memory Network ACL 2022 Pre-Trained Model Reusability Evaluation for Small-Data Transfer Learning NIPS 2022 Great~Truths~are ~Always ~Simple: A Rather Simple Knowledge Encoder for Enhancing the Commonsense Reasoning Capacity of Pre-Trained Models NAACL 2022 NeuralHDHair: Automatic High-Fidelity Hair Modeling From a Single Image Using Implicit Neural Representations CVPR 2022 HoD-Net: High-Order Differentiable Deep Neural Networks and Applications AAAI 2022 Pose Guided Image Generation from Misaligned Sources via Residual Flow Based Correction AAAI 2022 Revisiting Temporal Alignment for Video Restoration CVPR 2022 MAT: Mask-Aware Transformer for Large Hole Image Inpainting CVPR 2022 Limited Data Emotional Voice Conversion Leveraging Text-to-Speech: Two-Stage Sequence-to-Sequence Training INTERSPEECH 2021 BASAR:Black-Box Attack on Skeletal Action Recognition CVPR 2021 One-shot Face Reenactment Using Appearance Adaptive Normalization AAAI 2021 Neural Sentence Ordering Based on Constraint Graphs AAAI 2021 EmbedMask: Embedding Coupling for Instance Segmentation IJCAI 2021 Understanding the Robustness of Skeleton-Based Action Recognition Under Adversarial Attack CVPR 2021 Learning Efficient Photometric Feature Transform for Multi-View Stereo ICCV 2021 Unsupervised Image Generation With Infinite Generative Adversarial Networks ICCV 2021 In-game Residential Home Planning via Visual Context-aware Global Relation Learning AAAI 2021 Virtual Data Augmentation: A Robust and General Framework for Fine-tuning Pre-trained Models EMNLP 2021 CRSLab: An Open-Source Toolkit for Building Conversational Recommender System IJCNLP 2021 CRSLab: An Open-Source Toolkit for Building Conversational Recommender System ACL 2021 Structure-aware Person Image Generation with Pose Decomposition and Semantic Correlation AAAI 2021 LAPAR: Linearly-Assembled Pixel-Adaptive Regression Network for Single Image Super-resolution and Beyond NIPS 2020 Converting Anyone’s Emotion: Towards Speaker-Independent Emotional Voice Conversion INTERSPEECH 2020 Towards High-Fidelity 3D Face Reconstruction From In-the-Wild Images Using Graph Convolutional Networks CVPR 2020 Towards Topic-Guided Conversational Recommender System COLING 2020 Learn with Noisy Data via Unsupervised Loss Correction for Weakly Supervised Reading Comprehension COLING 2020 Unsupervised Context Rewriting for Open Domain Conversation IJCNLP 2019 Unsupervised Context Rewriting for Open Domain Conversation EMNLP 2019 HEMlets Pose: Learning Part-Centric Heatmap Triplets for Accurate 3D Human Pose Estimation ICCV 2019 Large-Scale Speaker Diarization of Radio Broadcast Archives INTERSPEECH 2019 Radiometric Calibration From Faces in Images CVPR 2017 Specular Highlight Removal in Facial Images CVPR 2017 A Geodesic-Preserving Method for Image Warping CVPR 2015 Simulating Makeup Through Physics-Based Manipulation of Intrinsic Image Layers CVPR 2015 Bayesian Depth-from-Defocus with Shading Constraints CVPR 2013