Kun Zhou
103 papers · 2013–2026 · 15 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+17 more ↓ Show less ↑
π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (16) π Interdisciplinary Bridge π Renaissance Researcher (6) π£ Hot Topic Early Bird
π
Interdisciplinary Bridge
π£
Hot Topic Early Bird
πΊοΈ
Taxonomy Completionist
(16)
π
Keyword Trendsetter Combo
(3)
π
Conference Loyalist
(25)
π
Grand Slam
π¬
Deep Specialist
(14)
π§¬
Topic Evolution
π€
Dynamic Duo
(30)
π₯
Mega-Team
(25)
ποΈ
Keyword Collector
(474)
π
Century Club
(93)
π
Trend Setter
π
Conference Pioneer
β‘
Prolific Year
(5)
π₯
Unstoppable
(7)
β
The Questioner
Conferences
CVPR (25)
ACL (18)
AAAI (15)
EMNLP (11)
ICCV (5)
IJCAI (5)
INTERSPEECH (5)
NIPS (5)
COLING (3)
ECCV (3)
ICLR (3)
IJCNLP (2)
EACL (1)
ICML (1)
NAACL (1)
Top co-authors
Research topics
Keywords
large language model
(18)
3d reconstruction
(10)
image restoration
(6)
generative adversarial network
(5)
graph neural network
(5)
neural network
(5)
vision-language model
(4)
unsupervised learning
(4)
image super-resolution
(4)
image generation
(4)
generative model
(4)
reinforcement learning
(4)
diffusion model
(3)
language model
(3)
novel view synthesis
(3)
instruction tuning
(3)
gaussian splatting
(3)
instance segmentation
(3)
model compression
(3)
facial animation
(3)
Papers
LR-AdaInSeg:Adaptive Instance Segmentation of Incomplete 3D Scenes Driven by Low-Rank Networks
AAAI 2026
ElastoGen: 4D Generative Elastodynamics
AAAI 2026
3DTeethSAM: Taming SAM2 for 3D Teeth Segmentation
AAAI 2026
Analyzing and Mitigating Object Hallucination: A Training Bias Perspective
AAAI 2026
ODUTQA-MDC: A Task for Open-Domain Underspecified Tabular QA with Multi-turn Dialogue-based Clarification
ACL 2026
Vision-G1: Towards General Reasoning Vision-Language Models via Reinforcement Learning
AAAI 2026
C-World: A Computer Use Agent Environment Creator
ACL 2026
Decentralized Arena: Towards Democratic and Scalable Automatic Evaluation of Language Models
ACL 2026
Beyond the Last Frame: Process-aware Evaluation for Generative Video Reasoning
ACL 2026
Deriving Character Logic from Storyline as Codified Decision Trees
ACL 2026
AttentionDrag: Exploiting Latent Correlation Knowledge in Pre-trained Diffusion Models for Image Editing
IJCAI 2025
Low-Light Video Enhancement via Spatial-Temporal Consistent Decomposition
IJCAI 2025
Multi-band Frequency Reconstruction for Neural Psychoacoustic Coding
ICML 2025
Exploring the Design Space of Visual Context Representation in Video MLLMs
ICLR 2025
OpenSubstance: A High-quality Measured Dataset of Multi-View and -Lighting Images and Shapes
ICCV 2025
YuLan-Mini: Pushing the Limits of Open Data-efficient Language Model
ACL 2025
Towards Effective and Efficient Continual Pre-training of Large Language Models
ACL 2025
KG-Agent: An Efficient Autonomous Agent Framework for Complex Reasoning over Knowledge Graph
ACL 2025
ROVI: A VLM-LLM Re-Captioned Dataset for Open-Vocabulary Instance-Grounded Text-to-Image Generation
ICCV 2025
ViFT: Towards Visual Instruction-Free Fine-tuning for Large Vision-Language Models
EMNLP 2025
What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Instruction Tuning
COLING 2025
Extracting and Combining Abilities For Building Multi-lingual Ability-enhanced Large Language Models
EMNLP 2025
Enhancing Chain-of-Thought Reasoning via Neuron Activation Differential Analysis
EMNLP 2025
ARM: Appearance Reconstruction Model for Relightable 3D Generation
CVPR 2025
Real-time High-fidelity Gaussian Human Avatars with Position-based Interpolation of Spatially Distributed MLPs
CVPR 2025
High-fidelity 3D Object Generation from Single Image with RGBN-Volume Gaussian Reconstruction Model
CVPR 2025
TSP-Mamba: The Travelling Salesman Problem Meets Mamba for Image Super-resolution and Beyond
CVPR 2025
EnliveningGS: Active Locomotion of 3DGS
CVPR 2025
RGBAvatar: Reduced Gaussian Blendshapes for Online Modeling of Head Avatars
CVPR 2025
FlexUOD: The Answer to Real-world Unsupervised Image Outlier Detection
CVPR 2025
Gaussian Splashing: Unified Particles for Versatile Motion Synthesis and Rendering
CVPR 2025
Enhancing Identity-Deformation Disentanglement in StyleGAN for One-Shot Face Video Re-Enactment
AAAI 2025
GenesisTex2: Stable, Consistent and High-Quality Text-to-Texture Generation
AAAI 2025
RETQA: A Large-Scale Open-Domain Tabular Question Answering Dataset for Real Estate Sector
AAAI 2025
DATA-CUBE: Data Curriculum for Instruction-based Sentence Representation Learning
ACL 2024
JiuZhang3.0: Efficiently Improving Mathematical Reasoning by Training Small Data Synthesis Models
NIPS 2024
UPS: Unified Projection Sharing for Lightweight Single-Image Super-resolution and Beyond
NIPS 2024
Parrot: Enhancing Multi-Turn Instruction Following for Large Language Models
ACL 2024
Skin-in-the-Game: Decision Making via Multi-Stakeholder Alignment in LLMs
ACL 2024
LLMBox: A Comprehensive Library for Large Language Models
ACL 2024
Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint
ACL 2024
MonoHair: High-Fidelity Hair Modeling from a Monocular Video
CVPR 2024
Real-time Acquisition and Reconstruction of Dynamic Volumes with Neural Structured Illumination
CVPR 2024
Text-Guided 3D Face Synthesis - From Generation to Editing
CVPR 2024
Diffusion-NAT: Self-Prompting Discrete Diffusion for Non-Autoregressive Text Generation
EACL 2024
Unveiling Advanced Frequency Disentanglement Paradigm for Low-Light Image Enhancement
ECCV 2024
Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking Multimodal Large Language Models
ECCV 2024
KeypointDETR: An End-to-End 3D Keypoint Detector
ECCV 2024
Not Everything is All You Need: Toward Low-Redundant Optimization for Large Language Model Alignment
EMNLP 2024
Image Inpainting via Iteratively Decoupled Probabilistic Modeling
ICLR 2024
Phonetic Enhanced Language Modeling for Text-to-Speech Synthesis
INTERSPEECH 2024
Evaluating Object Hallucination in Large Vision-Language Models
EMNLP 2023
ReasoningLM: Enabling Structural Subgraph Reasoning in Pre-trained Language Models for Question Answering over Knowledge Graph
EMNLP 2023
StructGPT: A General Framework for Large Language Model to Reason over Structured Data
EMNLP 2023
A Unified Spatial-Angular Structured Light for Single-View Acquisition of Shape and Reflectance
CVPR 2023
Diffusion Models for Non-autoregressive Text Generation: A Survey
IJCAI 2023
NeRFLix: High-Quality Neural View Synthesis by Learning a Degradation-Driven Inter-Viewpoint MiXer
CVPR 2023
Small Pre-trained Language Models Can be Fine-tuned as Large Models via Over-Parameterization
ACL 2023
Visually-augmented pretrained language models for NLP tasks without images
ACL 2023
Evaluating and Improving Tool-Augmented Computation-Intensive Math Reasoning
NIPS 2023
UniKGQA: Unified Retrieval and Reasoning for Solving Multi-hop Question Answering Over Knowledge Graph
ICLR 2023
ChatCoT: Tool-Augmented Chain-of-Thought Reasoning on Chat-based Large Language Models
EMNLP 2023
Exploring Motion Ambiguity and Alignment for High-Quality Video Frame Interpolation
CVPR 2023
Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conversion
INTERSPEECH 2022
Best-Buddy GANs for Highly Detailed Image Super-resolution
AAAI 2022
SimANS: Simple Ambiguous Negatives Sampling for Dense Text Retrieval
EMNLP 2022
Learning Implicit Body Representations from Double Diffusion Based Neural Radiance Fields
IJCAI 2022
Debiased Contrastive Learning of Unsupervised Sentence Representations
ACL 2022
Continual Pre-training of Language Models for Math Problem Understanding with Syntax-Aware Memory Network
ACL 2022
Pre-Trained Model Reusability Evaluation for Small-Data Transfer Learning
NIPS 2022
Great~Truths~are ~Always ~Simple: A Rather Simple Knowledge Encoder for Enhancing the Commonsense Reasoning Capacity of Pre-Trained Models
NAACL 2022
NeuralHDHair: Automatic High-Fidelity Hair Modeling From a Single Image Using Implicit Neural Representations
CVPR 2022
HoD-Net: High-Order Differentiable Deep Neural Networks and Applications
AAAI 2022
Pose Guided Image Generation from Misaligned Sources via Residual Flow Based Correction
AAAI 2022
Revisiting Temporal Alignment for Video Restoration
CVPR 2022
MAT: Mask-Aware Transformer for Large Hole Image Inpainting
CVPR 2022
Limited Data Emotional Voice Conversion Leveraging Text-to-Speech: Two-Stage Sequence-to-Sequence Training
INTERSPEECH 2021
BASAR:Black-Box Attack on Skeletal Action Recognition
CVPR 2021
One-shot Face Reenactment Using Appearance Adaptive Normalization
AAAI 2021
Neural Sentence Ordering Based on Constraint Graphs
AAAI 2021
EmbedMask: Embedding Coupling for Instance Segmentation
IJCAI 2021
Understanding the Robustness of Skeleton-Based Action Recognition Under Adversarial Attack
CVPR 2021
Learning Efficient Photometric Feature Transform for Multi-View Stereo
ICCV 2021
Unsupervised Image Generation With Infinite Generative Adversarial Networks
ICCV 2021
In-game Residential Home Planning via Visual Context-aware Global Relation Learning
AAAI 2021
Virtual Data Augmentation: A Robust and General Framework for Fine-tuning Pre-trained Models
EMNLP 2021
CRSLab: An Open-Source Toolkit for Building Conversational Recommender System
IJCNLP 2021
CRSLab: An Open-Source Toolkit for Building Conversational Recommender System
ACL 2021
Structure-aware Person Image Generation with Pose Decomposition and Semantic Correlation
AAAI 2021
LAPAR: Linearly-Assembled Pixel-Adaptive Regression Network for Single Image Super-resolution and Beyond
NIPS 2020
Converting Anyoneβs Emotion: Towards Speaker-Independent Emotional Voice Conversion
INTERSPEECH 2020
Towards High-Fidelity 3D Face Reconstruction From In-the-Wild Images Using Graph Convolutional Networks
CVPR 2020
Towards Topic-Guided Conversational Recommender System
COLING 2020
Learn with Noisy Data via Unsupervised Loss Correction for Weakly Supervised Reading Comprehension
COLING 2020
Unsupervised Context Rewriting for Open Domain Conversation
IJCNLP 2019
Unsupervised Context Rewriting for Open Domain Conversation
EMNLP 2019
HEMlets Pose: Learning Part-Centric Heatmap Triplets for Accurate 3D Human Pose Estimation
ICCV 2019
Large-Scale Speaker Diarization of Radio Broadcast Archives
INTERSPEECH 2019
Radiometric Calibration From Faces in Images
CVPR 2017
Specular Highlight Removal in Facial Images
CVPR 2017
A Geodesic-Preserving Method for Image Warping
CVPR 2015
Simulating Makeup Through Physics-Based Manipulation of Intrinsic Image Layers
CVPR 2015
Bayesian Depth-from-Defocus with Shading Constraints
CVPR 2013