Houqiang Li

143 papers · 2014–2026 · 14 conferences · across top CS/AI conferences

Achievements

+16 more ↓

🗺️ Taxonomy Completionist (10) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (5) 🌍 Conference Polyglot (14)

🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (10) 🏠 Conference Loyalist (23) 🔬 Deep Specialist (22) 🏆 Grand Slam 🏆 Keyword Champion (6) 🤝 Dynamic Duo (86) 👑 Triple Crown 🗃️ Keyword Collector (582) 💎 Century Club (140) ❓ The Questioner 🚀 Conference Pioneer ⚡ Prolific Year (10) 🔥 Unstoppable (12) 📈 Trend Setter

Conferences

CVPR (41) AAAI (25) ICCV (17) NIPS (12) ICLR (9) ACL (8) ECCV (8) IJCAI (7) ICML (6) EMNLP (5) COLING (2) ACML (1) IJCNLP (1) UAI (1)

Top co-authors

Wengang Zhou (89) Jiajun Deng (14) Jianmin Bao (11) Min Wang (11) Qi Tian (10) Dong Chen (8) Hezhen Hu (8) Bin Li (8) Jie Wang (7) Li Li (7)

Research topics

Core AI (1)

Keywords

representation learning (16) contrastive learning (8) semantic segmentation (7) video understanding (7) transfer learning (7) feature representation (6) sign language recognition (6) knowledge distillation (6) reinforcement learning (5) domain adaptation (5) weakly supervised learning (5) object detection (5) semi-supervised learning (5) zero-shot learning (5) self-supervised learning (5) metric learning (5) multi-agent reinforcement learning (5) image generation (5) model compression (5) unsupervised learning (5)

Papers

DocR1: Evidence Page-Guided GRPO for Multi-Page Document Understanding AAAI 2026 Rethinking Long-tailed Dataset Distillation: A Uni-Level Framework with Unbiased Recovery and Relabeling AAAI 2026 Bias Fitting to Mitigate Length Bias of Reward Model in RLHF ACL 2026 Towards Practical Real-Time Neural Video Compression CVPR 2025 S3R-GS: Streamlining the Pipeline for Large-Scale Street Scene Reconstruction ICCV 2025 Incremental Transformer: Efficient Encoder for Incremented Text Over MRC and Conversation Tasks COLING 2025 Active Perception Meets Rule-Guided RL: A Two-Phase Approach for Precise Object Navigation in Complex Environments ICCV 2025 Controllable Style Arithmetic with Language Models ACL 2025 DesignDiffusion: High-Quality Text-to-Design Image Generation with Diffusion Models CVPR 2025 Visual Evidence Prompting Mitigates Hallucinations in Large Vision-Language Models ACL 2025 Self-Classification Enhancement and Correction for Weakly Supervised Object Detection IJCAI 2025 OPTICAL: Leveraging Optimal Transport for Contribution Allocation in Dataset Distillation CVPR 2025 Interpret and Improve In-Context Learning via the Lens of Input-Label Mappings ACL 2025 Robust Multimodal Large Language Models Against Modality Conflict ICML 2025 RaCFormer: Towards High-Quality 3D Object Detection via Query-based Radar-Camera Fusion CVPR 2025 Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters CVPR 2025 EG4D: Explicit Generation of 4D Object without Score Distillation ICLR 2025 SmartEraser: Remove Anything from Images using Masked-Region Guidance CVPR 2025 Uni-Sign: Toward Unified Sign Language Understanding at Scale ICLR 2025 Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models AAAI 2025 TinySAM: Pushing the Envelope for Efficient Segment Anything Model AAAI 2025 Instance-aware Exploration-Verification-Exploitation for Instance ImageGoal Navigation CVPR 2024 TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy NIPS 2024 Learning Label Dependencies for Visual Information Extraction IJCAI 2024 Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning ICML 2024 From Yes-Men to Truth-Tellers: Addressing Sycophancy in Large Language Models with Pinpoint Tuning ICML 2024 Image2Sentence based Asymmetrical Zero-shot Composed Image Retrieval ICLR 2024 Revisiting Open-Set Panoptic Segmentation AAAI 2024 KGDM: A Diffusion Model to Capture Multiple Relation Semantics for Knowledge Graph Embedding AAAI 2024 SUF: Stabilized Unconstrained Fine-Tuning for Offline-to-Online Reinforcement Learning AAAI 2024 Semi-Supervised Spoken Language Glossification ACL 2024 Sinkhorn Distance Minimization for Knowledge Distillation COLING 2024 BoolQuestions: Does Dense Retrieval Understand Boolean Logic in Language? EMNLP 2024 Interpretable Composition Attribution Enhancement for Visio-linguistic Compositional Understanding EMNLP 2024 Long-term Temporal Context Gathering for Neural Video Compression ECCV 2024 Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene Synthesis ECCV 2024 Generative Latent Coding for Ultra-Low Bitrate Image Compression CVPR 2024 InstructDiffusion: A Generalist Modeling Interface for Vision Tasks CVPR 2024 State Sequences Prediction via Fourier Transform for Representation Learning NIPS 2023 MA2CL:Masked Attentive Contrastive Learning for Multi-Agent Reinforcement Learning IJCAI 2023 Learning robust representation for reinforcement learning with distractions by reward sequence prediction UAI 2023 DIFFER:Decomposing Individual Reward for Fair Experience Replay in Multi-Agent Reinforcement Learning NIPS 2023 AltFreezing for More General Video Face Forgery Detection CVPR 2023 Hierarchical Multi-Agent Skill Discovery NIPS 2023 Stare at What You See: Masked Image Modeling Without Reconstruction CVPR 2023 Human Pose As Compositional Tokens CVPR 2023 Cyclic-Bootstrap Labeling for Weakly Supervised Object Detection ICCV 2023 Masked Motion Predictors are Strong 3D Action Representation Learners ICCV 2023 Focus on Your Target: A Dual Teacher-Student Framework for Domain-Adaptive Semantic Segmentation ICCV 2023 SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning ICCV 2023 DIRE for Diffusion-Generated Image Detection ICCV 2023 Sign Language Translation with Iterative Prototype ICCV 2023 CLIP4HOI: Towards Adapting CLIP for Practical Zero-Shot HOI Detection NIPS 2023 Multi-Agent First Order Constrained Optimization in Policy Space NIPS 2023 $\mathcal{O}$-GNN: incorporating ring priors into molecular modeling ICLR 2023 Low-Light Video Enhancement with Synthetic Event Guidance AAAI 2023 BEST: BERT Pre-training for Sign Language Recognition with Coupling Tokenization AAAI 2023 HandNeRF: Neural Radiance Fields for Animatable Interacting Hands CVPR 2023 Motion Information Propagation for Neural Video Compression CVPR 2023 Asymmetric Feature Fusion for Image Retrieval CVPR 2023 CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Alignment ICLR 2023 A General Rank Preserving Framework for Asymmetric Image Retrieval ICLR 2023 Making Better Decision by Directly Planning in Continuous Control ICLR 2023 NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation ACL 2023 Hybrid and Collaborative Passage Reranking ACL 2023 TAPE: Task-Agnostic Prior Embedding for Image Restoration ECCV 2022 LDSA: Learning Dynamic Subtask Assignment in Cooperative Multi-Agent Reinforcement Learning NIPS 2022 Hand-Object Interaction Image Generation NIPS 2022 Learning Token-Based Representation for Image Retrieval AAAI 2022 Learning Robust Policy against Disturbance in Transition Dynamics via State-Conservative Policy Optimization AAAI 2022 Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic AAAI 2022 Uformer: A General U-Shaped Transformer for Image Restoration CVPR 2022 Contextual Similarity Distillation for Asymmetric Image Retrieval CVPR 2022 Large-Scale Pre-Training for Person Re-Identification With Noisy Labels CVPR 2022 Domain-Agnostic Prior for Transfer Semantic Segmentation CVPR 2022 CMD: Self-Supervised 3D Action Representation Learning with Cross-Modal Mutual Distillation ECCV 2022 CMT: Context-Matching-Guided Transformer for 3D Tracking in Point Clouds ECCV 2022 MVP: Multimodality-Guided Visual Pre-training ECCV 2022 Geometric Representation Learning for Document Image Rectification ECCV 2022 Neural-based Mixture Probabilistic Query Embedding for Answering FOL queries on Knowledge Graphs EMNLP 2022 Supervised Off-Policy Ranking ICML 2022 Equivalence Analysis between Counterfactual Regret Minimization and Online Mirror Descent ICML 2022 Improving Sign Language Translation With Monolingual Data by Sign Back-Translation CVPR 2021 ProphetNet-X: Large-Scale Pre-training Models for English, Chinese, Multi-lingual, Dialog, and Code Generation IJCNLP 2021 Fine-grained Semantic Alignment Network for Weakly Supervised Temporal Language Grounding EMNLP 2021 Discovering Representation Sprachbund For Multilingual Pre-Training EMNLP 2021 Probing Inter-modality: Visual Parsing with Self-Attention for Vision-and-Language Pre-training NIPS 2021 Joint Inductive and Transductive Learning for Video Object Segmentation ICCV 2021 SignBERT: Pre-Training of Hand-Model-Aware Representation for Sign Language Recognition ICCV 2021 Conditional DETR for Fast Training Convergence ICCV 2021 Instance-Wise Hard Negative Example Generation for Contrastive Learning in Unpaired Image-to-Image Translation ICCV 2021 3D Local Convolutional Neural Networks for Gait Recognition ICCV 2021 Learning Deep Local Features With Multiple Dynamic Attentions for Large-Scale Image Retrieval ICCV 2021 TransVG: End-to-End Visual Grounding With Transformers ICCV 2021 IOT: Instance-wise Layer Reordering for Transformer Structures ICLR 2021 ProphetNet-X: Large-Scale Pre-training Models for English, Chinese, Multi-lingual, Dialog, and Code Generation ACL 2021 Contrastive Transformation for Self-supervised Correspondence Learning AAAI 2021 Auto-Encoding Transformations in Reparameterized Lie Groups for Unsupervised Learning AAAI 2021 Instance Mining with Class Feature Banks for Weakly Supervised Object Detection AAAI 2021 Task-Independent Knowledge Makes for Transferable Representations for Generalized Zero-Shot Learning AAAI 2021 Hand-Model-Aware Sign Language Recognition AAAI 2021 Voxel R-CNN: Towards High Performance Voxel-based 3D Object Detection AAAI 2021 BANG: Bridging Autoregressive and Non-autoregressive Generation with Large Scale Pretraining ICML 2021 Contextual Similarity Aggregation with Self-attention for Visual Re-ranking NIPS 2021 Dual Progressive Prototype Network for Generalized Zero-Shot Learning NIPS 2021 Revisiting Knowledge Distillation: An Inheritance and Exploration Framework CVPR 2021 Generating Diverse Structure for Image Inpainting With Hierarchical VQ-VAE CVPR 2021 Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking CVPR 2021 ATSO: Asynchronous Teacher-Student Optimization for Semi-Supervised Image Segmentation CVPR 2021 Model-Aware Gesture-to-Gesture Translation CVPR 2021 Unsupervised Pre-Training for Person Re-Identification CVPR 2021 Representing Videos As Discriminative Sub-Graphs for Action Recognition CVPR 2021 Transformation GAN for Unsupervised Image Synthesis and Representation Learning CVPR 2020 M-LVC: Multiple Frames Prediction for Learned Video Compression CVPR 2020 Multi-Question Learning for Visual Question Answering AAAI 2020 Incorporating BERT into Neural Machine Translation ICLR 2020 Relation-Guided Spatial Attention and Temporal Refinement for Video-Based Person Re-Identification AAAI 2020 POST: POlicy-Based Switch Tracking AAAI 2020 Deep Model-Based Reinforcement Learning via Estimated Uncertainty and Conservative Policy Optimization AAAI 2020 Promoting Stochasticity for Expressive Policies via a Simple and Efficient Regularization Method NIPS 2020 Attentive Experience Replay AAAI 2020 Spatial-Temporal Multi-Cue Network for Continuous Sign Language Recognition AAAI 2020 Unsupervised Deep Tracking CVPR 2019 Quantization Networks CVPR 2019 Spatial and Temporal Mutual Promotion for Video-Based Person Re-Identification AAAI 2019 Iterative Alignment Network for Continuous Sign Language Recognition CVPR 2019 Relation Distillation Networks for Video Object Detection ICCV 2019 Densely Supervised Hierarchical Policy-Value Network for Image Paragraph Generation IJCAI 2019 Improving Deep Neural Network Sparsity through Decorrelation Regularization IJCAI 2018 Multi-Cue Correlation Filters for Robust Visual Tracking CVPR 2018 CCNet: Cluster-Coordinated Net for Learning Multi-agent Communication Protocols with Reinforcement Learning ACML 2018 Towards Open-Set Identity Preserving Face Synthesis CVPR 2018 Affinity Derivation and Graph Merge for Instance Segmentation ECCV 2018 Dilated Convolutional Network with Iterative Optimization for Continuous Sign Language Recognition IJCAI 2018 Feature Selective Networks for Object Detection CVPR 2018 CVAE-GAN: Fine-Grained Image Generation Through Asymmetric Training ICCV 2017 Video Captioning With Transferred Semantic Attributes CVPR 2017 Comparative Deep Learning of Hybrid Representations for Image Recommendations CVPR 2016 Learning Deep Intrinsic Video Representation by Exploring Temporal Coherence and Graph Structure IJCAI 2016 Jointly Modeling Embedding and Translation to Bridge Video and Language CVPR 2016 Semi-Supervised Domain Adaptation With Subspace Learning for Visual Recognition CVPR 2015 SOM: Semantic Obviousness Metric for Image Quality Assessment CVPR 2015 Separable Kernel for Image Deblurring CVPR 2014