Houqiang Li
143 papers · 2014–2026 · 14 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (10) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (5) π Conference Polyglot (14)
π
Renaissance Researcher
(5)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(10)
π
Conference Loyalist
(23)
π¬
Deep Specialist
(22)
π
Grand Slam
π
Keyword Champion
(6)
π€
Dynamic Duo
(86)
π
Triple Crown
ποΈ
Keyword Collector
(582)
π
Century Club
(140)
β
The Questioner
π
Conference Pioneer
β‘
Prolific Year
(10)
π₯
Unstoppable
(12)
π
Trend Setter
Conferences
CVPR (41)
AAAI (25)
ICCV (17)
NIPS (12)
ICLR (9)
ACL (8)
ECCV (8)
IJCAI (7)
ICML (6)
EMNLP (5)
COLING (2)
ACML (1)
IJCNLP (1)
UAI (1)
Top co-authors
Research topics
Keywords
representation learning
(16)
contrastive learning
(8)
semantic segmentation
(7)
video understanding
(7)
transfer learning
(7)
feature representation
(6)
sign language recognition
(6)
knowledge distillation
(6)
reinforcement learning
(5)
domain adaptation
(5)
weakly supervised learning
(5)
object detection
(5)
semi-supervised learning
(5)
zero-shot learning
(5)
self-supervised learning
(5)
metric learning
(5)
multi-agent reinforcement learning
(5)
image generation
(5)
model compression
(5)
unsupervised learning
(5)
Papers
DocR1: Evidence Page-Guided GRPO for Multi-Page Document Understanding
AAAI 2026
Rethinking Long-tailed Dataset Distillation: A Uni-Level Framework with Unbiased Recovery and Relabeling
AAAI 2026
Bias Fitting to Mitigate Length Bias of Reward Model in RLHF
ACL 2026
Towards Practical Real-Time Neural Video Compression
CVPR 2025
S3R-GS: Streamlining the Pipeline for Large-Scale Street Scene Reconstruction
ICCV 2025
Incremental Transformer: Efficient Encoder for Incremented Text Over MRC and Conversation Tasks
COLING 2025
Active Perception Meets Rule-Guided RL: A Two-Phase Approach for Precise Object Navigation in Complex Environments
ICCV 2025
Controllable Style Arithmetic with Language Models
ACL 2025
DesignDiffusion: High-Quality Text-to-Design Image Generation with Diffusion Models
CVPR 2025
Visual Evidence Prompting Mitigates Hallucinations in Large Vision-Language Models
ACL 2025
Self-Classification Enhancement and Correction for Weakly Supervised Object Detection
IJCAI 2025
OPTICAL: Leveraging Optimal Transport for Contribution Allocation in Dataset Distillation
CVPR 2025
Interpret and Improve In-Context Learning via the Lens of Input-Label Mappings
ACL 2025
Robust Multimodal Large Language Models Against Modality Conflict
ICML 2025
RaCFormer: Towards High-Quality 3D Object Detection via Query-based Radar-Camera Fusion
CVPR 2025
Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters
CVPR 2025
EG4D: Explicit Generation of 4D Object without Score Distillation
ICLR 2025
SmartEraser: Remove Anything from Images using Masked-Region Guidance
CVPR 2025
Uni-Sign: Toward Unified Sign Language Understanding at Scale
ICLR 2025
Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models
AAAI 2025
TinySAM: Pushing the Envelope for Efficient Segment Anything Model
AAAI 2025
Instance-aware Exploration-Verification-Exploitation for Instance ImageGoal Navigation
CVPR 2024
TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy
NIPS 2024
Learning Label Dependencies for Visual Information Extraction
IJCAI 2024
Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning
ICML 2024
From Yes-Men to Truth-Tellers: Addressing Sycophancy in Large Language Models with Pinpoint Tuning
ICML 2024
Image2Sentence based Asymmetrical Zero-shot Composed Image Retrieval
ICLR 2024
Revisiting Open-Set Panoptic Segmentation
AAAI 2024
KGDM: A Diffusion Model to Capture Multiple Relation Semantics for Knowledge Graph Embedding
AAAI 2024
SUF: Stabilized Unconstrained Fine-Tuning for Offline-to-Online Reinforcement Learning
AAAI 2024
Semi-Supervised Spoken Language Glossification
ACL 2024
Sinkhorn Distance Minimization for Knowledge Distillation
COLING 2024
BoolQuestions: Does Dense Retrieval Understand Boolean Logic in Language?
EMNLP 2024
Interpretable Composition Attribution Enhancement for Visio-linguistic Compositional Understanding
EMNLP 2024
Long-term Temporal Context Gathering for Neural Video Compression
ECCV 2024
Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene Synthesis
ECCV 2024
Generative Latent Coding for Ultra-Low Bitrate Image Compression
CVPR 2024
InstructDiffusion: A Generalist Modeling Interface for Vision Tasks
CVPR 2024
State Sequences Prediction via Fourier Transform for Representation Learning
NIPS 2023
MA2CL:Masked Attentive Contrastive Learning for Multi-Agent Reinforcement Learning
IJCAI 2023
Learning robust representation for reinforcement learning with distractions by reward sequence prediction
UAI 2023
DIFFER:Decomposing Individual Reward for Fair Experience Replay in Multi-Agent Reinforcement Learning
NIPS 2023
AltFreezing for More General Video Face Forgery Detection
CVPR 2023
Hierarchical Multi-Agent Skill Discovery
NIPS 2023
Stare at What You See: Masked Image Modeling Without Reconstruction
CVPR 2023
Human Pose As Compositional Tokens
CVPR 2023
Cyclic-Bootstrap Labeling for Weakly Supervised Object Detection
ICCV 2023
Masked Motion Predictors are Strong 3D Action Representation Learners
ICCV 2023
Focus on Your Target: A Dual Teacher-Student Framework for Domain-Adaptive Semantic Segmentation
ICCV 2023
SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning
ICCV 2023
DIRE for Diffusion-Generated Image Detection
ICCV 2023
Sign Language Translation with Iterative Prototype
ICCV 2023
CLIP4HOI: Towards Adapting CLIP for Practical Zero-Shot HOI Detection
NIPS 2023
Multi-Agent First Order Constrained Optimization in Policy Space
NIPS 2023
$\mathcal{O}$-GNN: incorporating ring priors into molecular modeling
ICLR 2023
Low-Light Video Enhancement with Synthetic Event Guidance
AAAI 2023
BEST: BERT Pre-training for Sign Language Recognition with Coupling Tokenization
AAAI 2023
HandNeRF: Neural Radiance Fields for Animatable Interacting Hands
CVPR 2023
Motion Information Propagation for Neural Video Compression
CVPR 2023
Asymmetric Feature Fusion for Image Retrieval
CVPR 2023
CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Alignment
ICLR 2023
A General Rank Preserving Framework for Asymmetric Image Retrieval
ICLR 2023
Making Better Decision by Directly Planning in Continuous Control
ICLR 2023
NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation
ACL 2023
Hybrid and Collaborative Passage Reranking
ACL 2023
TAPE: Task-Agnostic Prior Embedding for Image Restoration
ECCV 2022
LDSA: Learning Dynamic Subtask Assignment in Cooperative Multi-Agent Reinforcement Learning
NIPS 2022
Hand-Object Interaction Image Generation
NIPS 2022
Learning Token-Based Representation for Image Retrieval
AAAI 2022
Learning Robust Policy against Disturbance in Transition Dynamics via State-Conservative Policy Optimization
AAAI 2022
Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic
AAAI 2022
Uformer: A General U-Shaped Transformer for Image Restoration
CVPR 2022
Contextual Similarity Distillation for Asymmetric Image Retrieval
CVPR 2022
Large-Scale Pre-Training for Person Re-Identification With Noisy Labels
CVPR 2022
Domain-Agnostic Prior for Transfer Semantic Segmentation
CVPR 2022
CMD: Self-Supervised 3D Action Representation Learning with Cross-Modal Mutual Distillation
ECCV 2022
CMT: Context-Matching-Guided Transformer for 3D Tracking in Point Clouds
ECCV 2022
MVP: Multimodality-Guided Visual Pre-training
ECCV 2022
Geometric Representation Learning for Document Image Rectification
ECCV 2022
Neural-based Mixture Probabilistic Query Embedding for Answering FOL queries on Knowledge Graphs
EMNLP 2022
Supervised Off-Policy Ranking
ICML 2022
Equivalence Analysis between Counterfactual Regret Minimization and Online Mirror Descent
ICML 2022
Improving Sign Language Translation With Monolingual Data by Sign Back-Translation
CVPR 2021
ProphetNet-X: Large-Scale Pre-training Models for English, Chinese, Multi-lingual, Dialog, and Code Generation
IJCNLP 2021
Fine-grained Semantic Alignment Network for Weakly Supervised Temporal Language Grounding
EMNLP 2021
Discovering Representation Sprachbund For Multilingual Pre-Training
EMNLP 2021
Probing Inter-modality: Visual Parsing with Self-Attention for Vision-and-Language Pre-training
NIPS 2021
Joint Inductive and Transductive Learning for Video Object Segmentation
ICCV 2021
SignBERT: Pre-Training of Hand-Model-Aware Representation for Sign Language Recognition
ICCV 2021
Conditional DETR for Fast Training Convergence
ICCV 2021
Instance-Wise Hard Negative Example Generation for Contrastive Learning in Unpaired Image-to-Image Translation
ICCV 2021
3D Local Convolutional Neural Networks for Gait Recognition
ICCV 2021
Learning Deep Local Features With Multiple Dynamic Attentions for Large-Scale Image Retrieval
ICCV 2021
TransVG: End-to-End Visual Grounding With Transformers
ICCV 2021
IOT: Instance-wise Layer Reordering for Transformer Structures
ICLR 2021
ProphetNet-X: Large-Scale Pre-training Models for English, Chinese, Multi-lingual, Dialog, and Code Generation
ACL 2021
Contrastive Transformation for Self-supervised Correspondence Learning
AAAI 2021
Auto-Encoding Transformations in Reparameterized Lie Groups for Unsupervised Learning
AAAI 2021
Instance Mining with Class Feature Banks for Weakly Supervised Object Detection
AAAI 2021
Task-Independent Knowledge Makes for Transferable Representations for Generalized Zero-Shot Learning
AAAI 2021
Hand-Model-Aware Sign Language Recognition
AAAI 2021
Voxel R-CNN: Towards High Performance Voxel-based 3D Object Detection
AAAI 2021
BANG: Bridging Autoregressive and Non-autoregressive Generation with Large Scale Pretraining
ICML 2021
Contextual Similarity Aggregation with Self-attention for Visual Re-ranking
NIPS 2021
Dual Progressive Prototype Network for Generalized Zero-Shot Learning
NIPS 2021
Revisiting Knowledge Distillation: An Inheritance and Exploration Framework
CVPR 2021
Generating Diverse Structure for Image Inpainting With Hierarchical VQ-VAE
CVPR 2021
Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking
CVPR 2021
ATSO: Asynchronous Teacher-Student Optimization for Semi-Supervised Image Segmentation
CVPR 2021
Model-Aware Gesture-to-Gesture Translation
CVPR 2021
Unsupervised Pre-Training for Person Re-Identification
CVPR 2021
Representing Videos As Discriminative Sub-Graphs for Action Recognition
CVPR 2021
Transformation GAN for Unsupervised Image Synthesis and Representation Learning
CVPR 2020
M-LVC: Multiple Frames Prediction for Learned Video Compression
CVPR 2020
Multi-Question Learning for Visual Question Answering
AAAI 2020
Incorporating BERT into Neural Machine Translation
ICLR 2020
Relation-Guided Spatial Attention and Temporal Refinement for Video-Based Person Re-Identification
AAAI 2020
POST: POlicy-Based Switch Tracking
AAAI 2020
Deep Model-Based Reinforcement Learning via Estimated Uncertainty and Conservative Policy Optimization
AAAI 2020
Promoting Stochasticity for Expressive Policies via a Simple and Efficient Regularization Method
NIPS 2020
Attentive Experience Replay
AAAI 2020
Spatial-Temporal Multi-Cue Network for Continuous Sign Language Recognition
AAAI 2020
Unsupervised Deep Tracking
CVPR 2019
Quantization Networks
CVPR 2019
Spatial and Temporal Mutual Promotion for Video-Based Person Re-Identification
AAAI 2019
Iterative Alignment Network for Continuous Sign Language Recognition
CVPR 2019
Relation Distillation Networks for Video Object Detection
ICCV 2019
Densely Supervised Hierarchical Policy-Value Network for Image Paragraph Generation
IJCAI 2019
Improving Deep Neural Network Sparsity through Decorrelation Regularization
IJCAI 2018
Multi-Cue Correlation Filters for Robust Visual Tracking
CVPR 2018
CCNet: Cluster-Coordinated Net for Learning Multi-agent Communication Protocols with Reinforcement Learning
ACML 2018
Towards Open-Set Identity Preserving Face Synthesis
CVPR 2018
Affinity Derivation and Graph Merge for Instance Segmentation
ECCV 2018
Dilated Convolutional Network with Iterative Optimization for Continuous Sign Language Recognition
IJCAI 2018
Feature Selective Networks for Object Detection
CVPR 2018
CVAE-GAN: Fine-Grained Image Generation Through Asymmetric Training
ICCV 2017
Video Captioning With Transferred Semantic Attributes
CVPR 2017
Comparative Deep Learning of Hybrid Representations for Image Recommendations
CVPR 2016
Learning Deep Intrinsic Video Representation by Exploring Temporal Coherence and Graph Structure
IJCAI 2016
Jointly Modeling Embedding and Translation to Bridge Video and Language
CVPR 2016
Semi-Supervised Domain Adaptation With Subspace Learning for Visual Recognition
CVPR 2015
SOM: Semantic Obviousness Metric for Image Quality Assessment
CVPR 2015
Separable Kernel for Image Deblurring
CVPR 2014