Jing Liu
163 papers · 2011–2026 · 19 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (20) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (6) π£ Hot Topic Early Bird
π
Interdisciplinary Bridge
π
Conference Polyglot
(19)
π
Cross-Pollinator
(12)
π
Conference Loyalist
(23)
π€
Dynamic Duo
(18)
π
Triple Crown
π§¬
Topic Evolution
π
Grand Slam
π¬
Deep Specialist
(20)
π
Keyword Champion
π
Conference Pioneer
π₯
Unstoppable
(13)
β‘
Prolific Year
(9)
π
Century Club
(153)
ποΈ
Keyword Collector
(646)
π
Trend Setter
Conferences
AAAI (25)
CVPR (23)
ACL (19)
EMNLP (16)
ICCV (14)
NIPS (12)
IJCAI (9)
ICLR (8)
ECCV (7)
COLING (5)
WACV (4)
INTERSPEECH (4)
IJCNLP (4)
ICML (4)
NAACL (3)
MIDL (2)
OSDI (2)
MICCAI (1)
CONLL (1)
Top co-authors
Keywords
model compression
(13)
large language model
(12)
attention mechanism
(8)
machine reading comprehension
(8)
diffusion model
(6)
semantic segmentation
(6)
question answering
(6)
multimodal learning
(6)
representation learning
(5)
model robustness
(5)
video understanding
(5)
vision-language model
(5)
transfer learning
(5)
object detection
(5)
visual question answering
(5)
knowledge distillation
(4)
retrieval-augmented generation
(4)
multi-modal learning
(4)
information retrieval
(4)
vision transformer
(4)
Papers
GeoX-Bench: Benchmarking Cross-View Geo-Localization and Pose Estimation Capabilities of Large Multimodal Models
AAAI 2026
OmniSparse: Training-Aware Fine-Grained Sparse Attention for Long-Video MLLMs
AAAI 2026
Textual Self-Attention Network: Test-Time Preference Optimization Through Textual Gradient-Based Attention
AAAI 2026
Reinforced Informativeness Optimization for Long-Form Retrieval-Augmented Generation
ACL 2026
SFGA: Similarity-Constrained Fusion Learning for Unsupervised Anomaly Detection in Multiplex Graphs
AAAI 2026
LatentLLM: Activation-Aware Transform to Multi-Head Latent Attention
AAAI 2026
UrbanNav: Learning Language-Guided Embodied Urban Navigation from Web-Scale Human Trajectories
AAAI 2026
BEE-RAG: Balanced Entropy Engineering for Retrieval-Augmented Generation
AAAI 2026
SimpleDiffusion: A Lightweight and Efficient Conditional Diffusion Model for Multi-Modal Salient Object Detection
AAAI 2026
M3-VQA: A Benchmark for Multimodal, Multi-Entity, Multi-Hop Visual Question Answering
ACL 2026
Context-aware Dynamic Pruning for Speech Foundation Models
ICLR 2025
ViPE: Visual Perception in Parameter Space for Efficient Video-Language Understanding
EMNLP 2025
VRoPE: Rotary Position Embedding for Video Large Language Models
EMNLP 2025
M2OST: Many-to-one Regression for Predicting Spatial Transcriptomics from Digital Pathology Images
AAAI 2025
DiMSOD: A Diffusion-Based Framework for Multi-Modal Salient Object Detection
AAAI 2025
TRAIL: Trust-Aware Client Scheduling for Semi-Decentralized Federated Learning
AAAI 2025
FedCross: Intertemporal Federated Learning Under Evolutionary Games
AAAI 2025
AutoSGNN: Automatic Propagation Mechanism Discovery for Spectral Graph Neural Networks
AAAI 2025
Forget to Flourish: Leveraging Machine-Unlearning on Pretrained Language Models for Privacy Leakage
AAAI 2025
Numerical Pruning for Efficient Autoregressive Models
AAAI 2025
Graph Contrastive Learning with Joint Spectral Augmentation of Attribute and Topology
AAAI 2025
Channel Merging: Preserving Specialization for Merged Experts
AAAI 2025
AR-Diffusion: Asynchronous Video Generation with Auto-Regressive Diffusion
CVPR 2025
QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge
CVPR 2025
COAP: Memory-Efficient Training with Correlation-Aware Gradient Projection
CVPR 2025
Efficient Motion-Aware Video MLLM
CVPR 2025
ID-Patch: Robust ID Association for Group Photo Personalization
CVPR 2025
Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation
COLING 2025
Breaking the Encoder Barrier for Seamless Video-Language Understanding
ICCV 2025
SplatFace: Gaussian Splat Face Reconstruction Leveraging an Optimizable Surface
WACV 2025
GroundingMate: Aiding Object Grounding for Goal-Oriented Vision-and-Language Navigation
WACV 2025
Scaling Omni-modal Pretraining with Multimodal Context: Advancing Universal Representation Learning Across Modalities
ICCV 2025
ZipVL: Accelerating Vision-Language Models through Dynamic Token Sparsity
ICCV 2025
MotionCtrl: A Real-time Controllable Vision-Language-Motion Model
ICCV 2025
COSMO: Combination of Selective Memorization for Low-cost Vision-and-Language Navigation
ICCV 2025
An Empirical Study of LLM-as-a-Judge for LLM Evaluation: Fine-tuned Judge Model is not a General Substitute for GPT-4
ACL 2025
Winning Big with Small Models: Knowledge Distillation vs. Self-Training for Reducing Hallucination in QA Agents
ACL 2025
Exploring the Frontiers of Animation Video Generation in the Sora Era: Method, Dataset and Benchmark
IJCAI 2025
Learning Beyond Still Frames: Scaling Vision-Language Models with Video
ICCV 2025
ECC: Synergizing Emotion, Cause and Commonsense for Empathetic Dialogue Generation
COLING 2025
RoleBreak: Character Hallucination as a Jailbreak Attack in Role-Playing Systems
COLING 2025
Few-Shot Learner Generalizes Across AI-Generated Image Detection
ICML 2025
HarmoniCa: Harmonizing Training and Inference for Better Feature Caching in Diffusion Transformer Acceleration
ICML 2025
Findings of the Third BabyLM Challenge: Accelerating Language Modeling Research with Cognitively Plausible Data
EMNLP 2025
Gap Preserving Distillation by Building Bidirectional Mappings with A Dynamic Teacher
ICLR 2025
Diffusion Feedback Helps CLIP See Better
ICLR 2025
Ada-K Routing: Boosting the Efficiency of MoE-based LLMs
ICLR 2025
Needle In A Video Haystack: A Scalable Synthetic Evaluator for Video MLLMs
ICLR 2025
LLM as Copilot for Coarse-grained Vision-and-Language Navigation
ECCV 2024
Self-Evaluation of Large Language Model based on Glass-box Features
EMNLP 2024
Self-Bootstrapped Visual-Language Model for Knowledge Selection and Question Answering
EMNLP 2024
REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering
EMNLP 2024
BASES: Large-scale Web Search User Simulation with Large Language Model based Agents
EMNLP 2024
Temporal Adaptive RGBT Tracking with Modality Prompt
AAAI 2024
Signed Graph Neural Ordinary Differential Equation for Modeling Continuous-Time Dynamics
AAAI 2024
Graph Disentangled Contrastive Learning with Personalized Transfer for Cross-Domain Recommendation
AAAI 2024
FG-Net: Facial Action Unit Detection With Generalizable Pyramidal Features
WACV 2024
Med-DANet V2: A Flexible Dynamic Architecture for Efficient Medical Volumetric Segmentation
WACV 2024
The Promises and Pitfalls of Using Language Models to Measure Instruction Quality in Education
NAACL 2024
Med-Tuning: A New Parameter-Efficient Tuning Framework for Medical Volumetric Segmentation
MIDL 2024
SegNeuron: 3D Neuron Instance Segmentation in Any EM Volume with a Generalist Model
MICCAI 2024
Interleaved Audio/Audiovisual Transfer Learning for AV-ASR in Low-Resourced Languages
INTERSPEECH 2024
Soft Knowledge Prompt: Help External Knowledge Become a Better Teacher to Instruct LLM in Knowledge-based VQA
ACL 2024
Beyond Literal Descriptions: Understanding and Locating Open-World Objects Aligned with Human Intentions
ACL 2024
ConEC: Earnings Call Dataset with Real-world Contexts for Benchmarking Contextual Speech Recognition
COLING 2024
Automated Loss function Search for Class-imbalanced Node Classification
ICML 2024
Pretrained Optimization Model for Zero-Shot Black Box Optimization
NIPS 2024
CA-SSLR: Condition-Aware Self-Supervised Learning Representation for Generalized Speech Processing
NIPS 2024
ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification
NIPS 2024
MiniCache: KV Cache Compression in Depth Dimension for Large Language Models
NIPS 2024
Rapid Plug-in Defenders
NIPS 2024
Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentation
CVPR 2024
Text Prompt with Normality Guidance for Weakly Supervised Video Anomaly Detection
CVPR 2024
Open-Vocabulary Video Anomaly Detection
CVPR 2024
TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models
CVPR 2024
SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models
CVPR 2024
Efficient Stitchable Task Adaptation
CVPR 2024
QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models
ICLR 2024
EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models
ICLR 2024
COSA: Concatenated Sample Pretrained Vision-Language Foundation Model
ICLR 2024
Stitched ViTs are Flexible Vision Backbones
ECCV 2024
Spatio-Temporal Domain Awareness for Multi-Agent Collaborative Perception
ICCV 2023
PTQD: Accurate Post-Training Quantization for Diffusion Models
NIPS 2023
How2comm: Communication-Efficient and Collaboration-Pragmatic Multi-Agent Perception
NIPS 2023
VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
NIPS 2023
GLOBER: Coherent Non-autoregressive Video Generation via GLOBal Guided Video DecodER
NIPS 2023
SwiftAvatar: Efficient Auto-Creation of Parameterized Stylized Character on Arbitrary Avatar Engines
AAAI 2023
TOME: A Two-stage Approach for Model-based Retrieval
ACL 2023
Video Event Restoration Based on Keyframes for Video Anomaly Detection
CVPR 2023
Boosting Verified Training for Robust Image Classifications via Abstraction
CVPR 2023
Dynamic Focus-Aware Positional Queries for Semantic Segmentation
CVPR 2023
MOSO: Decomposing MOtion, Scene and Object for Video Prediction
CVPR 2023
OmniAvatar: Geometry-Guided Controllable 3D Head Synthesis
CVPR 2023
A Thorough Examination on Zero-shot Dense Retrieval
EMNLP 2023
AIDE: A Vision-Driven Multi-View, Multi-Modal, Multi-Tasking Dataset for Assistive Driving Perception
ICCV 2023
BiViT: Extremely Compressed Binary Vision Transformers
ICCV 2023
March in Chat: Interactive Prompting for Remote Embodied Referring Expression
ICCV 2023
LoTE-Animal: A Long Time-span Dataset for Endangered Animal Behavior Understanding
ICCV 2023
Less Learn Shortcut: Analyzing and Mitigating Learning of Spurious Feature-Label Correlation
IJCAI 2023
A Survey on Efficient Training of Transformers
IJCAI 2023
Model-Internal Slot-triggered Biasing for Domain Expansion in Neural Transducer ASR Models
INTERSPEECH 2023
Intra- and Inter-Cellular Awareness for 3D Neuron Tracking and Segmentation in Large-Scale Connectomics
MIDL 2023
EcoFormer: Energy-Saving Attention with Linear Complexity
NIPS 2022
Computationally Identifying Funneling and Focusing Questions in Classroom Discourse
NAACL 2022
Dynamic Local Aggregation Network with Adaptive Clusterer for Anomaly Detection
ECCV 2022
DuReader-Retrieval: A Large-scale Chinese Benchmark for Passage Retrieval from Web Search Engine
EMNLP 2022
Less Is More: Pay Less Attention in Vision Transformers
AAAI 2022
DuQM: A Chinese Dataset of Linguistically Perturbed Natural Questions for Evaluating the Robustness of Question Matching Models
EMNLP 2022
CoPur: Certifiably Robust Collaborative Inference via Feature Purification
NIPS 2022
DuReadervis: A Chinese Dataset for Open-domain Document Visual Question Answering
ACL 2022
Consistent-Separable Feature Representation for Semantic Segmentation
AAAI 2021
Measuring Conversational Uptake: A Case Study on Student-Teacher Interactions
IJCNLP 2021
DuReader_robust: A Chinese Dataset Towards Evaluating Robustness and Generalization of Machine Reading Comprehension in Real-World Applications
ACL 2021
RocketQA: An Optimized Training Approach to Dense Passage Retrieval for Open-Domain Question Answering
NAACL 2021
DuReader_robust: A Chinese Dataset Towards Evaluating Robustness and Generalization of Machine Reading Comprehension in Real-World Applications
IJCNLP 2021
Scalable Vision Transformers With Hierarchical Pooling
ICCV 2021
HAIR: Hierarchical Visual-Semantic Relational Reasoning for Video Question Answering
ICCV 2021
PAIR: Leveraging Passage-Centric Similarity Relation for Improving Dense Passage Retrieval
IJCNLP 2021
A Novel Method to Solve Neural Knapsack Problems
ICML 2021
RocketQAv2: A Joint Training Method for Dense Passage Retrieval and Passage Re-ranking
EMNLP 2021
Measuring Conversational Uptake: A Case Study on Student-Teacher Interactions
ACL 2021
Phonetically Induced Subwords for End-to-End Speech Recognition
INTERSPEECH 2021
PAIR: Leveraging Passage-Centric Similarity Relation for Improving Dense Passage Retrieval
ACL 2021
AQD: Towards Accurate Quantized Object Detection
CVPR 2021
A Cluster-Weighted Kernel K-Means Method for Multi-View Clustering
AAAI 2020
Learning Progressive Joint Propagation for Human Motion Prediction
ECCV 2020
Generative Low-bitwidth Data Free Quantization
ECCV 2020
Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision
ECCV 2020
Deep Transferring Quantization
ECCV 2020
Normalized and Geometry-Aware Self-Attention Network for Image Captioning
CVPR 2020
Latent Regularized Generative Dual Adversarial Network For Abnormal Detection
IJCAI 2020
Non-Autoregressive Image Captioning with Counterfactuals-Critical Multi-Agent Learning
IJCAI 2020
A Robust Adversarial Training Approach to Machine Reading Comprehension
AAAI 2020
D-NET: A Pre-Training and Fine-Tuning Framework for Improving the Generalization of Machine Reading Comprehension
EMNLP 2019
Deep Incremental Hashing Network for Efficient Image Retrieval
CVPR 2019
MSCap: Multi-Style Image Captioning With Unpaired Stylized Text
CVPR 2019
Dual Attention Network for Scene Segmentation
CVPR 2019
Densely Connected Attention Flow for Visual Question Answering
IJCAI 2019
FakeTables: Using GANs to Generate Functional Dependency Preserving Tables with Bounded Real Data
IJCAI 2019
VEST: A System for Vulnerability Exploit Scoring & Timing
IJCAI 2019
Enhancing Pre-Trained Language Representations with Rich Knowledge for Machine Reading Comprehension
ACL 2019
Adaptive Context Network for Scene Parsing
ICCV 2019
Discrimination-aware Channel Pruning for Deep Neural Networks
NIPS 2018
Answer-focused and Position-aware Neural Question Generation
EMNLP 2018
Aggregated Semantic Matching for Short Text Entity Linking
CONLL 2018
Neural Math Word Problem Solver with Reinforcement Learning
COLING 2018
Principled Schedulability Analysis for Distributed Storage Systems using Thread Architecture Models
OSDI 2018
Fault-Tolerance, Fast and Slow: Exploiting Failure Asynchrony in Distributed Systems
OSDI 2018
Multi-Passage Machine Reading Comprehension with Cross-Passage Answer Verification
ACL 2018
Adaptations of ROUGE and BLEU to Better Evaluate Machine Reading Comprehension Task
ACL 2018
DuReader: a Chinese Machine Reading Comprehension Dataset from Real-world Applications
ACL 2018
A-Lamp: Adaptive Layout-Aware Multi-Patch Deep Convolutional Neural Network for Photo Aesthetic Assessment
CVPR 2017
A Statistical Framework for Product Description Generation
IJCNLP 2017
Knowledge Base Completion via Coupled Path Ranking
ACL 2016
Understanding Periodically Interrupted Mandarin Speech
INTERSPEECH 2016
RBPB: Regularization-Based Pattern Balancing Method for Event Extraction
ACL 2016
News Citation Recommendation with Implicit and Explicit Semantics
ACL 2016
Weakly Supervised RBM for Semantic Segmentation
IJCAI 2015
A Regularized Competition Model for Question Difficulty Estimation in Community Question Answering Services
EMNLP 2014
Question Difficulty Estimation in Community Question Answering Services
EMNLP 2013
A Hierarchical Entity-Based Approach to Structuralize User Generated Content in Social Media: A Case of Yahoo! Answers
EMNLP 2013
Weakly-Supervised Dual Clustering for Image Semantic Segmentation
CVPR 2013
Nonlinear Evidence Fusion and Propagation for Hyponymy Relation Mining
ACL 2011