conftrace_

Jing Zhang

235 papers · 2003–2026 · 21 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓
+16 more ↓ πŸ—ΊοΈ Taxonomy Completionist (29) 🧭 Keyword Pioneer πŸŒ‰ Interdisciplinary Bridge 🌈 Renaissance Researcher (7) 🌍 Conference Polyglot (21)
🌈 Renaissance Researcher (7) πŸŒ‰ Interdisciplinary Bridge πŸ—ΊοΈ Taxonomy Completionist (29) 🏠 Conference Loyalist (41) 🀝 Dynamic Duo (45) πŸ‘‘ Triple Crown πŸ† Keyword Champion πŸ† Grand Slam πŸ”¬ Deep Specialist (30) πŸ—ƒοΈ Keyword Collector (60) πŸš€ Conference Pioneer πŸ”₯ Unstoppable (10) ⚑ Prolific Year (35) ❓ The Questioner (3) πŸ’Ž Century Club (225) πŸ“ˆ Trend Setter

Conferences

AAAI (48) CVPR (42) ACL (21) NIPS (21) ICCV (18) ECCV (17) IJCAI (14) EMNLP (12) ICML (6) MICCAI (6) ICLR (5) COLING (5) NAACL (4) SEMEVAL (4) WACV (3) INTERSPEECH (2) JMLR (2) ACML (2) IJCNLP (1) MIDL (1) UAI (1)

Papers

Learning Spatial Decay for Vision Transformers AAAI 2026 DCMM-Transformer: Degree-Corrected Mixed-Membership Attention for Medical Imaging AAAI 2026 LLM-SLM Collaborative Framework of Idiomatic Expression Generation ACL 2026 Prune4Web: DOM Tree Pruning Programming for Web Agent AAAI 2026 SteerMusic: Enhanced Musical Consistency for Zero-shot Text-Guided and Personalized Music Editing AAAI 2026 Unlocking Vision-Language Models for Video Anomaly Detection via Fine-Grained Prompting WACV 2026 S5: Scalable Semi-Supervised Semantic Segmentation in Remote Sensing AAAI 2026 Sparse-RL: Breaking the Memory Wall in LLM Reinforcement Learning via Stable Sparse Rollouts ACL 2026 Omni-I2C: A Holistic Benchmark for High-Fidelity Image-to-Code Generation ACL 2026 Attribution Analysis-based Concept Alignment: A Human-in-the-loop Data Debugging Framework AAAI 2026 SAQ-SAM: Semantically-Aligned Quantization for Segment Anything Model AAAI 2026 SAM Decoding: Speculative Decoding via Suffix Automaton ACL 2025 MapNav: A Novel Memory Representation via Annotated Semantic Maps for VLM-based Vision-and-Language Navigation ACL 2025 Uncovering the Impact of Chain-of-Thought Reasoning for Direct Preference Optimization: Lessons from Text-to-SQL ACL 2025 TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios ACL 2025 Probability Density Geodesics in Image Diffusion Latent Space CVPR 2025 SafeMap: Robust HD Map Construction from Incomplete Observations ICML 2025 CSTrack: Enhancing RGB-X Tracking via Compact Spatiotemporal Features ICML 2025 Consistency Rating of Semantic Transparency: an Evaluation Method for Metaphor Competence in Idiom Understanding Tasks COLING 2025 CARE Transformer: Mobile-Friendly Linear Visual Transformer via Decoupled Dual Interaction CVPR 2025 Brain-Inspired Spiking Neural Networks for Energy-Efficient Object Detection CVPR 2025 FlightPatchNet: Multi-Scale Patch Network with Differential Coding for Short-Term Flight Trajectory Prediction UAI 2025 Black Sheep in the Herd: Playing with Spuriously Correlated Attributes for Vision-Language Recognition ICLR 2025 Streamlining Redundant Layers to Compress Large Language Models ICLR 2025 Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning ICLR 2025 GARF: Learning Generalizable 3D Reassembly for Real-World Fractures ICCV 2025 ATCTrack: Aligning Target-Context Cues with Dynamic Target States for Robust Vision-Language Tracking ICCV 2025 Rethink Sparse Signals for Pose-guided Text-to-image Generation ICCV 2025 Synergistic Prompting for Robust Visual Recognition with Missing Modalities ICCV 2025 Harnessing Massive Satellite Imagery with Efficient Masked Image Modeling ICCV 2025 What Makes for Text to 360-degree Panorama Generation with Stable Diffusion? ICCV 2025 Adversarial Exploitation of Data Diversity Improves Visual Localization ICCV 2025 Oblique Genomics Mixture of Experts: Prediction of Brain Disorder With Aging-Related Changes of Brain’s Structural Connectivity Under Genomic Influences MICCAI 2025 Rethink Rumor Detection in the Era of LLMs: A Review EMNLP 2025 L-Diffusion: Laplace Diffusion for Efficient Pathology Image Segmentation ICML 2025 Domain-Adaptive Diagnosis of Lewy Body Disease with Transferability Aware Transformer MICCAI 2025 A Unified Continuous Staging Framework for Alzheimer’s Disease and Lewy Body Dementia via Hierarchical Anatomical Features MICCAI 2025 FedClean: A General Robust Label Noise Correction for Federated Learning ICML 2025 DDPA-3DVG: Vision-Language Dual-Decoupling and Progressive Alignment for 3D Visual Grounding IJCAI 2025 Self-calibration Enhanced Whole Slide Pathology Image Analysis IJCAI 2025 BEVTrack: A Simple and Strong Baseline for 3D Single Object Tracking in Bird's-Eye View IJCAI 2025 Open-Vocabulary Fine-Grained Hand Action Detection IJCAI 2025 Human-Imperceptible, Machine-Recognizable Images IJCAI 2025 Core-Periphery Principle Guided State Space Model for Functional Connectome Classification MICCAI 2025 MOL-Mamba: Enhancing Molecular Representation with Structural & Electronic Insights AAAI 2025 Patch-level Sounding Object Tracking for Audio-Visual Question Answering AAAI 2025 Multi-axis Prompt and Multi-dimension Fusion Network for All-in-one Weather-degraded Image Restoration AAAI 2025 UAWTrack: Universal 3D Single Object Tracking in Adverse Weather AAAI 2025 Semi-supervised Infrared Small Target Detection with Thermodynamic-Inspired Uneven Perturbation and Confidence Adaptation AAAI 2025 MOCID: Motion Context and Displacement Information Learning for Moving Infrared Small Target Detection AAAI 2025 UICOMPASS: UI Map Guided Mobile Task Automation via Adaptive Action Generation EMNLP 2025 Highly Imperceptible Black-Box Graph Injection Attacks with Reinforcement Learning AAAI 2025 FacLens: Transferable Probe for Foreseeing Non-Factuality in Fact-Seeking Question Answering of Large Language Models EMNLP 2025 Identifying and Mitigating Position Bias of Multi-image Vision-Language Models CVPR 2025 CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos CVPR 2025 Empowering LLMs to Understand and Generate Complex Vector Graphics CVPR 2025 SAIST: Segment Any Infrared Small Target Model Guided by Contrastive Language-Image Pretraining CVPR 2025 XLRS-Bench: Could Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery? CVPR 2025 P2 Law: Scaling Law for Post-Training After Model Pruning ACL 2025 CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis ACL 2025 Dynamic Scaling of Unit Tests for Code Reward Modeling ACL 2025 Dynamic Parallel Tree Search for Efficient LLM Reasoning ACL 2025 IRSAM: Advancing Segment Anything Model for Infrared Small Target Detection ECCV 2024 LeMeViT: Efficient Vision Transformer with Learnable Meta Tokens for Remote Sensing Image Interpretation IJCAI 2024 Disentangling Domain and General Representations for Time Series Classification IJCAI 2024 GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matching NIPS 2024 HybridVC: Efficient Voice Style Conversion with Text and Audio Prompts INTERSPEECH 2024 CP-CLIP: Core-Periphery Feature Alignment CLIP for Zero-Shot Medical Image Analysis MICCAI 2024 Gyri vs. Sulci: Core-Periphery Organization in Functional Brain Networks MICCAI 2024 Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model NIPS 2024 SpreadsheetBench: Towards Challenging Real World Spreadsheet Manipulation NIPS 2024 PowerPM: Foundation Model for Power Systems NIPS 2024 DreamSteerer: Enhancing Source Image Conditioned Editability using Personalized Diffusion Models NIPS 2024 LaViP: Language-Grounded Visual Prompting AAAI 2024 Multi-Modality Affinity Inference for Weakly Supervised 3D Semantic Segmentation AAAI 2024 Object-Aware Adaptive-Positivity Learning for Audio-Visual Question Answering AAAI 2024 Data-Free Generalized Zero-Shot Learning AAAI 2024 Decomposing Semantic Shifts for Composed Image Retrieval AAAI 2024 SurgicalSAM: Efficient Class Promptable Surgical Instrument Segmentation AAAI 2024 Cross-Modal Feature Distribution Calibration for Few-Shot Visual Question Answering AAAI 2024 IRPruneDet: Efficient Infrared Small Target Detection via Wavelet Structure-Regularized Soft Channel Pruning AAAI 2024 SimDistill: Simulated Multi-Modal Distillation for BEV 3D Object Detection AAAI 2024 Adversarial Purification with the Manifold Hypothesis AAAI 2024 Quantum-Inspired Neural Network with Runge-Kutta Method AAAI 2024 Question Calibration and Multi-Hop Modeling for Temporal Question Answering AAAI 2024 AlignBench: Benchmarking Chinese Alignment of Large Language Models ACL 2024 Transferable and Efficient Non-Factual Content Detection via Probe Training with Offline Consistency Checking ACL 2024 SP3: Enhancing Structured Pruning via PCA Projection ACL 2024 Understanding Transcriptional Regulatory Redundancy by Learnable Global Subset Perturbations ACML 2024 A Cause-Effect Look at Alleviating Hallucination of Knowledge-grounded Dialogue Generation COLING 2024 Distilling Causal Effect of Data in Continual Few-shot Relation Learning COLING 2024 Diversifying Question Generation over Knowledge Base via External Natural Questions COLING 2024 LA-UCL: LLM-Augmented Unsupervised Contrastive Learning Framework for Few-Shot Text Classification COLING 2024 Samsung Research China-Beijing at SemEval-2024 Task 3: A multi-stage framework for Emotion-Cause Pair Extraction in Conversations SEMEVAL 2024 Samsung Research China-Beijing at SemEval-2024 Task 3: A multi-stage framework for Emotion-Cause Pair Extraction in Conversations NAACL 2024 LUWA Dataset: Learning Lithic Use-Wear Analysis on Microscopic Images CVPR 2024 A Semi-supervised Nighttime Dehazing Baseline with Spatial-Frequency Aware and Realistic Brightness Constraint CVPR 2024 UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather CVPR 2024 SVGDreamer: Text Guided SVG Generation with Diffusion Model CVPR 2024 ArGue: Attribute-Guided Prompt Tuning for Vision-Language Models CVPR 2024 SGSH: Stimulate Large Language Models with Skeleton Heuristics for Knowledge Base Question Generation NAACL 2024 MapDistill: Boosting Efficient Camera-based HD Map Construction via Camera-LiDAR Fusion Model Distillation ECCV 2024 RaFE: Generative Radiance Fields Restoration ECCV 2024 Is Your HD Map Constructor Reliable under Sensor Corruptions? NIPS 2024 Training A Small Emotional Vision Language Model for Visual Art Comprehension ECCV 2024 Deciphering Rumors: A Multi-Task Learning Approach with Intent-aware Hierarchical Contrastive Learning EMNLP 2024 PCQPR: Proactive Conversational Question Planning with Reflection EMNLP 2024 IMPUS: Image Morphing with Perceptually-Uniform Sampling Using Diffusion Models ICLR 2024 OxyGenerator: Reconstructing Global Ocean Deoxygenation Over a Century with Deep Learning ICML 2024 Latent Optimal Paths by Gumbel Propagation for Variational Bayesian Dynamic Programming ICML 2024 Beyond Accuracy: Tracking more like Human via Visual Search NIPS 2024 MemVLT: Vision-Language Tracking with Adaptive Memory-based Prompts NIPS 2024 Learning to Learn Better for Video Object Segmentation AAAI 2023 DeepSolo: Let Transformer Decoder With Explicit Points Solo for Text Spotting CVPR 2023 Decoupling Learning and Remembering: A Bilevel Memory Framework With Knowledge Projection for Task-Incremental Learning CVPR 2023 Referring Image Matting CVPR 2023 Hyper-Label-Graph: Modeling Branch-Level Dependencies of Labels for Hierarchical Multi-Label Text Classification ACML 2023 SRCB at SemEval-2023 Task 1: Prompt Based and Cross-Modal Retrieval Enhanced Visual Word Sense Disambiguation ACL 2023 Chain of Thought Prompting Elicits Knowledge Augmentation ACL 2023 FC-KBQA: A Fine-to-Coarse Composition Framework for Knowledge Base Question Answering ACL 2023 OSP2B: One-Stage Point-to-Box Network for 3D Siamese Tracking IJCAI 2023 Constrained Policy Optimization with Explicit Behavior Density For Offline Reinforcement Learning NIPS 2023 A Generation-based Deductive Method for Math Word Problems EMNLP 2023 FFAEval: Evaluating Dialogue System via Free-For-All Ranking EMNLP 2023 SPLIT: Stance and Persuasion Prediction with Multi-modal on Image and Textual Information EMNLP 2023 Label Distribution Changing Learning with Sample Space Expanding JMLR 2023 Feature Decomposition for Reducing Negative Transfer: A Novel Multi-Task Learning Method for Recommender System (Student Abstract) AAAI 2023 MPMQA: Multimodal Question Answering on Product Manuals AAAI 2023 SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model NIPS 2023 DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models NIPS 2023 LPFF: A Portrait Dataset for Face Generators Across Large Poses ICCV 2023 Domain Specified Optimization for Deployment Authorization ICCV 2023 Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning ICCV 2023 RPEFlow: Multimodal Fusion of RGB-PointCloud-Event for Joint Optical Flow and Scene Flow Estimation ICCV 2023 Multimodal Variational Auto-encoder based Audio-Visual Segmentation ICCV 2023 ESSAformer: Efficient Transformer for Hyperspectral Image Super-resolution ICCV 2023 Model Calibration in Dense Classification with Adaptive Label Perturbation ICCV 2023 P2C: Self-Supervised Point Cloud Completion from Single Partial Clouds ICCV 2023 RESDSQL: Decoupling Schema Linking and Skeleton Parsing for Text-to-SQL AAAI 2023 DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer AAAI 2023 GLT-T: Global-Local Transformer Voting for 3D Single Object Tracking in Point Clouds AAAI 2023 CLAMP: Prompt-Based Contrastive Learning for Connecting Language and Animal Pose CVPR 2023 SRCB at SemEval-2023 Task 1: Prompt Based and Cross-Modal Retrieval Enhanced Visual Word Sense Disambiguation SEMEVAL 2023 Dynamic Focus-Aware Positional Queries for Semantic Segmentation CVPR 2023 Explicit Boundary Guided Semi-Push-Pull Contrastive Learning for Supervised Anomaly Detection CVPR 2023 Leverage Interactive Affinity for Affordance Learning CVPR 2023 Modeling the Distributional Uncertainty for Salient Object Detection Models CVPR 2023 ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation NIPS 2022 DSM: Question Generation over Knowledge Base via Modeling Diverse Subgraphs with Meta-learner EMNLP 2022 Knowledge-augmented Self-training of A Question Rewriter for Conversational Knowledge Base Question Answering EMNLP 2022 3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds CVPR 2022 DearKD: Data-Efficient Early Knowledge Distillation for Vision Transformers CVPR 2022 GMFlow: Learning Optical Flow via Global Matching CVPR 2022 Watermarking for Out-of-distribution Detection NIPS 2022 Recurrent Glimpse-Based Decoder for Detection With Transformer CVPR 2022 Long-range Sequence Modeling with Predictable Sparse Attention ACL 2022 Exploring Figure-Ground Assignment Mechanism in Perceptual Organization NIPS 2022 Learning Affordance Grounding From Exocentric Images CVPR 2022 ISNet: Shape Matters for Infrared Small Target Detection CVPR 2022 RU-Net: Regularized Unrolling Network for Scene Graph Generation CVPR 2022 CODE: Contrastive Pre-training with Adversarial Fine-Tuning for Zero-Shot Expert Linking AAAI 2022 Energy-Based Generative Cooperative Saliency Prediction AAAI 2022 FP-DETR: Detection Transformer Advanced by Fully Pre-training ICLR 2022 Incorporating Dual-Aware with Hierarchical Interactive Memory Networks for Task-Oriented Dialogue INTERSPEECH 2022 Transmission-Guided Bayesian Generative Model for Smoke Segmentation AAAI 2022 Inferring the Class Conditional Response Map for Weakly Supervised Semantic Segmentation WACV 2022 Modeling Aleatoric Uncertainty for Camouflaged Object Detection WACV 2022 FIBA: Frequency-Injection Based Backdoor Attack in Medical Image Analysis CVPR 2022 SRCB at SemEval-2022 Task 5: Pretraining Based Image to Text Late Sequential Fusion System for Multimodal Misogynous Meme Identification SEMEVAL 2022 Crowdsourcing with Meta-Knowledge Transfer (Student Abstract) AAAI 2022 APT-36K: A Large-scale Benchmark for Animal Pose Estimation and Tracking NIPS 2022 HOSMEL: A Hot-Swappable Modularized Entity Linking Toolkit for Chinese ACL 2022 Subgraph Retrieval Enhanced Model for Multi-hop Knowledge Base Question Answering ACL 2022 Siamese Network with Interactive Transformer for Video Object Segmentation AAAI 2022 MeshMAE: Masked Autoencoders for 3D Mesh Data Analysis ECCV 2022 Towards Data-Efficient Detection Transformers ECCV 2022 ReAct: Temporal Action Detection with Relational Queries ECCV 2022 FakeCLR: Exploring Contrastive Learning for Solving Latent Discontinuity in Data-Efficient GANs ECCV 2022 VSA: Learning Varied-Size Window Attention in Vision Transformers ECCV 2022 PolyphonicFormer: Unified Query Learning for Depth-Aware Video Panoptic Segmentation ECCV 2022 Improving RGB-D Point Cloud Registration by Learning Multi-Scale Local Linear Transformation ECCV 2022 RegionCL: Exploring Contrastive Region Pairs for Self-Supervised Representation Learning ECCV 2022 BMD: A General Class-Balanced Multicentric Dynamic Prototype Strategy for Source-Free Domain Adaptation ECCV 2022 Audioβ€”Visual Segmentation ECCV 2022 "Towards Scale-Aware, Robust, and Generalizable Unsupervised Monocular Depth Estimation by Integrating IMU Motion Dynamics" ECCV 2022 "JPerceiver: Joint Perception Network for Depth, Pose and Layout Estimation in Driving Scenes" ECCV 2022 Visual Semantics Allow for Textual Reasoning Better in Scene Text Recognition AAAI 2022 SCL-WC: Cross-Slide Contrastive Learning for Weakly-Supervised Whole-Slide Image Classification NIPS 2022 SAR-to-Optical Image Translation via Neural Partial Differential Equations IJCAI 2022 SASA: Semantics-Augmented Set Abstraction for Point-Based 3D Object Detection AAAI 2022 SRCB at SemEval-2022 Task 5: Pretraining Based Image to Text Late Sequential Fusion System for Multimodal Misogynous Meme Identification NAACL 2022 P-INT: A Path-based Interaction Model for Few-shot Knowledge Graph Completion EMNLP 2021 Learning Generative Vision Transformer with Energy-Based Latent Space for Saliency Prediction NIPS 2021 ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias NIPS 2021 Progressive One-shot Human Parsing AAAI 2021 Memory-Gated Recurrent Networks AAAI 2021 Continuous Self-Attention Models with Neural ODE Networks AAAI 2021 TA-MAMC at SemEval-2021 Task 4: Task-adaptive Pretraining and Multi-head Attention for Abstract Meaning Reading Comprehension ACL 2021 Simultaneously Localize, Segment and Rank the Camouflaged Objects CVPR 2021 Weakly Supervised Video Salient Object Detection CVPR 2021 Uncertainty-Aware Joint Salient Object and Camouflaged Object Detection CVPR 2021 A Pretraining Numerical Reasoning Model for Ordinal Constrained Question Answering on Knowledge Base EMNLP 2021 Out-of-Boundary View Synthesis Towards Full-Frame Video Stabilization ICCV 2021 RGB-D Saliency Detection via Cascaded Mutual Information Minimization ICCV 2021 Deep Automatic Natural Image Matting IJCAI 2021 One-Shot Affordance Detection IJCAI 2021 A Comprehensive Survey on Image Dehazing Based on Deep Learning IJCAI 2021 TA-MAMC at SemEval-2021 Task 4: Task-adaptive Pretraining and Multi-head Attention for Abstract Meaning Reading Comprehension IJCNLP 2021 TA-MAMC at SemEval-2021 Task 4: Task-adaptive Pretraining and Multi-head Attention for Abstract Meaning Reading Comprehension SEMEVAL 2021 Deep Degradation Prior for Low-Quality Image Classification CVPR 2020 UC-Net: Uncertainty Inspired RGB-D Saliency Detection via Conditional Variational Autoencoders CVPR 2020 Interactive Learning with Proactive Cognition Enhancement for Crowd Workers AAAI 2020 Auto Learning Attention NIPS 2020 BERT-INT:A BERT-based Interaction Model For Knowledge Graph Alignment IJCAI 2020 Grapy-ML: Graph Pyramid Mutual Learning for Cross-Dataset Human Parsing AAAI 2020 Learning Noise-Aware Encoder-Decoder from Noisy Labels by Alternating Back-Propagation for Saliency Detection ECCV 2020 Weakly-Supervised Salient Object Detection via Scribble Annotations CVPR 2020 Direct estimation of fetal head circumference from ultrasound images based on regression CNN MIDL 2020 MirrorGAN: Learning Text-To-Image Generation by Redescription CVPR 2019 Deep Multiple-Attribute-Perceived Network for Real-World Texture Recognition ICCV 2019 Category Anchor-Guided Unsupervised Domain Adaptation for Semantic Segmentation NIPS 2019 Learn, Imagine and Create: Text-to-Image Generation from Prior Knowledge NIPS 2019 ShieldNets: Defending Against Adversarial Attacks Using Probabilistic Adversarial Robustness CVPR 2019 Few-Shot Learning via Saliency-Guided Hallucination of Samples CVPR 2019 Machine Learning with Crowdsourcing: A Brief Summary of the Past Research and Future Directions AAAI 2019 A Framework to Coordinate Segmentation and Recognition AAAI 2019 Hierarchical Reinforcement Learning for Course Recommendation in MOOCs AAAI 2019 Multi-Level Deep Cascade Trees for Conversion Rate Prediction in Recommendation System AAAI 2019 Importance Weighted Adversarial Nets for Partial Domain Adaptation CVPR 2018 Deep Unsupervised Saliency Detection: A Multiple Noisy Labeling Perspective CVPR 2018 Fast Haze Removal for Nighttime Image Using Maximum Reflectance Prior CVPR 2017 Joint Geometrical and Statistical Alignment for Visual Domain Adaptation CVPR 2017 CEKA: A Tool for Mining the Wisdom of Crowds JMLR 2015 Social Influence Locality for Modeling Retweeting Behaviors IJCAI 2013 Speechalator: Two-Way Speech-to-Speech Translation in Your Hand NAACL 2003