Jiajun Deng
41 papers · 2019–2025 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
π Conference Polyglot (8) π Interdisciplinary Bridge π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (12) π Academic Marathon (6)
π
Academic Marathon
(6)
π
Cross-Pollinator
(5)
π
Renaissance Researcher
(6)
π§¬
Topic Evolution
π€
Dynamic Duo
(14)
π¬
Deep Specialist
(11)
π
Trend Setter
ποΈ
Keyword Collector
(168)
π
Century Club
(41)
β‘
Prolific Year
(14)
π₯
Unstoppable
(7)
Conferences
INTERSPEECH (13)
ICCV (11)
ECCV (6)
AAAI (4)
CVPR (3)
NIPS (2)
ICML (1)
IJCAI (1)
Top co-authors
Keywords
automatic speech recognition
(6)
speaker adaptation
(5)
3d object detection
(5)
domain adaptation
(4)
autonomous driving
(4)
weakly supervised learning
(3)
point cloud
(3)
object detection
(3)
speech recognition
(3)
multimodal learning
(3)
weakly supervised object detection
(2)
model quantization
(2)
vision-language model
(2)
semi-supervised learning
(2)
visual grounding
(2)
multi-modal learning
(2)
neural architecture search
(2)
depth estimation
(2)
bird's eye view
(2)
conformer model
(2)
Papers
GraspCoT: Integrating Physical Property Reasoning for 6-DoF Grasping under Flexible Language Instructions
ICCV 2025
3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer
CVPR 2025
SpatialSplat: Efficient Semantic 3D from Sparse Unposed Images
ICCV 2025
S3R-GS: Streamlining the Pipeline for Large-Scale Street Scene Reconstruction
ICCV 2025
RaCFormer: Towards High-Quality 3D Object Detection via Query-based Radar-Camera Fusion
CVPR 2025
Self-Classification Enhancement and Correction for Weakly Supervised Object Detection
IJCAI 2025
Hierarchical Masked Autoregressive Models with Low-Resolution Token Pivots
ICML 2025
SynTag: Enhancing the Geometric Robustness of Inversion-based Generative Image Watermarking
ICCV 2025
Hierarchical Temporal Context Learning for Camera-based Semantic Scene Completion
ECCV 2024
Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition
INTERSPEECH 2024
Cycle-Consistency Learning for Captioning and Grounding
AAAI 2024
Revisiting Open-Set Panoptic Segmentation
AAAI 2024
One-pass Multiple Conformer and Foundation Speech Systems Compression and Quantization Using An All-in-one Neural Model
INTERSPEECH 2024
Towards Effective and Efficient Non-autoregressive Decoding Using Block-based Attention Mask
INTERSPEECH 2024
Agent3D-Zero: An Agent for Zero-shot 3D Understanding
ECCV 2024
End-to-End Rate-Distortion Optimized 3D Gaussian Representation
ECCV 2024
Use of Speech Impairment Severity for Dysarthric Speech Recognition
INTERSPEECH 2023
CLIP4HOI: Towards Adapting CLIP for Practical Zero-Shot HOI Detection
NIPS 2023
Bi-LRFusion: Bi-Directional LiDAR-Radar Fusion for 3D Dynamic Object Detection
CVPR 2023
3DPPE: 3D Point Positional Encoding for Transformer-based Multi-Camera 3D Object Detection
ICCV 2023
SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning
ICCV 2023
Invariant Training 2D-3D Joint Hard Samples for Few-Shot Point Cloud Recognition
ICCV 2023
Masked Motion Predictors are Strong 3D Action Representation Learners
ICCV 2023
Cyclic-Bootstrap Labeling for Weakly Supervised Object Detection
ICCV 2023
Hyper-parameter Adaptation of Conformer ASR Systems for Elderly and Dysarthric Speech Recognition
INTERSPEECH 2023
Towards Effective and Compact Contextual Representation for Conformer Transducer Speech Recognition Systems
INTERSPEECH 2023
Exploiting Cross-Domain And Cross-Lingual Ultrasound Tongue Imaging Features For Elderly And Dysarthric Speech Recognition
INTERSPEECH 2023
CluB: Cluster Meets BEV for LiDAR-Based 3D Object Detection
NIPS 2023
Lossless 4-bit Quantization of Architecture Compressed Conformer ASR Systems on the 300-hr Switchboard Corpus
INTERSPEECH 2023
Factorised Speaker-environment Adaptive Training of Conformer Speech Recognition Systems
INTERSPEECH 2023
Two-pass Decoding and Cross-adaptation Based System Combination of End-to-end Conformer and Hybrid TDNN ASR Systems
INTERSPEECH 2022
Conformer Based Elderly Speech Recognition System for Alzheimerβs Disease Detection
INTERSPEECH 2022
Geometric Representation Learning for Document Image Rectification
ECCV 2022
CMD: Self-Supervised 3D Action Representation Learning with Cross-Modal Mutual Distillation
ECCV 2022
Confidence Score Based Conformer Speaker Adaptation for Speech Recognition
INTERSPEECH 2022
TransVG: End-to-End Visual Grounding With Transformers
ICCV 2021
Bayesian Parametric and Architectural Domain Adaptation of LF-MMI Trained TDNNs for Elderly and Dysarthric Speech Recognition
INTERSPEECH 2021
Instance Mining with Class Feature Banks for Weakly Supervised Object Detection
AAAI 2021
Voxel R-CNN: Towards High Performance Voxel-based 3D Object Detection
AAAI 2021
Adaptive Offline Quintuplet Loss for Image-Text Matching
ECCV 2020
Relation Distillation Networks for Video Object Detection
ICCV 2019