Sheng Jin

45 papers · 2018–2026 · 7 conferences · across top CS/AI conferences

Achievements

+9 more ↓

🏃 Academic Marathon (7) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (7) 🐝 Cross-Pollinator (7)

🌍 Conference Polyglot (7) 🏃 Academic Marathon (7) 🌈 Renaissance Researcher (7) 🤝 Dynamic Duo (26) 🔥 Unstoppable (8) 📈 Trend Setter 💎 Century Club (44) 🗃️ Keyword Collector (172) ⚡ Prolific Year (12)

Conferences

AAAI (9) CVPR (9) ECCV (9) NIPS (6) ICCV (5) ICLR (5) EMNLP (2)

Top co-authors

Wentao Liu (26) Chen Qian (21) Ping Luo (17) Lumin Xu (13) Wanli Ouyang (10) Wang Zeng (7) Size Wu (6) Wenwei Zhang (5) Chen Change Loy (5) Chao Chen (4)

Keywords

human pose estimation (5) large language model (5) knowledge distillation (4) image segmentation (3) unsupervised learning (3) vision-language model (2) unsupervised domain adaptation (2) monocular 3d detection (2) adversarial learning (2) domain adaptation (2) neural architecture search (2) depth estimation (2) feature representation (2) out-of-distribution detection (2) zero-shot learning (2) data augmentation (2) object detection (2) transfer learning (2) multimodal learning (2) video understanding (2)

Papers

EduGuardBench: A Holistic Benchmark for Evaluating the Pedagogical Fidelity and Adversarial Safety of LLMs as Simulated Teachers AAAI 2026 Ultra-High Resolution Segmentation via Boundary-Enhanced Patch-Merging Transformer AAAI 2025 AutoMMLab: Automatically Generating Deployable Models from Language Instructions for Computer Vision Tasks AAAI 2025 Harmonizing Visual Representations for Unified Multimodal Understanding and Generation ICCV 2025 EMNLP: Educator-role Moral and Normative Large Language Models Profiling EMNLP 2025 Evolution in Simulation: AI-Agent School with Dual Memory for High-Fidelity Educational Dynamics EMNLP 2025 F-LMM: Grounding Frozen Large Multimodal Models CVPR 2025 NADER: Neural Architecture Design via Multi-Agent Collaboration CVPR 2025 Frame-Voyager: Learning to Query Frames for Video Large Language Models ICLR 2025 Unsupervised Continual Domain Shift Learning with Multi-Prototype Modeling CVPR 2025 Weakly Supervised Monocular 3D Detection with a Single-View Image CVPR 2024 MonoMAE: Enhancing Monocular 3D Detection through Depth-Aware Masked Autoencoders NIPS 2024 KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension NIPS 2024 GKGNet: Group K-Nearest Neighbor based Graph Convolutional Network for Multi-Label Image Recognition ECCV 2024 You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-Person Multi-Task Human-Centric Perception ECCV 2024 UniFS: Universal Few-shot Instance Perception with Point Representations ECCV 2024 When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset ECCV 2024 CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction ICLR 2024 LLMs Meet VLMs: Boost Open Vocabulary Object Detection with Fine-grained Descriptors ICLR 2024 PROGRAM: PROtotype GRAph Model based Pseudo-Label Learning for Test-Time Adaptation ICLR 2024 CLIM: Contrastive Language-Image Mosaic for Region Representation AAAI 2024 Rethinking Out-of-Distribution Detection on Imbalanced Data Distribution NIPS 2024 Category-Extensible Out-of-Distribution Detection via Hierarchical Context Descriptions NIPS 2023 Domain Generalization via Balancing Training Difficulty and Model Capability ICCV 2023 Uncertainty-aware Unsupervised Multi-Object Tracking ICCV 2023 Aligning Bag of Regions for Open-Vocabulary Object Detection CVPR 2023 Pseudo-Labeled Auto-Curriculum Learning for Semi-Supervised Keypoint Localization ICLR 2022 Temporal Action Proposal Generation with Background Constraint AAAI 2022 Not All Tokens Are Equal: Human-Centric Visual Analysis via Token Clustering Transformer CVPR 2022 PoseTrans: A Simple yet Effective Pose Transformation Augmentation for Human Pose Estimation ECCV 2022 3D Interacting Hand Pose Estimation by Hand De-Occlusion and Removal ECCV 2022 Pose for Everything: Towards Category-Agnostic Pose Estimation ECCV 2022 ViPNAS: Efficient Video Pose Estimation via Neural Architecture Search CVPR 2021 When Human Pose Estimation Meets Robustness: Adversarial Algorithms and Benchmarks CVPR 2021 Asynchronous Teacher Guided Bit-wise Hard Mining for Online Hashing AAAI 2021 Graph-Based 3D Multi-Person Pose Estimation Using Multi-View Images ICCV 2021 Whole-Body Human Pose Estimation in the Wild ECCV 2020 Differentiable Hierarchical Graph Grouping for Multi-Person Pose Estimation ECCV 2020 HoMM: Higher-Order Moment Matching for Unsupervised Domain Adaptation AAAI 2020 RL-Duet: Online Music Accompaniment Generation Using Deep Reinforcement Learning AAAI 2020 When Counterpoint Meets Chinese Folk Melodies NIPS 2020 SSAH: Semi-Supervised Adversarial Deep Hashing with Self-Paced Hard Sample Generation AAAI 2020 Multi-Person Articulated Tracking With Spatial and Temporal Embeddings CVPR 2019 TRB: A Novel Triplet Representation for Understanding 2D Human Body ICCV 2019 Connectionist Temporal Classification with Maximum Entropy Regularization NIPS 2018