Pengfei Hu
20 papers · 2019–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+9 more ↓ Show less ↑
π Renaissance Researcher (5) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (10) π§ Keyword Pioneer π Conference Polyglot (9)
π
Conference Polyglot
(9)
π
Academic Marathon
(6)
π
Cross-Pollinator
(10)
π§¬
Topic Evolution
π
Keyword Champion
(2)
π₯
Unstoppable
(5)
β‘
Prolific Year
(6)
π
Century Club
(19)
ποΈ
Keyword Collector
(123)
Conferences
INTERSPEECH (7)
AAAI (5)
EMNLP (2)
ACL (1)
CVPR (1)
ICCV (1)
ICLR (1)
IJCAI (1)
NIPS (1)
Top co-authors
Keywords
document analysis
(4)
large language model
(3)
video generation
(2)
end-to-end speech recognition
(2)
acoustic model
(2)
speech recognition
(2)
hierarchical structure
(2)
automatic speech recognition
(2)
table structure recognition
(2)
phonetic reduction
(2)
document parsing
(2)
attention mechanism
(2)
electronic health record
(2)
language model adaptation
(1)
data augmentation
(1)
machine translation
(1)
object detection
(1)
transfer learning
(1)
knowledge editing
(1)
document understanding
(1)
Papers
Video SimpleQA: Towards Factuality Evaluation in Large Video Language Models
AAAI 2026
RFL: Simplifying Chemical Structure Recognition with Ring-Free Language
AAAI 2025
Joint Knowledge Editing for Information Enrichment and Probability Promotion
AAAI 2025
DocMamba: Efficient Document Pre-training with State Space Model
AAAI 2025
MedPlan: A Two-Stage RAG-Based System for Personalized Medical Plan Generation
ACL 2025
DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking head Video Generation
ICLR 2025
No Black Boxes: Interpretable and Interactable Predictive Healthcare with Knowledge-Enhanced Agentic Causal Discovery
EMNLP 2025
SRFUND: A Multi-Granularity Hierarchical Structure Reconstruction Benchmark in Form Understanding
NIPS 2024
SEMv3: A Fast and Robust Approach to Table Separation Line Detection
IJCAI 2024
UniTabNet: Bridging Vision and Language Models for Enhanced Table Structure Recognition
EMNLP 2024
A Method of Audio-Visual Person Verification by Mining Connections between Time Series
INTERSPEECH 2023
HRDoc: Dataset and Baseline Method toward Hierarchical Reconstruction of Document Structures
AAAI 2023
Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video
ICCV 2023
Defensive Patches for Robust Recognition in the Physical World
CVPR 2022
MFA-Conformer: Multi-scale Feature Aggregation Conformer for Automatic Speaker Verification
INTERSPEECH 2022
PM-MMUT: Boosted Phone-mask Data Augmentation using Multi-Modeling Unit Training for Phonetic-Reduction-Robust E2E Speech Recognition
INTERSPEECH 2022
Linguistic-Acoustic Similarity Based Accent Shift for Accent Recognition
INTERSPEECH 2022
The TNT Team System Descriptions of Cantonese and Mongolian for IARPA OpenASR20
INTERSPEECH 2021
Leveraging Phone Mask Training for Phonetic-Reduction-Robust E2E Uyghur Speech Recognition
INTERSPEECH 2021
Multimedia Simultaneous Translation System for Minority Language Communication with Mandarin
INTERSPEECH 2019