Shi Yu
19 papers · 2017–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
π Academic Marathon (8) π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (6) π Cross-Pollinator (11)
π
Conference Polyglot
(6)
π
Academic Marathon
(8)
π
Renaissance Researcher
(6)
π€
Dynamic Duo
(10)
π§¬
Topic Evolution
π
Trend Setter
β‘
Prolific Year
(9)
π
Century Club
(17)
ποΈ
Keyword Collector
(80)
π₯
Unstoppable
(5)
Conferences
ACL (7)
EMNLP (4)
COLING (2)
ICLR (2)
INTERSPEECH (2)
EACL (1)
IJCNLP (1)
Top co-authors
Keywords
retrieval-augmented generation
(4)
vowel epenthesis
(2)
language model pretraining
(2)
dense retrieval
(2)
information retrieval
(2)
zero-shot learning
(2)
semantic embedding
(2)
self-supervised learning
(1)
text annotation
(1)
direct preference optimization
(1)
named entity recognition
(1)
part-of-speech tagging
(1)
model adaptation
(1)
natural language processing
(1)
dynamic time warping
(1)
factual accuracy
(1)
query expansion
(1)
exemplar models
(1)
knowledge refinement
(1)
document ranking
(1)
Papers
ThinkNote: Enhancing Knowledge Integration and Utilization of Large Language Models via Constructivist Cognition Modeling
EACL 2026
CheckRLM: Effective KnowledgeβThought Coherence Checking in Retrieval-Augmented Reasoning
ACL 2026
RankCoT: Refining Knowledge for Retrieval-Augmented Generation through Ranking Chain-of-Thoughts
ACL 2025
Craw4LLM: Efficient Web Crawling for LLM Pretraining
ACL 2025
ExpandR: Teaching Dense Retrievers Beyond Queries with LLM Guidance
EMNLP 2025
RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework
ACL 2025
Multi-Modal Multi-Granularity Tokenizer for Chu Bamboo Slips
COLING 2025
KBAlign: Efficient Self Adaptation on Specific Textual Knowledge Bases
EMNLP 2025
DeepNote: Note-Centric Deep Retrieval-Augmented Generation
EMNLP 2025
VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents
ICLR 2025
RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards
ICLR 2025
Fusion-in-T5: Unifying Variant Signals for Simple and Effective Document Ranking with Attention Fusion
COLING 2024
Structure-Aware Language Model Pretraining Improves Dense Retrieval on Structured Data
ACL 2023
Augmentation-Adapted Retriever Improves Generalization of Language Models as Generic Plug-In
ACL 2023
MIC: A Multi-task Interactive Curation Tool
EMNLP 2022
Speech Perception and Loanword Adaptations: The Case of Copy-Vowel Epenthesis
INTERSPEECH 2021
Named Entity Recognition through Deep Representation Learning and Weak Supervision
IJCNLP 2021
Named Entity Recognition through Deep Representation Learning and Weak Supervision
ACL 2021
Predicting Epenthetic Vowel Quality from Acoustics
INTERSPEECH 2017