Feiyang Kang
7 papers · 2023–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
π Conference Polyglot (4) π Renaissance Researcher (5) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (18) π§ Keyword Pioneer
π
Cross-Pollinator
(14)
Conferences
ICLR (3)
EMNLP (2)
CVPR (1)
NIPS (1)
Top co-authors
Keywords
large language model
(2)
optimal transport
(1)
information retrieval
(1)
empirical study
(1)
model interpretability
(1)
gradient-based method
(1)
gradient computation
(1)
synthetic datum
(1)
data selection
(1)
scaling law
(1)
training data attribution
(1)
influence function
(1)
model collapse
(1)
data attribution
(1)
data mixture
(1)
training datum
(1)
knowledge tracing
(1)
influence estimation
(1)
natural web datum
(1)
data influence estimation
(1)
Papers
Demystifying Synthetic Data in LLM Pre-training: A Systematic Study of Scaling Laws, Benefits, and Pitfalls
EMNLP 2025
The Mirrored Influence Hypothesis: Efficient Data Influence Estimation by Harnessing Forward Passes
CVPR 2024
FASTTRACK: Reliable Fact Tracing via Clustering and LLM-Powered Evidence Validation
EMNLP 2024
Get more for less: Principled Data Selection for Warming Up Fine-Tuning in LLMs
ICLR 2024
Performance Scaling via Optimal Transport: Enabling Data Selection from Partially Revealed Sources
NIPS 2023
Towards Robustness Certification Against Universal Perturbations
ICLR 2023
LAVA: Data Valuation without Pre-Specified Learning Algorithms
ICLR 2023