Ivan P. Yamshchikov
25 papers · 2018–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
π Cross-Pollinator (9) π Academic Marathon (7) π§ Keyword Pioneer π Conference Polyglot (7) π Renaissance Researcher (9)
π
Renaissance Researcher
(9)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(53)
π
Keyword Champion
(2)
π§¬
Topic Evolution
π€
Dynamic Duo
(12)
ποΈ
Keyword Collector
(124)
π
Century Club
(23)
π
Trend Setter
π₯
Unstoppable
(5)
β
The Questioner
(2)
β‘
Prolific Year
(8)
Conferences
EMNLP (10)
ACL (5)
COLING (3)
EACL (3)
AAAI (2)
IJCNLP (1)
NAACL (1)
Top co-authors
Keywords
text generation
(5)
language model
(4)
style transfer
(4)
text classification
(4)
large language model
(4)
low-resource language
(2)
benchmark evaluation
(2)
natural language processing
(2)
byte pair encoding
(2)
sentiment transfer
(2)
multimodal learning
(2)
text style transfer
(2)
semantic preservation
(2)
echo chamber
(2)
representation learning
(2)
authorship attribution
(2)
text analysis
(1)
knowledge representation
(1)
neural machine translation
(1)
dataset creation
(1)
Papers
Teaching Old Tokenizers New Words: Efficient Tokenizer Adaptation for Pretrained Models
EACL 2026
From Where Words Come: Efficient Regularization of Code Tokenizers Through Source Attribution
ACL 2026
Smotrom tvoja pΓ₯ ander drogoj verden! Resurrecting Dead Pidgin with Generative Models: Russenorsk Case Study
ACL 2025
Surface Fairness, Deep Bias: A Comparative Study of Bias in Language Models
ACL 2025
ComicScene154: A Scene Dataset for Comic Analysis
EMNLP 2025
Neural Machine Translation for Malayalam Paraphrase Generation
EACL 2024
LLMs Simulate Big5 Personality Traits: Further Evidence
EACL 2024
BPE Gets Picky: Efficient Vocabulary Refinement During Tokenizer Training
EMNLP 2024
Individuation in Neural Models with and without Visual Grounding
EMNLP 2024
Vygotsky Distance: Measure for Benchmark Task Similarity
COLING 2024
Knowledge Graph Representation for Political Information Sources
COLING 2024
Echo-chambers and Idea Labs: Communication Styles on Twitter
COLING 2024
Sui Generis: Large Language Models for Authorship Attribution and Verification in Latin
EMNLP 2024
Rehabilitating Homeless: Dataset and Key Insights
AAAI 2023
Post Turing: Mapping the landscape of LLM Evaluation
EMNLP 2023
What is Wrong with Language Models that Can Not Tell a Story?
ACL 2023
Do Data-based Curricula Work?
ACL 2022
BERT in Plutarchβs Shadows
EMNLP 2022
Style-transfer and Paraphrase: Looking for a Sensible Semantic Similarity Metric
AAAI 2021
StoryDB: Broad Multi-language Narrative Dataset
EMNLP 2021
Decomposing Textual Information For Style Transfer
EMNLP 2019
Style Transfer for Texts: Retrain, Report Errors, Compare with Rewrites
IJCNLP 2019
Dyr Bul Shchyl. Proxying Sound Symbolism With Word Embeddings
NAACL 2019
Style Transfer for Texts: Retrain, Report Errors, Compare with Rewrites
EMNLP 2019
Sounds Wilde. Phonetically Extended Embeddings for Author-Stylized Poetry Generation
EMNLP 2018