Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Resources & Methods
Natural Language Processing
›
Resources & Methods
›
Text Representation
2246 directly classified papers
Papers per year
2006: 2
2007: 4
2008: 1
2009: 6
2010: 2
2011: 3
2012: 3
2013: 7
2014: 7
2015: 4
2016: 30
2017: 126
2018: 177
2019: 231
2020: 245
2021: 296
2022: 240
2023: 210
2024: 292
2025: 297
2026: 63
Papers
Unsupervised Sentence Representation Learning with Syntactically Aligned Negative Samples
NAACL 2025
Principal Parts Detection for Computational Morphology: Task, Models and Benchmark
CONLL 2025
TeCoFeS: Text Column Featurization using Semantic Analysis
NAACL 2025
Word2Vec4Kids: Interactive Challenges to Introduce Middle School Students to Word Embeddings
AAAI 2025
Zero-Shot Cross-Sentential Scientific Relation Extraction via Entity-Guided Summarization
IJCNLP 2025
A Large and Balanced Corpus for Fine-grained Arabic Readability Assessment
ACL 2025
Beyond Film Subtitles: Is YouTube the Best Approximation of Spoken Vocabulary?
COLING 2025
CLEME2.0: Towards Interpretable Evaluation by Disentangling Edits for Grammatical Error Correction
ACL 2025
Adapting General-Purpose Embedding Models to Private Datasets Using Keyword-based Retrieval
ACL 2025
Factuality Beyond Coherence: Evaluating LLM Watermarking Methods for Medical Texts
EMNLP 2025
Cost-Effective Discourse Annotation in the Prague Czech–English Dependency Treebank
COLING 2024
Construction of Paired Knowledge Graph - Text Datasets Informed by Cyclic Evaluation
COLING 2024
Creating Terminological Resources in the Digital Age for Less-resourced Languages
COLING 2024
CLAUSE-ATLAS: A Corpus of Narrative Information to Scale up Computational Literary Analysis
COLING 2024
Tokenisation in Machine Translation Does Matter: The impact of different tokenisation approaches for Maltese
ACL 2024
Computational Modelling of Plurality and Definiteness in Chinese Noun Phrases
COLING 2024
Creation and Analysis of an International Corpus of Privacy Laws
COLING 2024
AMenDeD: Modelling Concepts by Aligning Mentions, Definitions and Decontextualised Embeddings
COLING 2024
A New Massive Multilingual Dataset for High-Performance Language Technologies
COLING 2024
Monotonic Representation of Numeric Attributes in Language Models
ACL 2024
Where is the signal in tokenization space?
EMNLP 2024
A Persona-Based Corpus in the Diabetes Self-Care Domain - Applying a Human-Centered Approach to a Low-Resource Context
COLING 2024
Demonstration Retrieval-Augmented Generative Event Argument Extraction
COLING 2024
Towards a Transformer-Based Reverse Dictionary Model for Quality Estimation of Definitions (Student Abstract)
AAAI 2024
A Construction Grammar Corpus of Varying Schematicity: A Dataset for the Evaluation of Abstractions in Language Models
COLING 2024
<
1
…
14
15
16
…
90
>