Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Resources & Methods
Natural Language Processing
›
Resources & Methods
›
Text Representation
2246 directly classified papers
Papers per year
2006: 2
2007: 4
2008: 1
2009: 6
2010: 2
2011: 3
2012: 3
2013: 7
2014: 7
2015: 4
2016: 30
2017: 126
2018: 177
2019: 231
2020: 245
2021: 296
2022: 240
2023: 210
2024: 292
2025: 297
2026: 63
Papers
Topics as Entity Clusters: Entity-based Topics from Large Language Models and Graph Neural Networks
COLING 2024
Understanding How Positional Encodings Work in Transformer Model
COLING 2024
Willkommens-Merkel, Chaos-Johnson, and Tore-Klose: Modeling the Evaluative Meaning of German Personal Name Compounds
COLING 2024
Locally Measuring Cross-lingual Lexical Alignment: A Domain and Word Level Perspective
EMNLP 2024
From Reading to Compressing: Exploring the Multi-document Reader for Prompt Compression
EMNLP 2024
Teanga Data Model for Linked Corpora
COLING 2024
Unsupervised Authorship Attribution for Medieval Latin Using Transformer-Based Embeddings
COLING 2024
Information Parity: Measuring and Predicting the Multilingual Capabilities of Language Models
EMNLP 2024
AboutMe: Using Self-Descriptions in Webpages to Document the Effects of English Pretraining Data Filters
ACL 2024
Are ELECTRA’s Sentence Embeddings Beyond Repair? The Case of Semantic Textual Similarity
EMNLP 2024
Can we teach language models to gloss endangered languages?
EMNLP 2024
Towards Automatic Composition of ASP Programs from Natural Language Specifications
IJCAI 2024
Toeing the Party Line: Election Manifestos as a Key to Understand Political Discourse on Twitter
EMNLP 2024
Automatic Reconstruction of Ancient Chinese Pronunciations
EMNLP 2024
Linguistically Conditioned Semantic Textual Similarity
ACL 2024
LongWanjuan: Towards Systematic Measurement for Long Text Quality
EMNLP 2024
Learning Semantic Structure through First-Order-Logic Translation
EMNLP 2024
Tokenization Falling Short: On Subword Robustness in Large Language Models
EMNLP 2024
Making Sentence Embeddings Robust to User-Generated Content
COLING 2024
Unitxt: Flexible, Shareable and Reusable Data Preparation and Evaluation for Generative AI
NAACL 2024
RAAMove: A Corpus for Analyzing Moves in Research Article Abstracts
COLING 2024
SynTQA: Synergistic Table-based Question Answering via Mixture of Text-to-SQL and E2E TQA
EMNLP 2024
Axis Tour: Word Tour Determines the Order of Axes in ICA-transformed Embeddings
EMNLP 2024
AMenDeD: Modelling Concepts by Aligning Mentions, Definitions and Decontextualised Embeddings
COLING 2024
SpeciaLex: A Benchmark for In-Context Specialized Lexicon Learning
EMNLP 2024
<
1
…
19
20
21
…
90
>