← Resources & Methods

Natural Language Processing › Resources & Methods ›

Text Representation

2246 directly classified papers

Papers per year

Papers

Topics as Entity Clusters: Entity-based Topics from Large Language Models and Graph Neural Networks COLING 2024

Understanding How Positional Encodings Work in Transformer Model COLING 2024

Willkommens-Merkel, Chaos-Johnson, and Tore-Klose: Modeling the Evaluative Meaning of German Personal Name Compounds COLING 2024

Locally Measuring Cross-lingual Lexical Alignment: A Domain and Word Level Perspective EMNLP 2024

From Reading to Compressing: Exploring the Multi-document Reader for Prompt Compression EMNLP 2024

Teanga Data Model for Linked Corpora COLING 2024

Unsupervised Authorship Attribution for Medieval Latin Using Transformer-Based Embeddings COLING 2024

Information Parity: Measuring and Predicting the Multilingual Capabilities of Language Models EMNLP 2024

AboutMe: Using Self-Descriptions in Webpages to Document the Effects of English Pretraining Data Filters ACL 2024

Are ELECTRA’s Sentence Embeddings Beyond Repair? The Case of Semantic Textual Similarity EMNLP 2024

Can we teach language models to gloss endangered languages? EMNLP 2024

Towards Automatic Composition of ASP Programs from Natural Language Specifications IJCAI 2024

Toeing the Party Line: Election Manifestos as a Key to Understand Political Discourse on Twitter EMNLP 2024

Automatic Reconstruction of Ancient Chinese Pronunciations EMNLP 2024

Linguistically Conditioned Semantic Textual Similarity ACL 2024

LongWanjuan: Towards Systematic Measurement for Long Text Quality EMNLP 2024

Learning Semantic Structure through First-Order-Logic Translation EMNLP 2024

Tokenization Falling Short: On Subword Robustness in Large Language Models EMNLP 2024

Making Sentence Embeddings Robust to User-Generated Content COLING 2024

Unitxt: Flexible, Shareable and Reusable Data Preparation and Evaluation for Generative AI NAACL 2024

RAAMove: A Corpus for Analyzing Moves in Research Article Abstracts COLING 2024

SynTQA: Synergistic Table-based Question Answering via Mixture of Text-to-SQL and E2E TQA EMNLP 2024

Axis Tour: Word Tour Determines the Order of Axes in ICA-transformed Embeddings EMNLP 2024

AMenDeD: Modelling Concepts by Aligning Mentions, Definitions and Decontextualised Embeddings COLING 2024

SpeciaLex: A Benchmark for In-Context Specialized Lexicon Learning EMNLP 2024