Weicheng Ma
37 papers · 2019–2025 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
๐ Academic Marathon (6) ๐ Conference Polyglot (8) ๐ Interdisciplinary Bridge ๐งญ Keyword Pioneer ๐ฃ Hot Topic Early Bird
๐บ๏ธ
Taxonomy Completionist
(66)
๐
Interdisciplinary Bridge
๐
Conference Polyglot
(8)
๐งฌ
Topic Evolution
๐ฌ
Deep Specialist
(12)
๐
Keyword Champion
(2)
๐ค
Dynamic Duo
(35)
๐๏ธ
Keyword Collector
(162)
โ
The Questioner
(3)
โก
Prolific Year
(6)
๐
Century Club
(37)
๐ฅ
Unstoppable
(7)
Conferences
EMNLP (12)
ACL (8)
IJCNLP (5)
NAACL (5)
SEMEVAL (4)
AAAI (1)
AACL (1)
COLING (1)
Top co-authors
Keywords
text classification
(12)
large language model
(7)
data augmentation
(5)
transformer model
(5)
natural language understanding
(4)
ensemble model
(3)
lexical complexity
(3)
attention head
(3)
sequence labeling
(3)
endangered language
(3)
sentiment analysis
(3)
multilingual model
(3)
feature engineering
(3)
attention map
(3)
hate speech detection
(3)
pairwise comparison
(2)
synthetic data generation
(2)
model pruning
(2)
transformer network
(2)
cross-lingual transfer
(2)
Papers
What is it? Towards a Generalizable Native American Language Identification System
NAACL 2025
Judging the Judges: A Systematic Study of Position Bias in LLM-as-a-Judge
AACL 2025
NรผshuRescue: Reviving the Endangered Nรผshu Language with AI
COLING 2025
Enhancing LLM-Based Persuasion Simulations with Cultural and Speaker-Specific Information
EMNLP 2025
Scalable and Culturally Specific Stereotype Dataset Construction via Human-LLM Collaboration
EMNLP 2025
Data to Defense: The Role of Curation in Aligning Large Language Models Against Safety Compromise
EMNLP 2025
Is It Navajo? Accurate Language Detection for Endangered Athabaskan Languages
NAACL 2025
Communication Makes Perfect: Persuasion Dataset Construction via Multi-LLM Communication
NAACL 2025
Judging the Judges: A Systematic Study of Position Bias in LLM-as-a-Judge
IJCNLP 2025
A Generalizable Rhetorical Strategy Annotation Model Using LLM-based Debate Simulation and Labelling
EMNLP 2025
Simulated Misinformation Susceptibility (SMISTS): Enhancing Misinformation Research with Large Language Model Simulations
ACL 2024
Is GPT-4V (ision) All You Need for Automating Academic Data Visualization? Exploring Vision-Language Modelsโ Capability in Reproducing Academic Charts
EMNLP 2024
Deciphering Stereotypes in Pre-Trained Language Models
EMNLP 2023
Intersectional Stereotypes in Large Language Models: Dataset and Analysis
EMNLP 2023
Improving Syntactic Probing Correctness and Robustness with Control Tasks
ACL 2023
DartmouthCS at SemEval-2022 Task 8: Predicting Multilingual News Article Similarity with Meta-Information and Translation
SEMEVAL 2022
EnCBP: A New Benchmark Dataset for Finer-Grained Cultural Background Prediction in English
ACL 2022
Capturing Topic Framing via Masked Language Modeling
EMNLP 2022
Dartmouth at SemEval-2022 Task 6: Detection of Sarcasm
NAACL 2022
DartmouthCS at SemEval-2022 Task 8: Predicting Multilingual News Article Similarity with Meta-Information and Translation
NAACL 2022
Dartmouth at SemEval-2022 Task 6: Detection of Sarcasm
SEMEVAL 2022
BigGreen at SemEval-2021 Task 1: Lexical Complexity Prediction with Assembly Models
SEMEVAL 2021
Improvements and Extensions on Metaphor Detection
ACL 2021
Contributions of Transformer Attention Heads in Multi- and Cross-lingual Tasks
IJCNLP 2021
BigGreen at SemEval-2021 Task 1: Lexical Complexity Prediction with Assembly Models
IJCNLP 2021
Lone Pine at SemEval-2021 Task 5: Fine-Grained Detection of Hate Speech Using BERToxic
IJCNLP 2021
Improvements and Extensions on Metaphor Detection
IJCNLP 2021
Lone Pine at SemEval-2021 Task 5: Fine-Grained Detection of Hate Speech Using BERToxic
ACL 2021
Lone Pine at SemEval-2021 Task 5: Fine-Grained Detection of Hate Speech Using BERToxic
SEMEVAL 2021
Embedding Heterogeneous Networks into Hyperbolic Space Without Meta-path
AAAI 2021
BigGreen at SemEval-2021 Task 1: Lexical Complexity Prediction with Assembly Models
ACL 2021
Contributions of Transformer Attention Heads in Multi- and Cross-lingual Tasks
ACL 2021
GradTS: A Gradient-Based Automatic Auxiliary Task Selection Method Based on Transformer Networks
EMNLP 2021
Data Boost: Text Data Augmentation Through Reinforcement Learning Guided Conditional Generation
EMNLP 2020
Multi-resolution Annotations for Emoji Prediction
EMNLP 2020
An Empirical Survey of Unsupervised Text Representation Methods on Twitter Data
EMNLP 2020
ChiMed: A Chinese Medical Corpus for Question Answering
ACL 2019