Weicheng Ma

37 papers · 2019–2025 · 8 conferences · across top CS/AI conferences

Achievements

+12 more ↓

🏃 Academic Marathon (6) 🌍 Conference Polyglot (8) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird

🗺️ Taxonomy Completionist (66) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (8) 🧬 Topic Evolution 🔬 Deep Specialist (12) 🏆 Keyword Champion (2) 🤝 Dynamic Duo (35) 🗃️ Keyword Collector (162) ❓ The Questioner (3) ⚡ Prolific Year (6) 💎 Century Club (37) 🔥 Unstoppable (7)

Conferences

EMNLP (12) ACL (8) IJCNLP (5) NAACL (5) SEMEVAL (4) AAAI (1) AACL (1) COLING (1)

Top co-authors

Soroush Vosoughi (35) Lili Wang (14) Ruibo Liu (6) Ivory Yang (5) Hefan Zhang (4) Saeed Hassanpour (4) Farnoosh Hashemi (3) Renze Lou (3) Joice Chen (3) Yakoob Khan (3)

Keywords

text classification (12) large language model (7) data augmentation (5) transformer model (5) natural language understanding (4) ensemble model (3) lexical complexity (3) attention head (3) sequence labeling (3) endangered language (3) sentiment analysis (3) multilingual model (3) feature engineering (3) attention map (3) hate speech detection (3) pairwise comparison (2) synthetic data generation (2) model pruning (2) transformer network (2) cross-lingual transfer (2)

Papers

What is it? Towards a Generalizable Native American Language Identification System NAACL 2025 Judging the Judges: A Systematic Study of Position Bias in LLM-as-a-Judge AACL 2025 NüshuRescue: Reviving the Endangered Nüshu Language with AI COLING 2025 Enhancing LLM-Based Persuasion Simulations with Cultural and Speaker-Specific Information EMNLP 2025 Scalable and Culturally Specific Stereotype Dataset Construction via Human-LLM Collaboration EMNLP 2025 Data to Defense: The Role of Curation in Aligning Large Language Models Against Safety Compromise EMNLP 2025 Is It Navajo? Accurate Language Detection for Endangered Athabaskan Languages NAACL 2025 Communication Makes Perfect: Persuasion Dataset Construction via Multi-LLM Communication NAACL 2025 Judging the Judges: A Systematic Study of Position Bias in LLM-as-a-Judge IJCNLP 2025 A Generalizable Rhetorical Strategy Annotation Model Using LLM-based Debate Simulation and Labelling EMNLP 2025 Simulated Misinformation Susceptibility (SMISTS): Enhancing Misinformation Research with Large Language Model Simulations ACL 2024 Is GPT-4V (ision) All You Need for Automating Academic Data Visualization? Exploring Vision-Language Models’ Capability in Reproducing Academic Charts EMNLP 2024 Deciphering Stereotypes in Pre-Trained Language Models EMNLP 2023 Intersectional Stereotypes in Large Language Models: Dataset and Analysis EMNLP 2023 Improving Syntactic Probing Correctness and Robustness with Control Tasks ACL 2023 DartmouthCS at SemEval-2022 Task 8: Predicting Multilingual News Article Similarity with Meta-Information and Translation SEMEVAL 2022 EnCBP: A New Benchmark Dataset for Finer-Grained Cultural Background Prediction in English ACL 2022 Capturing Topic Framing via Masked Language Modeling EMNLP 2022 Dartmouth at SemEval-2022 Task 6: Detection of Sarcasm NAACL 2022 DartmouthCS at SemEval-2022 Task 8: Predicting Multilingual News Article Similarity with Meta-Information and Translation NAACL 2022 Dartmouth at SemEval-2022 Task 6: Detection of Sarcasm SEMEVAL 2022 BigGreen at SemEval-2021 Task 1: Lexical Complexity Prediction with Assembly Models SEMEVAL 2021 Improvements and Extensions on Metaphor Detection ACL 2021 Contributions of Transformer Attention Heads in Multi- and Cross-lingual Tasks IJCNLP 2021 BigGreen at SemEval-2021 Task 1: Lexical Complexity Prediction with Assembly Models IJCNLP 2021 Lone Pine at SemEval-2021 Task 5: Fine-Grained Detection of Hate Speech Using BERToxic IJCNLP 2021 Improvements and Extensions on Metaphor Detection IJCNLP 2021 Lone Pine at SemEval-2021 Task 5: Fine-Grained Detection of Hate Speech Using BERToxic ACL 2021 Lone Pine at SemEval-2021 Task 5: Fine-Grained Detection of Hate Speech Using BERToxic SEMEVAL 2021 Embedding Heterogeneous Networks into Hyperbolic Space Without Meta-path AAAI 2021 BigGreen at SemEval-2021 Task 1: Lexical Complexity Prediction with Assembly Models ACL 2021 Contributions of Transformer Attention Heads in Multi- and Cross-lingual Tasks ACL 2021 GradTS: A Gradient-Based Automatic Auxiliary Task Selection Method Based on Transformer Networks EMNLP 2021 Data Boost: Text Data Augmentation Through Reinforcement Learning Guided Conditional Generation EMNLP 2020 Multi-resolution Annotations for Emoji Prediction EMNLP 2020 An Empirical Survey of Unsupervised Text Representation Methods on Twitter Data EMNLP 2020 ChiMed: A Chinese Medical Corpus for Question Answering ACL 2019