Jen-tse Huang

30 papers · 2022–2026 · 7 conferences · across top CS/AI conferences

Achievements

+9 more ↓

🌍 Conference Polyglot (7) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (10) 🐝 Cross-Pollinator (12)

🐝 Cross-Pollinator (12) 🌈 Renaissance Researcher (8) 🤝 Dynamic Duo (21) 🔬 Deep Specialist (12) 🏆 Keyword Champion (2) 💎 Century Club (26) 🗃️ Keyword Collector (126) ❓ The Questioner ⚡ Prolific Year (9)

Conferences

ACL (11) EMNLP (10) ICLR (3) CVPR (2) ICML (2) NAACL (1) NIPS (1)

Top co-authors

Wenxuan Wang (23) Wenxiang Jiao (14) Youliang Yuan (13) Zhaopeng Tu (12) Michael Lyu (7) Michael R. Lyu (6) Pinjia He (6) Xiaoyuan Liu (4) Man Ho Lam (4) Eric John Li (4)

Keywords

large language model (14) benchmark evaluation (4) adversarial attack (3) prompt engineering (3) multi-agent system (3) role-playing agent (2) psychological evaluation (2) data augmentation (2) psychological scale (2) personality assessment (2) fairness evaluation (2) instruction following (2) fairness benchmark (2) multimodal large language model (2) commonsense knowledge (1) dialogue generation (1) visual question answering (1) neural machine translation (1) multilingual translation (1) logical reasoning (1)

Papers

FAIRGAMER: Evaluating Social Biases in LLM-Based Video Game NPCs ACL 2026 HumanLLM: Benchmarking and Improving LLM Anthropomorphism via Human Cognitive Patterns ACL 2026 JARVIS or Ultron? A Survey on the Safety and Security Threats of Computer-Using Agents ACL 2026 Curing Miracle Steps in LLM Mathematical Reasoning with Rubric Rewards ACL 2026 Chain-of-Jailbreak Attack for Image Generation Models via Step by Step Editing ACL 2025 VisBias: Measuring Explicit and Implicit Social Biases in Vision Language Models EMNLP 2025 Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training ACL 2025 Can’t See the Forest for the Trees: Benchmarking Multimodal Safety Awareness for Multimodal LLMs ACL 2025 Insight Over Sight: Exploring the Vision-Knowledge Conflicts in Multimodal LLMs ACL 2025 AI Sees Your Location—But With A Bias Toward The Wealthy World EMNLP 2025 UniDebugger: Hierarchical Multi-Agent Framework for Unified Software Debugging EMNLP 2025 Learning to Ask: When LLM Agents Meet Unclear Instruction EMNLP 2025 Where Fact Ends and Fairness Begins: Redefining AI Bias Evaluation through Cognitive Biases EMNLP 2025 Competing Large Language Models in Multi-Agent Gaming Environments ICLR 2025 On the Resilience of LLM-Based Multi-Agent Collaboration with Faulty Agents ICML 2025 CoSER: Coordinating LLM-Based Persona Simulation of Established Roles ICML 2025 SOTOPIA-S4: a user-friendly system for flexible, customizable, and large-scale social simulation NAACL 2025 Apathetic or Empathetic? Evaluating LLMs' Emotional Alignments with Humans NIPS 2024 All Languages Matter: On the Multilingual Safety of LLMs ACL 2024 InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews ACL 2024 On the Humanity of Conversational AI: Evaluating the Psychological Portrayal of LLMs ICLR 2024 GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher ICLR 2024 On the Reliability of Psychological Scales on Large Language Models EMNLP 2024 InterIntent: Investigating Social Intelligence of LLMs via Intention Understanding in an Interactive Game Context EMNLP 2024 Not All Countries Celebrate Thanksgiving: On the Cultural Dominance in Large Language Models ACL 2024 LogicAsker: Evaluating and Improving the Logical Reasoning Ability of Large Language Models EMNLP 2024 Improving the Transferability of Adversarial Samples by Path-Augmented Method CVPR 2023 ParroT: Translating during Chat using Large Language Models tuned with Human Translation and Feedback EMNLP 2023 Improving Adversarial Transferability via Neuron Attribution-Based Attacks CVPR 2022 Tencent’s Multilingual Machine Translation System for WMT22 Large-Scale African Languages EMNLP 2022