Jen-tse Huang
30 papers · 2022–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+9 more ↓ Show less ↑
๐ Conference Polyglot (7) ๐งญ Keyword Pioneer ๐ Interdisciplinary Bridge ๐บ๏ธ Taxonomy Completionist (10) ๐ Cross-Pollinator (12)
๐
Cross-Pollinator
(12)
๐
Renaissance Researcher
(8)
๐ค
Dynamic Duo
(21)
๐ฌ
Deep Specialist
(12)
๐
Keyword Champion
(2)
๐
Century Club
(26)
๐๏ธ
Keyword Collector
(126)
โ
The Questioner
โก
Prolific Year
(9)
Conferences
ACL (11)
EMNLP (10)
ICLR (3)
CVPR (2)
ICML (2)
NAACL (1)
NIPS (1)
Top co-authors
Keywords
large language model
(14)
benchmark evaluation
(4)
adversarial attack
(3)
prompt engineering
(3)
multi-agent system
(3)
role-playing agent
(2)
psychological evaluation
(2)
data augmentation
(2)
psychological scale
(2)
personality assessment
(2)
fairness evaluation
(2)
instruction following
(2)
fairness benchmark
(2)
multimodal large language model
(2)
commonsense knowledge
(1)
dialogue generation
(1)
visual question answering
(1)
neural machine translation
(1)
multilingual translation
(1)
logical reasoning
(1)
Papers
FAIRGAMER: Evaluating Social Biases in LLM-Based Video Game NPCs
ACL 2026
HumanLLM: Benchmarking and Improving LLM Anthropomorphism via Human Cognitive Patterns
ACL 2026
JARVIS or Ultron? A Survey on the Safety and Security Threats of Computer-Using Agents
ACL 2026
Curing Miracle Steps in LLM Mathematical Reasoning with Rubric Rewards
ACL 2026
Chain-of-Jailbreak Attack for Image Generation Models via Step by Step Editing
ACL 2025
VisBias: Measuring Explicit and Implicit Social Biases in Vision Language Models
EMNLP 2025
Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training
ACL 2025
Canโt See the Forest for the Trees: Benchmarking Multimodal Safety Awareness for Multimodal LLMs
ACL 2025
Insight Over Sight: Exploring the Vision-Knowledge Conflicts in Multimodal LLMs
ACL 2025
AI Sees Your LocationโBut With A Bias Toward The Wealthy World
EMNLP 2025
UniDebugger: Hierarchical Multi-Agent Framework for Unified Software Debugging
EMNLP 2025
Learning to Ask: When LLM Agents Meet Unclear Instruction
EMNLP 2025
Where Fact Ends and Fairness Begins: Redefining AI Bias Evaluation through Cognitive Biases
EMNLP 2025
Competing Large Language Models in Multi-Agent Gaming Environments
ICLR 2025
On the Resilience of LLM-Based Multi-Agent Collaboration with Faulty Agents
ICML 2025
CoSER: Coordinating LLM-Based Persona Simulation of Established Roles
ICML 2025
SOTOPIA-S4: a user-friendly system for flexible, customizable, and large-scale social simulation
NAACL 2025
Apathetic or Empathetic? Evaluating LLMs' Emotional Alignments with Humans
NIPS 2024
All Languages Matter: On the Multilingual Safety of LLMs
ACL 2024
InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews
ACL 2024
On the Humanity of Conversational AI: Evaluating the Psychological Portrayal of LLMs
ICLR 2024
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher
ICLR 2024
On the Reliability of Psychological Scales on Large Language Models
EMNLP 2024
InterIntent: Investigating Social Intelligence of LLMs via Intention Understanding in an Interactive Game Context
EMNLP 2024
Not All Countries Celebrate Thanksgiving: On the Cultural Dominance in Large Language Models
ACL 2024
LogicAsker: Evaluating and Improving the Logical Reasoning Ability of Large Language Models
EMNLP 2024
Improving the Transferability of Adversarial Samples by Path-Augmented Method
CVPR 2023
ParroT: Translating during Chat using Large Language Models tuned with Human Translation and Feedback
EMNLP 2023
Improving Adversarial Transferability via Neuron Attribution-Based Attacks
CVPR 2022
Tencentโs Multilingual Machine Translation System for WMT22 Large-Scale African Languages
EMNLP 2022