Tat-Seng Chua
221 papers · 2001–2026 · 16 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+19 more ↓ Show less ↑
π§ Keyword Pioneer π£ Hot Topic Early Bird πΊοΈ Taxonomy Completionist (28) π Interdisciplinary Bridge π Conference Polyglot (16)
π
Interdisciplinary Bridge
π£
Hot Topic Early Bird
π§
Keyword Pioneer
π
Keyword Trendsetter Combo
(4)
π
Conference Loyalist
(25)
π§¬
Topic Evolution
π€
Dynamic Duo
(26)
π
Grand Slam
π₯
Mega-Team
(32)
π
Triple Crown
π¬
Deep Specialist
(33)
π
Keyword Champion
β‘
Prolific Year
(12)
ποΈ
Keyword Collector
(82)
π
Trend Setter
π
Century Club
(219)
π₯
Unstoppable
(11)
β
The Questioner
(7)
π
Conference Pioneer
Conferences
ACL (50)
EMNLP (30)
AAAI (27)
CVPR (23)
ICLR (16)
IJCAI (15)
NIPS (13)
ICML (11)
COLING (9)
IJCNLP (8)
ECCV (6)
ICCV (6)
NAACL (3)
CONLL (2)
AACL (1)
EACL (1)
Top co-authors
Research topics
Keywords
large language model
(27)
multimodal learning
(19)
graph neural network
(11)
representation learning
(11)
recommender system
(9)
video understanding
(9)
transfer learning
(8)
vision-language model
(8)
zero-shot learning
(8)
neural network
(7)
attention mechanism
(7)
scene graph
(7)
video question answering
(7)
multimodal large language model
(7)
knowledge graph
(6)
contrastive learning
(6)
text generation
(6)
direct preference optimization
(6)
diffusion model
(5)
dialogue system
(5)
Papers
LLaVA-UHD v2: Exploiting Hierarchical Vision Granularity in MLLMs via Inverse Semantic Pyramid
AAAI 2026
Logic Unseen: Revealing the Logical Blindspots of Vision-Language Models
AAAI 2026
DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization
ICML 2025
Long-Term TalkingFace Generation via Motion-Prior Conditional Diffusion Model
ICML 2025
Causal-Entity Reflected Egocentric Traffic Accident Video Synthesis
ICCV 2025
Benchmarking Multimodal CoT Reward Model Stepwise by Visual Program
ICCV 2025
Uncertainty-Driven Expert Control: Enhancing the Reliability of Medical Vision-Language Models
ICCV 2025
Measuring What Makes You Unique: Difference-Aware User Modeling for Enhancing LLM Personalization
ACL 2025
G2S: A General-to-Specific Learning Framework for Temporal Knowledge Graph Forecasting with Large Language Models
ACL 2025
Self-Improvement Towards Pareto Optimality: Mitigating Preference Conflicts in Multi-Objective Alignment
ACL 2025
Personalized Generation In Large Model Era: A Survey
ACL 2025
MPO: Multilingual Safety Alignment via Reward Gap Optimization
ACL 2025
Length Controlled Generation for Black-box LLMs
ACL 2025
Beware of Your Po! Measuring and Mitigating AI Safety Risks in Role-Play Fine-Tuning of LLMs
ACL 2025
Personalized Text Generation with Contrastive Activation Steering
ACL 2025
Knowledge Boundary of Large Language Models: A Survey
ACL 2025
Cracking the Code of Hallucination in LVLMs with Vision-aware Head Divergence
ACL 2025
Boosting Virtual Agent Learning and Reasoning: A Step-Wise, Multi-Dimensional, and Generalist Reward Model with Benchmark
ICML 2025
FOCoOp: Enhancing Out-of-Distribution Robustness in Federated Prompt Learning for Vision-Language Models
ICML 2025
AnyEdit: Edit Any Knowledge Encoded in Language Models
ICML 2025
On Path to Multimodal Generalist: General-Level and General-Bench
ICML 2025
Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual Scene
CVPR 2025
Universal Scene Graph Generation
CVPR 2025
Zero-1-to-A: Zero-Shot One Image to Animatable Head Avatars Using Video Diffusion
CVPR 2025
Media Source Matters More Than Content: Unveiling Political Bias in LLM-Generated Citations
EMNLP 2025
FACT-AUDIT: An Adaptive Multi-Agent Framework for Dynamic Fact-Checking Evaluation of Large Language Models
ACL 2025
How to Enable Effective Cooperation Between Humans and NLP Models: A Survey of Principles, Formalizations, and Beyond
ACL 2025
AdaSteer: Your Aligned LLM is Inherently an Adaptive Jailbreak Defender
EMNLP 2025
Decoding in Latent Spaces for Efficient Inference in LLM-based Recommendation
EMNLP 2025
RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness
CVPR 2025
SILMM: Self-Improving Large Multimodal Models for Compositional Text-to-Image Generation
CVPR 2025
Attend and Enrich: Enhanced Visual Prompt for Zero-Shot Learning
AAAI 2025
Combating Multimodal LLM Hallucination via Bottom-Up Holistic Reasoning
AAAI 2025
Optimize Incompatible Parameters Through Compatibility-aware Knowledge Integration
AAAI 2025
LightPROF: A Lightweight Reasoning Framework for Large Language Model on Knowledge Graph
AAAI 2025
Aligning Large Language Models for Faithful Integrity Against Opposing Argument
AAAI 2025
A Federated Framework for LLM-based Recommendation
NAACL 2025
Hello Again! LLM-powered Personalized Agent for Long-term Dialogue
NAACL 2025
Fine-Grained Verifiers: Preference Modeling as Next-token Prediction in Vision-Language Alignment
ICLR 2025
TIGeR: Unifying Text-to-Image Generation and Retrieval with Large Multimodal Models
ICLR 2025
Towards Semantic Equivalence of Tokenization in Multimodal LLM
ICLR 2025
AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models
ICLR 2025
Language Representations Can be What Recommenders Need: Findings and Potentials
ICLR 2025
Bridging Jensen Gap for Max-Min Group Fairness Optimization in Recommendation
ICLR 2025
NExT-Mol: 3D Diffusion Meets 1D Language Modeling for 3D Molecule Generation
ICLR 2025
Efficient Inference for Large Language Model-based Generative Recommendation
ICLR 2025
Neural Causal Graph for Interpretable and Intervenable Classification
ICLR 2025
EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering
CVPR 2025
Preference Diffusion for Recommendation
ICLR 2025
STEP: Enhancing Video-LLMs' Compositional Reasoning by Spatio-Temporal Graph-guided Self-Training
CVPR 2025
Temporally and Distributionally Robust Optimization for Cold-Start Recommendation
AAAI 2024
Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching
ECCV 2024
Disentangling Masked Autoencoders for Unsupervised Domain Generalization
ECCV 2024
LLaVA-UHD: an LMM Perceiving any Aspect Ratio and High-Resolution Images
ECCV 2024
On Softmax Direct Preference Optimization for Recommendation
NIPS 2024
Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing
NIPS 2024
ALI-Agent: Assessing LLMs' Alignment with Human Values via Agent-based Evaluation
NIPS 2024
Towards Neuron Attributions in Multi-Modal Large Language Models
NIPS 2024
A Survey on Neural Question Generation: Methods, Applications, and Prospects
IJCAI 2024
NExT-Chat: An LMM for Chat, Detection and Segmentation
ICML 2024
NExT-GPT: Any-to-Any Multimodal LLM
ICML 2024
Improving Expressive Power of Spectral Graph Neural Networks with Eigenvalue Correction
AAAI 2024
GOODAT: Towards Test-Time Graph Out-of-Distribution Detection
AAAI 2024
Momentor: Advancing Video Large Language Model with Fine-Grained Temporal Reasoning
ICML 2024
Auto-Encoding Morph-Tokens for Multimodal LLM
ICML 2024
Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions
ICLR 2024
Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization
ICLR 2024
Towards 3D Molecule-Text Interpretation in Language Models
ICLR 2024
Plug-and-Play Policy Planner for Large Language Model Powered Dialogue Agents
ICLR 2024
Analyzing Temporal Complex Events with Large Language Models? A Benchmark towards Temporal, Long Context Understanding
ACL 2024
ProtT3: Protein-to-Text Generation for Text-based Protein Understanding
ACL 2024
Chain-of-Exemplar: Enhancing Distractor Generation for Multimodal Educational Question Generation
ACL 2024
On the Multi-turn Instruction Following for Conversational Web Agents
ACL 2024
CLAMBER: A Benchmark of Identifying and Clarifying Ambiguous Information Needs in Large Language Models
ACL 2024
Generative Cross-Modal Retrieval: Memorizing Images in Multimodal Language Models for Retrieval and Beyond
ACL 2024
XNLP: An Interactive Demonstration System for Universal Structured NLP
ACL 2024
ReactXT: Understanding Molecular βReaction-shipβ via Reaction-Contextualized Molecule-Text Pretraining
ACL 2024
STYLE: Improving Domain Transferability of Asking Clarification Questions in Large Language Model Powered Conversational Agents
ACL 2024
Distillation Enhanced Generative Retrieval
ACL 2024
Doc2SoarGraph: Discrete Reasoning over Visually-Rich Table-Text Documents via Semantic-Oriented Hierarchical Graphs
COLING 2024
From Multimodal LLM to Human-level AI: Modality, Instruction, Reasoning, Efficiency and beyond
COLING 2024
Think Twice Before Trusting: Self-Detection for Large Language Models through Comprehensive Answer Reflection
EMNLP 2024
Ask-before-Plan: Proactive Language Agents for Real-World Planning
EMNLP 2024
A Study of Implicit Ranking Unfairness in Large Language Models
EMNLP 2024
Beyond Persuasion: Towards Conversational Recommender System with Credible Explanations
EMNLP 2024
Donβt Just Say βI donβt knowβ! Self-aligning Large Language Models for Responding to Unknown Questions with Explanations
EMNLP 2024
Strength Lies in Differences! Improving Strategy Planning for Non-collaborative Dialogues via Diversified User Simulation
EMNLP 2024
Dysen-VDM: Empowering Dynamics-aware Text-to-Video Diffusion with LLMs
CVPR 2024
RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback
CVPR 2024
Can I Trust Your Answer? Visually Grounded Video Question Answering
CVPR 2024
Discriminative Probing and Tuning for Text-to-Image Generation
CVPR 2024
LASO: Language-guided Affordance Segmentation on 3D Object
CVPR 2024
Abductive Ego-View Accident Video Understanding for Safe Driving Perception
CVPR 2024
Discovering Spatio-Temporal Rationales for Video Question Answering
ICCV 2023
LLMDet: A Third Party Large Language Models Generated Text Detection Tool
EMNLP 2023
Video-Audio Domain Generalization via Confounder Disentanglement
AAAI 2023
Are Binary Annotations Sufficient? Video Moment Retrieval via Hierarchical Uncertainty-Based Active Learning
CVPR 2023
A Survey on Proactive Dialogue Systems: Problems, Methods, and Prospects
IJCAI 2023
FakeSV: A Multimodal Benchmark with Rich Social Context for Fake News Detection on Short Video Platforms
AAAI 2023
Robust Prompt Optimization for Large Language Models Against Distribution Shifts
EMNLP 2023
Beyond Factuality: A Comprehensive Evaluation of Large Language Models as Knowledge Generators
EMNLP 2023
Imagine That! Abstract-to-Intricate Text-to-Image Synthesis with Scene Graph Hallucination Diffusion
NIPS 2023
Rethinking Tokenizer and Decoder in Masked Graph Modeling for Molecules
NIPS 2023
MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter
EMNLP 2023
Visually Grounded Commonsense Knowledge Acquisition
AAAI 2023
Prompting and Evaluating Large Language Models for Proactive Dialogues: Clarification, Target-guided, and Non-collaboration
EMNLP 2023
A Comprehensive Evaluation of Large Language Models on Legal Judgment Prediction
EMNLP 2023
MacLaSa: Multi-Aspect Controllable Text Generation via Efficient Sampling from Compact Latent Space
EMNLP 2023
VPGTrans: Transfer Visual Prompt Generator across LLMs
NIPS 2023
DiaASQ: A Benchmark of Conversational Aspect-based Sentiment Quadruple Analysis
ACL 2023
Two Heads Are Better Than One: Improving Fake News Video Detection by Correlating with Neighbors
ACL 2023
Constructing Code-mixed Universal Dependency Forest for Unbiased Cross-lingual Relation Extraction
ACL 2023
Improving Named Entity Recognition via Bridge-based Domain Adaptation
ACL 2023
Hypothetical Training for Robust Machine Reading Comprehension of Tabular Context
ACL 2023
Goal Awareness for Conversational AI: Proactivity, Non-collaborativity, and Beyond
ACL 2023
Reasoning Implicit Sentiment with Chain-of-Thought Prompting
ACL 2023
Information Screening whilst Exploiting! Multimodal Relation Extraction with Feature Denoising and Multimodal Topic Modeling
ACL 2023
Generating Visual Spatial Description via Holistic 3D Scene Understanding
ACL 2023
Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Scene Hallucination
ACL 2023
Cross2StrA: Unpaired Cross-lingual Image Captioning with Cross-lingual Cross-modal Structure-pivoted Alignment
ACL 2023
Empowering Collaborative Filtering with Principled Adversarial Contrastive Loss
NIPS 2023
Boosting Causal Discovery via Adaptive Sample Reweighting
ICLR 2023
Gradient-Regulated Meta-Prompt Learning for Generalizable Vision-Language Models
ICCV 2023
Discovering Invariant Rationales for Graph Neural Networks
ICLR 2022
Incorporating Bias-aware Margins into Contrastive Loss for Collaborative Filtering
NIPS 2022
LasUIE: Unifying Information Extraction with Latent Adaptive Structure-aware Generative Language Model
NIPS 2022
Rethinking the Two-Stage Framework for Grounded Situation Recognition
AAAI 2022
Video as Conditional Graph Hierarchy for Multi-Granular Question Answering
AAAI 2022
Learning to Imagine: Integrating Counterfactual Thinking in Neural Discrete Reasoning
ACL 2022
Invariant Grounding for Video Question Answering
CVPR 2022
Fine-Grained Scene Graph Generation with Data Transfer
ECCV 2022
Video Graph Transformer for Video Question Answering
ECCV 2022
ConReader: Exploring Implicit Relations in Contracts for Contract Clause Extraction
EMNLP 2022
Video Question Answering: Datasets, Algorithms and Challenges
EMNLP 2022
PACIFIC: Towards Proactive Conversational Question Answering over Tabular and Textual Data in Finance
EMNLP 2022
PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language Models
EMNLP 2022
Semi-supervised New Slot Discovery with Incremental Clustering
EMNLP 2022
Let Invariant Rationale Discovery Inspire Graph Contrastive Learning
ICML 2022
Neural Quality Estimation with Multiple Hypotheses for Grammatical Error Correction
NAACL 2021
How Knowledge Graph and Attention Help? A Qualitative Analysis into Bag-level Relation Extraction
IJCNLP 2021
How Knowledge Graph and Attention Help? A Qualitative Analysis into Bag-level Relation Extraction
ACL 2021
Empowering Language Understanding with Counterfactual Reasoning
ACL 2021
TAT-QA: A Question Answering Benchmark on a Hybrid of Tabular and Textual Content in Finance
IJCNLP 2021
Towards Multi-Grained Explainability for Graph Neural Networks
NIPS 2021
Have We Solved The Hard Problem? Itβs Not Easy! Contextual Lexical Contrast as a Means to Probe Neural Coherence
AAAI 2021
Conceptualized and Contextualized Gaussian Embedding
AAAI 2021
Empowering Language Understanding with Counterfactual Reasoning
IJCNLP 2021
TAT-QA: A Question Answering Benchmark on a Hybrid of Tabular and Textual Content in Finance
ACL 2021
Few-Shot 3D Point Cloud Semantic Segmentation
CVPR 2021
NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions
CVPR 2021
Re-examining the Role of Schema Linking in Text-to-SQL
EMNLP 2020
Visual Relation Grounding in Videos
ECCV 2020
SESS: Self-Ensembling Semi-Supervised 3D Object Detection
CVPR 2020
Hyperbolic Visual Embedding Learning for Zero-Shot Recognition
CVPR 2020
Semantic Graphs for Generating Deep Questions
ACL 2020
Expertise Style Transfer: A New Task Towards Better Communication between Experts and Laymen
ACL 2020
Neural Sparse Voxel Fields
NIPS 2020
Learning Goal-oriented Dialogue Policy with opposite Agent Awareness
AACL 2020
Heuristic Black-Box Adversarial Attacks on Video Recognition Models
AAAI 2020
Zero-Shot Ingredient Recognition by Multi-Relational Graph Convolutional Network
AAAI 2020
Image Enhanced Event Detection in News Articles
AAAI 2020
Solving Sequential Text Classification as Board-Game Playing
AAAI 2020
Multi-Source Domain Adaptation for Visual Sentiment Classification
AAAI 2020
Mining Unfollow Behavior in Large-Scale Online Social Networks via Spatial-Temporal Interaction
AAAI 2020
PEIA: Personality and Emotion Integrated Attentive Model for Music Recommendation on Social Media Platforms
AAAI 2020
Exploring and Evaluating Attributes, Values, and Structures for Entity Alignment
EMNLP 2020
Semi-supervised Entity Alignment via Joint Knowledge Embedding Model and Cross-graph Model
IJCNLP 2019
TransNFCM: Translation-Based Neural Fashion Compatibility Modeling
AAAI 2019
Meta-Transfer Learning for Few-Shot Learning
CVPR 2019
A Whole New Ball Game: Harvesting Game Data for Player Profiling
AAAI 2019
Semi-supervised Entity Alignment via Joint Knowledge Embedding Model and Cross-graph Model
EMNLP 2019
Enhancing Stock Movement Prediction with Adversarial Training
IJCAI 2019
Multi-Channel Graph Neural Network for Entity Alignment
ACL 2019
Graph Neural Networks with Generated Parameters for Relation Extraction
ACL 2019
Learning to Self-Train for Semi-Supervised Few-Shot Classification
NIPS 2019
Low-Resource Name Tagging Learned with Weakly Labeled Data
EMNLP 2019
Low-Resource Name Tagging Learned with Weakly Labeled Data
IJCNLP 2019
Explainable Reasoning over Knowledge Graphs for Recommendation
AAAI 2019
Cross-Domain Depression Detection via Harvesting Social Media
IJCAI 2018
Temporally Grounding Natural Sentence in Video
EMNLP 2018
Affective Image Content Analysis: A Comprehensive Survey
IJCAI 2018
Quality Matters: Assessing cQA Pair Quality via Transductive Multi-View Learning
IJCAI 2018
Improving Implicit Recommender Systems with View Data
IJCAI 2018
Outer Product-based Neural Collaborative Filtering
IJCAI 2018
Depression Detection via Harvesting Social Media: A Multimodal Dictionary Learning Solution
IJCAI 2017
SCA-CNN: Spatial and Channel-Wise Attention in Convolutional Networks for Image Captioning
CVPR 2017
Representativeness-aware Aspect Analysis for Brand Monitoring in Social Media
IJCAI 2017
Attentional Factorization Machines: Learning the Weight of Feature Interactions via Attention Networks
IJCAI 2017
Visual Translation Embedding Network for Visual Relation Detection
CVPR 2017
Online Collaborative Learning for Open-Vocabulary Visual Classifiers
CVPR 2016
Generative Topic Embedding: a Continuous Representation of Documents
ACL 2016
DrMAD: Distilling Reverse-Mode Automatic Differentiation for Optimizing Hyperparameters of Deep Neural Networks
IJCAI 2016
What Does Social Media Say about Your Stress?
IJCAI 2016
Interest Inference via Structure-Constrained Multi-Source Multi-Task Learning
IJCAI 2015
Learning Image and User Features for Recommendation in Social Networks
ICCV 2015
Catch the Black Sheep: Unified Framework for Shilling Attack Detection Based on Fraudulent Action Propagation
IJCAI 2015
Answering Opinion Questions on Products by Exploiting Hierarchical Organization of Consumer Reviews
EMNLP 2012
Answering Opinion Questions on Products by Exploiting Hierarchical Organization of Consumer Reviews
CONLL 2012
SSHLDA: A Semi-Supervised Hierarchical Topic Model
CONLL 2012
A Semi-Supervised Bayesian Network Model for Microblog Topic Classification
COLING 2012
The Use of Dependency Relation Graph to Enhance the Term Weighting in Question Retrieval
COLING 2012
SSHLDA: A Semi-Supervised Hierarchical Topic Model
EMNLP 2012
Community Answer Summarization for Multi-Sentence Question with Group L1 Regularization
ACL 2012
Domain-Assisted Product Aspect Hierarchy Generation: Towards Hierarchical Organization of Unstructured Consumer Reviews
EMNLP 2011
Aspect Ranking: Identifying Important Product Aspects from Online Consumer Reviews
ACL 2011
Exploiting Salient Patterns for Question Detection and Question Retrieval in Community-based Question Answering
COLING 2010
Query Segmentation Based on Eigenspace Similarity
IJCNLP 2009
Summarizing Definition from Wikipedia
ACL 2009
Query Segmentation Based on Eigenspace Similarity
ACL 2009
Summarizing Definition from Wikipedia
IJCNLP 2009
Modeling Context in Scenario Template Creation
IJCNLP 2008
A Multi-resolution Framework for Information Extraction from Free Text
ACL 2007
ARE: Instance Splitting Strategies for Dependency Relation-Based Information Extraction
ACL 2006
ARE: Instance Splitting Strategies for Dependency Relation-Based Information Extraction
COLING 2006
Paraphrase Recognition via Dissimilarity Significance Classification
EMNLP 2006
Cascading Use of Soft and Hard Matching Pattern Rules for Weakly Supervised Information Extraction
COLING 2004
Web-based List Question Answering
COLING 2004
QUALIFIER: Question Answering by Lexical Fabric and External Resources
EACL 2003
Extracting Key Semantic Terms from Chinese Speech Query for Web Searches
ACL 2003
An Agent-based Approach to Chinese Named Entity Recognition
COLING 2002
Building Semantic Perceptron Net for Topic Spotting
ACL 2001