Jianfeng Gao

268 papers · 2000–2026 · 16 conferences · across top CS/AI conferences

Achievements

+18 more ↓

🗺️ Taxonomy Completionist (22) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (5) 🐣 Hot Topic Early Bird

🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌟 Keyword Trendsetter Combo (12) 🏠 Conference Loyalist (24) 🏆 Keyword Champion (2) 🤝 Dynamic Duo (60) 🏆 Grand Slam 👑 Triple Crown 👥 Mega-Team (71) 🔬 Deep Specialist (29) 📈 Trend Setter 🔥 Unstoppable (24) 🚀 Conference Pioneer 💎 Century Club (267) 🗃️ Keyword Collector (674) ❓ The Questioner (3) ⚡ Prolific Year (23)

Conferences

ACL (53) EMNLP (51) NAACL (26) ICLR (25) NIPS (24) IJCNLP (23) CVPR (21) AAAI (11) ICML (9) COLING (7) ECCV (5) CONLL (3) EACL (3) ICCV (3) INTERSPEECH (3) IJCAI (1)

Top co-authors

Xiaodong Liu (60) Michel Galley (34) Chunyuan Li (33) Jianwei Yang (29) Baolin Peng (29) Hao Cheng (29) Pengcheng He (21) Xiaodong He (21) Xiujun Li (21) Bill Dolan (20)

Keywords

dialogue system (17) large language model (15) reinforcement learning (15) transfer learning (14) few-shot learning (13) zero-shot learning (13) language model (12) neural network (12) question answering (11) pre-trained language model (11) text generation (10) object detection (10) natural language understanding (10) image captioning (9) multi-task learning (9) vision-language model (8) response generation (8) dialogue policy (8) multimodal learning (7) visual question answering (7)

Papers

SynthAgent: Adapting Web Agents with Synthetic Supervision ACL 2026 Latent Action Pretraining from Videos ICLR 2025 Vector-ICL: In-context Learning with Continuous Vector Representations ICLR 2025 TraceVLA: Visual Trace Prompting Enhances Spatial-Temporal Awareness for Generalist Robotic Policies ICLR 2025 MMInference: Accelerating Pre-filling for Long-Context Visual Language Models via Modality-Aware Permutation Sparse Attention ICML 2025 CollabLLM: From Passive Responders to Active Collaborators ICML 2025 Simplifying DINO via Coding Rate Regularization ICML 2025 Matryoshka Multimodal Models ICLR 2025 SimulatorArena: Are User Simulators Reliable Proxies for Multi-Turn Evaluation of AI Assistants? EMNLP 2025 SITE: towards Spatial Intelligence Thorough Evaluation ICCV 2025 Towards Consistent Natural-Language Explanations via Explanation-Consistency Finetuning COLING 2025 Magma: A Foundation Model for Multimodal AI Agents CVPR 2025 Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion CVPR 2025 Iterative Self-Tuning LLMs for Enhanced Jailbreaking Capabilities NAACL 2025 Diversifying the Expert Knowledge for Task-Agnostic Pruning in Sparse Mixture-of-Experts ACL 2025 Generative Adapter: Contextualizing Language Models in Parameters with A Single Forward Pass ICLR 2025 ExACT: Teaching AI Agents to Explore with Reflective-MCTS and Exploratory Learning ICLR 2025 SCBench: A KV Cache-Centric Analysis of Long-Context Methods ICLR 2025 SeCom: On Memory Construction and Retrieval for Personalized Conversational Agents ICLR 2025 DataGen: Unified Synthetic Dataset Generation via Large Language Models ICLR 2025 GUI-World: A Video Benchmark and Dataset for Multimodal GUI-oriented Understanding ICLR 2025 DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs NIPS 2024 Compositional Generalization Across Distributional Shifts with Sparse Tree Operations NIPS 2024 Crafting Interpretable Embeddings for Language Neuroscience by Asking LLMs Questions NIPS 2024 Self-Checker: Plug-and-Play Modules for Fact-Checking with Large Language Models NAACL 2024 Teaching Language Models to Self-Improve through Interactive Demonstrations NAACL 2024 Position: TrustLLM: Trustworthiness in Large Language Models ICML 2024 Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs ICLR 2024 MindAgent: Emergent Gaming Interaction NAACL 2024 Visual In-Context Prompting CVPR 2024 Toward Compositional Behavior in Neural Models: A Survey of Current Views EMNLP 2024 SciEx: Benchmarking Large Language Models on Scientific Exams with Human Expert Grading and Automatic Grading EMNLP 2024 ReEval: Automatic Hallucination Evaluation for Retrieval-Augmented Large Language Models via Transferable Adversarial Attacks NAACL 2024 Fast-ELECTRA for Efficient Pre-training ICLR 2024 MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts ICLR 2024 Pix2Gif: Motion-Guided Diffusion for GIF Generation ECCV 2024 Segment and Recognize Anything at Any Granularity ECCV 2024 LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents ECCV 2024 LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models ECCV 2024 Language Models as Inductive Reasoners EACL 2024 Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs ICLR 2024 Is Self-Repair a Silver Bullet for Code Generation? ICLR 2024 Tree Prompting: Efficient Task Adaptation without Fine-Tuning EMNLP 2023 Localized Symbolic Knowledge Distillation for Visual Commonsense Models NIPS 2023 Bridging Discrete and Backpropagation: Straight-Through and Beyond NIPS 2023 Segment Everything Everywhere All at Once NIPS 2023 LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day NIPS 2023 Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models NIPS 2023 Guiding Large Language Models via Directional Stimulus Prompting NIPS 2023 Augmenting Language Models with Long-Term Memory NIPS 2023 Differentiable Tree Operations Promote Compositional Generalization ICML 2023 Understand and Modularize Generator Optimization in ELECTRA-style Pretraining ICML 2023 DIONYSUS: A Pre-trained Model for Low-Resource Dialogue Summarization ACL 2023 Chain-of-Skills: A Configurable Model for Open-Domain Question Answering ACL 2023 Z-Code++: A Pre-trained Language Model Optimized for Abstractive Summarization ACL 2023 Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers ACL 2023 Task-Aware Specialization for Efficient and Robust Dense Retrieval for Open-Domain Question Answering ACL 2023 Logical Transformers: Infusing Logical Structures into Pre-Trained Language Models ACL 2023 AutoMoE: Heterogeneous Mixture-of-Experts with Adaptive Computation for Efficient Neural Machine Translation ACL 2023 Learning Math Reasoning from Self-Sampled Correct and Partially-Correct Solutions ICLR 2023 DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing ICLR 2023 Visually-Augmented Language Modeling ICLR 2023 Optimizing Bi-Encoder for Named Entity Recognition via Contrastive Learning ICLR 2023 Learning Customized Visual Models With Retrieval-Augmented Knowledge CVPR 2023 GLIGEN: Open-Set Grounded Text-to-Image Generation CVPR 2023 Generalized Decoding for Pixel, Image, and Language CVPR 2023 Explaining Data Patterns in Natural Language with Language Models EMNLP 2023 Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding EMNLP 2023 ConvLab-3: A Flexible Dialogue System Toolkit Based on a Unified Data Format EMNLP 2023 Interactive Text Generation EMNLP 2023 RetGen: A Joint Framework for Retrieval and Grounded Text Generation Modeling AAAI 2022 ValueNet: A New Dataset for Human Value Driven Dialogue System AAAI 2022 Knowledge-Rich Self-Supervision for Biomedical Entity Linking EMNLP 2022 K-LITE: Learning Transferable Visual Models with External Knowledge NIPS 2022 CodeExp: Explanatory Code Document Generation EMNLP 2022 Grounded Language-Image Pre-Training CVPR 2022 Human Parity on CommonsenseQA: Augmenting Self-Attention with External Attention IJCAI 2022 Open-domain Question Answering via Chain of Reasoning over Heterogeneous Knowledge EMNLP 2022 Grounded Keys-to-Text Generation: Towards Factual Open-Ended Generation EMNLP 2022 Fault-Aware Neural Code Rankers NIPS 2022 RegionCLIP: Region-Based Language-Image Pretraining CVPR 2022 WebQA: Multihop and Multimodal QA CVPR 2022 GLIPv2: Unifying Localization and Vision-Language Understanding NIPS 2022 AdaMix: Mixture-of-Adaptations for Parameter-efficient Model Tuning EMNLP 2022 Few-shot Task-agnostic Neural Architecture Search for Distilling Large Language Models NIPS 2022 Taming Sparsely Activated Transformer with Stochastic Experts ICLR 2022 No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models ICLR 2022 Efficient Self-supervised Vision Transformers for Representation Learning ICLR 2022 ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual Models NIPS 2022 Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone NIPS 2022 Knowledge-Grounded Dialogue Generation with a Unified Knowledge Representation NAACL 2022 KAT: A Knowledge Augmented Transformer for Vision-and-Language NAACL 2022 LiST: Lite Prompted Self-training Makes Parameter-efficient Few-shot Learners NAACL 2022 Open Domain Question Answering with A Unified Knowledge Interface ACL 2022 Unified Contrastive Learning in Image-Text-Label Space CVPR 2022 Focal Modulation Networks NIPS 2022 Data Augmentation for Spoken Language Understanding via Pretrained Language Models INTERSPEECH 2021 Posterior Differential Regularization with f-divergence for Improving Model Robustness NAACL 2021 Enriching Transformers with Structured Tensor-Product Representations for Abstractive Summarization NAACL 2021 Text Editing by Command NAACL 2021 Targeted Adversarial Training for Natural Language Understanding NAACL 2021 Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer NIPS 2021 Focal Attention for Long-Range Interactions in Vision Transformers NIPS 2021 DEBERTA: DECODING-ENHANCED BERT WITH DISENTANGLED ATTENTION ICLR 2021 UnitedQA: A Hybrid Approach for Open Domain Question Answering ACL 2021 Generation-Augmented Retrieval for Open-Domain Question Answering ACL 2021 RADDLE: An Evaluation Benchmark and Analysis Platform for Robust Task-oriented Dialog Systems ACL 2021 EmailSum: Abstractive Email Thread Summarization ACL 2021 Reader-Guided Passage Reranking for Open-Domain Question Answering ACL 2021 GO FIGURE: A Meta Evaluation of Factuality in Summarization ACL 2021 Token-wise Curriculum Learning for Neural Machine Translation EMNLP 2021 ARCH: Efficient Adversarial Regularized Training with Caching EMNLP 2021 NICE: Neural Image Commenting with Empathy EMNLP 2021 A Controllable Model of Grounded Response Generation AAAI 2021 Data Augmentation for Abstractive Query-Focused Multi-Document Summarization AAAI 2021 VIVO: Visual Vocabulary Pre-Training for Novel Object Captioning AAAI 2021 Contrastive Multi-document Question Generation EACL 2021 TACo: Token-Aware Cascade Contrastive Learning for Video-Text Alignment ICCV 2021 Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding ICCV 2021 Adversarial Regularization as Stackelberg Game: An Unrolled Optimization Approach EMNLP 2021 HittER: Hierarchical Transformers for Knowledge Graph Embeddings EMNLP 2021 Few-Shot Named Entity Recognition: An Empirical Baseline Study EMNLP 2021 VinVL: Revisiting Visual Representations in Vision-Language Models CVPR 2021 UnitedQA: A Hybrid Approach for Open Domain Question Answering IJCNLP 2021 Generation-Augmented Retrieval for Open-Domain Question Answering IJCNLP 2021 RADDLE: An Evaluation Benchmark and Analysis Platform for Robust Task-oriented Dialog Systems IJCNLP 2021 EmailSum: Abstractive Email Thread Summarization IJCNLP 2021 Reader-Guided Passage Reranking for Open-Domain Question Answering IJCNLP 2021 GO FIGURE: A Meta Evaluation of Factuality in Summarization IJCNLP 2021 SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models through Principled Regularized Optimization ACL 2020 PIQA: Reasoning about Physical Commonsense in Natural Language AAAI 2020 What Makes A Good Story? Designing Composite Rewards for Visual Storytelling AAAI 2020 Complementary Auxiliary Classifiers for Label-Conditional Text Generation AAAI 2020 Unified Vision-Language Pre-Training for Image Captioning and VQA AAAI 2020 MIND: A Large-scale Dataset for News Recommendation ACL 2020 The Microsoft Toolkit of Multi-Task Deep Neural Networks for Natural Language Understanding ACL 2020 ConvLab-2: An Open-Source Toolkit for Building, Evaluating, and Diagnosing Dialogue Systems ACL 2020 DIALOGPT : Large-Scale Generative Pre-training for Conversational Response Generation ACL 2020 Conversation Learner - A Machine Teaching Tool for Building Dialog Managers for Task-Oriented Dialog Systems ACL 2020 Towards Learning a Generic Agent for Vision-and-Language Navigation via Pre-Training CVPR 2020 Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks ECCV 2020 PlotMachines: Outline-Conditioned Generation with Dynamic Plot State Tracking EMNLP 2020 Optimus: Organizing Sentences via Pre-trained Modeling of a Latent Space EMNLP 2020 Understanding the Difficulty of Training Transformers EMNLP 2020 Few-shot Natural Language Generation for Task-Oriented Dialog EMNLP 2020 RMM: A Recursive Mental Model for Dialogue Navigation EMNLP 2020 Guided Dialogue Policy Learning without Adversarial Learning in the Loop EMNLP 2020 RaCT: Toward Amortized Ranking-Critical Training For Collaborative Filtering ICLR 2020 On the Variance of the Adaptive Learning Rate and Beyond ICLR 2020 UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training ICML 2020 Mapping natural-language problems to formal-language solutions using structured neural representations ICML 2020 Feature Quantization Improves GAN Training ICML 2020 Structuring Latent Spaces for Stylized Response Generation EMNLP 2019 Robust Navigation with Language Pretraining and Stochastic Sampling EMNLP 2019 REO-Relevance, Extraness, Omission: A Fine-grained Evaluation for Image Captioning EMNLP 2019 Microsoft Icecaps: An Open-Source Toolkit for Conversation Modeling ACL 2019 DoubleTransfer at MEDIQA 2019: Multi-Source Transfer Learning for Natural Language Understanding in the Medical Domain ACL 2019 Towards Generating Long and Coherent Text with Multi-Level Latent Variable Models ACL 2019 Budgeted Policy Learning for Task-Oriented Dialogue Systems ACL 2019 Multi-Task Deep Neural Networks for Natural Language Understanding ACL 2019 Conversing by Reading: Contentful Neural Conversation with On-demand Machine Reading ACL 2019 Multi-step Reasoning via Recurrent Dual Attention for Visual Dialog ACL 2019 ConvLab: Multi-Domain End-to-End Dialog System Platform ACL 2019 Towards Coherent and Cohesive Long-form Text Generation NAACL 2019 Multi-task Learning with Sample Re-weighting for Machine Reading Comprehension NAACL 2019 Jointly Optimizing Diversity and Relevance in Neural Response Generation NAACL 2019 Unsupervised Deep Structured Semantic Models for Commonsense Reasoning NAACL 2019 Cyclical Annealing Schedule: A Simple Approach to Mitigating KL Vanishing NAACL 2019 Adversarial Domain Adaptation for Machine Reading Comprehension EMNLP 2019 Implicit Deep Latent Variable Models for Text Generation IJCNLP 2019 Adversarial Domain Adaptation for Machine Reading Comprehension IJCNLP 2019 TIGEr: Text-to-Image Grounding for Image Caption Evaluation IJCNLP 2019 Structuring Latent Spaces for Stylized Response Generation IJCNLP 2019 Robust Navigation with Language Pretraining and Stochastic Sampling IJCNLP 2019 Object-Driven Text-To-Image Synthesis via Adversarial Training CVPR 2019 Tactical Rewind: Self-Correction via Backtracking in Vision-And-Language Navigation CVPR 2019 Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation CVPR 2019 StoryGAN: A Sequential Conditional GAN for Story Visualization CVPR 2019 TIGEr: Text-to-Image Grounding for Image Caption Evaluation EMNLP 2019 A Hybrid Neural Network Model for Commonsense Reasoning EMNLP 2019 Interactive Semantic Parsing for If-Then Recipes via Hierarchical Reinforcement Learning AAAI 2019 Unified Language Model Pre-training for Natural Language Understanding and Generation NIPS 2019 Implicit Deep Latent Variable Models for Text Generation EMNLP 2019 REO-Relevance, Extraness, Omission: A Fine-grained Evaluation for Image Captioning IJCNLP 2019 Switch-Based Active Deep Dyna-Q: Efficient Adaptive Planning for Task-Completion Dialogue Policy Learning AAAI 2019 Neural Approaches to Conversational AI ACL 2018 Discriminative Deep Dyna-Q: Robust Planning for Dialogue Policy Learning EMNLP 2018 Subgoal Discovery for Hierarchical Dialogue Policy Learning EMNLP 2018 Discourse-Aware Neural Rewards for Coherent Text Generation NAACL 2018 Language-Based Image Editing With Recurrent Attentive Models CVPR 2018 Stochastic Answer Networks for Machine Reading Comprehension ACL 2018 Generating Informative and Diverse Conversational Responses via Adversarial Information Maximization NIPS 2018 Navigating with Graph Representations for Fast and Scalable Decoding of Neural Language Models NIPS 2018 M-Walk: Learning to Walk over Graphs using Monte Carlo Tree Search NIPS 2018 Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy Learning ACL 2018 An Empirical Analysis of Multiple-Turn Reasoning Strategies in Reading Comprehension Tasks IJCNLP 2017 Semantic Compositional Networks for Visual Captioning CVPR 2017 StyleNet: Generating Attractive Visual Captions With Styles CVPR 2017 A Nested Attention Neural Hybrid Model for Grammatical Error Correction ACL 2017 Towards End-to-End Reinforcement Learning of Dialogue Agents for Information Access ACL 2017 Image-Grounded Conversations: Multimodal Context for Natural Question and Response Generation IJCNLP 2017 Multi-Task Learning for Speaker-Role Adaptation in Neural Conversation Models IJCNLP 2017 End-to-End Task-Completion Neural Dialogue Systems IJCNLP 2017 Open-Domain Neural Dialogue Systems IJCNLP 2017 Composite Task-Completion Dialogue Policy Learning via Hierarchical Deep Reinforcement Learning EMNLP 2017 Multi-Domain Joint Semantic Frame Parsing Using Bi-Directional RNN-LSTM INTERSPEECH 2016 Deep Reinforcement Learning for Dialogue Generation EMNLP 2016 Deep Reinforcement Learning with a Natural Language Action Space ACL 2016 A Persona-Based Neural Conversation Model ACL 2016 Bi-directional Attention with Agreement for Dependency Parsing EMNLP 2016 Deep Reinforcement Learning with a Combinatorial Action Space for Predicting Popular Reddit Threads EMNLP 2016 A Diversity-Promoting Objective Function for Neural Conversation Models NAACL 2016 Stacked Attention Networks for Image Question Answering CVPR 2016 End-to-End Memory Networks with Knowledge Carryover for Multi-Turn Spoken Language Understanding INTERSPEECH 2016 deltaBLEU: A Discriminative Metric for Generation Tasks with Intrinsically Diverse Targets ACL 2015 End-to-end Learning of LDA by Mirror-Descent Back Propagation over a Deep Architecture NIPS 2015 Semantic Parsing via Staged Query Graph Generation: Question Answering with Knowledge Base IJCNLP 2015 deltaBLEU: A Discriminative Metric for Generation Tasks with Intrinsically Diverse Targets IJCNLP 2015 Deep Learning and Continuous Representations for Natural Language Processing NAACL 2015 Representation Learning Using Multi-Task Deep Neural Networks for Semantic Classification and Information Retrieval NAACL 2015 From Captions to Visual Concepts and Back CVPR 2015 A Neural Network Approach to Context-Sensitive Generation of Conversational Responses NAACL 2015 Semantic Parsing via Staged Query Graph Generation: Question Answering with Knowledge Base ACL 2015 Large-scale Expected BLEU Training of Phrase-based Reordering Models EMNLP 2014 Learning Continuous Phrase Representations for Translation Modeling ACL 2014 Modeling Interestingness with Deep Neural Networks EMNLP 2014 Minimum Translation Modeling with Recurrent Neural Networks EACL 2014 Decoder Integration and Expected BLEU Training for Recurrent Neural Network Language Models ACL 2014 Training MRF-Based Phrase Translation Models using Gradient Ascent NAACL 2013 Beyond Left-to-Right: Multiple Decomposition Structures for SMT NAACL 2013 A Unified Approach to Transliteration-based Text Input with Online Spelling Correction EMNLP 2012 A Unified Approach to Transliteration-based Text Input with Online Spelling Correction CONLL 2012 Learning Lexicon Models from Search Logs for Query Expansion EMNLP 2012 Learning Lexicon Models from Search Logs for Query Expansion CONLL 2012 MSR SPLAT, a language analysis toolkit NAACL 2012 Domain Adaptation via Pseudo In-Domain Data Selection EMNLP 2011 Learning Phrase-Based Spelling Error Models from Clickthrough Data ACL 2010 A comparison of unsupervised methods for Part-of-Speech Tagging in Chinese COLING 2010 A Large Scale Ranker-Based System for Search Query Spelling Correction COLING 2010 Discovery of Term Variation in Japanese Web Search Queries EMNLP 2009 Model Adaptation via Model Interpolation and Boosting for Web Search Ranking EMNLP 2009 A Web-based English Proofing System for English as a Second Language Users IJCNLP 2008 Bayesian Semi-Supervised Chinese Word Segmentation for Statistical Machine Translation COLING 2008 Indirect-HMM-based Hypothesis Alignment for Combining Outputs from Machine Translation Systems EMNLP 2008 A comparison of Bayesian estimators for unsupervised Hidden Markov Model POS taggers EMNLP 2008 Using Contextual Speller Techniques and Language Modeling for ESL Error Correction IJCNLP 2008 A Comparative Study of Parameter Estimation Methods for Statistical Natural Language Processing ACL 2007 Compressing Trigram Language Models With Golomb Coding EMNLP 2007 Compressing Trigram Language Models With Golomb Coding CONLL 2007 Approximation Lasso Methods for Language Modeling ACL 2006 A DOM Tree Alignment Model for Mining Parallel Data from the Web ACL 2006 An Information-Theoretic Approach to Automatic Evaluation of Summaries NAACL 2006 Approximation Lasso Methods for Language Modeling COLING 2006 A DOM Tree Alignment Model for Mining Parallel Data from the Web COLING 2006 Transformation Based Chinese Entity Detection and Tracking IJCNLP 2005 A Comparative Study on Language Model Adaptation Techniques Using New Evaluation Metrics EMNLP 2005 Minimum Sample Risk Methods for Language Modeling EMNLP 2005 An Empirical Study on Language Model Adaptation Using a Metric of Domain Similarity IJCNLP 2005 Adaptive Chinese Word Segmentation ACL 2004 Unsupervised Learning of Dependency Structure for Language Modeling ACL 2003 Improved Source-Channel Models for Chinese Word Segmentation ACL 2003 Improving Language Model Size Reduction using Better Pruning Criteria ACL 2002 Chinese Named Entity Identification Using Class-based Language Model COLING 2002 Exploring Asymmetric Clustering for Statistical Language Modeling ACL 2002 Exploiting Headword Dependency and Predictive Clustering for Language Modeling EMNLP 2002 Extraction of Chinese Compound Words - An Experimental Study on a Very Large Corpus ACL 2000 PENS: A Machine-aided English Writing System for Chinese Users ACL 2000 Distribution-Based Pruning of Backoff Language Models ACL 2000