Jianfeng Gao
268 papers · 2000–2026 · 16 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+18 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (22) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (5) π£ Hot Topic Early Bird
π
Renaissance Researcher
(5)
π
Interdisciplinary Bridge
π§
Keyword Pioneer
π
Keyword Trendsetter Combo
(12)
π
Conference Loyalist
(24)
π
Keyword Champion
(2)
π€
Dynamic Duo
(60)
π
Grand Slam
π
Triple Crown
π₯
Mega-Team
(71)
π¬
Deep Specialist
(29)
π
Trend Setter
π₯
Unstoppable
(24)
π
Conference Pioneer
π
Century Club
(267)
ποΈ
Keyword Collector
(674)
β
The Questioner
(3)
β‘
Prolific Year
(23)
Conferences
ACL (53)
EMNLP (51)
NAACL (26)
ICLR (25)
NIPS (24)
IJCNLP (23)
CVPR (21)
AAAI (11)
ICML (9)
COLING (7)
ECCV (5)
CONLL (3)
EACL (3)
ICCV (3)
INTERSPEECH (3)
IJCAI (1)
Top co-authors
Keywords
dialogue system
(17)
large language model
(15)
reinforcement learning
(15)
transfer learning
(14)
few-shot learning
(13)
zero-shot learning
(13)
language model
(12)
neural network
(12)
question answering
(11)
pre-trained language model
(11)
text generation
(10)
object detection
(10)
natural language understanding
(10)
image captioning
(9)
multi-task learning
(9)
vision-language model
(8)
response generation
(8)
dialogue policy
(8)
multimodal learning
(7)
visual question answering
(7)
Papers
SynthAgent: Adapting Web Agents with Synthetic Supervision
ACL 2026
Latent Action Pretraining from Videos
ICLR 2025
Vector-ICL: In-context Learning with Continuous Vector Representations
ICLR 2025
TraceVLA: Visual Trace Prompting Enhances Spatial-Temporal Awareness for Generalist Robotic Policies
ICLR 2025
MMInference: Accelerating Pre-filling for Long-Context Visual Language Models via Modality-Aware Permutation Sparse Attention
ICML 2025
CollabLLM: From Passive Responders to Active Collaborators
ICML 2025
Simplifying DINO via Coding Rate Regularization
ICML 2025
Matryoshka Multimodal Models
ICLR 2025
SimulatorArena: Are User Simulators Reliable Proxies for Multi-Turn Evaluation of AI Assistants?
EMNLP 2025
SITE: towards Spatial Intelligence Thorough Evaluation
ICCV 2025
Towards Consistent Natural-Language Explanations via Explanation-Consistency Finetuning
COLING 2025
Magma: A Foundation Model for Multimodal AI Agents
CVPR 2025
Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion
CVPR 2025
Iterative Self-Tuning LLMs for Enhanced Jailbreaking Capabilities
NAACL 2025
Diversifying the Expert Knowledge for Task-Agnostic Pruning in Sparse Mixture-of-Experts
ACL 2025
Generative Adapter: Contextualizing Language Models in Parameters with A Single Forward Pass
ICLR 2025
ExACT: Teaching AI Agents to Explore with Reflective-MCTS and Exploratory Learning
ICLR 2025
SCBench: A KV Cache-Centric Analysis of Long-Context Methods
ICLR 2025
SeCom: On Memory Construction and Retrieval for Personalized Conversational Agents
ICLR 2025
DataGen: Unified Synthetic Dataset Generation via Large Language Models
ICLR 2025
GUI-World: A Video Benchmark and Dataset for Multimodal GUI-oriented Understanding
ICLR 2025
DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs
NIPS 2024
Compositional Generalization Across Distributional Shifts with Sparse Tree Operations
NIPS 2024
Crafting Interpretable Embeddings for Language Neuroscience by Asking LLMs Questions
NIPS 2024
Self-Checker: Plug-and-Play Modules for Fact-Checking with Large Language Models
NAACL 2024
Teaching Language Models to Self-Improve through Interactive Demonstrations
NAACL 2024
Position: TrustLLM: Trustworthiness in Large Language Models
ICML 2024
Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs
ICLR 2024
MindAgent: Emergent Gaming Interaction
NAACL 2024
Visual In-Context Prompting
CVPR 2024
Toward Compositional Behavior in Neural Models: A Survey of Current Views
EMNLP 2024
SciEx: Benchmarking Large Language Models on Scientific Exams with Human Expert Grading and Automatic Grading
EMNLP 2024
ReEval: Automatic Hallucination Evaluation for Retrieval-Augmented Large Language Models via Transferable Adversarial Attacks
NAACL 2024
Fast-ELECTRA for Efficient Pre-training
ICLR 2024
MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts
ICLR 2024
Pix2Gif: Motion-Guided Diffusion for GIF Generation
ECCV 2024
Segment and Recognize Anything at Any Granularity
ECCV 2024
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
ECCV 2024
LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models
ECCV 2024
Language Models as Inductive Reasoners
EACL 2024
Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs
ICLR 2024
Is Self-Repair a Silver Bullet for Code Generation?
ICLR 2024
Tree Prompting: Efficient Task Adaptation without Fine-Tuning
EMNLP 2023
Localized Symbolic Knowledge Distillation for Visual Commonsense Models
NIPS 2023
Bridging Discrete and Backpropagation: Straight-Through and Beyond
NIPS 2023
Segment Everything Everywhere All at Once
NIPS 2023
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
NIPS 2023
Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models
NIPS 2023
Guiding Large Language Models via Directional Stimulus Prompting
NIPS 2023
Augmenting Language Models with Long-Term Memory
NIPS 2023
Differentiable Tree Operations Promote Compositional Generalization
ICML 2023
Understand and Modularize Generator Optimization in ELECTRA-style Pretraining
ICML 2023
DIONYSUS: A Pre-trained Model for Low-Resource Dialogue Summarization
ACL 2023
Chain-of-Skills: A Configurable Model for Open-Domain Question Answering
ACL 2023
Z-Code++: A Pre-trained Language Model Optimized for Abstractive Summarization
ACL 2023
Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers
ACL 2023
Task-Aware Specialization for Efficient and Robust Dense Retrieval for Open-Domain Question Answering
ACL 2023
Logical Transformers: Infusing Logical Structures into Pre-Trained Language Models
ACL 2023
AutoMoE: Heterogeneous Mixture-of-Experts with Adaptive Computation for Efficient Neural Machine Translation
ACL 2023
Learning Math Reasoning from Self-Sampled Correct and Partially-Correct Solutions
ICLR 2023
DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing
ICLR 2023
Visually-Augmented Language Modeling
ICLR 2023
Optimizing Bi-Encoder for Named Entity Recognition via Contrastive Learning
ICLR 2023
Learning Customized Visual Models With Retrieval-Augmented Knowledge
CVPR 2023
GLIGEN: Open-Set Grounded Text-to-Image Generation
CVPR 2023
Generalized Decoding for Pixel, Image, and Language
CVPR 2023
Explaining Data Patterns in Natural Language with Language Models
EMNLP 2023
Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding
EMNLP 2023
ConvLab-3: A Flexible Dialogue System Toolkit Based on a Unified Data Format
EMNLP 2023
Interactive Text Generation
EMNLP 2023
RetGen: A Joint Framework for Retrieval and Grounded Text Generation Modeling
AAAI 2022
ValueNet: A New Dataset for Human Value Driven Dialogue System
AAAI 2022
Knowledge-Rich Self-Supervision for Biomedical Entity Linking
EMNLP 2022
K-LITE: Learning Transferable Visual Models with External Knowledge
NIPS 2022
CodeExp: Explanatory Code Document Generation
EMNLP 2022
Grounded Language-Image Pre-Training
CVPR 2022
Human Parity on CommonsenseQA: Augmenting Self-Attention with External Attention
IJCAI 2022
Open-domain Question Answering via Chain of Reasoning over Heterogeneous Knowledge
EMNLP 2022
Grounded Keys-to-Text Generation: Towards Factual Open-Ended Generation
EMNLP 2022
Fault-Aware Neural Code Rankers
NIPS 2022
RegionCLIP: Region-Based Language-Image Pretraining
CVPR 2022
WebQA: Multihop and Multimodal QA
CVPR 2022
GLIPv2: Unifying Localization and Vision-Language Understanding
NIPS 2022
AdaMix: Mixture-of-Adaptations for Parameter-efficient Model Tuning
EMNLP 2022
Few-shot Task-agnostic Neural Architecture Search for Distilling Large Language Models
NIPS 2022
Taming Sparsely Activated Transformer with Stochastic Experts
ICLR 2022
No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models
ICLR 2022
Efficient Self-supervised Vision Transformers for Representation Learning
ICLR 2022
ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual Models
NIPS 2022
Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
NIPS 2022
Knowledge-Grounded Dialogue Generation with a Unified Knowledge Representation
NAACL 2022
KAT: A Knowledge Augmented Transformer for Vision-and-Language
NAACL 2022
LiST: Lite Prompted Self-training Makes Parameter-efficient Few-shot Learners
NAACL 2022
Open Domain Question Answering with A Unified Knowledge Interface
ACL 2022
Unified Contrastive Learning in Image-Text-Label Space
CVPR 2022
Focal Modulation Networks
NIPS 2022
Data Augmentation for Spoken Language Understanding via Pretrained Language Models
INTERSPEECH 2021
Posterior Differential Regularization with f-divergence for Improving Model Robustness
NAACL 2021
Enriching Transformers with Structured Tensor-Product Representations for Abstractive Summarization
NAACL 2021
Text Editing by Command
NAACL 2021
Targeted Adversarial Training for Natural Language Understanding
NAACL 2021
Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer
NIPS 2021
Focal Attention for Long-Range Interactions in Vision Transformers
NIPS 2021
DEBERTA: DECODING-ENHANCED BERT WITH DISENTANGLED ATTENTION
ICLR 2021
UnitedQA: A Hybrid Approach for Open Domain Question Answering
ACL 2021
Generation-Augmented Retrieval for Open-Domain Question Answering
ACL 2021
RADDLE: An Evaluation Benchmark and Analysis Platform for Robust Task-oriented Dialog Systems
ACL 2021
EmailSum: Abstractive Email Thread Summarization
ACL 2021
Reader-Guided Passage Reranking for Open-Domain Question Answering
ACL 2021
GO FIGURE: A Meta Evaluation of Factuality in Summarization
ACL 2021
Token-wise Curriculum Learning for Neural Machine Translation
EMNLP 2021
ARCH: Efficient Adversarial Regularized Training with Caching
EMNLP 2021
NICE: Neural Image Commenting with Empathy
EMNLP 2021
A Controllable Model of Grounded Response Generation
AAAI 2021
Data Augmentation for Abstractive Query-Focused Multi-Document Summarization
AAAI 2021
VIVO: Visual Vocabulary Pre-Training for Novel Object Captioning
AAAI 2021
Contrastive Multi-document Question Generation
EACL 2021
TACo: Token-Aware Cascade Contrastive Learning for Video-Text Alignment
ICCV 2021
Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding
ICCV 2021
Adversarial Regularization as Stackelberg Game: An Unrolled Optimization Approach
EMNLP 2021
HittER: Hierarchical Transformers for Knowledge Graph Embeddings
EMNLP 2021
Few-Shot Named Entity Recognition: An Empirical Baseline Study
EMNLP 2021
VinVL: Revisiting Visual Representations in Vision-Language Models
CVPR 2021
UnitedQA: A Hybrid Approach for Open Domain Question Answering
IJCNLP 2021
Generation-Augmented Retrieval for Open-Domain Question Answering
IJCNLP 2021
RADDLE: An Evaluation Benchmark and Analysis Platform for Robust Task-oriented Dialog Systems
IJCNLP 2021
EmailSum: Abstractive Email Thread Summarization
IJCNLP 2021
Reader-Guided Passage Reranking for Open-Domain Question Answering
IJCNLP 2021
GO FIGURE: A Meta Evaluation of Factuality in Summarization
IJCNLP 2021
SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models through Principled Regularized Optimization
ACL 2020
PIQA: Reasoning about Physical Commonsense in Natural Language
AAAI 2020
What Makes A Good Story? Designing Composite Rewards for Visual Storytelling
AAAI 2020
Complementary Auxiliary Classifiers for Label-Conditional Text Generation
AAAI 2020
Unified Vision-Language Pre-Training for Image Captioning and VQA
AAAI 2020
MIND: A Large-scale Dataset for News Recommendation
ACL 2020
The Microsoft Toolkit of Multi-Task Deep Neural Networks for Natural Language Understanding
ACL 2020
ConvLab-2: An Open-Source Toolkit for Building, Evaluating, and Diagnosing Dialogue Systems
ACL 2020
DIALOGPT : Large-Scale Generative Pre-training for Conversational Response Generation
ACL 2020
Conversation Learner - A Machine Teaching Tool for Building Dialog Managers for Task-Oriented Dialog Systems
ACL 2020
Towards Learning a Generic Agent for Vision-and-Language Navigation via Pre-Training
CVPR 2020
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
ECCV 2020
PlotMachines: Outline-Conditioned Generation with Dynamic Plot State Tracking
EMNLP 2020
Optimus: Organizing Sentences via Pre-trained Modeling of a Latent Space
EMNLP 2020
Understanding the Difficulty of Training Transformers
EMNLP 2020
Few-shot Natural Language Generation for Task-Oriented Dialog
EMNLP 2020
RMM: A Recursive Mental Model for Dialogue Navigation
EMNLP 2020
Guided Dialogue Policy Learning without Adversarial Learning in the Loop
EMNLP 2020
RaCT: Toward Amortized Ranking-Critical Training For Collaborative Filtering
ICLR 2020
On the Variance of the Adaptive Learning Rate and Beyond
ICLR 2020
UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training
ICML 2020
Mapping natural-language problems to formal-language solutions using structured neural representations
ICML 2020
Feature Quantization Improves GAN Training
ICML 2020
Structuring Latent Spaces for Stylized Response Generation
EMNLP 2019
Robust Navigation with Language Pretraining and Stochastic Sampling
EMNLP 2019
REO-Relevance, Extraness, Omission: A Fine-grained Evaluation for Image Captioning
EMNLP 2019
Microsoft Icecaps: An Open-Source Toolkit for Conversation Modeling
ACL 2019
DoubleTransfer at MEDIQA 2019: Multi-Source Transfer Learning for Natural Language Understanding in the Medical Domain
ACL 2019
Towards Generating Long and Coherent Text with Multi-Level Latent Variable Models
ACL 2019
Budgeted Policy Learning for Task-Oriented Dialogue Systems
ACL 2019
Multi-Task Deep Neural Networks for Natural Language Understanding
ACL 2019
Conversing by Reading: Contentful Neural Conversation with On-demand Machine Reading
ACL 2019
Multi-step Reasoning via Recurrent Dual Attention for Visual Dialog
ACL 2019
ConvLab: Multi-Domain End-to-End Dialog System Platform
ACL 2019
Towards Coherent and Cohesive Long-form Text Generation
NAACL 2019
Multi-task Learning with Sample Re-weighting for Machine Reading Comprehension
NAACL 2019
Jointly Optimizing Diversity and Relevance in Neural Response Generation
NAACL 2019
Unsupervised Deep Structured Semantic Models for Commonsense Reasoning
NAACL 2019
Cyclical Annealing Schedule: A Simple Approach to Mitigating KL Vanishing
NAACL 2019
Adversarial Domain Adaptation for Machine Reading Comprehension
EMNLP 2019
Implicit Deep Latent Variable Models for Text Generation
IJCNLP 2019
Adversarial Domain Adaptation for Machine Reading Comprehension
IJCNLP 2019
TIGEr: Text-to-Image Grounding for Image Caption Evaluation
IJCNLP 2019
Structuring Latent Spaces for Stylized Response Generation
IJCNLP 2019
Robust Navigation with Language Pretraining and Stochastic Sampling
IJCNLP 2019
Object-Driven Text-To-Image Synthesis via Adversarial Training
CVPR 2019
Tactical Rewind: Self-Correction via Backtracking in Vision-And-Language Navigation
CVPR 2019
Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation
CVPR 2019
StoryGAN: A Sequential Conditional GAN for Story Visualization
CVPR 2019
TIGEr: Text-to-Image Grounding for Image Caption Evaluation
EMNLP 2019
A Hybrid Neural Network Model for Commonsense Reasoning
EMNLP 2019
Interactive Semantic Parsing for If-Then Recipes via Hierarchical Reinforcement Learning
AAAI 2019
Unified Language Model Pre-training for Natural Language Understanding and Generation
NIPS 2019
Implicit Deep Latent Variable Models for Text Generation
EMNLP 2019
REO-Relevance, Extraness, Omission: A Fine-grained Evaluation for Image Captioning
IJCNLP 2019
Switch-Based Active Deep Dyna-Q: Efficient Adaptive Planning for Task-Completion Dialogue Policy Learning
AAAI 2019
Neural Approaches to Conversational AI
ACL 2018
Discriminative Deep Dyna-Q: Robust Planning for Dialogue Policy Learning
EMNLP 2018
Subgoal Discovery for Hierarchical Dialogue Policy Learning
EMNLP 2018
Discourse-Aware Neural Rewards for Coherent Text Generation
NAACL 2018
Language-Based Image Editing With Recurrent Attentive Models
CVPR 2018
Stochastic Answer Networks for Machine Reading Comprehension
ACL 2018
Generating Informative and Diverse Conversational Responses via Adversarial Information Maximization
NIPS 2018
Navigating with Graph Representations for Fast and Scalable Decoding of Neural Language Models
NIPS 2018
M-Walk: Learning to Walk over Graphs using Monte Carlo Tree Search
NIPS 2018
Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy Learning
ACL 2018
An Empirical Analysis of Multiple-Turn Reasoning Strategies in Reading Comprehension Tasks
IJCNLP 2017
Semantic Compositional Networks for Visual Captioning
CVPR 2017
StyleNet: Generating Attractive Visual Captions With Styles
CVPR 2017
A Nested Attention Neural Hybrid Model for Grammatical Error Correction
ACL 2017
Towards End-to-End Reinforcement Learning of Dialogue Agents for Information Access
ACL 2017
Image-Grounded Conversations: Multimodal Context for Natural Question and Response Generation
IJCNLP 2017
Multi-Task Learning for Speaker-Role Adaptation in Neural Conversation Models
IJCNLP 2017
End-to-End Task-Completion Neural Dialogue Systems
IJCNLP 2017
Open-Domain Neural Dialogue Systems
IJCNLP 2017
Composite Task-Completion Dialogue Policy Learning via Hierarchical Deep Reinforcement Learning
EMNLP 2017
Multi-Domain Joint Semantic Frame Parsing Using Bi-Directional RNN-LSTM
INTERSPEECH 2016
Deep Reinforcement Learning for Dialogue Generation
EMNLP 2016
Deep Reinforcement Learning with a Natural Language Action Space
ACL 2016
A Persona-Based Neural Conversation Model
ACL 2016
Bi-directional Attention with Agreement for Dependency Parsing
EMNLP 2016
Deep Reinforcement Learning with a Combinatorial Action Space for Predicting Popular Reddit Threads
EMNLP 2016
A Diversity-Promoting Objective Function for Neural Conversation Models
NAACL 2016
Stacked Attention Networks for Image Question Answering
CVPR 2016
End-to-End Memory Networks with Knowledge Carryover for Multi-Turn Spoken Language Understanding
INTERSPEECH 2016
deltaBLEU: A Discriminative Metric for Generation Tasks with Intrinsically Diverse Targets
ACL 2015
End-to-end Learning of LDA by Mirror-Descent Back Propagation over a Deep Architecture
NIPS 2015
Semantic Parsing via Staged Query Graph Generation: Question Answering with Knowledge Base
IJCNLP 2015
deltaBLEU: A Discriminative Metric for Generation Tasks with Intrinsically Diverse Targets
IJCNLP 2015
Deep Learning and Continuous Representations for Natural Language Processing
NAACL 2015
Representation Learning Using Multi-Task Deep Neural Networks for Semantic Classification and Information Retrieval
NAACL 2015
From Captions to Visual Concepts and Back
CVPR 2015
A Neural Network Approach to Context-Sensitive Generation of Conversational Responses
NAACL 2015
Semantic Parsing via Staged Query Graph Generation: Question Answering with Knowledge Base
ACL 2015
Large-scale Expected BLEU Training of Phrase-based Reordering Models
EMNLP 2014
Learning Continuous Phrase Representations for Translation Modeling
ACL 2014
Modeling Interestingness with Deep Neural Networks
EMNLP 2014
Minimum Translation Modeling with Recurrent Neural Networks
EACL 2014
Decoder Integration and Expected BLEU Training for Recurrent Neural Network Language Models
ACL 2014
Training MRF-Based Phrase Translation Models using Gradient Ascent
NAACL 2013
Beyond Left-to-Right: Multiple Decomposition Structures for SMT
NAACL 2013
A Unified Approach to Transliteration-based Text Input with Online Spelling Correction
EMNLP 2012
A Unified Approach to Transliteration-based Text Input with Online Spelling Correction
CONLL 2012
Learning Lexicon Models from Search Logs for Query Expansion
EMNLP 2012
Learning Lexicon Models from Search Logs for Query Expansion
CONLL 2012
MSR SPLAT, a language analysis toolkit
NAACL 2012
Domain Adaptation via Pseudo In-Domain Data Selection
EMNLP 2011
Learning Phrase-Based Spelling Error Models from Clickthrough Data
ACL 2010
A comparison of unsupervised methods for Part-of-Speech Tagging in Chinese
COLING 2010
A Large Scale Ranker-Based System for Search Query Spelling Correction
COLING 2010
Discovery of Term Variation in Japanese Web Search Queries
EMNLP 2009
Model Adaptation via Model Interpolation and Boosting for Web Search Ranking
EMNLP 2009
A Web-based English Proofing System for English as a Second Language Users
IJCNLP 2008
Bayesian Semi-Supervised Chinese Word Segmentation for Statistical Machine Translation
COLING 2008
Indirect-HMM-based Hypothesis Alignment for Combining Outputs from Machine Translation Systems
EMNLP 2008
A comparison of Bayesian estimators for unsupervised Hidden Markov Model POS taggers
EMNLP 2008
Using Contextual Speller Techniques and Language Modeling for ESL Error Correction
IJCNLP 2008
A Comparative Study of Parameter Estimation Methods for Statistical Natural Language Processing
ACL 2007
Compressing Trigram Language Models With Golomb Coding
EMNLP 2007
Compressing Trigram Language Models With Golomb Coding
CONLL 2007
Approximation Lasso Methods for Language Modeling
ACL 2006
A DOM Tree Alignment Model for Mining Parallel Data from the Web
ACL 2006
An Information-Theoretic Approach to Automatic Evaluation of Summaries
NAACL 2006
Approximation Lasso Methods for Language Modeling
COLING 2006
A DOM Tree Alignment Model for Mining Parallel Data from the Web
COLING 2006
Transformation Based Chinese Entity Detection and Tracking
IJCNLP 2005
A Comparative Study on Language Model Adaptation Techniques Using New Evaluation Metrics
EMNLP 2005
Minimum Sample Risk Methods for Language Modeling
EMNLP 2005
An Empirical Study on Language Model Adaptation Using a Metric of Domain Similarity
IJCNLP 2005
Adaptive Chinese Word Segmentation
ACL 2004
Unsupervised Learning of Dependency Structure for Language Modeling
ACL 2003
Improved Source-Channel Models for Chinese Word Segmentation
ACL 2003
Improving Language Model Size Reduction using Better Pruning Criteria
ACL 2002
Chinese Named Entity Identification Using Class-based Language Model
COLING 2002
Exploring Asymmetric Clustering for Statistical Language Modeling
ACL 2002
Exploiting Headword Dependency and Predictive Clustering for Language Modeling
EMNLP 2002
Extraction of Chinese Compound Words - An Experimental Study on a Very Large Corpus
ACL 2000
PENS: A Machine-aided English Writing System for Chinese Users
ACL 2000
Distribution-Based Pruning of Backoff Language Models
ACL 2000