Mark Johnson
114 papers · 2000–2025 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+17 more ↓ Show less ↑
๐งญ Keyword Pioneer ๐ Interdisciplinary Bridge ๐ Renaissance Researcher (7) ๐บ๏ธ Taxonomy Completionist (14) ๐ฃ Hot Topic Early Bird
๐
Renaissance Researcher
(7)
๐
Interdisciplinary Bridge
๐งญ
Keyword Pioneer
๐
Keyword Trendsetter Combo
(6)
๐
Conference Loyalist
(21)
๐บ
Lone Wolf
(7)
๐ค
Dynamic Duo
(14)
๐งฌ
Topic Evolution
๐
Keyword Champion
๐ฑ
Topic Pioneer
โก
Prolific Year
(8)
๐ฅ
Unstoppable
(22)
โ
The Questioner
(3)
๐
Trend Setter
๐
Century Club
(114)
๐๏ธ
Keyword Collector
(172)
๐
Conference Pioneer
Conferences
ACL (36)
EMNLP (21)
NAACL (21)
COLING (12)
CONLL (8)
NIPS (5)
IJCNLP (4)
CVPR (2)
JMLR (2)
AACL (1)
EACL (1)
ICCV (1)
Top co-authors
Research topics
Keywords
disfluency detection
(5)
image captioning
(4)
entailment graph
(4)
knowledge graph
(3)
dependency parsing
(3)
object detection
(3)
relation prediction
(3)
copy mechanism
(3)
probabilistic context-free grammar
(3)
natural language understanding
(2)
semi-supervised learning
(2)
word segmentation
(2)
reinforcement learning
(2)
part-of-speech tagging
(2)
transfer learning
(2)
syntactic parsing
(2)
semantic parsing
(2)
speech recognition
(2)
link prediction
(2)
natural language inference
(2)
Papers
Mastering the Craft of Data Synthesis for CodeLLMs
NAACL 2025
Sources of Hallucination by Large Language Models on Inference Tasks
EMNLP 2023
Smoothing Entailment Graphs with Language Models
AACL 2023
Smoothing Entailment Graphs with Language Models
IJCNLP 2023
Integrating Lexical Information into Entity Neighbourhood Representations for Relation Prediction
NAACL 2021
Mention Flags (MF): Constraining Transformer-based Text Generators
IJCNLP 2021
ECOL-R: Encouraging Copying in Novel Object Captioning with Reinforcement Learning
EACL 2021
Open-Domain Contextual Link Prediction and its Complementarity with Entailment Graphs
EMNLP 2021
Blindness to Modality Helps Entailment Graph Mining
EMNLP 2021
Mention Flags (MF): Constraining Transformer-based Text Generators
ACL 2021
Neural Rule-Execution Tracking Machine For Transformer-Based Text Generation
NIPS 2021
Multivalent Entailment Graphs for Question Answering
EMNLP 2021
Improving Disfluency Detection by Self-Training a Self-Attentive Model
ACL 2020
End-to-End Speech Recognition and Disfluency Removal
EMNLP 2020
Incorporating Temporal Information in Entailment Graph Mining
COLING 2020
How to Best Use Syntax in Semantic Role Labelling
ACL 2019
Neural Constituency Parsing of Speech Transcripts
NAACL 2019
nocaps: novel object captioning at scale
ICCV 2019
Duality of Link Prediction and Entailment Graph Induction
ACL 2019
An adaptable task-oriented dialog system for stand-alone embedded devices
ACL 2019
Active learning for deep semantic parsing
ACL 2018
VnCoreNLP: A Vietnamese Natural Language Processing Toolkit
NAACL 2018
Partially-Supervised Image Captioning
NIPS 2018
AMR dependency parsing with a typed semantic algebra
ACL 2018
Vision-and-Language Navigation: Interpreting Visually-Grounded Navigation Instructions in Real Environments
CVPR 2018
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
CVPR 2018
Predicting accuracy on large datasets from smaller pilot data
ACL 2018
Disfluency Detection using Auto-Correlational Neural Networks
EMNLP 2018
Guided Open Vocabulary Image Captioning with Constrained Beam Search
EMNLP 2017
A Novel Neural Network Model for Joint POS Tagging and Graph-based Dependency Parsing
CONLL 2017
Multilingual Semantic Parsing And Code-Switching
CONLL 2017
Idea density for predicting Alzheimerโs disease from transcribed speech
CONLL 2017
Unsupervised Text Segmentation Based on Native Language Characteristics
ACL 2017
Disfluency Detection using a Noisy Channel Model and a Deep Neural Language Model
ACL 2017
Efficient techniques for parsing with tree automata
ACL 2016
Grammar induction from (lots of) words alone
COLING 2016
STransE: a novel embedding model of entities and relationships in knowledge bases
NAACL 2016
Neighborhood Mixture Model for Knowledge Base Completion
CONLL 2016
Using Left-corner Parsing to Encode Universal Structural Constraints in Grammar Induction
EMNLP 2016
An Improved Non-monotonic Transition System for Dependency Parsing
EMNLP 2015
Sign constraints on feature weights improve a joint model of word segmentation and phonology
NAACL 2015
An Incremental Algorithm for Transition-based CCG Parsing
NAACL 2015
A Computationally Efficient Algorithm for Learning Topical Collocation Models
ACL 2015
A Computationally Efficient Algorithm for Learning Topical Collocation Models
IJCNLP 2015
Modelling function words improves unsupervised word segmentation
ACL 2014
Unsupervised Word Segmentation in Context
COLING 2014
Syllable weight encodes mostly the same information for English word segmentation as dictionary stress
EMNLP 2014
A Non-Monotonic Arc-Eager Transition System for Dependency Parsing
CONLL 2013
Topic Segmentation with a Structured Topic Model
NAACL 2013
A joint model of word segmentation and phonological variation for English word-final /t/-deletion
ACL 2013
The effect of non-tightness on Bayesian estimation of PCFGs
ACL 2013
Semantic Parsing with Bayesian Tree Transducers
ACL 2012
Exploring Adaptor Grammars for Native Language Identification
CONLL 2012
Using Rejuvenation to Improve Particle Filtering for Bayesian Word Segmentation
ACL 2012
Exploring Adaptor Grammars for Native Language Identification
EMNLP 2012
Exploiting Social Information in Grounded Language Learning via Grammatical Reduction
ACL 2012
Studying the Effect of Input Size for Bayesian Word Segmentation on the Providence Corpus
COLING 2012
Improving Combinatory Categorial Grammar Parse Reranking with Dependency Grammar Features
COLING 2012
Introduction to the Special Topic on Grammar Induction, Representation of Language and Language Learning
JMLR 2011
The impact of language models and loss functions on repair disfluency detection
ACL 2011
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing
EMNLP 2011
Reducing Grounded Learning Tasks To Grammatical Inference
EMNLP 2011
Producing Power-Law Distributions and Damping Word Frequencies with Two-Stage Language Models
JMLR 2011
SVD and Clustering for Unsupervised POS Tagging
ACL 2010
Unsupervised phonemic Chinese word segmentation using Adaptor Grammars
COLING 2010
Using Universal Linguistic Knowledge to Guide Grammar Induction
EMNLP 2010
Detecting Speech Repairs Incrementally Using a Noisy Channel Approach
COLING 2010
Synergies in learning words and their referents
NIPS 2010
PCFGs, Topic Models, Adaptor Grammars and Learning Topical Collocations and the Structure of Proper Names
ACL 2010
Automatic Domain Adaptation for Parsing
NAACL 2010
Learning Words and Their Meanings from Unsegmented Child-directed Speech
NAACL 2010
Reranking the Berkeley and Brown Parsers
NAACL 2010
A Note on the Implementation of Hierarchical Dirichlet Processes
ACL 2009
Improving Unsupervised Dependency Parsing with Richer Contexts and Smoothing
NAACL 2009
Structured Generative Models for Unsupervised Named-Entity Clustering
NAACL 2009
Improving nonparameteric Bayesian inference: experiments on unsupervised word segmentation with adaptor grammars
NAACL 2009
A Note on the Implementation of Hierarchical Dirichlet Processes
IJCNLP 2009
A comparison of Bayesian estimators for unsupervised Hidden Markov Model POS taggers
EMNLP 2008
Using Adaptor Grammars to Identify Synergies in the Unsupervised Acquisition of Linguistic Structure
ACL 2008
When is Self-Training Effective for Parsing?
COLING 2008
Bayesian Inference for PCFGs via Markov Chain Monte Carlo
NAACL 2007
Why Doesnโt EM Find Good HMM POS-Taggers?
CONLL 2007
Why Doesnโt EM Find Good HMM POS-Taggers?
EMNLP 2007
A Bayesian LDA-based model for semi-supervised part-of-speech tagging
NIPS 2007
A Comparative Study of Parameter Estimation Methods for Statistical Natural Language Processing
ACL 2007
Transforming Projective Bilexical Dependency Grammars into efficiently-parsable CFGs with Unfold-Fold
ACL 2007
Multilevel Coarse-to-Fine PCFG Parsing
NAACL 2006
Learning Phrasal Categories
EMNLP 2006
Contextual Dependencies in Unsupervised Word Segmentation
COLING 2006
Reranking and Self-Training for Parser Adaptation
COLING 2006
Effective Self-Training for Parsing
NAACL 2006
Adaptor Grammars: A Framework for Specifying Compositional Nonparametric Bayesian Models
NIPS 2006
Early Deletion of Fillers In Processing Conversational Speech
NAACL 2006
Contextual Dependencies in Unsupervised Word Segmentation
ACL 2006
Reranking and Self-Training for Parser Adaptation
ACL 2006
Effective Use of Prosody in Parsing Conversational Speech
EMNLP 2005
Representational Bias in Unsupervised Learning of Syllable Structure
CONLL 2005
Coarse-to-Fine n-Best Parsing and MaxEnt Discriminative Reranking
ACL 2005
Sentence-Internal Prosody Does not Help Parsing the Way Punctuation Does
NAACL 2004
Discriminative Language Modeling with Conditional Random Fields and the Perceptron Algorithm
ACL 2004
Attention Shifting for Parsing Speech
ACL 2004
A TAG-based noisy-channel model of speech repairs
ACL 2004
Supersense Tagging of Unknown Nouns in WordNet
EMNLP 2003
Investigating Loss Functions and Optimization Methods for Discriminative Learning of Label Sequences
EMNLP 2003
Parsing and Disfluency Placement
EMNLP 2002
Dynamic programming for parsing and estimation of stochastic unification-based grammars
ACL 2002
A Simple Pattern-matching Algorithm for Recovering Empty Nodes and their Antecedents
ACL 2002
Parsing the Wall Street Journal using a Lexical-Functional Grammar and Discriminative Estimation Techniques
ACL 2002
Edit Detection and Parsing for Transcribed Speech
NAACL 2001
Joint and Conditional Estimation of Tagging and Parsing Models
ACL 2001
Compact non-left-recursive grammars using the selective left-corner transform and factoring
COLING 2000
Lexicalized Stochastic Modeling of Constraint-Based Grammars using Log-Linear Measures and EM Training
ACL 2000
Exploiting auxiliary distributions in stochastic unification-based grammars
NAACL 2000
Explaining away ambiguity: Learning verb selectional preference with Bayesian networks
COLING 2000