Byoung-tak Zhang
54 papers · 2000–2026 · 16 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
π Academic Marathon (26) π Conference Polyglot (16) π Interdisciplinary Bridge π§ Keyword Pioneer π£ Hot Topic Early Bird
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π
Conference Polyglot
(16)
π
Keyword Trendsetter Combo
(3)
π€
Dynamic Duo
(11)
π±
Topic Pioneer
π¬
Deep Specialist
(15)
π§¬
Topic Evolution
π
Keyword Champion
π₯
Unstoppable
(11)
β‘
Prolific Year
(7)
π
Century Club
(51)
π
Trend Setter
ποΈ
Keyword Collector
(218)
π
Conference Pioneer
Conferences
AAAI (8)
ACL (8)
NIPS (8)
EMNLP (6)
CVPR (5)
IJCAI (4)
ECCV (2)
ICCV (2)
ICML (2)
IJCNLP (2)
WACV (2)
CLEAR (1)
COLING (1)
NAACL (1)
RSS (1)
UAI (1)
Top co-authors
Keywords
video understanding
(6)
multimodal learning
(6)
visual dialog
(5)
visual question answering
(5)
self-supervised learning
(5)
reinforcement learning
(4)
neural network
(4)
video question answering
(4)
question answering
(4)
visual grounding
(3)
contrastive learning
(3)
attention mechanism
(3)
graph neural network
(3)
image generation
(2)
semi-supervised learning
(2)
visual reasoning
(2)
scene graph
(2)
metric learning
(2)
domain adaptation
(2)
information theory
(2)
Papers
Hybrid State Representation for Video Procedure Planning
WACV 2026
PeriUn: Enhancing Unlearning by Selectively Forgetting Peripheral Samples
AAAI 2026
Neural Collapse-Informed Initialization with Perturbation Injection in Classification-based Metric Learning
AAAI 2026
Towards Spatially Consistent Image Generation: On Incorporating Intrinsic Scene Properties into Diffusion Models
AAAI 2026
On the Consistency of Video Large Language Models in Temporal Comprehension
CVPR 2025
Background-Aware Moment Detection for Video Moment Retrieval
WACV 2025
CLIP-RT: Learning Language-Conditioned Robotic Policies from Natural Language Supervision
RSS 2025
OCK: Unsupervised Dynamic Video Prediction with Object-Centric Kinematics
ICCV 2025
Truncated Gaussian Policy for Debiased Continuous Control
AAAI 2025
Confidence-guided Refinement Reasoning for Zero-shot Question Answering
EMNLP 2025
Unveiling the Significance of Toddler-Inspired Reward Transition in Goal-Oriented Reinforcement Learning
AAAI 2024
Efficient Monte Carlo Tree Search via On-the-Fly State-Conditioned Action Abstraction
UAI 2024
DUEL: Duplicate Elimination on Active Memory for Self-Supervised Class-Imbalanced Learning
AAAI 2024
Fine-Grained Causal Dynamics Learning with Quantization for Improving Robustness in Reinforcement Learning
ICML 2024
Continuous SO(3) Equivariant Convolution for 3D Point Cloud Analysis
ECCV 2024
On Discovery of Local Independence over Continuous Variables via Neural Contextual Decomposition
CLEAR 2023
Learning Geometry-Aware Representations by Sketching
CVPR 2023
The Dialog Must Go On: Improving Visual Dialog via Generative Self-Training
CVPR 2023
Neural Collage Transfer: Artistic Reconstruction via Material Manipulation
ICCV 2023
Modal-specific Pseudo Query Generation for Video Corpus Moment Retrieval
EMNLP 2022
Language-agnostic Semantic Consistent Text-to-Image Generation
ACL 2022
Hypergraph Transformer: Weakly-Supervised Multi-hop Reasoning for Knowledge-based Visual Question Answering
ACL 2022
SelecMix: Debiased Learning by Contradicting-pair Sampling
NIPS 2022
Robust Imitation via Mirror Descent Inverse Reinforcement Learning
NIPS 2022
Smooth-Swap: A Simple Enhancement for Face-Swapping With Smoothness
CVPR 2022
Scene Graph Parsing via Abstract Meaning Representation in Pre-trained Language Models
NAACL 2022
PlaceNet: Neural Spatial Representation Learning with Multimodal Attention
IJCAI 2022
Attend What You Need: Motion-Appearance Synergistic Networks for Video Question Answering
ACL 2021
Reasoning Visual Dialog with Sparse Graph Learning and Knowledge Transfer
EMNLP 2021
Devilβs Advocate: Novel Boosting Ensemble Method from Psychological Findings for Text Classification
EMNLP 2021
DramaQA: Character-Centered Video Story Understanding with Hierarchical QA
AAAI 2021
Message Passing Adaptive Resonance Theory for Online Active Semi-supervised Learning
ICML 2021
Attend What You Need: Motion-Appearance Synergistic Networks for Video Question Answering
IJCNLP 2021
Goal-Aware Cross-Entropy for Multi-Target Reinforcement Learning
NIPS 2021
Toward General Scene Graph: Integration of Visual Semantic Knowledge with Entity Synset Alignment
ACL 2020
Hypergraph Attention Networks for Multimodal Learning
CVPR 2020
Cut-Based Graph Learning Networks to Discover Compositional Structure of Sequential Video Data
AAAI 2020
Dual Attention Networks for Visual Reference Resolution in Visual Dialog
IJCNLP 2019
Dual Attention Networks for Visual Reference Resolution in Visual Dialog
EMNLP 2019
CoDraw: Collaborative Drawing as a Testbed for Grounded Goal-driven Communication
ACL 2019
Multimodal Dual Attention Memory for Video Story Question Answering
ECCV 2018
Bilinear Attention Networks
NIPS 2018
Answerer in Questioner's Mind: Information Theoretic Approach to Goal-Oriented Visual Dialog
NIPS 2018
Overcoming Catastrophic Forgetting by Incremental Moment Matching
NIPS 2017
DeepStory: Video Story QA by Deep Embedded Memory Networks
IJCAI 2017
Dual-Memory Deep Learning Architectures for Lifelong Learning of Everyday Human Behaviors
IJCAI 2016
DeepSchema: Automatic Schema Acquisition from Wearable Sensor Data in Restaurant Situations
IJCAI 2016
Multimodal Residual Learning for Visual QA
NIPS 2016
Generative Local Metric Learning for Nearest Neighbor Classification
NIPS 2010
Text Chunking by Combining Hand-Crafted Rules and Memory-Based Learning
ACL 2003
A Comparative Evaluation of Data-driven Models in Translation Selection of Machine Translation
COLING 2002
Reducing Parsing Complexity by Intra-Sentence Segmentation based on Maximum Entropy Model
ACL 2000
Reducing Parsing Complexity by Intra-Sentence Segmentation based on Maximum Entropy Model
EMNLP 2000
Word Sense Disambiguation by Learning from Unlabeled Data
ACL 2000