Bin Wang
210 papers · 2003–2026 · 21 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+18 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (30) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (8) π£ Hot Topic Early Bird
π
Renaissance Researcher
(8)
π
Interdisciplinary Bridge
π§
Keyword Pioneer
π
Conference Loyalist
(21)
π
Keyword Champion
(2)
π
Triple Crown
π
Grand Slam
π₯
Mega-Team
(92)
π¬
Deep Specialist
(28)
π§¬
Topic Evolution
π€
Dynamic Duo
(21)
β‘
Prolific Year
(22)
π₯
Unstoppable
(12)
ποΈ
Keyword Collector
(73)
π
Century Club
(198)
π
Trend Setter
β
The Questioner
(3)
π
Conference Pioneer
Conferences
ACL (38)
AAAI (30)
EMNLP (23)
IJCAI (22)
COLING (12)
ICCV (12)
CVPR (11)
NIPS (10)
NAACL (8)
ICLR (7)
ECCV (7)
ICML (6)
IJCNLP (6)
INTERSPEECH (6)
SEMEVAL (4)
AACL (2)
WACV (2)
CORL (1)
MICCAI (1)
ACML (1)
OSDI (1)
Top co-authors
Research topics
Keywords
large language model
(26)
multimodal learning
(13)
knowledge distillation
(11)
instruction tuning
(8)
self-supervised learning
(8)
dialogue system
(7)
diffusion model
(7)
representation learning
(7)
neural network
(7)
graph neural network
(7)
vision-language model
(7)
contrastive learning
(6)
data augmentation
(6)
model compression
(6)
knowledge graph
(5)
neural machine translation
(5)
domain adaptation
(5)
benchmark evaluation
(5)
attention mechanism
(5)
link prediction
(5)
Papers
Learning Structurally Stabilized Representations for Lossless DNA Storage
AAAI 2026
Joint Knowledge Base Completion and Question Answering by Combining Large Language Models and Small Language Models
ACL 2026
MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing
ACL 2026
DiffBench Meets DiffAgent: End-to-End LLM-Driven Diffusion Acceleration Code Generation
AAAI 2026
TrajAgg: Dual-Scale Feature Aggregation with Hybrid Training for Trajectory Similarity Computation in Free Space
AAAI 2026
PACE: Predictive Adaptive Context Extraction for Long-Horizon LLM Agents
ACL 2026
RICo: Refined In-Context Contribution for Automatic Instruction-Tuning Data Selection
AAAI 2026
Large Language Models Struggle with Unreasonability in Math Problems
AAAI 2026
SEFEL: A Simple Yet Effective Framework for Fast Event Linking
AAAI 2026
SACodec: Asymmetric Quantization with Semantic Anchoring for Low-Bitrate High-Fidelity Neural Speech Codecs
AAAI 2026
Self-Improving Sparse Retrieval Through Heuristic Representation Refinement and Representation-Focused Learning
AAAI 2026
Dynamic Gaussian Scene Reconstruction from Unsynchronized Videos
AAAI 2026
ETRQA: A Comprehensive Benchmark for Evaluating Event Temporal Reasoning Abilities of Large Language Models
ACL 2025
Beyond Classification: Towards Speech Emotion Reasoning with Multitask AudioLLMs
IJCNLP 2025
Theoretical Insights into Fine-Tuning Attention Mechanism: Generalization and Optimization
IJCAI 2025
DGraFormer: Dynamic Graph Learning Guided Multi-Scale Transformer for Multivariate Time Series Forecasting
IJCAI 2025
Non-collective Calibrating Strategy for Time Series Forecasting
IJCAI 2025
Accelerating Diffusion-based Super-Resolution with Dynamic Time-Spatial Sampling
IJCAI 2025
Stability and Generalization of Zeroth-Order Decentralized Stochastic Gradient Descent with Changing Topology
AAAI 2025
Function-to-Style Guidance of LLMs for Code Translation
ICML 2025
FG-CLIP: Fine-Grained Visual and Textual Alignment
ICML 2025
IAA: Inner-Adaptor Architecture Empowers Frozen Large Language Model with Multimodal Capabilities
AAAI 2025
Beyond Classification: Towards Speech Emotion Reasoning with Multitask AudioLLMs
AACL 2025
IFEval-Audio: Benchmarking Instruction-Following Capability in Audio-based Large Language Models
AACL 2025
PMSS: Pretrained Matrices Skeleton Selection for LLM Fine-tuning
COLING 2025
CrossIn: An Efficient Instruction Tuning Approach for Cross-Lingual Knowledge Alignment
COLING 2025
Order-aware Interactive Segmentation
ICLR 2025
ToolACE: Winning the Points of LLM Function Calling
ICLR 2025
GeoX: Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training
ICLR 2025
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
ICLR 2025
LEGION: Learning to Ground and Explain for Synthetic Image Detection
ICCV 2025
Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study
NAACL 2025
Shift the Lens: Environment-Aware Unsupervised Camouflaged Object Detection
CVPR 2025
Seq2Time: Sequential Knowledge Transfer for Video LLM Temporal Grounding
CVPR 2025
OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations
CVPR 2025
Image Over Text: Transforming Formula Recognition Evaluation with Character Detection Matching
CVPR 2025
ReachAgent: Enhancing Mobile Agent via Page Reaching and Operation
NAACL 2025
SSMLoRA: Enhancing Low-Rank Adaptation with State Space Model
NAACL 2025
AudioBench: A Universal Benchmark for Audio Large Language Models
NAACL 2025
Beyond Single Images: Retrieval Self-Augmented Unsupervised Camouflaged Object Detection
ICCV 2025
Chimera: Improving Generalist Model with Domain-Specific Experts
ICCV 2025
IFEval-Audio: Benchmarking Instruction-Following Capability in Audio-based Large Language Models
IJCNLP 2025
Hybrid Layout Control for Diffusion Transformer: Fewer Annotations, Superior Aesthetics
ICCV 2025
Spatiotemporal-aware Trend-Seasonality Decomposition Network for Traffic Flow Forecasting
AAAI 2025
Towards Ship License Plate Recognition in the Wild: A Large Benchmark and Strong Baseline
AAAI 2025
Reverse Distribution Based Video Moment Retrieval for Effective Bias Elimination
AAAI 2025
OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation
ICCV 2025
LLM4RSR: Large Language Models as Data Correctors for Robust Sequential Recommendation
AAAI 2025
Walk Wisely on Graph: Knowledge Graph Reasoning with Dual Agents via Efficient Guidance-Exploration
AAAI 2025
SURVEYFORGE : On the Outline Heuristics, Memory-Driven Generation, and Multi-dimensional Evaluation for Automated Survey Writing
ACL 2025
Global Eye: Breaking the βFixed Thinking Patternβ during the Instruction Expansion Process
ACL 2025
Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia
ACL 2025
HoPE: A Novel Positional Encoding Without Long-Term Decay for Enhanced Context Awareness and Extrapolation
ACL 2025
MERaLiON-AudioLLM: Advancing Speech and Language Understanding for Singapore
ACL 2025
CoinMath: Harnessing the Power of Coding Instruction for Math LLM
ACL 2025
TailorKV: A Hybrid Framework for Long-Context Inference via Tailored KV Cache Optimization
ACL 2025
A Comprehensive Evaluation of Quantization Strategies for Large Language Models
ACL 2024
Segmentation-guided Layer-wise Image Vectorization with Gradient Fills
ECCV 2024
A New Dataset and Framework for Real-World Blurred Images Super-Resolution
ECCV 2024
Parrot Captions Teach CLIP to Spot Text
ECCV 2024
Resilience of Large Language Models for Noisy Instructions
EMNLP 2024
In2Core: Leveraging Influence Functions for Coreset Selection in Instruction Finetuning of Large Language Models
EMNLP 2024
MobileVLM: A Vision-Language Model for Better Intra- and Inter-UI Understanding
EMNLP 2024
Mixture of Diverse Size Experts
EMNLP 2024
ToolPlanner: A Tool Augmented LLM for Multi Granularity Instructions with Path Planning and Feedback
EMNLP 2024
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
EMNLP 2024
PhyRecon: Physically Plausible Neural Scene Reconstruction
NIPS 2024
VIGC: Visual Instruction Generation and Correction
AAAI 2024
W2P: Switching from Weak Supervision to Partial Supervision for Semantic Segmentation
AAAI 2024
Domain Generalization With Correlated Style Uncertainty
WACV 2024
GazeGNN: A Gaze-Guided Graph Neural Network for Chest X-Ray Classification
WACV 2024
SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoning
NAACL 2024
Gaze-directed Vision GNN for Mitigating Shortcut Learning in Medical Image
MICCAI 2024
Bridging Language Gaps in Audio-Text Retrieval
INTERSPEECH 2024
Streaming Audio Transformers for Online Audio Tagging
INTERSPEECH 2024
Enhancing Automated Audio Captioning via Large Language Models with Optimized Audio Encoding
INTERSPEECH 2024
Scaling up masked audio encoder learning for general audio classification
INTERSPEECH 2024
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD
NIPS 2024
DiP-GO: A Diffusion Pruner via Few-step Gradient Optimization
NIPS 2024
Distribution-Aware Data Expansion with Diffusion Models
NIPS 2024
Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents
ACL 2024
DetermLR: Augmenting LLM-based Logical Reasoning from Indeterminacy to Determinacy
ACL 2024
EmpathyEar: An Open-source Avatar Multimodal Empathetic Chatbot
ACL 2024
Pruning Large Language Models to Intra-module Low-rank Architecture with Transitional Activations
ACL 2024
Recognizing Everything from All Modalities at Once: Grounded Multimodal Universal Information Extraction
ACL 2024
CRAFT: Extracting and Tuning Cultural Instructions from the Wild
ACL 2024
Multi-Relational Graph Attention Network for Social Relationship Inference from Human Mobility Data
IJCAI 2024
Distributed Bilevel Optimization with Communication Compression
ICML 2024
HyperMR: Hyperbolic Hypergraph Multi-hop Reasoning for Knowledge-based Visual Question Answering
COLING 2024
ToolRerank: Adaptive and Hierarchy-Aware Reranking for Tool Retrieval
COLING 2024
OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation
CVPR 2024
Generate Subgoal Images before Act: Unlocking the Chain-of-Thought Reasoning in Diffusion Model for Robot Manipulation with Multimodal Prompts
CVPR 2024
CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoor Object Detection from Multi-view Images
CVPR 2024
Towards Faithful XAI Evaluation via Generalization-Limited Backdoor Watermark
ICLR 2024
Tree-Planner: Efficient Close-loop Task Planning with Large Language Models
ICLR 2024
The Xiaomi AI Labβs Speech Translation Systems for IWSLT 2023 Offline Task, Simultaneous Task and Speech-to-Speech Task
ACL 2023
Theoretically Guaranteed Bidirectional Data Rectification for Robust Sequential Recommendation
NIPS 2023
Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization
NIPS 2023
EmbodiedGPT: Vision-Language Pre-Training via Embodied Chain of Thought
NIPS 2023
Poisoning with Cerberus: Stealthy and Colluded Backdoor Attack against Federated Learning
AAAI 2023
BERT-ERC: Fine-Tuning BERT Is Enough for Emotion Recognition in Conversation
AAAI 2023
Dialogue Rewriting via Skeleton-Guided Generation
AAAI 2023
MoralDial: A Framework to Train and Evaluate Moral Dialogue Systems via Moral Discussions
ACL 2023
Exploring Better Text Image Translation with Multimodal Codebook
ACL 2023
Compounding Geometric Operations for Knowledge Graph Completion
ACL 2023
GreenKGC: A Lightweight Knowledge Graph Completion Method
ACL 2023
Pay More Attention to Relation Exploration for Knowledge Base Question Answering
ACL 2023
Universal Information Extraction with Meta-Pretrained Self-Retrieval
ACL 2023
Relational Sentence Embedding for Flexible Semantic Matching
ACL 2023
Exploring All-In-One Knowledge Distillation Framework for Neural Machine Translation
EMNLP 2023
Instructive Dialogue Summarization with Query Aggregations
EMNLP 2023
In-Image Neural Machine Translation with Segmented Pixel Sequence-to-Sequence Model
EMNLP 2023
LEA2: A Lightweight Ensemble Adversarial Attack via Non-overlapping Vulnerable Frequency Regions
ICCV 2023
Few-Shot Physically-Aware Articulated Mesh Generation via Hierarchical Deformation
ICCV 2023
Reconstructed Convolution Module Based Look-Up Tables for Efficient Image Super-Resolution
ICCV 2023
Fan-Beam Binarization Difference Projection (FB-BDP): A Novel Local Object Descriptor for Fine-Grained Leaf Image Retrieval
ICCV 2023
V3Det: Vast Vocabulary Visual Detection Dataset
ICCV 2023
Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection
ICLR 2023
ChiPFormer: Transferable Chip Placement via Offline Decision Transformer
ICML 2023
MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL
ICML 2023
Low-Confidence Samples Mining for Semi-supervised Object Detection
IJCAI 2023
Efficient Multi-View Inverse Rendering Using a Hybrid Differentiable Rendering Method
IJCAI 2023
Socially-Attentive Policy Optimization in Multi-Agent Self-Driving System
CORL 2022
Towards Robust Neural Machine Translation with Iterative Scheduled Data-Switch Training
COLING 2022
Perceiving Stroke-Semantic Context: Hierarchical Contrastive Learning for Robust Scene Text Recognition
AAAI 2022
Offline-to-Online Co-Evolutional User Simulator and Dialogue System
EMNLP 2022
Analyzing and Evaluating Faithfulness in Dialogue Summarization
EMNLP 2022
BIT-Xiaomiβs System for AutoSimTrans 2022
NAACL 2022
Towards Generalized Open Information Extraction
EMNLP 2022
Filter Pruning via Feature Discrimination in Deep Neural Networks
ECCV 2022
MISC: A Mixed Strategy-Aware Model integrating COMET for Emotional Support Conversation
ACL 2022
Just Rank: Rethinking Evaluation with Word and Sentence Similarities
ACL 2022
C3KG: A Chinese Commonsense Conversation Knowledge Graph
ACL 2022
The Xiaomi Text-to-Text Simultaneous Speech Translation System for IWSLT 2022
ACL 2022
GeoAug: Data Augmentation for Few-Shot NeRF with Geometry Constraints
ECCV 2022
Self-supervised Learning and Adaptation for Single Image Dehazing
IJCAI 2022
Robust Network Architecture Search via Feature Distortion Restraining
ECCV 2022
Bi-CMR: Bidirectional Reinforcement Guided Hashing for Effective Cross-Modal Retrieval
AAAI 2022
BCOT: A Markerless High-Precision 3D Object Tracking Benchmark
CVPR 2022
DOMINO: Decomposed Mutual Information Optimization for Generalized Context in Meta-Reinforcement Learning
NIPS 2022
Semi-supervised Object Detection with Adaptive Class-Rebalancing Self-Training
AAAI 2022
Generate, Discriminate and Contrast: A Semi-Supervised Sentence Representation Learning Framework
EMNLP 2022
A Trend-Driven Fashion Design System for Rapid Response Marketing in E-commerce
AAAI 2022
Structure-Unified M-Tree Coding Solver for Math Word Problem
EMNLP 2022
Self-Supervised Video Representation Learning by Context and Motion Decoupling
CVPR 2021
Few-Shot Event Detection with Prototypical Amortized Conditional Random Field
IJCNLP 2021
Reasoning in Dialog: Improving Response Generation by Context Reading Comprehension
AAAI 2021
Train a One-Million-Way Instance Classifier for Unsupervised Visual Representation Learning
AAAI 2021
Does Every Data Instance Matter? Enhancing Sequential Recommendation by Eliminating Unreliable Data
IJCAI 2021
Model-Based Reinforcement Learning via Imagination with Derived Memory
NIPS 2021
Maximal Clique Based Non-Autoregressive Open Information Extraction
EMNLP 2021
Polyphone Disambiguation in Mandarin Chinese with Semi-Supervised Learning
INTERSPEECH 2021
Unsupervised Co-part Segmentation through Assembly
ICML 2021
Improving Tree-Structured Decoder Training for Code Generation via Mutual Learning
AAAI 2021
Pre-training with Meta Learning for Chinese Word Segmentation
NAACL 2021
Few-Shot Event Detection with Prototypical Amortized Conditional Random Field
ACL 2021
Modeling Discourse Structure for Document-level Neural Machine Translation
ACL 2020
GSSNN: Graph Smoothing Splines Neural Networks
AAAI 2020
A Novel Line Integral Transform for 2D Affine-Invariant Shape Retrieval
ECCV 2020
Graph Structured Network for Image-Text Matching
CVPR 2020
Lee at SemEval-2020 Task 5: ALBERT Model Based on the Maximum Ensemble Strategy and Different Data Sampling Methods for Detecting Counterfactual Statements
COLING 2020
Lijunyi at SemEval-2020 Task 4: An ALBERT Model Based Maximum Ensemble with Different Training Sizes and Depths for Commonsense Validation and Explanation
COLING 2020
Overcoming Language Priors with Self-supervised Learning for Visual Question Answering
IJCAI 2020
Triple-GAIL: A Multi-Modal Imitation Learning Framework with Generative Adversarial Nets
IJCAI 2020
An Iterative Multi-Source Mutual Knowledge Transfer Framework for Machine Reading Comprehension
IJCAI 2020
Learning to Prune Dependency Trees with Rethinking for Neural Relation Extraction
COLING 2020
Porous Lattice Transformer Encoder for Chinese NER
COLING 2020
Infusing Sequential Information into Conditional Masked Translation Model with Self-Review Mechanism
COLING 2020
Graph Geometry Interaction Learning
NIPS 2020
LAIX Corpus of Chinese Learner English: Towards a Benchmark for L2 English ASR
INTERSPEECH 2020
Xiaomiβs Submissions for IWSLT 2020 Open Domain Translation Task
ACL 2020
HiveD: Sharing a GPU Cluster for Deep Learning with Guarantees
OSDI 2020
Lijunyi at SemEval-2020 Task 4: An ALBERT Model Based Maximum Ensemble with Different Training Sizes and Depths for Commonsense Validation and Explanation
SEMEVAL 2020
Lee at SemEval-2020 Task 5: ALBERT Model Based on the Maximum Ensemble Strategy and Different Data Sampling Methods for Detecting Counterfactual Statements
SEMEVAL 2020
Focus-Constrained Attention Mechanism for CVAE-based Response Generation
EMNLP 2020
Coarse-to-Fine Pre-training for Named Entity Recognition
EMNLP 2020
The Kelly Growth Optimal Portfolio with Ensemble Learning
AAAI 2019
Finding Justifications by Approximating Core for Large-scale Ontologies
IJCAI 2019
MAT-Net: Medial Axis Transform Network for 3D Object Recognition
IJCAI 2019
Adaptive Convolution for Multi-Relational Learning
NAACL 2019
Beyond Word Attention: Using Segment Attention in Neural Relation Extraction
IJCAI 2019
Neural Collective Entity Linking Based on Recurrent Random Walk Network Learning
IJCAI 2019
YNU NLP at SemEval-2019 Task 5: Attention and Capsule Ensemble for Identifying Hate Speech
SEMEVAL 2019
YNUWB at SemEval-2019 Task 6: K-max pooling CNN with average meta-embedding for identifying offensive language
SEMEVAL 2019
YNU-junyi in BioNLP-OST 2019: Using CNN-LSTM Model with Embeddings for SeeDev Binary Event Extraction
EMNLP 2019
Boundary Perception Guidance: A Scribble-Supervised Semantic Segmentation Approach
IJCAI 2019
Low Shot Box Correction for Weakly Supervised Object Detection
IJCAI 2019
An Adaptive Hierarchical Compositional Model for Phrase Embedding
IJCAI 2018
Where to Prune: Using LSTM to Guide End-to-end Pruning
IJCAI 2018
Implicit Non-linear Similarity Scoring for Recognizing Unseen Classes
IJCAI 2018
Improving Knowledge Graph Embedding Using Simple Constraints
ACL 2018
Multi-Stage Multi-Recursive-Input Fully Convolutional Networks for Neuronal Boundary Detection
ICCV 2017
Can Walking and Measuring Along Chord Bunches Better Describe Leaf Shapes?
CVPR 2017
Attentive Path Combination for Knowledge Graph Completion
ACML 2017
Knowledge Base Completion via Coupled Path Ranking
ACL 2016
Relation Extraction with Multi-instance Multi-label Convolutional Neural Networks
COLING 2016
Jointly Embedding Knowledge Graphs and Logical Rules
EMNLP 2016
Multi-Granularity Chinese Word Embedding
EMNLP 2016
Context-Dependent Knowledge Graph Embedding
EMNLP 2015
Trans-dimensional Random Fields for Language Modeling
ACL 2015
Automatic Thumbnail Generation Based on Visual Representativeness and Foreground Recognizability
ICCV 2015
Semantically Smooth Knowledge Graph Embedding
ACL 2015
Knowledge Base Completion Using Embeddings and Rules
IJCAI 2015
Semantically Smooth Knowledge Graph Embedding
IJCNLP 2015
Trans-dimensional Random Fields for Language Modeling
IJCNLP 2015
A Regularized Competition Model for Question Difficulty Estimation in Community Question Answering Services
EMNLP 2014
Using Clustering to Improve Retrieval Evaluation without Relevance Judgments
COLING 2010
Information Retrieval Oriented Word Segmentation based on Character Association Strength Ranking
EMNLP 2008
A Study on Effectiveness of Syntactic Relationship in Dependence Retrieval Model
IJCNLP 2008
Exploiting Parallel Texts for Word Sense Disambiguation: An Empirical Study
ACL 2003