Wei Zhang
325 papers · 2006–2026 · 25 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+19 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (27) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (7) π£ Hot Topic Early Bird
π
Academic Marathon
(19)
π
Renaissance Researcher
(7)
π
Interdisciplinary Bridge
π
Keyword Trendsetter Combo
(5)
π
Conference Loyalist
(40)
π€
Dynamic Duo
(33)
π
Triple Crown
π
Grand Slam
π₯
Mega-Team
(30)
π¬
Deep Specialist
(30)
π§¬
Topic Evolution
π
Keyword Champion
(7)
π₯
Unstoppable
(16)
β
The Questioner
(4)
π
Century Club
(307)
ποΈ
Keyword Collector
(78)
β‘
Prolific Year
(20)
π
Trend Setter
π
Conference Pioneer
Conferences
AAAI (54)
CVPR (51)
ICCV (32)
ACL (28)
EMNLP (28)
IJCAI (25)
NIPS (22)
ECCV (18)
ICLR (11)
COLING (10)
ICML (8)
IJCNLP (6)
INTERSPEECH (5)
CORL (4)
RSS (4)
MICCAI (4)
ACML (3)
OSDI (3)
WACV (2)
NSDI (2)
NAACL (1)
JMLR (1)
SEMEVAL (1)
UAI (1)
AISTATS (1)
Top co-authors
Research topics
Keywords
large language model
(28)
model compression
(18)
contrastive learning
(18)
semantic segmentation
(15)
object detection
(14)
knowledge distillation
(14)
representation learning
(14)
reinforcement learning
(13)
neural network
(11)
convolutional neural network
(11)
multimodal learning
(9)
graph neural network
(9)
attention mechanism
(8)
knowledge graph
(8)
video understanding
(8)
multimodal large language model
(8)
self-supervised learning
(8)
model quantization
(7)
adversarial learning
(7)
deep neural network
(7)
Papers
Divide-and-Conquer Decoupled Network for Cross-Domain Few-Shot Segmentation
AAAI 2026
BulletTime4D: Towards High Spatio-Temporal Resolution Dynamic Scene Rendering via Spike-Guided Stereo Vision
AAAI 2026
MAUGen: A Unified Diffusion Approach for Multi-Identity Facial Expression and AU Label Generation
AAAI 2026
Learning Personalised Human Internal Cognition from External Expressive Behaviours for Real Personality Recognition
AAAI 2026
LLM-as-Scheduler: Agentic Workflow Dynamic Scheduling
ACL 2026
MirageBackdoor: A Stealthy Attack that Induces Think-Well-Answer-Wrong Reasoning
ACL 2026
HI-SLAM2: Geometry-Aware Gaussian SLAM for Fast Monocular Scene Reconstruction (Abstract Reprint)
AAAI 2026
SERL: Self-Examining Reinforcement Learning on Open-Domain
AAAI 2026
PHPFND: Detecting Fake News via Post-Hoc Processing of LLMs Hallucination
AAAI 2026
TokenPowerBench: Benchmarking the Power Consumption of LLM Inference
AAAI 2026
Seeing Is Believing: Rich-Context Hallucination Detection for MLLMs via Backward Visual Grounding
AAAI 2026
Sample-specific Modality Diagnosis and Cross-modal Enhancement for Incomplete Multimodal Representations
AAAI 2026
Natural-Language Policies to Executable Decisions: An Interpretable Large Language Model Framework
ACL 2026
A Survey of Inductive Reasoning for Large Language Models
ACL 2026
Evidence-aware Integration and Domain Identification of Spatial Transcriptomics Data
AAAI 2026
UniFit: Towards Universal Virtual Try-on with MLLM-Guided Semantic Alignment
AAAI 2026
Dual-Path Knowledge-Augmented Contrastive Alignment Network for Spatially Resolved Transcriptomics
AAAI 2026
Exploring Surround-View Fisheye Camera 3D Object Detection
AAAI 2026
AdaDrive: Self-Adaptive Slow-Fast System for Language-Grounded Autonomous Driving
ICCV 2025
SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models
ICCV 2025
VLDrive: Vision-Augmented Lightweight MLLMs for Efficient Language-grounded Autonomous Driving
ICCV 2025
Efficient Event Camera Data Pretraining with Adaptive Prompt Fusion
ICCV 2025
LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation
ICCV 2025
VisiPruner: Decoding Discontinuous Cross-Modal Dynamics for Efficient Multimodal LLMs
EMNLP 2025
Beyond Content Relevance: Evaluating Instruction Following in Retrieval Models
ICLR 2025
As Simple as Fine-tuning: LLM Alignment via Bidirectional Negative Feedback Loss
ICLR 2025
FreqPrior: Improving Video Diffusion Models with Frequency Filtering Gaussian Noise
ICLR 2025
SleepSMC: Ubiquitous Sleep Staging via Supervised Multimodal Coordination
ICLR 2025
EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
CVPR 2025
EasyCraft: A Robust and Efficient Framework for Automatic Avatar Crafting
CVPR 2025
HiRes-LLaVA: Restoring Fragmentation Input in High-Resolution Large Vision-Language Models
CVPR 2025
GROVE: A Generalized Reward for Learning Open-Vocabulary Physical Skill
CVPR 2025
Less Attention is More: Prompt Transformer for Generalized Category Discovery
CVPR 2025
Object Detection using Event Camera: A MoE Heat Conduction based Detector and A New Benchmark Dataset
CVPR 2025
Decoupled Motion Expression Video Segmentation
CVPR 2025
Matrix Completion with Incomplete Side Information via Orthogonal Complement Projection
ICML 2025
HetSSNet: Spatial-Spectral Heterogeneous Graph Learning Network for Panchromatic and Multispectral Images Fusion
ICML 2025
Reinforcement Learning for Infinite-Dimensional Systems
JMLR 2025
LVPNet: A Latent-variable-based Prediction-driven End-to-end Framework for Lossless Compression of Medical Images
MICCAI 2025
MS-IQA: A Multi-Scale Feature Fusion Network for PET/CT Image Quality Assessment
MICCAI 2025
Towards Boosting LLMs-driven Relevance Modeling with Progressive Retrieved Behavior-augmented Prompting
COLING 2025
Debiasing 6-DOF IMU via Hierarchical Learning of Continuous Bias Dynamics
RSS 2025
E2LLM: Encoder Elongated Large Language Models for Long-Context Understanding and Reasoning
EMNLP 2025
CIKT: A Collaborative and Iterative Knowledge Tracing Framework with Large Language Models
EMNLP 2025
VisFinEval: A Scenario-Driven Chinese Multimodal Benchmark for Holistic Financial Understanding
EMNLP 2025
R-CHAR: A Metacognition-Driven Framework for Role-Playing in Large Language Models
EMNLP 2025
PromptSculptor: Multi-Agent Based Text-to-Image Prompt Optimization
EMNLP 2025
Turning the Tide: Repository-based Code Reflection
EMNLP 2025
PerReactor: Offline Personalised Multiple Appropriate Facial Reaction Generation
AAAI 2025
In2NeCT: Inter-class and Intra-class Neural Collapse Tuning for Semantic Segmentation of Imbalanced Remote Sensing Images
AAAI 2025
Coherency Improved Explainable Recommendation via Large Language Model
AAAI 2025
STAIR: Manipulating Collaborative and Multimodal Information for E-Commerce Recommendation
AAAI 2025
MOOSS: Mask-Enhanced Temporal Contrastive Learning for Smooth State Evolution in Visual Reinforcement Learning
WACV 2025
SaCa: A Highly Compatible Reinforcing Framework for Knowledge Graph Embedding via Structural Pattern Contrast
EMNLP 2025
Discrete-Time Hybrid Automata Learning: Legged Locomotion Meets Skateboarding
RSS 2025
SCOPE: Compress Mathematical Reasoning Steps for Efficient Automated Process Annotation
ACL 2025
Full-Step-DPO: Self-Supervised Preference Optimization with Step-wise Rewards for Mathematical Reasoning
ACL 2025
PBCAT: Patch-Based Composite Adversarial Training against Physically Realizable Attacks on Object Detection
ICCV 2025
Crabs: Consuming Resource via Auto-generation for LLM-DoS Attack under Black-box Settings
ACL 2025
General Compression Framework for Efficient Transformer Object Tracking
ICCV 2025
Learning Implicit Features with Flow-Infused Transformations for Realistic Virtual Try-On
ICCV 2025
Qwen2.5-xCoder: Multi-Agent Collaboration for Multilingual Code Instruction Tuning
ACL 2025
LIFBench: Evaluating the Instruction Following Performance and Stability of Large Language Models in Long-Context Scenarios
ACL 2025
Mis-prompt: Benchmarking Large Language Models for Proactive Error Handling
ACL 2025
Pretraining Context Compressor for Large Language Models with Embedding-Based Memory
ACL 2025
Structure-aware Domain Knowledge Injection for Large Language Models
ACL 2025
IW-Bench: Evaluating Large Multimodal Models for Converting Image-to-Web
ACL 2025
CNNSum: Exploring Long-Context Summarization with Large Language Models in Chinese Novels
ACL 2025
Flow2Code: Evaluating Large Language Models for Flowchart-based Code Generation Capability
ACL 2025
CodeArena: Evaluating and Aligning CodeLLMs on Human Preference
EMNLP 2025
PricingLogic: Evaluating LLMs Reasoning on Complex Tourism Pricing Tasks
EMNLP 2025
Context Guided Transformer Entropy Modeling for Video Compression
ICCV 2025
ILLUME: Illuminating Your LLMs to See, Draw, and Self-Enhance
ICCV 2025
MMReason: An Open-Ended Multi-Modal Multi-Step Reasoning Benchmark for MLLMs Toward AGI
ICCV 2025
Multi-Loco: Unifying Multi-Embodiment Legged Locomotion via Reinforcement Learning Augmented Diffusion
CORL 2025
Generative Visual Foresight Meets Task-Agnostic Pose Estimation in Robotic Table-top Manipulation
CORL 2025
JointDreamer: Ensuring Geometry Consistency and Text Congruence in Text-to-3D Generation via Joint Score Distillation
ECCV 2024
Latent Space Editing in Transformer-Based Flow Matching
AAAI 2024
LaneGraph2Seq: Lane Topology Extraction with Language Model via Vertex-Edge Encoding and Connectivity Enhancement
AAAI 2024
CGMGM: A Cross-Gaussian Mixture Generative Model for Few-Shot Semantic Segmentation
AAAI 2024
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation
AAAI 2024
Symbolic Cognitive Diagnosis via Hybrid Optimization for Intelligent Education Systems
AAAI 2024
Gaussian Process Neural Additive Models
AAAI 2024
Dynamic Token-Pass Transformers for Semantic Segmentation
WACV 2024
Aligning Large Language Models for Controllable Recommendations
ACL 2024
D2LLM: Decomposed and Distilled Large Language Models for Semantic Search
ACL 2024
Everything of Thoughts: Defying the Law of Penrose Triangle for Thought Generation
ACL 2024
Multi-Task Network Guided Multimodal Fusion for Fake News Detection
ACML 2024
Restricted Isometry Property of Rank-One Measurements with Random Unit-Modulus Vectors
AISTATS 2024
VLMPC: Vision-Language Model Predictive Control for Robotic Manipulation
RSS 2024
Language-Augmented Symbolic Planner for Open-World Task Planning
RSS 2024
Netcastle: Network Infrastructure Testing At Scale
NSDI 2024
TRELM: Towards Robust and Efficient Pre-training for Knowledge-Enhanced Language Models
COLING 2024
Typos Correction Training against Misspellings from Text-to-Text Transformers
COLING 2024
When Diffusion MRI Meets Diffusion Model: A Novel Deep Generative Model for Diffusion MRI Generation
MICCAI 2024
High-resolution Medical Image Translation via Patch Alignment-Based Bidirectional Contrastive Learning
MICCAI 2024
Interpreting and Improving Large Language Models in Arithmetic Calculation
ICML 2024
Locally Private and Robust Multi-Armed Bandits
NIPS 2024
Graph-enhanced Optimizers for Structure-aware Recommendation Embedding Evolution
NIPS 2024
ESNet: Evolution and Succession Network for High-Resolution Salient Object Detection
ICML 2024
Real-time Stereo-based 3D Object Detection for Streaming Perception
NIPS 2024
Enhancing LLMβs Cognition via Structurization
NIPS 2024
Holistic Autonomous Driving Understanding by Bird's-Eye-View Injected Multi-Modal Large Models
CVPR 2024
GeoReF: Geometric Alignment Across Shape Variation for Category-level Object Pose Refinement
CVPR 2024
Event-based Visible and Infrared Fusion via Multi-task Collaboration
CVPR 2024
Decoupled Pseudo-labeling for Semi-Supervised Monocular 3D Object Detection
CVPR 2024
Enhanced Motion-Text Alignment for Image-to-Video Transfer Learning
CVPR 2024
BIVDiff: A Training-Free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models
CVPR 2024
DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection
CVPR 2024
Towards a Simultaneous and Granular Identity-Expression Control in Personalized Face Generation
CVPR 2024
EVS-assisted Joint Deblurring Rolling-Shutter Correction and Video Frame Interpolation through Sensor Inverse Modeling
CVPR 2024
Language-Driven Anchors for Zero-Shot Adversarial Robustness
CVPR 2024
SFOD: Spiking Fusion Object Detector
CVPR 2024
Ins-DetCLIP: Aligning Detection Model to Follow Human-Language Instruction
ICLR 2024
PanoVOS: Bridging Non-panoramic and Panoramic Views with Transformer for Video Segmentation
ECCV 2024
Interactive 3D Object Detection with Prompts
ECCV 2024
PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion
ECCV 2024
Norface: Improving Facial Expression Analysis by Identity Normalization
ECCV 2024
OneVOS: Unifying Video Object Segmentation with All-in-One Transformer Framework
ECCV 2024
LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model
ECCV 2024
Alignment-Enhanced Decoding: Defending Jailbreaks via Token-Level Adaptive Refining of Probability Distributions
EMNLP 2024
The Accuracy Paradox in RLHF: When Better Reward Models Donβt Yield Better Language Models
EMNLP 2024
Assessing βImplicitβ Retrieval Robustness of Large Language Models
EMNLP 2024
Improving Knowledge Graph Completion with Structure-Aware Supervised Contrastive Learning
EMNLP 2024
Donβt Forget Your Reward Values: Language Model Alignment via Value-based Calibration
EMNLP 2024
mABC: Multi-Agent Blockchain-inspired Collaboration for Root Cause Analysis in Micro-Services Architecture
EMNLP 2024
Erratum to: 3D-TOGO: Towards Text-Guided Cross-Category 3D Object Generation
AAAI 2023
Translating Images to Road Network: A Non-Autoregressive Sequence-to-Sequence Approach
ICCV 2023
DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability
ICCV 2023
AdaEmbed: Adaptive Embedding for Large-Scale Recommendation Models
OSDI 2023
Gradient-based Sampling for Class Imbalanced Semi-supervised Object Detection
ICCV 2023
CFCG: Semi-Supervised Semantic Segmentation via Cross-Fusion and Contour Guidance Supervision
ICCV 2023
Data-free Knowledge Distillation for Fine-grained Visual Categorization
ICCV 2023
WaterMask: Instance Segmentation for Underwater Imagery
ICCV 2023
Interpretable Debiasing of Vectorized Language Representations with Iterative Orthogonalization
ICLR 2023
Multi-Level Confidence Learning for Trustworthy Multimodal Classification
AAAI 2023
OMPQ: Orthogonal Mixed Precision Quantization
AAAI 2023
SpatialFormer: Semantic and Target Aware Attentions for Few-Shot Learning
AAAI 2023
E2E-LOAD: End-to-End Long-form Online Action Detection
ICCV 2023
BM2CP: Efficient Collaborative Perception with LiDAR-Camera Modalities
CORL 2023
Adaptive Low-Precision Training for Embeddings in Click-Through Rate Prediction
AAAI 2023
Learning to Binarize Continuous Features for Neuro-Rule Networks
IJCAI 2023
Self-Decoupling and Ensemble Distillation for Efficient Segmentation
AAAI 2023
RSPT: Reconstruct Surroundings and Predict Trajectory for Generalizable Active Object Tracking
AAAI 2023
FlowFace: Semantic Flow-Guided Shape-Aware Face Swapping
AAAI 2023
3D-TOGO: Towards Text-Guided Cross-Category 3D Object Generation
AAAI 2023
DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-Training via Word-Region Alignment
CVPR 2023
Semi-DETR: Semi-Supervised Object Detection With Detection Transformers
CVPR 2023
CapDet: Unifying Dense Captioning and Open-World Detection Pretraining
CVPR 2023
HS-Pose: Hybrid Scope Feature Extraction for Category-Level Object Pose Estimation
CVPR 2023
Reading Relevant Feature from Global Representation Memory for Visual Object Tracking
NIPS 2023
OpenLane-V2: A Topology Reasoning Benchmark for Unified 3D HD Mapping
NIPS 2023
MolXPT: Wrapping Molecules with Text for Generative Pre-training
ACL 2023
Explainable Recommendation with Personalized Review Retrieval and Aspect Learning
ACL 2023
GrowCLIP: Data-Aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-Training
ICCV 2023
BioT5: Enriching Cross-modal Integration in Biology with Chemical Knowledge and Natural Language Associations
EMNLP 2023
LVOS: A Benchmark for Long-term Video Object Segmentation
ICCV 2023
From Complex to Simple: Unraveling the Cognitive Tree for Reasoning with Small Language Models
EMNLP 2023
MRN: Multiplexed Routing Network for Incremental Multilingual Text Recognition
ICCV 2023
Towards High-Fidelity Text-Guided 3D Face Generation and Manipulation Using only Images
ICCV 2023
Ambiguity-Resistant Semi-Supervised Learning for Dense Object Detection
CVPR 2023
Point2Seq: Detecting 3D Objects As Sequences
CVPR 2022
Directional Self-Supervised Learning for Heavy Image Augmentations
CVPR 2022
A Large-Scale Comprehensive Dataset and Copy-Overlap Aware Evaluation Protocol for Segment-Level Video Copy Detection
CVPR 2022
FERV39k: A Large-Scale Multi-Scene Dataset for Facial Expression Recognition in Videos
CVPR 2022
Class-Aware Contrastive Semi-Supervised Learning
CVPR 2022
PointCLIP: Point Cloud Understanding by CLIP
CVPR 2022
Normalization of Language Embeddings for Cross-Lingual Alignment
ICLR 2022
DIRL: Domain-Invariant Representation Learning for Generalizable Semantic Segmentation
AAAI 2022
Multi-Knowledge Aggregation and Transfer for Semantic Segmentation
AAAI 2022
Weakly-Supervised Salient Object Detection Using Point Supervision
AAAI 2022
LCTR: On Awakening the Local Continuity of Transformer for Weakly Supervised Object Localization
AAAI 2022
Towards Generalizeable Semantic Product Search by Text Similarity Pre-training on Search Click Logs
ACL 2022
Deterministic policy gradient: Convergence analysis
UAI 2022
Compression of Generative Pre-trained Language Models via Quantization
ACL 2022
Explore Inter-contrast between Videos via Composition for Weakly Supervised Temporal Sentence Grounding
AAAI 2022
Comprehensive Regularization in a Bi-directional Predictive Network for Video Anomaly Detection
AAAI 2022
Visual Consensus Modeling for Video-Text Retrieval
AAAI 2022
zIO: Accelerating IO-Intensive Applications with Transparent Zero-Copy IO
OSDI 2022
NetHint: White-Box Networking for Multi-Tenant Data Centers
NSDI 2022
Towards Online 3D Bin Packing: Learning Synergies between Packing and Unpacking via DRL
CORL 2022
Bayesian Optimization with Clustering and Rollback for CNN Auto Pruning
ECCV 2022
Diverse Learner: Exploring Diverse Supervision for Semi-Supervised Object Detection
ECCV 2022
Data-Efficient Backdoor Attacks
IJCAI 2022
Tip-Adapter: Training-Free Adaption of CLIP for Few-Shot Classification
ECCV 2022
DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world Detection
NIPS 2022
Robustness to Unbounded Smoothness of Generalized SignSGD
NIPS 2022
Leveraging the Hints: Adaptive Bidding in Repeated First-Price Auctions
NIPS 2022
Wukong: A 100 Million Large-scale Chinese Cross-modal Pre-training Benchmark
NIPS 2022
Responsive Listening Head Generation: A Benchmark Dataset and Baseline
ECCV 2022
CODA: A Real-World Road Corner Case Dataset for Object Detection in Autonomous Driving
ECCV 2022
G-MAP: General Memory-Augmented Pre-trained Language Model for Domain Tasks
EMNLP 2022
ISDNet: Integrating Shallow and Deep Networks for Efficient Ultra-High Resolution Segmentation
CVPR 2022
LPSNet: A Lightweight Solution for Fast Panoptic Segmentation
CVPR 2021
Post-Training Quantization for Vision Transformer
NIPS 2021
Scalable Rule-Based Representation Learning for Interpretable Classification
NIPS 2021
Exploiting Relationship for Complex-scene Image Generation
AAAI 2021
Unsupervised Active Learning via Subspace Learning
AAAI 2021
Non-asymptotic Convergence of Adam-type Reinforcement Learning Algorithms under Markovian Sampling
AAAI 2021
Multi-modal Multi-label Emotion Recognition with Heterogeneous Hierarchical Message Passing
AAAI 2021
Graph-Based Tri-Attention Network for Answer Ranking in CQA
AAAI 2021
Circles are like Ellipses, or Ellipses are like Circles? Measuring the Degree of Asymmetry of Static and Contextual Word Embeddings and the Implications to Representation Learning
AAAI 2021
BinaryBERT: Pushing the Limit of BERT Quantization
ACL 2021
On Sample Based Explanation Methods for NLP: Faithfulness, Efficiency and Semantic Evaluation
ACL 2021
Donβt Miss the Labels: Label-semantic Augmented Meta-Learner for Few-Shot Text Classification
ACL 2021
Points As Queries: Weakly Semi-Supervised Object Detection by Points
CVPR 2021
Source-Free Domain Adaptation for Semantic Segmentation
CVPR 2021
Mesh Saliency: An Independent Perceptual Measure or a Derivative of Image Saliency?
CVPR 2021
Focus on Local: Detecting Lane Marker From Bottom Up via Key Point
CVPR 2021
Zero-Shot Adversarial Quantization
CVPR 2021
HourNAS: Extremely Fast Neural Architecture Search Through an Hourglass Lens
CVPR 2021
UAV-Human: A Large Benchmark for Human Behavior Understanding With Unmanned Aerial Vehicles
CVPR 2021
Discrimination-Aware Mechanism for Fine-Grained Representation Learning
CVPR 2021
Learning a Facial Expression Embedding Disentangled From Identity
CVPR 2021
Joint-DetNAS: Upgrade Your Detector With NAS, Pruning and Dynamic Distillation
CVPR 2021
G-DetKD: Towards General Distillation Framework for Object Detectors via Contrastive and Semantic-Guided Feature Imitation
ICCV 2021
Box-Aware Feature Enhancement for Single Object Tracking on Point Clouds
ICCV 2021
Exploring Geometry-Aware Contrast and Clustering Harmonization for Self-Supervised 3D Object Detection
ICCV 2021
C3-SemiSeg: Contrastive Semi-Supervised Segmentation via Cross-Set Learning and Dynamic Class-Balancing
ICCV 2021
SCC: an efficient deep reinforcement learning agent mastering the game of StarCraft II
ICML 2021
Weakly-Supervised Spatio-Temporal Anomaly Detection in Surveillance Video
IJCAI 2021
Drop Redundant, Shrink Irrelevant: Selective Knowledge Injection for Language Pretraining
IJCAI 2021
Mental Models of AI Agents in a Cooperative Game Setting (Extended Abstract)
IJCAI 2021
BinaryBERT: Pushing the Limit of BERT Quantization
IJCNLP 2021
On Sample Based Explanation Methods for NLP: Faithfulness, Efficiency and Semantic Evaluation
IJCNLP 2021
Donβt Miss the Labels: Label-semantic Augmented Meta-Learner for Few-Shot Text Classification
IJCNLP 2021
4-Bit Quantization of LSTM-Based Speech Recognition Models
INTERSPEECH 2021
Look-Into-Object: Self-Supervised Structure Modeling for Object Recognition
CVPR 2020
Knowledge Association with Hyperbolic Knowledge Graph Embeddings
EMNLP 2020
Dynamic Anticipation and Completion for Multi-Hop Reasoning over Sparse Knowledge Graph
EMNLP 2020
TernaryBERT: Distillation-aware Ultra-low Bit BERT
EMNLP 2020
Towards Stabilizing Batch Statistics in Backward Propagation of Batch Normalization
ICLR 2020
Towards Better Understanding of Adaptive Gradient Algorithms in Generative Adversarial Nets
ICLR 2020
Unsupervised Multi-View CNN for Salient View Selection of 3D Objects and Scenes
ECCV 2020
GeoLayout: Geometry Driven Room Layout Estimation Based on Depth Maps of Planes
ECCV 2020
CurveLane-NAS: Unifying Lane-Sensitive Architecture Search and Adaptive Point Blending
ECCV 2020
Classes Matter: A Fine-grained Adversarial Approach to Cross-domain Semantic Segmentation
ECCV 2020
HARD-Net: Hardness-AwaRe Discrimination Network for 3D Early Activity Prediction
ECCV 2020
Segment as Points for Efficient Online Multi-Object Tracking and Segmentation
ECCV 2020
History-Gradient Aided Batch Size Adaptation for Variance Reduced Algorithms
ICML 2020
CAUSE: Learning Granger Causality from Event Sequences using Attribution Methods
ICML 2020
Finite-Time Analysis for Double Q-learning
NIPS 2020
Adaptive Fractional Dilated Convolution Network for Image Aesthetics Assessment
CVPR 2020
SP-NAS: Serial-to-Parallel Backbone Search for Object Detection
CVPR 2020
OpenUE: An Open Toolkit of Universal Extraction from Text
EMNLP 2020
Analysis of Q-learning with Adaptation and Momentum Restart for Gradient Descent
IJCAI 2020
Global Structure and Local Semantics-Preserved Embeddings for Entity Alignment
IJCAI 2020
ScaleCom: Scalable Sparsified Gradient Compression for Communication-Efficient Distributed Training
NIPS 2020
Kernel Based Progressive Distillation for Adder Neural Networks
NIPS 2020
Online Decision Based Visual Tracking via Reinforcement Learning
NIPS 2020
A Decentralized Parallel Algorithm for Training Generative Adversarial Nets
NIPS 2020
Residual Distillation: Towards Portable Deep Neural Networks without Shortcuts
NIPS 2020
Towards Accurate and Consistent Evaluation: A Dataset for Distantly-Supervised Relation Extraction
COLING 2020
Bridging Text and Knowledge with Multi-Prototype Embedding for Few-Shot Relational Triple Extraction
COLING 2020
NUT-RC: Noisy User-generated Text-oriented Reading Comprehension
COLING 2020
Improving Relation Extraction with Relational Paraphrase Sentences
COLING 2020
SM-NAS: Structural-to-Modular Neural Architecture Search for Object Detection
AAAI 2020
ZoomNet: Part-Aware Adaptive Zooming Neural Network for 3D Object Detection
AAAI 2020
Learning Long- and Short-Term User Literal-Preference with Multimodal Hierarchical Transformer Network for Personalized Image Caption
AAAI 2020
Improving Domain-Adapted Sentiment Classification by Deep Adversarial Mutual Learning
AAAI 2020
Improving Neural Relation Extraction with Positive and Unlabeled Learning
AAAI 2020
Transparent Classification with Multilayer Logical Perceptrons and Random Binarization
AAAI 2020
Knowledge Graph Alignment Network with Gated Multi-Hop Neighborhood Aggregation
AAAI 2020
Model Rubikβs Cube: Twisting Resolution, Depth and Width for TinyNets
NIPS 2020
How Can Self-Attention Networks Recognize Dyck-n Languages?
EMNLP 2020
Summarizing Chinese Medical Answer with Graph Convolution Networks and Question-focused Dual Attention
EMNLP 2020
Software Component Prediction for Bug Reports
ACML 2019
Exploring the Task Cooperation in Multi-goal Visual Navigation
IJCAI 2019
MSR: Multi-Scale Shape Regression for Scene Text Detection
IJCAI 2019
End-to-End Multi-Perspective Matching for Entity Resolution
IJCAI 2019
Multi-Granular Text Encoding for Self-Explaining Categorization
ACL 2019
Hierarchical Photo-Scene Encoder for Album Storytelling
AAAI 2019
Embedding Complementary Deep Networks for Image Classification
CVPR 2019
Destruction and Construction Learning for Fine-Grained Image Recognition
CVPR 2019
Meta Relational Learning for Few-Shot Link Prediction in Knowledge Graphs
IJCNLP 2019
VrR-VG: Refocusing Visually-Relevant Relationships
ICCV 2019
Hybrid 8-bit Floating Point (HFP8) Training and Inference for Deep Neural Networks
NIPS 2019
Large-Scale Mixed-Bandwidth Deep Neural Network Acoustic Modeling for Automatic Speech Recognition
INTERSPEECH 2019
A Highly Efficient Distributed Deep Learning System for Automatic Speech Recognition
INTERSPEECH 2019
Unsupervised Person Image Generation With Semantic Parsing Transformation
CVPR 2019
Meta Relational Learning for Few-Shot Link Prediction in Knowledge Graphs
EMNLP 2019
NADPEx: An on-policy temporally consistent exploration method for deep reinforcement learning
ICLR 2019
Long-tail Relation Extraction via Knowledge Graph Embeddings and Graph Convolution Networks
NAACL 2019
Sampling Wisely: Deep Image Embedding by Top-K Precision Optimization
ICCV 2019
Auto-FPN: Automatic Network Architecture Adaptation for Object Detection Beyond Classification
ICCV 2019
Controllable Video Captioning With POS Sequence Guidance Based on Gated Fusion Network
ICCV 2019
Attention-Based Capsule Networks with Dynamic Routing for Relation Extraction
EMNLP 2018
Evolutionary Stochastic Gradient Descent for Optimization of Deep Neural Networks
NIPS 2018
Improving Entity Recommendation with Search Log and Multi-Task Learning
IJCAI 2018
Evidence Aggregation for Answer Re-Ranking in Open-Domain Question Answering
ICLR 2018
Master-Slave Curriculum Design for Reinforcement Learning
IJCAI 2018
Image-level to Pixel-wise Labeling: From Theory to Practice
IJCAI 2018
Asynchronous Decentralized Parallel Stochastic Gradient Descent
ICML 2018
Optical Flow Guided Feature: A Fast and Robust Motion Representation for Video Action Recognition
CVPR 2018
A Preliminary Study on Tonal Coarticulation in Continuous Speech
INTERSPEECH 2018
Reconstruction Network for Video Captioning
CVPR 2018
Convolutional Memory Blocks for Depth Data Representation Learning
IJCAI 2018
Learning Sequential Correlation for User Generated Textual Content Popularity Prediction
IJCAI 2018
Label-Free Distant Supervision for Relation Extraction via Knowledge Graph Embedding
EMNLP 2018
Adaptive Sampling Scheme for Learning in Severely Imbalanced Large Scale Data
ACML 2017
Model Accuracy and Runtime Tradeoff in Distributed Deep Learning: A Systematic Study
IJCAI 2017
Learning to Explain Entity Relationships by Pairwise Ranking with Convolutional Neural Networks
IJCAI 2017
Binarized Mode Seeking for Scalable Visual Pattern Discovery
CVPR 2017
Can Decentralized Algorithms Outperform Centralized Algorithms? A Case Study for Decentralized Parallel Stochastic Gradient Descent
NIPS 2017
Reanalyze Fundamental Frequency Peak Delay in Mandarin
INTERSPEECH 2017
Model-Based Deep Hand Pose Estimation
IJCAI 2016
Staleness-Aware Async-SGD for Distributed Deep Learning
IJCAI 2016
Collaborative Multi-Level Embedding Learning from Reviews for Rating Prediction
IJCAI 2016
Multiple Granularity Descriptors for Fine-Grained Categorization
ICCV 2015
Prior-Based Dual Additive Latent Dirichlet Allocation for User-Item Connected Documents
IJCAI 2015
A Spatio-Temporal Appearance Representation for Viceo-Based Pedestrian Re-Identification
ICCV 2015
Weakly Supervised Semantic Segmentation for Social Images
CVPR 2015
Omni-word Feature and Soft Constraint for Chinese Relation Extraction
ACL 2014
Multi-View Embedding Learning for Incompletely Labeled Data
IJCAI 2013
Integrating Semantic Relatedness and Wordsβ Intrinsic Features for Keyword Extraction
IJCAI 2013
Dimensionality Reduction with Generalized Linear Models
IJCAI 2013
Learning a Replacement Model for Query Segmentation with Consistency in Search Logs
IJCNLP 2013
Sparse Reconstruction for Weakly Supervised Semantic Segmentation
IJCAI 2013
Automated Concurrency-Bug Fixing
OSDI 2012
A Lazy Learning Model for Entity Linking using Query-Specific Information
COLING 2012
A Wikipedia-LDA Model for Entity Linking with Batch Size Changing Instance Selection
IJCNLP 2011
Entity Linking Leveraging Automatically Generated Annotation
COLING 2010
HIT-IR-WSD: A WSD System for English Lexical Sample Task
SEMEVAL 2007
An Iterative Implicit Feedback Approach to Personalized Search
ACL 2006
An Iterative Implicit Feedback Approach to Personalized Search
COLING 2006