Tong Zhang
333 papers · 2001–2026 · 22 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+19 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (62) π§ Keyword Pioneer π Renaissance Researcher (9) π Interdisciplinary Bridge π£ Hot Topic Early Bird
π§
Keyword Pioneer
π
Renaissance Researcher
(9)
πΊοΈ
Taxonomy Completionist
(62)
π
Keyword Trendsetter Combo
(10)
π
Conference Loyalist
(59)
πΊ
Lone Wolf
(6)
π¬
Deep Specialist
(29)
π±
Topic Pioneer
π
Grand Slam
π
Triple Crown
π€
Dynamic Duo
(21)
π
Keyword Champion
(3)
π₯
Unstoppable
(25)
β‘
Prolific Year
(25)
π
Trend Setter
ποΈ
Keyword Collector
(316)
π
Conference Pioneer
π
Century Club
(326)
β
The Questioner
(2)
Conferences
NIPS (59)
ICML (49)
ACL (35)
CVPR (31)
JMLR (30)
EMNLP (29)
ICLR (20)
AAAI (15)
ECCV (15)
COLT (9)
NAACL (8)
IJCAI (6)
ICCV (6)
CORL (5)
COLING (5)
IJCNLP (3)
CONLL (3)
EACL (1)
MICCAI (1)
AISTATS (1)
NSDI (1)
WACV (1)
Top co-authors
Keywords
large language model
(18)
convex optimization
(14)
representation learning
(13)
stochastic optimization
(12)
regret bound
(12)
gradient descent
(11)
neural machine translation
(10)
stochastic gradient descent
(10)
greedy algorithm
(10)
function approximation
(9)
convolutional neural network
(9)
reinforcement learning
(9)
variance reduction
(8)
reinforcement learning from human feedback
(8)
graph neural network
(8)
domain adaptation
(8)
distributed learning
(7)
transfer learning
(7)
nonconvex optimization
(7)
semi-supervised learning
(7)
Papers
Cross-Domain Few-Shot Learning via Multi-View Collaborative Optimization with Vision-Language Models
AAAI 2026
Monte Carlo Diffusion for Generalizable Learning-Based RANSAC
AAAI 2026
PsyPARSE: Retrieval-Augmented Slow Thinking for Personalized Empathetic Counseling
AAAI 2026
Tackling Distractor Documents in Multi-Hop QA with Reinforcement and Curriculum Learning
EACL 2026
GUIDE: Towards Scalable Advising for Research Ideas
ACL 2026
SciFlow-Bench: Evaluating Structure-Aware Scientific Diagram Generation via Inverse Parsing
ACL 2026
Contextual Relevance and Adaptive Sampling for LLM-Based Document Reranking
ACL 2026
Building Math Agents with Multi-Turn Iterative Preference Learning
ICLR 2025
Personalized Visual Instruction Tuning
ICLR 2025
MA-LoT: Model-Collaboration Lean-based Long Chain-of-Thought Reasoning enhances Formal Theorem Proving
ICML 2025
Optimal Sample Selection Through Uncertainty Estimation and Its Application in Deep Learning
JMLR 2025
CANDY: Benchmarking LLMsβ Limitations and Assistive Potential in Chinese Misinformation Fact-Checking
EMNLP 2025
ALRPHFS: Adversarially Learned Risk Patterns with Hierarchical Fast & Slow Reasoning for Robust Agent Defense
EMNLP 2025
HuB: Learning Extreme Humanoid Balance
CORL 2025
Catoni Contextual Bandits are Robust to Heavy-tailed Rewards
ICML 2025
EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents
ICML 2025
Letβs Reason Formally: Natural-Formal Hybrid Reasoning Enhances LLMβs Math Capability
EMNLP 2025
TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data
NAACL 2025
Pre-training CLIP against Data Poisoning with Optimal Transport-based Matching and Alignment
EMNLP 2025
Demystifying Singular Defects in Large Language Models
ICML 2025
Scene Graph-Grounded Image Generation
AAAI 2025
FANS: Formal Answer Selection for LLM Natural Language Math Reasoning Using Lean4
EMNLP 2025
Self-Ensembling Gaussian Splatting for Few-Shot Novel View Synthesis
ICCV 2025
Refining CLIP's Spatial Awareness: A Visual-Centric Perspective
ICLR 2025
FDS: Frequency-Aware Denoising Score for Text-Guided Latent Diffusion Image Editing
CVPR 2025
Scaling Mesh Generation via Compressive Tokenization
CVPR 2025
Generating Multimodal Driving Scenes via Next-Scene Prediction
CVPR 2025
Distribution Prototype Diffusion Learning for Open-set Supervised Anomaly Detection
CVPR 2025
DFMU: Distribution-based Framework for Modeling Aleatoric Uncertainty in Multimodal Sentiment Analysis
IJCAI 2025
Going Beyond Consistency: Target-oriented Multi-view Graph Neural Network
IJCAI 2025
MiCRo: Mixture Modeling and Context-aware Routing for Personalized Preference Learning
EMNLP 2025
FPE2M2: Approaching Lossless and Efficient Quantization with Native Floating Point
ACL 2025
Logarithmic Regret for Online KL-Regularized Reinforcement Learning
ICML 2025
Bridge-Coder: Transferring Model Capabilities from High-Resource to Low-Resource Programming Language
ACL 2025
ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweighting
ACL 2025
A Parameter-Efficient and Fine-Grained Prompt Learning for Vision-Language Models
ACL 2025
From Lists to Emojis: How Format Bias Affects Model Alignment
ACL 2025
One QuantLLM for ALL: Fine-tuning Quantized LLMs Once for Efficient Deployments
ACL 2025
AlignDistil: Token-Level Language Model Alignment as Adaptive Policy Distillation
ACL 2025
Incongruity-aware Tension Field Network for Multi-modal Sarcasm Detection
ACL 2025
TWIST: Text-encoder Weight-editing for Inserting Secret Trojans in Text-to-Image Models
ACL 2025
SEOE: A Scalable and Reliable Semantic Evaluation Framework for Open Domain Event Detection
ACL 2025
Understanding Overadaptation in Supervised Fine-Tuning: The Role of Ensemble Methods
ICML 2025
ELABORATION: A Comprehensive Benchmark on Human-LLM Competitive Programming
ACL 2025
MatchDiffusion: Training-free Generation of Match-Cuts
ICCV 2025
TimeBooth: Disentangled Facial Invariant Representation for Diverse and Personalized Face Aging
ICCV 2025
An Orthogonal High-Rank Adaptation for Large Language Models
EMNLP 2025
AdaGrad under Anisotropic Smoothness
ICLR 2025
Rotated Runtime Smooth: Training-Free Activation Smoother for accurate INT4 inference
ICLR 2025
PivotMesh: Generic 3D Mesh Generation via Pivot Vertices Guidance
ICLR 2025
Faster Sampling without Isoperimetry via Diffusion-based Monte Carlo
COLT 2024
3D-Aware Hypothesis & Verification for Generalizable Relative Object Pose Estimation
ICLR 2024
Reverse Diffusion Monte Carlo
ICLR 2024
A unique M-pattern for micro-expression spotting in long videos
ICLR 2024
Towards Robust Offline Reinforcement Learning under Diverse Data Corruption
ICLR 2024
Accelerated Convergence of Stochastic Heavy Ball Method under Anisotropic Gradient Noise
ICLR 2024
Spurious Feature Diversification Improves Out-of-distribution Generalization
ICLR 2024
Mind Your Augmentation: The Key to Decoupling Dense Self-Supervised Learning
ICLR 2024
Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal Algorithms
NIPS 2024
AdanCA: Neural Cellular Automata As Adaptors For More Robust Vision Transformer
NIPS 2024
LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning
NIPS 2024
Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs
NIPS 2024
Online Iterative Reinforcement Learning from Human Feedback with General Preference Model
NIPS 2024
Reverse Transition Kernel: A Flexible Framework to Accelerate Diffusion Inference
NIPS 2024
Image Textualization: An Automatic Framework for Generating Rich and Detailed Image Descriptions
NIPS 2024
A Sober Look at the Robustness of CLIPs to Spurious Features
NIPS 2024
Reinforcement Learning with Foundation Priors: Let Embodied Agent Efficiently Learn on Its Own
CORL 2024
General Flow as Foundation Affordance for Scalable Robot Learning
CORL 2024
Leveraging Locality to Boost Sample Efficiency in Robotic Manipulation
CORL 2024
CVTHead: One-Shot Controllable Head Avatar With Vertex-Feature Transformer
WACV 2024
PipeNet: Question Answering with Semantic Pruning over Knowledge Graphs
NAACL 2024
TagFog: Textual Anchor Guidance and Fake Outlier Generation for Visual Out-of-Distribution Detection
AAAI 2024
Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning
ICML 2024
Towards Robust Model-Based Reinforcement Learning Against Adversarial Corruption
ICML 2024
Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-constraint
ICML 2024
Faster Sampling via Stochastic Gradient Proximal Sampler
ICML 2024
Towards Better Generalization in Open-Domain Question Answering by Mitigating Context Memorization
NAACL 2024
Active Prompting with Chain-of-Thought for Large Language Models
ACL 2024
Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards
ACL 2024
Enhancing Dialogue State Tracking Models through LLM-backed User-Agents Simulation
ACL 2024
CLAMBER: A Benchmark of Identifying and Clarifying Ambiguous Information Needs in Large Language Models
ACL 2024
RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models
ACL 2024
VeraCT Scan: Retrieval-Augmented Fake News Detection with Justifiable Reasoning
ACL 2024
Plum: Prompt Learning using Metaheuristics
ACL 2024
The Non-linear $F$-Design and Applications to Interactive Learning
ICML 2024
Submodular-based In-context Example Selection for LLMs-based Machine Translation
COLING 2024
LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation Models
NAACL 2024
R-Tuning: Instructing Large Language Models to Say βI Donβt Knowβ
NAACL 2024
Multi-Scale Prompt Memory-Augmented Model for Black-Box Scenarios
NAACL 2024
SiFT: A Serial Framework with Textual Guidance for Federated Learning
MICCAI 2024
Desigen: A Pipeline for Controllable Design Template Generation
CVPR 2024
Mitigating Object Dependencies: Improving Point Cloud Self-Supervised Learning through Object Exchange
CVPR 2024
DVMNet: Computing Relative Pose for Unseen Objects Beyond Hypotheses
CVPR 2024
PerceptionGPT: Effectively Fusing Visual Perception into LLM
CVPR 2024
InNeRF360: Text-Guided 3D-Consistent Object Inpainting on 360-degree Neural Radiance Fields
CVPR 2024
On the Impact of Hard Adversarial Instances on Overfitting in Adversarial Training
JMLR 2024
SINDER: Repairing the Singular Defects of DINOv2
ECCV 2024
An Incremental Unified Framework for Small Defect Inspection
ECCV 2024
Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization
ECCV 2024
Data Augmentation via Latent Diffusion for Saliency Prediction
ECCV 2024
PAPAL: A Provable PArticle-based Primal-Dual ALgorithm for Mixed Nash Equilibrium
JMLR 2024
Fast Rates in Pool-Based Batch Active Learning
JMLR 2024
Strength Lies in Differences! Improving Strategy Planning for Non-collaborative Dialogues via Diversified User Simulation
EMNLP 2024
Mitigating the Alignment Tax of RLHF
EMNLP 2024
TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts
EMNLP 2024
MLLM-Protector: Ensuring MLLMβs Safety without Hurting Performance
EMNLP 2024
The Instinctive Bias: Spurious Images lead to Illusion in MLLMs
EMNLP 2024
TensorOpera Router: A Multi-Model Router for Efficient LLM Inference
EMNLP 2024
Interpretable Preferences via Multi-Objective Reward Modeling and Mixture-of-Experts
EMNLP 2024
On the Limited Generalization Capability of the Implicit Reward Model Induced by Direct Preference Optimization
EMNLP 2024
Learn and Sample Together: Collaborative Generation for Graphic Design Layout
IJCAI 2023
Learning in POMDPs is Sample-Efficient with Hindsight Observability
ICML 2023
TempSAL - Uncovering Temporal Information for Deep Saliency Prediction
CVPR 2023
Spatiotemporal Self-Supervised Learning for Point Clouds in the Wild
CVPR 2023
VolRecon: Volume Rendering of Signed Ray Distance Functions for Generalizable Multi-View Reconstruction
CVPR 2023
DyNCA: Real-Time Dynamic Texture Synthesis Using Neural Cellular Automata
CVPR 2023
Doolittle: Benchmarks and Corpora for Academic Writing Formalization
EMNLP 2023
NEMTO: Neural Environment Matting for Novel View and Relighting Synthesis of Transparent Objects
ICCV 2023
Beyond Uniform Lipschitz Condition in Differentially Private Optimization
ICML 2023
On the Convergence of Federated Averaging with Cyclic Client Participation
ICML 2023
What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?
ICML 2023
Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game
ICLR 2023
Mixture-of-Domain-Adapters: Decoupling and Injecting Domain Knowledge to Pre-trained Language Modelsβ Memories
ACL 2023
A Theoretical Analysis of Optimistic Proximal Policy Optimization in Linear Markov Decision Processes
NIPS 2023
A Universal Semantic-Geometric Representation for Robotic Manipulation
CORL 2023
Generalized Polyak Step Size for First Order Optimization with Momentum
ICML 2023
Catalyst Acceleration of Error Compensated Methods Leads to Better Communication Complexity
AISTATS 2023
Deep Graph Structural Infomax
AAAI 2023
Covariate-Shift Generalization via Random Sample Weighting
AAAI 2023
Particle-based Variational Inference with Preconditioned Functional Gradient Flow
ICLR 2023
Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes
ICML 2023
Inconsistency, Instability, and Generalization Gap of Deep Neural Network Training
NIPS 2023
Multi-Consensus Decentralized Accelerated Gradient Descent
JMLR 2023
Posterior Sampling for Competitive RL: Function Approximation and Partial Observation
NIPS 2023
Corruption-Robust Offline Reinforcement Learning with General Function Approximation
NIPS 2023
Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage
NIPS 2023
Double Randomized Underdamped Langevin with Dimension-Independent Convergence Guarantee
NIPS 2023
Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data
EMNLP 2023
Towards Effective Automatic Debt Collection with Persona Awareness
EMNLP 2023
DetGPT: Detect What You Need via Reasoning
EMNLP 2023
VO$Q$L: Towards Optimal Regret in Model-free RL with Nonlinear Function Approximation
COLT 2023
Variance-Dependent Regret Bounds for Linear Bandits and Reinforcement Learning: Adaptivity and Computational Efficiency
COLT 2023
Weakly Supervised Disentangled Generative Causal Representation Learning
JMLR 2022
Multilingual Word Sense Disambiguation with Unified Sense Representation
COLING 2022
Speeding up Transformer Decoding via an Attention Refinement Network
COLING 2022
Minimax Regret Optimization for Robust Machine Learning under Distribution Shift
COLT 2022
Non-Linear Reinforcement Learning in Large Action Spaces: Structural Conditions and Sample-efficiency of Posterior Sampling
COLT 2022
Probabilistic Bilevel Coreset Selection
ICML 2022
Sparse Invariant Risk Minimization
ICML 2022
HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning
ICLR 2022
Eigencurve: Optimal Learning Rate Schedule for SGD on Quadratic Objectives with Skewed Hessian Spectrums
ICLR 2022
Exploiting Hybrid Semantics of Relation Paths for Multi-hop Question Answering over Knowledge Graphs
COLING 2022
Bayesian Invariant Risk Minimization
CVPR 2022
Exploring Geometric Consistency for Monocular 3D Object Detection
CVPR 2022
MulT: An End-to-End Multitask Learning Transformer
CVPR 2022
Leverage Your Local and Global Representations: A New Self-Supervised Learning Strategy
CVPR 2022
Increasing Visual Awareness in Multimodal Neural Machine Translation from an Information Theoretic Perspective
EMNLP 2022
MICO: A Multi-alternative Contrastive Learning Framework for Commonsense Knowledge Representation
EMNLP 2022
History-Aware Hierarchical Transformer for Multi-session Open-domain Dialogue System
EMNLP 2022
Model Agnostic Sample Reweighting for Out-of-Distribution Learning
ICML 2022
Frequency-Aware Contrastive Learning for Neural Machine Translation
AAAI 2022
Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets
ICML 2022
A Theoretical Analysis on Independence-driven Importance Weighting for Covariate-shift Generalization
ICML 2022
A Self-Play Posterior Sampling Algorithm for Zero-Sum Markov Games
ICML 2022
Benefits of Overparameterized Convolutional Residual Networks: Function Approximation under Smoothness Constraint
ICML 2022
Achieving Minimax Rates in Pool-Based Batch Active Learning
ICML 2022
When is the Convergence Time of Langevin Algorithms Dimension Independent? A Composite Optimization Viewpoint
JMLR 2022
Rare and Zero-shot Word Sense Disambiguation using Z-Reweighting
ACL 2022
Toward Knowledge-Enriched Conversational Recommendation Systems
ACL 2022
Semi-Supervised Monocular 3D Object Detection by Multi-View Consistency
ECCV 2022
RC-MVSNet: Unsupervised Multi-View Stereo with Neural Rendering
ECCV 2022
Model-based RL with Optimistic Posterior Sampling: Structural Conditions and Sample Complexity
NIPS 2022
Nearly Optimal Algorithms for Linear Contextual Bandits with Adversarial Corruptions
NIPS 2022
TILGAN: Transformer-based Implicit Latent GAN for Diverse and Coherent Text Generation
ACL 2021
Error Compensated Distributed SGD Can Be Accelerated
NIPS 2021
Efficient Neural Network Training via Forward and Backward Propagation Sparsification
NIPS 2021
Point, Disambiguate and Copy: Incorporating Bilingual Dictionaries for Neural Machine Translation
IJCNLP 2021
Taming Pre-trained Language Models with N-gram Representations for Low-Resource Domain Adaptation
IJCNLP 2021
G-DetKD: Towards General Distillation Framework for Object Detectors via Contrastive and Semantic-Guided Feature Imitation
ICCV 2021
A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning
NIPS 2021
Graph Deformer Network
IJCAI 2021
Joint-DetNAS: Upgrade Your Detector With NAS, Pruning and Dynamic Distillation
CVPR 2021
Effective Sparsification of Neural Networks With Global Sparsity Constraint
CVPR 2021
Uncertainty-Aware Joint Salient Object and Camouflaged Object Detection
CVPR 2021
Few-Shot Human Motion Transfer by Personalized Geometry and Texture Modeling
CVPR 2021
Involution: Inverting the Inherence of Convolution for Visual Recognition
CVPR 2021
Deep Wasserstein Graph Discriminant Learning for Graph Classification
AAAI 2021
Graph Game Embedding
AAAI 2021
DeEPCA: Decentralized Exact PCA with Linear Convergence Rate
JMLR 2021
TILGAN: Transformer-based Implicit Latent GAN for Diverse and Coherent Text Generation
IJCNLP 2021
Reinforced Attention for Few-Shot Learning and Beyond
CVPR 2021
TransNAS-Bench-101: Improving Transferability and Generalizability of Cross-Task Neural Architecture Search
CVPR 2021
Multi-Hop Transformer for Document-Level Machine Translation
NAACL 2021
Modeling from Features: a Mean-field Framework for Over-parameterized Deep Neural Networks
COLT 2021
Wasserstein Coupled Graph Learning for Cross-Modal Retrieval
ICCV 2021
Taming Pre-trained Language Models with N-gram Representations for Low-Resource Domain Adaptation
ACL 2021
Point, Disambiguate and Copy: Incorporating Bilingual Dictionaries for Neural Machine Translation
ACL 2021
Improving Chinese Word Segmentation with Wordhood Memory Networks
ACL 2020
Bridging the Gap between Sample-based and One-shot Neural Architecture Search with BONAS
NIPS 2020
How to Characterize The Landscape of Overparameterized Convolutional Neural Networks
NIPS 2020
Residual Distillation: Towards Portable Deep Neural Networks without Shortcuts
NIPS 2020
A Generalized Neural Tangent Kernel Analysis for Two-layer Neural Networks
NIPS 2020
Decentralized Accelerated Proximal Gradient Descent
NIPS 2020
Model Rubikβs Cube: Twisting Resolution, Depth and Width for TinyNets
NIPS 2020
Stochastic Recursive Gradient Descent Ascent for Stochastic Nonconvex-Strongly-Concave Minimax Problems
NIPS 2020
Variational Pathway Reasoning for EEG Emotion Recognition
AAAI 2020
Stable Learning via Sample Reweighting
AAAI 2020
Optimal Feature Transport for Cross-View Image Geo-Localization
AAAI 2020
Joint Chinese Word Segmentation and Part-of-speech Tagging via Two-way Attentions of Auto-analyzed Knowledge
ACL 2020
Pattern-Structure Diffusion for Multi-Task Learning
CVPR 2020
Cross-Modal Pattern-Propagation for RGB-T Tracking
CVPR 2020
UC-Net: Uncertainty Inspired RGB-D Saliency Detection via Conditional Variational Autoencoders
CVPR 2020
Synthetic Learning: Learn From Distributed Asynchronized Discriminator GAN Without Sharing Medical Image Data
CVPR 2020
MiLeNAS: Efficient Neural Architecture Search via Mixed-Level Reformulation
CVPR 2020
CATCH: Context-based Meta Reinforcement Learning for Transferrable Architecture Search
ECCV 2020
Graph Wasserstein Correlation Analysis for Movie Retrieval
ECCV 2020
Improving Constituency Parsing with Span Attention
EMNLP 2020
ZEN: Pre-training Chinese Text Encoder Enhanced by N-gram Representations
EMNLP 2020
Black-Box Adversarial Attack with Transferable Model-based Embedding
ICLR 2020
Graph inference learning for semi-supervised classification
ICLR 2020
Guided Learning of Nonconvex Models through Successive Functional Gradient Optimization
ICML 2020
Re-architecting Congestion Management in Lossless Ethernet
NSDI 2020
Neural Collaborative Subspace Clustering
ICML 2019
DHER: Hindsight Experience Replay for Dynamic Goals
ICLR 2019
Layer-Wise Learning Strategy for Nonparametric Tensor Product Smoothing Spline Regression and Graphical Models
JMLR 2019
Robust Frequent Directions with Application in Online Learning
JMLR 2019
Picasso: A Sparse Learning Library for High Dimensional Data Analysis in R and Python
JMLR 2019
Utilizing Second Order Information in Minibatch Stochastic Variance Reduced Proximal Iterations
JMLR 2019
Divergence-Augmented Policy Optimization
NIPS 2019
Dynamic Layer Aggregation for Neural Machine Translation with Routing-by-Agreement
AAAI 2019
Neural Machine Translation with Adequacy-Oriented Learning
AAAI 2019
Grid-Wise Control for Multi-Agent Reinforcement Learning in Video Game AI
ICML 2019
NATTACK: Learning the Distributions of Adversarial Examples for an Improved Black-Box Attack on Deep Neural Networks
ICML 2019
DoubleSqueeze: Parallel Stochastic Gradient Descent with Double-pass Error-Compensated Compression
ICML 2019
Reinforced Training Data Selection for Domain Adaptation
ACL 2019
Sharp Analysis for Nonconvex SGD Escaping from Saddle Points
COLT 2019
Efficient Decision-Based Black-Box Adversarial Attacks on Face Recognition
CVPR 2019
Exploiting Deep Representations for Neural Machine Translation
EMNLP 2018
QuaSE: Sequence Editing under Quantifiable Guidance
EMNLP 2018
Multi-Head Attention with Disagreement Regularization
EMNLP 2018
Super-Identity Convolutional Neural Network for Face Hallucination
ECCV 2018
Candidates vs. Noises Estimation for Large Multi-Class Classification Problem
ICML 2018
Composite Functional Gradient Learning of Generative Adversarial Models
ICML 2018
End-to-end Active Object Tracking via Reinforcement Learning
ICML 2018
An Algorithmic Framework of Variable Metric Over-Relaxed Hybrid Proximal Extra-Gradient Method
ICML 2018
Graphical Nonconvex Optimization via an Adaptive Convex Relaxation
ICML 2018
Error Compensated Quantized SGD and its Applications to Large-scale Distributed Optimization
ICML 2018
Safe Element Screening for Submodular Function Minimization
ICML 2018
Fully Decentralized Multi-Agent Reinforcement Learning with Networked Agents
ICML 2018
Unsupervised Image-to-Image Translation with Stacked Cycle-Consistent Adversarial Networks
ECCV 2018
Video Re-localization
ECCV 2018
Neural Stereoscopic Image Style Transfer
ECCV 2018
Orthogonal Deep Features Decomposition for Age-Invariant Face Recognition
ECCV 2018
Recurrent Fusion Network for Image captioning
ECCV 2018
Modeling Varying Camera-IMU Time Offset in Optimization-Based Visual-Inertial Odometry
ECCV 2018
Deep Unsupervised Saliency Detection: A Multiple Noisy Labeling Perspective
CVPR 2018
A Novel Neural Network Model based on Cerebral Hemispheric Asymmetry for EEG Emotion Recognition
IJCAI 2018
Stochastic Expectation Maximization with Variance Reduction
NIPS 2018
Exponentially Weighted Imitation Learning for Batched Historical Data
NIPS 2018
Communication Compression for Decentralized Training
NIPS 2018
Gradient Sparsification for Communication-Efficient Distributed Optimization
NIPS 2018
SPIDER: Near-Optimal Non-Convex Optimization via Stochastic Path-Integrated Differential Estimator
NIPS 2018
Stochastic Primal-Dual Method for Empirical Risk Minimization with O(1) Per-Iteration Complexity
NIPS 2018
Adaptive Sampling Towards Fast Graph Representation Learning
NIPS 2018
Gradient Hard Thresholding Pursuit
JMLR 2018
Modeling Localness for Self-Attention Networks
EMNLP 2018
Efficient Optimization for Linear Dynamical Systems with Applications to Clustering and Sparse Coding
NIPS 2017
Deep Subspace Clustering Networks
NIPS 2017
A General Distributed Dual Coordinate Optimization Framework for Regularized Loss Minimization
JMLR 2017
Efficient Distributed Learning with Sparsity
ICML 2017
Projection-free Distributed Online Learning in Networks
ICML 2017
Deep Pyramid Convolutional Neural Networks for Text Categorization
ACL 2017
Diffusion Approximations for Online Principal Component Estimation and Global Convergence
NIPS 2017
On Quadratic Convergence of DC Proximal Newton Algorithm in Nonconvex Sparse Learning
NIPS 2017
Towards More Efficient SPSD Matrix Approximation and CUR Matrix Decomposition
JMLR 2016
Sparse Nonlinear Regression: Parameter Estimation under Nonconvexity
ICML 2016
Learning Additive Exponential Family Graphical Models via $\ell_{2,1}$-norm Regularized M-Estimation
NIPS 2016
Exact Recovery of Hard Thresholding Pursuit
NIPS 2016
Supervised and Semi-Supervised Text Categorization using LSTM for Region Embeddings
ICML 2016
Learning Sparse Low-Threshold Linear Classifiers
JMLR 2015
Adaptive Stochastic Alternating Direction Method of Multipliers
ICML 2015
Stochastic Optimization with Importance Sampling for Regularized Loss Minimization
ICML 2015
Matrix Factorization with Scale-Invariant Parameters
IJCAI 2015
Semi-supervised Convolutional Neural Networks for Text Categorization via Region Embedding
NIPS 2015
Local Smoothness in Variance Reduced Optimization
NIPS 2015
Quartz: Randomized Dual Coordinate Ascent with Arbitrary Sampling
NIPS 2015
Effective Use of Word Order for Text Categorization with Convolutional Neural Networks
NAACL 2015
Accelerated Proximal Stochastic Dual Coordinate Ascent for Regularized Loss Minimization
ICML 2014
Gradient Hard Thresholding Pursuit for Sparsity-Constrained Optimization
ICML 2014
Communication-Efficient Distributed Optimization using an Approximate Newton-type Method
ICML 2014
A Convergence Rate Analysis for LogitBoost, MART and Their Variant
ICML 2014
Compressed Counting Meets Compressed Sensing
COLT 2014
Truncated Power Method for Sparse Eigenvalue Problems
JMLR 2013
Accelerating Stochastic Gradient Descent using Predictive Variance Reduction
NIPS 2013
Stochastic Gradient Descent for Non-smooth Optimization: Convergence Results and Optimal Averaging Schemes
ICML 2013
Accelerated Mini-Batch Stochastic Dual Coordinate Ascent
NIPS 2013
Stochastic Dual Coordinate Ascent Methods for Regularized Loss Minimization
JMLR 2013
Random Design Analysis of Ridge Regression
COLT 2012
Selective Labeling via Error Bound Minimization
NIPS 2012
Learning with Structured Sparsity
JMLR 2011
Learning to Search Efficiently in High Dimensions
NIPS 2011
Greedy Model Averaging
NIPS 2011
Spectral Methods for Learning Multivariate Latent Tree Structure
NIPS 2011
Analysis of Multi-stage Convex Relaxation for Sparse Regularization
JMLR 2010
Deep Coding Network
NIPS 2010
Agnostic Active Learning Without Constraints
NIPS 2010
Sparse Online Learning via Truncated Gradient
JMLR 2009
On the Consistency of Feature Selection using Greedy Least Squares Regression
JMLR 2009
Nonlinear Learning using Local Coordinate Coding
NIPS 2009
Multi-Label Prediction via Compressed Sensing
NIPS 2009
Multi-stage Convex Relaxation for Learning with Sparse Regularization
NIPS 2008
Adaptive Forward-Backward Greedy Algorithm for Sparse Learning with Linear Models
NIPS 2008
Sparse Online Learning via Truncated Gradient
NIPS 2008
On the Effectiveness of Laplacian Normalization for Graph Semi-supervised Learning
JMLR 2007
The Epoch-Greedy Algorithm for Multi-armed Bandits with Side Information
NIPS 2007
A General Boosting Method and its Application to Learning Ranking Functions for Web Search
NIPS 2007
Learning on Graph with Laplacian Regularization
NIPS 2006
A Discriminative Global Training Algorithm for Statistical MT
ACL 2006
A Discriminative Global Training Algorithm for Statistical MT
COLING 2006
A Localized Prediction Model for Statistical Machine Translation
ACL 2005
A Framework for Learning Predictive Structures from Multiple Tasks and Unlabeled Data
JMLR 2005
A High-Performance Semi-Supervised Learning Method for Text Chunking
ACL 2005
Statistical Analysis of Some Multi-Category Large Margin Classification Methods
JMLR 2004
Greedy Algorithms for Classification -- Consistency, Convergence Rates, and Adaptivity
JMLR 2003
HowtogetaChineseName(Entity): Segmentation and Combination Issues
EMNLP 2003
Generalization Error Bounds for Bayesian Mixture Algorithms
JMLR 2003
A Robust Risk Minimization based Named Entity Recognition System
CONLL 2003
Named Entity Recognition through Classifier Combination
CONLL 2003
Updating an NLP system to fit new domains: an empirical study on the sentence segmentation problem
CONLL 2003
Text Chunking based on a Generalization of Winnow
JMLR 2002
Recommender Systems Using Linear Classifiers
JMLR 2002
Covering Number Bounds of Certain Regularized Linear Function Classes
JMLR 2002
Text Chunking using Regularized Winnow
ACL 2001