conftrace_

Tong Zhang

333 papers · 2001–2026 · 22 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+19 more ↓

🗺️ Taxonomy Completionist (62) 🧭 Keyword Pioneer 🌈 Renaissance Researcher (9) 🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird

🧭 Keyword Pioneer 🌈 Renaissance Researcher (9) 🗺️ Taxonomy Completionist (62) 🌟 Keyword Trendsetter Combo (10) 🏠 Conference Loyalist (59) 🐺 Lone Wolf (6) 🔬 Deep Specialist (29) 🌱 Topic Pioneer 🏆 Grand Slam 👑 Triple Crown 🤝 Dynamic Duo (21) 🏆 Keyword Champion (3) 🔥 Unstoppable (25) ⚡ Prolific Year (25) 📈 Trend Setter 🗃️ Keyword Collector (316) 🚀 Conference Pioneer 💎 Century Club (326) ❓ The Questioner (2)

Conferences

NIPS (59) ICML (49) ACL (35) CVPR (31) JMLR (30) EMNLP (29) ICLR (20) AAAI (15) ECCV (15) COLT (9) NAACL (8) IJCAI (6) ICCV (6) CORL (5) COLING (5) IJCNLP (3) CONLL (3) EACL (1) MICCAI (1) AISTATS (1) NSDI (1) WACV (1)

Top co-authors

Rui Pan (22) SHIZHE DIAO (21) Renjie Pi (20) Jipeng Zhang (18) Hanze Dong (16) Sabine Süsstrunk (15) Wei Xiong (14) Mathieu Salzmann (14) Zhen Cui (14) Yong Lin (13)

Keywords

large language model (18) convex optimization (14) representation learning (13) stochastic optimization (12) regret bound (12) gradient descent (11) neural machine translation (10) stochastic gradient descent (10) greedy algorithm (10) function approximation (9) convolutional neural network (9) reinforcement learning (9) variance reduction (8) reinforcement learning from human feedback (8) graph neural network (8) domain adaptation (8) distributed learning (7) transfer learning (7) nonconvex optimization (7) semi-supervised learning (7)

Papers

Cross-Domain Few-Shot Learning via Multi-View Collaborative Optimization with Vision-Language Models AAAI 2026 Monte Carlo Diffusion for Generalizable Learning-Based RANSAC AAAI 2026 PsyPARSE: Retrieval-Augmented Slow Thinking for Personalized Empathetic Counseling AAAI 2026 Tackling Distractor Documents in Multi-Hop QA with Reinforcement and Curriculum Learning EACL 2026 GUIDE: Towards Scalable Advising for Research Ideas ACL 2026 SciFlow-Bench: Evaluating Structure-Aware Scientific Diagram Generation via Inverse Parsing ACL 2026 Contextual Relevance and Adaptive Sampling for LLM-Based Document Reranking ACL 2026 Building Math Agents with Multi-Turn Iterative Preference Learning ICLR 2025 Personalized Visual Instruction Tuning ICLR 2025 MA-LoT: Model-Collaboration Lean-based Long Chain-of-Thought Reasoning enhances Formal Theorem Proving ICML 2025 Optimal Sample Selection Through Uncertainty Estimation and Its Application in Deep Learning JMLR 2025 CANDY: Benchmarking LLMs’ Limitations and Assistive Potential in Chinese Misinformation Fact-Checking EMNLP 2025 ALRPHFS: Adversarially Learned Risk Patterns with Hierarchical Fast & Slow Reasoning for Robust Agent Defense EMNLP 2025 HuB: Learning Extreme Humanoid Balance CORL 2025 Catoni Contextual Bandits are Robust to Heavy-tailed Rewards ICML 2025 EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents ICML 2025 Let’s Reason Formally: Natural-Formal Hybrid Reasoning Enhances LLM’s Math Capability EMNLP 2025 TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data NAACL 2025 Pre-training CLIP against Data Poisoning with Optimal Transport-based Matching and Alignment EMNLP 2025 Demystifying Singular Defects in Large Language Models ICML 2025 Scene Graph-Grounded Image Generation AAAI 2025 FANS: Formal Answer Selection for LLM Natural Language Math Reasoning Using Lean4 EMNLP 2025 Self-Ensembling Gaussian Splatting for Few-Shot Novel View Synthesis ICCV 2025 Refining CLIP's Spatial Awareness: A Visual-Centric Perspective ICLR 2025 FDS: Frequency-Aware Denoising Score for Text-Guided Latent Diffusion Image Editing CVPR 2025 Scaling Mesh Generation via Compressive Tokenization CVPR 2025 Generating Multimodal Driving Scenes via Next-Scene Prediction CVPR 2025 Distribution Prototype Diffusion Learning for Open-set Supervised Anomaly Detection CVPR 2025 DFMU: Distribution-based Framework for Modeling Aleatoric Uncertainty in Multimodal Sentiment Analysis IJCAI 2025 Going Beyond Consistency: Target-oriented Multi-view Graph Neural Network IJCAI 2025 MiCRo: Mixture Modeling and Context-aware Routing for Personalized Preference Learning EMNLP 2025 FPE2M2: Approaching Lossless and Efficient Quantization with Native Floating Point ACL 2025 Logarithmic Regret for Online KL-Regularized Reinforcement Learning ICML 2025 Bridge-Coder: Transferring Model Capabilities from High-Resource to Low-Resource Programming Language ACL 2025 ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweighting ACL 2025 A Parameter-Efficient and Fine-Grained Prompt Learning for Vision-Language Models ACL 2025 From Lists to Emojis: How Format Bias Affects Model Alignment ACL 2025 One QuantLLM for ALL: Fine-tuning Quantized LLMs Once for Efficient Deployments ACL 2025 AlignDistil: Token-Level Language Model Alignment as Adaptive Policy Distillation ACL 2025 Incongruity-aware Tension Field Network for Multi-modal Sarcasm Detection ACL 2025 TWIST: Text-encoder Weight-editing for Inserting Secret Trojans in Text-to-Image Models ACL 2025 SEOE: A Scalable and Reliable Semantic Evaluation Framework for Open Domain Event Detection ACL 2025 Understanding Overadaptation in Supervised Fine-Tuning: The Role of Ensemble Methods ICML 2025 ELABORATION: A Comprehensive Benchmark on Human-LLM Competitive Programming ACL 2025 MatchDiffusion: Training-free Generation of Match-Cuts ICCV 2025 TimeBooth: Disentangled Facial Invariant Representation for Diverse and Personalized Face Aging ICCV 2025 An Orthogonal High-Rank Adaptation for Large Language Models EMNLP 2025 AdaGrad under Anisotropic Smoothness ICLR 2025 Rotated Runtime Smooth: Training-Free Activation Smoother for accurate INT4 inference ICLR 2025 PivotMesh: Generic 3D Mesh Generation via Pivot Vertices Guidance ICLR 2025 Faster Sampling without Isoperimetry via Diffusion-based Monte Carlo COLT 2024 3D-Aware Hypothesis & Verification for Generalizable Relative Object Pose Estimation ICLR 2024 Reverse Diffusion Monte Carlo ICLR 2024 A unique M-pattern for micro-expression spotting in long videos ICLR 2024 Towards Robust Offline Reinforcement Learning under Diverse Data Corruption ICLR 2024 Accelerated Convergence of Stochastic Heavy Ball Method under Anisotropic Gradient Noise ICLR 2024 Spurious Feature Diversification Improves Out-of-distribution Generalization ICLR 2024 Mind Your Augmentation: The Key to Decoupling Dense Self-Supervised Learning ICLR 2024 Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal Algorithms NIPS 2024 AdanCA: Neural Cellular Automata As Adaptors For More Robust Vision Transformer NIPS 2024 LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning NIPS 2024 Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs NIPS 2024 Online Iterative Reinforcement Learning from Human Feedback with General Preference Model NIPS 2024 Reverse Transition Kernel: A Flexible Framework to Accelerate Diffusion Inference NIPS 2024 Image Textualization: An Automatic Framework for Generating Rich and Detailed Image Descriptions NIPS 2024 A Sober Look at the Robustness of CLIPs to Spurious Features NIPS 2024 Reinforcement Learning with Foundation Priors: Let Embodied Agent Efficiently Learn on Its Own CORL 2024 General Flow as Foundation Affordance for Scalable Robot Learning CORL 2024 Leveraging Locality to Boost Sample Efficiency in Robotic Manipulation CORL 2024 CVTHead: One-Shot Controllable Head Avatar With Vertex-Feature Transformer WACV 2024 PipeNet: Question Answering with Semantic Pruning over Knowledge Graphs NAACL 2024 TagFog: Textual Anchor Guidance and Fake Outlier Generation for Visual Out-of-Distribution Detection AAAI 2024 Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning ICML 2024 Towards Robust Model-Based Reinforcement Learning Against Adversarial Corruption ICML 2024 Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-constraint ICML 2024 Faster Sampling via Stochastic Gradient Proximal Sampler ICML 2024 Towards Better Generalization in Open-Domain Question Answering by Mitigating Context Memorization NAACL 2024 Active Prompting with Chain-of-Thought for Large Language Models ACL 2024 Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards ACL 2024 Enhancing Dialogue State Tracking Models through LLM-backed User-Agents Simulation ACL 2024 CLAMBER: A Benchmark of Identifying and Clarifying Ambiguous Information Needs in Large Language Models ACL 2024 RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models ACL 2024 VeraCT Scan: Retrieval-Augmented Fake News Detection with Justifiable Reasoning ACL 2024 Plum: Prompt Learning using Metaheuristics ACL 2024 The Non-linear $F$-Design and Applications to Interactive Learning ICML 2024 Submodular-based In-context Example Selection for LLMs-based Machine Translation COLING 2024 LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation Models NAACL 2024 R-Tuning: Instructing Large Language Models to Say ‘I Don’t Know’ NAACL 2024 Multi-Scale Prompt Memory-Augmented Model for Black-Box Scenarios NAACL 2024 SiFT: A Serial Framework with Textual Guidance for Federated Learning MICCAI 2024 Desigen: A Pipeline for Controllable Design Template Generation CVPR 2024 Mitigating Object Dependencies: Improving Point Cloud Self-Supervised Learning through Object Exchange CVPR 2024 DVMNet: Computing Relative Pose for Unseen Objects Beyond Hypotheses CVPR 2024 PerceptionGPT: Effectively Fusing Visual Perception into LLM CVPR 2024 InNeRF360: Text-Guided 3D-Consistent Object Inpainting on 360-degree Neural Radiance Fields CVPR 2024 On the Impact of Hard Adversarial Instances on Overfitting in Adversarial Training JMLR 2024 SINDER: Repairing the Singular Defects of DINOv2 ECCV 2024 An Incremental Unified Framework for Small Defect Inspection ECCV 2024 Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization ECCV 2024 Data Augmentation via Latent Diffusion for Saliency Prediction ECCV 2024 PAPAL: A Provable PArticle-based Primal-Dual ALgorithm for Mixed Nash Equilibrium JMLR 2024 Fast Rates in Pool-Based Batch Active Learning JMLR 2024 Strength Lies in Differences! Improving Strategy Planning for Non-collaborative Dialogues via Diversified User Simulation EMNLP 2024 Mitigating the Alignment Tax of RLHF EMNLP 2024 TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts EMNLP 2024 MLLM-Protector: Ensuring MLLM’s Safety without Hurting Performance EMNLP 2024 The Instinctive Bias: Spurious Images lead to Illusion in MLLMs EMNLP 2024 TensorOpera Router: A Multi-Model Router for Efficient LLM Inference EMNLP 2024 Interpretable Preferences via Multi-Objective Reward Modeling and Mixture-of-Experts EMNLP 2024 On the Limited Generalization Capability of the Implicit Reward Model Induced by Direct Preference Optimization EMNLP 2024 Learn and Sample Together: Collaborative Generation for Graphic Design Layout IJCAI 2023 Learning in POMDPs is Sample-Efficient with Hindsight Observability ICML 2023 TempSAL - Uncovering Temporal Information for Deep Saliency Prediction CVPR 2023 Spatiotemporal Self-Supervised Learning for Point Clouds in the Wild CVPR 2023 VolRecon: Volume Rendering of Signed Ray Distance Functions for Generalizable Multi-View Reconstruction CVPR 2023 DyNCA: Real-Time Dynamic Texture Synthesis Using Neural Cellular Automata CVPR 2023 Doolittle: Benchmarks and Corpora for Academic Writing Formalization EMNLP 2023 NEMTO: Neural Environment Matting for Novel View and Relighting Synthesis of Transparent Objects ICCV 2023 Beyond Uniform Lipschitz Condition in Differentially Private Optimization ICML 2023 On the Convergence of Federated Averaging with Cyclic Client Participation ICML 2023 What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL? ICML 2023 Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game ICLR 2023 Mixture-of-Domain-Adapters: Decoupling and Injecting Domain Knowledge to Pre-trained Language Models’ Memories ACL 2023 A Theoretical Analysis of Optimistic Proximal Policy Optimization in Linear Markov Decision Processes NIPS 2023 A Universal Semantic-Geometric Representation for Robotic Manipulation CORL 2023 Generalized Polyak Step Size for First Order Optimization with Momentum ICML 2023 Catalyst Acceleration of Error Compensated Methods Leads to Better Communication Complexity AISTATS 2023 Deep Graph Structural Infomax AAAI 2023 Covariate-Shift Generalization via Random Sample Weighting AAAI 2023 Particle-based Variational Inference with Preconditioned Functional Gradient Flow ICLR 2023 Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes ICML 2023 Inconsistency, Instability, and Generalization Gap of Deep Neural Network Training NIPS 2023 Multi-Consensus Decentralized Accelerated Gradient Descent JMLR 2023 Posterior Sampling for Competitive RL: Function Approximation and Partial Observation NIPS 2023 Corruption-Robust Offline Reinforcement Learning with General Function Approximation NIPS 2023 Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage NIPS 2023 Double Randomized Underdamped Langevin with Dimension-Independent Convergence Guarantee NIPS 2023 Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data EMNLP 2023 Towards Effective Automatic Debt Collection with Persona Awareness EMNLP 2023 DetGPT: Detect What You Need via Reasoning EMNLP 2023 VO$Q$L: Towards Optimal Regret in Model-free RL with Nonlinear Function Approximation COLT 2023 Variance-Dependent Regret Bounds for Linear Bandits and Reinforcement Learning: Adaptivity and Computational Efficiency COLT 2023 Weakly Supervised Disentangled Generative Causal Representation Learning JMLR 2022 Multilingual Word Sense Disambiguation with Unified Sense Representation COLING 2022 Speeding up Transformer Decoding via an Attention Refinement Network COLING 2022 Minimax Regret Optimization for Robust Machine Learning under Distribution Shift COLT 2022 Non-Linear Reinforcement Learning in Large Action Spaces: Structural Conditions and Sample-efficiency of Posterior Sampling COLT 2022 Probabilistic Bilevel Coreset Selection ICML 2022 Sparse Invariant Risk Minimization ICML 2022 HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning ICLR 2022 Eigencurve: Optimal Learning Rate Schedule for SGD on Quadratic Objectives with Skewed Hessian Spectrums ICLR 2022 Exploiting Hybrid Semantics of Relation Paths for Multi-hop Question Answering over Knowledge Graphs COLING 2022 Bayesian Invariant Risk Minimization CVPR 2022 Exploring Geometric Consistency for Monocular 3D Object Detection CVPR 2022 MulT: An End-to-End Multitask Learning Transformer CVPR 2022 Leverage Your Local and Global Representations: A New Self-Supervised Learning Strategy CVPR 2022 Increasing Visual Awareness in Multimodal Neural Machine Translation from an Information Theoretic Perspective EMNLP 2022 MICO: A Multi-alternative Contrastive Learning Framework for Commonsense Knowledge Representation EMNLP 2022 History-Aware Hierarchical Transformer for Multi-session Open-domain Dialogue System EMNLP 2022 Model Agnostic Sample Reweighting for Out-of-Distribution Learning ICML 2022 Frequency-Aware Contrastive Learning for Neural Machine Translation AAAI 2022 Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets ICML 2022 A Theoretical Analysis on Independence-driven Importance Weighting for Covariate-shift Generalization ICML 2022 A Self-Play Posterior Sampling Algorithm for Zero-Sum Markov Games ICML 2022 Benefits of Overparameterized Convolutional Residual Networks: Function Approximation under Smoothness Constraint ICML 2022 Achieving Minimax Rates in Pool-Based Batch Active Learning ICML 2022 When is the Convergence Time of Langevin Algorithms Dimension Independent? A Composite Optimization Viewpoint JMLR 2022 Rare and Zero-shot Word Sense Disambiguation using Z-Reweighting ACL 2022 Toward Knowledge-Enriched Conversational Recommendation Systems ACL 2022 Semi-Supervised Monocular 3D Object Detection by Multi-View Consistency ECCV 2022 RC-MVSNet: Unsupervised Multi-View Stereo with Neural Rendering ECCV 2022 Model-based RL with Optimistic Posterior Sampling: Structural Conditions and Sample Complexity NIPS 2022 Nearly Optimal Algorithms for Linear Contextual Bandits with Adversarial Corruptions NIPS 2022 TILGAN: Transformer-based Implicit Latent GAN for Diverse and Coherent Text Generation ACL 2021 Error Compensated Distributed SGD Can Be Accelerated NIPS 2021 Efficient Neural Network Training via Forward and Backward Propagation Sparsification NIPS 2021 Point, Disambiguate and Copy: Incorporating Bilingual Dictionaries for Neural Machine Translation IJCNLP 2021 Taming Pre-trained Language Models with N-gram Representations for Low-Resource Domain Adaptation IJCNLP 2021 G-DetKD: Towards General Distillation Framework for Object Detectors via Contrastive and Semantic-Guided Feature Imitation ICCV 2021 A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning NIPS 2021 Graph Deformer Network IJCAI 2021 Joint-DetNAS: Upgrade Your Detector With NAS, Pruning and Dynamic Distillation CVPR 2021 Effective Sparsification of Neural Networks With Global Sparsity Constraint CVPR 2021 Uncertainty-Aware Joint Salient Object and Camouflaged Object Detection CVPR 2021 Few-Shot Human Motion Transfer by Personalized Geometry and Texture Modeling CVPR 2021 Involution: Inverting the Inherence of Convolution for Visual Recognition CVPR 2021 Deep Wasserstein Graph Discriminant Learning for Graph Classification AAAI 2021 Graph Game Embedding AAAI 2021 DeEPCA: Decentralized Exact PCA with Linear Convergence Rate JMLR 2021 TILGAN: Transformer-based Implicit Latent GAN for Diverse and Coherent Text Generation IJCNLP 2021 Reinforced Attention for Few-Shot Learning and Beyond CVPR 2021 TransNAS-Bench-101: Improving Transferability and Generalizability of Cross-Task Neural Architecture Search CVPR 2021 Multi-Hop Transformer for Document-Level Machine Translation NAACL 2021 Modeling from Features: a Mean-field Framework for Over-parameterized Deep Neural Networks COLT 2021 Wasserstein Coupled Graph Learning for Cross-Modal Retrieval ICCV 2021 Taming Pre-trained Language Models with N-gram Representations for Low-Resource Domain Adaptation ACL 2021 Point, Disambiguate and Copy: Incorporating Bilingual Dictionaries for Neural Machine Translation ACL 2021 Improving Chinese Word Segmentation with Wordhood Memory Networks ACL 2020 Bridging the Gap between Sample-based and One-shot Neural Architecture Search with BONAS NIPS 2020 How to Characterize The Landscape of Overparameterized Convolutional Neural Networks NIPS 2020 Residual Distillation: Towards Portable Deep Neural Networks without Shortcuts NIPS 2020 A Generalized Neural Tangent Kernel Analysis for Two-layer Neural Networks NIPS 2020 Decentralized Accelerated Proximal Gradient Descent NIPS 2020 Model Rubik’s Cube: Twisting Resolution, Depth and Width for TinyNets NIPS 2020 Stochastic Recursive Gradient Descent Ascent for Stochastic Nonconvex-Strongly-Concave Minimax Problems NIPS 2020 Variational Pathway Reasoning for EEG Emotion Recognition AAAI 2020 Stable Learning via Sample Reweighting AAAI 2020 Optimal Feature Transport for Cross-View Image Geo-Localization AAAI 2020 Joint Chinese Word Segmentation and Part-of-speech Tagging via Two-way Attentions of Auto-analyzed Knowledge ACL 2020 Pattern-Structure Diffusion for Multi-Task Learning CVPR 2020 Cross-Modal Pattern-Propagation for RGB-T Tracking CVPR 2020 UC-Net: Uncertainty Inspired RGB-D Saliency Detection via Conditional Variational Autoencoders CVPR 2020 Synthetic Learning: Learn From Distributed Asynchronized Discriminator GAN Without Sharing Medical Image Data CVPR 2020 MiLeNAS: Efficient Neural Architecture Search via Mixed-Level Reformulation CVPR 2020 CATCH: Context-based Meta Reinforcement Learning for Transferrable Architecture Search ECCV 2020 Graph Wasserstein Correlation Analysis for Movie Retrieval ECCV 2020 Improving Constituency Parsing with Span Attention EMNLP 2020 ZEN: Pre-training Chinese Text Encoder Enhanced by N-gram Representations EMNLP 2020 Black-Box Adversarial Attack with Transferable Model-based Embedding ICLR 2020 Graph inference learning for semi-supervised classification ICLR 2020 Guided Learning of Nonconvex Models through Successive Functional Gradient Optimization ICML 2020 Re-architecting Congestion Management in Lossless Ethernet NSDI 2020 Neural Collaborative Subspace Clustering ICML 2019 DHER: Hindsight Experience Replay for Dynamic Goals ICLR 2019 Layer-Wise Learning Strategy for Nonparametric Tensor Product Smoothing Spline Regression and Graphical Models JMLR 2019 Robust Frequent Directions with Application in Online Learning JMLR 2019 Picasso: A Sparse Learning Library for High Dimensional Data Analysis in R and Python JMLR 2019 Utilizing Second Order Information in Minibatch Stochastic Variance Reduced Proximal Iterations JMLR 2019 Divergence-Augmented Policy Optimization NIPS 2019 Dynamic Layer Aggregation for Neural Machine Translation with Routing-by-Agreement AAAI 2019 Neural Machine Translation with Adequacy-Oriented Learning AAAI 2019 Grid-Wise Control for Multi-Agent Reinforcement Learning in Video Game AI ICML 2019 NATTACK: Learning the Distributions of Adversarial Examples for an Improved Black-Box Attack on Deep Neural Networks ICML 2019 DoubleSqueeze: Parallel Stochastic Gradient Descent with Double-pass Error-Compensated Compression ICML 2019 Reinforced Training Data Selection for Domain Adaptation ACL 2019 Sharp Analysis for Nonconvex SGD Escaping from Saddle Points COLT 2019 Efficient Decision-Based Black-Box Adversarial Attacks on Face Recognition CVPR 2019 Exploiting Deep Representations for Neural Machine Translation EMNLP 2018 QuaSE: Sequence Editing under Quantifiable Guidance EMNLP 2018 Multi-Head Attention with Disagreement Regularization EMNLP 2018 Super-Identity Convolutional Neural Network for Face Hallucination ECCV 2018 Candidates vs. Noises Estimation for Large Multi-Class Classification Problem ICML 2018 Composite Functional Gradient Learning of Generative Adversarial Models ICML 2018 End-to-end Active Object Tracking via Reinforcement Learning ICML 2018 An Algorithmic Framework of Variable Metric Over-Relaxed Hybrid Proximal Extra-Gradient Method ICML 2018 Graphical Nonconvex Optimization via an Adaptive Convex Relaxation ICML 2018 Error Compensated Quantized SGD and its Applications to Large-scale Distributed Optimization ICML 2018 Safe Element Screening for Submodular Function Minimization ICML 2018 Fully Decentralized Multi-Agent Reinforcement Learning with Networked Agents ICML 2018 Unsupervised Image-to-Image Translation with Stacked Cycle-Consistent Adversarial Networks ECCV 2018 Video Re-localization ECCV 2018 Neural Stereoscopic Image Style Transfer ECCV 2018 Orthogonal Deep Features Decomposition for Age-Invariant Face Recognition ECCV 2018 Recurrent Fusion Network for Image captioning ECCV 2018 Modeling Varying Camera-IMU Time Offset in Optimization-Based Visual-Inertial Odometry ECCV 2018 Deep Unsupervised Saliency Detection: A Multiple Noisy Labeling Perspective CVPR 2018 A Novel Neural Network Model based on Cerebral Hemispheric Asymmetry for EEG Emotion Recognition IJCAI 2018 Stochastic Expectation Maximization with Variance Reduction NIPS 2018 Exponentially Weighted Imitation Learning for Batched Historical Data NIPS 2018 Communication Compression for Decentralized Training NIPS 2018 Gradient Sparsification for Communication-Efficient Distributed Optimization NIPS 2018 SPIDER: Near-Optimal Non-Convex Optimization via Stochastic Path-Integrated Differential Estimator NIPS 2018 Stochastic Primal-Dual Method for Empirical Risk Minimization with O(1) Per-Iteration Complexity NIPS 2018 Adaptive Sampling Towards Fast Graph Representation Learning NIPS 2018 Gradient Hard Thresholding Pursuit JMLR 2018 Modeling Localness for Self-Attention Networks EMNLP 2018 Efficient Optimization for Linear Dynamical Systems with Applications to Clustering and Sparse Coding NIPS 2017 Deep Subspace Clustering Networks NIPS 2017 A General Distributed Dual Coordinate Optimization Framework for Regularized Loss Minimization JMLR 2017 Efficient Distributed Learning with Sparsity ICML 2017 Projection-free Distributed Online Learning in Networks ICML 2017 Deep Pyramid Convolutional Neural Networks for Text Categorization ACL 2017 Diffusion Approximations for Online Principal Component Estimation and Global Convergence NIPS 2017 On Quadratic Convergence of DC Proximal Newton Algorithm in Nonconvex Sparse Learning NIPS 2017 Towards More Efficient SPSD Matrix Approximation and CUR Matrix Decomposition JMLR 2016 Sparse Nonlinear Regression: Parameter Estimation under Nonconvexity ICML 2016 Learning Additive Exponential Family Graphical Models via $\ell_{2,1}$-norm Regularized M-Estimation NIPS 2016 Exact Recovery of Hard Thresholding Pursuit NIPS 2016 Supervised and Semi-Supervised Text Categorization using LSTM for Region Embeddings ICML 2016 Learning Sparse Low-Threshold Linear Classifiers JMLR 2015 Adaptive Stochastic Alternating Direction Method of Multipliers ICML 2015 Stochastic Optimization with Importance Sampling for Regularized Loss Minimization ICML 2015 Matrix Factorization with Scale-Invariant Parameters IJCAI 2015 Semi-supervised Convolutional Neural Networks for Text Categorization via Region Embedding NIPS 2015 Local Smoothness in Variance Reduced Optimization NIPS 2015 Quartz: Randomized Dual Coordinate Ascent with Arbitrary Sampling NIPS 2015 Effective Use of Word Order for Text Categorization with Convolutional Neural Networks NAACL 2015 Accelerated Proximal Stochastic Dual Coordinate Ascent for Regularized Loss Minimization ICML 2014 Gradient Hard Thresholding Pursuit for Sparsity-Constrained Optimization ICML 2014 Communication-Efficient Distributed Optimization using an Approximate Newton-type Method ICML 2014 A Convergence Rate Analysis for LogitBoost, MART and Their Variant ICML 2014 Compressed Counting Meets Compressed Sensing COLT 2014 Truncated Power Method for Sparse Eigenvalue Problems JMLR 2013 Accelerating Stochastic Gradient Descent using Predictive Variance Reduction NIPS 2013 Stochastic Gradient Descent for Non-smooth Optimization: Convergence Results and Optimal Averaging Schemes ICML 2013 Accelerated Mini-Batch Stochastic Dual Coordinate Ascent NIPS 2013 Stochastic Dual Coordinate Ascent Methods for Regularized Loss Minimization JMLR 2013 Random Design Analysis of Ridge Regression COLT 2012 Selective Labeling via Error Bound Minimization NIPS 2012 Learning with Structured Sparsity JMLR 2011 Learning to Search Efficiently in High Dimensions NIPS 2011 Greedy Model Averaging NIPS 2011 Spectral Methods for Learning Multivariate Latent Tree Structure NIPS 2011 Analysis of Multi-stage Convex Relaxation for Sparse Regularization JMLR 2010 Deep Coding Network NIPS 2010 Agnostic Active Learning Without Constraints NIPS 2010 Sparse Online Learning via Truncated Gradient JMLR 2009 On the Consistency of Feature Selection using Greedy Least Squares Regression JMLR 2009 Nonlinear Learning using Local Coordinate Coding NIPS 2009 Multi-Label Prediction via Compressed Sensing NIPS 2009 Multi-stage Convex Relaxation for Learning with Sparse Regularization NIPS 2008 Adaptive Forward-Backward Greedy Algorithm for Sparse Learning with Linear Models NIPS 2008 Sparse Online Learning via Truncated Gradient NIPS 2008 On the Effectiveness of Laplacian Normalization for Graph Semi-supervised Learning JMLR 2007 The Epoch-Greedy Algorithm for Multi-armed Bandits with Side Information NIPS 2007 A General Boosting Method and its Application to Learning Ranking Functions for Web Search NIPS 2007 Learning on Graph with Laplacian Regularization NIPS 2006 A Discriminative Global Training Algorithm for Statistical MT ACL 2006 A Discriminative Global Training Algorithm for Statistical MT COLING 2006 A Localized Prediction Model for Statistical Machine Translation ACL 2005 A Framework for Learning Predictive Structures from Multiple Tasks and Unlabeled Data JMLR 2005 A High-Performance Semi-Supervised Learning Method for Text Chunking ACL 2005 Statistical Analysis of Some Multi-Category Large Margin Classification Methods JMLR 2004 Greedy Algorithms for Classification -- Consistency, Convergence Rates, and Adaptivity JMLR 2003 HowtogetaChineseName(Entity): Segmentation and Combination Issues EMNLP 2003 Generalization Error Bounds for Bayesian Mixture Algorithms JMLR 2003 A Robust Risk Minimization based Named Entity Recognition System CONLL 2003 Named Entity Recognition through Classifier Combination CONLL 2003 Updating an NLP system to fit new domains: an empirical study on the sentence segmentation problem CONLL 2003 Text Chunking based on a Generalization of Winnow JMLR 2002 Recommender Systems Using Linear Classifiers JMLR 2002 Covering Number Bounds of Certain Regularized Linear Function Classes JMLR 2002 Text Chunking using Regularized Winnow ACL 2001