conftrace_

Yi Yang

401 papers · 2011–2026 · 19 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+20 more ↓

🗺️ Taxonomy Completionist (31) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (7) 🐣 Hot Topic Early Bird

🏃 Academic Marathon (15) 🌈 Renaissance Researcher (7) 🌉 Interdisciplinary Bridge 🏠 Conference Loyalist (31) 🌟 Keyword Trendsetter Combo (6) 🤝 Dynamic Duo (53) 👑 Triple Crown 🧬 Topic Evolution 🏆 Keyword Champion (3) 🏆 Grand Slam 👥 Mega-Team (24) 🌱 Topic Pioneer 🔬 Deep Specialist (46) 🚀 Conference Pioneer 🔥 Unstoppable (14) ❓ The Questioner (5) 💎 Century Club (387) 🗃️ Keyword Collector (76) ⚡ Prolific Year (35) 📈 Trend Setter

Conferences

CVPR (103) ICCV (65) AAAI (38) ACL (32) NIPS (31) EMNLP (29) ECCV (27) IJCAI (21) ICLR (18) NAACL (11) ICML (8) IJCNLP (6) WACV (3) INTERSPEECH (2) JMLR (2) COLING (2) EACL (1) AISTATS (1) ACML (1)

Top co-authors

Linchao Zhu (53) Wenguan Wang (37) Zongxin Yang (28) Hehe Fan (27) Fan Ma (21) Xiaohan Wang (20) Yifan Sun (20) Yunchao Wei (16) Ruijie Quan (15) Xuanyi Dong (14)

Research topics

Privacy (3) Robotics (2) Differential Privacy (1) Core AI (1)

Keywords

semantic segmentation (25) domain adaptation (25) video understanding (24) large language model (22) convolutional neural network (22) representation learning (21) diffusion model (16) zero-shot learning (15) person re-identification (15) contrastive learning (15) object detection (12) multimodal learning (12) few-shot learning (12) attention mechanism (12) action recognition (12) unsupervised learning (10) self-supervised learning (10) text classification (9) knowledge distillation (9) adversarial learning (9)

Papers

Oscillation Inversion: Training-Free Image and Video Enhancement Through Oscillated Latents in Large Flow Models AAAI 2026 Bayes-Optimal Fair Classification with Multiple Sensitive Features AAAI 2026 Beyond Chunking: Discourse-Aware Hierarchical Retrieval for Long Document Question Answering ACL 2026 LBLLM: Lightweight Binarization of Large Language Models via Three-Stage Distillation ACL 2026 One Refiner to Unlock Them All: Inference-Time Reasoning Elicitation via Reinforcement Query Refinement ACL 2026 OMIBench: Benchmarking Olympiad-Level Multi-Image Reasoning in Large Vision-Language Models ACL 2026 HiMo: High-Speed Objects Motion Compensation in Point Clouds (Abstract Reprint) AAAI 2026 MathFlow: Enhancing the Perceptual Flow of MLLMs for Visual Mathematical Problems ACL 2026 KV-Embedding: Training-free Text Embedding via Internal KV Re-routing in Decoder-only LLMs ACL 2026 FlowMorph: Revealing an Optimizable Flow Latent Space for Controlled Image Morphing WACV 2026 Revealing the Numeracy Gap: An Empirical Investigation of Text Embedding Models EACL 2026 Breaking the Modality Barrier: Generative Modeling for Accurate Molecule Retrieval from Mass Spectra AAAI 2026 Melodia: Training-Free Music Editing Guided by Attention Probing in Diffusion Models AAAI 2026 Insert Anything: Image Insertion via In-Context Editing in DiT AAAI 2026 DLVINet: Advancing Dual-Lens Video Inpainting Beyond Parallax Constraints AAAI 2026 Evaluating and Aligning Human Economic Risk Preferences in LLMs EMNLP 2025 DiffVsgg: Diffusion-Driven Online Video Scene Graph Generation CVPR 2025 ReFu: Recursive Fusion for Exemplar-Free 3D Class-Incremental Learning WACV 2025 Bias A-head? Analyzing Bias in Transformer-Based Language Model Attention Heads NAACL 2025 Representation Learning with Mutual Influence of Modalities for Node Classification in Multi-Modal Heterogeneous Networks IJCAI 2025 Drafting and Revision: Advancing High-Fidelity Video Inpainting IJCAI 2025 ZeroMamba: Exploring Visual State Space Model for Zero-Shot Learning AAAI 2025 Image Regeneration: Evaluating Text-to-Image Model via Generating Identical Image with Multimodal Large Language Models AAAI 2025 Autonomous LLM-Enhanced Adversarial Attack for Text-to-Motion AAAI 2025 BrainGuard: Privacy-Preserving Multisubject Image Reconstructions from Brain Activities AAAI 2025 LLM Agents Can Be Choice-Supportive Biased Evaluators: An Empirical Study AAAI 2025 Prompt-Aware Controllable Shadow Removal IJCAI 2025 DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization ICML 2025 Holistic Physics Solver: Learning PDEs in a Unified Spectral-Physical Space ICML 2025 Origin Identification for Text-Guided Image-to-Image Diffusion Models ICML 2025 Reaction Graph: Towards Reaction-Level Modeling for Chemical Reactions with 3D Structures ICML 2025 Learning without Isolation: Pathway Protection for Continual Learning ICML 2025 Know the Unknown: An Uncertainty-Sensitive Method for LLM Instruction Tuning ACL 2025 Adapting General-Purpose Embedding Models to Private Datasets Using Keyword-based Retrieval ACL 2025 Achieving binary weight and activation for LLMs using Post-Training Quantization ACL 2025 Sparse Rewards Can Self-Train Dialogue Agents ACL 2025 Bridging the LLM Accessibility Divide? Performance, Fairness, and Cost of Closed versus Open LLMs for Automated Essay Scoring ACL 2025 PersonaTwin: A Multi-Tier Prompt Conditioning Framework for Generating and Evaluating Personalized Digital Twins ACL 2025 Long-horizon Visual Instruction Generation with Logic and Attribute Self-reflection ICLR 2025 Transformer-based Speech Model Learns Well as Infants and Encodes Abstractions through Exemplars in the Poverty of the Stimulus Environment COLING 2025 VideoGrain: Modulating Space-Time Attention for Multi-Grained Video Editing ICLR 2025 TDDBench: A Benchmark for Training data detection ICLR 2025 Hydra-SGG: Hybrid Relation Assignment for One-stage Scene Graph Generation ICLR 2025 OSDA Agent: Leveraging Large Language Models for De Novo Design of Organic Structure Directing Agents ICLR 2025 3DIS: Depth-Driven Decoupled Image Synthesis for Universal Multi-Instance Generation ICLR 2025 Adversarial Mixup Unlearning ICLR 2025 Underwater Visual SLAM with Depth Uncertainty and Medium Modeling ICCV 2025 Towards Human-like Virtual Beings: Simulating Human Behavior in 3D Scenes ICCV 2025 DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models ICCV 2025 NeRF Is a Valuable Assistant for 3D Gaussian Splatting ICCV 2025 From Trial to Triumph: Advancing Long Video Understanding via Visual Context Sample Scaling and Self-reward Alignment ICCV 2025 BVINet: Unlocking Blind Video Inpainting with Zero Annotations ICCV 2025 UniGlyph: Unified Segmentation-Conditioned Diffusion for Precise Visual Text Synthesis ICCV 2025 TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation ICCV 2025 Hierarchical Event Memory for Accurate and Low-latency Online Video Temporal Grounding ICCV 2025 MaGS: Reconstructing and Simulating Dynamic 3D Objects with Mesh-adsorbed Gaussian Splatting ICCV 2025 MeshLLM: Empowering Large Language Models to Progressively Understand and Generate 3D Mesh ICCV 2025 Dual Reciprocal Learning of Language-based Human Motion Understanding and Generation ICCV 2025 TAPNext: Tracking Any Point (TAP) as Next Token Prediction ICCV 2025 From Image to Video: An Empirical Study of Diffusion Representations ICCV 2025 Gaussian-based World Model: Gaussian Priors for Voxel-Based Occupancy Prediction and Future Motion Prediction ICCV 2025 R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization ICCV 2025 MC-Bench: A Benchmark for Multi-Context Visual Grounding in the Era of MLLMs ICCV 2025 DecoupledESC: Enhancing Emotional Support Generation via Strategy-Response Decoupled Preference Optimization EMNLP 2025 MASTER: Multi-Agent Security Through Exploration of Roles and Topological Structures - A Comprehensive Framework EMNLP 2025 Dropping Experts, Recombining Neurons: Retraining-Free Pruning for Sparse Mixture-of-Experts LLMs EMNLP 2025 Video2Roleplay: A Multimodal Dataset and Framework for Video-Guided Role-playing Agents EMNLP 2025 SPARK: Simulating the Co-evolution of Stance and Topic Dynamics in Online Discourse with LLM-based Agents EMNLP 2025 Identifying Pre-training Data in LLMs: A Neuron Activation-Based Detection Framework EMNLP 2025 Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy CVPR 2025 FinMTEB: Finance Massive Text Embedding Benchmark EMNLP 2025 Silence is Golden: Leveraging Adversarial Examples to Nullify Audio Control in LDM-based Talking-Head Generation CVPR 2025 Adapting Text-to-Image Generation with Feature Difference Instruction for Generic Image Restoration CVPR 2025 SKDream: Controllable Multi-view and 3D Generation with Arbitrary Skeletons CVPR 2025 GraphMimic: Graph-to-Graphs Generative Modeling from Videos for Policy Learning CVPR 2025 DroneSplat: 3D Gaussian Splatting for Robust 3D Reconstruction from In-the-Wild Drone Imagery CVPR 2025 EnergyMoGen: Compositional Human Motion Generation with Energy-Based Diffusion Model in Latent Space CVPR 2025 Scene Map-based Prompt Tuning for Navigation Instruction Generation CVPR 2025 EconNLI: Evaluating Large Language Models on Economics Reasoning ACL 2024 Enhancing Hallucination Detection through Perturbation-Based Synthetic Data Generation in System Responses ACL 2024 VillagerAgent: A Graph-Based Multi-Agent Framework for Coordinating Complex Task Dependencies in Minecraft ACL 2024 FragRel: Exploiting Fragment-level Relations in the External Memory of Large Language Models ACL 2024 Exploring the Relationship between In-Context Learning and Instruction Tuning EMNLP 2024 Neural Clustering based Visual Representation Learning CVPR 2024 DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models (Exemplified as A Video Agent) ICML 2024 Improving Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning ICML 2024 TOPA: Extending Large Language Models for Video Understanding via Text-Only Pre-Alignment NIPS 2024 Image Copy Detection for Diffusion Models NIPS 2024 Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models ICLR 2024 Deep SE(3)-Equivariant Geometric Reasoning for Precise Placement Tasks ICLR 2024 Clustering for Protein Representation Learning CVPR 2024 LSK3DNet: Towards Effective and Efficient 3D Perception with Large Sparse Kernels CVPR 2024 CapHuman: Capture Your Moments in Parallel Universes CVPR 2024 Entangled View-Epipolar Information Aggregation for Generalizable Neural Radiance Fields CVPR 2024 MS-DETR: Efficient DETR Training with Mixed Supervision CVPR 2024 Volumetric Environment Representation for Vision-Language Navigation CVPR 2024 VISTA-LLAMA: Reducing Hallucination in Video Language Models via Equal Distance to Visual Tokens CVPR 2024 Epipolar-Free 3D Gaussian Splatting for Generalizable Novel View Synthesis NIPS 2024 SeFlow: A Self-Supervised Scene Flow Method in Autonomous Driving ECCV 2024 General and Task-Oriented Video Segmentation ECCV 2024 Mutual Learning for Acoustic Matching and Dereverberation via Visual Scene-driven Diffusion ECCV 2024 Nonverbal Interaction Detection ECCV 2024 Navigation Instruction Generation with BEV Perception and Large Language Models ECCV 2024 VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models NIPS 2024 VLMimic: Vision Language Models are Visual Imitation Learner for Fine-grained Actions NIPS 2024 DRIP: Unleashing Diffusion Priors for Joint Foreground and Alpha Prediction in Image Matting NIPS 2024 TAPVid-3D: A Benchmark for Tracking Any Point in 3D NIPS 2024 Vision-Language Navigation with Energy-Based Policy NIPS 2024 Moving Off-the-Grid: Scene-Grounded Video Representations NIPS 2024 FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention NIPS 2024 DataStealing: Steal Data from Diffusion Models in Federated Learning with Multiple Trojans NIPS 2024 Human-Object Interaction Detection Collaborated with Large Relation-driven Diffusion Models NIPS 2024 Controllable Navigation Instruction Generation with Chain of Thought Prompting ECCV 2024 HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting ECCV 2024 Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts ECCV 2024 Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data ECCV 2024 Depth-Aware Blind Image Decomposition for Real-World Adverse Weather Recovery ECCV 2024 VividDreamer: Invariant Score Distillation for Hyper-Realistic Text-to-3D Generation ECCV 2024 Beyond Surface Similarity: Detecting Subtle Semantic Shifts in Financial Narratives NAACL 2024 Connecting the Dots: Inferring Patent Phrase Similarity with Retrieved Phrase Graphs NAACL 2024 Interpretable3D: An Ad-Hoc Interpretable Classifier for 3D Point Clouds AAAI 2024 Stitching Segments and Sentences towards Generalization in Video-Text Pre-training AAAI 2024 DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval AAAI 2024 Improving Bird's Eye View Semantic Segmentation by Task Decomposition CVPR 2024 Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval CVPR 2024 Learning from One Continuous Video Stream CVPR 2024 MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis CVPR 2024 Revealing the Two Sides of Data Augmentation: An Asymmetric Distillation-based Win-Win Solution for Open-Set Recognition IJCAI 2024 SIFU: Side-view Conditioned Implicit Function for Real-world Usable Clothed Human Reconstruction CVPR 2024 Clustering Propagation for Universal Medical Image Segmentation CVPR 2024 Psychometry: An Omnifit Model for Image Reconstruction from Human Brain Activity CVPR 2024 Automated Tone Transcription and Clustering with Tone2Vec EMNLP 2024 CQIL: Inference Latency Optimization with Concurrent Computation of Quasi-Independent Layers ACL 2024 MS2SL: Multimodal Spoken Data-Driven Continuous Sign Language Production ACL 2024 JOTR: 3D Joint Contrastive Learning with Transformers for Occluded Human Mesh Recovery ICCV 2023 Learning Symmetry-Aware Geometry Correspondences for 6D Object Pose Estimation ICCV 2023 Fast and Accurate Factual Inconsistency Detection Over Long Documents EMNLP 2023 FinEntity: Entity-level Sentiment Classification for Financial Texts EMNLP 2023 One Is All: Bridging the Gap between Neural Radiance Fields Architectures with Progressive Volume Distillation AAAI 2023 Semi-attention Partition for Occluded Person Re-identification AAAI 2023 Stroke Extraction of Chinese Character Based on Deep Structure Deformable Image Registration AAAI 2023 A Benchmark and Asymmetrical-Similarity Learning for Practical Image Copy Detection AAAI 2023 Global-correlated 3D-decoupling Transformer for Clothed Avatar Reconstruction NIPS 2023 Neural-Logic Human-Object Interaction Detection NIPS 2023 Analogical Inference Enhanced Knowledge Graph Embedding AAAI 2023 TransHP: Image Classification with Hierarchical Prompting NIPS 2023 PointGPT: Auto-regressively Generative Pre-training from Point Clouds NIPS 2023 Exploring Hypergraph of Earnings Call for Risk Prediction (Student Abstract) AAAI 2023 Debiasing Intrinsic Bias and Application Bias Jointly via Invariant Risk Minimization (Student Abstract) AAAI 2023 Perception Test: A Diagnostic Benchmark for Multimodal Video Models NIPS 2023 Hyperbolic Space with Hierarchical Margin Boosts Fine-Grained Learning from Coarse Labels NIPS 2023 DAC-DETR: Divide the Attention Layers and Conquer NIPS 2023 Pyramid Diffusion Models for Low-light Image Enhancement IJCAI 2023 Video Object Segmentation in Panoptic Wild Scenes IJCAI 2023 Bidirectional Cross-Modal Knowledge Exploration for Video Recognition With Pre-Trained Vision-Language Models CVPR 2023 Global-to-Local Modeling for Video-Based 3D Human Pose and Shape Estimation CVPR 2023 FedSeg: Class-Heterogeneous Federated Learning for Semantic Segmentation CVPR 2023 Text Augmented Spatial Aware Zero-shot Referring Image Segmentation EMNLP 2023 Logic-induced Diagnostic Reasoning for Semi-supervised Semantic Segmentation ICCV 2023 Continuous-Discrete Convolution for Geometry-Sequence Modeling in Proteins ICLR 2023 Suppressing the Heterogeneity: A Strong Feature Extractor for Few-shot Segmentation ICLR 2023 Decompose to Generalize: Species-Generalized Animal Pose Estimation ICLR 2023 DeCap: Decoding CLIP Latents for Zero-Shot Captioning via Text-Only Training ICLR 2023 Efficient Multimodal Fusion via Interactive Prompting CVPR 2023 PointListNet: Deep Learning on 3D Point Lists CVPR 2023 LANA: A Language-Capable Navigator for Instruction Following and Generation CVPR 2023 Joint Video Multi-Frame Interpolation and Deblurring Under Unknown Exposure Time CVPR 2023 Context-Aware Pretraining for Efficient Blind Image Decomposition CVPR 2023 MIST: Multi-Modal Iterative Spatial-Temporal Transformer for Long-Form Video Question Answering CVPR 2023 ProD: Prompting-To-Disentangle Domain Knowledge for Cross-Domain Few-Shot Image Classification CVPR 2023 Unified Mask Embedding and Correspondence Learning for Self-Supervised Video Segmentation CVPR 2023 DETR With Additional Global Aggregation for Cross-Domain Weakly Supervised Object Detection CVPR 2023 Adversarially Masking Synthetic To Mimic Real: Adaptive Noise Injection for Point Cloud Segmentation Adaptation CVPR 2023 Bird's-Eye-View Scene Graph for Vision-Language Navigation ICCV 2023 Causal-Debias: Unifying Debiasing in Pretrained Language Models and Fine-tuning via Causal Invariant Learning ACL 2023 WhitenedCSE: Whitening-based Contrastive Learning of Sentence Embeddings ACL 2023 Exploiting Contrastive Learning and Numerical Evidence for Confusing Legal Judgment Prediction EMNLP 2023 Predict the Future from the Past? On the Temporal Data Distribution Shift in Financial Sentiment Classifications EMNLP 2023 Integrating Boxes and Masks: A Multi-Object Framework for Unified Visual Tracking and Segmentation ICCV 2023 TransHuman: A Transformer-based Human Representation for Generalizable Neural Human Rendering ICCV 2023 Clustering based Point Cloud Representation Learning for 3D Analysis ICCV 2023 TAPIR: Tracking Any Point with Per-Frame Initialization and Temporal Refinement ICCV 2023 GETAvatar: Generative Textured Meshes for Animatable Human Avatars ICCV 2023 LogicSeg: Parsing Visual Semantics with Neural Logic Learning and Reasoning ICCV 2023 Distilling DETR with Visual-Linguistic Knowledge for Open-Vocabulary Object Detection ICCV 2023 Compositional Feature Augmentation for Unbiased Scene Graph Generation ICCV 2023 Rethinking Point Cloud Registration as Masking and Reconstruction ICCV 2023 Omnidirectional Information Gathering for Knowledge Transfer-Based Audio-Visual Navigation ICCV 2023 MAAL: Multimodality-Aware Autoencoder-Based Affordance Learning for 3D Articulated Objects ICCV 2023 Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation ICCV 2023 Action Sensitivity Learning for Temporal Action Localization ICCV 2023 Gloss-Free End-to-End Sign Language Translation ACL 2023 Is ChatGPT a Financial Expert? Evaluating Language Models on Financial Natural Language Processing EMNLP 2023 H2FA R-CNN: Holistic and Hierarchical Feature Alignment for Cross-Domain Weakly Supervised Object Detection CVPR 2022 Feature-Proxy Transformer for Few-Shot Segmentation NIPS 2022 TAP-Vid: A Benchmark for Tracking Any Point in a Video NIPS 2022 GMMSeg: Gaussian Mixture based Generative Semantic Segmentation Models NIPS 2022 Decoupling Features in Hierarchical Propagation for Video Object Segmentation NIPS 2022 Divide-and-Regroup Clustering for Domain Adaptive Person Re-identification AAAI 2022 Monocular Camera-Based Point-Goal Navigation by Learning Depth Channel and Cross-Modality Pyramid Fusion AAAI 2022 Auto-Debias: Debiasing Masked Language Models with Automated Biased Prompts ACL 2022 Buy Tesla, Sell Ford: Assessing Implicit Stock Market Preference in Pre-trained Language Models ACL 2022 Deep Hierarchical Semantic Segmentation CVPR 2022 Multi-View Consistent Generative Adversarial Networks for 3D-Aware Image Synthesis CVPR 2022 Locality-Aware Inter- and Intra-Video Reconstruction for Self-Supervised Correspondence Learning CVPR 2022 Unified Transformer Tracker for Object Tracking CVPR 2022 Learning Memory-Augmented Unidirectional Metrics for Cross-Modality Person Re-Identification CVPR 2022 Automated Progressive Learning for Efficient Training of Vision Transformers CVPR 2022 Large-Scale Video Panoptic Segmentation in the Wild: A Benchmark CVPR 2022 Learning To Learn by Jointly Optimizing Neural Architecture and Weights CVPR 2022 SEEG: Semantic Energized Co-Speech Gesture Generation CVPR 2022 A Simple Episodic Linear Probe Improves Visual Recognition in the Wild CVPR 2022 Compositional Temporal Grounding With Structured Variational Cross-Graph Correspondence Learning CVPR 2022 Visual Abductive Reasoning CVPR 2022 MHR-Net: Multiple-Hypothesis Reconstruction of Non-rigid Shapes from 2D Views ECCV 2022 Instance As Identity: A Generic Online Paradigm for Video Instance Segmentation ECCV 2022 Sparse Teachers Can Be Dense with Knowledge EMNLP 2022 Rethinking Multi-Modal Alignment in Multi-Choice VideoQA from Feature and Sample Perspectives EMNLP 2022 PLATO-Ad: A Unified Advertisement Text Generation Framework with Multi-Task Prompt Learning EMNLP 2022 BARLE: Background-Aware Representation Learning for Background Shift Out-of-Distribution Detection EMNLP 2022 Switch to Generalize: Domain-Switch Learning for Cross-Domain Few-Shot Classification ICLR 2022 Beam-Guided TasNet: An Iterative Speech Separation Framework with Multi-Channel Output INTERSPEECH 2022 Triggerless Backdoor Attack for NLP Tasks with Clean Labels NAACL 2022 Benchmarking Intersectional Biases in NLP NAACL 2022 Removing Raindrops and Rain Streaks in One Go CVPR 2021 Differentiable Multi-Granularity Human Representation Learning for Instance-Aware Human Semantic Parsing CVPR 2021 Constructing a Psychometric Testbed for Fair Natural Language Processing EMNLP 2021 Learning Numeracy: A Simple Yet Effective Number Embedding Approach Using Knowledge Graph EMNLP 2021 Point 4D Transformer Networks for Spatio-Temporal Modeling in Point Cloud Videos CVPR 2021 PartialFed: Cross-Domain Personalized Federated Learning via Partial Initialization NIPS 2021 Few-Shot Segmentation via Cycle-Consistent Transformer NIPS 2021 Associating Objects with Transformers for Video Object Segmentation NIPS 2021 CLIP: A Dataset for Extracting Action Items for Physicians from Hospital Discharge Notes IJCNLP 2021 OpenMix: Reviving Known Knowledge for Discovering Novel Visual Categories in an Open World CVPR 2021 Domain Consensus Clustering for Universal Domain Adaptation CVPR 2021 PR-RRN: Pairwise-Regularized Residual-Recursive Networks for Non-Rigid Structure-From-Motion ICCV 2021 AINet: Association Implantation for Superpixel Segmentation ICCV 2021 Interactive Prototype Learning for Egocentric Action Recognition ICCV 2021 Universal-Prototype Enhancing for Few-Shot Object Detection ICCV 2021 Super-Resolving Cross-Domain Face Miniatures by Peeking at One-Shot Exemplar ICCV 2021 A Multi-Mode Modulator for Multi-Domain Few-Shot Classification ICCV 2021 Sub-Bit Neural Networks: Learning To Compress and Accelerate Binary Neural Networks ICCV 2021 Vector-Decomposed Disentanglement for Domain-Invariant Object Detection ICCV 2021 Weakly Supervised Person Search With Region Siamese Networks ICCV 2021 Adaptive Hierarchical Graph Reasoning With Semantic Coherence for Video-and-Language Inference ICCV 2021 RFNet: Region-Aware Fusion Network for Incomplete Multi-Modal Brain Tumor Segmentation ICCV 2021 T2VLAD: Global-Local Sequence Alignment for Text-Video Retrieval CVPR 2021 Faster Meta Update Strategy for Noise-Robust Deep Learning CVPR 2021 Exploring Heterogeneous Clues for Weakly-Supervised Audio-Visual Video Parsing CVPR 2021 DOTS: Decoupling Operation and Topology in Differentiable Architecture Search CVPR 2021 DSC-PoseNet: Learning 6DoF Object Pose Estimation via Dual-Scale Consistency CVPR 2021 Decoupled and Memory-Reinforced Networks: Towards Effective Feature Learning for One-Step Person Search AAAI 2021 Auto-Navigator: Decoupled Neural Architecture Search for Visual Navigation WACV 2021 Judgment Prediction via Injecting Legal Knowledge into Neural Networks AAAI 2021 PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences ICLR 2021 Modeling the Probabilistic Distribution of Unlabeled Data for One-shot Medical Image Segmentation AAAI 2021 CLIP: A Dataset for Extracting Action Items for Physicians from Hospital Discharge Notes ACL 2021 Action-Based Conversations Dataset: A Corpus for Building More In-Depth Task-Oriented Dialogue Systems NAACL 2021 VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild CVPR 2021 Describing Unseen Videos via Multi-Modal Cooperative Dialog Agents ECCV 2020 Motion-Excited Sampler: Video Adversarial Attack with Sparked Prior ECCV 2020 Learning to Transfer Learn: Reinforcement Learning-Based Selection for Adaptive Transfer Learning ECCV 2020 Adversarial Localized Energy Network for Structured Prediction AAAI 2020 Person Tube Retrieval via Language Description AAAI 2020 Context Modulated Dynamic Networks for Actor and Action Video Segmentation with Language Queries AAAI 2020 Collaborative Video Object Segmentation by Foreground-Background Integration ECCV 2020 Dataless Short Text Classification Based on Biterm Topic Model and Word Embeddings IJCAI 2020 Unsupervised Scene Adaptation with Memory Regularization in vivo IJCAI 2020 Interpretable Operational Risk Classification with Semi-Supervised Variational Autoencoder ACL 2020 Interpreting Twitter User Geolocation ACL 2020 AARM: Action Attention Recalibration Module for Action Recognition ACML 2020 Neural Topic Model with Attention for Supervised Learning AISTATS 2020 Generating Plausible Counterfactual Explanations for Deep Transformers in Financial Text Classification COLING 2020 Adversarial Style Mining for One-Shot Unsupervised Domain Adaptation NIPS 2020 NAS-Bench-201: Extending the Scope of Reproducible Neural Architecture Search ICLR 2020 Query-efficient Meta Attack to Deep Neural Networks ICLR 2020 Symbiotic Attention with Privileged Information for Egocentric Action Recognition AAAI 2020 Random Erasing Data Augmentation AAAI 2020 FASTER Recurrent Networks for Efficient Video Classification AAAI 2020 EEMEFN: Low-Light Image Enhancement via Edge-Enhanced Multi-Exposure Fusion Network AAAI 2020 Pixel-Level Cycle Association: A New Perspective for Domain Adaptive Semantic Segmentation NIPS 2020 Self-paced Multi-view Co-training JMLR 2020 Consistent Structural Relation Learning for Zero-Shot Segmentation NIPS 2020 Simple and Effective Few-Shot Named Entity Recognition with Structured Nearest Neighbor Learning EMNLP 2020 Content-Consistent Matching for Domain Adaptive Semantic Segmentation ECCV 2020 SF-Net: Single-Frame Supervision for Temporal Action Localization ECCV 2020 Inter-Image Communication for Weakly Supervised Localization ECCV 2020 Memory Aggregation Networks for Efficient Interactive Video Object Segmentation CVPR 2020 Imitative Non-Autoregressive Modeling for Trajectory Forecasting and Imputation CVPR 2020 Gated Channel Transformation for Visual Recognition CVPR 2020 ActBERT: Learning Global-Local Video-Text Representations CVPR 2020 Learning Filter Pruning Criteria for Deep Convolutional Neural Networks Acceleration CVPR 2020 Salience-Guided Cascaded Suppression Network for Person Re-Identification CVPR 2020 Semantic Correspondence as an Optimal Transport Problem CVPR 2020 Inflated Episodic Memory With Region Self-Attention for Long-Tailed Visual Recognition CVPR 2020 Attract or Distract: Exploit the Margin of Open Set ICCV 2019 Dialog Intent Induction with Deep Multi-View Clustering EMNLP 2019 Adaptive Sparse Confidence-Weighted Learning for Online Feature Selection AAAI 2019 Connective Cognition Network for Directional Visual Commonsense Reasoning NIPS 2019 Recognizing Part Attributes With Insufficient Data ICCV 2019 Pose-Guided Feature Alignment for Occluded Person Re-Identification ICCV 2019 Teacher Supervises Students How to Learn From Partially Labeled Images for Facial Landmark Detection ICCV 2019 One-Shot Neural Architecture Search via Self-Evaluated Template Network ICCV 2019 Auto-ReID: Searching for a Part-Aware ConvNet for Person Re-Identification ICCV 2019 Dual Attention Matching for Audio-Visual Event Localization ICCV 2019 Significance-Aware Information Bottleneck for Domain Adaptive Semantic Segmentation ICCV 2019 Entangled Transformer for Image Captioning ICCV 2019 Very Long Natural Scenery Image Prediction by Outpainting ICCV 2019 Sim-Real Joint Reinforcement Transfer for 3D Indoor Navigation CVPR 2019 UnOS: Unified Unsupervised Optical-Flow and Stereo-Depth Estimation by Watching Videos CVPR 2019 DM-GAN: Dynamic Memory Generative Adversarial Networks for Text-To-Image Synthesis CVPR 2019 Contrastive Adaptation Network for Unsupervised Domain Adaptation CVPR 2019 Filter Pruning via Geometric Median for Deep Convolutional Neural Networks Acceleration CVPR 2019 Taking a Closer Look at Domain Shift: Category-Level Adversaries for Semantics Consistent Domain Adaptation CVPR 2019 Joint Discriminative and Generative Learning for Person Re-Identification CVPR 2019 Searching for a Robust Neural Architecture in Four GPU Hours CVPR 2019 Invariance Matters: Exemplar Memory for Domain Adaptive Person Re-Identification CVPR 2019 LEARNING TO PROPAGATE LABELS: TRANSDUCTIVE PROPAGATION NETWORK FOR FEW-SHOT LEARNING ICLR 2019 A Semi-Markov Structured Support Vector Machine Model for High-Precision Named Entity Recognition ACL 2019 Syntax-Infused Variational Autoencoder for Text Generation ACL 2019 Video Interactive Captioning with Human Prompts IJCAI 2019 Generalized Majorization-Minimization for Non-Convex Optimization IJCAI 2019 What You Say and How You Say It Matters: Predicting Stock Volatility Using Verbal and Vocal Cues ACL 2019 Dialog Intent Induction with Deep Multi-View Clustering IJCNLP 2019 Network Pruning via Transformable Architecture Search NIPS 2019 A Robust and Efficient Algorithm for the PnL Problem Using Algebraic Distance to Approximate the Reprojection Distance AAAI 2019 A Bottom-Up Clustering Approach to Unsupervised Person Re-Identification AAAI 2019 Cubic LSTMs for Video Prediction AAAI 2019 Uncertainty Sampling for Action Recognition via Maximizing Expected Average Precision IJCAI 2018 Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks IJCAI 2018 A Unified Analysis of Stochastic Momentum Methods for Deep Learning IJCAI 2018 Deep Adversarial Attention Alignment for Unsupervised Domain Adaptation: the Benefit of Target Expectation Maximization ECCV 2018 Self-produced Guidance for Weakly-supervised Object Localization ECCV 2018 Adversarial Complementary Learning for Weakly Supervised Object Localization CVPR 2018 Style Aggregated Network for Facial Landmark Detection CVPR 2018 Collective Entity Disambiguation with Structured Gradient Tree Boosting NAACL 2018 Improve Neural Entity Recognition via Multi-Task Data Selection and Constrained Decoding NAACL 2018 Robust PCA by Manifold Optimization JMLR 2018 Macro-Micro Adversarial Network for Human Parsing ECCV 2018 Convolutional Neural Networks with Recurrent Neural Filters EMNLP 2018 RCAA: Relational Context-Aware Agents for Person Search ECCV 2018 Supervision-by-Registration: An Unsupervised Approach to Improve the Precision of Facial Landmark Detectors CVPR 2018 Beyond Part Models: Person Retrieval with Refined Part Pooling (and A Strong Convolutional Baseline) ECCV 2018 Exploit the Unknown Gradually: One-Shot Video-Based Person Re-Identification by Stepwise Learning CVPR 2018 Camera Style Adaptation for Person Re-Identification CVPR 2018 Compound Memory Networks for Few-shot Video Classification ECCV 2018 Generalizing A Person Retrieval Model Hetero- and Homogeneously ECCV 2018 Image-Image Domain Adaptation With Preserved Self-Similarity and Domain-Dissimilarity for Person Re-Identification CVPR 2018 Occlusion Aware Unsupervised Learning of Optical Flow CVPR 2018 Watching a Small Portion could be as Good as Watching All: Towards Efficient Video Classification IJCAI 2018 Complex Event Detection by Identifying Reliable Shots From Untrimmed Videos ICCV 2017 Learning Discriminative Latent Attributes for Zero-Shot Classification ICCV 2017 Recursive Spatial Transformer (ReST) for Alignment-Free Face Recognition ICCV 2017 Unlabeled Samples Generated by GAN Improve the Person Re-Identification Baseline in Vitro ICCV 2017 Few-Shot Object Recognition From Machine-Labeled Web Images CVPR 2017 Person Re-Identification in the Wild CVPR 2017 Bidirectional Multirate Reconstruction for Temporal Modeling in Videos CVPR 2017 More Is Less: A More Complicated Network With Less Inference Complexity CVPR 2017 Alibaba at IJCNLP-2017 Task 1: Embedding Grammatical Features into LSTMs for Chinese Grammatical Error Diagnosis Task IJCNLP 2017 Part-of-Speech Tagging for Historical English NAACL 2016 They Are Not Equally Reliable: Semantic Event Search Using Differentiated Concept Classifiers CVPR 2016 Toward Socially-Infused Information Extraction: Embedding Authors, Mentions, and Entities EMNLP 2016 Attention to Scale: Scale-Aware Semantic Image Segmentation CVPR 2016 Video Paragraph Captioning Using Hierarchical Recurrent Neural Networks CVPR 2016 Voice Conversion Based on Matrix Variate Gaussian Mixture Model Using Multiple Frame Features INTERSPEECH 2016 Hierarchical Recurrent Neural Encoder for Video Representation With Application to Captioning CVPR 2016 Improving Topic Model Stability for Effective Document Exploration IJCAI 2016 CNN-RNN: A Unified Framework for Multi-Label Image Classification CVPR 2016 You Lead, We Exceed: Labor-Free Video Concept Learning by Jointly Exploiting Web Videos and Images CVPR 2016 A Discriminative CNN Video Representation for Event Detection CVPR 2015 Complex Event Detection using Semantic Saliency and Nearly-Isotonic SVM ICML 2015 Efficient Methods for Incorporating Knowledge into Topic Models EMNLP 2015 Inferring Painting Style with Multi-Task Dictionary Learning IJCAI 2015 Semantic Concept Discovery for Large-Scale Zero-Shot Event Detection IJCAI 2015 Scalable Maximum Margin Matrix Factorization by Active Riemannian Subspace Search IJCAI 2015 Efficient Methods for Inferring Large Sparse Topic Hierarchies ACL 2015 S-MART: Novel Tree-based Structured Learning Algorithms Applied to Tweet Entity Linking ACL 2015 S-MART: Novel Tree-based Structured Learning Algorithms Applied to Tweet Entity Linking IJCNLP 2015 Efficient Methods for Inferring Large Sparse Topic Hierarchies IJCNLP 2015 WikiQA: A Challenge Dataset for Open-Domain Question Answering EMNLP 2015 Unsupervised Multi-Domain Adaptation with Feature Embeddings NAACL 2015 Look and Think Twice: Capturing Top-Down Visual Attention With Feedback Convolutional Neural Networks ICCV 2015 Learning Like a Child: Fast Novel Visual Concept Learning From Sentence Descriptions of Images ICCV 2015 Depth-Based Hand Pose Estimation: Data, Methods, and Challenges ICCV 2015 Learning From Massive Noisy Labeled Data for Image Classification CVPR 2015 DevNet: A Deep Event Network for Multimedia Event Detection and Evidence Recounting CVPR 2015 Decomposable Nonlocal Tensor Dictionary Learning for Multispectral Image Denoising CVPR 2014 Fast Easy Unsupervised Domain Adaptation with Marginalized Structured Dropout ACL 2014 Event Detection using Multi-Level Relevance Labels and Multiple Features CVPR 2014 Parsing Occluded People CVPR 2014 Robust Tensor Clustering with Non-Greedy Maximization IJCAI 2013 A Log-Linear Model for Unsupervised Text Normalization EMNLP 2013 Complex Event Detection via Multi-source Video Attributes CVPR 2013 Harry Potter's Marauder's Map: Localizing and Tracking Multiple Persons-of-Interest by Nonnegative Discretization CVPR 2013 How Related Exemplars Help Complex Event Detection in Web Videos? ICCV 2013 Overcoming the Memory Bottleneck in Distributed Training of Latent Variable Models of Text NAACL 2013 Thinking of Images as What They Are: Compound Matrix Regression for Image Classification IJCAI 2013 Co-Regularized Ensemble for Feature Selection IJCAI 2013 Feature Weighting via Optimal Thresholding for Video Analysis ICCV 2013 Space-Time Robust Representation for Action Recognition ICCV 2013 Quality-biased Ranking of Short Texts in Microblogging Services IJCNLP 2011