Yi Yang
401 papers · 2011–2026 · 19 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+20 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (31) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (7) π£ Hot Topic Early Bird
π
Academic Marathon
(15)
π
Renaissance Researcher
(7)
π
Interdisciplinary Bridge
π
Conference Loyalist
(31)
π
Keyword Trendsetter Combo
(6)
π€
Dynamic Duo
(53)
π
Triple Crown
π§¬
Topic Evolution
π
Keyword Champion
(3)
π
Grand Slam
π₯
Mega-Team
(24)
π±
Topic Pioneer
π¬
Deep Specialist
(46)
π
Conference Pioneer
π₯
Unstoppable
(14)
β
The Questioner
(5)
π
Century Club
(387)
ποΈ
Keyword Collector
(76)
β‘
Prolific Year
(35)
π
Trend Setter
Conferences
CVPR (103)
ICCV (65)
AAAI (38)
ACL (32)
NIPS (31)
EMNLP (29)
ECCV (27)
IJCAI (21)
ICLR (18)
NAACL (11)
ICML (8)
IJCNLP (6)
WACV (3)
INTERSPEECH (2)
JMLR (2)
COLING (2)
EACL (1)
AISTATS (1)
ACML (1)
Top co-authors
Research topics
Keywords
semantic segmentation
(25)
domain adaptation
(25)
video understanding
(24)
large language model
(22)
convolutional neural network
(22)
representation learning
(21)
diffusion model
(16)
zero-shot learning
(15)
person re-identification
(15)
contrastive learning
(15)
object detection
(12)
multimodal learning
(12)
few-shot learning
(12)
attention mechanism
(12)
action recognition
(12)
unsupervised learning
(10)
self-supervised learning
(10)
text classification
(9)
knowledge distillation
(9)
adversarial learning
(9)
Papers
Oscillation Inversion: Training-Free Image and Video Enhancement Through Oscillated Latents in Large Flow Models
AAAI 2026
Bayes-Optimal Fair Classification with Multiple Sensitive Features
AAAI 2026
Beyond Chunking: Discourse-Aware Hierarchical Retrieval for Long Document Question Answering
ACL 2026
LBLLM: Lightweight Binarization of Large Language Models via Three-Stage Distillation
ACL 2026
One Refiner to Unlock Them All: Inference-Time Reasoning Elicitation via Reinforcement Query Refinement
ACL 2026
OMIBench: Benchmarking Olympiad-Level Multi-Image Reasoning in Large Vision-Language Models
ACL 2026
HiMo: High-Speed Objects Motion Compensation in Point Clouds (Abstract Reprint)
AAAI 2026
MathFlow: Enhancing the Perceptual Flow of MLLMs for Visual Mathematical Problems
ACL 2026
KV-Embedding: Training-free Text Embedding via Internal KV Re-routing in Decoder-only LLMs
ACL 2026
FlowMorph: Revealing an Optimizable Flow Latent Space for Controlled Image Morphing
WACV 2026
Revealing the Numeracy Gap: An Empirical Investigation of Text Embedding Models
EACL 2026
Breaking the Modality Barrier: Generative Modeling for Accurate Molecule Retrieval from Mass Spectra
AAAI 2026
Melodia: Training-Free Music Editing Guided by Attention Probing in Diffusion Models
AAAI 2026
Insert Anything: Image Insertion via In-Context Editing in DiT
AAAI 2026
DLVINet: Advancing Dual-Lens Video Inpainting Beyond Parallax Constraints
AAAI 2026
Evaluating and Aligning Human Economic Risk Preferences in LLMs
EMNLP 2025
DiffVsgg: Diffusion-Driven Online Video Scene Graph Generation
CVPR 2025
ReFu: Recursive Fusion for Exemplar-Free 3D Class-Incremental Learning
WACV 2025
Bias A-head? Analyzing Bias in Transformer-Based Language Model Attention Heads
NAACL 2025
Representation Learning with Mutual Influence of Modalities for Node Classification in Multi-Modal Heterogeneous Networks
IJCAI 2025
Drafting and Revision: Advancing High-Fidelity Video Inpainting
IJCAI 2025
ZeroMamba: Exploring Visual State Space Model for Zero-Shot Learning
AAAI 2025
Image Regeneration: Evaluating Text-to-Image Model via Generating Identical Image with Multimodal Large Language Models
AAAI 2025
Autonomous LLM-Enhanced Adversarial Attack for Text-to-Motion
AAAI 2025
BrainGuard: Privacy-Preserving Multisubject Image Reconstructions from Brain Activities
AAAI 2025
LLM Agents Can Be Choice-Supportive Biased Evaluators: An Empirical Study
AAAI 2025
Prompt-Aware Controllable Shadow Removal
IJCAI 2025
DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization
ICML 2025
Holistic Physics Solver: Learning PDEs in a Unified Spectral-Physical Space
ICML 2025
Origin Identification for Text-Guided Image-to-Image Diffusion Models
ICML 2025
Reaction Graph: Towards Reaction-Level Modeling for Chemical Reactions with 3D Structures
ICML 2025
Learning without Isolation: Pathway Protection for Continual Learning
ICML 2025
Know the Unknown: An Uncertainty-Sensitive Method for LLM Instruction Tuning
ACL 2025
Adapting General-Purpose Embedding Models to Private Datasets Using Keyword-based Retrieval
ACL 2025
Achieving binary weight and activation for LLMs using Post-Training Quantization
ACL 2025
Sparse Rewards Can Self-Train Dialogue Agents
ACL 2025
Bridging the LLM Accessibility Divide? Performance, Fairness, and Cost of Closed versus Open LLMs for Automated Essay Scoring
ACL 2025
PersonaTwin: A Multi-Tier Prompt Conditioning Framework for Generating and Evaluating Personalized Digital Twins
ACL 2025
Long-horizon Visual Instruction Generation with Logic and Attribute Self-reflection
ICLR 2025
Transformer-based Speech Model Learns Well as Infants and Encodes Abstractions through Exemplars in the Poverty of the Stimulus Environment
COLING 2025
VideoGrain: Modulating Space-Time Attention for Multi-Grained Video Editing
ICLR 2025
TDDBench: A Benchmark for Training data detection
ICLR 2025
Hydra-SGG: Hybrid Relation Assignment for One-stage Scene Graph Generation
ICLR 2025
OSDA Agent: Leveraging Large Language Models for De Novo Design of Organic Structure Directing Agents
ICLR 2025
3DIS: Depth-Driven Decoupled Image Synthesis for Universal Multi-Instance Generation
ICLR 2025
Adversarial Mixup Unlearning
ICLR 2025
Underwater Visual SLAM with Depth Uncertainty and Medium Modeling
ICCV 2025
Towards Human-like Virtual Beings: Simulating Human Behavior in 3D Scenes
ICCV 2025
DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models
ICCV 2025
NeRF Is a Valuable Assistant for 3D Gaussian Splatting
ICCV 2025
From Trial to Triumph: Advancing Long Video Understanding via Visual Context Sample Scaling and Self-reward Alignment
ICCV 2025
BVINet: Unlocking Blind Video Inpainting with Zero Annotations
ICCV 2025
UniGlyph: Unified Segmentation-Conditioned Diffusion for Precise Visual Text Synthesis
ICCV 2025
TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation
ICCV 2025
Hierarchical Event Memory for Accurate and Low-latency Online Video Temporal Grounding
ICCV 2025
MaGS: Reconstructing and Simulating Dynamic 3D Objects with Mesh-adsorbed Gaussian Splatting
ICCV 2025
MeshLLM: Empowering Large Language Models to Progressively Understand and Generate 3D Mesh
ICCV 2025
Dual Reciprocal Learning of Language-based Human Motion Understanding and Generation
ICCV 2025
TAPNext: Tracking Any Point (TAP) as Next Token Prediction
ICCV 2025
From Image to Video: An Empirical Study of Diffusion Representations
ICCV 2025
Gaussian-based World Model: Gaussian Priors for Voxel-Based Occupancy Prediction and Future Motion Prediction
ICCV 2025
R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization
ICCV 2025
MC-Bench: A Benchmark for Multi-Context Visual Grounding in the Era of MLLMs
ICCV 2025
DecoupledESC: Enhancing Emotional Support Generation via Strategy-Response Decoupled Preference Optimization
EMNLP 2025
MASTER: Multi-Agent Security Through Exploration of Roles and Topological Structures - A Comprehensive Framework
EMNLP 2025
Dropping Experts, Recombining Neurons: Retraining-Free Pruning for Sparse Mixture-of-Experts LLMs
EMNLP 2025
Video2Roleplay: A Multimodal Dataset and Framework for Video-Guided Role-playing Agents
EMNLP 2025
SPARK: Simulating the Co-evolution of Stance and Topic Dynamics in Online Discourse with LLM-based Agents
EMNLP 2025
Identifying Pre-training Data in LLMs: A Neuron Activation-Based Detection Framework
EMNLP 2025
Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy
CVPR 2025
FinMTEB: Finance Massive Text Embedding Benchmark
EMNLP 2025
Silence is Golden: Leveraging Adversarial Examples to Nullify Audio Control in LDM-based Talking-Head Generation
CVPR 2025
Adapting Text-to-Image Generation with Feature Difference Instruction for Generic Image Restoration
CVPR 2025
SKDream: Controllable Multi-view and 3D Generation with Arbitrary Skeletons
CVPR 2025
GraphMimic: Graph-to-Graphs Generative Modeling from Videos for Policy Learning
CVPR 2025
DroneSplat: 3D Gaussian Splatting for Robust 3D Reconstruction from In-the-Wild Drone Imagery
CVPR 2025
EnergyMoGen: Compositional Human Motion Generation with Energy-Based Diffusion Model in Latent Space
CVPR 2025
Scene Map-based Prompt Tuning for Navigation Instruction Generation
CVPR 2025
EconNLI: Evaluating Large Language Models on Economics Reasoning
ACL 2024
Enhancing Hallucination Detection through Perturbation-Based Synthetic Data Generation in System Responses
ACL 2024
VillagerAgent: A Graph-Based Multi-Agent Framework for Coordinating Complex Task Dependencies in Minecraft
ACL 2024
FragRel: Exploiting Fragment-level Relations in the External Memory of Large Language Models
ACL 2024
Exploring the Relationship between In-Context Learning and Instruction Tuning
EMNLP 2024
Neural Clustering based Visual Representation Learning
CVPR 2024
DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models (Exemplified as A Video Agent)
ICML 2024
Improving Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning
ICML 2024
TOPA: Extending Large Language Models for Video Understanding via Text-Only Pre-Alignment
NIPS 2024
Image Copy Detection for Diffusion Models
NIPS 2024
Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models
ICLR 2024
Deep SE(3)-Equivariant Geometric Reasoning for Precise Placement Tasks
ICLR 2024
Clustering for Protein Representation Learning
CVPR 2024
LSK3DNet: Towards Effective and Efficient 3D Perception with Large Sparse Kernels
CVPR 2024
CapHuman: Capture Your Moments in Parallel Universes
CVPR 2024
Entangled View-Epipolar Information Aggregation for Generalizable Neural Radiance Fields
CVPR 2024
MS-DETR: Efficient DETR Training with Mixed Supervision
CVPR 2024
Volumetric Environment Representation for Vision-Language Navigation
CVPR 2024
VISTA-LLAMA: Reducing Hallucination in Video Language Models via Equal Distance to Visual Tokens
CVPR 2024
Epipolar-Free 3D Gaussian Splatting for Generalizable Novel View Synthesis
NIPS 2024
SeFlow: A Self-Supervised Scene Flow Method in Autonomous Driving
ECCV 2024
General and Task-Oriented Video Segmentation
ECCV 2024
Mutual Learning for Acoustic Matching and Dereverberation via Visual Scene-driven Diffusion
ECCV 2024
Nonverbal Interaction Detection
ECCV 2024
Navigation Instruction Generation with BEV Perception and Large Language Models
ECCV 2024
VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models
NIPS 2024
VLMimic: Vision Language Models are Visual Imitation Learner for Fine-grained Actions
NIPS 2024
DRIP: Unleashing Diffusion Priors for Joint Foreground and Alpha Prediction in Image Matting
NIPS 2024
TAPVid-3D: A Benchmark for Tracking Any Point in 3D
NIPS 2024
Vision-Language Navigation with Energy-Based Policy
NIPS 2024
Moving Off-the-Grid: Scene-Grounded Video Representations
NIPS 2024
FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention
NIPS 2024
DataStealing: Steal Data from Diffusion Models in Federated Learning with Multiple Trojans
NIPS 2024
Human-Object Interaction Detection Collaborated with Large Relation-driven Diffusion Models
NIPS 2024
Controllable Navigation Instruction Generation with Chain of Thought Prompting
ECCV 2024
HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting
ECCV 2024
Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts
ECCV 2024
Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data
ECCV 2024
Depth-Aware Blind Image Decomposition for Real-World Adverse Weather Recovery
ECCV 2024
VividDreamer: Invariant Score Distillation for Hyper-Realistic Text-to-3D Generation
ECCV 2024
Beyond Surface Similarity: Detecting Subtle Semantic Shifts in Financial Narratives
NAACL 2024
Connecting the Dots: Inferring Patent Phrase Similarity with Retrieved Phrase Graphs
NAACL 2024
Interpretable3D: An Ad-Hoc Interpretable Classifier for 3D Point Clouds
AAAI 2024
Stitching Segments and Sentences towards Generalization in Video-Text Pre-training
AAAI 2024
DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval
AAAI 2024
Improving Bird's Eye View Semantic Segmentation by Task Decomposition
CVPR 2024
Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval
CVPR 2024
Learning from One Continuous Video Stream
CVPR 2024
MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis
CVPR 2024
Revealing the Two Sides of Data Augmentation: An Asymmetric Distillation-based Win-Win Solution for Open-Set Recognition
IJCAI 2024
SIFU: Side-view Conditioned Implicit Function for Real-world Usable Clothed Human Reconstruction
CVPR 2024
Clustering Propagation for Universal Medical Image Segmentation
CVPR 2024
Psychometry: An Omnifit Model for Image Reconstruction from Human Brain Activity
CVPR 2024
Automated Tone Transcription and Clustering with Tone2Vec
EMNLP 2024
CQIL: Inference Latency Optimization with Concurrent Computation of Quasi-Independent Layers
ACL 2024
MS2SL: Multimodal Spoken Data-Driven Continuous Sign Language Production
ACL 2024
JOTR: 3D Joint Contrastive Learning with Transformers for Occluded Human Mesh Recovery
ICCV 2023
Learning Symmetry-Aware Geometry Correspondences for 6D Object Pose Estimation
ICCV 2023
Fast and Accurate Factual Inconsistency Detection Over Long Documents
EMNLP 2023
FinEntity: Entity-level Sentiment Classification for Financial Texts
EMNLP 2023
One Is All: Bridging the Gap between Neural Radiance Fields Architectures with Progressive Volume Distillation
AAAI 2023
Semi-attention Partition for Occluded Person Re-identification
AAAI 2023
Stroke Extraction of Chinese Character Based on Deep Structure Deformable Image Registration
AAAI 2023
A Benchmark and Asymmetrical-Similarity Learning for Practical Image Copy Detection
AAAI 2023
Global-correlated 3D-decoupling Transformer for Clothed Avatar Reconstruction
NIPS 2023
Neural-Logic Human-Object Interaction Detection
NIPS 2023
Analogical Inference Enhanced Knowledge Graph Embedding
AAAI 2023
TransHP: Image Classification with Hierarchical Prompting
NIPS 2023
PointGPT: Auto-regressively Generative Pre-training from Point Clouds
NIPS 2023
Exploring Hypergraph of Earnings Call for Risk Prediction (Student Abstract)
AAAI 2023
Debiasing Intrinsic Bias and Application Bias Jointly via Invariant Risk Minimization (Student Abstract)
AAAI 2023
Perception Test: A Diagnostic Benchmark for Multimodal Video Models
NIPS 2023
Hyperbolic Space with Hierarchical Margin Boosts Fine-Grained Learning from Coarse Labels
NIPS 2023
DAC-DETR: Divide the Attention Layers and Conquer
NIPS 2023
Pyramid Diffusion Models for Low-light Image Enhancement
IJCAI 2023
Video Object Segmentation in Panoptic Wild Scenes
IJCAI 2023
Bidirectional Cross-Modal Knowledge Exploration for Video Recognition With Pre-Trained Vision-Language Models
CVPR 2023
Global-to-Local Modeling for Video-Based 3D Human Pose and Shape Estimation
CVPR 2023
FedSeg: Class-Heterogeneous Federated Learning for Semantic Segmentation
CVPR 2023
Text Augmented Spatial Aware Zero-shot Referring Image Segmentation
EMNLP 2023
Logic-induced Diagnostic Reasoning for Semi-supervised Semantic Segmentation
ICCV 2023
Continuous-Discrete Convolution for Geometry-Sequence Modeling in Proteins
ICLR 2023
Suppressing the Heterogeneity: A Strong Feature Extractor for Few-shot Segmentation
ICLR 2023
Decompose to Generalize: Species-Generalized Animal Pose Estimation
ICLR 2023
DeCap: Decoding CLIP Latents for Zero-Shot Captioning via Text-Only Training
ICLR 2023
Efficient Multimodal Fusion via Interactive Prompting
CVPR 2023
PointListNet: Deep Learning on 3D Point Lists
CVPR 2023
LANA: A Language-Capable Navigator for Instruction Following and Generation
CVPR 2023
Joint Video Multi-Frame Interpolation and Deblurring Under Unknown Exposure Time
CVPR 2023
Context-Aware Pretraining for Efficient Blind Image Decomposition
CVPR 2023
MIST: Multi-Modal Iterative Spatial-Temporal Transformer for Long-Form Video Question Answering
CVPR 2023
ProD: Prompting-To-Disentangle Domain Knowledge for Cross-Domain Few-Shot Image Classification
CVPR 2023
Unified Mask Embedding and Correspondence Learning for Self-Supervised Video Segmentation
CVPR 2023
DETR With Additional Global Aggregation for Cross-Domain Weakly Supervised Object Detection
CVPR 2023
Adversarially Masking Synthetic To Mimic Real: Adaptive Noise Injection for Point Cloud Segmentation Adaptation
CVPR 2023
Bird's-Eye-View Scene Graph for Vision-Language Navigation
ICCV 2023
Causal-Debias: Unifying Debiasing in Pretrained Language Models and Fine-tuning via Causal Invariant Learning
ACL 2023
WhitenedCSE: Whitening-based Contrastive Learning of Sentence Embeddings
ACL 2023
Exploiting Contrastive Learning and Numerical Evidence for Confusing Legal Judgment Prediction
EMNLP 2023
Predict the Future from the Past? On the Temporal Data Distribution Shift in Financial Sentiment Classifications
EMNLP 2023
Integrating Boxes and Masks: A Multi-Object Framework for Unified Visual Tracking and Segmentation
ICCV 2023
TransHuman: A Transformer-based Human Representation for Generalizable Neural Human Rendering
ICCV 2023
Clustering based Point Cloud Representation Learning for 3D Analysis
ICCV 2023
TAPIR: Tracking Any Point with Per-Frame Initialization and Temporal Refinement
ICCV 2023
GETAvatar: Generative Textured Meshes for Animatable Human Avatars
ICCV 2023
LogicSeg: Parsing Visual Semantics with Neural Logic Learning and Reasoning
ICCV 2023
Distilling DETR with Visual-Linguistic Knowledge for Open-Vocabulary Object Detection
ICCV 2023
Compositional Feature Augmentation for Unbiased Scene Graph Generation
ICCV 2023
Rethinking Point Cloud Registration as Masking and Reconstruction
ICCV 2023
Omnidirectional Information Gathering for Knowledge Transfer-Based Audio-Visual Navigation
ICCV 2023
MAAL: Multimodality-Aware Autoencoder-Based Affordance Learning for 3D Articulated Objects
ICCV 2023
Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation
ICCV 2023
Action Sensitivity Learning for Temporal Action Localization
ICCV 2023
Gloss-Free End-to-End Sign Language Translation
ACL 2023
Is ChatGPT a Financial Expert? Evaluating Language Models on Financial Natural Language Processing
EMNLP 2023
H2FA R-CNN: Holistic and Hierarchical Feature Alignment for Cross-Domain Weakly Supervised Object Detection
CVPR 2022
Feature-Proxy Transformer for Few-Shot Segmentation
NIPS 2022
TAP-Vid: A Benchmark for Tracking Any Point in a Video
NIPS 2022
GMMSeg: Gaussian Mixture based Generative Semantic Segmentation Models
NIPS 2022
Decoupling Features in Hierarchical Propagation for Video Object Segmentation
NIPS 2022
Divide-and-Regroup Clustering for Domain Adaptive Person Re-identification
AAAI 2022
Monocular Camera-Based Point-Goal Navigation by Learning Depth Channel and Cross-Modality Pyramid Fusion
AAAI 2022
Auto-Debias: Debiasing Masked Language Models with Automated Biased Prompts
ACL 2022
Buy Tesla, Sell Ford: Assessing Implicit Stock Market Preference in Pre-trained Language Models
ACL 2022
Deep Hierarchical Semantic Segmentation
CVPR 2022
Multi-View Consistent Generative Adversarial Networks for 3D-Aware Image Synthesis
CVPR 2022
Locality-Aware Inter- and Intra-Video Reconstruction for Self-Supervised Correspondence Learning
CVPR 2022
Unified Transformer Tracker for Object Tracking
CVPR 2022
Learning Memory-Augmented Unidirectional Metrics for Cross-Modality Person Re-Identification
CVPR 2022
Automated Progressive Learning for Efficient Training of Vision Transformers
CVPR 2022
Large-Scale Video Panoptic Segmentation in the Wild: A Benchmark
CVPR 2022
Learning To Learn by Jointly Optimizing Neural Architecture and Weights
CVPR 2022
SEEG: Semantic Energized Co-Speech Gesture Generation
CVPR 2022
A Simple Episodic Linear Probe Improves Visual Recognition in the Wild
CVPR 2022
Compositional Temporal Grounding With Structured Variational Cross-Graph Correspondence Learning
CVPR 2022
Visual Abductive Reasoning
CVPR 2022
MHR-Net: Multiple-Hypothesis Reconstruction of Non-rigid Shapes from 2D Views
ECCV 2022
Instance As Identity: A Generic Online Paradigm for Video Instance Segmentation
ECCV 2022
Sparse Teachers Can Be Dense with Knowledge
EMNLP 2022
Rethinking Multi-Modal Alignment in Multi-Choice VideoQA from Feature and Sample Perspectives
EMNLP 2022
PLATO-Ad: A Unified Advertisement Text Generation Framework with Multi-Task Prompt Learning
EMNLP 2022
BARLE: Background-Aware Representation Learning for Background Shift Out-of-Distribution Detection
EMNLP 2022
Switch to Generalize: Domain-Switch Learning for Cross-Domain Few-Shot Classification
ICLR 2022
Beam-Guided TasNet: An Iterative Speech Separation Framework with Multi-Channel Output
INTERSPEECH 2022
Triggerless Backdoor Attack for NLP Tasks with Clean Labels
NAACL 2022
Benchmarking Intersectional Biases in NLP
NAACL 2022
Removing Raindrops and Rain Streaks in One Go
CVPR 2021
Differentiable Multi-Granularity Human Representation Learning for Instance-Aware Human Semantic Parsing
CVPR 2021
Constructing a Psychometric Testbed for Fair Natural Language Processing
EMNLP 2021
Learning Numeracy: A Simple Yet Effective Number Embedding Approach Using Knowledge Graph
EMNLP 2021
Point 4D Transformer Networks for Spatio-Temporal Modeling in Point Cloud Videos
CVPR 2021
PartialFed: Cross-Domain Personalized Federated Learning via Partial Initialization
NIPS 2021
Few-Shot Segmentation via Cycle-Consistent Transformer
NIPS 2021
Associating Objects with Transformers for Video Object Segmentation
NIPS 2021
CLIP: A Dataset for Extracting Action Items for Physicians from Hospital Discharge Notes
IJCNLP 2021
OpenMix: Reviving Known Knowledge for Discovering Novel Visual Categories in an Open World
CVPR 2021
Domain Consensus Clustering for Universal Domain Adaptation
CVPR 2021
PR-RRN: Pairwise-Regularized Residual-Recursive Networks for Non-Rigid Structure-From-Motion
ICCV 2021
AINet: Association Implantation for Superpixel Segmentation
ICCV 2021
Interactive Prototype Learning for Egocentric Action Recognition
ICCV 2021
Universal-Prototype Enhancing for Few-Shot Object Detection
ICCV 2021
Super-Resolving Cross-Domain Face Miniatures by Peeking at One-Shot Exemplar
ICCV 2021
A Multi-Mode Modulator for Multi-Domain Few-Shot Classification
ICCV 2021
Sub-Bit Neural Networks: Learning To Compress and Accelerate Binary Neural Networks
ICCV 2021
Vector-Decomposed Disentanglement for Domain-Invariant Object Detection
ICCV 2021
Weakly Supervised Person Search With Region Siamese Networks
ICCV 2021
Adaptive Hierarchical Graph Reasoning With Semantic Coherence for Video-and-Language Inference
ICCV 2021
RFNet: Region-Aware Fusion Network for Incomplete Multi-Modal Brain Tumor Segmentation
ICCV 2021
T2VLAD: Global-Local Sequence Alignment for Text-Video Retrieval
CVPR 2021
Faster Meta Update Strategy for Noise-Robust Deep Learning
CVPR 2021
Exploring Heterogeneous Clues for Weakly-Supervised Audio-Visual Video Parsing
CVPR 2021
DOTS: Decoupling Operation and Topology in Differentiable Architecture Search
CVPR 2021
DSC-PoseNet: Learning 6DoF Object Pose Estimation via Dual-Scale Consistency
CVPR 2021
Decoupled and Memory-Reinforced Networks: Towards Effective Feature Learning for One-Step Person Search
AAAI 2021
Auto-Navigator: Decoupled Neural Architecture Search for Visual Navigation
WACV 2021
Judgment Prediction via Injecting Legal Knowledge into Neural Networks
AAAI 2021
PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences
ICLR 2021
Modeling the Probabilistic Distribution of Unlabeled Data for One-shot Medical Image Segmentation
AAAI 2021
CLIP: A Dataset for Extracting Action Items for Physicians from Hospital Discharge Notes
ACL 2021
Action-Based Conversations Dataset: A Corpus for Building More In-Depth Task-Oriented Dialogue Systems
NAACL 2021
VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild
CVPR 2021
Describing Unseen Videos via Multi-Modal Cooperative Dialog Agents
ECCV 2020
Motion-Excited Sampler: Video Adversarial Attack with Sparked Prior
ECCV 2020
Learning to Transfer Learn: Reinforcement Learning-Based Selection for Adaptive Transfer Learning
ECCV 2020
Adversarial Localized Energy Network for Structured Prediction
AAAI 2020
Person Tube Retrieval via Language Description
AAAI 2020
Context Modulated Dynamic Networks for Actor and Action Video Segmentation with Language Queries
AAAI 2020
Collaborative Video Object Segmentation by Foreground-Background Integration
ECCV 2020
Dataless Short Text Classification Based on Biterm Topic Model and Word Embeddings
IJCAI 2020
Unsupervised Scene Adaptation with Memory Regularization in vivo
IJCAI 2020
Interpretable Operational Risk Classification with Semi-Supervised Variational Autoencoder
ACL 2020
Interpreting Twitter User Geolocation
ACL 2020
AARM: Action Attention Recalibration Module for Action Recognition
ACML 2020
Neural Topic Model with Attention for Supervised Learning
AISTATS 2020
Generating Plausible Counterfactual Explanations for Deep Transformers in Financial Text Classification
COLING 2020
Adversarial Style Mining for One-Shot Unsupervised Domain Adaptation
NIPS 2020
NAS-Bench-201: Extending the Scope of Reproducible Neural Architecture Search
ICLR 2020
Query-efficient Meta Attack to Deep Neural Networks
ICLR 2020
Symbiotic Attention with Privileged Information for Egocentric Action Recognition
AAAI 2020
Random Erasing Data Augmentation
AAAI 2020
FASTER Recurrent Networks for Efficient Video Classification
AAAI 2020
EEMEFN: Low-Light Image Enhancement via Edge-Enhanced Multi-Exposure Fusion Network
AAAI 2020
Pixel-Level Cycle Association: A New Perspective for Domain Adaptive Semantic Segmentation
NIPS 2020
Self-paced Multi-view Co-training
JMLR 2020
Consistent Structural Relation Learning for Zero-Shot Segmentation
NIPS 2020
Simple and Effective Few-Shot Named Entity Recognition with Structured Nearest Neighbor Learning
EMNLP 2020
Content-Consistent Matching for Domain Adaptive Semantic Segmentation
ECCV 2020
SF-Net: Single-Frame Supervision for Temporal Action Localization
ECCV 2020
Inter-Image Communication for Weakly Supervised Localization
ECCV 2020
Memory Aggregation Networks for Efficient Interactive Video Object Segmentation
CVPR 2020
Imitative Non-Autoregressive Modeling for Trajectory Forecasting and Imputation
CVPR 2020
Gated Channel Transformation for Visual Recognition
CVPR 2020
ActBERT: Learning Global-Local Video-Text Representations
CVPR 2020
Learning Filter Pruning Criteria for Deep Convolutional Neural Networks Acceleration
CVPR 2020
Salience-Guided Cascaded Suppression Network for Person Re-Identification
CVPR 2020
Semantic Correspondence as an Optimal Transport Problem
CVPR 2020
Inflated Episodic Memory With Region Self-Attention for Long-Tailed Visual Recognition
CVPR 2020
Attract or Distract: Exploit the Margin of Open Set
ICCV 2019
Dialog Intent Induction with Deep Multi-View Clustering
EMNLP 2019
Adaptive Sparse Confidence-Weighted Learning for Online Feature Selection
AAAI 2019
Connective Cognition Network for Directional Visual Commonsense Reasoning
NIPS 2019
Recognizing Part Attributes With Insufficient Data
ICCV 2019
Pose-Guided Feature Alignment for Occluded Person Re-Identification
ICCV 2019
Teacher Supervises Students How to Learn From Partially Labeled Images for Facial Landmark Detection
ICCV 2019
One-Shot Neural Architecture Search via Self-Evaluated Template Network
ICCV 2019
Auto-ReID: Searching for a Part-Aware ConvNet for Person Re-Identification
ICCV 2019
Dual Attention Matching for Audio-Visual Event Localization
ICCV 2019
Significance-Aware Information Bottleneck for Domain Adaptive Semantic Segmentation
ICCV 2019
Entangled Transformer for Image Captioning
ICCV 2019
Very Long Natural Scenery Image Prediction by Outpainting
ICCV 2019
Sim-Real Joint Reinforcement Transfer for 3D Indoor Navigation
CVPR 2019
UnOS: Unified Unsupervised Optical-Flow and Stereo-Depth Estimation by Watching Videos
CVPR 2019
DM-GAN: Dynamic Memory Generative Adversarial Networks for Text-To-Image Synthesis
CVPR 2019
Contrastive Adaptation Network for Unsupervised Domain Adaptation
CVPR 2019
Filter Pruning via Geometric Median for Deep Convolutional Neural Networks Acceleration
CVPR 2019
Taking a Closer Look at Domain Shift: Category-Level Adversaries for Semantics Consistent Domain Adaptation
CVPR 2019
Joint Discriminative and Generative Learning for Person Re-Identification
CVPR 2019
Searching for a Robust Neural Architecture in Four GPU Hours
CVPR 2019
Invariance Matters: Exemplar Memory for Domain Adaptive Person Re-Identification
CVPR 2019
LEARNING TO PROPAGATE LABELS: TRANSDUCTIVE PROPAGATION NETWORK FOR FEW-SHOT LEARNING
ICLR 2019
A Semi-Markov Structured Support Vector Machine Model for High-Precision Named Entity Recognition
ACL 2019
Syntax-Infused Variational Autoencoder for Text Generation
ACL 2019
Video Interactive Captioning with Human Prompts
IJCAI 2019
Generalized Majorization-Minimization for Non-Convex Optimization
IJCAI 2019
What You Say and How You Say It Matters: Predicting Stock Volatility Using Verbal and Vocal Cues
ACL 2019
Dialog Intent Induction with Deep Multi-View Clustering
IJCNLP 2019
Network Pruning via Transformable Architecture Search
NIPS 2019
A Robust and Efficient Algorithm for the PnL Problem Using Algebraic Distance to Approximate the Reprojection Distance
AAAI 2019
A Bottom-Up Clustering Approach to Unsupervised Person Re-Identification
AAAI 2019
Cubic LSTMs for Video Prediction
AAAI 2019
Uncertainty Sampling for Action Recognition via Maximizing Expected Average Precision
IJCAI 2018
Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks
IJCAI 2018
A Unified Analysis of Stochastic Momentum Methods for Deep Learning
IJCAI 2018
Deep Adversarial Attention Alignment for Unsupervised Domain Adaptation: the Benefit of Target Expectation Maximization
ECCV 2018
Self-produced Guidance for Weakly-supervised Object Localization
ECCV 2018
Adversarial Complementary Learning for Weakly Supervised Object Localization
CVPR 2018
Style Aggregated Network for Facial Landmark Detection
CVPR 2018
Collective Entity Disambiguation with Structured Gradient Tree Boosting
NAACL 2018
Improve Neural Entity Recognition via Multi-Task Data Selection and Constrained Decoding
NAACL 2018
Robust PCA by Manifold Optimization
JMLR 2018
Macro-Micro Adversarial Network for Human Parsing
ECCV 2018
Convolutional Neural Networks with Recurrent Neural Filters
EMNLP 2018
RCAA: Relational Context-Aware Agents for Person Search
ECCV 2018
Supervision-by-Registration: An Unsupervised Approach to Improve the Precision of Facial Landmark Detectors
CVPR 2018
Beyond Part Models: Person Retrieval with Refined Part Pooling (and A Strong Convolutional Baseline)
ECCV 2018
Exploit the Unknown Gradually: One-Shot Video-Based Person Re-Identification by Stepwise Learning
CVPR 2018
Camera Style Adaptation for Person Re-Identification
CVPR 2018
Compound Memory Networks for Few-shot Video Classification
ECCV 2018
Generalizing A Person Retrieval Model Hetero- and Homogeneously
ECCV 2018
Image-Image Domain Adaptation With Preserved Self-Similarity and Domain-Dissimilarity for Person Re-Identification
CVPR 2018
Occlusion Aware Unsupervised Learning of Optical Flow
CVPR 2018
Watching a Small Portion could be as Good as Watching All: Towards Efficient Video Classification
IJCAI 2018
Complex Event Detection by Identifying Reliable Shots From Untrimmed Videos
ICCV 2017
Learning Discriminative Latent Attributes for Zero-Shot Classification
ICCV 2017
Recursive Spatial Transformer (ReST) for Alignment-Free Face Recognition
ICCV 2017
Unlabeled Samples Generated by GAN Improve the Person Re-Identification Baseline in Vitro
ICCV 2017
Few-Shot Object Recognition From Machine-Labeled Web Images
CVPR 2017
Person Re-Identification in the Wild
CVPR 2017
Bidirectional Multirate Reconstruction for Temporal Modeling in Videos
CVPR 2017
More Is Less: A More Complicated Network With Less Inference Complexity
CVPR 2017
Alibaba at IJCNLP-2017 Task 1: Embedding Grammatical Features into LSTMs for Chinese Grammatical Error Diagnosis Task
IJCNLP 2017
Part-of-Speech Tagging for Historical English
NAACL 2016
They Are Not Equally Reliable: Semantic Event Search Using Differentiated Concept Classifiers
CVPR 2016
Toward Socially-Infused Information Extraction: Embedding Authors, Mentions, and Entities
EMNLP 2016
Attention to Scale: Scale-Aware Semantic Image Segmentation
CVPR 2016
Video Paragraph Captioning Using Hierarchical Recurrent Neural Networks
CVPR 2016
Voice Conversion Based on Matrix Variate Gaussian Mixture Model Using Multiple Frame Features
INTERSPEECH 2016
Hierarchical Recurrent Neural Encoder for Video Representation With Application to Captioning
CVPR 2016
Improving Topic Model Stability for Effective Document Exploration
IJCAI 2016
CNN-RNN: A Unified Framework for Multi-Label Image Classification
CVPR 2016
You Lead, We Exceed: Labor-Free Video Concept Learning by Jointly Exploiting Web Videos and Images
CVPR 2016
A Discriminative CNN Video Representation for Event Detection
CVPR 2015
Complex Event Detection using Semantic Saliency and Nearly-Isotonic SVM
ICML 2015
Efficient Methods for Incorporating Knowledge into Topic Models
EMNLP 2015
Inferring Painting Style with Multi-Task Dictionary Learning
IJCAI 2015
Semantic Concept Discovery for Large-Scale Zero-Shot Event Detection
IJCAI 2015
Scalable Maximum Margin Matrix Factorization by Active Riemannian Subspace Search
IJCAI 2015
Efficient Methods for Inferring Large Sparse Topic Hierarchies
ACL 2015
S-MART: Novel Tree-based Structured Learning Algorithms Applied to Tweet Entity Linking
ACL 2015
S-MART: Novel Tree-based Structured Learning Algorithms Applied to Tweet Entity Linking
IJCNLP 2015
Efficient Methods for Inferring Large Sparse Topic Hierarchies
IJCNLP 2015
WikiQA: A Challenge Dataset for Open-Domain Question Answering
EMNLP 2015
Unsupervised Multi-Domain Adaptation with Feature Embeddings
NAACL 2015
Look and Think Twice: Capturing Top-Down Visual Attention With Feedback Convolutional Neural Networks
ICCV 2015
Learning Like a Child: Fast Novel Visual Concept Learning From Sentence Descriptions of Images
ICCV 2015
Depth-Based Hand Pose Estimation: Data, Methods, and Challenges
ICCV 2015
Learning From Massive Noisy Labeled Data for Image Classification
CVPR 2015
DevNet: A Deep Event Network for Multimedia Event Detection and Evidence Recounting
CVPR 2015
Decomposable Nonlocal Tensor Dictionary Learning for Multispectral Image Denoising
CVPR 2014
Fast Easy Unsupervised Domain Adaptation with Marginalized Structured Dropout
ACL 2014
Event Detection using Multi-Level Relevance Labels and Multiple Features
CVPR 2014
Parsing Occluded People
CVPR 2014
Robust Tensor Clustering with Non-Greedy Maximization
IJCAI 2013
A Log-Linear Model for Unsupervised Text Normalization
EMNLP 2013
Complex Event Detection via Multi-source Video Attributes
CVPR 2013
Harry Potter's Marauder's Map: Localizing and Tracking Multiple Persons-of-Interest by Nonnegative Discretization
CVPR 2013
How Related Exemplars Help Complex Event Detection in Web Videos?
ICCV 2013
Overcoming the Memory Bottleneck in Distributed Training of Latent Variable Models of Text
NAACL 2013
Thinking of Images as What They Are: Compound Matrix Regression for Image Classification
IJCAI 2013
Co-Regularized Ensemble for Feature Selection
IJCAI 2013
Feature Weighting via Optimal Thresholding for Video Analysis
ICCV 2013
Space-Time Robust Representation for Action Recognition
ICCV 2013
Quality-biased Ranking of Short Texts in Microblogging Services
IJCNLP 2011