Feng Yang
35 papers · 2016–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
🏃 Academic Marathon (9) 🌍 Conference Polyglot (9) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (11)
🌈
Renaissance Researcher
(8)
🌍
Conference Polyglot
(9)
🏃
Academic Marathon
(9)
🌟
Keyword Trendsetter Combo
(3)
🤝
Dynamic Duo
(12)
🔥
Unstoppable
(6)
💎
Century Club
(31)
🚀
Conference Pioneer
🗃️
Keyword Collector
(178)
⚡
Prolific Year
(6)
Conferences
CVPR (13)
AAAI (5)
ICCV (5)
INTERSPEECH (4)
ECCV (3)
MICCAI (2)
ACML (1)
NIPS (1)
WACV (1)
Top co-authors
Research topics
Keywords
diffusion model
(4)
deep learning
(3)
text-to-image generation
(3)
image restoration
(3)
image watermarking
(3)
vision-language model
(3)
reward model
(3)
human feedback
(2)
contrastive learning
(2)
image generation
(2)
copyright protection
(2)
speech intelligibility
(2)
knowledge distillation
(2)
parameter efficiency
(2)
model robustness
(1)
object detection
(1)
zero-shot learning
(1)
noise suppression
(1)
causal inference
(1)
domain generalization
(1)
Papers
OpenDriveVLA: Towards End-to-end Autonomous Driving with Large Vision Language Action Model
AAAI 2026
Injection Without Distortion: Geometrically Constrained Knowledge Enhancement for Vision-Language Models
AAAI 2026
Domain-Aware Multi-View Contrastive Representation Learning for Protein Subcellular Localization Prediction
AAAI 2026
Dynamic Geometric Equivariant Network for Full-Atom Antibody Design
AAAI 2026
Calibrated Multi-Preference Optimization for Aligning Diffusion Models
CVPR 2025
Motion Artifact Removal in Pixel-Frequency Domain via Alternate Masks and Diffusion Model
AAAI 2025
MedGCD: Generalized Category Discovery in Medical Imaging
MICCAI 2025
CounterPC: Counterfactual Feature Realignment for Unsupervised Domain Adaptation on Point Clouds
ICCV 2025
3D-GSW: 3D Gaussian Splatting for Robust Watermarking
CVPR 2025
Focus-N-Fix: Region-Aware Fine-Tuning for Text-to-Image Generation
CVPR 2025
Cropper: Vision-Language Model for Image Cropping through In-Context Learning
CVPR 2025
ArtVLM: Attribute Recognition Through Vision-Based Prefix Language Modeling
ECCV 2024
WateRF: Robust Watermarks in Radiance Fields for Protection of Copyrights
CVPR 2024
Rich Human Feedback for Text-to-Image Generation
CVPR 2024
Optical Diffusion Models for Image Generation
NIPS 2024
Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation
ECCV 2024
Decoupled Training for Semi-supervised Medical Image Segmentation with Worst-Case-Aware Learning
MICCAI 2024
SVDiff: Compact Parameter Space for Diffusion Fine-Tuning
ICCV 2023
VILA: Learning Image Aesthetics From User Comments With Vision-Language Pretraining
CVPR 2023
Re-mine, Learn and Reason: Exploring the Cross-modal Semantic Correlations for Language-guided HOI detection
ICCV 2023
MaxViT: Multi-axis Vision Transformer
ECCV 2022
A Self-improving Skin Lesions Diagnosis Framework
Via Pseudo-labeling and Self-distillation
ACML 2022
Deep 3D-to-2D Watermarking: Embedding Messages in 3D Meshes and Extracting Them From 2D Renderings
CVPR 2022
MAXIM: Multi-Axis MLP for Image Processing
CVPR 2022
Adversarially Adaptive Normalization for Single Domain Generalization
CVPR 2021
Rich Features for Perceptual Quality Assessment of UGC Videos
CVPR 2021
COMISR: Compression-Informed Video Super-Resolution
ICCV 2021
MUSIQ: Multi-Scale Image Quality Transformer
ICCV 2021
Perceptual Contributions of Vowels and Consonant-Vowel Transitions in Understanding Time-Compressed Mandarin Sentences
INTERSPEECH 2021
Multi-Path Neural Networks for On-Device Multi-Domain Visual Classification
WACV 2021
GIFnets: Differentiable GIF Encoding Framework
CVPR 2020
Distortion Agnostic Deep Watermarking
CVPR 2020
Acoustic Features Associated with Sustained Vowel and Continuous Speech Productions by Chinese Children with Functional Articulation Disorders
INTERSPEECH 2018
Impaired Categorical Perception of Mandarin Tones and its Relationship to Language Ability in Autism Spectrum Disorders
INTERSPEECH 2016
Assessing Level-Dependent Segmental Contribution to the Intelligibility of Speech Processed by Single-Channel Noise-Suppression Algorithms
INTERSPEECH 2016