Mintong Kang
15 papers · 2022–2025 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+4 more ↓ Show less ↑
π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (18) π§ Keyword Pioneer π Conference Polyglot (5) π Cross-Pollinator (14)
π₯
Mega-Team
(25)
π€
Dynamic Duo
(13)
π
Century Club
(15)
β‘
Prolific Year
(8)
Conferences
ICLR (5)
ICML (4)
NIPS (4)
EMNLP (1)
ICCV (1)
Top co-authors
Keywords
diffusion model
(2)
adversarial robustness
(1)
convex optimization
(1)
toxicity detection
(1)
singular value decomposition
(1)
distributional robustness
(1)
pareto optimality
(1)
distributionally robust optimization
(1)
adversarial attack
(1)
bias mitigation
(1)
gradient backpropagation
(1)
evasion attack
(1)
coalition formation
(1)
non-iid datum
(1)
knowledge removal
(1)
fair generation
(1)
adaptive guidance
(1)
certified fairness
(1)
large language model
(1)
non-iid setting
(1)
Papers
AdvAgent: Controllable Blackbox Red-teaming on Web Agents
ICML 2025
FairGen: Controlling Sensitive Attributes for Fair Generations in Diffusion Models via Adaptive Latent Guidance
EMNLP 2025
FG-OrIU: Towards Better Forgetting via Feature-Gradient Orthogonality for Incremental Unlearning
ICCV 2025
AdvWave: Stealthy Adversarial Jailbreak Attack against Large Audio-Language Models
ICLR 2025
MMDT: Decoding the Trustworthiness and Safety of Multimodal Foundation Models
ICLR 2025
$R^2$-Guard: Robust Reasoning Enabled LLM Guardrail via Knowledge-Enhanced Logical Reasoning
ICLR 2025
EIA: ENVIRONMENTAL INJECTION ATTACK ON GENERALIST WEB AGENTS FOR PRIVACY LEAKAGE
ICLR 2025
ShieldAgent: Shielding Agents via Verifiable Safety Policy Reasoning
ICML 2025
C-RAG: Certified Generation Risks for Retrieval-Augmented Language Models
ICML 2024
Certifiably Byzantine-Robust Federated Conformal Prediction
ICML 2024
COLEP: Certifiably Robust Learning-Reasoning Conformal Prediction via Probabilistic Circuits
ICLR 2024
DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models
NIPS 2023
DiffAttack: Evasion Attacks Against Diffusion-Based Adversarial Purification
NIPS 2023
Certifying Some Distributional Fairness with Subpopulation Decomposition
NIPS 2022
Fairness in Federated Learning via Core-Stability
NIPS 2022