Research Explorer

Iterative Label Refinement Matters More than Preference Optimization under Weak Supervision

Yaowen Ye, Cassidy Laidlaw, Jacob Steinhardt

2025 ICLR

Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning

Yuheng Zhang, Dian Yu, Baolin Peng et al.

2025 ICLR

Iterative Substructure Extraction for Molecular Relational Learning with Interactive Graph Information Bottleneck

Shuai Zhang, Junfeng Fang, Xuqiang Li et al.

2025 ICLR

IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation

Xinchen Zhang, Ling Yang, Guohao Li et al.

2025 ICLR

IterGen: Iterative Semantic-aware Structured LLM Generation with Backtracking

Shubham Ugare, Rohan Gumaste, Tarun Suresh et al.

2025 ICLR

It Helps to Take a Second Opinion: Teaching Smaller LLMs To Deliberate Mutually via Selective Rationale Optimisation

Sohan Patnaik, Milan Aggarwal, Sumit Bhatia et al.

2025 ICLR

IV-mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis

Shitong Shao, zikai zhou, Bai LiChen et al.

2025 ICLR

Jailbreak Antidote: Runtime Safety-Utility Balance via Sparse Representation Adjustment in Large Language Models

Guobin Shen, Dongcheng Zhao, Yiting Dong et al.

2025 ICLR

Jailbreaking as a Reward Misspecification Problem

Zhihui Xie, Jiahui Gao, Lei Li et al.

2025 ICLR

Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks

Maksym Andriushchenko, Francesco Croce, Nicolas Flammarion

2025 ICLR

Jamba: Hybrid Transformer-Mamba Language Models

Barak Lenz, Opher Lieber, Alan Arazi et al.

2025 ICLR

JetFormer: An autoregressive generative model of raw images and text

Michael Tschannen, André Susano Pinto, Alexander Kolesnikov

2025 ICLR

Joint Fine-tuning and Conversion of Pretrained Speech and Language Models towards Linear Complexity

Mutian He, Philip N. Garner

2025 ICLR

Joint Gradient Balancing for Data Ordering in Finite-Sum Multi-Objective Optimization

Hansi Yang, James Kwok

2025 ICLR

Joint Graph Rewiring and Feature Denoising via Spectral Resonance

Jonas Linkerhägner, Cheng Shi, Ivan Dokmanić

2025 ICLR

Joint Reward and Policy Learning with Demonstrations and Human Feedback Improves Alignment

Chenliang Li, Siliang Zeng, Zeyi Liao et al.

2025 ICLR

JPEG Inspired Deep Learning

Ahmed H. Salamah, Kaixiang Zheng, Yiwen Liu et al.

2025 ICLR

JudgeBench: A Benchmark for Evaluating LLM-Based Judges

Sijun Tan, Siyuan Zhuang, Kyle Montgomery et al.

2025 ICLR

Judge Decoding: Faster Speculative Sampling Requires Going Beyond Model Alignment

Gregor Bachmann, Sotiris Anagnostidis, Albert Pumarola et al.

2025 ICLR

JudgeLM: Fine-tuned Large Language Models are Scalable Judges

Lianghui Zhu, Xinggang Wang, Xinlong Wang

2025 ICLR

Jump Your Steps: Optimizing Sampling Schedule of Discrete Diffusion Models

Yong-Hyun Park, Chieh-Hsin Lai, Satoshi Hayakawa et al.

2025 ICLR

Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge

Jiayi Ye, Yanbo Wang, Yue Huang et al.

2025 ICLR

KAA: Kolmogorov-Arnold Attention for Enhancing Attentive Graph Neural Networks

Taoran Fang, Tianhong Gao, Chunping Wang et al.

2025 ICLR

KAN: Kolmogorov–Arnold Networks

Ziming Liu, Yixuan Wang, Sachin Vaidya et al.

2025 ICLR

KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models

Fan Wang, Juyong Jiang, Chansung Park et al.

2025 ICLR

Papers