Papers
WIST: Web-Grounded Iterative Self-Play Tree for Domain-Targeted Reasoning Improvement
Fangyuan Li, Pengfei Li, Shijie Wang et al.
Would LLMs be Good Historical Linguists and Chinese Dialect Learners?
Yicheng Liu, Shumin Shi, Youchao Zhou et al.
Writing-RL: Advancing Long-form Writing via Adaptive Curriculum Reinforcement Learning
Xuanyu Lei, Chenliang Li, Yuning Wu et al.
WSDPO: A Generative Word Sense Disambiguation Framework with Chain-of-Thought and Preference Optimization
Kunpeng Kang, Shuaimin Li, Kaiyuan Zhang et al.
XMark: Reliable Multi-Bit Watermarking for LLM-Generated Texts
Jiahao Xu, Rui Hu, Olivera Kotevska et al.
XOXO: Stealthy Cross-Origin Context Poisoning Attacks against AI Coding Assistants
Adam Štorek, Mukur Gupta, Noopur Bhatt et al.
XToM: Exploring the Multilingual Theory of Mind for Large Language Models
Chunkit Chan, Yauwai Yim, Hongchuan Zeng et al.
XtraGPT: Context-Aware and Controllable Academic Paper Revision via Human-AI Collaboration
Nuo Chen, Andre Lin HuiKai, Jiaying Wu et al.
XY-Tokenizer: Mitigating the Semantic-Acoustic Conflict in Low-Bitrate Speech Codecs
Yitian Gong, Luozhijie Jin, Kuangwei Chen et al.
YIELD: A Large-Scale Dataset and Evaluation Framework for Information Elicitation Agents
Victor De Lima, Grace Hui Yang
You Can Have a Second Chance: Unbiased and Multi-bit Watermarking for Diffusion Language Models with Regret-based Remasking
Ke Yang, Dongyang Liang, Jing Yu et al.
Your Inference Request Will Become a Black Box: Confidential Inference for Cloud-based Large Language Models
Chung-ju Huang, Huiqiang Zhao, Yuanpeng He et al.
Your Reasoning Benchmark May Not Test Reasoning: Revealing Perception Bottleneck in Abstract Reasoning Benchmarks
Xinhe Wang, Jin Huang, Xingjian Zhang et al.
Your Reasoning Model is Secretly a Reward Model - Optimization-Free Verification from Experience
Zhenwen Liang, Ruosen Li, Yujun Zhou et al.
Your Reasoning Model Knows What Counts: Self-Guided Chain-of-Thought Pruning for Efficient Reasoning
Zi-Ao Ma, Xian-Ling Mao, Tian Lan et al.
Your Students Don’t Use LLMs Like You Wish They Did
Sebastian Kobler, Matthew Clemson, Angela Sun et al.
Z3D: Zero-Shot 3D Visual Grounding from Images
Nikita Drozdov, Andrey Lemeshko, Nikita Gavrilov et al.
ZARA: Training-Free Motion Time-Series Reasoning via Evidence-Grounded LLM Agents
Zechen Li, Baiyu Chen, Hao Xue et al.
Zero-Shot Detection of LLM-Generated Text using Temperature Sensitivity
Shixuan Ma, Jiahao Li, Zhendong Mao et al.
Zero-shot Jianzi Recognition as Structured Visual Information Extraction in Open Compositional Symbolic Systems
Zehan Li, Fu Zhang, Zhijun Liu et al.
Zero-shot Large Language Models for Automatic Readability Assessment
Riley Grossman, Yi Chen
Zero-Shot Multimodal Retrieval with Multi-Scale Contextual Representations
Sourajit Saha, Tejas Gokhale
ZoomR: Memory Efficient Reasoning through Multi-Granularity Key Value Retrieval
David H. Yang, Yuxuan Zhu, Mohammad Mohammadi Amiri et al.
100-LongBench: Are de facto Long-Context Benchmarks Literally Evaluating Long-Context Ability?
Van Yang, Hongye Jin, Shaochen Zhong et al.
111DUT at SemEval-2025 Task 8:Hierarchical Chain-of-Thought Reasoning and Multi-Model Deliberation for Robust TableQA
Jiaqi Yao, Erchen Yu, Yicen Tian et al.