Papers
Thinking with DistilQwen: A Tale of Four Distilled Reasoning and Reward Model Series
Wenrui Cai, Chengyu Wang, Junbing Yan et al.
Think in Safety: Unveiling and Mitigating Safety Alignment Collapse in Multimodal Large Reasoning Model
Xinyue Lou, You Li, Jinan Xu et al.
ThinkQE: Query Expansion via an Evolving Thinking Process
Yibin Lei, Tao Shen, Andrew Yates
Think Right, Not More: Test-Time Scaling for Numerical Claim Verification
Primakov Chungkham, Venktesh V, Vinay Setty et al.
Think-Search-Patch: A Retrieval-Augmented Reasoning Framework for Repository-Level Code Repair
Bojian Xiong, Yikun Lei, Xikai Liu et al.
ThinkSLM: Towards Reasoning in Small Language Models
Gaurav Srivastava, Shuxiang Cao, Xuan Wang
ThinkSwitcher: When to Think Hard, When to Think Fast
Guosheng Liang, Longguang Zhong, Ziyi Yang et al.
ThinkTuning: Instilling Cognitive Reflections without Distillation
Aswin Rrv, Jacob Dineen, Divij Handa et al.
Think Twice, Generate Once: Safeguarding by Progressive Self-Reflection
Hoang Phan, Victor Li, Qi Lei
Think, Verbalize, then Speak: Bridging Complex Thoughts and Comprehensible Speech
Tony Woo, Sehun Lee, Kang-wook Kim et al.
Think Wider, Detect Sharper: Reinforced Reference Coverage for Document-Level Self-Contradiction Detection
Yuhao Chen, Yuanjie Lyu, Shuochen Liu et al.
Third-Person Appraisal Agent: Simulating Human Emotional Reasoning in Text with Large Language Models
Simin Hong, Jun Sun, Hongyang Chen
This is not a Disimprovement: Improving Negation Reasoning in Large Language Models via Prompt Engineering
Joshua Jose Dias Barreto, Abhik Jana
Thought calibration: Efficient and confident test-time scaling
Menghua Wu, Cai Zhou, Stephen Bates et al.
Thread: A Logic-Based Data Organization Paradigm for How-To Question Answering with Retrieval Augmented Generation
Kaikai An, Fangkai Yang, Liqun Li et al.
Threading the Needle: Reweaving Chain-of-Thought Reasoning to Explain Human Label Variation
Beiduo Chen, Yang Janet Liu, Anna Korhonen et al.
Through the Valley: Path to Effective Long CoT Training for Small Language Models
Renjie Luo, Jiaxi Li, Chen Huang et al.
Thunder-DeID: Accurate and Efficient De-identification Framework for Korean Court Judgments
Sungeun Hahm, Heejin Kim, Gyuseong Lee et al.
TIDES: Technical Information Discovery and Extraction System
Jihee Kim, Subeen Park, Hakyung Lee et al.
Time Is Effort: Estimating Human Post-Editing Time for Grammar Error Correction Tool Evaluation
Ankit Vadehra, Bill Johnson, Gene Saunders et al.
Time to Revisit Exact Match
Auss Abbood, Zaiqiao Meng, Nigel Collier
Time to Talk: LLM Agents for Asynchronous Group Communication in Mafia Games
Niv Eckhaus, Uri Berger, Gabriel Stanovsky
Tiny Budgets, Big Gains: Parameter Placement Strategy in Parameter Super-Efficient Fine-Tuning
Jinman Zhao, Xueyan Zhang, Jiaru Li et al.
TinyScientist: An Interactive, Extensible, and Controllable Framework for Building Research Agents
Haofei Yu, Keyang Xuan, Fenghai Li et al.