Papers
The Unintended Trade-off of AI Alignment: Balancing Hallucination Mitigation and Safety in LLMs
Omar Mahmoud, Ali Khalil, Thommen George Karimpanal et al.
Think Hard Only When Needed: A Hybrid Best-of-N and Beam Search for Efficient Test-Time Compute
Hyewon Suh, Chaojian Li, Cheng-Jhih Shih et al.
Thinking Beyond the Local: Multi-View Instructed Adaptive Reasoning in KG-Enhanced LLMs
Minghan Zhang, Shu Zhao, Zhen Yang et al.
Thinking Long, but Short: Stable Sequential Test-Time Scaling for Large Reasoning Models
Michael R. Metel, Yufei Cui, Boxing Chen et al.
Think Just Enough: Leveraging Self-Assessed Confidence for Adaptive Reasoning in Language Models
Junyeob Kim, Sang-goo Lee, Taeuk Kim
ThinkNote: Enhancing Knowledge Integration and Utilization of Large Language Models via Constructivist Cognition Modeling
Zhipeng Xu, Zhenghao Liu, Yukun Yan et al.
ThinkPilot: Steering Reasoning Models via Automated Think-prefixes Optimization
Sunzhu Li, Zhiyu Lin, Jiale Zhao et al.
Thunder-NUBench: A Benchmark for LLMs’ Sentence-Level Negation Understanding
Yeonkyoung So, Gyuseong Lee, Sungmok Jung et al.
TimeMachine-bench: A Benchmark for Evaluating Model Capabilities in Repository-Level Migration Tasks
Ryo Fujii, Makoto Morishita, Kazuki Yano et al.
TimeRes: A Turkish Benchmark For Evaluating Temporal Understanding of Large Language Models
Habib Yağız Demir, Ümit Atlamaz, Susan Üsküdarlı
TIPA: Typologically Informed Parameter Aggregation
Stef Accou, Wessel Poelman
Tokenisation of Turkic Copula Constructions in Universal Dependencies
Cagri Coltekin, Furkan Akkurt, Bermet Chontaeva et al.
Tokenizer-Aware Cross-Lingual Adaptation of Decoder-Only LLMs through Embedding Relearning and Swapping
Fan Jiang, Honglin Yu, Grace Y Chung et al.
Token-Level Precise Attack on RAG: Searching for the Best Alternatives to Mislead Generation
Zizhong Li, Haopeng Zhang, Jiawei Zhang
Token Pruning for Improving Graph-Generating State Space Model Performance
Monish Beegamudre, Jack Zheng, Margaret Capetz
Token-Wise Kernels (TWiKers) for Vicinity-Aware Attention in Transformers
Kuangdai Leng, Jia Bi, Samuel Pinilla et al.
To make someone do something: mining alert-style directives in Bulgarian social media for low-resource language modelling
Ruslana Margova, Stanislav Penkov
ToolDreamer: Instilling LLM Reasoning Into Tool Retrievers
Saptarshi Sengupta, Zhengyu Zhou, Jun Araki et al.
Too Long, Didn’t Model: Decomposing LLM Long Context Understanding With Novels
Sil Hamilton, Rebecca Hicke, Mia Ferrante et al.
Too Many Frames, Not All Useful: Efficient Strategies for Long-Form Video QA
Jongwoo Park, Kanchana Ranasinghe, Kumara Kahatapitiya et al.