conftrace_

Papers

Mathematical Proof as a Litmus Test: Revealing Failure Modes of Advanced Large Reasoning Models ACL 2026 CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments for LLM Tool-Use Agents ACL 2026 Learning Diverse Responses with Prefix-Conditioned Supervised Fine-Tuning ACL 2026 CiteGuard: Faithful Citation Attribution for LLMs via Retrieval-Augmented Validation ACL 2026 TAMP: Token-Adaptive Layerwise Pruning in Multimodal Large Language Models ACL 2025 Unveiling the Lack of LVLM Robustness to Fundamental Visual Variations: Why and Path Forward ACL 2025 The Law of Knowledge Overshadowing: Towards Understanding, Predicting and Preventing LLM Hallucination ACL 2025 The Law of Knowledge Overshadowing: Towards Understanding, Predicting, and Preventing LLM Hallucination ACL 2025 MAC-Tuning: LLM Multi-Compositional Problem Reasoning with Enhanced Knowledge Boundary Awareness EMNLP 2025 Let’s Reason Formally: Natural-Formal Hybrid Reasoning Enhances LLM’s Math Capability EMNLP 2025 End-to-End Optimization for Multimodal Retrieval-Augmented Generation via Reward Backpropagation EMNLP 2025 MedEBench: Diagnosing Reliability in Text-Guided Medical Image Editing EMNLP 2025 CALM: Unleashing the Cross-Lingual Self-Aligning Ability of Language Model Question Answering NAACL 2025 Aligning LLMs with Individual Preferences via Interaction COLING 2025 VLM2-Bench: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues ACL 2025 MMBoundary: Advancing MLLM Knowledge Boundary Awareness through Reasoning Step Confidence Calibration ACL 2025 Self-Correction is More than Refinement: A Learning Framework for Visual and Language Reasoning Tasks ACL 2025 ADEPT: A DEbiasing PrompT Framework AAAI 2023 A Zero-Shot Claim Detection Framework Using Question Answering COLING 2022