Papers
Visual Enhanced Entity-Level Interaction Network for Multimodal Summarization
Haolong Yan, Binghao Tang, Boda Lin et al.
Visual Grounding for User Interfaces
Yijun Qian, Yujie Lu, Alexander Hauptmann et al.
Visual Grounding Helps Learn Word Meanings in Low-Data Regimes
Chengxu Zhuang, Evelina Fedorenko, Jacob Andreas
Visually-Aware Context Modeling for News Image Captioning
Tingyu Qu, Tinne Tuytelaars, Marie-Francine Moens
Visually Guided Generative Text-Layout Pre-training for Document Intelligence
Zhiming Mao, Haoli Bai, Lu Hou et al.
VLUE: A New Benchmark and Multi-task Knowledge Transfer Learning for Vietnamese Natural Language Understanding
Phong Nguyen-Thuan Do, Son Quoc Tran, Phu Gia Hoang et al.
Volcano: Mitigating Multimodal Hallucination through Self-Feedback Guided Revision
Seongyun Lee, Sue Hyun Park, Yongrae Jo et al.
VOLIMET: A Parallel Corpus of Literal and Metaphorical Verb-Object Pairs for English–German and English–French
Prisca Piccirilli, Alexander Fraser, Sabine Schulte im Walde
VOLTA: Improving Generative Diversity by Variational Mutual Information Maximizing Autoencoder
Yueen Ma, DaFeng Chi, Jingjing Li et al.
WangLab at MEDIQA-CORR 2024: Optimized LLM-based Programs for Medical Error Detection and Correction
Augustin Toma, Ronald Xie, Steven Palayew et al.
WangLab at MEDIQA-M3G 2024: Multimodal Medical Answer Generation using Large Language Models
Ronald Xie, Steven Palayew, Augustin Toma et al.
WaterJudge: Quality-Detection Trade-off when Watermarking Large Language Models
Piotr Molenda, Adian Liusie, Mark Gales
Wav2pos: Exploring syntactic analysis from audio for Highland Puebla Nahuatl
Robert Pugh, Varun Sreedhar, Francis Tyers
WebWISE: Unlocking Web Interface Control for LLMs via Sequential Exploration
Heyi Tao, Sethuraman T V, Michal Shlapentokh-Rothman et al.
Weighted Layer Averaging RoBERTa for Black-Box Machine-Generated Text Detection
Ayan Datta, Aryan Chandramania, Radhika Mamidi
Weight-Inherited Distillation for Task-Agnostic BERT Compression
Taiqiang Wu, Cheng Hou, Shanshan Lao et al.
Werkzeug at SemEval-2024 Task 8: LLM-Generated Text Detection via Gated Mixture-of-Experts Fine-Tuning
Youlin Wu, Kaichun Wang, Kai Ma et al.
What Are We Measuring When We Evaluate Large Vision-Language Models? An Analysis of Latent Factors and Biases
Anthony Tiong, Junqi Zhao, Boyang Li et al.
What Causes the Failure of Explicit to Implicit Discourse Relation Recognition?
Wei Liu, Stephen Wan, Michael Strube
whatdoyoumeme at SemEval-2024 Task 4: Hierarchical-Label-Aware Persuasion Detection using Translated Texts
Nishan Chatterjee, Marko Pranjic, Boshko Koloski et al.
What Drives Performance in Multilingual Language Models?
Sina Bagheri Nezhad, Ameeta Agrawal
What explains the success of cross-modal fine-tuning with ORCA?
Paloma García-de-Herreros, Vagrant Gautam, Philipp Slusallek et al.
What if you said that differently?: How Explanation Formats Affect Human Feedback Efficacy and User Perception
Chaitanya Malaviya, Subin Lee, Dan Roth et al.
What Makes Math Word Problems Challenging for LLMs?
Kv Aditya Srivatsa, Ekaterina Kochmar