Co-occurring keywords
Papers
Would I Lie To You? Inference Time Alignment of Language Models using Direct Preference Heads
NIPS 2024
OntoFact: Unveiling Fantastic Fact-Skeleton of LLMs via Ontology-Driven Reinforcement Learning
AAAI 2024
Achieving $\tilde{O}(1/\epsilon)$ Sample Complexity for Constrained Markov Decision Process
NIPS 2024
Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language Use
EMNLP 2024
Reinforcement Learning based Data Augmentation for Noise Robust Speech Emotion Recognition
INTERSPEECH 2024
ROAR: Reinforcing Original to Augmented Data Ratio Dynamics for Wav2vec2.0 Based ASR
INTERSPEECH 2024
Isometric Neural Machine Translation using Phoneme Count Ratio Reward-based Reinforcement Learning
NAACL 2024