conftrace_
2024 CORL CoRL 2024

Trajectory Improvement and Reward Learning from Comparative Language Feedback