Data Science Lab
HOMERESEARCHPEOPLEPUBLICATIONSSEMINAR

ARES : Alternating Reinforcement Learning and Supervised Fine-Tuning for Enhanced Multi-Modal Chain-of-Thought Reasoning Through Diverse AI Feedback

EMNLP 2024

Mar 31, 2025 About 1 min
#Multi-Modal #Reinforcement Learning
MM-Embed: Universal Multimodal Retrieval with Multimodal...RouteLLM: Learning to Route LLMs with...

15588 경기도 안산시 상록구 한양대학로 55 (사동) 제 4공학관 408-1호


55, Hanyangdaehak-ro, Sangnok-gu, Ansan-si, Gyenggi-do