Data Science Lab
HOME
RESEARCH
PEOPLE
PUBLICATIONS
SEMINAR
Light
Dark
ARES : Alternating Reinforcement Learning and Supervised Fine-Tuning for Enhanced Multi-Modal Chain-of-Thought Reasoning Through Diverse AI Feedback
EMNLP 2024
Mar 31, 2025
About 1 min
#Multi-Modal
#Reinforcement Learning
MM-Embed: Universal Multimodal Retrieval with Multimodal...
RouteLLM: Learning to Route LLMs with...