Data Science Lab
HOMERESEARCHPEOPLEPUBLICATIONSSEMINAR

Multimodal Procedural Planning via Dual Text-Image Prompting

Jan 07, 2025 About 1 min
#Multimodal
Retrieval Augmented Geneartion or Long-Context LLMs?...Image Captioners Are Scalable Vision Learners...

15588 경기도 안산시 상록구 한양대학로 55 (사동) 제 4공학관 408-1호


55, Hanyangdaehak-ro, Sangnok-gu, Ansan-si, Gyenggi-do