Wenyan Li
Wenyan Li
Home
Publications
Projects
Posts
CV
Contact
Multimodal
The Role of Data Curation in Image Captioning
Image captioning models are typically trained by treating all samples equally, neglecting to account for mismatched or otherwise …
Wenyan Li
,
Jonas F Lotz
,
Chen Qiu
,
Desmond Elliott
PDF
Cite
Code
Slides
MAP: Low-data Regime Multimodal Learning with Adapter-based Pre-training and Prompting
Pretrained vision-language (VL) models have shown impressive results on various multi-modal downstream tasks recently. Many of the …
Wenyan Li
,
Dong Li
,
Wanjing Li
,
Yuanjie Wang
,
Hai Jie
,
Yiran Zhong
PDF
Cite
Cite
×