First-Author Publications
-
New! EAGLE: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Fuxiao Liu*, Min Shi*, Shihao Wang, Shijia Liao, Subhashree Radhakrishnan, De-An Huang, Hongxu Yin, Karan Sapra, Yaser Yacoob, Humphrey Shi, Bryan Catanzaro, Andrew Tao, Jan Kautz, Zhiding Yu*, Guilin Liu*
-
New! Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning
Fuxiao Liu*, Kevin Lin, Linjie Li, Jianfeng Wang, Yaser Yacoob, Lijuan Wang
-
New! HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models
Fuxiao Liu*, Tianrui Guan*, Xiyang Wu, Ruiqi Xian, Xijun Wang, Zongxia Li, Lichang Chen, Yaser Yacoob, Dinesh Manocha, Tianyi Zhou
-
New! MMC: Advancing Multimodal Chart Understanding with Large-scale Instruction Tuning
Fuxiao Liu*, Xiaoyang Wang, Wenlin Yao, Jianshu Chen, Kaiqiang Song, Sangwoo Cho, Yaser Yacoob, Dong Yu
-
DocumentCLIP: Linking Figures and Main Body Text in Reflowed Documents
Fuxiao Liu*, Hao Tan, Chris Tensmeyer
-
COVID-VTS: Fact Extraction and Verification on Short Video Platforms
Fuxiao Liu*, Yaser Yacoob, Abhinav Shrivastava
-
Visual News: Benchmark and Challenges in News Image Captioning
Fuxiao Liu*, Yinghan Wang, Tianlu Wang, Vicente Ordonez