Xinxiao Wu received her Ph.D. from the school of computer science, Beijing Institute of Technology in July 2010. From August 2010 to October 2011, She worked as a Post-PhD student research fellow in Nanyang Technological University, Singapore. She joined the School of Computer Science, Beijing Institute of Technology in 2012. She is currently a Professor. She has obtained Excellent PhD studental Dissertation Award from the Chinese Association for Artificial Intelligence. She has published more than 60 papers in top conferences and journals on computer vision and artificial intelligence: ICCV, CVPR, ECCV, NeurIPS, AAAI, IJCAI, ACM MM, IJCV, IEEE TIP, IEEE TMM, IEEE TNNLS, IEEE TCSVT, IEEE TCYB. Her research work has been supported by many research grants as principal investigator, which includes the National Natural Science Foundation (NSFC), the Ministry of Education PhD studental Fund, and many school-enterprise projects, etc. She also servers on the editorial boards of IEEE Transactions on Multimedia. Her current research interests include computer vision, multi-modal reasoning, multi-modal agent and machine learning.