Computer Science Speaking Skills Talk February 27, 2024 4:00pm — 5:00pm Location: In Person - Reddy Conference Room, Gates Hillman 4405 Speaker: JIELIN QIU , Ph.D. Student, Computer Science Department, Carnegie Mellon University https://www.cs.cmu.edu/~jielinq/ Multimodal Learning in Alignment, Robustness, and Generalization In the modern era of data-driven AI technologies, multimodal intelligence has emerged as a powerful paradigm. Multimodal intelligence is artificial intelligence that studies computer agents able to demonstrate intelligence capabilities such as understanding, reasoning, and planning through multimodal experiences and data. In this talk, we will discuss these questions: (1) How do we explore the inner semantic alignment between different domains? How can the learned alignment help advance multimodal applications? (2) How robust are the multimodal models? How can we improve the models' robustness in real-world applications? (3) How do we generalize the knowledge of one learned domain to another unlearned domain? Presented in Partial Fulfillment of the CSD Speaking Skills Requirement Add event to Google Add event to iCal