Computer Science Speaking Skills Talk
— 5:00pm
Location:
In Person
-
Reddy Conference Room, Gates Hillman 4405
Speaker:
JIELIN QIU
,
Ph.D. Student, Computer Science Department, Carnegie Mellon University
https://www.cs.cmu.edu/~jielinq/
Multimodal Learning in Alignment, Robustness, and Generalization
In the modern era of data-driven AI technologies, multimodal intelligence has emerged as a powerful paradigm. Multimodal intelligence is artificial intelligence that studies computer agents able to demonstrate intelligence capabilities such as understanding, reasoning, and planning through multimodal experiences and data.
In this talk, we will discuss these questions: (1) How do we explore the inner semantic alignment between different domains? How can the learned alignment help advance multimodal applications? (2) How robust are the multimodal models? How can we improve the models' robustness in real-world applications? (3) How do we generalize the knowledge of one learned domain to another unlearned domain?
Presented in Partial Fulfillment of the CSD Speaking Skills Requirement