PhD Student in Computer Science | University of Southern California
Researching Large Vision-Language Models, Multimodal Learning, and Mechanistic Interpretability
I'm a second-year PhD student at the GLAMOR Lab in the Thomas Lord Computer Science Department at the University of Southern California. I am fortunate to be advised by Professor Jesse Thomason.
My research interests span Large Vision-Language Models (VLMs), multimodal learning, interpretability. I am particularly interested in how VLMs perceive and reason in real-world 3D environments, and in developing interpretability methods to uncover the mechanisms behind why certain VLMs exhibit stronger spatial reasoning abilities, as well as why others struggle with specific spatial tasks. I'm also interested in how Multimodal Large Language Models integrate diverse modalities to form coherent representations that support complex reasoning.
University of Southern California
Advisor: Prof. Jesse Thomason | Focus: Large Vision-Language Models and Multimodal Learning
Tsinghua University
Advisors: Prof. Wenwu Zhu, Prof. Xin Wang
South China University of Technology
I'm always interested in discussing research opportunities, collaborations, or just talking about AI and LLM!
Feel free to reach out via email or connect with me on the platforms below.