Shiqian Su

I am a first-year Ph.D. student in the Department of Electronic Engineering at Tsinghua University, where I am fortunate to be advised by Prof. Jifeng Dai. I am also doing research intern at SenseTime Research since 2024.

I completed my bachelor’s degree in the Department of Physics at Tsinghua in 2024. Additionally,I was an exchange student at the Department of Information Technology and Electrical Engineering at ETH Zürich in 2023.

My research focuses on multi-modal models and embodied intelligence within the realms of deep learning and computer vision. This field continuously presents new challenges and opportunities for innovation, and I am excited to contribute to the community’s efforts in realizing Artificial General Intelligence (AGI).

Feel free to explore our team and works at: https://fundamentalvision.github.io/

I believe AI has the potential to revolutionize how people interact with one another and the world around them. The future is full of possibilities, and I am enthusiastic about exploring startup opportunities. If you are interested in this field, don’t hesitate to get in touch!

News


Selected Publications

HoVLE

HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding

Tao, C.*, Su, S.*, Zhu, X.*, Zhang, C., Chen, Z., Liu, J., ... & Dai, J.
CVPR 2025 / Paper / Model

Data Scaling Laws

Learning 1D Causal Visual Representation with De-focus Attention Networks

Tao, C.*, Zhu, X.*, Su, S.*, Lu, L., Tian, C., Luo, X., ... & Dai, J.
NeurIPS 2024 / Paper / Code


Honors and Awards