Cordelia Schmid

Fellowship
Hans Fischer Senior Fellowship

Appointment
2025

Institution
INRIA

Hosts
Prof. Majid Khadiv; Prof. Daniel Cremers

Focus Group
Autonomous Humanoid Workers

Short CV

Cordelia Schmid holds a M.S. degree in Computer Science from the University of Karlsruhe and a Doctorate, also in Computer Science, from the Institut National Polytechnique de Grenoble (INPG). Her doctoral thesis received the best thesis award from INPG in 1996. She received the Habilitation degree in 2001. Dr. Schmid was a post-doctoral research assistant in the Robotics Research Group of Oxford University in 1996-1997. Since 1997 she has held a permanent research position at Inria, where she is a research director.

Dr. Schmid is a member of the German National Academy of Sciences, Leopoldina and a fellow of IEEE and the ELLIS society. She was awarded the Longuet-Higgins prize in 2006, 2014 and 2016, the Koenderink prize in 2018 and the Helmholtz prize in 2023, all for fundamental contributions in computer vision that have withstood the test of time. Dr. Schmid has been an Associate Editor for IEEE PAMI (2001-2005) and for IJCV (2004-2012), an editor-in-chief for IJCV (2013-2018), a program chair of IEEE CVPR 2005 and ECCV 2012 as well as a general chair of IEEE CVPR 2015, ECCV 2020 and ICCV 2023. Starting 2018 she holds a joint appointment with Google research.

Selected Awards

2025, Archimedes Science Award, Dresden
2025, ACM Athena Lecturer Award
2024, Heinrich-Hertz-Gastprofessur, Karlsruher Institut für Technologie (KIT)
2024, European Inventor Award, category research, awarded by European Patent Office
2023, Körber European Science Prize
2021, PAMI Distinguished Researcher Award
2020, Royal Society Milner award
2016, Grand Prix Inria-Académie des sciences
2015, Humboldt research award, Alexander von Humboldt Foundation, Germany
2013, ERC advanced grant ALLEGRO

Research Interests

Computer Vision, Visual Understanding, Robotics, Vision-Language guided Robotics, Machine Learning, Artificial Intelligence.

Selected Publications

Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy. R. Garcia, S. Chen and C. Schmid. IEEE International Conference on Robotics and Automation, 2025.
Vividex: Learning vision-based dexterous manipulation from human videos. Z. Chen, S. Chen, E. Arlaud, I. Laptev and C. Schmid. IEEE International Conference on Robotics and Automation, 2025.
Streaming Dense Video Captioning. X. Zhou, A. Arnab, S. Buch, S. Yan, A. Myers, X. Xiong, A. Nagrani and C. Schmid. IEEE Conference on Computer Vision and Pattern Recognition, 2024.
Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning. A. Yang, A. Nagrani, P.-H. Seo, A. Miech, J. Pont-Tuset, I. Laptev, J. Sivic and C. Schmid. IEEE Conference on Computer Vision and Pattern Recognition, 2023.
Instruction-Driven History-Aware Policies for Robotic Manipulations. P.-L. Guhur, S. Chen, R. Garcia, M. Tapaswi, I. Laptev and C. Schmid. Conference on Robot Learning, 2022.
ViViT: A Video Vision Transformer. A. Arnab, M. Dehghani, G. Heigold, C. Sun, M. Lučić and C. Schmid. IEEE International Conference on Computer Vision, 2021.
Episodic Transformer for Vision-and-Language Navigation. A. Pashevich, C. Schmid and C. Sun. IEEE International Conference on Computer Vision, 2021.
Multi-modal Transformer for Video Retrieval. V. Gabeur, C. Sun, K. Alahari and C. Schmid. European Conference on Computer Vision, 2020.
VideoBERT: A Joint Model for Video and Language Representation Learning. C. Sun, A. Myers, C. Vondrick, K. Murphy and C. Schmid. IEEE International Conference on Computer Vision, 2019.
AVA: A Video Dataset of Spatio-Temporally Localized Atomic Visual Actions. C. Gu, C. Sun, D. Ross, C. Vondrick, C. Pantofaru, Y. Li, S. Vijayanarasimhan, G. Toderici, S. Ricco, R. Sukthankar, C. Schmid and J. Malik. IEEE Conference on Computer Vision and Pattern Recognition, 2018.