Cordelia Schmid

Fellowship
Hans Fischer Senior Fellowship
Appointment
2025
Institution
INRIA
Department
THOTH project-team
Hosts
Prof. Majid Khadiv; Prof. Daniel Cremers
Focus Group
Autonomous Humanoid Workers
Short CV
Cordelia Schmid holds a M.S. degree in Computer Science from the University of Karlsruhe and a Doctorate, also in Computer Science, from the Institut National Polytechnique de Grenoble (INPG). Her doctoral thesis received the best thesis award from INPG in 1996. She received the Habilitation degree in 2001. Dr. Schmid was a post-doctoral research assistant in the Robotics Research Group of Oxford University in 1996-1997. Since 1997 she has held a permanent research position at Inria, where she is a research director.
Dr. Schmid is a member of the German National Academy of Sciences, Leopoldina and a fellow of IEEE and the ELLIS society. She was awarded the Longuet-Higgins prize in 2006, 2014 and 2016, the Koenderink prize in 2018 and the Helmholtz prize in 2023, all for fundamental contributions in computer vision that have withstood the test of time. Dr. Schmid has been an Associate Editor for IEEE PAMI (2001-2005) and for IJCV (2004-2012), an editor-in-chief for IJCV (2013-2018), a program chair of IEEE CVPR 2005 and ECCV 2012 as well as a general chair of IEEE CVPR 2015, ECCV 2020 and ICCV 2023. Starting 2018 she holds a joint appointment with Google research.
Selected Awards
- 2025, Archimedes Science Award, Dresden
- 2025, ACM Athena Lecturer Award
- 2024, Heinrich-Hertz-Gastprofessur, Karlsruher Institut für Technologie (KIT)
- 2024, European Inventor Award, category research, awarded by European Patent Office
- 2023, Körber European Science Prize
- 2021, PAMI Distinguished Researcher Award
- 2020, Royal Society Milner award
- 2016, Grand Prix Inria-Académie des sciences
- 2015, Humboldt research award, Alexander von Humboldt Foundation, Germany
- 2013, ERC advanced grant ALLEGRO
Research Interests
Computer Vision, Visual Understanding, Robotics, Vision-Language guided Robotics, Machine Learning, Artificial Intelligence.
Selected Publications
- Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy. R. Garcia, S. Chen and C. Schmid. IEEE International Conference on Robotics and Automation, 2025.
- Vividex: Learning vision-based dexterous manipulation from human videos. Z. Chen, S. Chen, E. Arlaud, I. Laptev and C. Schmid. IEEE International Conference on Robotics and Automation, 2025.
- Streaming Dense Video Captioning. X. Zhou, A. Arnab, S. Buch, S. Yan, A. Myers, X. Xiong, A. Nagrani and C. Schmid. IEEE Conference on Computer Vision and Pattern Recognition, 2024.
- Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning. A. Yang, A. Nagrani, P.-H. Seo, A. Miech, J. Pont-Tuset, I. Laptev, J. Sivic and C. Schmid. IEEE Conference on Computer Vision and Pattern Recognition, 2023.
- Instruction-Driven History-Aware Policies for Robotic Manipulations. P.-L. Guhur, S. Chen, R. Garcia, M. Tapaswi, I. Laptev and C. Schmid. Conference on Robot Learning, 2022.
- ViViT: A Video Vision Transformer. A. Arnab, M. Dehghani, G. Heigold, C. Sun, M. Lučić and C. Schmid. IEEE International Conference on Computer Vision, 2021.
- Episodic Transformer for Vision-and-Language Navigation. A. Pashevich, C. Schmid and C. Sun. IEEE International Conference on Computer Vision, 2021.
- Multi-modal Transformer for Video Retrieval. V. Gabeur, C. Sun, K. Alahari and C. Schmid. European Conference on Computer Vision, 2020.
- VideoBERT: A Joint Model for Video and Language Representation Learning. C. Sun, A. Myers, C. Vondrick, K. Murphy and C. Schmid. IEEE International Conference on Computer Vision, 2019.
- AVA: A Video Dataset of Spatio-Temporally Localized Atomic Visual Actions. C. Gu, C. Sun, D. Ross, C. Vondrick, C. Pantofaru, Y. Li, S. Vijayanarasimhan, G. Toderici, S. Ricco, R. Sukthankar, C. Schmid and J. Malik. IEEE Conference on Computer Vision and Pattern Recognition, 2018.