Lightning Pose: improved animal pose estimation via semi-supervised learning, Bayesian ensembling and cloud-native open-source tools

Dan Biderman; Matthew R Whiteway; Cole Hurwitz; Nicholas Greenspan; Robert S Lee; Ankit Vishnubhotla; Richard Warren; Federico Pedraja; Dillon Noone; Michael M Schartner; Julia M Huntenburg; Anup Khanal; Guido T Meijer; Jean-Paul Noel; Alejandro Pan-Vazquez; Karolina Z Socha; Anne E Urai; International Brain Laboratory; John P Cunningham; Nathaniel B Sawtell; Liam Paninski

doi:10.1038/s41592-024-02319-1

Lightning Pose: improved animal pose estimation via semi-supervised learning, Bayesian ensembling and cloud-native open-source tools

Nat Methods. 2024 Jul;21(7):1316-1328. doi: 10.1038/s41592-024-02319-1. Epub 2024 Jun 25.

Authors

Dan Biderman^#¹, Matthew R Whiteway^#², Cole Hurwitz³, Nicholas Greenspan³, Robert S Lee⁴, Ankit Vishnubhotla³, Richard Warren³, Federico Pedraja³, Dillon Noone³, Michael M Schartner⁵, Julia M Huntenburg⁶, Anup Khanal⁷, Guido T Meijer⁵, Jean-Paul Noel⁸, Alejandro Pan-Vazquez⁹, Karolina Z Socha¹⁰, Anne E Urai¹¹; International Brain Laboratory; John P Cunningham³, Nathaniel B Sawtell³, Liam Paninski³

Collaborators

International Brain Laboratory:
Larry Abbot, Luigi Acerbi, Valeria Aguillon-Rodriguez, Mandana Ahmadi, Jaweria Amjad, Dora Angelaki, Jaime Arlandis, Zoe C Ashwood, Kush Banga, Hailey Barrell, Hannah M Bayer, Brandon Benson, Julius Benson, Jai Bhagat, Dan Birman, Niccolò Bonacchi, Kcenia Bougrova, Julien Boussard, Sebastian A Bruijns, E Kelly Buchanan, Robert Campbell, Matteo Carandini, Joana A Catarino, Fanny Cazettes, Gaelle A Chapuis, Anne K Churchland, Yang Dan, Felicia Davatolhagh, Peter Dayan, Sophie Denève, Eric E J DeWitt, Ling Liang Dong, Tatiana Engel, Michele Fabbri, Mayo Faulkner, Robert Fetcho, Ila Fiete, Charles Findling, Laura Freitas-Silva, Surya Ganguli, Berk Gercek, Naureen Ghani, Ivan Gordeliy, Laura M Haetzel, Kenneth D Harris, Michael Hausser, Naoki Hiratani, Sonja Hofer, Fei Hu, Felix Huber, Cole Hurwitz, Anup Khanal, Christopher S Krasniak, Sanjukta Krishnagopal, Michael Krumin, Debottam Kundu, Agnès Landemard, Christopher Langdon, Christopher Langfield, Inês Laranjeira, Peter Latham, Petrina Lau, Hyun Dong Lee, Ari Liu, Zachary F Mainen, Amalia Makri-Cottington, Hernando Martinez-Vergara, Brenna McMannon, Isaiah McRoberts, Guido T Meijer, Maxwell Melin, Leenoy Meshulam, Kim Miller, Nathaniel J Miska, Catalin Mitelut, Zeinab Mohammadi, Thomas Mrsic-Flogel, Masayoshi Murakami, Jean-Paul Noel, Kai Nylund, Farideh Oloomi, Alejandro Pan-Vazquez, Liam Paninski, Alberto Pezzotta, Samuel Picard, Jonathan W Pillow, Alexandre Pouget, Florian Rau, Cyrille Rossant, Noam Roth, Nicholas A Roy, Kamron Saniee, Rylan Schaeffer, Michael M Schartner, Yanliang Shi, Carolina Soares, Karolina Z Socha, Cristian Soitu, Nicholas A Steinmetz, Karel Svoboda, Marsa Taheri, Charline Tessereau, Anne E Urai, Erdem Varol, Miles J Wells, Steven J West, Matthew R Whiteway, Charles Windolf, Olivier Winter, Ilana Witten, Lauren E Wool, Zekai Xu, Han Yu, Anthony M Zador, Yizi Zhang

Affiliations

¹ Columbia University, New York, NY, USA. db3236@cumc.columbia.edu.
² Columbia University, New York, NY, USA. m.whiteway@columbia.edu.
³ Columbia University, New York, NY, USA.
⁴ Lightning.ai, New York, NY, USA.
⁵ Champalimaud Centre for the Unknown, Lisbon, Portugal.
⁶ Max Planck Institute for Biological Cybernetics, Tübingen, Germany.
⁷ University of California, Los Angeles, Los Angeles, CA, USA.
⁸ New York University, New York, NY, USA.
⁹ Princeton University, Princeton, NJ, USA.
¹⁰ University College London, London, UK.
¹¹ Leiden University, Leiden, the Netherlands.

^# Contributed equally.

PMID: 38918605
DOI: 10.1038/s41592-024-02319-1

Abstract

Contemporary pose estimation methods enable precise measurements of behavior via supervised deep learning with hand-labeled video frames. Although effective in many cases, the supervised approach requires extensive labeling and often produces outputs that are unreliable for downstream analyses. Here, we introduce 'Lightning Pose', an efficient pose estimation package with three algorithmic contributions. First, in addition to training on a few labeled video frames, we use many unlabeled videos and penalize the network whenever its predictions violate motion continuity, multiple-view geometry and posture plausibility (semi-supervised learning). Second, we introduce a network architecture that resolves occlusions by predicting pose on any given frame using surrounding unlabeled frames. Third, we refine the pose predictions post hoc by combining ensembling and Kalman smoothing. Together, these components render pose trajectories more accurate and scientifically usable. We released a cloud application that allows users to label data, train networks and process new videos directly from the browser.

MeSH terms

Algorithms*
Animals
Bayes Theorem*
Behavior, Animal
Cloud Computing
Deep Learning
Image Processing, Computer-Assisted / methods
Posture / physiology
Software
Supervised Machine Learning
Video Recording* / methods

Abstract

MeSH terms

Grants and funding