Keypoint-MoSeq: parsing behavior by linking point tracking to pose dynamics

Caleb Weinreb; Jonah E Pearl; Sherry Lin; Mohammed Abdal Monium Osman; Libby Zhang; Sidharth Annapragada; Eli Conlin; Red Hoffmann; Sofia Makowska; Winthrop F Gillis; Maya Jay; Shaokai Ye; Alexander Mathis; Mackenzie W Mathis; Talmo Pereira; Scott W Linderman; Sandeep Robert Datta

doi:10.1038/s41592-024-02318-2

Keypoint-MoSeq: parsing behavior by linking point tracking to pose dynamics

Nat Methods. 2024 Jul;21(7):1329-1339. doi: 10.1038/s41592-024-02318-2. Epub 2024 Jul 12.

Authors

Caleb Weinreb¹, Jonah E Pearl¹, Sherry Lin¹, Mohammed Abdal Monium Osman¹, Libby Zhang^{2

3}, Sidharth Annapragada¹, Eli Conlin¹, Red Hoffmann¹, Sofia Makowska¹, Winthrop F Gillis¹, Maya Jay¹, Shaokai Ye⁴, Alexander Mathis⁴, Mackenzie W Mathis⁴, Talmo Pereira⁵, Scott W Linderman^{6

7}, Sandeep Robert Datta⁸

Affiliations

¹ Department of Neurobiology, Harvard Medical School, Boston, MA, USA.
² Department of Electrical Engineering, Stanford University, Stanford, CA, USA.
³ Wu Tsai Neurosciences Institute, Stanford University, Stanford, CA, USA.
⁴ Brain Mind and Neuro-X Institute, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland.
⁵ Salk Institute for Biological Studies, La Jolla, CA, USA.
⁶ Wu Tsai Neurosciences Institute, Stanford University, Stanford, CA, USA. scott.linderman@stanford.edu.
⁷ Department of Statistics, Stanford University, Stanford, CA, USA. scott.linderman@stanford.edu.
⁸ Department of Neurobiology, Harvard Medical School, Boston, MA, USA. srdatta@hms.harvard.edu.

Abstract

Keypoint tracking algorithms can flexibly quantify animal movement from videos obtained in a wide variety of settings. However, it remains unclear how to parse continuous keypoint data into discrete actions. This challenge is particularly acute because keypoint data are susceptible to high-frequency jitter that clustering algorithms can mistake for transitions between actions. Here we present keypoint-MoSeq, a machine learning-based platform for identifying behavioral modules ('syllables') from keypoint data without human supervision. Keypoint-MoSeq uses a generative model to distinguish keypoint noise from behavior, enabling it to identify syllables whose boundaries correspond to natural sub-second discontinuities in pose dynamics. Keypoint-MoSeq outperforms commonly used alternative clustering methods at identifying these transitions, at capturing correlations between neural activity and behavior and at classifying either solitary or social behaviors in accordance with human annotations. Keypoint-MoSeq also works in multiple species and generalizes beyond the syllable timescale, identifying fast sniff-aligned movements in mice and a spectrum of oscillatory behaviors in fruit flies. Keypoint-MoSeq, therefore, renders accessible the modular structure of behavior through standard video recordings.

MeSH terms

Algorithms*
Animals
Behavior, Animal* / physiology
Drosophila melanogaster / physiology
Humans
Machine Learning*
Male
Mice
Movement / physiology
Video Recording* / methods

Abstract

MeSH terms

Grants and funding