A machine learning framework to analyze hyperspectral stimulated Raman scattering microscopy images of expressed human meibum

Alba Alfonso-García; Jerry Paugh; Marjan Farid; Sumit Garg; James V Jester; Eric O Potma

doi:10.1002/jrs.5118

A machine learning framework to analyze hyperspectral stimulated Raman scattering microscopy images of expressed human meibum

J Raman Spectrosc. 2017 Jun;48(6):803-812. doi: 10.1002/jrs.5118. Epub 2017 Apr 11.

Authors

Alba Alfonso-García^{1

2}, Jerry Paugh³, Marjan Farid⁴, Sumit Garg⁴, James V Jester^{1

4}, Eric O Potma²

Affiliations

¹ Department of Biomedical Engineering, University of California, Irvine.
² Department of Chemistry, University of California, Irvine.
³ Southern California College of Optometry at Marshall B. Ketchum University, Fullerton.
⁴ Gavin Herbert Eye Institute, University of California, Irvine.

Abstract

We develop and discuss a methodology for batch-level analysis of hyperspectral stimulated Raman scattering (hsSRS) data sets of human meibum in the CH-stretching vibrational range. The analysis consists of two steps. The first step uses a training set (n=19) to determine chemically meaningful reference spectra that jointly constitute a basis set for the sample. This procedure makes use of batch-level vertex component analysis (VCA), followed by unsupervised k-means clustering to express the data set in terms of spectra that represent lipid and protein mixtures in changing proportions. The second step uses a random forest classifier to rapidly classify hsSRS stacks in terms of the pre-determined basis set. The overall procedure allows a rapid quantitative analysis of large hsSRS data sets, enabling a direct comparison among samples using a single set of reference spectra. We apply this procedure to assess 50 specimens of expressed human meibum, rich in both protein and lipid, and show that the batch-level analysis reveals marked variation among samples that potentially correlate with meibum health quality.

Keywords: human meibum; hyperspectral stimulated Raman scattering microscopy; machine learning; multi-image analysis.

Abstract

Grants and funding