TY - JOUR
T1 - Development and Validation of a Respiratory-Responsive Vocal Biomarker–Based Tool for Generalizable Detection of Respiratory Impairment
T2 - Independent Case-Control Studies in Multiple Respiratory Conditions including Asthma, Chronic Obstructive Pulmonary Disease, and COVID-19
AU - Kaur, Savneet
AU - Larsen, Erik
AU - Harper, James
AU - Purandare, Bharat
AU - Uluer, Ahmet
AU - Hasdianda, Mohammad Adrian
AU - Umale, Nikita Arun
AU - Killeen, James
AU - Castillo, Edward
AU - Jariwala, Sunit
N1 - Funding Information:
SK, BP, AU, MAH, NAU, JK, and EC have no competing interests as defined by Nature Research or other interests that might be perceived to influence the interpretation of the article. EL is employed by Sonde Health Inc and holds stock options in Sonde Health Inc. JH is a current employee of Sonde Health Inc (founder and Chief Operating Officer) who receives both cash and stock options from the company as part of his compensation package. SJ has received grant support from the National Institutes of Health, Agency for Healthcare Research and Quality, Stony Wold-Herbert Fund, Patient-Centered Outcomes Research Institute, American Lung Association, Price Family Fund, Genentech, Astra Zeneca, Sonde Health, and Einstein Clinical and Translational Science Award and National Center for Advancing Translational Sciences and has served as a consultant and member of a scientific advisory board for Teva and Sanofi.
Publisher Copyright:
©Savneet Kaur, Erik Larsen, James Harper, Bharat Purandare, Ahmet Uluer, Mohammad Adrian Hasdianda, Nikita Arun Umale, James Killeen, Edward Castillo, Sunit Jariwala.
PY - 2023
Y1 - 2023
N2 - Background: Vocal biomarker–based machine learning approaches have shown promising results in the detection of various health conditions, including respiratory diseases, such as asthma. Objective: This study aimed to determine whether a respiratory-responsive vocal biomarker (RRVB) model platform initially trained on an asthma and healthy volunteer (HV) data set can differentiate patients with active COVID-19 infection from asymptomatic HVs by assessing its sensitivity, specificity, and odds ratio (OR). Methods: A logistic regression model using a weighted sum of voice acoustic features was previously trained and validated on a data set of approximately 1700 patients with a confirmed asthma diagnosis and a similar number of healthy controls. The same model has shown generalizability to patients with chronic obstructive pulmonary disease, interstitial lung disease, and cough. In this study, 497 participants (female: n=268, 53.9%; <65 years old: n=467, 94%; Marathi speakers: n=253, 50.9%; English speakers: n=223, 44.9%; Spanish speakers: n=25, 5%) were enrolled across 4 clinical sites in the United States and India and provided voice samples and symptom reports on their personal smartphones. The participants included patients who are symptomatic COVID-19 positive and negative as well as asymptomatic HVs. The RRVB model performance was assessed by comparing it with the clinical diagnosis of COVID-19 confirmed by reverse transcriptase–polymerase chain reaction. Results: The ability of the RRVB model to differentiate patients with respiratory conditions from healthy controls was previously demonstrated on validation data in asthma, chronic obstructive pulmonary disease, interstitial lung disease, and cough, with ORs of 4.3, 9.1, 3.1, and 3.9, respectively. The same RRVB model in this study in COVID-19 performed with a sensitivity of 73.2%, specificity of 62.9%, and OR of 4.64 (P<.001). Patients who experienced respiratory symptoms were detected more frequently than those who did not experience respiratory symptoms and completely asymptomatic patients (sensitivity: 78.4% vs 67.4% vs 68%, respectively). Conclusions: The RRVB model has shown good generalizability across respiratory conditions, geographies, and languages. Results using data set of patients with COVID-19 demonstrate its meaningful potential to serve as a prescreening tool for identifying individuals at risk for COVID-19 infection in combination with temperature and symptom reports. Although not a COVID-19 test, these results suggest that the RRVB model can encourage targeted testing. Moreover, the generalizability of this model for detecting respiratory symptoms across different linguistic and geographic contexts suggests a potential path for the development and validation of voice-based tools for broader disease surveillance and monitoring applications in the future.
AB - Background: Vocal biomarker–based machine learning approaches have shown promising results in the detection of various health conditions, including respiratory diseases, such as asthma. Objective: This study aimed to determine whether a respiratory-responsive vocal biomarker (RRVB) model platform initially trained on an asthma and healthy volunteer (HV) data set can differentiate patients with active COVID-19 infection from asymptomatic HVs by assessing its sensitivity, specificity, and odds ratio (OR). Methods: A logistic regression model using a weighted sum of voice acoustic features was previously trained and validated on a data set of approximately 1700 patients with a confirmed asthma diagnosis and a similar number of healthy controls. The same model has shown generalizability to patients with chronic obstructive pulmonary disease, interstitial lung disease, and cough. In this study, 497 participants (female: n=268, 53.9%; <65 years old: n=467, 94%; Marathi speakers: n=253, 50.9%; English speakers: n=223, 44.9%; Spanish speakers: n=25, 5%) were enrolled across 4 clinical sites in the United States and India and provided voice samples and symptom reports on their personal smartphones. The participants included patients who are symptomatic COVID-19 positive and negative as well as asymptomatic HVs. The RRVB model performance was assessed by comparing it with the clinical diagnosis of COVID-19 confirmed by reverse transcriptase–polymerase chain reaction. Results: The ability of the RRVB model to differentiate patients with respiratory conditions from healthy controls was previously demonstrated on validation data in asthma, chronic obstructive pulmonary disease, interstitial lung disease, and cough, with ORs of 4.3, 9.1, 3.1, and 3.9, respectively. The same RRVB model in this study in COVID-19 performed with a sensitivity of 73.2%, specificity of 62.9%, and OR of 4.64 (P<.001). Patients who experienced respiratory symptoms were detected more frequently than those who did not experience respiratory symptoms and completely asymptomatic patients (sensitivity: 78.4% vs 67.4% vs 68%, respectively). Conclusions: The RRVB model has shown good generalizability across respiratory conditions, geographies, and languages. Results using data set of patients with COVID-19 demonstrate its meaningful potential to serve as a prescreening tool for identifying individuals at risk for COVID-19 infection in combination with temperature and symptom reports. Although not a COVID-19 test, these results suggest that the RRVB model can encourage targeted testing. Moreover, the generalizability of this model for detecting respiratory symptoms across different linguistic and geographic contexts suggests a potential path for the development and validation of voice-based tools for broader disease surveillance and monitoring applications in the future.
KW - COVID-19
KW - RRVB
KW - artificial intelligence
KW - asthma
KW - eHealth
KW - mHealth
KW - machine learning
KW - mobile health
KW - mobile phone
KW - respiratory
KW - respiratory symptom
KW - respiratory-responsive vocal biomarker
KW - smartphones
KW - sound
KW - speech
KW - vocal
KW - vocal biomarkers
KW - voice
UR - http://www.scopus.com/inward/record.url?scp=85152624608&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85152624608&partnerID=8YFLogxK
U2 - 10.2196/44410
DO - 10.2196/44410
M3 - Article
C2 - 36881540
AN - SCOPUS:85152624608
SN - 1439-4456
VL - 25
JO - Journal of Medical Internet Research
JF - Journal of Medical Internet Research
M1 - e44410
ER -