TY - JOUR
T1 - Imputing cognitive impairment in SPARK, a large autism cohort
AU - the SPARK Consortium
AU - Shu, Chang
AU - Green Snyder, Lee Anne
AU - Shen, Yufeng
AU - Chung, Wendy K.
AU - Abbeduto, Leonard
AU - Aberbach, Gabriella
AU - Aberle, Shelley
AU - Acampado, John
AU - Ace, Andy
AU - Ahlers, Kaitlyn
AU - Albright, Charles
AU - Alessandri, Michael
AU - Alvarez, Nicolas
AU - Amaral, David
AU - Amatya, Alpha
AU - Andrus, Alicia
AU - Anglo, Claudine
AU - Annett, Rob
AU - Arzate, Eduardo
AU - Astrovskaya, Irina
AU - Baalman, Kelli
AU - Baer, Melissa
AU - Baraghoshi, Gabriele
AU - Bardett, Nicole
AU - Barnes, Sarah
AU - Bashar, Asif
AU - Bates, Heidi
AU - Beard, Katie
AU - Becerra, Juana
AU - Beckwith, Malia
AU - Beeson, Landon
AU - Beeson, Josh
AU - Bell, Brandi
AU - Belli, Monica
AU - Bentley, Dawn
AU - Berger, Natalie
AU - Berman, Anna
AU - Bernier, Raphael
AU - BerryKravis, Elizabeth
AU - Berwanger, Mary
AU - Birdwell, Shelby
AU - Blank, Elizabeth
AU - Booker, Stephanie
AU - Bordofsky, Aniela
AU - Bower, Erin
AU - Bradley, Catherine
AU - Brewster, Stephanie
AU - Brooks, Elizabeth
AU - Shulman, Lisa
AU - Valicenti-Mcdermott, Maria
N1 - Funding Information:
We are grateful to all the families in SPARK, the SPARK clinical sites and SPARK staff. We appreciate obtaining access to the SPARK phenotypic data set on SFARI Base. Approved researchers can obtain the SPARK population data set described in this study by applying at https://base.sfari.org.
Publisher Copyright:
© 2021 International Society for Autism Research and Wiley Periodicals LLC.
PY - 2022/1
Y1 - 2022/1
N2 - Diverse large cohorts are necessary for dissecting subtypes of autism, and intellectual disability is one of the most robust endophenotypes for analysis. However, current cognitive assessment methods are not feasible at scale. We developed five commonly used machine learning models to predict cognitive impairment (FSIQ<80 and FSIQ<70) and FSIQ scores among 521 children with autism using parent-reported online surveys in SPARK, and evaluated them in an independent set (n = 1346) with a missing data rate up to 70%. We assessed accuracy, sensitivity, and specificity by comparing predicted cognitive levels against clinical IQ data. The elastic-net model has good performance (AUC = 0.876, sensitivity = 0.772, specificity = 0.803) using 129 predictive features to impute cognitive impairment (FSIQ<80). Top-ranked predictive features included parent-reported language and cognitive levels, age at autism diagnosis, and history of services. Prediction of FSIQ<70 and FSIQ scores also showed good performance. We show cognitive levels can be imputed with high accuracy for children with autism, using commonly collected parent-reported data and standardized surveys. The current model offers a method for large-scale autism studies seeking estimates of cognitive ability when standardized psychometric testing is not feasible. Lay Summary: Children with autism who have more severe learning challenges or cognitive impairment have different needs that are important to consider in research studies. When children in our study were missing standardized cognitive testing scores, we were able to use machine learning with other information to correctly “guess” when they have cognitive impairment about 80% of the time. We can use this information in research in the future to develop more appropriate treatments for children with autism and cognitive impairment.
AB - Diverse large cohorts are necessary for dissecting subtypes of autism, and intellectual disability is one of the most robust endophenotypes for analysis. However, current cognitive assessment methods are not feasible at scale. We developed five commonly used machine learning models to predict cognitive impairment (FSIQ<80 and FSIQ<70) and FSIQ scores among 521 children with autism using parent-reported online surveys in SPARK, and evaluated them in an independent set (n = 1346) with a missing data rate up to 70%. We assessed accuracy, sensitivity, and specificity by comparing predicted cognitive levels against clinical IQ data. The elastic-net model has good performance (AUC = 0.876, sensitivity = 0.772, specificity = 0.803) using 129 predictive features to impute cognitive impairment (FSIQ<80). Top-ranked predictive features included parent-reported language and cognitive levels, age at autism diagnosis, and history of services. Prediction of FSIQ<70 and FSIQ scores also showed good performance. We show cognitive levels can be imputed with high accuracy for children with autism, using commonly collected parent-reported data and standardized surveys. The current model offers a method for large-scale autism studies seeking estimates of cognitive ability when standardized psychometric testing is not feasible. Lay Summary: Children with autism who have more severe learning challenges or cognitive impairment have different needs that are important to consider in research studies. When children in our study were missing standardized cognitive testing scores, we were able to use machine learning with other information to correctly “guess” when they have cognitive impairment about 80% of the time. We can use this information in research in the future to develop more appropriate treatments for children with autism and cognitive impairment.
UR - http://www.scopus.com/inward/record.url?scp=85117321099&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85117321099&partnerID=8YFLogxK
U2 - 10.1002/aur.2622
DO - 10.1002/aur.2622
M3 - Article
C2 - 34636158
AN - SCOPUS:85117321099
SN - 1939-3806
VL - 15
SP - 156
EP - 170
JO - Autism Research
JF - Autism Research
IS - 1
ER -