This study investigates whether AI-driven analysis of speech can accurately predict clinical diagnoses and assess risk for various mental or behavioral health conditions, including attention-deficit/hyperactivity disorder (ADHD), autism spectrum disorder, bipolar disorder, generalized anxiety disorder, major depressive disorder, obsessive compulsive disorder (OCD), post-traumatic stress disorder (PTSD), and schizophrenia. We aim to develop tools that can support clinicians in making more accurate and efficient diagnoses.
Study Type
OBSERVATIONAL
Enrollment
500
A comprehensive machine-learning tool aimed at providing probability estimates for several compatible disorders, including Attention Deficit Hyperactivity Disorder (ADHD), Autism Spectrum Disorder (ASD), Bipolar Affective Disorder (BPAD), Generalized Anxiety Disorder (GAD), Major Depressive Disorder (MDD), Obsessive Compulsive Disorder (OCD), Post-Traumatic Stress Disorder (PTSD), and Schizophrenia Spectrum Disorders (SSD). By offering a multi-diagnostic assessment based on speech analysis, Solicue aims to assist clinicians in navigating this complexity and potentially identifying conditions that might otherwise be overlooked in initial assessments. Solicue leverages machine learning to analyze a wide range of clinically relevant speech features, including linguistic content, prosodic elements (such as pitch, rhythm, and intonation), and other paralinguistic features.
Mercuria is designed to stratify the risk of bipolar disorder in individuals presenting with depressive symptoms. This is a critical clinical need, as misdiagnosis of bipolar disorder as unipolar depression is common and can lead to inappropriate treatment, potentially worsening outcomes. By analyzing speech patterns characteristic of bipolar disorder, Mercuria aims to provide an additional tool for clinicians to differentiate between these conditions more accurately, guiding appropriate treatment decisions. Mercuria leverages machine learning to analyze a wide range of clinically relevant speech features, including linguistic content, prosodic elements (such as pitch, rhythm, and intonation), and other paralinguistic features.
The Brookline Center
Brookline, Massachusetts, United States
Allwell Behavioral Health Services
Zanesville, Ohio, United States
Speech Battery ("PSY-10") audio
The speech battery consists of prompt-based tasks designed to elicit speech responses from participants in the form of monologues. This includes text reading, recall, and picture description tasks.
Time frame: At initial assessment
Clinical diagnosis
Clinician diagnosis will be recorded for each participant at first assessment, 3-month, and 6-month follow-up. Diagnoses will be made according to ICD-11 or DSM-5 criteria for the compatible disorders: ADHD, ASD, BPAD, GAD, MDD, OCD, PTSD, and SSD. Additional relevant labels such as other mental health disorders, clinical high risk (CHR) and substance use may be recorded.
Time frame: 0 months, 3 months, 6 months
Performance of AI models
The performance of the Mercuria and Solicue AI models will be evaluated using performance metrics of accuracy, balanced accuracy, sensitivity (recall), specificity, positive predictive value (precision), negative predictive value, F1 score, AUC-ROC. Predicted labels will be compared with the ground truth clinical diagnoses obtained from the participating mental health clinics. Confidence acceptance threshold will be set.
Time frame: 0 months, 3 months, 6 months
Patient Health Questionnaire-9 (PHQ-9)
The PHQ-9 is a 9-item self-reported questionnaire that assesses the severity of depressive symptoms.
Time frame: At initial assessment
Mood Disorder Questionnaire (MDQ)
The MDQ is a 15-item self-report screening instrument designed to detect bipolar spectrum disorders. It consists of 13 yes/no questions about manic symptoms, followed by two questions about the co-occurrence and impact of these symptoms.
Time frame: At initial assessment
DSM-5 Level 1 Cross-Cutting Symptom Measure (DSM-XC)
The DSM-5 Level 1 Cross-Cutting Symptom Measure is a 23-item self-report questionnaire that screens for 13 psychiatric domains, including depression, anxiety, and substance use.
Time frame: At initial assessment
Reported Distress
To assess the safety of online speech assessment during clinical evaluation at initial intake. The safety of online speech assessment will be measured by severity of reported distress measured using the User Feedback Form (UFF).
Time frame: After initial assessment
This platform is for informational purposes only and does not constitute medical advice. Always consult a qualified healthcare professional.