This study aims to establish a standardized, synchronized data collection system for pediatric symptom questionnaires, cough sounds, and breath sounds, and to construct a multimodal database of pediatric respiratory diseases including both disease cases and healthy controls. Using the final research labels determined by clinicians' diagnoses, health status assessments, and research team review as the reference standard, this study will develop and validate a multimodal assisted diagnostic model for common pediatric respiratory diseases based on symptom questionnaires, cough sounds, and breath sounds. The study will primarily evaluate the diagnostic performance of the model in distinguishing healthy children from children with respiratory diseases, screening for asthma and asthma-related cough, and identifying pneumonia, tracheitis/bronchitis, upper airway-related diseases, and common causes of chronic cough. It will also assess the incremental value of cough sounds and breath sounds beyond symptom questionnaire information.
This is a prospective, observational diagnostic accuracy study to be conducted at Shanghai Children's Medical Center. The study population will include children presenting with cough, wheezing, fever with respiratory symptoms, nasal congestion, rhinorrhea, sore throat, or other respiratory complaints, as well as healthy children recruited during routine health examinations. The study will establish a standardized and synchronized data collection workflow for pediatric symptom questionnaires, cough sounds, and breath sounds. All enrolled participants will complete a structured symptom questionnaire, undergo cough sound recording using a smartphone application, and undergo breath sound recording using an electronic stethoscope under unified protocols. Demographic and clinical information, including age, sex, disease duration, major symptoms, medical history, allergy history, family history, medication use, final clinical diagnosis, or health status assessment, will also be collected to construct a multimodal database of pediatric respiratory diseases including both disease cases and healthy controls. Based on this database, the study will develop a stepwise multimodal assisted diagnostic framework using a combination of conventional statistical learning and deep learning methods. Three diagnostic models will be constructed and compared: a symptom questionnaire-only model, a symptom questionnaire plus cough sound model, and a multimodal model integrating symptom questionnaires, cough sounds, and breath sounds. Using the final research labels determined by clinicians' diagnoses, health status assessments, and research team review as the reference standard, the study will evaluate the diagnostic performance of these models in distinguishing healthy children from children with respiratory diseases, screening for asthma and asthma-related cough, and identifying pneumonia, tracheitis/bronchitis, upper airway-related diseases, and common causes of chronic cough. Model performance will be assessed using AUC, AUPRC, sensitivity, specificity, positive predictive value, negative predictive value, F1 score, and accuracy. The study will further investigate the incremental value of cough sounds and breath sounds beyond symptom questionnaire information, and assess model stability and generalizability across different age groups, clinical settings, and device conditions. The findings are expected to provide evidence for the optimization, clinical translation, and potential home-based extension of multimodal artificial intelligence-assisted diagnostic models for pediatric respiratory diseases. The study will not interfere with routine clinical care, and the model outputs will not be used for real-time clinical decision-making.
Study Type
OBSERVATIONAL
Enrollment
1,400
Participants will complete a structured symptom questionnaire, cough sound recording using a smartphone application, and breath sound recording using an electronic stethoscope. These data will be used to develop and evaluate a multimodal assisted diagnostic model and will not guide real-time clinical decision-making.
Shanghai Children's Medical Center
Shanghai, Shanghai Municipality, China
Area under the receiver operating characteristic curve of the multimodal assisted diagnostic model
The area under the receiver operating characteristic curve will be used to evaluate the diagnostic performance of the multimodal assisted diagnostic model integrating symptom questionnaire, cough sound, and breath sound data. The model will be assessed against the final clinical research label for distinguishing children with respiratory diseases from healthy controls and for identifying major pediatric respiratory disease categories.
Time frame: Through study completion, an average of 1 year
Area under the precision-recall curve of the multimodal assisted diagnostic model
The area under the precision-recall curve will be used to evaluate the diagnostic performance of the multimodal assisted diagnostic model, particularly in settings with imbalanced disease categories. The model will be assessed against the final clinical research label.
Time frame: Through study completion, an average of 1 year
Sensitivity of the multimodal assisted diagnostic model
Time frame: Through study completion, an average of 1 year
Specificity of the multimodal assisted diagnostic model
Time frame: Through study completion, an average of 1 year
Positive predictive value of the multimodal assisted diagnostic model
Time frame: Through study completion, an average of 1 year
Negative predictive value of the multimodal assisted diagnostic model
Time frame: Through study completion, an average of 1 year
F1 score of the multimodal assisted diagnostic model
Time frame: Through study completion, an average of 1 year
Accuracy of the multimodal assisted diagnostic model
Time frame: Through study completion, an average of 1 year
This platform is for informational purposes only and does not constitute medical advice. Always consult a qualified healthcare professional.