Findings
With recent remarkable advances in diagnostic and treatment technologies for genetic syndromes, a timely diagnosis has become crucial [
1]. However, establishing an accurate diagnosis remains a lengthy, expensive, and expert-dependent process, especially considering the increasing number of possible rare syndromes [
2]. Facial dysmorphology often offers valuable diagnostic clues, but interpreting these facial features and identifying specific genetic syndromes can pose a challenge, even for genetic experts [
3,
4]. Recent studies have demonstrated the considerable potential of deep learning-based facial analysis technologies as diagnostic tools for genetic syndromes [
1-
6]. The aim of this study was to contribute additional data on the utility of facial analysis technology for genetic diagnosis in a typical pediatric genetic clinic.
A retrospective review was carried out from September 2020 to August 2022 of patients treated at the clinic for rare genetic diseases at Chungbuk National University Hospital. We identified children under 18 years of age who had not previously received a definitive genetic diagnosis and underwent a comprehensive genetic evaluation. These patients sought consultation at the clinic for rare genetic diseases for various reasons, including unexplained developmental delay, intellectual disability, and craniofacial dysmorphism. The clinician's discretion guided the conventional diagnostic investigations. Chromosomal microarray (CMA) and/or targeted panel sequencing were employed as first-line tests, while whole-genome sequencing (WGS) was utilized as a second-line test when the genetic cause remained unidentified. Additional confirmatory tests, such as G-banded karyotyping and methylation polymerase chain reaction assay, were conducted as needed. All parents or legal guardians provided written informed consent for image publication. Furthermore, at least one full frontal face image was collected beforehand to validate the facial recognition software for patients who later received a confirmed genetic diagnosis. These photos, along with pertinent clinical non-facial features, were uploaded to the CLINIC application of the Face2Gene (F2G) web interface (
https://www.face2gene.com/). The resulting analysis was then correlated with the molecular diagnosis. The app presents the degree of similarity as a bar plot, indicating "high," "medium," or "low" levels. The study adhered to the Declaration of Helsinki and received approval from the Institutional Review Board of Chungbuk National University Hospital (2020-04-005). Written informed consent was obtained from all patients’ parents for research purposes and publication, including photographs with recognizable faces.
Frontal facial photographs of 23 children, suspected of having a genetic syndrome with craniofacial dysmorphism, were available for review (14 females; median age, 5.5 years; range, 0.1 to 16.6). Genetic testing was conducted based on the clinical diagnosis made by the clinician. Of these children, 12 received a genetic diagnosis through the first-tier test. The remaining 11 underwent second-tier testing, with two of them receiving a final genetic diagnosis. The overall diagnostic yield of this study was 60.9% (14/23).
Table 1 presents the clinical characteristics and genetic findings of the 14 patients who received a final genetic diagnosis. The cases that made it to the top 10 lists are also indicated, along with the level of similarity in gestalt and feature. Nine out of the fourteen (64.3%) patients had conditions that were correctly included in the top 10 differential diagnoses by the F2G system. Among the patients who matched the top 10 lists, the most common disease was congenital myopathies (three cases), followed by Williams syndrome (two cases), Rett syndrome (two cases), Prader-Willi syndrome (one case), and oculocutaneous albinism (one case). The majority of the syndromes (88.9%) were diagnosed with the first-line genetic test, based on the clinician's clinical diagnosis. Interestingly, for the diseases included in the top 10 lists, the gestalt similarity was only 55.6%. However, the non-facial phenotypic features added by the clinician showed a similarity of more than a medium level in all patients. Four children were diagnosed with syndromes that did not make it to the top 10 differential diagnoses. These included 16p11.2 deletion syndrome (two cases), seizure threshold 2 (SZT2)-related early onset epileptic encephalopathy (one case), and Gabriele-de Vries syndrome (one case).
This study illustrates the integration of facial analysis technology into the practical clinical workflow for Korean children who are strongly suspected of having a genetic syndrome. A definitive genetic diagnosis was reached in 60.9% of children exhibiting distinct facial features, following a systematic genetic work-up. Of these children, 64.3% had conditions that were included in the top 10 syndromes suggested by F2G.
Since the first publication by Boehringer et al. [
7] in 2011, several studies have assessed facial phenotyping software. This software uses patient photographs and automated computer tools to identify genetic syndromes [
2,
7-
9]. F2G CLINIC, a deep learning-based system, is a freely available online application for facial phenotyping in patients with genetic syndromes. It is widely used by geneticists [
2,
5,
10]. This software has been successfully validated for various genetic syndromes, including Cornelia de Lange syndrome, Williams-Beuren syndrome, Noonan syndrome, and Down syndrome [
11-
14]. Following the report of Gurovich et al. [
2] that F2G successfully suggested the correct syndrome in 90.5% of cases, several clinical studies have been conducted on various ethnic groups. A German study involving 323 patients diagnosed with 17 different genetic syndromes demonstrated DeepGestalt’s high top 10 sensitivity at 91% [
5]. In a smaller Turkish study, 48% (12/25 patients) were correctly matched within a suggested list of 30 diseases [
3]. A Canadian research group achieved a slightly lower diagnostic yield of 57% in its top 10 [
4]. A recent Japanese study reported a top 10 sensitivity rate of 86.6% (52/60 patients) in a routine clinical setting [
6]. In the study by Porras et al. [
15], the dataset included 1,400 children with 128 genetic conditions, and the average detection rate was 88%. However, the accuracy of these tools has been found to be lower in Asian populations (82%) compared to White (90%) and Hispanic populations (91%).
Our data demonstrated a relatively low sensitivity (64.3%), a finding that is consistent with previous studies that reported inconsistent results in real clinical settings. This fluctuation in detection rate can be attributed to several factors. First, facial dysmorphology presents a significant challenge due to its variations according to age, race, and ethnicity [
11-
14]. Porras et al. [
15] suggested that the lower-than-average accuracy in non-White populations could be due to the limited number of corresponding patients in their dataset. The development of race-specific facial phenotype models could enhance the accuracy of this technology. Second, the effectiveness of deep learning varies depending on the rarity of the genetic syndrome itself. This makes it challenging to detect newly identified rare genetic diseases, except for those that have been extensively studied previously. Indeed, certain syndromes consistently appear in the top suggested syndromes [
4]. Our study underscores the limited utility of this technology in diagnosing various rare diseases using advanced diagnostic tools such as CMA and WGS. Third, the additional input of clinical features by clinicians is extremely valuable. In our study, cases that matched the top 10 list showed only a partial similarity of gestalt, but a 100% similarity above the medium level in annotated phenotypic features. In instances where a syndrome is suggested with a medium-high probability, clinicians should critically evaluate whether the suggestion aligns with the patient's phenotype [
16]. This underscores that the utility of this software is enhanced when clinicians supplement the physical examination and patient history, rather than relying solely on facial photography.
The small number of subjects included in our study is a limitation. Thus, in cases where direct comparison of similarity is necessary, it may be essential to directly compare the similarity bars. Another limitation is that we expressed the similarity of gestalts and features in three broad levels, rather than using specific scores. This approach means that even within the same level of similarity, there can be significant differences in scores. Therefore, it may be necessary to directly compare similarity bars when required.
In conclusion, our data underscore the utility of facial analysis technology as an auxiliary tool in pediatric genetic clinics at the point-of-care. We also underscore several considerations for enhancing the accuracy of this machine learning-based screening tool for genetic diseases. By augmenting the quality and volume of training datasets, computer-assisted pattern recognition platforms can function as invaluable decision support tools, bolstering clinicians’ confidence in genetic diagnostics.
CONFLICT OF INTEREST
Jon Soo Kim is an editorial board member of the journal, but he was not involved in the peer reviewer selection, evaluation, or decision process of this article. No other potential conflicts.
Acknowledgments
Whole-genome sequencing was supported by the Bio & Medical Technology Development Program of the National Research Foundation (NRF) funded by the Ministry of Health and Welfare, Ministry of Science and ICT, Ministry of Trade Industry and Energy, Korea Disease Control and Prevention Agency (No. 2020M3E5D7086780, The National Project of Bio Big Data).
This work was supported by a research grant from Chungbuk National University Hospital in 2020.
Table 1.
Summary of 14 patients receiving genetic diagnosis
Case |
Sex |
Age (yr) |
HPO phenotypea
|
Genetic finding |
Diagnosis (OMIM) |
Similarity |
Top 10 lists |
Gestalt |
Feature |
F01 |
F |
3 |
Global DD, feeding difficulties in infancy, FTT, hypotonia |
PV_KLHL40 |
Nemaline myopathy (#615340) |
None |
Med |
Included |
c.1582G>A(p.Glu528Lys) Hom |
F02 |
F |
1.1 |
Global DD, ID, feeding difficulty in infancy, FTT, hypotonia, strabismus, seizure |
LPV_YY1 |
Gabriele-de Vries syndrome (#617557) |
- |
- |
- |
c.1130A>G(p.His377Arg) |
|
Het, de novo
|
|
F05 |
M |
16.6 |
Global DD, ID, autistic behavior, seizure |
PV_SZT2 |
Developmental epileptic encephalopathy (#615476) |
- |
- |
- |
c.2507del(p.Ser836Metfs*95) Het |
|
LPV_SZT2 |
|
c.6553C>T(p.Arg2185Trp) Het |
|
F06 |
M |
0.4 |
Nystagmus, albinism |
LPV_OCA2 |
Oculocutaneous albinism (#203200) |
None |
High |
Included |
c.1160C>T(p.Thr387Met) Het |
|
Exon 20-24 deletion of OCA Het |
|
F07 |
F |
16.6 |
Global DD, ID, autistic behavior, stereotypical body rocking |
P_arr 7q11.23 |
Williams syndrome (#194050) |
Med |
Med |
Included |
(72,589,903-74,392,574)×1 |
|
F08 |
F |
8.5 |
Myopathy, feeding difficulty in infancy, lumbar hyperlordosis, waddling gait |
PV_RYR1 c.5653G>T(p.Glu1885*) |
Central core disease (#255320) |
None |
Med |
Included |
VUS_RYR1 c.7487C>T(p.Pro2496Leu) |
|
Het (compound) |
|
F09 |
F |
9.9 |
Myopathy, feeding difficulty in infancy, lumbar hyperlordosis, waddling gait |
PV_RYR1 c.5653G>T(p.Glu1885*) |
Central core disease (#255320) |
None |
Med |
Included |
VUS_RYR1 c.7487C>T(p.Pro2496Leu) |
|
Het (compound) |
|
F11 |
F |
0.1 |
Premature birth, hemivertebrae |
LP_arr 16q11.2 |
16p11.2 deletion syndrome (#611913) |
- |
- |
- |
(29,580,020-30,177,240)×1 |
|
F16 |
F |
7.2 |
Global DD, ID, autistic behavior, stereotypical hand wringing, seizure |
PV_FOXG1 |
Rett syndrome (#613454) |
Med |
Med |
Included |
c.256del(p.Gln86Argfs*106) |
|
Het, de novo
|
|
F19 |
F |
0.1 |
Hearing impairment, umbilical hernia |
P_arr 7q11.23 |
Williams syndrome (#194050) |
Med |
Med |
Included |
(72,718,277-74,141,603)×1 |
|
F20 |
M |
6.6 |
Global DD, ID, autistic behavior |
LP_arr 16p12.2 |
16p12.2 deletion syndrome (#136570) |
- |
- |
- |
(21581028_21946045)×1 |
|
F21 |
F |
5.5 |
Global DD, ID, ectopic kidney, short stature |
LP_arr 16p12.2 |
16p12.2 deletion syndrome (#136570) |
- |
- |
- |
(21,405,327-21,816,543)×1 |
|
F22 |
M |
6 |
Hypotonia, feeding difficulty in infancy, cryptorchidism |
LP_arr15q11.2 |
Prader-Willi syndromeb (#176270) |
Low |
Med |
Included |
(22,817,870-102,397,317)×2 hmz |
|
Uniparental disomy |
|
F23 |
F |
11 |
Global DD, ID, autistic behavior, stereotypical hand wringing, seizure |
PV_MECP2 |
Rett syndrome (#312750) |
Med |
Med |
Included |
c.880C>T(p.Arg294*) Het |
|
References
4. Marwaha A, Chitayat D, Meyn MS, Mendoza-Londono R, Chad L. The point-of-care use of a facial phenotyping tool in the genetics clinic: enhancing diagnosis and education with machine learning. Am J Med Genet A 2021;185:1151-8.
7. Boehringer S, Guenther M, Sinigerova S, Wurtz RP, Horsthemke B, Wieczorek D. Automated syndrome detection in a set of clinical facial photographs. Am J Med Genet A 2011;155A:2161-9.
10. Hurst AC. Facial recognition software in clinical dysmorphology. Curr Opin Pediatr 2018;30:701-6.
15. Porras AR, Rosenbaum K, Tor-Diez C, Summar M, Linguraru MG. Development and evaluation of a machine learning-based point-of-care screening tool for genetic syndromes in children: a multinational retrospective study. Lancet Digit Health 2021;3:e635-43.