Resumen:
|
[EN] Background Genomics-based clinical diagnosis has emerged as a novel medical approach to improve diagnosis and treatment. However, advances in sequencing techniques have increased the generation of genomics data
dramatically. ...[+]
[EN] Background Genomics-based clinical diagnosis has emerged as a novel medical approach to improve diagnosis and treatment. However, advances in sequencing techniques have increased the generation of genomics data
dramatically. This has led to several data management problems, one of which is data dispersion (i.e., genomics data
is scattered across hundreds of data repositories). In this context, geneticists try to remediate the above-mentioned
problem by limiting the scope of their work to a single data source they know and trust. This work has studied
the consequences of focusing on a single data source rather than considering the many diferent existing genomics
data sources.
Methods The analysis is based on the data associated with two groups of disorders (i.e., oncology and cardiology)
accessible from six well-known genomic data sources (i.e., ClinVar, Ensembl, GWAS Catalog, LOVD, CIViC, and CardioDB). Two dimensions have been considered in this analysis, namely, completeness and concordance. Completeness has been evaluated at two levels. First, by analyzing the information provided by each data source with regard
to a conceptual schema data model (i.e., the schema level). Second, by analyzing the DNA variations provided by each
data source as related to any of the disorders selected (i.e., the data level). Concordance has been evaluated by comparing the consensus among the data sources regarding the clinical relevance of each variation and disorder.
Results The data sources with the highest completeness at the schema level are ClinVar, Ensembl, and CIViC. ClinVar
has the highest completeness at the data level data source for the oncology and cardiology disorders. However, there
are clinically relevant variations that are exclusive to other data sources, and they must be considered in order to provide the best clinical diagnosis. Although the information available in the data sources is predominantly concordant,
discordance among the analyzed data exist. This can lead to inaccurate diagnoses.
Conclusion Precision medicine analyses using a single genomics data source leads to incomplete results. Also, there
are concordance problems that threaten the correctness of the genomics-based diagnosis results.
[-]
|
Agradecimientos:
|
This work was supported by the Valencian Innovation Agency and Innovation through the OGMIOS project (INNEST/2021/57), the Generalitat Valenciana through the CoMoDiD project (CIPROM/2021/023), and the GVA-Predoctoral ...[+]
This work was supported by the Valencian Innovation Agency and Innovation through the OGMIOS project (INNEST/2021/57), the Generalitat Valenciana through the CoMoDiD project (CIPROM/2021/023), and the GVA-Predoctoral Research Grant (ACIF/2021/117), and the Spanish State Research Agency through the DELFOS (PDC2021-121243-I00) and SREC (PID2021-123824OBI00) projects, MICIN/AEI/10.13039/501 100011033 and co-fnanced with ERDF and the European Union Next Generation EU/PRTR.
[-]
|