Repositorio Dspace

A knowledge graph-based data harmonization framework for secondary data reuse

Mostrar el registro sencillo del ítem

dc.contributor.author Abad-Navarro,Francisco
dc.contributor.author Martinez-Costa,Catalina
dc.date.accessioned 2025-10-20T14:40:16Z
dc.date.available 2025-10-20T14:40:16Z
dc.date.issued 2024-01
dc.identifier.citation Abad-Navarro F, Martínez-Costa C. A knowledge graph-based data harmonization framework for secondary data reuse. Computer Methods and Programs in Biomedicine. enero de 2024;243:107918.
dc.identifier.issn 0169-2607
dc.identifier.uri https://sms.carm.es/ricsmur/handle/123456789/20478
dc.description.abstract Background and objective: The adoption of new technologies in clinical care systems has propitiated the availability of a great amount of valuable data. However, this data is usually heterogeneous, requiring its harmonization to be integrated and analysed. We propose a semantic-driven harmonization framework that (1) enables the meaningful sharing and integration of healthcare data across institutions and (2) facilitates the analysis and exploitation of the shared data. Methods: The framework includes an ontology-based common data model (i.e. SCDM), a data transformation pipeline and a semantic query system. Heterogeneous datasets, mapped to different terminologies, are integrated by using an ontology-based infrastructure rooted in a top-level ontology. A graph database is generated by using these mappings, and web-based semantic query system facilitates data exploration. Results: Several datasets from different European institutions have been integrated by using the framework in the context of the European H2020 Precise4Q project. Through the query system, data scientists were able to explore data and use it for building machine learning models. Conclusions: The flexible data representation using RDF, together with the formal semantic underpinning provided by the SCDM, have enabled the semantic integration, query and advanced exploitation of heterogeneous data in the context of the Precise4Q project.
dc.language.iso eng
dc.publisher ELSEVIER IRELAND LTD
dc.rights Atribución-NoComercial-SinDerivadas 3.0 España
dc.rights.uri http://creativecommons.org/licenses/by-nc-nd/3.0/es/ *
dc.subject.mesh Pattern Recognition, Automated
dc.subject.mesh Knowledge Bases
dc.subject.mesh Data Management
dc.subject.mesh Machine Learning
dc.subject.mesh Delivery of Health Care
dc.subject.mesh Semantics
dc.title A knowledge graph-based data harmonization framework for secondary data reuse
dc.type info:eu-repo/semantics/article
dc.identifier.pmid 37981455
dc.relation.publisherversion https://dx.doi.org/10.1016/j.cmpb.2023.107918
dc.type.version info:eu-repo/semantics/publishedVersion
dc.identifier.doi 10.1016/j.cmpb.2023.107918
dc.journal.title Computer Methods and Programs in Biomedicine
dc.identifier.essn 1872-7565


Ficheros en el ítem

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem

Atribución-NoComercial-SinDerivadas 3.0 España Excepto si se señala otra cosa, la licencia del ítem se describe como Atribución-NoComercial-SinDerivadas 3.0 España

Buscar en DSpace


Búsqueda avanzada

Listar

Mi cuenta