Efficient large-scale biomedical ontology matching with anchor-based biomedical ontology partitioning and compact geometric semantic genetic programming
Created by W.Langdon from
gp-bibliography.bib Revision:1.8414
- @Article{Xue:2024:jii,
-
author = "Xingsi Xue and Donglei Sun and Achyut Shankar and
Wattana Viriyasitavat and Patrick Siarry",
-
title = "Efficient large-scale biomedical ontology matching
with anchor-based biomedical ontology partitioning and
compact geometric semantic genetic programming",
-
journal = "Journal of Industrial Information Integration",
-
year = "2024",
-
volume = "41",
-
pages = "100637",
-
keywords = "genetic algorithms, genetic programming, Large-scale
biomedical ontology matching, Anchor-based ontology
partitioning, Compact geometric semantic genetic
programming, Dominance improvement ratio",
-
ISSN = "2452-414X",
-
URL = "
https://www.sciencedirect.com/science/article/pii/S2452414X24000815",
-
DOI = "
doi:10.1016/j.jii.2024.100637",
-
abstract = "Biomedical ontology offers a structured framework to
model the biomedical knowledge in a machine-readable
format. However, the heterogeneity inherent in
biomedical ontologies hinders their communication.
Biomedical Ontology Matching (BOM) can address this
issue by identifying equivalent concepts in biomedical
ontologies. Recently, Evolutionary Algorithms (EAs)
based matching techniques have exhibited their
effectiveness in finding high-quality matching results.
However, due to the vast number of entities, and
intricate relationships between entities, it is
difficult for traditional EAs to efficiently solve the
BOM problem. To tackle this challenge, this paper
proposes an efficient BOM method to automatically match
large-scale biomedical ontologies. First, a novel
anchor-based biomedical ontology partitioning method is
developed to transform the large-scale BOM problem into
several small-scale matching tasks, reducing the search
space of the matching phase. Second, a new Compact
Geometric Semantic Genetic Programming (CGSGP) is
proposed to efficiently construct high-level Similarity
Feature for BOM, which can significantly reduce the
computational complexity. Lastly, a new fitness
function composed of the approximated evaluation metric
and the Dominance Improvement Ratio (DIR) is
introduced, which can overcome the solution's bias
improvement and enable the simultaneous matching of
multiple pairs of sub-ontologies without requiring the
standard alignment. The experiment verifies our
approach's performance on the Ontology Alignment
Evaluation Initiative (OAEI)'s Anatomy, Large Biomed
and Disease and Phenotype datasets. The experimental
results show that our method can efficiently determine
high-quality BOM results across different test cases,
whose performance significantly outperforms the
state-of-the-art BOM techniques",
- }
Genetic Programming entries for
Xingsi Xue
Donglei Sun
Achyut Shankar
Wattana Viriyasitavat
Patrick Siarry
Citations