Abstract
An efficient computation of ontology mappings requires optimized algorithms and significant computing resources especially for large life science ontologies. We describe how we optimized n-gram matching for computing the similarity of concept names and synonyms in our match system GOMMA. Furthermore, we outline how to enable a highly parallel string matching on Graphical Processing Units (GPU). The evaluation on the OAEI LargeBio match task demonstrates the high effectiveness of the proposed optimizations and that the use of GPUs in addition to standard processors enables significant performance improvements.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Ding, B., König, A.C.: Fast set intersection in memory. PVLDB 4(4) (2011)
Ehrig, M., Staab, S.: QOM – quick ontology mapping. In: McIlraith, S.A., Plexousakis, D., van Harmelen, F. (eds.) ISWC 2004. LNCS, vol. 3298, pp. 683–697. Springer, Heidelberg (2004)
Euzenat, J., Shvaiko, P.: Ontology matching. Springer, New York (2007)
Forchhammer, B., et al.: Duplicate Detection on GPUs. In: BTW (2013)
Foundation Model of Anatomy, http://fma.biostr.washington.edu/
Gross, A., Hartung, M., Kirsten, T., Rahm, E.: GOMMA Results for OAEI 2012. In: Proc. 7th Ontology Matching Workshop (2012)
Gross, A., Hartung, M., Kirsten, T., Rahm, E.: On matching large life science ontologies in parallel. In: Lambrix, P., Kemp, G. (eds.) DILS 2010. LNCS, vol. 6254, pp. 35–49. Springer, Heidelberg (2010)
He, B., et al.: Relational joins on graphics processors. In: Proc. SIGMOD (2008)
Hu, W., Qu, Y., Cheng, G.: Matching large ontologies: A divide-and-conquer approach. Data & Knowledge Engineering 67(1) (2008)
Kirsten, T., Gross, A., Hartung, M., Rahm, E.: GOMMA: A Component-based Infrastructure for managing and analyzing Life Science Ontologies and their Evolution. Journal of Biomedical Semantics 2, 6 (2011)
NCI Thesaurus, http://ncit.nci.nih.gov/
Ngomo, A.-C.N., Kolb, L., Heino, N., Hartung, M., Auer, S., Rahm, E.: When to reach for the cloud: Using parallel hardware for link discovery. In: Cimiano, P., Corcho, O., Presutti, V., Hollink, L., Rudolph, S. (eds.) ESWC 2013. LNCS, vol. 7882, pp. 275–289. Springer, Heidelberg (2013)
Noy, N., et al.: BioPortal: ontologies and integrated data resources at the click of a mouse. Nucleic Acids Research 37(suppl. 2) (2009)
Ontology Alignment Evaluation Initiative, http://oaei.ontologymatching.org/
Owens, J., et al.: GPU computing. Proceedings of the IEEE 96(5) (2008)
Rahm, E.: Towards Large Scale Schema and Ontology Matching. In: Schema Matching and Mapping. Springer (2011)
Vouzis, P., Sahinidis, N.: GPU-BLAST: using graphics processors to accelerate protein sequence alignment. Bioinformatics 27(2) (2011)
Xiao, C., Wang, W., Lin, X., Yu, J.X., Wang, G.: Efficient similarity joins for near-duplicate detection. ACM Trans. Database Syst. 36(3) (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hartung, M., Kolb, L., Groß, A., Rahm, E. (2013). Optimizing Similarity Computations for Ontology Matching - Experiences from GOMMA. In: Baker, C.J.O., Butler, G., Jurisica, I. (eds) Data Integration in the Life Sciences. DILS 2013. Lecture Notes in Computer Science(), vol 7970. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-39437-9_7
Download citation
DOI: https://doi.org/10.1007/978-3-642-39437-9_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-39436-2
Online ISBN: 978-3-642-39437-9
eBook Packages: Computer ScienceComputer Science (R0)