Skip to main content

Improved Sentence Similarity Measurement in the Medical Field Based on Syntactico-Semantic Knowledge

  • Conference paper
  • First Online:
Intelligent Systems Design and Applications (ISDA 2021)

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 418))

  • 1939 Accesses

Abstract

Computing semantic sentence similarity plays a vital role in a range of text mining applications. In the clinical domain, Semantic Textual Similarity can enable us to detect and eliminate redundant information that may lead to a reduction in cognitive burden and an improvement in the clinical decision-making process. Several methods have been proposed to measure the sentence Similarity based on semantic knowledge and learning models. Despite realized efforts, the results of these methods are unsatisfactory, as much relevant semantic knowledge, such as semantic class, thematic role and syntactico-semantic knowledge like the semantic predicates, are not taken into account. In this paper, we propose a novel method to measure semantic similarity between clinical sentences based on deep learning and using syntactico-semantic knowledge such as semantic argument and thematic role. An experiment was carried out on MedSTS dataset yielded better results, showing a high correlation (r = 0 89) with human ratings.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Abualigah, L.M., Khader, A.T., Al-Betar, M.A., Alomari, O.A.: Text feature selection with a robust weight scheme and dynamic dimension reduction to text document clustering. Exp. Syst. Appl. 84, 24–36 (2017)

    Article  Google Scholar 

  2. Abualigah, L.M., Khader, A.T., Hanandeh, E.S.: Hybrid clustering analysis using improved krill herd algorithm. Appl. Intell. 48, 4047–4071 (2018)

    Article  Google Scholar 

  3. Aouicha, M.B., Taieb, M.A.H., Hamadou, A.B.: SISR: system for integrating semantic relatedness and similarity measures. Soft Comput. 22, 1855–1879 (2018)

    Article  Google Scholar 

  4. Camacho-Collados, J., Pilehvar, M.T., Navigli, R.: NASARI: a novel approach to a semantically-aware representation of items. In: Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language, pp. 566–576 (2015)

    Google Scholar 

  5. Chen, Q., Peng, Y., Keenan, T., Dharssi, S., Agro, E.: A multi-task deep learning model for the classification of age-related macular degeneration. In: AMIA Summits on Translational Science Proceedings (2019)

    Google Scholar 

  6. DoÄŸan, R.I., Kim, S., Chatr-aryamontri, A., Wei, C.-H., Comeau, D.C., Antunes, R., Matos, S., Chen, Q., Elangovan, A., Panyam, N.C.: Overview of the BioCreative VI precision medicine track: mining protein interactions and mutations for precision medicine. Database 2019, 2019 (2019)

    Google Scholar 

  7. Fellbaum, C.: WordNet and wordnets. In: Brown, K., et al. (ed.) Encyclopedia of Language and Linguistics, 2nd edn., pp. 665–670. Elsevier, Oxford (2005)

    Google Scholar 

  8. Hassan, B., AbdelRahman, S., Bahgat, R., Farag, I.: FCICU: sense-based language independent semantic textual similarity approach. In: Proceedings of the 11th International Workshop on Semantic Evaluation 2017, pp. 125–129 (2017)

    Google Scholar 

  9. Kipper, K., Korhonen, A., Ryant, N., Palmer, M.: Extending VerbNet with novel verb classes. In: Fifth International Conference on Language Resources and Evaluation (LREC 2006). Genoa, Italy (2006)

    Google Scholar 

  10. Kim, Y.: Convolutional neural networks for sentence classification. arXiv preprint arXiv:1408(5882). 36 (2014)

  11. Mueller, J., Thyagarajan, A.: Siamese recurrent architectures for learning sentence similarity. In: Thirtieth AAAI Conference on Artificial Intelligence, vol. 37, p. 19 (2016)

    Google Scholar 

  12. Ruas, T., Grosky, W., Aizawa, A.: Multi-sense embeddings through a word sense disambiguation process. Expert Syst. Appl. 136 (2019). https://doi.org/10.1016/j.eswa.2019.06.026.13

  13. Serban, I.V., et al.: A hierarchical latent variable encoder-decoder model for generating dialogues. In: Thirty-First AAAI Conference on Artificial Intelligence (2017)

    Google Scholar 

  14. Ruas, T., Grosky, W.: Keyword extraction through contextual semantic analysis of documents. In: Proceedings of the 9th International Conference on Management of Emergent Digital EcoSystems, pp. 150–156. ACM Press, Bangkok (2017)

    Google Scholar 

  15. Wali, W., Gargouri, B., Hamadou, A.B.: Enhancing the sentence Similarity measure by semantic and syntactico-semantic knowledge. Vietnam. J. Comput. Sci. 4(1), 51–60 (2017)

    Article  Google Scholar 

  16. Wali, W., Gargouri, B., Hamadou, A.B.: Sentence similarity computation based on wordnet and VerbNet. Computación y Sistemas 21(4) (2017)

    Google Scholar 

  17. Wang, Y., Afzal, N., Fu, S., et al.: MedSTS: a resource for clinical semantic textual similarity. Lang Resour. Eval. 54, 57–72 (2020). https://doi.org/10.1007/s10579-018-9431-1

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wafa Wali .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wali, W., Gargouri, B. (2022). Improved Sentence Similarity Measurement in the Medical Field Based on Syntactico-Semantic Knowledge. In: Abraham, A., Gandhi, N., Hanne, T., Hong, TP., Nogueira Rios, T., Ding, W. (eds) Intelligent Systems Design and Applications. ISDA 2021. Lecture Notes in Networks and Systems, vol 418. Springer, Cham. https://doi.org/10.1007/978-3-030-96308-8_83

Download citation

Publish with us

Policies and ethics