Skip to main content

Algorithms for Linguistic Description of Categorical Data

  • Chapter
  • First Online:
Artificial Intelligence in Project Management and Making Decisions (UCIENCIA 2021)

Abstract

The paper proposes a method that comprises five algorithms for producing composite linguistic summaries from categorical data. The generated composite summaries reflect Evidence, Contrast, or Emphasis relations between at least two constituent summaries. The constituent summaries are instances of the LDS classical protoforms created, in this case, with frequent L1 item sets and association rules obtained from applying an association rule mining algorithm. In order to verify the feasibility of implementing the method, we performed a use case with a dataset of 2128 cases of the Economic Chamber of the Provincial People’s Court of Havana. The results were consistent with expectations, obtaining 18 Evidence relations, 11 Contrast relations, and 16 Emphasis relations. Furthermore, we evaluated the interpretability of the composite summaries obtained in the use case. Specifically, we measured the accuracy of identifying the relation type implicit in the summary and their understandability. In both cases, the results were positive.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Yager, R.R.: A new approach to the summarization of data. Inf. Sci. 28, 69–86 (1982)

    Article  MathSciNet  Google Scholar 

  2. Fontenla-Seco, Y., Bugarin, A., Lama, M.: Fuzzy temporal protoforms for the quantitative description of processes in natural language. In: IEEE International Conference on Fuzzy Systems, pp. 1–6 (2021). http://doi.org/10.1109/FUZZ45933.2021.9494444

  3. Cornejo, M.E., Medina, J., Rubio-Manzano, C.: Linguistic descriptions of data via fuzzy formal concept analysis. In: Computational Intelligence and Mathematics for Tackling Complex Problems, pp. 119–125. Springer, Berlin (2022). http://doi.org/10.1007/978-3-030-74970-5_14

  4. Duraj, A., Szczepaniak, P.S.: Linguistic summaries using interval-valued fuzzy representation of imprecise information—an innovative tool for detecting outliers. In: International Conference on Computational Science (ICCS 2021), pp. 500–513. Springer, Berlin (2021). http://doi.org/10.1007/978-3-030-77980-1_38

  5. Nguyen, C.H., Pham, T.L., Nguyen, T.N., Ho, C.H., Nguyen, T.A.: The linguistic summarization and the interpretability, scalability of fuzzy representations of multilevel semantic structures of word-domains. Microprocess. Microsyst. (2021). https://doi.org/10.1016/J.MICPRO.2020.103641

    Article  Google Scholar 

  6. To, N.D., Reformat, M.Z., Yager, R.R.: Question-answering system with linguistic summarization. In: IEEE International Conference on Fuzzy Systems, pp. 1–8 (2021). http://doi.org/10.1109/FUZZ45933.2021.9494389

  7. Pérez, I., Piñero, P.Y., Bello, R., Acuña, L.A., García, R.: Linguistic summaries generation with hybridization method based on rough and fuzzy sets. In: International Joint Conference on Rough Sets, pp. 385–397. Springer, Berlin (2020). http://doi.org/10.1007/978-3-030-52705-1_29

  8. Kacprzyk, J., Yager, R., Merigo, J.M.: Towards human-centric aggregation via ordered weighted aggregation operators and linguistic data summaries: a new perspective on Zadeh’s inspirations. IEEE Comput. Intell. Mag. 14, 16–30 (2019). https://doi.org/10.1109/MCI.2018.2881641

    Article  Google Scholar 

  9. Rodríguez, C.R., Peña, M., Zuev, D.S.: Extracting composite summaries from qualitative data. In: VII International Workshop on Artificial Intelligence and Pattern Recognition. Springer, Berlin (2021). http://doi.org/10.1007/978-3-030-89691-1_26

  10. Yager, R.R., Reformat, M.Z., To, N.D.: Drawing on the iPad to input fuzzy sets with an application to linguistic data science. Inf. Sci. 479, 277–291 (2019). https://doi.org/10.1016/J.INS.2018.11.048

    Article  Google Scholar 

  11. Ramos-Soto, A., Martin-Rodilla, P.: Enriching linguistic descriptions of data: a framework for composite protoforms. In: Fuzzy Sets and Systems, vol. 407, pp. 1–26 (2021). http://doi.org/10.1016/j.fss.2019.11.013

  12. Heble-Lahera, C., Cascallar-Fuentes, A., Ramos-Soto, A., Diz, A.B.: Empirical study of fuzzy quantification models for linguistic descriptions of meteorological data. In: IEEE International Conference on Fuzzy Systems (2020). http://doi.org/10.1109/FUZZ48607.2020.9177716

  13. Peláez-Aguilera, M.D., Espinilla, M., Fernández Olmo, M.R., Medina, J.: Fuzzy linguistic protoforms to summarize heart rate streams of patients with ischemic heart disease. Complexity (2019). http://doi.org/10.1155/2019/2694126

  14. Genç, S., Akay, D., Boran, F.E., Yager, R.R.: Linguistic summarization of fuzzy social and economic networks: an application on the international trade network. Soft. Comput. 24, 1511–1527 (2019). https://doi.org/10.1007/S00500-019-03982-9

    Article  Google Scholar 

  15. Wilbik, A., Gilsing, R., Turetken, O., Ozkan, B., Grefen, P.: Intentional linguistic summaries for collaborative business model radars. In: 2020 IEEE International Conference on Fuzzy Systems, pp. 1–7 (2020). http://doi.org/10.1109/FUZZ48607.2020.9177587

  16. Moreno-Garcia, J., Jimenez-Linares, L., Rodriguez-Benitez, L.: Automatic generation of linguistic descriptions of electricity consumption in the buildings of a large institution. In: Computational Intelligence and Mathematics for Tackling Complex Problems, vol. 3, pp. 101–109. Springer, Berlin (2022). http://doi.org/10.1007/978-3-030-74970-5_12

  17. Moreno-Garcia, J., Jimenez-Linares, L., Liu, J., Rodriguez-Benitez, L.: Generation of linguistic descriptions for daily noise pollution in urban areas. In: IEEE International Conference on Fuzzy Systems, pp. 1–6 (2021). http://doi.org/10.1109/FUZZ45933.2021.9494388

  18. Pérez, I., Piñero, P.Y., García, R., Bello, R., Acuña, L.A.: Discovering fails in software projects planning based on linguistic summaries. In: International Joint Conference on Rough Sets, pp. 365–375. Springer, Berlin (2020). http://doi.org/10.1007/978-3-030-52705-1_27

  19. Zadeh, L.A.: A prototype-centered approach to adding deduction capability to search engines—the concept of protoform. In: Annual Conference of the North American Fuzzy Information Processing Society—NAFIPS (2002). http://doi.org/10.1109/NAFIPS.2002.1018115

  20. Kacprzyk, J., Zadrozny, S.: Linguistic database summaries and their protoforms: towards natural language based knowledge discovery tools. In: Information Sciences, vol. 173 (2005). http://doi.org/10.1016/j.ins.2005.03.002

  21. Trivino, G., Sugeno, M.: Towards linguistic descriptions of phenomena. Int. J. Approximate Reasoning 54, 22–34 (2013). https://doi.org/10.1016/j.ijar.2012.07.004

    Article  Google Scholar 

  22. Smits, G., Nerzic, P., Pivert, O., Lesot, M.J.: Efficient generation of reliable estimated linguistic summaries. In: IEEE International Conference on Fuzzy Systems (2018). http://doi.org/10.1109/FUZZ-IEEE.2018.8491604

  23. Mann, W.C., Thompson, S.A.: Rhetorical structure theory: toward a functional theory of text organization. Text 8, 243–281 (1988)

    Google Scholar 

  24. Hou, S., Zhang, S., Fei, C.: Rhetorical structure theory: a comprehensive review of theory, parsing methods and applications. In: Expert Systems with Applications, vol. 157 (2020). http://doi.org/10.1016/j.eswa.2020.113421

  25. Agrawal, R., Srikant, R.: A fast algorithm for mining association rules. In: Proceedings of the 20th International Conference on Very Large Databases, VLDB, pp. 487–499 (1994)

    Google Scholar 

  26. Savasere, A., Omiecinski, E., Navathe, S.: An efficient algorithm for mining association rules in large databases. In: Proceedings of the 21st International Conference on Very Large Data Bases, pp. 432–443, Zurich (1995)

    Google Scholar 

  27. Zaki, M.J.: Scalable algorithms for association mining. IEEE Trans. Knowl. Data Eng. 12, 372–390 (2000). https://doi.org/10.1109/69.846291

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Carlos R. Rodríguez Rodríguez .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Rodríguez Rodríguez, C.R., Zuev, D.S., Peña Abreu, M. (2022). Algorithms for Linguistic Description of Categorical Data. In: Piñero Pérez, P.Y., Bello Pérez, R.E., Kacprzyk, J. (eds) Artificial Intelligence in Project Management and Making Decisions. UCIENCIA 2021. Studies in Computational Intelligence, vol 1035. Springer, Cham. https://doi.org/10.1007/978-3-030-97269-1_5

Download citation

Publish with us

Policies and ethics