Informatica logo


Login Register

  1. Home
  2. Issues
  3. Volume 33, Issue 2 (2022)
  4. A New Quality Measure and Visualization ...

Informatica

Information Submit your article For Referees Help ATTENTION!
  • Article info
  • Full article
  • Related articles
  • Cited by
  • More
    Article info Full article Related articles Cited by

A New Quality Measure and Visualization of the Short-Quantified Sentences of Natural Language on Maps – A Case on COVID-19 Data
Volume 33, Issue 2 (2022), pp. 321–342
Miroslav Hudec   Kristína Malovcová   Rijad Trumic   Eva Rakovská  

Authors

 
Placeholder
https://doi.org/10.15388/22-INFOR492
Pub. online: 20 June 2022      Type: Research Article      Open accessOpen Access

Received
1 July 2021
Accepted
1 June 2022
Published
20 June 2022

Abstract

Maps are a common tool for visualizing various statistical figures that describe development in our society. Domain experts, journalists, and general public can pose questions on how to emphasize regions where, for instance, most young patients have long stayed in hospitals. One of the visualization’s problems is expressing validities of short-quantified sentences for regions on maps. The truth value of a summary assigns a value from the unit interval, which makes it suitable for interpretation on maps by hues of a selected colour, but it does not reflect the data distribution among regions. To meet this goal, a new quality measure covering data distribution among districts and its aggregation by the ordinal sums of conjunctive and disjunctive functions with the truth value is proposed and documented on examples. The next proposal is a relative quantifier expressing significant proportion of entities. This model is applied to the interpretation of COVID-19 cases development in the Slovak Republic on real data from one health insurance company. Finally, this article discusses the applicability of the proposed approach in other areas where the interpretation of summarized sentences on maps is beneficial.

References

 
Aggarwal, C.C. (2015). Data Mining: The Textbook. Springer, Cham.
 
Alonso, J.M., Castiello, C., Magdalena, L., Mencer, C. (2021). Explainable Fuzzy Systems. Springer, Cham.
 
Beliakov, G., Pradera, A., Calvo, T. (2007). Aggregation Functions: A Guide for Practitioners. Springer, Berlin Heidelberg.
 
Birkhoff, G. (1967). Lattice Theory. Colloqium Publications, Vol. XXV, 3rd ed. American Mathematical Society, Providence.
 
Boran, F.E., Akay, D., Yager, R.R. (2016). An overview of methods for linguistic summarization with fuzzy sets. International Journal of General Systems, 61, 356–377.
 
Bugarín, A., Marín, N., Sánchez, D., Trivino, G. (2015). Aspects of quality evaluation in linguistic descriptions of data. In: Proceedings of the 2015 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE 2015), Istanbul, Turkey, August 2–5.
 
Clifford, A.H. (1954). Naturally totally ordered commutative semigroups. American Journal of Mathematics, 76, 631–646.
 
De Baets, B., Mesiar, R. (2002). Ordinal sums of aggregation operators. In: Technologies for Constructing Intelligent Systems, Vol. 2. Springer, Berlin Heidelberg, pp. 137–147.
 
Dujmović, J. (2018). Soft Computing Evaluation Logic: The LSP Decision Method and Its Applications. John Wiley and Sons, Hoboken.
 
Eurostat (2003). Regions Nomenclature of Territorial Units for Statistics. Office for Official Publications of the European Communities, Luxembourg.
 
Fodor, J., Yager, R.R., Rybalov, A. (1997). Structure of uninorms. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 5, 411–427.
 
Hirota, K., Pedrycz, W. (1999). Fuzzy computing for data mining. Proceedings of IEEE, 87, 1575–1600.
 
Holzinger, A., Malle, B., Kieseberg, P., Roth, P.M., Müller, H., Reihs, R., Zatloukal, K. (2017). Machine learning and knowledge extraction in digital pathology needs an integrative approach. In: Holzinger, A., Goebel, R., Ferri, M., Palade, V. (Eds.), Towards Integrative Machine Learning and Knowledge Extraction. Springer, Cham, pp. 13–50.
 
Hudec, M. (2017). Merging validity and coverage for measuring quality of data summaries. In: Kulczycki, P., Kóczy, L.T., Mesiar, R., Kacprzyk, J. (Eds.), Information Technology and Computational Physics. Springer, Cham, pp. 71–85.
 
Hudec, M., Bednárová, E., Holzinger, A. (2018). Augmenting statistical data dissemination by short quantified sentences of natural language. Journal of Official Statistics, 34(4), 981–1010.
 
Hudec, M., Mináriková, E., Mesiar, R., Saranti, A., Holzinger, A. (2021). Classification by ordinal sums of conjunctive and disjunctive functions for explainable AI and interpretable machine learning solutions. Knowledge–Based Systems, 220, 106916.
 
Jain, A., Keller, J.M. (2015). On the computation of semantically ordered truth values of linguistic protoform summaries. In: Proceedings of the 2015 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE 2015), Istanbul, Turkey, August 2–5.
 
Kacprzyk, J., Strykowski, P. (1999). Linguistic data summaries for intelligent decision support. In: Proceedings of the fourth European Workshop on Fuzzy Decision Analysis and Recognition Technology for Management, Planning and Optimization (EFDAN 1999), Dortmund, Germany, June 14–15.
 
Kacprzyk, J., Yager, R.R. (2001). Linguistic summaries of data using fuzzy logic. International Journal of General Systems, 30(2), 133–154.
 
Kacprzyk, J., Zadrożny, S. (2005). Linguistic database summaries and their protoforms: towards natural language based knowledge discovery tools. Information Sciences, 173, 281–304.
 
Kacprzyk, J., Zadrożny, S. (2010). Modern data-driven decision support systems: the role of computing with words and computational linguistics. International Journal of General Systems, 39(4), 133–154.
 
Kacprzyk, J., Wilbik, A., Zadrożny, S. (2006). Linguistic summarization of trends: a fuzzy logic based approach. In: Proceedings of the 11th Information Processing and Management of Uncertainty in Knowledge Based Systems (IPMU 2006), Paris, France, July 2–7.
 
Klement, E.P., Mesiar, R., Pap, E. (1996). On the relationship of associative compensatory operators to triangular norms and connorms. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 4, 129–144.
 
Klement, E.P., Mesiar, R., Pap, E. (2000). Triangular Norms. Kluwer, Dordrecht.
 
Lesot, M.-J., Moyse, G., Bouchon-Meunier, B. (2016). Interpretability of fuzzy linguistic summaries. Fuzzy Sets and Systems, 292, 307–317.
 
Liétard, L. (2006). A new definition for linguistic summaries of data. In: Proceedings of the 2008 IEEE International Conference on Fuzzy Systems, Hong Kong, China, June 1–6.
 
Paliulionis, V. (2000). Intelligent GIS: architectural issues and implementation methods. Informatica, 11, 269–280.
 
Radojević, D. (2008). Interpolative realization of Boolean algebra as a consistent frame for gradation and/or fuzziness. In: Nikravesh, M., Kacprzyk, J., Zadeh, L.A. (Eds.), Forging New Frontiers: Fuzzy Pioneers II, Studies in Fuzziness and Soft Computing. Springer-Verlag, Berlin Heidelberg, pp. 295–318.
 
Rasmussen, D., Yager, R.R. (1997). Summary SQL – a fuzzy tool for data mining. Intelligent Data Analysis, 1, 49–58.
 
Reiter, E. (2017). Non-experts struggle with information graphics. https://ehudreiter.com/2017/10/02/non-experts-struggle-graphs/. Accessed 4 February 2021.
 
Ruspini, E.H. (1969). A new approach to clustering. Information and Control, 15(1), 22–32.
 
Vaisman, A., Zimányi, E. (2014). Data Warehouse Systems – Design and Implementation. Springer, Berlin Heidelberg.
 
Wilbik, A., Havens, T.C., Wilkin, T. (2020). On a paradox of extended linguistic summaries. In: Proceedings of the 2020 IEEE International Conference on Fuzzy Systems, Glasgow, UK, July 19–24.
 
Wu, D., Mendel, J.M., J., J. (2010). Linguistic summarization using if-then rules. In: Proceedings of the 2010 IEEE International Conference on Fuzzy Systems, Barcelona, Spain, July 18–23.
 
Yager, R.R. (1982). A new approach to the summarization of data. Information Sciences, 28, 69–86.
 
Yager, R.R., Rybalov, A. (1996). Uninorm aggregation operators. Fuzzy Sets and Systems, 80, 111–120.
 
Zadeh, L.A. (1965). Fuzzy sets. Information and Control, 8, 338–353.
 
Zadeh, L.A. (1975). The concept of a linguistic variable and its application to approximate reasoning: Part I. Information Sciences, 8, 199–249.
 
Zadeh, L.A. (1983). A computational approach to fuzzy quantifiers in natural languages. Computers & Mathematics with Applications, 9, 149–184.
 
Zadrożny, S., De Tré, G., De Caluwe, R., Kacprzyk, J. (2008). An overview of fuzzy approaches to flexible database querying. In: Galindo, J. (Ed.), Handbook of Research on Fuzzy Information Processing in Databases. Information Science Reference, Hershey, pp. 34–54.

Biographies

Hudec Miroslav
miroslav.hudec@vsb.cz

M. Hudec is an associate professor at the University of Economics in Bratislava, Faculty of Economic Informatics (Slovak Republic) and VSB – Technical University of Ostrava, Faculty of Economics (Czech Republic). He received the PhD degree from the University of Belgrade (Serbia). His work is mainly focused on fuzzy logic, knowledge discovery and business intelligence. He is a member of program committees of several international conferences and serves as an associate editor in Applied Soft Computing. He has published more than 50 articles including a monograph in Springer.

Malovcová Kristína
malovcova.kristina@dovera.sk

K. Malovcová received a master’s degree from the University of Economics in Bratislava, Faculty of Economic Informatics (Slovak Republic) in June 2021. In her final thesis she dealt with the flexible calculation of linguistic summaries and interpretation of results on a map. She works as a junior analyst at the health insurance company DÔVERA zdravotná poisťovňa, a.s., Slovak Republic.

Trumic Rijad
rijad.trumic@vsb.cz

R. Trumic holds a master in business administration from the University of Applied Sciences Munich, Germany. Currently, he is a PhD student at the Faculty of Economics, VSB – Technical University of Ostrava, Czech Republic. He has been working in supply chain management for BMW in Munich for many years and his research is mainly related to cost avoidance in change management.

Rakovská Eva
eva.rakovska@euba.sk

E. Rakovská received her PhD degree in applied informatics at the University of Economics in Bratislava (Slovak Republic) in 2010. She received a master’s degree in mathematics at Comenius University in 1986. She worked as a programmer and IT developer in various companies, including a hospital. She is an assistant professor at the University of Economics in Bratislava, Faculty of Economic Informatics since 2001. Her research interests are oriented toward applied informatics and knowledge engineering. The main impact is on the soft computing application and expert systems.


Full article Related articles Cited by PDF XML
Full article Related articles Cited by PDF XML

Copyright
© 2022 Vilnius University
by logo by logo
Open access article under the CC BY license.

Keywords
linguistic summary visualization on maps quality measure of summary aggregation function relative quantifier COVID-19

Funding
This work was partially supported by the Ministry of Education, Youth and Sports of the Czech Republic under Grant SGS No. SP2021/86; and partially supported by the Ministry of Education, Science, Research and Sport of the Slovak Republic under Grant 025EU-4/2021.

Metrics
since January 2020
836

Article info
views

431

Full article
views

441

PDF
downloads

119

XML
downloads

Export citation

Copy and paste formatted citation
Placeholder

Download citation in file


Share


RSS

INFORMATICA

  • Online ISSN: 1822-8844
  • Print ISSN: 0868-4952
  • Copyright © 2023 Vilnius University

About

  • About journal

For contributors

  • OA Policy
  • Submit your article
  • Instructions for Referees
    •  

    •  

Contact us

  • Institute of Data Science and Digital Technologies
  • Vilnius University

    Akademijos St. 4

    08412 Vilnius, Lithuania

    Phone: (+370 5) 2109 338

    E-mail: informatica@mii.vu.lt

    https://informatica.vu.lt/journal/INFORMATICA
Powered by PubliMill  •  Privacy policy