Abstract
VisIRML is a visual analytic system to classify and display unstructured data. Subject matter experts define topics by iteratively training a machine learning (ML) classifier by labeling of sample articles facilitated via information retrieval (IR) query expansion—i.e. semi-supervised machine learning. The resulting classifier produces high quality labels better than comparable semi-supervised learning techniques. While multiple visualization approaches were considered to depict these articles, users exhibited a strong preference for a map-based representation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Moere, A.V.: Towards designing persuasive ambient visualization. In: Issues in the Design and Evaluation of Ambient Information Systems Workshop. Citeseer (2007)
Hagerman, C., Brath, R., Langevin, S.: Visual analytic system for subject matter expert document tagging using information retrieval and semi-supervised machine learning. In: 2019 23rd International Conference Information Visualisation (IV), 2019, pp. 234–240 (2019). https://doi.org/10.1109/IV.2019.00047
Lang, A.: Aesthetics in Information Visualization, Same Book As Above
Kosara, R.: Visualization criticism-the missing link between information visualization and art. In: 2007 11th International Conference Information Visualization (IV'07), pp. 631–636. IEEE (2007)
Bafadikanya, B.: Attractive visualization. In: Trends in information visualization. In: Baur, D., Sedlmair, M., Wimmer, R., Chen, Y.-X., Streng, S., Boring, S., De Luca, A., Butz, A., (eds.), Technical ReportLMU-MI-2010-1, Apr. 2010. ISSN 1862-5207. University of Munich, Department of Computer Science, Media Informatics Group (2010)
Viegas, F.: Artifacts of the Presence Era, flickr.com, CC-BY-2.0 by Viegas (2009)
Lozano-Hemmer, R.: Pulse. Flickr.com. CC-BY-SA-2.0 by Anokarina (2019)
The Nasdaq Stock Market, Inc. NASDAQ MarketSite Tower. © Copyright 2000 reprinted with the permission of The Nasdaq Stock Market, Inc., Photo credit: Peter Aaron/ Esto
Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press (2009)
Pustejovsky, J., Stubbs, A.: Natural Language Annotation for Machine Learning. O’Reilly Media (2012)
Pujara, J., London, B., Getoor, L.: Reducing label cost by combining feature labels and crowdsourcing. In: Proceedings of the 28th International Conference on Machine Learning (2011)
Settles, B.: Active Learning Literature Survey. University of Wisconsin, Madison (2010)
Li, X., Liu, B.: Learning to classify texts using positive and unlabeled data. In: Proceedings of the 18th International Joint Conference on Artificial Intelligence (2003)
Brath, R., Matusiak, M.: Automated annotations. In: An IEEE VIS Workshop on Visualization for Communication (VisComm) (2018)
Edward Segel and Jeffrey Heer: Narrative visualization: telling stories with data. IEEE Trans. Visual Comput. Graph. 16(6), 1139–1148 (2010)
UCI Machine Learning Repository: Reuters-21578 Text Categorization Collection Data Set. archive.ics.uci.edu, 2016. https://archive.ics.uci.edu/ml/datasets/Reuters-21578+Text+Categorization+Collection.
Leetaru, K., Schrodt, P.A.: Gdelt: Global data on events, location, and tone, 1979–2012. In: ISA Annual Convention, vol. 2, no. 4. Citeseer (2013)
Ramos, J.: Using tf-idf to determine word relevance in document queries. In: Proceedings of the First Instructional Conference on Machine Learning 2003 Dec 3, vol. 242, No. 1, pp. 29–48 (2003)
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient Estimation of Word Representations in Vector Space. arXiv:1301.3781
Completion Suggester Elasticsearch Reference. Elastic.co, 2016. https://www.elastic.co/guide/en/elasticsearch/reference/2.1/search-suggesters-completion.html
Dai, A.M., Olah, C., Le, Q.V.: Document embedding with paragraph vectors (2015). arXiv preprint arXiv:1507.07998
Avarikioti, G., Emiris, I.Z., Psarros, I., Samaras, G.: Practical linear-space Approximate Near Neighbors in high dimension (2016). arXiv preprint arXiv:1612.07405
Hearst, M.: Search User Interfaces. Cambridge University Press (2009)
Shneiderman, B.: The eyes have it: a task by data type taxonomy for information visualizations. In: The Craft of Information Visualization 2003 Jan 1, pp. 364–371. Morgan Kaufmann
Meng, X., Bradley, J., Yavuz, B., Sparks, E., Venkataraman, S., Liu, D., Freeman, J., Tsai, D.B., Amde, M., Owen, S., Xin, D., Xin, R., Franklin, M.J., Zadeh, R., Zaharia, M., Talwalkar, A.: MLlib: machine learning in apache spark. J. Mach. Learn. Res. 17(1), 1235–1241 (2016)
Lau, A., Moere, A.V.: Towards a model of information aesthetics in information visualization. In: 2007 11th International Conference Information Visualization (IV'07). IEEE (2007)
Munzner, T.: A nested model for visualization design and validation. IEEE Trans. Visual Comput. Graph. 15(6), 921–928 (2009)
Pousman, Z., Stasko, J., Mateas, M.: Casual information visualization: Depictions of data in everyday life. IEEE Trans. Visual. Comput. Graph. 13(6), 1145–1152 (2007)
Song, Y.: Automatic tag recommendation algorithms for social recommender systems. ACM Trans. Comput. Logic (2008)
Guan, Z., Wang, C., Bu, J., Chen, C., Yang, K., Cai, D., He, X.: Document recommendation in social tagging services. In: Proceedings of the 19th International Conference on World Wide Web (WWW ‘10), pp. 391–400. ACM, New York (2010)
Ha-Thuc, V., Mejova, Y., Harris, C., Srinivasan, P.: News event modeling and tracking in the social web with ontological guidance. 2010 IEEE Fourth International Conference on Semantic Computing, pp. 414–419 (2010)
Ha-Thuc, V., Renders, J.M.: Large-scale hierarchical text classification without labeled data. In: Proceedings of the Fourth ACM International Conference on Web Search and Data Mining (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this chapter
Cite this chapter
Hagerman, C., Brath, R., Langevin, S. (2022). VisIRML: Visualization with an Interactive Information Retrieval and Machine Learning Classifier. In: Kovalerchuk, B., Nazemi, K., Andonie, R., Datia, N., Banissi, E. (eds) Integrating Artificial Intelligence and Visualization for Visual Knowledge Discovery. Studies in Computational Intelligence, vol 1014. Springer, Cham. https://doi.org/10.1007/978-3-030-93119-3_13
Download citation
DOI: https://doi.org/10.1007/978-3-030-93119-3_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-93118-6
Online ISBN: 978-3-030-93119-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)