Abstract
The theory of concept (or Galois) lattices provides a simple and formal approach to conceptual clustering. In this paper we present GALOIS, a system that automates and applies this theory. The algorithm utilized by GALOIS to build a concept lattice is incremental and efficient, each update being done in time at most quadratic in the number of objects in the lattice. Also, the algorithm may incorporate background information into the lattice, and through clustering, extend the scope of the theory. The application we present is concerned with information retrieval via browsing, for which we argue that concept lattices may represent major support structures. We describe a prototype user interface for browsing through the concept lattice of a document-term relation, possibly enriched with a thesaurus of terms. An experimental evaluation of the system performed on a medium-sized bibliographic database shows good retrieval performance and a significant improvement after the introduction of background knowledge.
Article PDF
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
References
Anderson, J., & Matessa, M. (1992). Exploration of an incremental Bayesian algorithm for categorization. Machine Learning, 9:275–308.
Carpineto, C., & Romano, G. (1993a). GALOIS:An order-theoretic approach to conceptual clustering. Proceedings of the Tenth International Conference on Machine Learning (pp. 33–40), Amherst, MA: Morgan Kaufmann.
Carpineto, C., & Romano, G. (1993b). Adding background knowledge to a Galois lattice classification structure (Technical Report 5T02093). Rome, Italy: Fondazione Ugo Bordoni.
Carpineto, C., & Romano, G. (1994). Dynamically bounding browsable retrieval spaces: an application to Galois lattices. Proceedings of RIAO94: Intelligent Multimedia Information Retrieval Systems and Management (pp. 520–533). New York, N.Y.: Jouve.
Crouch, D., Crouch, C., & Andreas, G. (1989). The use of cluster hierarchies in hypertext information retrieval. Proceedings of the ACM Hypertext’ 89 Conference (pp. 225–237). Pittsburgh, PA: AC
Cutting, D., Karger, D., Pedersen, J., & Tukey, J. (1992). Scatter/Gather: A cluster-based approach to browsing large document collections. Proceedings of the Fifteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 318–329). New York, N.Y.: ACM SIGIR Forum.
Davey, B., & Priestley, H. (1990). Introduction to Lattices and Order, Cambridge University Press, Cambridge, Great Britain.
Fisher, D. (1987). Knowledge acquisition via incremental conceptual clustering. Machine Learning, 2:139–172.
Frei, H., & Jauslin, J. (1983). Graphical presentation of information and services: a user oriented interface. Information Technology: Research and Development, 2:23–42.
Furnas, G. (1985). Experience with an adaptive indexing scheme. Proceedings of ACM CHI'85 Conference on Human Factors in Computings Systems (pp. 130–135). San Francisco, CA: ACM.
Furnas, G. (1986). Generalized fisheye views. Proceedings of ACM CHI'86 Conference on Human Factors in Computing Systems (pp. 16–23). Boston, MA: ACM.
Ganascia, J.-G. (1987). CHARADE: A rule system learning system. Proceedings of the Tenth International Conference on Artificial Intelligence (pp. 345–347). Milan, Italy: Morgan Kaufmann.
Ganter, B. (1988). Composition and decomposition of data in formal concept analysis. In H. Bock (Ed.), Classification and Related Methods of Data Analysis. North-Holland.
Gennari, J., Langley, P., & Fisher, D. (1989). Models of incremental concept formation. Artificial Intelligence, 40:12–61.
Godin, R., Gecsei, J., & Pichet, C. (1989). Design of a browsing interface for information retrieval. Proceedings of the Twelfth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 32–39). Cambridge, MA: ACM SIGIR Forum.
Godin, R., Missaoui, R., & Alaoui, H. (1991). Learning algorithms using a Galois lattice structure. Proceedings of the 1991 IEEE International Conference on Tools for AI (pp. 22–29). San Jose, CA: IEEE Computer Society Press.
Godin, R., Missaoui, R., & April, A. (1993). Experimental comparison of navigation in a Galois lattice with conventional information retrieval methods. International Journal of Man-machine Studies, 38:747–767.
Gu´enoche, A. (1990). Construction du treillis de Galois d'une relation binaire. Math´matiques et Sciences Humaines, 109:41
Hadzikadic, M., & Yun, D. (1989). Concept formation by incremental conceptual clustering. Proceedings of the Eleventh International Joint Conference on Artificial Intelligence (pp. 831–836). Detroit, MI: Morgan Kaufman
Hanson, S., & Bauer, M. (1989). Conceptual clustering, categorization, and polymorphy. Machine Learning, 3:343–372.
Hull, D. (1993). Using statistical testing in the evaluation of retrieval experiments. Proceedings of the Sixteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 329–338). Pittsburgh, PA: ACM SIGIR Forum, 1993.
Lebowitz, M. (1986). Concept learning in a rich input domain: generalization-based memory. In R.S. Michalski, J.G. Carbonell, & T.M. Mitchell (Eds.), Machine Learning: An Artificial Intelligence Approach (Vol. 2). Morgan Kaufmann, San Mateo, CA.
Levinson, R. (1984). A self-organizing retrieval system for graphs. Proceedings of the Fourth National Conference on Artificial Intelligence (pp. 203–206). Austin, TX: Morgan Kaufmann.
Maarek, Y., Berry, D., & Kaiser, G. (1991). An information retrieval approach for automatically constructing software libraries. IEEE Transactions on Software Engineering, 17(8):800–813.
Marchionini, G., & Shneiderman, B. (1988). Finding facts vs. browsing knowledge in hypertext systems. IEEE Computer, 21:70–80.
Martin, J., & Billman, D. (1994). Acquiring and combining overlapping concepts. Machine Learning, 16(1-2):121–155.
McKusick, K., & Langley, P. (1991). Constraints on tree structure in concept formation. Proceedings of the Thirteenth International Joint Conference on Artificial Intelligence (pp. 810–816). Sidney, Australia: Morgan Kaufmann.
Michalski, R.S. (1983). A theory and methodology of inductive learning. Artificial Intelligence, 20:111–161.
Michalski, R.S., Stepp, R. (1983). Learning from observation: conceptual clustering. In R.S. Michalski, J.G. Carbonell, & T. M. Mitchell (Eds.), Machine Learning: An Artificial Intelligence Approach (Vol. 1). Palo Alto, CA: Tioga Publishing.
Mitchell, T.M. (1982). Generalization as search. Artificial Intelligence, 18:203–226.
Nielsen, J. (1990). Hypertext & Hypermedia, Academic Press, San Diego, CA.
Oosthuizen, G., & McGregor, D. (1988). Induction through knowledge base normalisation. In Proceedings of the Eight European Conference on Artificial Intelligence (pp. 396–401). Munich, Germany: Pitman.
Pedersen, G. (1993). A browser for bibliographic information retrieval based on an application of lattice theory. Proceedings of the Sixteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 270-279). Pittsburgh, PA.
Piatetsky-Shapiro, G., & Frawley, W. (Eds.) (1992). Knowledge Discovery in Databases, AAAI Press, Palo Alto, CA.
Salton, G., & McGill, M. (1983). Introduction to Modern Information Retrieval, McGraw Hill, New York, N.Y.
Shneiderman, B. (1987). Designing the User Interface: Strategies for Effective Human-Computer Interaction, Addison Wesley, Reading, MA.
Stepp, R., & Michalski, R.S. (1986). Conceptual clustering: inventing goal-oriented classifications of structured objects. In R.S. Michalski, J.G. Carbonell, & T.M. Mitchell (Eds.), Machine Learning: An Artificial Intelligence Approach (Vol. 2). Morgan Kaufmann, San Mateo, CA.
Tague-Sutcliffe, J. (1992). The pragmatics of information retrieval experimentation, revisited. Information Processing & Management, 28(4):467–490.
Thompson, K., Langley, P., & Iba, W. (1991). Using background knowledge in concept formation. Proceedings of the Eight International Workshop on Machine Learning (pp. 554–558). Evanston, IL: Morgan Kaufmann.
Thompson, R., & Croft, B. (1989). Support for browsing in an intelligent text retrieval system. International Journal of Man-machine Studies, 30:639–668.
van Rijsbergen, J., & Croft, B. (1975). Document clustering: An evaluation of some experiments with the Cranfield 1400 collection. Information Processing & Management, 11:171–182.
Wille, R. (1984). Line diagrams of hierarchical concept systems. International Classification, 2:77–86.
Willet, P. (1988). Recent trends in hierarchic document clustering: A critical review. Information Processing & Management, 24(5):577–597.
Yu, C., Meng, W., & Park, S. (1989). A framework for effective retrieval. ACMTransactions on Database Systems, 14(2):147–167.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Carpineto, C., Romano, G. A Lattice Conceptual Clustering System and Its Application to Browsing Retrieval. Machine Learning 24, 95–122 (1996). https://doi.org/10.1023/A:1018050230279
Issue Date:
DOI: https://doi.org/10.1023/A:1018050230279