Skip to main content

Discovering Relationships Among Catalogs

  • Conference paper
Discovery Science (DS 2004)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3245))

Included in the following conference series:

Abstract

When we have a large amount of information, we usually use categories with a hierarchy, in which all information is assigned. The Yahoo! Internet directory is one such example. This paper proposes a new method of integrating two catalogs with hierarchical categories. The proposed method uses not only the contents of information but also the structures of both hierarchical categories. In order to evaluate the proposed method, we conducted experiments using two actual Internet directories, Yahoo! and Google. The results show improved performance compared with the previous approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Agrawal, R., Srikant, R.: On integrating catalogs. In: Proc. of the Tenth Int. WWW Conf., pp. 603–612 (2001)

    Google Scholar 

  2. Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines. Cambridge University Press, Cambridge (2000)

    Google Scholar 

  3. dmoz (2003), http://dmoz.org/

  4. Doan, A., Madhavan, J., Domingos, P., Halevy, A.: Learning to map between ontologies on the semantic web. In: Proc. of the 11th Int. WWW Conf. (2002)

    Google Scholar 

  5. Fleiss, J.: Statistical Methods for Rates and Proportions. John Wiley & Sons, Chichester (1973)

    MATH  Google Scholar 

  6. Google (2003), http://directory.google.com/

  7. Ichise, R., Takeda, H., Honiden, S.: Integrating multiple internet directories by instance-based learning. In: Proc. of the 18th Int. Joint Conf. on AI, pp. 22-28 (2003)

    Google Scholar 

  8. Koller, D., Sahami, M.: Hierarchically classifying documents using very few words. In: Proc. of the 14th Int. Conf. on Machine Learning, pp. 170–178 (1997)

    Google Scholar 

  9. McCallum, A.K.: Bow: A toolkit for statistical language modeling, text retrieval, classification and clustering mccallum/bow/ (1996), http://www.cs.cmu.edu/

  10. McCallum, A.K., Rosenfeld, R., Mitchell, T.M., Ng, A.Y.: Improving text classification by shrinkage in a hierarchy of classes. In: Proc. of the 15th Int. Conf. on Machine Learning, pp. 359–367 (1998)

    Google Scholar 

  11. McGuinness, D.L., Fikes, R., Rice, J., Wilder, S.: An environment for merging and testing large ontologies. In: Proc. of the Conf. on Principles of Knowledge Representation and Reasoning, pp. 483–493 (2000)

    Google Scholar 

  12. Mitchell, T.M.: Machine Learning. McGraw-Hill, New York (1997)

    MATH  Google Scholar 

  13. Noy, N.F., Musen, M.A.: Prompt: Algorithm and tool for automated ontology merging and alignment. In: Proc. of the 17th National Conf. on AI, pp. 450–455 (2000)

    Google Scholar 

  14. Omelayenko, B., Fensel, D.: An analysis of B2B catalogue integration problems. In: Proc. of the Int. Conf. on Enterprise Information Systems, pp. 945–952 (2001)

    Google Scholar 

  15. Stumme, G., Madche, A.: FCA-Merge: Bottom-up merging of ontologies. In: Proc. of the 17th Int. Joint Conf. on AI, pp. 225–230 (2001)

    Google Scholar 

  16. Sun, A., Lim, E.: Hierarchical Text Classification and Evaluation. In: Proc. of IEEE Int. Conf. on Data Mining, pp. 521–528 (2001)

    Google Scholar 

  17. Yahoo! (2003), http://www.yahoo.com/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ichise, R., Hamasaki, M., Takeda, H. (2004). Discovering Relationships Among Catalogs. In: Suzuki, E., Arikawa, S. (eds) Discovery Science. DS 2004. Lecture Notes in Computer Science(), vol 3245. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30214-8_33

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-30214-8_33

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-23357-2

  • Online ISBN: 978-3-540-30214-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics