Abstract
Clustering Web services that groups together services with similar functionalities helps improve both the accuracy and efficiency of the Web service search engines. An important limitation of existing Web service clustering approaches is that they solely focus on utilizing WSDL (Web Service Description Language) documents. There has been a recent trend of using user-contributed tagging data to improve the performance of service clustering. Nonetheless, these approaches fail to completely leverage the information carried by the tagging data and hence only trivially improve the clustering performance. In this paper, we propose a novel approach that seamlessly integrates tagging data and WSDL documents through augmented Latent Dirichlet Allocation (LDA). We also develop three strategies to preprocess tagging data before being integrated into the LDA framework for clustering. Comprehensive experiments based on real data and the implementation of a Web service search engine demonstrate the effectiveness of the proposed LDA-based service clustering approach.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Al-Masri, E., Mahmoud, Q.H.: Investigating web services on the world wide web. In: International World Wide Web Conference, pp. 795–804 (2008)
Bianchini, D., Antonellis, V.D., Pernici, B., Plebani, P.: Ontology-based methodology for e-service discovery. ACM Journal of Information Systems 31(4), 361–380 (2006)
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. The Journal of Machine Learning Research 3(1), 993–1022 (2003)
Chen, L., Hu, L., Zheng, Z., Wu, J., Yin, J., Li, Y., Deng, S.: WTCluster: Utilizing tags for web services clustering. In: Kappel, G., Maamar, Z., Motahari-Nezhad, H.R. (eds.) ICSOC 2011. LNCS, vol. 7084, pp. 204–218. Springer, Heidelberg (2011)
Chen, L., Zheng, Z., Feng, Y., Wu, J., Lyu, M.R.: WSTRank: Ranking tags to facilitate web service mining. In: Liu, C., Ludwig, H., Toumani, F., Yu, Q. (eds.) ICSOC 2012. LNCS, vol. 7636, pp. 574–581. Springer, Heidelberg (2012)
Dasgupta, S., Bhat, S., Lee, Y.: Taxonomic clustering of web service for efficient discovery. In: Proceedings of International Conference on Information and Knowledge Management, pp. 1617–1620 (2010)
Elgazzar, K., Hassan, A.E., Martin, P.: Clustering wsdl documents to bootstrap the discovery of web services. In: International Conference on Web Services, pp. 147–154 (2009)
Hao, Y., Junliang, C., Xiangwu, M., Bingyu, Q.: Dynamically traveling web service clustering based on spatial and temporal aspects. In: Hainaut, J.-L., et al. (eds.) ER Workshops 2007. LNCS, vol. 4802, pp. 348–357. Springer, Heidelberg (2007)
Church, K., Gale, W.: Inverse document frequency (idf): a measure of deviations from poisson. In: Proceedings of the ACL 3rd Workshop on Very Large Corpora, pp. 121–130 (1995)
Platzer, C., Rosenberg, F., Dustdar, S.: Web service clustering using multidimensional angles as proximity measures. ACM Transactions on Internet Technology 9(3), 1–26 (2009)
Pop, C.B., Chifu, V.R., Salomie, I., Dinsoreanu, M., David, T., Acretoaie, V.: Semantic web service clustering for efficient discovery using an ant-based method. In: Essaaidi, M., Malgeri, M., Badica, C. (eds.) Intelligent Distributed Computing IV. SCI, vol. 315, pp. 23–33. Springer, Heidelberg (2010)
Porter, M.F.: An algorithm for suffix stripping. Program 14(3), 130–137 (1980)
Rosen-Zvi, M., Griffiths, T., Steyvers, M., Smyth, P.: The author-topic model for authors and documents. In: Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence, pp. 487–494 (2004)
Sigurbjrnsson, B., van Zwol, R.: Flickr tag recommendation based on collective knowledge. In: Proceedings of the 17th International Conference on World Wide Web, pp. 327–336 (2008)
Sun, P., Jiang, C.: Using service clustering to facilitate process-oriented semantic web service discovery. Chinese Journal of Computers 31(8), 1340–1353 (2008)
Liu, W., Wong, W.: Discovering homogenous service communities through web service clustering. In: Kowalczyk, R., Huhns, M.N., Klusch, M., Maamar, Z., Vo, Q.B. (eds.) SOCASE 2008. LNCS, vol. 5006, pp. 69–82. Springer, Heidelberg (2008)
Wu, J., Chen, L., Xie, Y., Zheng, Z.: Titan: A system for effective web service discovery. In: 21st International World Wide Web Conference, pp. 441–444 (2012)
Wu, J., Chen, L., Zheng, Z., Lyu, M.R., Wu, Z.: Clustering web services to facilitate service discovery. International Journal of Knowledge and Information Systems (2012) (to appear)
Yu, Q.: Place semantics into context: Service community discovery from the WSDL corpus. In: Kappel, G., Maamar, Z., Motahari-Nezhad, H.R. (eds.) ICSOC 2011. LNCS, vol. 7084, pp. 188–203. Springer, Heidelberg (2011)
Yu, Q., Rege, M.: On service community learning: A co-clustering approach. In: Internatonal Conference on Web Services, pp. 283–290 (2010)
Zheng, Z., Ma, H., Lyu, M.R., King, I.: QoS-aware Web service recommendation by collaborative filtering. IEEE Transactions on Service Computing 4(2), 140–152 (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chen, L., Wang, Y., Yu, Q., Zheng, Z., Wu, J. (2013). WT-LDA: User Tagging Augmented LDA for Web Service Clustering. In: Basu, S., Pautasso, C., Zhang, L., Fu, X. (eds) Service-Oriented Computing. ICSOC 2013. Lecture Notes in Computer Science, vol 8274. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-45005-1_12
Download citation
DOI: https://doi.org/10.1007/978-3-642-45005-1_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-45004-4
Online ISBN: 978-3-642-45005-1
eBook Packages: Computer ScienceComputer Science (R0)