Abstract
Web pages are typically designed for visual interaction – they include many visual elements to guide the reader. However, when they are accessed in alternative forms such as in audio, these elements are not available and therefore they become inaccessible. This paper presents our ontology-based heuristic approach that automatically identifies visual elements of web pages and their roles. Our architecture has three major components: 1. automatic identification of visual elements of web pages, 2. automatic generation of heuristics as Jess rules from an ontology and 3. application of these heuristic rules to web pages for automatic annotation of visual elements and their roles. This paper first explains our architecture in detail and then presents our both technical and user evaluations of the proposed approach and architecture. Our technical evaluation shows that complexity is an important performance factor in role detection and our user evaluation shows that our proposed system has around 80% receptive accuracy, but the proposed knowledge base could be further improved for better accuracy.
An Erratum for this chapter can be found at http://dx.doi.org/10.1007/978-3-642-39200-9_51
Chapter PDF
Similar content being viewed by others
References
Kovacevic, M., Diligenti, M., Gori, M., Milutinovic, V.: Recognition of common areas in a web page using visual information: a possible application in a page classification. In: ICDM 2002, pp. 250–257. IEEE Computer Society, Washington, DC (2002)
Yin, X., Lee, W.S.: Understanding the function of web elements for mobile content delivery using random walk models. In: WWW 2005, pp. 1150–1151. ACM (2005)
Chen, Y., Xie, X., Ma, W.Y., Zhang, H.J.: Adapting web pages for small-screen devices. IEEE Internet Computing 9(1), 50–56 (2005)
Xiang, P., Shi, Y.: Recovering semantic relations from web pages based on visual cues. In: IUI 2006, pp. 342–344. ACM (2006)
Cai, D., Yu, S., Wen, J.R., Ma, W.Y.: Vips: a vision based page segmentation algorithm, MSR-TR-2003-79, Microsoft Research (2003)
Craig, J., Cooper, M.: Accessible rich internet applications (WAI-ARIA) 1.0 (2010), http://www.w3.org/TR/2010/WD-wai-aria-20100916/complete (retrieved on January 15, 2013)
Ahmadi, H., Kong, J.: Efficient web browsing on small screens. In: AVI 2008, pp. 23–30. ACM (2008)
Xiao, Y., Tao, Y., Li, W.: A dynamic web page adaptation for mobile device based on web2.0. In: ASEA 2008, pp. 119–122. IEEE Computer Society, USA (2008)
Chen, J., Zhou, B., Shi, J., Zhang, H., Fengwu, Q.: Function-based object model towards website adaptation. In: WWW 2001, pp. 587–596. ACM (2001)
Lin, S.H., Ho, J.M.: Discovering informative content blocks from web documents. In: SIGKDD 2002, pp. 588–593. ACM (2002)
Burget, R., Rudolfova, I.: Web page element classification based on visual features. In: ACIIDS 2009, pp. 67–72 (April 2009)
Liu, B., Chin, C.W., Ng, H.T.: Mining topic-specific concepts and definitions on the web. In: WWW 2003, pp. 251–260. ACM (2003)
Yi, L., Liu, B., Li, X.: Eliminating noisy information in web pages for data mining. In: SIGKDD 2003, pp. 296–305. ACM (2003)
Takagi, H., Asakawa, C., Fukuda, K., Maeda, J.: Site-wide annotation: reconstructing existing pages to be accessible. In: SIGACCESS 2002, pp. 81–88. ACM (2002)
Harper, S., Yesilada, Y.: Web authoring for accessibility (WAfA). JWS 5(3), 175–179 (2007)
Yesilada, Y., Harper, S., Goble, C., Stevens, R.: Screen readers cannot see. In: Koch, N., Fraternali, P., Wirsing, M. (eds.) ICWE 2004. LNCS, vol. 3140, pp. 445–458. Springer, Heidelberg (2004)
Plessers, P., Casteleyn, S., Yesilada, Y., Troyer, O.D., Stevens, R., Harper, S., Goble, C.: Accessibility: A web engineering approach. In: WWW 2005, Chiba, Japan, pp. 353–362 (2005)
Yesilada, Y., Stevens, R., Harper, S., Goble, C.: Evaluating DANTE: Semantic transcoding for visually disabled users. ACM TOCHI 14(3) (2007)
Alcic, S., Conrad, S.: Page segmentation by web content clustering. In: WIMS 2011, pp. 24:1–24:9. ACM (2011)
Kohlschütter, C., Nejdl, W.: A densitometric approach to web page segmentation. In: CIKM 2008, pp. 1173–1182. ACM (2008)
Yu, S., Cai, D., Wen, J.R., Ma, W.Y.: Improving pseudo-relevance feedback in web information retrieval using web page segmentation. In: WWW 2003, pp. 11–18. ACM (2003)
Friedman-Hill, E.: Jess the rule engine for the java platform (2008), http://herzberg.ca.sandia.gov/ (retrieved on November 27, 2012)
Michailidou, E.: ViCRAM: Visual Complexity Rankings and Accessibility Metrics. PhD thesis (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Akpınar, M.E., Yeşilada, Y. (2013). Heuristic Role Detection of Visual Elements of Web Pages. In: Daniel, F., Dolog, P., Li, Q. (eds) Web Engineering. ICWE 2013. Lecture Notes in Computer Science, vol 7977. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-39200-9_12
Download citation
DOI: https://doi.org/10.1007/978-3-642-39200-9_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-39199-6
Online ISBN: 978-3-642-39200-9
eBook Packages: Computer ScienceComputer Science (R0)