Abstract
Domain specific information is increasingly available on the Web in form of document repositories. In specialized domains such as agriculture, bio-medical sciences and health-care, this information is required by various domain experts. Health-care experts such as researchers and practitioners require it during health-care delivery and for educational purposes. These users differ from the Web users and database users. Most of the existing document repositories on the Web have alphabetical and keyword based searches. These are not sufficient for the expert users with precise and complex queries, who require in-depth results within time constraints. Their information needs can be supported by providing user-level schema. Such a schema can support database-style high-level query languages over these repositories. Seeking specialized domain-specific information through queries is gaining importance. In this paper, a model for on-line document repositories is proposed. Queries can be performed with in-depth results. The model can be replicated to similarly structured document repositories in any given domain.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
References
A.D.A.M. Medical Encyclopedia (2011), http://www.drugs.com/medical_encyclopedia.html
Hanbury, A.: Medical information retrieval: an instance of domain-specific search. In: Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, Portland, Oregon, USA, pp. 1191–1192 (2012)
Cai, D., Yu, S., Wen, J., Ma, W.-Y.: Extracting content structure for web pages based on visual representation. In: Zhou, X., Zhang, Y., Orlowska, M.E. (eds.) APWeb 2003. LNCS, vol. 2642, pp. 406–417. Springer, Heidelberg (2003)
Jenkins, C., Corritore, C.L., Wiedenbeck, S.: Patterns of Information Seeking on the Web: A Qualitative Study of Domain Expertise and Web Expertise. IT and Society 1(3), 64–89 (2003)
Fisher, D., DeLine, R., Czerwinski, M., Drucker, S.: Interactions with Big Data Analytics. Interactions 19(3), 50–59 (2012)
Braga, D., Campi, A., Ceri, S.: XQBE (XQuery By Example): A visual Interface to the Standard XML Query Language. ACM Trans. Database Syst. 30(2), 398–443 (2005)
Cai, D., Yu, S., Wen, J.-R., Ma, W.-Y.: Block-based Web Search. In: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Sheffield, United Kingdom, pp. 456–463 (2004)
Freire, S.M., Sundvall, E., Karlsson, D., Lambrix, P.: Performance of XML Databases for Epidemiological Queries in Archetype-Based EHRs. In: Proceedings Scandinavian Conference on Health Informatics, vol. 70, pp. 51–57 (2012)
Health Illustrated Encyclopedia (2011), http://adam.about.net/encyclopedia/
Health Line Medical Encyclopedia (2011), http://www.healthline.com/
HTML DOM Tutorial (2011), http://www.w3schools.com/htmldom/default.asp
Zadeh, L.A.: Fuzzy sets and information granularity. In: Fuzzy Sets, Fuzzy Logic, and Fuzzy Systems, pp. 433–448 (1996)
Laurent, M., Vickers, T.J.: Seeking Health Information On-line: Does Wikipedia Matter? Journal of the American Medical Informatics Association 16, 471–479 (2009)
Gschwandtner, M., Kritz, M., Boyer, C.: Requirements of the Health Professional Research, Technical Report D8.1.2, Khresmoi Project (2011)
MEDLINE (October 2012), http://www.nlm.nih.gov/bsd/pmresources.html
Medical World Search (2011), http://www.mwsearch.com/mwsframetemplate.htm?
Merck Manual, Home Health Handbook (August 2013), http://www.merckmanuals.com/home/index.html
Middle Georgia Orthopaedics Encyclopedia (2011), http://www.mgo.md/encyclopedia.cfm
National Library of Medicine Encyclopedia (October 2012), http://www.nlm.nih.gov/medlineplus/
National Library of Medicine, NLM (2011), http://www.nlm.nih.gov/
Proceedings of CIDR 2009, Fourth Biennial Conference on Innovative Data Systems Research, Asilomar, CA, USA, January 4-7 (2009)
PubMed (2011), http://www.ncbi.nlm.nih.gov/pubmed
White, R.W., Dumais, S., Teevan, J.: How Medical Expertise Influences Web Search Interaction. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 791–792 (2008)
Cohen, S., Kanza, Y., Kogan, Y., Nutt, W., Sagiv, Y., Serebrenik, A.: EquiX-A Search and Query Language for XML. Journal of the American Society for Information Science and Technology 53 (2000)
Abiteboul, S., Buneman, P., Suciu, D.: Data on the Web: from Relations to Semistructured Data and XML (2000)
University of Maryland, Medical Center, Encyclopedia (2011), http://www.umm.edu/ency/
Yan, X., Lau, R.Y.K., Song, D., Li, X., Ma, J.: Toward a Semantic Granularity Model for Domain-specific Information Retrieval. ACM Trans. Inf. Syst. 29(3), 15:1–15:46 (2011)
XQuery FLOWR Expressions (2011), http://www.w3schools.com/xquery/xquery_flwor.asp
Yao, J.: Granular Computing. In: Proceedings of IEEE International Conference on Information Granulation and Granular Relationships, vol. 1, pp. 326–329 (2005)
Zou, J., Le, D., Thoma, G.R.: Combining DOM tree and Geometric Layout Analysis for on-line Medical Journal Article Segmentation. In: Proceedings of the 6th ACM/IEEE-CS Joint Conference on Digital Libraries, Chapel Hill, NC, USA, pp. 119–128 (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Madaan, A., Chu, W. (2014). Handling Domain Specific Document Repositories for Application of Query Languages. In: Madaan, A., Kikuchi, S., Bhalla, S. (eds) Databases in Networked Information Systems. DNIS 2014. Lecture Notes in Computer Science, vol 8381. Springer, Cham. https://doi.org/10.1007/978-3-319-05693-7_10
Download citation
DOI: https://doi.org/10.1007/978-3-319-05693-7_10
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-05692-0
Online ISBN: 978-3-319-05693-7
eBook Packages: Computer ScienceComputer Science (R0)