Abstract
We propose a workflow for an individual sociologist to be able to use quantitative content analysis in small-scale short-term research projects. The key idea of the approach is to generate a domain-oriented dictionary for researchers with limited resources. The workflow starts like a typical one and then deviates to include content analysis. First, the researcher performs deductive analysis which results in an interview guide. Second, the researcher conducts the small number of interviews to collect a domain-oriented labelled text corpus. Third, a domain-oriented dictionary is generated for the following content analysis. We propose and compare a number of methods to automatically extract a domain-oriented dictionary from a labelled corpus. Some properties of the proposed workflow are empirically studied based on a sociological research on volunteering in Russia.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
References
Arora, S., Ge, R., Halpern, Y., Mimno, D., Moitra, A., Sontag, D., Wu, Y., Zhu, M.: A practical algorithm for topic modeling with provable guarantees. arXiv preprint arXiv:1212.4777 (2012)
Arora, S., Ge, R., Moitra, A.: Learning topic models-going beyond svd. In: 2012 IEEE 53rd Annual Symposium on Foundations of Computer Science (FOCS), pp. 1–10. IEEE (2012)
Arsirij, E., Antoshhuk, S., Ignatenko, O., Trofimov, B.: Avtomatizacija razrabotki i obnovlenija semanticheskogo jadra sajta s dinamicheskim kontentom. Shtuchnijintelekt (2012)
Basili, R., Cammisa, M., Moschitti, A.: A Semantic Kernel to Classify Texts with Very Few Training Examples. Informatica (Slovenia) 30, 163–172 (2006)
Baziz, M., Boughanem, M., Aussenac-Gilles, N.: Conceptual indexing based on document content representation. In: Crestani, F., Ruthven, I. (eds.) CoLIS 2005. LNCS, vol. 3507, pp. 171–186. Springer, Heidelberg (2005)
Bengston, D.N., Xu, Z.: Changing national forest values: a content analysis. Research Paper NC-323. St. Paul, MN: US Dept. of Agriculture, Forest Service, North Central Forest Experiment Station (2006)
Berelson, B.: Content analysis in communication research (1952)
von dem Berge, B., Poguntke, T., Obert, P., Tipei, D.: Measuring intra-party democracy
Cristianini, N., Shawe-Taylor, J., Lodhi, H.: Latent semantic kernels. Journal of Intelligent Information Systems 18(2–3), 127–152 (2002)
Khalifa, O., Corne, D.W., Chantler, M., Halley, F.: Multi-objective topic modeling. In: Purshouse, R.C., Fleming, P.J., Fonseca, C.M., Greco, S., Shaw, J. (eds.) EMO 2013. LNCS, vol. 7811, pp. 51–65. Springer, Heidelberg (2013)
Kuznecov, A.M.: Strukturno-semanticheskie parametry v leksike: na materiale anglijskogo jazyka. Nauka (1980)
Kvale, S., Brinkmann, S.: Interviews: Learning the craft of qualitative research interviewing. Sage (2009)
Manning, C.D., Raghavan, P., Schütze, H., et al.: Introduction to information retrieval, vol. 1. Cambridge university press Cambridge (2008)
Neuendorf, K.: Computer content analysis programs (2015). http://academic.csuohio.edu/kneuendorf/content/cpuca/ccap.html (Accessed July 13, 2015])
Newman, D., Karimi, S., Cavedon, L.: External evaluation of topic models. In: Australasian Doc. Comp. Symp., 2009. Citeseer (2009)
Newman, D., Lau, J.H., Grieser, K., Baldwin, T.: Automatic evaluation of topic coherence. In: Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pp. 100–108. Association for Computational Linguistics (2010)
Sebastiani, F.: Machine learning in automated text categorization. ACM Computing Surveys 34(1), 1–47 (2002)
Stemler, S.: An overview of content analysis. Practical Assessment, Research & Evaluation 7(17), 137–146 (2001)
Voroncov, K.V., Potapenko, A.A.: Reguljarizacija verojatnostnyh tematicheskih modelej dlja povyshenija interpretiruemosti i opredelenija chisla tem. Mezhdunarodnaja konferencija po komp’juternoj lingvistike “Dialog”, pp. 676–687 (2014)
Vorontsov, K., Potapenko, A.: Pregularization, robustness and sparsity of probabilistic topic models. Computer Research and Modeling 4(4), 693–706 (2012)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Saburova, M., Maysuradze, A. (2015). A Low Effort Approach to Quantitative Content Analysis. In: Klinov, P., Mouromtsev, D. (eds) Knowledge Engineering and Semantic Web. KESW 2015. Communications in Computer and Information Science, vol 518. Springer, Cham. https://doi.org/10.1007/978-3-319-24543-0_13
Download citation
DOI: https://doi.org/10.1007/978-3-319-24543-0_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24542-3
Online ISBN: 978-3-319-24543-0
eBook Packages: Computer ScienceComputer Science (R0)