Abstract
The goal of this paper is to describe a method to automatically extract all basic attributes namely actor, action, object, time and location which belong to an activity, and the transition between activities in each sentence retrieved from Japanese CGM (consumer generated media). Previous work had some limitations, such as high setup cost, inability of extracting all attributes, limitation on the types of sentences that can be handled, and insufficient consideration of interdependency among attributes. To resolve these problems, this paper proposes a novel approach that treats the activity extraction as a sequence labeling problem, and automatically makes its own training data. This approach has advantages such as domain-independence, scalability, and unnecessary hand-tagged data. Since it is unnecessary to fix the positions and the number of the attributes in activity sentences, this approach can extract all attributes and transitions between activities by making only a single pass over its corpus.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Matsuo, Y., Okazaki, N., Izumi, K., Nakamura, Y., Nishimura., T., Hasida, K.: Inferring long-term user properties based on users’ location history. In: Proc. IJCAI 2007, pp. 2159–2165 (2007)
Pentney, W., Kautz, H., Philipose, M., Popescu, A., Wang, S.: Sensor-Based Understanding of Daily Life via Large-Scale Use of Common Sense. In: Proc. AAAI 2006 (2006)
Pentney, W., Philipose, M., Bilmes, J., Kautz, H.: Learning Large Scale Common Sense Models of Everyday Life. In: Proc. AAAI 2007 (2007)
KDDI, Corp.: My Life Assist Service (2009), http://www.kddilabs.jp/english/tech/frontier.html
Perkowitz, M., Philipose, M., Fishkin, K., Patterson, D.J.: Mining Models of Human Activities from the Web. In: Proc. WWW 2004 (2004)
Kurashima, T., Fujimura, K., Okuda, H.: Discovering Association Rules on Experiences from Large-Scale Weblogs Entries. In: ECIR 2009, pp. 546–553 (2009)
Pasca, M., Lin, D., Bigham, J., Lifchits, A., Jain, A.: Organizing and Searching the World Wide Web of Facts - Step One: the One-Million Fact Extraction Challenge. In: Proc. AAAI 2006, pp. 1400–1405 (2006)
Etzioni, O., Cafarella, M., Downey, D., Kok, S., Popescu, A., Shaked, T., Soderland, S., Weld, D., Yates, A.: Methods for Domain-Independent Information Extraction from the Web: An Experimental Comparison. In: Proc. AAAI 2004 (2004)
Banko, M., Etzioni, O.: The Tradeoffs Between Open and Traditional Relation Extraction. In: Proc. ACL 2008 (2008)
Kawamura, T., Nguyen, M.T., Ohsuga, A.: Building of Human Activity Correlation Map from Weblogs. In: Proc. ICSOFT (2009)
Poslad, S.: Ubiquitous Computing Smart Devices, Environments and Interactions. Wiley, Chichester (2009), ISBN: 978-0-470-03560-3
Ozok, A.A., Zaphiris, P.: Online Communities and Social Computing. In: Third International Conference, OCSC 2009, Held as Part of HCI International 2009, San Diego, CA, USA. Springer, Heidelberg (2009), ISBN-10: 3642027733
Banko, M., Cafarella, M.J., Soderland, S., Broadhead, M., Etzioni, O.: Open information extraction from the Web. In: Proc. IJCAI 2007, pp. 2670–2676 (2007)
Brin, S.: Extracting Patterns and Relations from the World Wide Web. In: WebDB Workshop at 6th International Conference on Extending Database Technology, EDBT 1998, Valencia, Spain, pp. 172–183 (1998)
Agichtein, E., Gravano, L.: Snowball: Extracting relations from large plain-text collections. In: Proc. ACM DL 2000 (2000)
Peppers, D., Rogers, M.: The One to One Future. Broadway Business (1996), ISBN-10: 0385485662
Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields Probabilistic models for segmenting and labeling sequence data. In: Proc. ICML, pp. 282–289 (2001)
Sha, F., Pereira, F.: Shallow parsing with conditional random fields. In: Proc. HLTNAACL, pp. 213–220 (2003)
McCallum, A., Li, W.: Early results for named entity recognition with conditional random fields, feature induction and Web-enhanced lexicons. In: Proc. CoNLL (2003)
Kudo, T., Yamamoto, K., Matsumoto, Y.: Applying Conditional Random Fields to Japanese Morphologiaical Analysis. In: IPSJ SIG Notes, pp. 89–96 (2004)
Fuchi, T., Takagi, S.: Japanese morphological analyzer using word co-occurence-JTAG. In: Proc. ACL 1998, pp. 409–413 (1998)
Kudo, T., Matsumoto, Y.: Japanese Dependency Analysis using Cascaded Chunking. In: Proc. CoNLL 2002, pp. 63–69 (2002)
Hiroyuki, Y., Hideyuki, T., Hiromitsu, S.: An individual behavioral pattern to provide ubiquitous service in intelligent space. WSEAS Transactions on Systems, 562–569 (2007)
NTTDocomo, Inc.: My Life Assist Service (2009), http://www.igvpj.jp/contents_en/activity09/ms09/list/personal/ntt-docomo-inc-1.html
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
The, N.M., Kawamura, T., Nakagawa, H., Tahara, Y., Ohsuga, A. (2010). Self-supervised Mining of Human Activity from CGM. In: Kang, BH., Richards, D. (eds) Knowledge Management and Acquisition for Smart Systems and Services. PKAW 2010. Lecture Notes in Computer Science(), vol 6232. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15037-1_6
Download citation
DOI: https://doi.org/10.1007/978-3-642-15037-1_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15036-4
Online ISBN: 978-3-642-15037-1
eBook Packages: Computer ScienceComputer Science (R0)