Abstract
Question classification is one of the key components of Open Domain Question-Answering System. It has become a research focus for its capability to perform Natural Language Processing. The task of question classification is to assign a class label to each question according to the semantic types of answer. Since the classification precision is affected by the coarse annotation granularity of syntactic features and noises of lexical features, we propose new classification features based on fine-grained PoS annotation of nouns and interrogative pronouns. We firstly refine annotation granularity of syntactic features and then extract the head words with high occurrence frequency and the fine-grained PoS tagging to produce new features so as to reduce the noises of lexical features. A new feature extracting algorithm based on fine-grained PoS annotation is applied to improve the precision of feature extracting. The experimental results demonstrate the effectiveness of the proposed method both in Chinese and English question classification.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
References
Ferrucci, D.A.: Introduction to “This is Watson”. IBM Res. & Dev. 56(3/4) (2012)
Kalyanpur, A., Patwardhan, S., Boguraev, B.K., et al.: Fact-based Question Decomposition in DeepQA. IBM J. Res. & Dev. 56(3/4) (2012)
Lally, A., Prager, J.M., McCord, M.C.: Question Analysis: How Watson Reads a Clue. IBM Res. & Dev. 56(3/4) (2012)
Hu, B., Wang, D., Yu, G.: An Answer Extraction Algorithm Based on Syntax Structure Feature Parsing and Classification. Chinese Journal of Computers 31(4) (2008)
Loni, B., van Tulder, G., Wiggers, P.: Question Classification by Weighted Combination of Lexical, Syntactic and Semantic Features. In: 14th International Conference, TSD 2011, pp. 243–250. Pilsen, Czech Republic (2011)
Loni, B.: A Survey of State-of-the-Art Methods on Question Classification. Literature Survey. Published on TU Delft Repository (2011)
Huang, Z., Thint, M., Qin, Z.: Question Classification Using Headwords and Their Hypernyms. In: Empirical Methods in Natural Language Processing, pp. 927–936. ACM, Honolulu (2008)
Silva, J., Coheur, L., Mendes, A.C., Wichert, A.: From Symbolic to Sub-symbolic Information in Question Classification. Artifciial Intelligence Review 35(2), 137–154 (2011)
Zong, C.: Statistical Natural Language Processing, 2nd edn. Tsinghua University Press, Beijing (2013)
Fan, S.: Research and Application on Question Analysis Technique in QA System. PhD thesis, Harbin Institute of Technology, China (2009)
Huang, Z., Thint, M., Celikyilmaz, A.: Investigation of Question Classifier in Question Answering. In: Empirical Methods in Natural Language Processing, pp. 543–550. ACL and AFNLP, Singapore (2009)
Williams, O.: High-performance Question Classification Using Semantic Features. Stanford University, CS224N (2010)
Yen, S.J., Wu, Y.C., Yang, J.C., Lee, Y.S.: A Support Vector Machine-Based Context-Ranking Model for Question Answering. Information Sciences 224, 77–87 (2013)
Sun, J., Cai, D., Lv, D.: HowNet Based Chinese Question Automatic Classification. Journal of Chinese Information Processing 21(1), 90–94 (2007)
Zhang, Z., Yu, Z., Ting, L., Sheng, L.: Chinese Question Classification Based on Identification of Cue Words and Extension of Training Set. Chinese High Technology Letters 19(2) (2009)
Duan, L., Chen, J., Niu, Y.: Study on Classification Features of Chinese Interrogatives. Journal of TaiYuan University Technology 42(5) (2011)
Ji, Y., Wang, R., Chen, Z.: Question Classification in Restricted Domain Using Syntactic Parsing Based Quadratic Bayesian Model. Journal of Computer Applications 32(6), 1685–1687 (2012)
Yang, S., Gao, C., Yu, D.: Generation of New Type of Question Features Based on Bag-of-Words Binding. Transactions of Beijing Institute of Technology 32(6), 591–595 (2012)
Yang, S., Gao, C., Yu, D., Yin, C.: Combining Features of Question Based on Diversity and Importance. Acta Electronica Sinica 42(5) (2014)
Modern Chinese corpus segmentation and part of speech tagging specification, http://www.icl.pku.edu.cn/icl_groups/corpus/contents.htm
Penn Treebank Corpus Part of Speech Tagging, http://www.cis.upenn.edu/~treebank/home.html
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Le, J., Niu, Z., Zhang, C. (2014). Question Classification Based on Fine-Grained PoS Annotation of Nouns and Interrogative Pronouns. In: Pham, DN., Park, SB. (eds) PRICAI 2014: Trends in Artificial Intelligence. PRICAI 2014. Lecture Notes in Computer Science(), vol 8862. Springer, Cham. https://doi.org/10.1007/978-3-319-13560-1_54
Download citation
DOI: https://doi.org/10.1007/978-3-319-13560-1_54
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13559-5
Online ISBN: 978-3-319-13560-1
eBook Packages: Computer ScienceComputer Science (R0)