Based on Support Vector and Word Features New Word Discovery Research

Chengcheng, Li; Yuanfang, Xu

doi:10.1007/978-3-642-35795-4_36

Li Chengcheng³ &
Xu Yuanfang³

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 320))

Included in the following conference series:

International Conference on Trustworthy Computing and Services

3180 Accesses
1 Citations

Abstract

Chinese word segmentation is difficult to deal with ambiguity and unknown words recognition, this paper proposes the new word mode features as well as various word internal patterns from the training corpus of positive and negative samples to quantify extraction, and then through the training of support vector machine to get new support vector classification. On the test corpus with absolute discounting method new candidate extraction and selection, and with the training corpus to extract word patterns to quantify the new support vector classification for support vector machine test, through a portion of the rule filter to get the final word recognition results.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

SVM-Based Detection of Misannotated Words in Read Speech Corpora

Multiple Support Vector Machines for Binary Text Classification Based on Sliding Window Technique

Feature weighted confidence to incorporate prior knowledge into support vector machines for classification

Article 10 February 2018

Keywords

References

Chen, K., Bai, M.H.: Unknown word detection for Chinese by a corpus- based learning method. Computational Linguistics and Chinese Language Processing 3(1), 27–44 (1998)
MathSciNet Google Scholar
Ning, S.: Based on word features and search engine for Chinese new word identification. Journal of Wuhan University (Science Edition ) 56(6), 704–710 (2010)
Google Scholar
Qian, Q., Zhang, Z.: A method based on multiple SVM classification method of relevance feedback image retrieval. Computer Technology and Development 19(8), 66–69 (2009)
MathSciNet Google Scholar
Huang, X., Wang, Y.: SVM in unbalanced data set. Computer Technology and Development 19(6), 190–193 (2009)
Google Scholar
Yong, F., Hua, L.: Based on Adaptive Chinese word segmentation and approximation of SVM text classification algorithm. Computer Science 37, 251–254, 293 (2010)
Google Scholar
Cao, B., Han, Z.: ASP.NET database system project development practice. Science Press, Beijing (2005)
Google Scholar
Wang, B.: Database access technology based on ASP.NET. Computer Application and Software 21(2), 120–122 (2004)
Google Scholar
Jeroslow, R., Wang, J.: Solving propositional satisfiability problems. In: Annals of Mat Hematics and Artificial intelligence. Springer (1990)
Google Scholar
Nie, J.-Y.: Unknown Word Detection and Segmentation of Chinese using Statistical andheuristic Knowledge. Communications of COLIPS 5(I&2), 47–57 (2008)
Google Scholar
Luo, Z., Song, R.: The adaptive method for Chinese new word identification based on multiple feature. Journal of Beijing University of Technology 23(7), 718–725 (2007)
Google Scholar
Li, Y., Wang, H.: Intelligent computer assisted instruction system of knowledge ambiguity elimination. Computer Technology and Development 19(4), 220–223 (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer and Information Engineering, Inner Mongolia Normal University, Hohhot, China
Li Chengcheng & Xu Yuanfang

Authors

Li Chengcheng
View author publications
You can also search for this author in PubMed Google Scholar
Xu Yuanfang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Beijing University of Posts and Telecommunications, Beijing, China
Yuyu Yuan & Xu Wu &
The School of Telecommunications Engineering, Beijing University of Posts and Telecommunications Beijing, P. O. Box 128, 100876, Beijing, China
Yueming Lu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chengcheng, L., Yuanfang, X. (2013). Based on Support Vector and Word Features New Word Discovery Research. In: Yuan, Y., Wu, X., Lu, Y. (eds) Trustworthy Computing and Services. ISCTCS 2012. Communications in Computer and Information Science, vol 320. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35795-4_36

Download citation

DOI: https://doi.org/10.1007/978-3-642-35795-4_36
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35794-7
Online ISBN: 978-3-642-35795-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Based on Support Vector and Word Features New Word Discovery Research

Abstract

Chapter PDF

Similar content being viewed by others

SVM-Based Detection of Misannotated Words in Read Speech Corpora

Multiple Support Vector Machines for Binary Text Classification Based on Sliding Window Technique

Feature weighted confidence to incorporate prior knowledge into support vector machines for classification

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Based on Support Vector and Word Features New Word Discovery Research

Abstract

Chapter PDF

Similar content being viewed by others

SVM-Based Detection of Misannotated Words in Read Speech Corpora

Multiple Support Vector Machines for Binary Text Classification Based on Sliding Window Technique

Feature weighted confidence to incorporate prior knowledge into support vector machines for classification

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation