Abstract
Entities and relationships are important structures that can be extracted from a text corpus to represent the factual knowledge inside the corpus. Effective and efficient mining of entity and relation structures from text helps gaining insights from large volume of text data (that are infeasible for human to read through and digest), and enables many downstream applications on understanding, exploring and analyzing the text content. Data analysts and government agents may want to identify person, organization and location entities in news everyday news articles and generate concise and timely summary of news events. Biomedical researchers who cannot digest large amounts of newly-published research papers in relevant areas would need an effective way to extract different relationships between proteins, drugs, and diseases so as to follow the new claims and facts presented in the research community. However, text data is highly variable: corpora covering topics from different domains, written in different genres or languages have typically required for effective processing a wide range of language resources such as grammars, vocabularies, gazetteers. The massive and messy nature of text data post significant challenges to creating tools for automated structuring of unstructured content that scale with text volume.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Ren, X., Han, J. (2018). Conclusions. In: Mining Structures of Factual Knowledge from Text. Synthesis Lectures on Data Mining and Knowledge Discovery. Springer, Cham. https://doi.org/10.1007/978-3-031-01912-8_14
Download citation
DOI: https://doi.org/10.1007/978-3-031-01912-8_14
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-00784-2
Online ISBN: 978-3-031-01912-8
eBook Packages: Synthesis Collection of Technology (R0)eBColl Synthesis Collection 8