Abstract
If we want to apply natural language processing and information retrieval methods to historical texts, we have to cope with the three issues discussed in Chapter 3:
-
1.
spellings that differ from today’s orthography (difference);
-
2.
spellings that are highly variable and often inconsistent (variance); and
-
3.
uncertainty such as transcription artifacts and digitization errors.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Piotrowski, M. (2012). Handling Spelling Variation. In: Natural Language Processing for Historical Texts. Synthesis Lectures on Human Language Technologies. Springer, Cham. https://doi.org/10.1007/978-3-031-02146-6_6
Download citation
DOI: https://doi.org/10.1007/978-3-031-02146-6_6
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-01018-7
Online ISBN: 978-3-031-02146-6
eBook Packages: Synthesis Collection of Technology (R0)eBColl Synthesis Collection 4