Cleaning Words with R: Stemming, Lemmatization & Replacing with More Common Synonym

*Recommendation
2018
R software
Text mining
Author

Steve Simon

Published

February 20, 2018

Figure 1. Excerpt from website

In many text mining or natural language processing applications, you will have problems with words that are very similar, but which are counted separately. An example might be the words win, winner, and winning. You can combine these words into a single category using stemming. This blog post gives a nice overview of stemming.

Bernhard Learns. Cleaning Words with R: Stemming, Lemmatization & Replacing with More Common Synonym. Blog post. April 27, 2017. Available in html format.

An earlier version is available here.