Text mining , text data mining (TDM) or text analytics is the
process of deriving high-quality information from text. It involves "the
discovery by computer of new, previously unknown information, by automatically
extracting information from different written resources." Written resources
may include websites, books, emails, reviews, and articles. High-quality
information is typically obtained by devising patterns and trends by means
such as statistical pattern learning. According to Hotho et al. (2005), there
are three perspectives of text mining: information extraction, data mining,
and knowledge discovery in databases (KDD).