#Features This is a basic Text processing API. It provides the following features
-
It provides a Tokenizer class and Word Frequency calculator which has helper methods to 1a. Tokenize entire file into word lists and word frequency maps given file name 1b. Tokenize sentences into words
-
It provides TwoGramGenerator class which has helper methods to 2a. Generate all two grams in a file , given file name 2b. Give frequency map of two grams
-
It provides a PalindromeGenerator which scans a file and generates frequency distribution of all palindromes contained in a file.
#HowToUse The repository has a set of testing methods within the testing package which demonstrate the use of all methods on an sample input.txt file