A Text Parser should include the following functionalities: Tokenizer: Reads document into memory, tokenizes to separate words; returns token stream. Basic tokenization rules: • remove numbers • ignore if word contains numbers… (Budget: $10 – $30 USD, Jobs: Java)
