Convert documents to lowercase
converts each uppercase character in the input documents to the corresponding
lowercase character, and leaves all other characters unchanged.newDocuments
= lower(documents
)
decodeHTMLEntities
| erasePunctuation
| eraseTags
| eraseURLs
| tokenizedDocument
| upper