Remove selected words from documents or bag-of-words model
removes the specified words from newDocuments
= removeWords(documents
,words
)documents
. The function,
by default, is case sensitive.
removes words ignoring case using any of the previous syntaxes.newDocuments
= removeWords(___,'IgnoreCase',true)
removes words by specifying the numeric or logical indices
newDocuments
= removeWords(documents
,idx
)idx
of the words in
documents.Vocabulary
. This syntax is the same as
newDocuments =
removeWords(documents,documents.Vocabulary(idx))
.
bagOfNgrams
| bagOfWords
| normalizeWords
| removeEmptyDocuments
| removeInfrequentWords
| removeLongWords
| removeNgrams
| removeShortWords
| stopWords
| tokenizedDocument