Replace n-grams in documents
updates the specified documents by replacing the n-grams newDocuments
= replaceNgrams(documents
,oldNgrams
,newNgrams
)oldNgrams
with
the corresponding n-grams in newNgrams
. The function, by default, is
case sensitive.
replaces the n-grams newDocuments
= replaceNgrams(documents
,oldNgrams
,newNgrams
,'IgnoreCase',true)oldNgrams
ignoring case.
decodeHTMLEntities
| normalizeWords
| removeWords
| replaceWords
| tokenizedDocument