Add sentence numbers to documents
Use addSentenceDetails
to add sentence information to
documents.
The function supports English, Japanese, German, and Korean text.
detects the sentence boundaries in updatedDocuments
= addSentenceDetails(documents
)documents
and updates the
token details. To get the sentence details from
updatedDocuments
, use tokenDetails
.
specifies additional options using one or more name-value pair arguments.updatedDocuments
= addSentenceDetails(documents
,Name,Value
)
Use addSentenceDetails
before using the
lower
, upper
,
erasePunctuation
,
normalizeWords
, removeWords
,
and removeStopWords
functions as
addSentenceDetails
uses information that is removed by
these functions.
If emoticons or emoji characters appear after a terminating punctuation character, then the function splits the sentence after the emoticons and emoji.
abbreviations
| addEntityDetails
| addLanguageDetails
| addLemmaDetails
| addPartOfSpeechDetails
| addTypeDetails
| splitSentences
| tokenDetails
| tokenizedDocument