Extract part-of-speech information from MeCab output for Japanese
posTags = textanalytics.ja.mecabToPOS(words,info)
posTags = textanalytics.ja.mecabToPOS(words,info) extracts part-of-speech information given MeCab output in the format returned by the MeCab-ipadic dictionary.
posTags
words
info
collapse all
Input tokens, specified as a string vector.
Data Types: string
string
Information struct with the following fields:
Feature – String vector of tokens of the same size as words containing the MeCab output lines in ChaSen format without the split tokens themselves.
Feature
PartOfSpeech – Numerical code used inside the MeCab-ipadic dictionary for the part-of-speech classification.
PartOfSpeech
Data Types: struct
struct
Extracted part-of-speech information, returned as a categorical vector the same size as words.
addPartOfSpeechDetails | mecabOptions | textanalytics.ja.mecabToLemma | textanalytics.ja.mecabToNER | tokenizedDocument
addPartOfSpeechDetails
mecabOptions
textanalytics.ja.mecabToLemma
textanalytics.ja.mecabToNER
tokenizedDocument