textanalytics.ja.mecabToPOS

Extract part-of-speech information from MeCab output for Japanese

Syntax

posTags = textanalytics.ja.mecabToPOS(words,info)

Description

posTags = textanalytics.ja.mecabToPOS(words,info) extracts part-of-speech information given MeCab output in the format returned by the MeCab-ipadic dictionary.

Input Arguments

collapse all

`words` — Input tokens
string vector

Input tokens, specified as a string vector.

Data Types: string

`info` — Information struct
struct

Information struct with the following fields:

Feature – String vector of tokens of the same size as words containing the MeCab output lines in ChaSen format without the split tokens themselves.
PartOfSpeech – Numerical code used inside the MeCab-ipadic dictionary for the part-of-speech classification.

Data Types: struct

Output Arguments

collapse all

`posTags` — Extracted part-of-speech information
categorical vector

Extracted part-of-speech information, returned as a categorical vector the same size as words.

Documentation

textanalytics.ja.mecabToPOS

Syntax

Description

Input Arguments

`words` — Input tokens
string vector

`info` — Information struct
struct

Output Arguments

`posTags` — Extracted part-of-speech information
categorical vector

See Also

Topics

Text Analytics Toolbox Documentation

Support

Documentation

textanalytics.ja.mecabToPOS

Syntax

Description

Input Arguments

words — Input tokens string vector

info — Information struct struct

Output Arguments

posTags — Extracted part-of-speech information categorical vector

See Also

Topics

Text Analytics Toolbox Documentation

Support

`words` — Input tokens
string vector

`info` — Information struct
struct

`posTags` — Extracted part-of-speech information
categorical vector