Extract keywords using RAKE
extracts keywords and respective scores using the Rapid Automatic Keyword Extraction (RAKE)
algorithm. The function supports English, Japanese, German, and Korean text. To learn how to
use tbl
= rakeKeywords(documents
)rakeKeywords
for other languages, see Language Considerations.
specifies additional options using one or more name-value pair arguments.tbl
= rakeKeywords(documents
,Name,Value
)
Tip
The rakeKeywords
function, by default, extracts keywords using
stop words and punctuation characters. When using the default values for the 'Delimiters'
and 'MergingDelimiters'
options, do not remove stop words or punctuation
characters from the input text.
You can experiment with different keyword extraction algorithms to see what works best
with your data. Because the RAKE keywords algorithm uses a delimiter-based approach to
extract candidate keywords, the extracted keywords can be very long. Alternatively, you
can try extracting keywords using TextRank algorithm which starts with individual tokens
as candidate keywords and then merges them when appropriate. To extract keywords using
TextRank, use the textrankKeywords
function. To learn more, see Extract Keywords from Text Data Using TextRank.
[1] Rose, Stuart, Dave Engel, Nick Cramer, and Wendy Cowley. "Automatic keyword extraction from individual documents." Text mining: applications and theory 1 (2010): 1-20.