Convert HTML and XML entities into characters
newStr = decodeHTMLEntities(str)
example
newStr = decodeHTMLEntities(str) replaces HTML and XML character entities and numeric character references in the elements of str with their Unicode equivalent.
newStr
str
collapse all
Replace HTML character entities with their Unicode equivalent.
str = ["<>" "R&D"]; newStr = decodeHTMLEntities(str)
newStr = 1x2 string "<>" "R&D"
Replace HTML numeric character references with their Unicode equivalent. Unicode character with hex code   is a space.
 
str = "R D"; newStr = decodeHTMLEntities(str)
newStr = "R D"
Input text, specified as a string array, character vector, or cell array of character vectors.
Example: ["An example of a short sentence."; "A second short sentence."]
["An example of a short sentence."; "A second short sentence."]
Data Types: string | char | cell
string
char
cell
Output text, returned as a string array, a character vector, or cell array of character vectors. str and newStr have the same data type.
erasePunctuation | eraseTags | eraseURLs | lower | tokenizedDocument | upper
erasePunctuation
eraseTags
eraseURLs
lower
tokenizedDocument
upper
You have a modified version of this example. Do you want to open this example with your edits?