Create table from file
creates
a table by reading column oriented data from a file.T
= readtable(filename
)
readtable
determines the file format from
the file extension:
.txt
, .dat
,
or .csv
for delimited text files
.xls
, .xlsb
, .xlsm
, .xlsx
, .xltm
, .xltx
,
or .ods
for spreadsheet files
readtable
creates one variable in T
for each column in
the file and reads variable names from the first row of the file. By default,
readtable
creates variables that have data types that are appropriate for
the data values detected in each column of the input file.
creates a table from a file with additional options specified by one or more name-value pair
arguments. For example, you can specify whether T
= readtable(___,Name,Value
)readtable
reads the first row
of the file as variable names or as data.
To set specific import options for your data, you can either use the
opts
object or you can specify name-value pairs. When you specify name-value
pairs in addition to opts
, then readtable
supports only
these name-value pairs:
Text and spreadsheet parameters — ReadVariableNames
,
ReadRowNames
Text only parameters — DateLocale
, Encoding
Spreadsheet only parameters — Sheet
,
UseExcel
Load the file myCsvTable.dat
and preview its contents in a text editor. A screen shot is shown below. Notice that the file contains comma-separated column oriented data.
filename = 'myCsvTable.dat';
Create a table from the comma-separated text file. The resulting table T
contains one variable for each column in the file and readtable
treats the entries in the first line of the file as variable names.
T = readtable(filename)
T=5×6 table
LastName Gender Age Height Weight Smoker
____________ ______ ___ ______ ______ ______
{'Smith' } {'M'} 38 71 176 1
{'Johnson' } {'M'} 43 69 163 0
{'Williams'} {'F'} 38 64 131 0
{'Jones' } {'F'} 40 67 133 0
{'Brown' } {'F'} 49 64 119 0
Starting in R2020a, the readtable
function read an input file as though it automatically called the detectImportOptions
function on the file. It can detect data types, discard extra header lines, and fill in missing values.
For example, preview the file headersAndMissing.txt
in a text editor. The file has a line with column names and another line with headers. The last two rows have gaps where the previous rows have data values.
Create a table from the file. The readtable
function discards the headers. Also, it fills in gaps with appropriate missing values—a NaN
value for numeric variables, and an empty character vector for text.
T = readtable('headersAndMissing.txt')
T=5×6 table
LastName Gender Age Height Weight Smoker
___________ __________ ___ ______ ______ ______
{'Wu' } {'M' } 38 71 176 1
{'Johnson'} {'M' } 43 69 163 0
{'Sanchez'} {'F' } 38 64 131 0
{'Brown' } {'F' } NaN 67 133 0
{'Picard' } {0x0 char} NaN 64 119 0
To restore the default behavior from previous releases, specify the 'Format','auto'
name-value pair argument. readtable
reads the headers as data, and as a result, it converts all the table variables to text.
T = readtable('headersAndMissing.txt','Format','auto')
T=6×6 table
LastName Gender Age Height Weight Smoker
___________ __________ __________ _______ _______ ___________
{'string' } {'string'} {'int' } {'int'} {'int'} {'boolean'}
{'Wu' } {'M' } {'38' } {'71' } {'176'} {'1' }
{'Johnson'} {'M' } {'43' } {'69' } {'163'} {'0' }
{'Sanchez'} {'F' } {'38' } {'64' } {'131'} {'0' }
{'Brown' } {'F' } {0x0 char} {'67' } {'133'} {'0' }
{'Picard' } {0x0 char} {0x0 char} {'64' } {'119'} {'0' }
For more information, see the Compatibility Considerations section on this page.
Preview the file mySpaceDelimTable.txt
in a text editor. A screen shot is shown below. Notice that the file contains space delimited, column oriented data.
Create a table from the space delimited text file. The readtable
function assigns the default variable names Var1
to Var5
because the file does not contain detectable column names in its first row.
T = readtable('mySpaceDelimTable.txt')
T=3×5 table
Var1 Var2 Var3 Var4 Var5
_____ ____ ____ ______ _________
{'M'} 45 45 {'NY'} {'true' }
{'F'} 41 32 {'CA'} {'false'}
{'M'} 40 34 {'MA'} {'false'}
Load the file myCsvTable.dat
and preview its contents in a text editor. A screen shot is shown below. Notice that the file contains comma-separated column oriented data.
filename = 'myCsvTable.dat';
Create a table from the comma-separated text file. Import the first two columns as character vectors, the third column as uint32
, and the next two columns as double-precision, floating-point numbers. Import the entries of the last column as character vectors.
T = readtable(filename,'Format','%s%s%u%f%f%s')
T=5×6 table
LastName Gender Age Height Weight Smoker
____________ ______ ___ ______ ______ ______
{'Smith' } {'M'} 38 71 176 {'1'}
{'Johnson' } {'M'} 43 69 163 {'0'}
{'Williams'} {'F'} 38 64 131 {'0'}
{'Jones' } {'F'} 40 67 133 {'0'}
{'Brown' } {'F'} 49 64 119 {'0'}
The conversion specifiers are %s
for a cell array of character vectors, %f
for double
, and %u
for uint32
.
Read German dates from a file and add them to a table as English dates.
Preview the file german_dates.txt
in a text editor. A screen shot is shown below. Notice that the first column of values contains dates in German and the second and third columns are numeric values.
Read the sample file using readtable
. The conversion specifiers is %D
dates and %f
for floating-point values. Specify the file encoding using the FileEncoding
name-value pair argument. Specify the format and locale of the dates using the DateLocale
name-value pair argument.
T = readtable('german_dates.txt',... 'Format','%{dd MMMM yyyy}D %f %f',... 'FileEncoding','ISO-8859-15',... 'DateLocale','de_DE')
T=3×3 table
Var1 Var2 Var3
________________ ____ _____
01 January 2014 20.2 100.5
01 February 2014 21.6 102.7
01 March 2014 20.7 99.8
Create a table from a spreadsheet that contains variable names in the first row and row names in the first column.
T = readtable('patients.xls','ReadRowNames',true);
Display the first five rows and first four variables of the table.
T(1:5,1:4)
ans=5×4 table
Gender Age Location Height
__________ ___ _____________________________ ______
Smith {'Male' } 38 {'County General Hospital' } 71
Johnson {'Male' } 43 {'VA Hospital' } 69
Williams {'Female'} 38 {'St. Mary's Medical Center'} 64
Jones {'Female'} 40 {'VA Hospital' } 67
Brown {'Female'} 49 {'County General Hospital' } 64
View the DimensionNames
property of the table.
T.Properties.DimensionNames
ans = 1x2 cell
{'LastName'} {'Variables'}
'LastName'
is the name in the first column of the first row of the spreadsheet.
Create a table using data from a specified region of the spreadsheet patients.xls
. Use the data from the 5-by-3 rectangular region between the corners C2
and E6
. Do not use the first row of this region as variable names.
T = readtable('patients.xls',... 'Range','C2:E6',... 'ReadVariableNames',false)
T = Var1 Var2 Var3 ____ ___________________________ ____ 38 'County General Hospital' 71 43 'VA Hospital' 69 38 'St. Mary's Medical Center' 64 40 'VA Hospital' 67 49 'County General Hospital' 64
T
contains default variable names.
Create import options, tailor the data types for multiple variables, and then read the data.
Create an import options object from a text file.
opts = detectImportOptions('airlinesmall.csv')
opts = DelimitedTextImportOptions with properties: Format Properties: Delimiter: {','} Whitespace: '\b\t ' LineEnding: {'\n' '\r' '\r\n'} CommentStyle: {} ConsecutiveDelimitersRule: 'split' LeadingDelimitersRule: 'keep' TrailingDelimitersRule: 'ignore' EmptyLineRule: 'skip' Encoding: 'ISO-8859-1' Replacement Properties: MissingRule: 'fill' ImportErrorRule: 'fill' ExtraColumnsRule: 'addvars' Variable Import Properties: Set types by name using setvartype VariableNames: {'Year', 'Month', 'DayofMonth' ... and 26 more} VariableTypes: {'double', 'double', 'double' ... and 26 more} SelectedVariableNames: {'Year', 'Month', 'DayofMonth' ... and 26 more} VariableOptions: Show all 29 VariableOptions Access VariableOptions sub-properties using setvaropts/getvaropts VariableNamingRule: 'modify' Location Properties: DataLines: [2 Inf] VariableNamesLine: 1 RowNamesColumn: 0 VariableUnitsLine: 0 VariableDescriptionsLine: 0 To display a preview of the table, use preview
Examine the Type
property of the variables TaxiIn
and TaxiOut
.
getvaropts(opts,{'TaxiIn','TaxiOut'})
ans = 1x2 TextVariableImportOptions array with properties: Name Type FillValue TreatAsMissing QuoteRule Prefixes Suffixes EmptyFieldRule WhitespaceRule
Change the type of the variables TaxiIn
and TaxiOut
to double
.
opts = setvartype(opts,{'TaxiIn','TaxiOut'},'double');
Specify the subset of variables to import and examine.
opts.SelectedVariableNames = {'TaxiIn','TaxiOut'};
Use the readtable
function along with the options object to import the selected variables. Display a summary of the table.
T = readtable('airlinesmall.csv',opts);
summary(T)
Variables: TaxiIn: 123523x1 double Values: Min 0 Median 5 Max 1451 NumMissing 37383 TaxiOut: 123523x1 double Values: Min 0 Median 13 Max 755 NumMissing 37364
Detect import options for a spreadsheet file, specify the variables to import, and then read the data.
Create an import options object from a file.
opts = detectImportOptions('patients.xls')
opts = SpreadsheetImportOptions with properties: Sheet Properties: Sheet: '' Replacement Properties: MissingRule: 'fill' ImportErrorRule: 'fill' Variable Import Properties: Set types by name using setvartype VariableNames: {'LastName', 'Gender', 'Age' ... and 7 more} VariableTypes: {'char', 'char', 'double' ... and 7 more} SelectedVariableNames: {'LastName', 'Gender', 'Age' ... and 7 more} VariableOptions: Show all 10 VariableOptions Access VariableOptions sub-properties using setvaropts/getvaropts VariableNamingRule: 'modify' Range Properties: DataRange: 'A2' (Start Cell) VariableNamesRange: 'A1' RowNamesRange: '' VariableUnitsRange: '' VariableDescriptionsRange: '' To display a preview of the table, use preview
Modify the options object to specify which variables to import.
opts.SelectedVariableNames = {'Systolic','Diastolic'};
Use readtable
along with the options object to import the specified variables.
T = readtable('patients.xls',opts);
summary(T)
Variables: Systolic: 100x1 double Values: Min 109 Median 122 Max 138 Diastolic: 100x1 double Values: Min 68 Median 81.5 Max 99
You can read tabular data while preserving variable names that include any characters, including spaces and non-ASCII characters. First, create a table with arbitrary variable names and write the table to a text file. Then, read the tabular data back while preserving the original variable names.
Create a table containing arbitrary variable names and write the tabular data to the file 'sampletable.txt'
.
LastName = {'Sanchez';'Johnson';'Li';'Diaz';'Brown'}; Age = [38;43;38;40;49]; Smoker = logical([1;0;1;0;1]); varNames = {'Last Name','Age','Smoker (1 or 0)'}; T = table(LastName,Age,Smoker,'VariableNames',varNames)
T=5×3 table
Last Name Age Smoker (1 or 0)
___________ ___ _______________
{'Sanchez'} 38 true
{'Johnson'} 43 false
{'Li' } 38 true
{'Diaz' } 40 false
{'Brown' } 49 true
writetable(T,'sampletable.txt')
Read the tabular data back using readtable
. By default, MATLAB® converts any variable names that include spaces and non-ASCII characters into valid MATLAB® identifiers. For example, MATLAB® converts the variable names 'Last Name'
to 'LastName'
, and 'Smoker (1 or 0)'
to 'Smoker_1or0_'
. To read the tabular data while preserving variable names, set the 'VariableNamingRule'
parameter to preserve
.
T_preserve = readtable('sampletable.txt',"VariableNamingRule","preserve")
T_preserve=5×3 table
Last Name Age Smoker (1 or 0)
___________ ___ _______________
{'Sanchez'} 38 1
{'Johnson'} 43 0
{'Li' } 38 1
{'Diaz' } 40 0
{'Brown' } 49 1
You can read hexadecimal and binary numbers from a file and store them as numeric variables in a table. The readtable
function automatically reads hexadecimal and binary numbers when they include the 0x
and 0b
prefixes respectively. The numeric values are stored using integer data types. You can also use import options to read such numbers when they do not have prefixes.
Preview the hexAndBinary.txt
file in a text editor. It has columns of hexadecimal and binary numbers with prefixes, and one column without.
Read the file using readtable
. The function detects the numbers with 0x
and 0b
prefixes and stores them as integers. The third column does not have prefixes so its values are treated as text.
T = readtable('hexAndBinary.txt')
T=3×4 table
Var1 Var2 Var3 Var4
_____ ____ ________ ___________
255 255 {'C7F9'} {'Device1'}
256 4 {'05BF'} {'Device2'}
43981 129 {'F00F'} {'Device3'}
The readtable
function stores the numeric values in different integer classes (uint16
and uint8
) because T.Var1
has a value that requires more than 8 bits of storage.
class(T.Var1)
ans = 'uint16'
class(T.Var2)
ans = 'uint8'
To specify the data types for storing the numeric values imported from hexadecimal and binary numbers, use the 'HexType'
and 'BinaryType'
name-value pair arguments. Store the values as signed 32-bit integers.
T = readtable('hexAndBinary.txt','HexType','int32','BinaryType','int32'); class(T.Var1)
ans = 'int32'
class(T.Var2)
ans = 'int32'
You can use import options to detect hexadecimal and binary numbers without prefixes, and specify storage for them. Create an import options object for hexAndBinary.txt
.
opts = detectImportOptions('hexAndBinary.txt')
opts = DelimitedTextImportOptions with properties: Format Properties: Delimiter: {','} Whitespace: '\b\t ' LineEnding: {'\n' '\r' '\r\n'} CommentStyle: {} ConsecutiveDelimitersRule: 'split' LeadingDelimitersRule: 'keep' TrailingDelimitersRule: 'ignore' EmptyLineRule: 'skip' Encoding: 'UTF-8' Replacement Properties: MissingRule: 'fill' ImportErrorRule: 'fill' ExtraColumnsRule: 'addvars' Variable Import Properties: Set types by name using setvartype VariableNames: {'Var1', 'Var2', 'Var3' ... and 1 more} VariableTypes: {'auto', 'auto', 'char' ... and 1 more} SelectedVariableNames: {'Var1', 'Var2', 'Var3' ... and 1 more} VariableOptions: Show all 4 VariableOptions Access VariableOptions sub-properties using setvaropts/getvaropts VariableNamingRule: 'modify' Location Properties: DataLines: [1 Inf] VariableNamesLine: 0 RowNamesColumn: 0 VariableUnitsLine: 0 VariableDescriptionsLine: 0 To display a preview of the table, use preview
To specify that the third column should be imported as hexadecimal values, despite the lack of a prefix, use the setvaropts
function to modify the variable type for the third variable of the table. Set the variable type to int32
.
opts = setvaropts(opts,3,'NumberSystem','hex','Type','int32')
opts = DelimitedTextImportOptions with properties: Format Properties: Delimiter: {','} Whitespace: '\b\t ' LineEnding: {'\n' '\r' '\r\n'} CommentStyle: {} ConsecutiveDelimitersRule: 'split' LeadingDelimitersRule: 'keep' TrailingDelimitersRule: 'ignore' EmptyLineRule: 'skip' Encoding: 'UTF-8' Replacement Properties: MissingRule: 'fill' ImportErrorRule: 'fill' ExtraColumnsRule: 'addvars' Variable Import Properties: Set types by name using setvartype VariableNames: {'Var1', 'Var2', 'Var3' ... and 1 more} VariableTypes: {'auto', 'auto', 'int32' ... and 1 more} SelectedVariableNames: {'Var1', 'Var2', 'Var3' ... and 1 more} VariableOptions: Show all 4 VariableOptions Access VariableOptions sub-properties using setvaropts/getvaropts VariableNamingRule: 'modify' Location Properties: DataLines: [1 Inf] VariableNamesLine: 0 RowNamesColumn: 0 VariableUnitsLine: 0 VariableDescriptionsLine: 0 To display a preview of the table, use preview
Read the file and import the third column as numeric values, not text.
T = readtable('hexAndBinary.txt',opts)
T=3×4 table
Var1 Var2 Var3 Var4
_____ ____ _____ ___________
255 255 51193 {'Device1'}
256 4 1471 {'Device2'}
43981 129 61455 {'Device3'}
filename
— Name of file to readName of the file to read, specified as a character vector or a string scalar.
Depending on the location of your file, filename
can take on one of these forms.
Location | Form | ||||||||
---|---|---|---|---|---|---|---|---|---|
Current folder or folder on the MATLAB® path | Specify the name of the file in Example: | ||||||||
File in a folder | If the file is not in the current folder or in a folder on the MATLAB path, then specify the full or relative path name
in Example:
Example:
| ||||||||
Remote Location | If the file is stored at a remote location, then
Based on your remote location,
For more information, see Work with Remote Data. Example:
|
If filename
includes the file extension, then the
importing function determines the file format from the extension. Otherwise,
you must specify the 'FileType'
name-value pair arguments
to indicate the type of file.
On Windows® systems with Microsoft® Excel® software, the importing function reads any Excel spreadsheet file format recognized by your version of Excel.
If your system does not have Excel for Windows or if you are using MATLAB
Online™, the importing function operates with the
UseExcel
property set to false
,
and reads only .xls, .xlsx, .xlsm, .xltx, and .xltm
files.
For delimited text files, the importing function converts empty fields in
the file to either NaN
(for a numeric variable) or an
empty character vector (for a text variable). All lines in the text file
must have the same number of delimiters. The importing function ignores
insignificant white space in the file.
Data Types: char
| string
opts
— File import optionsSpreadsheetImportOptions
| DelimitedtextImportOptions
| FixedWidthImportOptions
File import options, specified as an SpreadsheetImportOptions
,
DelimitedTextImportOptions
, or
FixedWidthImportOptions
object created by the detectImportOptions
function. The opts
object
contains properties that control the data import process. For more information on the
properties of each object, see the appropriate object page.
Type of Files | Output |
---|---|
Spreadsheet files | SpreadsheetImportOptions object |
Text files | DelimitedTextImportOptions object |
Fixed-width text files | FixedWidthImportOptions object |
Specify optional
comma-separated pairs of Name,Value
arguments. Name
is
the argument name and Value
is the corresponding value.
Name
must appear inside quotes. You can specify several name and value
pair arguments in any order as
Name1,Value1,...,NameN,ValueN
.
'ReadVariableNames',false
indicates
that the first row of the file does not correspond to variable names.When reading:
Text files, only these parameter names apply: FileType
,
ReadVariableNames
, ReadRowNames
,
TreatAsEmpty
, DatetimeType
,
Delimiter
, HeaderLines
, Format
,
EmptyValue
, MultipleDelimsAsOne
,
CollectOutput
, CommentStyle
,
ExpChars
, LineEnding
, DateLocale
,
Encoding
, HexType
, and
BinaryType
.
Spreadsheet files, only these parameter names apply: FileType
,
ReadVariableNames
, ReadRowNames
,
TreatAsEmpty
, DatetimeType
, Sheet
,
Range
, UseExcel
, and
TextType
.
Text or Spreadsheet files with the opts
import options, only these
parameter names apply: ReadVariableNames
, ReadRowNames
,
DateLocale
, Encoding
, Sheet
, and
UseExcel
.
'FileType'
— Type of file'spreadsheet'
| 'text'
| 'delimitedtext'
| 'fixedwidth'
Type of file, specified as the comma-separated pair consisting of
'FileType'
and one of these values.
Value | Import Options for File |
---|---|
'spreadsheet' | Return a |
'text' | Return a |
'delimitedtext' | Return a |
'fixedwidth' | Return a |
Use the 'FileType'
name-value pair argument when
filename
does not include the file extension, or when the extension is
not one of these:
.txt
, .dat
, or .csv
for text
files
.xls
, .xlsb
, .xlsm
,
.xlsx
, .xltm
, .xltx
, or
.ods
for spreadsheet files
Example: 'FileType','text'
Data Types: char
| string
'ReadVariableNames'
— Read first row as variable namestrue
| false
| 1
| 0
Indicator for reading the first row as variable names, specified as the comma-separated
pair consisting of 'ReadVariableNames'
and either true
,
false
, 1
, or 0
. If unspecified,
readtable
automatically detects the presence of variable
names.
Indicator | Description |
---|---|
| Use when the first row of the region to read contains the variable names for
the table. |
| Use when the first row of the region to read contains data in the table.
|
unspecified | When left unspecified, readtable automatically detects
true or false and proceeds accordingly. |
Note: If both the
'ReadVariableNames'
and 'ReadRowNames'
logical
indicators are true
, then readtable
saves the name in
the first column of the first row of the region to read as the first dimension name in the
property, T.Properties.DimensionNames
.
If you specify the ReadVariableNames
argument in addition to
opts
the import options, then the readtable
behavior
changes based on the specification:
If ReadVariableNames
is true
, then read the
variable names from the specified file by using the VariableNamesRange
or the VariableNamesLine
property of the import options object.
If ReadVariableNames
is false
, then read the
variable names from the VariableNames
property of the import options
object.
'ReadRowNames'
— Indicator for reading the first column as row namesfalse
(default) | true
| 0
| 1
Indicator for reading first column as row names, specified as the comma-separated pair
consisting of 'ReadRowNames'
and either false
,
true
, 0
, or 1
.
Indicator | Description |
---|---|
| Use when the first column of the region to read contains data, and not the row names for the table. |
| Use when the first column of the region to read contains the row names for the table. |
unspecified | When left unspecified, readtable assumes
false . |
Note: If both the
'ReadVariableNames'
and 'ReadRowNames'
logical
indicators are true
, then readtable
saves the name in
the first column of the first row of the region to read as the first dimension name in the
property, T.Properties.DimensionNames
.
If you specify the ReadRowNames
argument in addition to
opts
the import options , then the readtable
behavior
changes based on the specification:
If ReadRowNames
is true
, then read the row
names from the specified file by using the RowNamesRange
or the
RowNameColumn
property of the import options object.
If ReadRowNames
is false
, then do not import
row names.
'TreatAsEmpty'
— Placeholder text to treat as empty valuePlaceholder text to treat as an empty value, specified as the comma-separated pair
consisting of 'TreatAsEmpty'
and a character vector, cell array of
character vectors, string, or string array. Table elements corresponding to these characters
are set to NaN
.
'TreatAsEmpty'
only applies to numeric columns in the file, and
readtable
does not accept numeric literals, such as
'-99'
.
Example: 'TreatAsEmpty','N/A'
or
'TreatAsEmpty',"N/A"
sets N/A
within numeric columns
to NaN
.
Example: 'TreatAsEmpty',{'.','NA','N/A'}
or
'TreatAsEmpty',[".","NA","N/A"]
sets .
,
NA
and N/A
within numeric columns to
NaN
.
Data Types: char
| string
'TextType'
— Type for imported text data 'char'
(default) | 'string'
Type for imported text data, specified as the comma-separated pair consisting of 'TextType'
and either 'char'
or 'string'
.
'char'
— Import text data into MATLAB as character vectors.
'string'
— Import text data into MATLAB as string arrays.
Example: 'TextType','char'
'DatetimeType'
— Type for imported date and time data'datetime'
(default) | 'text'
| 'exceldatenum'
(spreadsheet files only)Type for imported date and time data, specified as the comma-separated pair consisting of 'DatetimeType'
and one of these values: 'datetime'
, 'text'
, or 'exceldatenum'
. The value 'exceldatenum'
is applicable only for spreadsheet files, and is not valid for text files.
Value | Type for Imported Date and Time Data |
---|---|
'datetime' | MATLAB For more information, see |
'text' | If
|
'exceldatenum' | Excel serial date numbers A serial date number is a single number equal to the number of days from a given reference date. Excel serial date numbers use a different reference date than MATLAB serial date numbers. For more information on Excel dates, see |
Data Types: char
| string
'VariableNamingRule'
— Flag to preserve variable names'modify'
(default) | 'preserve'
Flag to preserve variable names, specified as the comma-separated pair consisting of
VariableNamingRule
and either true
, or
false
.
'preserve'
— Preserve variable names that are not valid
MATLAB identifiers such as variable names that include spaces and
non-ASCII characters.
'modify'
— Convert invalid variable names (as
determined by the isvarname
function) to
valid MATLAB identifiers.
Starting in R2019b, variable names and row names can include any characters, including
spaces and non-ASCII characters. Also, they can start with any characters, not just
letters. Variable and row names do not have to be valid MATLAB identifiers (as determined by the isvarname
function). To preserve these variable names and row names, set
the value of VariableNamingRule
to
'preserve'
.
Data Types: char
| string
'Delimiter'
— Field delimiter characterField delimiter character, specified as the comma-separated pair consisting of
'Delimiter'
and a character vector, a cell array of character vectors, or
a string. Specify Delimiter
using any valid character such as a comma
','
or a period '.'
.
This table lists some commonly used field delimiter characters.
Specifier | Field Delimiter |
---|---|
| Comma |
| Space |
| Tab |
| Semicolon |
| Vertical bar |
unspecified | If unspecified, |
To treat consecutive delimiters as a single delimiter, specify
Delimiter
as a cell array of character vectors. In addition, you must
also specify the MultipleDelimsAsOne
option.
Example: 'Delimiter',','
or
'Delimiter','comma'
Data Types: char
| string
| cell
'HeaderLines'
— Lines to skipLines to skip at beginning of the file, specified as the comma-separated pair consisting
of 'HeaderLines'
and a positive integer. If unspecified,
readtable
automatically detects the number of lines to skip.
Data Types: single
| double
'Format'
— Column format'auto'
Column format of the file, specified as the comma-separated pair consisting of
'Format'
and a character vector or a string scalar having one or more
conversion specifiers, or 'auto'
. The conversion specifiers are the same
as the specifiers accepted by the textscan
function.
Specifying the format can significantly improve speed for some large files. If you do
not specify a value for Format
, then readtable
uses
%q
to interpret nonnumeric columns. The %q
specifier
reads the text and omits double quotation marks ("
) if appropriate.
If you do not specify the 'Format'
name-value pair, then the
readtable
function behaves as though you have used the results of the
detectImportOptions
function to import the data. For more information
on the consequences of this behavior, see Compatibility
Considerations.
If you specify 'Format','auto'
, then the variables created are
double
arrays, cell array of character vectors, or
datetime
arrays, depending on the data. If the entire column is
numeric, variables are imported as double
. If any element in a column is
not numeric, the variables are imported as cell arrays of character vectors, or as
datetime
arrays if the values represent dates and times.
Data Types: char
| string
'EmptyValue'
— Returned value for empty numeric fieldsNaN
(default) | scalarReturned value for empty numeric fields in delimited text files,
specified as the comma-separated pair consisting of 'EmptyValue'
and
a scalar.
'MultipleDelimsAsOne'
— Multiple delimiter handling0 (false)
(default) | 1 (true)
Multiple delimiter handling, specified as the comma-separated
pair consisting of 'MultipleDelimsAsOne'
and either true
or false
.
If true
, then the importing function treats consecutive
delimiters as a single delimiter. Repeated delimiters separated by
white-space are also treated as a single delimiter. You must also
specify the Delimiter
option.
Example: 'MultipleDelimsAsOne',1
'CollectOutput'
— Logical indicator determining data concatenationfalse
(default) | true
Logical indicator determining data concatenation, specified
as the comma-separated pair consisting of 'CollectOutput'
and
either true
or false
. If true
,
then the importing function concatenates consecutive output cells
of the same fundamental MATLAB class into a single array.
'CommentStyle'
— Symbols designating text to ignoreSymbols designating text to ignore, specified as the comma-separated
pair consisting of 'CommentStyle'
and a character
vector, cell array of character vectors, string, or string array.
For example, specify a character such as '%'
to ignore text following the
symbol on the same line. Specify a cell array of two character vectors, such as
{'/*','*/'}
, to ignore any text between those sequences.
MATLAB checks for comments only at the start of each field, not within a field.
Example: 'CommentStyle',{'/*','*/'}
Data Types: char
| string
'ExpChars'
— Exponent characters'eEdD'
(default) | character vector | stringExponent characters, specified as the comma-separated pair consisting
of 'ExpChars'
and a character vector or string.
The default exponent characters are e
, E
, d
,
and D
.
Data Types: char
| string
'LineEnding'
— End-of-line charactersEnd-of-line characters, specified as the comma-separated pair consisting of
'LineEnding'
and a character vector or string. The character
vector must be '\r\n'
or it must specify a single character. Common
end-of-line characters are a newline character ('\n'
) or a carriage
return ('\r'
). If you specify '\r\n'
, then the
importing function treats any of \r
, \n
, and the
combination of the two (\r\n
) as end-of-line characters.
The default end-of-line sequence is \n
, \r
, or
\r\n
, depending on the contents of your file.
If there are missing values and an end-of-line sequence at the end of the last line in
a file, then the importing function returns empty values for those fields. This ensures
that individual cells in output cell array, C
, are the same
size.
Example: 'LineEnding',':'
Data Types: char
| string
'DateLocale'
— Locale for reading datesLocale for reading dates, specified as the comma-separated pair consisting of
'DateLocale'
and a character vector or a string scalar of the
form
, where:xx
_YY
YY
is an uppercase ISO 3166-1 alpha-2 code
indicating a country.
xx
is a lowercase ISO 639-1 two-letter code
indicating a language.
For a list of common values for the locale, see the Locale
name-value pair argument for the datetime
function.
When using the %D
format specifier to read text as
datetime
values, use DateLocale
to specify the
locale in which the importing function should interpret month and day-of-week names and
abbreviations.
If you specify the DateLocale
argument in addition to
opts
the import options, then the importing function uses the
specified value for the DateLocale
argument, overriding the locale
defined in the import options.
Example: 'DateLocale','ja_JP'
'Encoding'
— Character encoding scheme'UTF-8'
| 'ISO-8859-1'
| 'windows-1251'
| 'windows-1252'
| ...Character encoding scheme associated with the file, specified as the comma-separated
pair consisting of 'Encoding'
and 'system'
or a
standard character encoding scheme name. When you do not specify any encoding, the
readtable
function uses automatic character set detection to determine
the encoding when reading the file.
If you specify the 'Encoding'
argument in addition to the import
options, then the readtable
function uses the specified value for
'Encoding'
, overriding the encoding defined in the import options.
Example: 'Encoding','UTF-8'
uses UTF-8 as the encoding.
Example: 'Encoding','system'
uses the system default
encoding.
Data Types: char
| string
'DurationType'
— Output data type of duration data'duration'
(default) | 'text'
Output data type of duration data from text files, specified as the comma-separated pair
consisting of 'DurationType'
and either 'duration'
or
'text'
.
Value | Type for Imported Duration Data |
---|---|
'duration' | MATLAB
For more information, see |
'text' | If
|
Data Types: char
| string
'HexType'
— Output data type of hexadecimal data'auto'
(default) | 'text'
| 'int8'
| 'int16'
| ...Output data type of hexadecimal data, specified as the comma-separated pair consisting of 'HexType'
and one of the values listed in the table.
The input file represents hexadecimal values as text, using either 0x
or
0X
as a prefix and the characters
0
-9
,
a
-f
, and A
-F
as digits. (Uppercase and lowercase letters represent the same digits—for example,
'0xf'
and '0xF'
both represent
15
.)
The importing function converts the hexadecimal values to the data type specified by
the value of 'HexType'
.
Value of | Data Type of Output Table Variables |
---|---|
| data type detected automatically |
| unaltered input text |
| 8-bit integer, signed |
| 16-bit integer, signed |
| 32-bit integer, signed |
| 64-bit integer, signed |
| 8-bit integer, unsigned |
| 16-bit integer, unsigned |
| 32-bit integer, unsigned |
| 64-bit integer, unsigned |
Example: 'HexType','uint16'
converts text representing hexadecimal values (such as '0xFF'
) to unsigned 16-bit integers (such as 255
) in the output table.
Data Types: char
| string
'BinaryType'
— Output data type of binary data'auto'
(default) | 'text'
| 'int8'
| 'int16'
| ...Output data type of binary data, specified as the comma-separated pair consisting of
'BinaryType'
and one of the values listed in the table.
The input file represents binary values as text, using either 0b
or
0B
as a prefix and the characters 0
and
1
as digits.
The importing function converts the binary values to the data type specified by the
value of 'BinaryType'
.
Value of | Data Type of Output Table Variables |
---|---|
| data type detected automatically |
| unaltered input text |
| 8-bit integer, signed |
| 16-bit integer, signed |
| 32-bit integer, signed |
| 64-bit integer, signed |
| 8-bit integer, unsigned |
| 16-bit integer, unsigned |
| 32-bit integer, unsigned |
| 64-bit integer, unsigned |
Example: 'BinaryType','uint16'
converts text representing binary
values (such as '0b11111111'
) to unsigned 16-bit integers (such as
255
) in the output table.
Data Types: char
| string
'TrimNonNumeric'
— Remove nonnumeric charactersfalse
(default) | true
Remove nonnumeric characters from a numeric variable, specified as a logical true
or false
.
Example: If name-value pair is specified as 'TrimNonNumeric',true
, then the
importing function reads '$500/-'
as
500
.
Data Types: logical
'Sheet'
— Worksheet to read1
(default) | positive integer | character vector | stringWorksheet to read, specified as the comma-separated pair consisting of
'Sheet'
and a positive integer indicating the worksheet index or a
character vector or string containing the worksheet name. The worksheet name cannot contain a
colon (:
). To determine the names of sheets in a spreadsheet file, use
sheets = sheetnames(filename)
. For more information, see sheetnames
.
If you specify the Sheet
argument in addition
to opts
the import options, then the readtable
function
uses the specified value for Sheet
argument, overriding
the sheet name defined in the import options.
Example: 'Sheet'
, 2
Example: 'Sheet'
, 'MySheetName'
Example: 'Sheet'
, "MySheetName"
Data Types: char
| string
| single
| double
| int8
| int16
| int32
| int64
| uint8
| uint16
| uint32
| uint64
'Range'
— Portion of worksheet to readPortion of the worksheet to read, indicated as a rectangular area specified by a comma
separated pair consisting of 'Range'
and a character vector or string
scalar in one of the following forms.
Ways to specify Range | Description |
---|---|
Starting Cell | Specify the starting cell for the data as a character vector or string scalar or a two element numeric vector.
Using the starting cell, the importing function automatically detects the extent of the data by beginning the import at the start cell and ending at the last empty row or footer range. Example:
|
Rectangular Range | Specify the range using the syntax Example: |
Unspecified or Empty | If unspecified, Example: Note: Used Range refers
to the rectangular portion of the spreadsheet that actually contains
data. |
Row Range | You can identify range by specifying the beginning and
ending rows using Excel row designators. Then Example: |
Column Range | You can identify range by specifying the beginning and
ending columns using Excel column designators. Then Example: |
Excel’s Named Range | In Excel, you can create names to identify ranges
in the spreadsheet. For instance, you can select a rectangular portion
of the spreadsheet and call it Example: |
Example: 'Range'
, 'A1:F10'
Example: 'Range'
, "A1:F10"
—
Data Types: char
| string
'UseExcel'
— Flag to start instance of Microsoft Excel for Windowsfalse
(default) | true
Flag to start an instance of Microsoft
Excel for Windows when reading spreadsheet data, specified as the comma-separated pair consisting of 'UseExcel'
and either true
, or false
.
You can set the 'UseExcel'
parameter to one of these values:
true
— The importing function starts an instance of
Microsoft
Excel when reading the
file.
false
— The importing function does not start an
instance of Microsoft
Excel when reading the file. When
operating in this mode, the importing function functionality differs in the
support of file formats and interactive features, such as formulas and
macros.
UseExcel |
|
|
---|---|---|
Supported file formats |
|
|
Support for interactive features, such as formulas and macros | Yes | No |
When reading from spreadsheet files on Windows platforms, if you want to start an instance of Microsoft
Excel, then set the 'UseExcel'
parameter to
true
.
T
— Output tableOutput table, returned as a table. The table can store metadata such as descriptions, variable
units, variable names, and row names. For more information, see the Properties section
of table
.
Large files in XLSX format sometimes load slowly. For better import and export performance, Microsoft recommends that you use the XLSB format.
detectImportOptions
function used by default to import tabular dataBehavior changed in R2020a
By default, the readtable
function uses the results of the
detectImportOptions
function to import tabular data. In essence, these two
function calls behave identically.
T = readtable(filename) T = readtable(filename,detectImportOptions(filename))
In R2020a, there are several differences between the default behavior of
readtable
and its default behavior in previous releases.
Description of Input Fields or Rows | Default R2020a | Default Behavior in Previous Releases |
---|---|---|
First row does not have text to assign as names of output table variables | Assigns the names | Converts the values in the first row of data values to the names of output table variables |
Multiple rows of text as header lines |
|
|
Empty fields | Treat as missing values for detected data type | Treat as empty character vectors or strings |
Values in quotes | Treat as detected data type | Treat as text |
Text that cannot be converted | Treat as missing values for detected data type | Treat as text |
Nonnumeric character trails numeric character without delimiter between them | Treat characters as nonnumeric | Treat numeric and nonnumeric characters as though delimiter separated them |
Input text file has lines with different number of delimiters | Returns output table with extra variables | Raises error message |
To call readtable
with the default behavior it had up to R2019b, use
the 'Format','auto'
name-value pair argument.
T = readtable(filename,'Format','auto')
detectImportOptions
| Import
Tool | preview
| readcell
| readmatrix
| readtimetable
| readvars
| setvaropts
| table
| textscan
| writetable
You have a modified version of this example. Do you want to open this example with your edits?