How to batch convert table columns to categorical

I have a large table (large number of columns and very large number of rows), many of these columns contain simple cell arrays with a very limited number of different options (such as 'ON" and 'OFF' or similar).
I would like to convert all these columns to categorical in an automated way, i.e. extracting column names from
Table.Properties.VariableNames
and not explicitly writing the (large) number of column names in dot notation.
I have seen that
>> ddd2 = categorical(MyTable{:,MyTable.Properties.VariableNames(1)});
works fine (i.e. ddd2 is a correct categorical array), but, at this point, I cannot put this categorical array back in my table. I tried:
>> MyTable{:,MyTable.Properties.VariableNames{1}}= ddd2;
The following error occurred converting from categorical to cell:
Conversion to cell from categorical is not possible.
but it returns the error you see above... On the other hand, if I use dot notation it works fine, i.e.
MyTable.Column1 = ddd2;
yields the desired results. I found a workaround, that works perfectly, but I would like to know if there is a more direct and elegant way to solve this problem.... this is the workaround:
sz0=size(MyTable);
kk=0;
for( i=1:sz0(2) )
if( iscell(MyTable{:,i}) )
ddd2 = categorical(MyTable{:,MyTable.Properties.VariableNames(i)});
eval(sprintf('MyTable.%s = ddd2;',MyTable.Properties.VariableNames{i}));
kk = kk+1;
end
end
fprintf( ' %d columns transformed to categorical\n', kk );
it works...

1 Kommentar

Dear Giancarlo,
thanks for your tip on including the Table.Properties.VariableNames in a for loop. You know, it was the easy to create a cell array for the 'vars' input of the change variable types built-in function.
Best.

Melden Sie sich an, um zu kommentieren.

 Akzeptierte Antwort

Sean de Wolski
Sean de Wolski am 19 Sep. 2018

2 Stimmen

In 18b there is a convenience function to do exactly this:
Note, that it may be possible to just create your table with categoricals. How are you creating the table?

7 Kommentare

Wow, thank you for the immediate answer (I will upgrade, then asap).
The table is created by reading a Mysql db using a select() command, I am not sure, but it does not seem I have many options to preprocess the arriving data ?
Thank you again, did not see that function !
I'm running MATLAB r2019a and I get an error.
Undefined function or variable 'convertvars'.
Aldema, is the first input of your convertvars call a table or timetable array? That function is only defined when the first input is a table or timetable array.
I have the same error, and I'm inputting a table into convertvars.
Dear Sean,
thank you. You reminded me of the built-in function, best!

Melden Sie sich an, um zu kommentieren.

Weitere Antworten (0)

Kategorien

Produkte

Version

R2018a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by