Rearranging rows side by side based on a column value

Question

Sunny am 7 Feb. 2019

0
Verknüpfen

Direkter Link zu dieser Frage

https://de.mathworks.com/matlabcentral/answers/443775-rearranging-rows-side-by-side-based-on-a-column-value

Kommentiert: Sunny am 12 Feb. 2019

Hi,

I have an excel file with 300k samples (rows) and 40 columns. The first column is ID which has duplicate values and last column is about status and has binary values either 0 or 1.

I am looking to scan through this file and if the status column of first row is 0 it should copy the next row columns from 2 to 39 (excluising ID and status) and paste it where first row ends first row if the belong to same ID. This should happen for every other row with status 0 and it should copy only data related to same ID. Please see example below. From the expected output you can obvserve for ID 35 we didn't append anyt value for first sample as the status is 1 and for ID 35 the third sample is also not appended even if status is 0 as its the last row related to 35 and we cannot append ID 45 values,

  ID Col1 Col2 Col3 Col4 Status
993  65  130    0     1 
993  65  24     1     0 
993  65  7      1     0 
993  65  9      1     0 
993  65  19     1     0 
993  65  58     0     0 

Expected Output:

 ID Col1 Col2 Col3 Col 4 Status    Col1 Col2 Col3 Col4 
993  65  130   0     1 
993  65  24    1     0        993  65   7     1
993  65  7     1     0         
993  65  9     1     0        993  65  19     1 
993  65  19    1     0        993  65  58     0     
993  65  58    0     0 

Thanks

4 Kommentare
2 ältere Kommentare anzeigen2 ältere Kommentare ausblenden

Guillaume am 8 Feb. 2019

Oh, I didn't realise you wanted the filtered columns to be appended to the right of the same file. While it's perfectly doable, are you really sure you want this? I wouldn't think that repeated data and data with gaps in the rows is very practical? Wouldn't you rather have it as a separate file (with no gaps)?

As to the headers, it's up to you if you want them or not. It's a slightly different approach (table vs matrix) but the same amount of code either way.

Sunny am 8 Feb. 2019

Bearbeitet: Sunny am 9 Feb. 2019

Guillaume thanks. i can have it as a seperate file . Your suggestion is correct as I would have deleted rows with gaps.

Melden Sie sich an, um zu kommentieren.

Melden Sie sich an, um diese Frage zu beantworten.

Answer 1

Guillaume am 11 Feb. 2019

0
Verknüpfen

Direkter Link zu dieser Antwort

https://de.mathworks.com/matlabcentral/answers/443775-rearranging-rows-side-by-side-based-on-a-column-value#answer_360492

In MATLAB Online öffnen

I'm assuming that status 1 only happens once per ID. I'm also assuming that all rows of an ID are together.

t = readtable('Input_File.xlsx');  %read input data
[~, ~, subs] = unique(t.ID);  %assign unique ID from 1 to n to each ID of table
hasstatus1 = accumarray(subs, t.Status, [], @any);  %find IDs that have a status of 1
endrows = accumarray(subs, (1:height(t))', [], @max);  %find last row of each ID
rowstodelete = t.Status == 1;  %mark rows with status 1 for deletion
rowstodelete(endrows(hasstatus1)) = true;  %and last row of ID which have a status of 1

From there, you can create a new table with only the rows and columns you want:

newtable = t(~rowstodelete, 2:end-1);  %2:end-1 as you want to get rid of 1st and last column
writetable(newtable, 'NewFile.xlsx');

Or append to the existing table with gaps in row. This forces all appended columns to be cell arrays, which is more awkward:

newcontent = num2cell(t{:, 2:end-1});
newcontent(rowstodelete, :) = {[]};
newtable = [t, cell2table(newcontent, 'VariableNames', compose('%s_1', string(t.Properties.VariableNames(2:end-1))))];
writetable(newtable, 'NewFile2.xlsx');

1 Kommentar
-1 ältere Kommentare anzeigen-1 ältere Kommentare ausblenden

Sunny am 12 Feb. 2019

Thanks, it worked perfectly.

Melden Sie sich an, um zu kommentieren.

Rearranging rows side by side based on a column value

4 Kommentare
2 ältere Kommentare anzeigen2 ältere Kommentare ausblenden

Akzeptierte Antwort

1 Kommentar
-1 ältere Kommentare anzeigen-1 ältere Kommentare ausblenden

Weitere Antworten (0)

Siehe auch

Kategorien

Tags

Produkte

Version

Community Treasure Hunt

Rearranging rows side by side based on a column value

4 Kommentare 2 ältere Kommentare anzeigen2 ältere Kommentare ausblenden

Akzeptierte Antwort

1 Kommentar -1 ältere Kommentare anzeigen-1 ältere Kommentare ausblenden

Weitere Antworten (0)

Siehe auch

Kategorien

Tags

Produkte

Version

Community Treasure Hunt

4 Kommentare
2 ältere Kommentare anzeigen2 ältere Kommentare ausblenden

1 Kommentar
-1 ältere Kommentare anzeigen-1 ältere Kommentare ausblenden