Indices of the values for which two conditions are true
1 Ansicht (letzte 30 Tage)
Ältere Kommentare anzeigen
Emiliya Taskova
am 11 Mai 2020
Kommentiert: Emiliya Taskova
am 11 Mai 2020
I have a table of data where the first column is an abbreviated name (abbrev), second column is the first word of the full name (firstword) and the third is a particular number corresponding to that name. I would like to clean my data from duplicates that have the same 'abbrev' and 'firstword' and sum up the numbers for these duplicates. Some entries may have the same abbreviated name but a different first word- e.g. 'rr' and 'Roger' and 'Dodger' and vice versa, that's why I want to introduce this condition that both the first name and the first word have to match for an entry to be considered a duplicate.
Or in other words from this data:
abbrev =
{'yw' }
{'rr' }
{'yw' }
{'rr' }
firstword =
{'yellow'}
{'Roger' }
{'yellow'}
{'Dodger' }
number =
5
10
1
3
I want to get this:
abbrev =
{'yw' }
{'rr' }
{'rr' }
firstword =
{'yellow'}
{'Roger' }
{'Dodger' }
number =
6
10
3
Thank you in advance!
0 Kommentare
Akzeptierte Antwort
Peng Li
am 11 Mai 2020
tbl = table(abbrev(:), firstword(:), number(:));
[gp, outTbl] = findgroups(tbl(:, 1:2));
outTbl.sum = splitapply(@sum, tbl.(3), gp)
outTbl =
3×3 table
Var1 Var2 sum
______ __________ ___
{'rr'} {'Dodger'} 3
{'rr'} {'Roger' } 10
{'yw'} {'yellow'} 6
Weitere Antworten (0)
Siehe auch
Kategorien
Mehr zu Numeric Types finden Sie in Help Center und File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!