How do I pull out from a table, the total mean of one item based on a particular condition (for several subjects), and then compute the difference in mean values?

Question

Monique Chatterton am 22 Okt. 2021

0
Verknüpfen

Direkter Link zu dieser Frage

https://de.mathworks.com/matlabcentral/answers/1569793-how-do-i-pull-out-from-a-table-the-total-mean-of-one-item-based-on-a-particular-condition-for-seve

Kommentiert: Monique Chatterton am 23 Okt. 2021

I have a large table of data (example below).

For each "Subject", I need to determine 1) the total mean value of all "Item As" that are 'High' in "Condition Z", and 2) the total mean value of all "Item As" that are 'Low' in "Condition Z". For example, Subject 1 the answers would be: 25 (i.e. The mean for Item A for 'High' "Condition Z" items); and 50 (i.e. The mean for Item A for 'Low' "Condition Z" items)

I then need to generate a table that, for each "Subject", lists the difference in these two values as follows: ("total mean value of all "Item As" that are 'High' in "Condition Z"") - ("the total mean value of all "Item As" that are 'Low' in "Condition Z"). Thank you very much!

Subject	Item A	     Condition Z	    Restudied	      Class
     30              High	            Y	            A
     20              High                N               B
     45              Low                 N               A
     55              Low                 Y               B
     17              High                Y               A   
     45              Low	                N	            B
     13              High                Y               A  
     56              High                Y	            A
     12              Low	                N	            B
     34              Low	                N	            B
     15              High	            N	            A

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Melden Sie sich an, um zu kommentieren.

Melden Sie sich an, um diese Frage zu beantworten.

Answer 1

Dave B am 22 Okt. 2021

0
Verknüpfen

Direkter Link zu dieser Antwort

https://de.mathworks.com/matlabcentral/answers/1569793-how-do-i-pull-out-from-a-table-the-total-mean-of-one-item-based-on-a-particular-condition-for-seve#answer_814448

Bearbeitet: Dave B am 22 Okt. 2021

In MATLAB Online öffnen

You can do the first part with groupsummary

Subject = [1 1 1 1 2 2 2 3 3 3 3]';
ItemA=[30 20 45 55 17 45 13 56 12 34 15]';
ConditionZ=["High" "High" "Low" "Low" "High" "Low" "High" "High" "Low" "Low" "High"]';
t=table(Subject,ItemA,ConditionZ)
t = 11×3 table
    Subject    ItemA    ConditionZ
    _______    _____    __________

       1        30        "High"  
       1        20        "High"  
       1        45        "Low"   
       1        55        "Low"   
       2        17        "High"  
       2        45        "Low"   
       2        13        "High"  
       3        56        "High"  
       3        12        "Low"   
       3        34        "Low"   
       3        15        "High"  
tsummary=groupsummary(t,["Subject" "ConditionZ"],"mean","ItemA")
tsummary = 6×4 table
    Subject    ConditionZ    GroupCount    mean_ItemA
    _______    __________    __________    __________

       1         "High"          2              25   
       1         "Low"           2              50   
       2         "High"          2              15   
       2         "Low"           1              45   
       3         "High"          2            35.5   
       3         "Low"           2              23   

You might be tempted to do the second part by just subtracting tsummary.mean_ItemA(tsummary.ConditionZ=="High") - tsummary.mean_ItemA(tsummary.ConditionZ=="low")...but i think a nicer solution is to make two separate tables and join them which will make sure that all of the indices line up.

thigh = tsummary(tsummary.ConditionZ=="High",:)
thigh = 3×4 table
    Subject    ConditionZ    GroupCount    mean_ItemA
    _______    __________    __________    __________

       1         "High"          2              25   
       2         "High"          2              15   
       3         "High"          2            35.5   
tlow = tsummary(tsummary.ConditionZ=="Low",:)
tlow = 3×4 table
    Subject    ConditionZ    GroupCount    mean_ItemA
    _______    __________    __________    __________

       1         "Low"           2             50    
       2         "Low"           1             45    
       3         "Low"           2             23    
tdiff=join(tlow,thigh,'Keys','Subject')
tdiff = 3×7 table
    Subject    ConditionZ_tlow    GroupCount_tlow    mean_ItemA_tlow    ConditionZ_thigh    GroupCount_thigh    mean_ItemA_thigh
    _______    _______________    _______________    _______________    ________________    ________________    ________________

       1            "Low"                2                 50                "High"                2                    25      
       2            "Low"                1                 45                "High"                2                    15      
       3            "Low"                2                 23                "High"                2                  35.5      
tdiff.difference = tdiff.mean_ItemA_thigh - tdiff.mean_ItemA_tlow
tdiff = 3×8 table
    Subject    ConditionZ_tlow    GroupCount_tlow    mean_ItemA_tlow    ConditionZ_thigh    GroupCount_thigh    mean_ItemA_thigh    difference
    _______    _______________    _______________    _______________    ________________    ________________    ________________    __________

       1            "Low"                2                 50                "High"                2                    25              -25   
       2            "Low"                1                 45                "High"                2                    15              -30   
       3            "Low"                2                 23                "High"                2                  35.5             12.5   
tdiff=tdiff(:,[1 end]) % just for display, no reason to eliminate the intermediate columns
tdiff = 3×2 table
    Subject    difference
    _______    __________

       1           -25   
       2           -30   
       3          12.5   

4 Kommentare
2 ältere Kommentare anzeigen2 ältere Kommentare ausblenden

dpb am 23 Okt. 2021

Yeah, this (relatively) recent penchant to introduce so many (almost) overlapping functionalities in common toolboxes or even in base product is maddening in causing so much overhead and bloat in the dictionary space; much of which then ends up being deprecated but never going away. The early incarnation of the table as the Statistics TB dataset and its implementation of a different categorical class is a prime example.

While such were indeed needed, it would have been far better for the TB and the base product to have been integrated together even if it meant delaying some TB functionality for a while.

The seeming lack of an overall comprehensive design and direction is becoming a real issue in my mind; the disparate user interfaces and inconsistencies continue to multiply.

$0.02, imo, ymmv, etc., etc., ...

Monique Chatterton am 23 Okt. 2021

This worked beautifully! Thank you so much for this answer and the additional information and explanations!

Melden Sie sich an, um zu kommentieren.

Answer 2

dpb am 22 Okt. 2021

0
Verknüpfen

Direkter Link zu dieser Antwort

https://de.mathworks.com/matlabcentral/answers/1569793-how-do-i-pull-out-from-a-table-the-total-mean-of-one-item-based-on-a-particular-condition-for-seve#answer_814458

In MATLAB Online öffnen

tMEANS=rowfun(@mean,tData,'GroupingVariables'{'ConditionZ'},'InputVariables','ItemA','OutputVariableNames','GroupedMeans');

See the doc for all the skinny on grouping variables and rowfun and friends...you might also find groupsummary of interest.

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Melden Sie sich an, um zu kommentieren.

How do I pull out from a table, the total mean of one item based on a particular condition (for several subjects), and then compute the difference in mean values?

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Akzeptierte Antwort

4 Kommentare
2 ältere Kommentare anzeigen2 ältere Kommentare ausblenden

Weitere Antworten (1)

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Siehe auch

Kategorien

Tags

Produkte

Version

Community Treasure Hunt

How do I pull out from a table, the total mean of one item based on a particular condition (for several subjects), and then compute the difference in mean values?

0 Kommentare -2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Akzeptierte Antwort

4 Kommentare 2 ältere Kommentare anzeigen2 ältere Kommentare ausblenden

Weitere Antworten (1)

0 Kommentare -2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Siehe auch

Kategorien

Tags

Produkte

Version

Community Treasure Hunt

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

4 Kommentare
2 ältere Kommentare anzeigen2 ältere Kommentare ausblenden

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden