what do lines that double back on themselves mean box plots
16 Ansichten (letzte 30 Tage)
Ältere Kommentare anzeigen
cgenes
am 13 Mai 2019
Bearbeitet: Adam Danz
am 15 Aug. 2023
HI
i'm using anova1 and it produces plots like this
it's a dumb question: but what do the lines mean that double back on themselves? red line in median i thin whiskers are outliers and the boxes should be 75 and 25 percentiles i thought
thanks
Charlie

1 Kommentar
Akzeptierte Antwort
Adam Danz
am 13 Mai 2019
Bearbeitet: Adam Danz
am 15 Aug. 2023
It's not a dumb question.
The notches on the sides of a box plot can be interpreted as a comparison interval around the median values. The height of the notch is the median +/- 1.57 x IQR/sqrt(n) where IQR is the interquartile range defined by the 25th and 75th percentiles and n is the number of data points [1]. If the notches of two boxes do not overlap, there is strong evidence that their medians "significantly differ"[2], a phrase that is working its way out of statistics [3].
The reason why your notches extend beyond the 25th and/or 75th percentiles is due to the uncertainty of the true median value. This often happens if your sample size is small (because you're dividing by the sqrt(n) to calculate the notch height).

image above: [2]
To apply that to your box plots:

Update: a pictoral explanation of boxplot notches has been added to Matlab's documentation [4,5].

4 Kommentare
John Kearns
am 28 Jul. 2023
I have just started looking into box plots/charts and this is the first description I've found that's clear to me at all. Thank you for taking the time to make descriptive figures, this is very helpful.
Weitere Antworten (0)
Siehe auch
Kategorien
Mehr zu Analysis of Variance and Covariance finden Sie in Help Center und File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!