cdfplot

Empirical cumulative distribution function (cdf) plot

Syntax

cdfplot(x)

cdfplot(ax,x)

h = cdfplot(___)

[h,stats] = cdfplot(___)

Description

cdfplot(x) creates an empirical cumulative distribution function (cdf) plot for the data in x. For a value t in x, the empirical cdf F(t) is the proportion of the values in x less than or equal to t.

example

cdfplot(ax,x) plots into the axes specified by ax instead of the current axes (gca). (since R2024a)

h = cdfplot(___) returns a handle of the empirical cdf plot line object. Use h to query or modify properties of the object after you create it. For a list of properties, see Line Properties.

[h,stats] = cdfplot(___) also returns a structure including summary statistics for the data in x.

Examples

collapse all

Compare Empirical cdf to Theoretical cdf

Open Live Script

Plot the empirical cdf of a sample data set and compare it to the theoretical cdf of the underlying distribution of the sample data set. In practice, a theoretical cdf can be unknown.

Generate a random sample data set from the extreme value distribution with a location parameter of 0 and a scale parameter of 3.

rng('default')  % For reproducibility
y = evrnd(0,3,100,1);

Plot the empirical cdf of the sample data set and the theoretical cdf on the same figure.

cdfplot(y)
hold on
x = linspace(min(y),max(y));
plot(x,evcdf(x,0,3))
legend('Empirical CDF','Theoretical CDF','Location','best')
hold off

Figure contains an axes object. The axes object with title Empirical CDF, xlabel x, ylabel F(x) contains 2 objects of type line. These objects represent Empirical CDF, Theoretical CDF.

The plot shows the similarity between the empirical cdf and the theoretical cdf.

Alternatively, you can use the ecdf function. The ecdf function also plots the 95% confidence intervals estimated by using Greenwood's Formula. For details, see Algorithms.

ecdf(y,'Bounds','on')
hold on
plot(x,evcdf(x,0,3))
grid on
title('Empirical CDF')
legend('Empirical CDF','Lower Confidence Bound','Upper Confidence Bound','Theoretical CDF','Location','best')
hold off

Figure contains an axes object. The axes object with title Empirical CDF, xlabel x, ylabel F(x) contains 4 objects of type stair, line. These objects represent Empirical CDF, Lower Confidence Bound, Upper Confidence Bound, Theoretical CDF.

Test for Standard Normal Distribution

Open Live Script

Perform the one-sample Kolmogorov-Smirnov test by using kstest. Confirm the test decision by visually comparing the empirical cumulative distribution function (cdf) to the standard normal cdf.

Load the examgrades data set. Create a vector containing the first column of the exam grade data.

load examgrades
test1 = grades(:,1);

Test the null hypothesis that the data comes from a normal distribution with a mean of 75 and a standard deviation of 10. Use these parameters to center and scale each element of the data vector, because kstest tests for a standard normal distribution by default.

x = (test1-75)/10;
h = kstest(x)

h = logical
   0

The returned value of h = 0 indicates that kstest fails to reject the null hypothesis at the default 5% significance level.

Plot the empirical cdf and the standard normal cdf for a visual comparison.

cdfplot(x)
hold on
x_values = linspace(min(x),max(x));
plot(x_values,normcdf(x_values,0,1),'r-')
legend('Empirical CDF','Standard Normal CDF','Location','best')

Figure contains an axes object. The axes object with title Empirical CDF, xlabel x, ylabel F(x) contains 2 objects of type line. These objects represent Empirical CDF, Standard Normal CDF.

The figure shows the similarity between the empirical cdf of the centered and scaled data vector and the cdf of the standard normal distribution.

Input Arguments

collapse all

`x` — Input data
numeric vector

Input data, specified as a numeric vector.

Data Types: single | double

`ax` — Target axes
`Axes` object

Since R2024a

Target axes, specified as an Axes object. If you do not specify the axes, then cdfplot uses the current axes (gca).

Output Arguments

collapse all

`h` — Handle of plot line object
chart line object

Handle of the empirical cdf plot line object, returned as a chart line object. Use h to query or modify properties of the object after you create it. For a list of properties, see Line Properties.

`stats` — Summary statistics
structure

Summary statistics for the data in x, returned as a structure with the following fields:

Field	Description
`min`	Minimum value
`max`	Maximum value
`mean`	Sample mean
`median`	Sample median (50th percentile)
`std`	Sample standard deviation

Tips

cdfplot is useful for examining the distribution of a sample data set. You can overlay a theoretical cdf on the same plot of cdfplot to compare the empirical distribution of the sample to the theoretical distribution. For an example, see Compare Empirical cdf to Theoretical cdf.
The kstest, kstest2, and lillietest functions compute test statistics derived from an empirical cdf. cdfplot is useful in helping you to understand the output from these functions. For an example, see Test for Standard Normal Distribution.

Alternative Functionality

You can use the ecdf function to find the empirical cdf values and create an empirical cdf plot. The ecdf function enables you to indicate censored data and compute the confidence bounds for the estimated cdf values.

Version History

Introduced before R2006a

expand all

R2024a: Specify target axes

Specify the target axes for the plot by using the ax input argument.

cdfplot

Syntax

Description

Examples

Compare Empirical cdf to Theoretical cdf

Test for Standard Normal Distribution

Input Arguments

`x` — Input data
numeric vector

`ax` — Target axes
`Axes` object

Output Arguments

`h` — Handle of plot line object
chart line object

`stats` — Summary statistics
structure

Tips

Alternative Functionality

Version History

R2024a: Specify target axes

See Also

Topics

cdfplot

Syntax

Description

Examples

Compare Empirical cdf to Theoretical cdf

Test for Standard Normal Distribution

Input Arguments

x — Input data numeric vector

ax — Target axes Axes object

Output Arguments

h — Handle of plot line object chart line object

stats — Summary statistics structure

Tips

Alternative Functionality

Version History

R2024a: Specify target axes

See Also

Topics

`x` — Input data
numeric vector

`ax` — Target axes
`Axes` object

`h` — Handle of plot line object
chart line object

`stats` — Summary statistics
structure