Residual Plot Choices
main topics
 

Minitab generates residual plots that you can use to examine the goodness of model fit. You can choose the following residual plots:

·    Histogram of residuals. An exploratory tool to show general characteristics of the data, including:

-    Typical values, spread or variation, and shape

-    Unusual values in the data

Long tails in the plot may indicate skewness in the data. If one or two bars are far from the others, those points may be outliers. Because the appearance of the histogram changes depending on the number of intervals used to group the data, use the normal probability plot and goodness-of-fit tests to assess the normality of the residuals.

·    Normal probability plot of residuals. The points in this plot should generally form a straight line if the residuals are normally distributed. If the points on the plot depart from a straight line, the normality assumption may be invalid. If your data have fewer than 50 observations, the plot may display curvature in the tails even if the residuals are normally distributed. As the number of observations decreases, the probability plot may show substantial variation and nonlinearity even if the residuals are normally distributed. Use the probability plot and goodness-of-fit tests, such as the Anderson-Darling statistic, to assess whether the residuals are normally distributed.

You can display the Anderson-Darling statistic (AD) on the plot, which can indicate whether the data are normal. If the p-value is lower than the chosen a-level, the data do not follow a normal distribution. To display the Anderson-Darling statistic, choose Tools > Options >  Linear Models > Residual Plots. For additional tests of normality, see Stat > Basic Statistics > Normality Test.

·    Residuals versus fits. This plot should show a random pattern of residuals on both sides of 0. If a point lies far from the majority of points, it may be an outlier. Also, there should not be any recognizable patterns in the residual plot. The following may indicate error that is not random:

-    a series of increasing or decreasing points

-    a predominance of positive residuals, or a predominance of negative residuals

-    patterns, such as increasing residuals with increasing fits

·    Residuals versus order. This is a plot of all residuals in the order that the data was collected and can be used to find non-random error, especially of time-related effects. A positive correlation is indicated by a clustering of residuals with the same sign. A negative correlation is indicated by rapid changes in the signs of consecutive residuals.

·    Four in one. Select this option to produce a normal plot of residuals, a histogram of residuals, a plot of residuals versus fits, and a plot of residuals versus order in one graph window.

·    Residuals versus other variables. This is a plot of all residuals versus another variable. Plot the residuals against:

-    Each predictor to look for curvature or differences in the magnitude of the residuals

-    Important variables left out of the model to see if they have critical additional effects on the response.

If certain residual values are of concern, you can brush your graph to identify them. See graph brushing.