What is the Difference Between Residual Sum of Squares and Total Sum of Squares?

The key difference between the residual sum of squares and the total sum of squares lies in what each measures within a statistical model. The total sum of squares (TSS) reflects the overall variability in the observed data, whereas the residual sum of squares (RSS) measures the variability in the errors (or residuals) between the observed data and the predicted values from the model.

Here's a breakdown:

Understanding Total Sum of Squares (TSS)

The total sum of squares quantifies the total variation in a dataset. It essentially calculates how much individual data points deviate from the mean of the entire dataset. Think of it as the total 'spread' in your data before any modeling.

Key Characteristics of TSS

Measures overall variability: How spread out the data is from its average.
Calculated before modeling: It's a measure of the inherent variation present in the data.
No consideration of any model: It doesn't take into account any predictive relationship.

Understanding Residual Sum of Squares (RSS)

The residual sum of squares, on the other hand, measures how well a statistical model fits the data. It calculates the sum of the squares of the differences (the residuals) between the observed values and the values predicted by the model. This is often used to assess how much of the variability is unexplained by the model.

Key Characteristics of RSS

Measures model error: How much the predicted values deviate from the actual values.
Calculated after modeling: Requires a model to be constructed.
Indicates model fit: A lower RSS signifies a better fit (less unexplained variability).

Table Comparing TSS and RSS

Feature	Total Sum of Squares (TSS)	Residual Sum of Squares (RSS)
What it Measures	Total variability in the data	Error/variability not captured by a model
Timing	Calculated before model is created	Calculated after fitting a model
Relationship to Model	Independent of model	Dependent on the specific model
Interpretation	Overall spread of the data	How well the model fits data
Formula Focus	Deviations from the mean of data	Deviations from the predicted values

Practical Insights

Good models strive for low RSS: A well-fitting model should minimize the residual sum of squares, meaning the model is capturing the underlying patterns in the data effectively.
RSS helps calculate R-squared: The relationship between TSS and RSS is used to calculate R-squared ($R^2$), which measures the proportion of the total variance in the dependent variable that is predictable from the independent variables in the model.

Example

Imagine you're trying to predict house prices based on square footage.

TSS: If you simply calculate the mean house price and then calculate the deviations from that mean without using square footage, you are measuring the TSS. It shows the overall variation in house prices.
RSS: If you build a model that predicts house price based on square footage, you then measure how much the actual price deviates from the price predicted by your model. This is the RSS. The smaller this sum is, the better your model explains the relationship between size and price.

In summary, the TSS represents the total variability of the data, while the RSS represents the portion of that variability that is not explained by a specific model. As stated in the reference, “The total sum of squares (TSS) measures how much variation there is in the observed data, while the residual sum of squares measures the variation in the error between the observed data and modeled values." Lowering the RSS is a key goal in model building.

askvity