Any summary of linear regression ought to at least point out that the "cost function", which depending on the circumstances is often viewed as modeling the variance in the data, is typically limited to one data coordinate. That is, the separations to the fit line to be minimized go entirely along the y-axis; which has the effect of assuming that there's perfect knowledge of the x value. And while that can be the case for some sampling protocols, it is also often not the case. So please consider including something like:<p>If there is uncertainty in both the x and y coordinates then one needs to pursue alternate approaches which admit variation in both, one of the most popular being "Deming regression"[1]<p>[1] <a href="https://en.wikipedia.org/wiki/Deming_regression" rel="nofollow">https://en.wikipedia.org/wiki/Deming_regression</a>