Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

7.5b) #94

Open
nickblink opened this issue Apr 13, 2018 · 2 comments
Open

7.5b) #94

nickblink opened this issue Apr 13, 2018 · 2 comments

Comments

@nickblink
Copy link

solutions say g1 is expected to have smaller test RSS because of one less degree of freedom but this is not necessarily true. For example, if the true DGP is cubic, then g2 will fit better.

@Jeffalltogether
Copy link

I agree with @nickblink.

When lambda is maximized, the penalty term is maximized, forcing the final form of g to have an nth-degree derivative = 0 (else the RSS = infinity).

For g1 that is a 1st degree polynomial and for g2 that is a 2nd degree polynomial.

The best fit to the test data will occur when the degree of polynomial that represents the true distribution is used, which could be 1st or 2nd degree depending on the underlying distribution of the data.

@leowang396
Copy link

Agreed.

The difference between g1 and g2 is the degree of their polynomial, but overfitting does not always occur. The function that wins the bias-variance trade-off in the context of given data set will give the best test RSS.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants