Hacker News new | past | comments | ask | show | jobs | submit login

I noticed you are standardizing your dataset before your test/train split. This is an example of information leakage which is causing your model to overfit by learning the test example distribution: http://www.eggie5.com/97-model-evaluation-information-leakin...



Noticed a couple typos:

Second paragraph - 'argures' should be 'argues', and 'resonse' should be 'response'.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: