Interesting. So for training they use features: > In the past, how often did thi... | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

Tarq0n on July 11, 2020 | parent | context | favorite | on: Testing Firefox more efficiently with machine lear...

Interesting. So for training they use features:

> In the past, how often did this test fail when the same files were touched?

> How far in the directory tree are the source files from the test files?

> How often in the VCS history were the source files modified together with the test files?

But for prediction all they input is a tuple (TEST, PATCH), and XGboost works fine without the additional features?

dmurray on July 11, 2020 [–]

I think they're deriving the additional features at prediction time. The test and patch don't contain all the information you need to compute the features, but they contain sufficient information when combined with a big static lookup table. At least that's the way I read it; agree it could be clearer.

Consider applying for YC's Spring batch! Applications are open till Feb 11.
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact