Hacker News new | past | comments | ask | show | jobs | submit login

You will find no understanding in the grown trees-- by definition, they're each seeing a different angle of the data, and each node a different subset of available features.

You can, however, calculate importance scores for the features used. Brennan's original paper gives a good algorithm for doing this (in short: for each tree, permuting the data along some feature for an out of bag sample and seeing how much worse it does.)




(Pardon me/blame autocorrect: Breiman.)




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: