Quick question: Did you try/find any correlation with post time/seasonality? I recall that in previous examinations of HN/Reddit top/viral posts, there was some amount of signal along that dimension. (I notice someone mentions this in the twitter thread too but I didn't see a response)
Additionally, you found average (all up) to be a better predictor than average per user or per category?
Apologies for grilling you here, I should frankly dig in myself, but if you happen to feel like indulging me it's much appreciated :)
I didn’t try adding in any metadata to the models yet (including time, date, and author). I was just trying to work with the text content of the posts.
Additionally, you found average (all up) to be a better predictor than average per user or per category?
Apologies for grilling you here, I should frankly dig in myself, but if you happen to feel like indulging me it's much appreciated :)