> That's much easier to intuit and use to guide experiments than actually build into the bandit algo.
IDK about your intuition, but for most other people, it gets in the way of statistics.
The "loss function" is just as easy to calculate for A/B tests as for multi-armed bandit. The value of user doing A is $X, the value of B is $Y, and the value of C is $Z.
IDK about your intuition, but for most other people, it gets in the way of statistics.
The "loss function" is just as easy to calculate for A/B tests as for multi-armed bandit. The value of user doing A is $X, the value of B is $Y, and the value of C is $Z.