32#include "Parameters.hpp"
34#include <boost/random/mersenne_twister.hpp>
35#include <boost/random/discrete_distribution.hpp>
84 friend ostream& operator<<(ostream&,
const UCB&);
Implementation of the UCB algorithm for multiarmed bandits.
void reward(double)
Provide reward for the most recent choice.
size_t choice
Store the last choice made.
static boost::random::mt19937 random_generator
Random source.
bool choose_next
Belt-and braces: warn if choose/reward happens in the wrong order.
size_t choose()
Choose using the current state.