|  | Connect++ 0.4.0
    A fast, readable connection prover for first-order logic. | 
Implementation of the UCB algorithm for multiarmed bandits. More...
#include <UCB.hpp>
| Public Member Functions | |
| size_t | get_choice () | 
| size_t | choose () | 
| Choose using the current state. | |
| void | reward (double) | 
| Provide reward for the most recent choice. | |
| Private Attributes | |
| bool | choose_next | 
| Belt-and braces: warn if choose/reward happens in the wrong order. | |
| size_t | choice | 
| Store the last choice made. | |
| Static Private Attributes | |
| static boost::random::mt19937 | random_generator | 
| Random source. | |
| Friends | |
| ostream & | operator<< (ostream &, const UCB &) | 
Implementation of the UCB algorithm for multiarmed bandits.
It's easy enough to find a description of this algorithm. The implementation here is pretty much straight from my book.
| size_t UCB::choose | ( | ) | 
Choose using the current state.
Definition at line 42 of file UCB.cpp.
| void UCB::reward | ( | double | r | ) | 
| 
 | friend | 
| 
 | private | 
| 
 | staticprivate |