PoliciesMultiPlayers.Scenario1 module

Scenario1: make a set of M experts with the following behavior, for K = 2 arms: at every round, one of them is chosen uniformly to predict arm 0, and the rest predict 1.

  • Reference: Beygelzimer, A., Langford, J., Li, L., Reyzin, L., & Schapire, R. E. (2011, April). Contextual Bandit Algorithms with Supervised Learning Guarantees. In AISTATS (pp. 19-26).

class PoliciesMultiPlayers.Scenario1.OneScenario1(mother, playerId)[source]

Bases: PoliciesMultiPlayers.ChildPointer.ChildPointer

OneScenario1: at every round, one of them is chosen uniformly to predict arm 0, and the rest predict 1.

__init__(mother, playerId)[source]

Initialize self. See help(type(self)) for accurate signature.

__str__()[source]

Return str(self).

__repr__()[source]

Return repr(self).

__module__ = 'PoliciesMultiPlayers.Scenario1'
class PoliciesMultiPlayers.Scenario1.Scenario1(nbPlayers, nbArms, lower=0.0, amplitude=1.0)[source]

Bases: PoliciesMultiPlayers.BaseMPPolicy.BaseMPPolicy

Scenario1: make a set of M experts with the following behavior, for K = 2 arms: at every round, one of them is chosen uniformly to predict arm 0, and the rest predict 1.

  • Reference: Beygelzimer, A., Langford, J., Li, L., Reyzin, L., & Schapire, R. E. (2011, April). Contextual Bandit Algorithms with Supervised Learning Guarantees. In AISTATS (pp. 19-26).

__init__(nbPlayers, nbArms, lower=0.0, amplitude=1.0)[source]
  • nbPlayers: number of players to create (in self._players).

Examples:

>>> s = Scenario1(10)
  • To get a list of usable players, use s.children.

  • Warning: s._players is for internal use

__str__()[source]

Return str(self).

_startGame_one(playerId)[source]

Forward the call to self._players[playerId].

_getReward_one(playerId, arm, reward)[source]

Forward the call to self._players[playerId].

_choice_one(playerId)[source]

Forward the call to self._players[playerId].

__module__ = 'PoliciesMultiPlayers.Scenario1'