bandit-algorithms

A comprehensive Python library implementing a variety of contextual and non-contextual multi-armed bandit algorithms—including LinUCB, Epsilon-Greedy, Upper Confidence Bound (UCB), Thompson Sampling, KernelUCB, NeuralLinearBandit, and DecisionTreeBandit—designed for reinforcement learning applications

python machine-learning reinforcement-learning algorithms epsilon-greedy multi-armed-bandit contextual-bandits bandit-algorithms linucb

Updated Dec 31, 2024
Python

DURUII / Replica-AUCB

Star

🐯REPLICA of "Auction-based combinatorial multi-armed bandit mechanisms with strategic arms"

multi-armed-bandit bandits mab cmab bandit-algorithms aution aucb

Updated Dec 17, 2023
Python

babaniyi / Deep-contextual-bandits

Star

Deep contextual bandits in PyTorch: Neural Bandits, Neural Linear, and Linear Full Posterior Sampling with comprehensive benchmarking on synthetic and real datasets

bandits bandit-algorithms multiarmed-bandits

Updated Jun 29, 2025
Python

ngutowski / algossim

Star

This repository aims at learning most popular MAB and CMAB algorithms and watch how they run. It is interesting for those wishing to start learning these topics.

recommendation-system artificial-intelligence-algorithms contextual-bandits bandit-algorithms

Updated Dec 7, 2021
Python

MaxenceGiraud / MachineLearningAlgos

Star

Personal reimplementation of some ML algorithms for learning purposes

Updated Jul 13, 2021
Python

guptav96 / bandit-algorithms

Star

A short implementation of bandit algorithms - ETC, UCB, MOSS and KL-UCB

reinforcement-learning bandit-algorithms exploration-exploitation

Updated Feb 27, 2022
Python

doerlbh / BanditZoo

Star

Python library of bandits and RL agents in different real-world environments

reinforcement-learning simulation bandits bandit bandit-algorithms

Updated Feb 21, 2022
Python

albertopirillo / ola-project-2023

Star

Pricing and advertising strategy for the e-commerce of an airline company, based on Multi-Armed Bandits (MABs) algorithms and Gaussian Processes. Simulations include non-stationary environments.

reinforcement-learning marketing-automation online-learning bandit-algorithms

Updated Sep 26, 2023
Python

duongnhatthang / meta-bandit

Star

Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms

python3 multi-task bandit meta-learning partial-monitoring sequential-decision-making-problems bandit-algorithms sequential-decisions meta-bandit

Updated Sep 27, 2024
Python

amirbalef / PS_MOMAB

Star

Multi-Objective Multi-Armed Bandit

multi-objective multi-armed-bandit non-stationary bandit-algorithms ucb-algorithm

Updated Jul 17, 2023
Python

szrlee / GPT-HyperAgent

Star

The official code repo for HyperAgent for neural bandits and GPT-HyperAgent for content moderation.

pipeline decision-making alignment gpt bandit-algorithms content-moderation online-decision-transformer human-ai-collaboration

Updated Jul 19, 2024
Python

MIFA-Lab / LDPbandit2020

Star

Implementation for NeurIPS 2020 paper "Locally Differentially Private (Contextual) Bandits Learning" (https://arxiv.org/abs/2006.00701)

numpy differential-privacy bandit-algorithms

Updated Jun 6, 2022
Python

rssalessio / DPE

Star

DPE code - Code used in "Optimal Algorithms for Multiplayer Multi-Armed Bandits" (AISTATS 2020)

multi-armed-bandits dpe bandit-algorithms aistats multiplayer-multi-armed-bandits aistats-2020

Updated Jul 10, 2020
Python

aaronjs99 / intelligent-agents

Star

Comparative analysis of Markov decision processes & intelligent agents

reinforcement-learning linear-programming mdp policy-iteration value-iteration bandit-algorithms

Updated May 16, 2025
Python

jia-yi-chen / Bandit-and-Reinforcement-Learning

Star

Python implementation for Reinforcement Learning algorithms -- Bandit algorithms, MDP, Dynamic Programming (value/policy iteration), Model-free Control (off-policy Monte Carlo, Q-learning)

reinforcement-learning monte-carlo q-learning grid-world dynamic-programming markov-decision-processes multi-armed-bandit bandit-algorithms

Updated Oct 2, 2021
Python

Improve this page

Add a description, image, and links to the bandit-algorithms topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the bandit-algorithms topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bandit-algorithms

Here are 35 public repositories matching this topic...

WilliamLwj / PyXAB

KKeishiro / Yahoo_recommendation

Alanthink / banditpylib

niffler92 / Bandit

kulinshah98 / Multi-Armed-Bandit-Algorithms

singhsidhukuldeep / contextual-bandits

DURUII / Replica-AUCB

babaniyi / Deep-contextual-bandits

ngutowski / algossim

MaxenceGiraud / MachineLearningAlgos

guptav96 / bandit-algorithms

doerlbh / BanditZoo

albertopirillo / ola-project-2023

duongnhatthang / meta-bandit

amirbalef / PS_MOMAB

szrlee / GPT-HyperAgent

MIFA-Lab / LDPbandit2020

rssalessio / DPE

aaronjs99 / intelligent-agents

jia-yi-chen / Bandit-and-Reinforcement-Learning

Improve this page

Add this topic to your repo