reinforcement-learning

Kevin_et_Mimi_le_minotaure

2026-01-30 14:04:34 UTC

Projet pédagogique en algorithmie et intelligence artificielle consistant à créer un agent (Kevin) capable de résoudre un labyrinthe.

Le projet explore plusieurs approches :

Génération de labyrinthes (DFS, Prim) Algorithmes de recherche de chemin (A*, Dijkstra) Apprentissage supervisé (Imitation Learning avec CNN) Apprentissage par renforcement (Deep Q-Network)

Des outils de visualisation permettent de générer des images et des GIFs montrant Kevin se déplacer dans le labyrinthe.

Projet réalisé dans le cadre de la formation Développeur en Intelligence Artificielle (Simplon).

roswell

2026-01-30 11:15:00 UTC

A flexible robotics toolkit that supports real-time environments and is AI friendly for IRL hardware.

pqn-control-cdc2025

2025-09-01 13:43:47 UTC

Code associated with the CDC 2025 paper "Control of a bi-stable genetic system via parallelized reinforcement learning".

Polymicrobial Infection

2022-05-23 22:55:31 UTC

Suppressing bacteria in microbial communities with reinforcement learning

libsia

2020-12-17 05:55:52 UTC

SIA - C++/Python library for model-based stochastic estimation and optimal control

NeuralNetTest

2020-11-19 11:01:27 UTC

Test project for neural networks - Handwritten digit recognition on MNIST dataset

real-life-reacher

2020-06-13 10:32:37 UTC

A very simple example of using RL in real life using servo motors.

Rc Car

2020-06-09 09:13:40 UTC

Learn to drive an radio-controlled car using only a camera. Uses a DonkeyCar simulation, SAC and a VAE. By Winder Research.

kullback-leibler-divergence-examples

2020-03-26 15:30:05 UTC

Examples demonstrating Kullback-Leibler divergence.

gym-simple-cliffworld

2020-03-10 18:50:30 UTC

A simplified version of "Cliffworld" in an OpenAI Gym Environment

rl-function-approximation

2020-03-06 10:21:51 UTC

A simple implementation of vanilla greedy-GQ, a reinforcement learning algorithm.

gym-shopping-cart

2019-12-28 10:11:55 UTC

An OpenAI Gym for Shopping Cart Reinforcement Learning.

cpprb

2019-01-12 04:04:12 UTC

Fast Flexible Replay Buffer Library

gymbag

2017-08-02 05:57:03 UTC

Simple and efficient data recording for OpenAI Gym reinforcement learning environments. https://doctorj.gitlab.io/gymbag/