reinforcement-learninghttps://gitlab.com/explore/projects/topics/106322026-03-04T16:47:47ZKevin_et_Mimi_le_minotaurehttps://gitlab.com/ahmad-training-2026/kevin_et_mimi_le_minotaure2026-01-30 14:04:34 UTC<p data-sourcepos="1:1-1:137" dir="auto">Projet pédagogique en algorithmie et intelligence artificielle consistant à créer un agent (Kevin) capable de résoudre un labyrinthe.</p>
<p data-sourcepos="3:1-3:39" dir="auto">Le projet explore plusieurs approches :</p>
 
Génération de labyrinthes (DFS, Prim)
Algorithmes de recherche de chemin (A*, Dijkstra)
Apprentissage supervisé (Imitation Learning avec CNN)
Apprentissage par renforcement (Deep Q-Network)
 
<p data-sourcepos="9:1-9:122" dir="auto">Des outils de visualisation permettent de générer des images et des GIFs montrant Kevin se déplacer dans le labyrinthe.</p>
<p data-sourcepos="11:1-11:99" dir="auto">Projet réalisé dans le cadre de la formation Développeur en Intelligence Artificielle (Simplon).</p>roswellhttps://gitlab.com/jeremymika/roswell2026-01-30 11:15:00 UTC<p data-sourcepos="1:1-1:101" dir="auto">A flexible robotics toolkit that supports real-time environments and is AI friendly for IRL hardware.</p>pqn-control-cdc2025https://gitlab.com/lugagnelab/pqn-control-cdc20252025-09-01 13:43:47 UTC<p data-sourcepos="1:1-1:120" dir="auto">Code associated with the CDC 2025 paper "Control of a bi-stable genetic system via parallelized reinforcement learning".</p>Polymicrobial Infectionhttps://gitlab.com/rl-4-microinfect/poly-infect2022-05-23 22:55:31 UTC<p data-sourcepos="1:1-1:73" dir="auto">Suppressing bacteria in microbial communities with reinforcement learning</p>libsiahttps://gitlab.com/parkerowan/libsia2020-12-17 05:55:52 UTC<p data-sourcepos="1:1-1:82" dir="auto">SIA - C++/Python library for model-based stochastic estimation and optimal control</p>NeuralNetTesthttps://gitlab.com/ChriZ98/NeuralNetTest2020-11-19 11:01:27 UTC<p data-sourcepos="1:1-1:81" dir="auto">Test project for neural networks - Handwritten digit recognition on MNIST dataset</p>real-life-reacherhttps://gitlab.com/WinderAI/rl/projects/real-life-reacher2020-06-13 10:32:37 UTC<p data-sourcepos="1:1-1:66" dir="auto">A very simple example of using RL in real life using servo motors.</p>Rc Carhttps://gitlab.com/WinderAI/rl/projects/rc-car2020-06-09 09:13:40 UTC<p data-sourcepos="1:1-1:164" dir="auto">Learn to drive an radio-controlled car using only a camera. Uses a DonkeyCar simulation, SAC and a VAE. By <a href="https://WinderResearch.com" rel="nofollow noreferrer noopener" target="_blank">Winder Research</a>.</p>kullback-leibler-divergence-exampleshttps://gitlab.com/WinderAI/rl/kullback-leibler-divergence-examples2020-03-26 15:30:05 UTC<p data-sourcepos="1:1-1:51" dir="auto">Examples demonstrating Kullback-Leibler divergence.</p>gym-simple-cliffworldhttps://gitlab.com/WinderAI/rl/environments/gym-simple-cliffworld2020-03-10 18:50:30 UTC<p data-sourcepos="1:1-1:65" dir="auto">A simplified version of "Cliffworld" in an OpenAI Gym Environment</p>rl-function-approximationhttps://gitlab.com/WinderAI/rl/algorithms/rl-function-approximation2020-03-06 10:21:51 UTC<p data-sourcepos="1:1-1:81" dir="auto">A simple implementation of vanilla greedy-GQ, a reinforcement learning algorithm.</p>gym-shopping-carthttps://gitlab.com/WinderAI/rl/gym-shopping-cart2019-12-28 10:11:55 UTC<p data-sourcepos="1:1-1:55" dir="auto">An OpenAI Gym for Shopping Cart Reinforcement Learning.</p>cpprbhttps://gitlab.com/ymd_h/cpprb2019-01-12 04:04:12 UTC<p data-sourcepos="1:1-1:35" dir="auto">Fast Flexible Replay Buffer Library</p>gymbaghttps://gitlab.com/doctorj/gymbag2017-08-02 05:57:03 UTC<p data-sourcepos="1:1-2:33" dir="auto">Simple and efficient data recording for OpenAI Gym reinforcement learning environments.
<a href="https://doctorj.gitlab.io/gymbag/" rel="nofollow noreferrer noopener" target="_blank">https://doctorj.gitlab.io/gymbag/</a></p>