reinforcement-learning https://gitlab.com/explore/projects/topics/10632 2026-03-04T16:47:47Z Kevin_et_Mimi_le_minotaure https://gitlab.com/ahmad-training-2026/kevin_et_mimi_le_minotaure 2026-01-30 14:04:34 UTC <p data-sourcepos="1:1-1:137" dir="auto">Projet pédagogique en algorithmie et intelligence artificielle consistant à créer un agent (Kevin) capable de résoudre un labyrinthe.</p>&#x000A;<p data-sourcepos="3:1-3:39" dir="auto">Le projet explore plusieurs approches :</p>&#x000A; &#x000A;Génération de labyrinthes (DFS, Prim)&#x000A;Algorithmes de recherche de chemin (A*, Dijkstra)&#x000A;Apprentissage supervisé (Imitation Learning avec CNN)&#x000A;Apprentissage par renforcement (Deep Q-Network)&#x000A; &#x000A;<p data-sourcepos="9:1-9:122" dir="auto">Des outils de visualisation permettent de générer des images et des GIFs montrant Kevin se déplacer dans le labyrinthe.</p>&#x000A;<p data-sourcepos="11:1-11:99" dir="auto">Projet réalisé dans le cadre de la formation Développeur en Intelligence Artificielle (Simplon).</p> roswell https://gitlab.com/jeremymika/roswell 2026-01-30 11:15:00 UTC <p data-sourcepos="1:1-1:101" dir="auto">A flexible robotics toolkit that supports real-time environments and is AI friendly for IRL hardware.</p> pqn-control-cdc2025 https://gitlab.com/lugagnelab/pqn-control-cdc2025 2025-09-01 13:43:47 UTC <p data-sourcepos="1:1-1:120" dir="auto">Code associated with the CDC 2025 paper "Control of a bi-stable genetic system via parallelized reinforcement learning".</p> Polymicrobial Infection https://gitlab.com/rl-4-microinfect/poly-infect 2022-05-23 22:55:31 UTC <p data-sourcepos="1:1-1:73" dir="auto">Suppressing bacteria in microbial communities with reinforcement learning</p> libsia https://gitlab.com/parkerowan/libsia 2020-12-17 05:55:52 UTC <p data-sourcepos="1:1-1:82" dir="auto">SIA - C++/Python library for model-based stochastic estimation and optimal control</p> NeuralNetTest https://gitlab.com/ChriZ98/NeuralNetTest 2020-11-19 11:01:27 UTC <p data-sourcepos="1:1-1:81" dir="auto">Test project for neural networks - Handwritten digit recognition on MNIST dataset</p> real-life-reacher https://gitlab.com/WinderAI/rl/projects/real-life-reacher 2020-06-13 10:32:37 UTC <p data-sourcepos="1:1-1:66" dir="auto">A very simple example of using RL in real life using servo motors.</p> Rc Car https://gitlab.com/WinderAI/rl/projects/rc-car 2020-06-09 09:13:40 UTC <p data-sourcepos="1:1-1:164" dir="auto">Learn to drive an radio-controlled car using only a camera. Uses a DonkeyCar simulation, SAC and a VAE. By <a href="https://WinderResearch.com" rel="nofollow noreferrer noopener" target="_blank">Winder Research</a>.</p> kullback-leibler-divergence-examples https://gitlab.com/WinderAI/rl/kullback-leibler-divergence-examples 2020-03-26 15:30:05 UTC <p data-sourcepos="1:1-1:51" dir="auto">Examples demonstrating Kullback-Leibler divergence.</p> gym-simple-cliffworld https://gitlab.com/WinderAI/rl/environments/gym-simple-cliffworld 2020-03-10 18:50:30 UTC <p data-sourcepos="1:1-1:65" dir="auto">A simplified version of "Cliffworld" in an OpenAI Gym Environment</p> rl-function-approximation https://gitlab.com/WinderAI/rl/algorithms/rl-function-approximation 2020-03-06 10:21:51 UTC <p data-sourcepos="1:1-1:81" dir="auto">A simple implementation of vanilla greedy-GQ, a reinforcement learning algorithm.</p> gym-shopping-cart https://gitlab.com/WinderAI/rl/gym-shopping-cart 2019-12-28 10:11:55 UTC <p data-sourcepos="1:1-1:55" dir="auto">An OpenAI Gym for Shopping Cart Reinforcement Learning.</p> cpprb https://gitlab.com/ymd_h/cpprb 2019-01-12 04:04:12 UTC <p data-sourcepos="1:1-1:35" dir="auto">Fast Flexible Replay Buffer Library</p> gymbag https://gitlab.com/doctorj/gymbag 2017-08-02 05:57:03 UTC <p data-sourcepos="1:1-2:33" dir="auto">Simple and efficient data recording for OpenAI Gym reinforcement learning environments.&#x000A;<a href="https://doctorj.gitlab.io/gymbag/" rel="nofollow noreferrer noopener" target="_blank">https://doctorj.gitlab.io/gymbag/</a></p>