data-science https://gitlab.com/explore/projects/topics/12803 2026-02-26T23:10:05Z manufacturing-downtime-analysis https://gitlab.com/arisharazakhan-cpu/manufacturing-downtime-analysis 2026-02-26 22:59:37 UTC <p data-sourcepos="1:1-1:287" dir="auto">Supervised learning pipeline for rare event operational failure prediction, integrating leakage resistant preprocessing, class weighted modeling, precision recall threshold calibration, ROC AUC benchmarking, and permutation based feature importance to analyze production stress drivers.</p> Insta Data https://gitlab.com/ahmad-training-2026/insta-data 2026-02-22 05:37:02 UTC <p data-sourcepos="1:1-1:145" dir="auto">Projet d’analyse du comportement des utilisateurs Instagram à partir d’un large dataset synthétique (plus d’un million d’utilisateurs).</p>&#x000A;<p data-sourcepos="3:1-3:19" dir="auto">Le projet explore :</p>&#x000A; &#x000A;Analyse exploratoire des données (EDA)&#x000A;Prédictions de variables comportementales (stress, âge, clics publicitaires, revenu)&#x000A;Identification de profils utilisateurs avec clustering (KMeans)&#x000A;Interprétation des résultats statistiques en lien avec la vie réelle&#x000A; &#x000A;<p data-sourcepos="9:1-10:100" dir="auto">Méthodologie :&#x000A;visualisation, corrélations, sélection de modèles, entraînement, évaluation et interprétation.</p>&#x000A;<p data-sourcepos="12:1-12:72" dir="auto">Technologies : Python, Pandas, NumPy, Matplotlib, Scikit-learn, Jupyter.</p>&#x000A;<p data-sourcepos="14:1-14:99" dir="auto">Projet réalisé dans le cadre de la formation Développeur en Intelligence Artificielle (Simplon).</p> DS-for-LA https://gitlab.com/psark/ds-for-la 2026-02-18 14:27:01 UTC <p data-sourcepos="1:1-1:63" dir="auto">A practical, linear-algebra-first introduction to data science.</p> Next JS Perso Portfolio https://gitlab.com/SalahBerr/nextjs-portfolio-perso 2026-02-07 09:44:05 UTC <p data-sourcepos="1:1-2:227" dir="auto"><gl-emoji title="rocket" data-name="rocket" data-unicode-version="6.0">🚀</gl-emoji> Salah Eddine Berredjem - Portfolio&#x000A;A modern, high-performance portfolio website built with Next.js 14 featuring stunning visuals, smooth animations, and optimized performance. Showcasing expertise in Web Frontend Development, AI/ML Engineering, and Data Science.</p> Pingouins https://gitlab.com/ahmad-training-2026/pingouins 2026-02-03 12:30:49 UTC <p data-sourcepos="1:1-1:130" dir="auto">API FastAPI permettant d’identifier des espèces de pingouins à partir de données tabulaires optionnelles et/ou d’une image.</p>&#x000A;<p data-sourcepos="3:1-3:85" dir="auto">L’API renvoie une liste d’espèces probables avec leurs probabilités et inclut :</p>&#x000A; &#x000A;Classification Machine Learning (RF, KNN, LR)&#x000A;Fusion multimodale (tabulaire + image)&#x000A;Journalisation des requêtes/réponses (SQLite)&#x000A;Interface web pour consulter les logs&#x000A;Interface web pour tester les prédictions&#x000A;Notebooks pour exploration et entraînement&#x000A; &#x000A;<p data-sourcepos="11:1-11:99" dir="auto">Projet réalisé dans le cadre de la formation Développeur en Intelligence Artificielle (Simplon).</p> neural-network https://gitlab.com/akinetic/neural-network 2025-12-11 12:01:29 UTC <p data-sourcepos="1:1-1:231" dir="auto">The efficient alternative to Neural Networks. Implements SLRM (Segmented Linear Regression Model) for neural compression and non-linear data modeling, achieving high precision with a fraction of the parameters of a traditional ANN.</p> insight_agent https://gitlab.com/jonahlierz/insight_agent 2025-12-10 13:48:12 UTC <p data-sourcepos="1:1-1:104" dir="auto">Tool-driven analytics agent using LLM orchestration, schema-based tools, and a semantic data dictionary.</p> mups https://gitlab.com/mhnv/mups 2025-11-13 23:28:22 UTC <p data-sourcepos="1:1-1:169" dir="auto">Turn any Python project folder into a reproducible job template that creates timestamped, isolated run directories with configurable environments and copy/link behavior.</p> Emergency Flood Prediction https://gitlab.com/bwerapol/Emergency-Flood-Prediction 2025-11-11 20:13:10 UTC <p data-sourcepos="1:1-1:251" dir="auto">[2011] Emergency flood forecasting system from Thailand's 2011 crisis. Provided 5-day ARIMA predictions with spatial interpolation across Bangkok, enabling 13 million residents to protect homes when official models failed, shared via public platforms.</p> climate predicton models https://gitlab.com/bwerapol/climate-predicton-model 2025-11-11 20:09:11 UTC <p data-sourcepos="1:1-1:287" dir="auto">[2019-2024] Climate prediction framework using statistical downscaling of GCM data. Combines ARIMA time series, machine learning regression, and stochastic weather generation for 5-day forecasts with spatial interpolation capabilities. The pilot area is the eastern seaboard of Thailand.</p> Data_Analysis_School https://gitlab.com/KLimPALE/Data_Analysis_School 2025-07-11 19:53:21 UTC <p data-sourcepos="1:1-1:58" dir="auto">Fundamental theory and practice in Data Science (DS). <gl-emoji title="abacus" data-name="abacus" data-unicode-version="11.0">🧮</gl-emoji></p> Trainee_TerraLab_DA_WillianYamauti https://gitlab.com/willian.yamauti/trainee_terralab_da 2025-06-07 12:53:06 UTC <p data-sourcepos="1:1-1:259" dir="auto">Este repositório foi criado como parte do programa de Trainee do TerraLab para área de Data Analytics. O objetivo é introduzir os conceitos fundamentais de Análise de Dados por meio de 5 sprints semanais, com foco em Python, Git e visualização de dados.</p> ml-model-netflix-recommendation-system https://gitlab.com/aydie/ml-model-netflix-recommendation-system 2025-05-14 16:03:51 UTC <p data-sourcepos="1:1-2:283" dir="auto">Project information&#x000A;This is the final source code for my model deployment—a Streamlit application for the Netflix Movie Recommendation System, or we can use the Flask framework. The complete model training, exploratory data analysis (EDA), and data preprocessing are available in my GitHub repository.</p>&#x000A;<p data-sourcepos="4:1-4:31" dir="auto">GitHub: github.com/aydiegithub/</p>&#x000A;<p data-sourcepos="6:1-6:45" dir="auto">Live Demo: aydie.in/ml/netflix-recommendation</p>&#x000A;<p data-sourcepos="8:1-8:46" dir="auto">Contact: <a data-sourcepos="8:10-8:26" href="mailto:business@aydie.in">business@aydie.in</a> 9036469492 aydie.in</p> turkiye-car-market-2020 https://gitlab.com/KARSTERRR/turkiye-car-market-2020 2025-05-05 15:51:57 UTC <p data-sourcepos="1:1-1:334" dir="auto">A data science project focused on analyzing a car market dataset from Turkey in 2020. The goal is to explore the data, apply various analytical techniques, and derive insights. The specific direction of analysis will be determined through exploration, with potential for building predictive models or visualizing trends in the market.</p> Kaggle_Playground https://gitlab.com/ibra-kdbra/kaggle_playground 2025-05-01 21:06:58 UTC <p data-sourcepos="1:1-1:77" dir="auto">This repo will have all resources, labs, data which I use/d on Kaggle Network</p> Scalable Machine Learning with SparkML - Census Income Classification https://gitlab.com/cvasu-showcase/scalable_machine_learning_with_SparkML_Census_Income_Classification 2025-04-19 03:56:57 UTC <p data-sourcepos="1:1-1:370" dir="auto">Built a complete machine learning pipeline in SparkML using the Adult Census dataset (~48k rows, 14 features). Implemented data preprocessing, feature encoding, cross-validation, and model training with Logistic Regression and Random Forest. Evaluated models with metrics such as AUC and F1-score. Reflected on scalability trade-offs and optimizations in distributed ML.</p> Lumina https://gitlab.com/pedroflorencio/lumina 2025-03-21 01:55:00 UTC <p data-sourcepos="1:1-1:293" dir="auto">Comparação entre Vision Transformers e Métodos Clássicos de Visão Computacional na Segmentação de Exsudatos Lipídicos em Imagens de Retinopatia Diabética. Trabalho de Conclusão de Curso para obtenção de título de bacharel em Engenharia Elétrica na Universidade Federal do Ceará.</p> web-hypoteste https://gitlab.com/data-science-apps/web-hypoteste 2024-09-25 11:50:37 UTC <p data-sourcepos="1:1-1:137" dir="auto">Aplicativo que facilita a realização do teste de hipótese, através de testes T, com uma interface amigável e de fácil utilização.</p> Data Science ML Component Pipeline https://gitlab.com/gitlab-data/ds-component-pipeline 2024-03-25 14:35:43 UTC <p data-sourcepos="1:1-1:96" dir="auto">Data Science / Machine Learning Pipeline component for training and deploying ML models using CI</p> All in 1 DataScience https://gitlab.com/wayofthewayne/all-in-1-datascience 2023-09-03 14:27:40 UTC <p data-sourcepos="1:1-1:228" dir="auto">This repo is a mix of several data science tools. There is a mix of web-scraping of data that is then cleaned, and used to analyze the property market in malta, using prediction models, visualisations and statistical analysis.</p>&#x000A;<p data-sourcepos="3:1-3:150" dir="auto">There is also visualisations for chess data from the 1980's till 2021. Moreover, there is twitter data, which is then stored in the neo4J nosql dbms.</p>&#x000A;<p data-sourcepos="5:1-5:169" dir="auto">No data is presented in the git, only the results. Code with the data can be found at: <a href="https://drive.google.com/file/d/15EQnRtsngDsFDD_A7g4N1fwCuXI0f_Xi/view?usp=sharing" rel="nofollow noreferrer noopener" target="_blank">https://drive.google.com/file/d/15EQnRtsngDsFDD_A7g4N1fwCuXI0f_Xi/view?usp=sharing</a></p>