Skip to content

This is a folder for my projects/documents on springboard data science career track

Notifications You must be signed in to change notification settings

ZackCode/datasci_workshop

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

54 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

datasci_workshop

This repository contains my notes and support documents for the data science career tack course from Springboard.com. It also serves as an index (and a showcase) for my capstone projects, my mini-projects for data science career track course on Springboard.com mainly, and on other courses or projects.

Capstone Projects

A capstone project is a relatively large project where multiple aspects of data science are employed, including data wrangling, inferential statistics, and machine learning.

Capstone Project 1 (In progress):

This project comes from a Kaggle competition. Walmart wants to know the correlation between weather condition and its sales record. More specifically, Walmart wants a model that can "accurately predict the sales of 111 potentially weather-sensitive products (like umbrellas, bread, and milk) around the time of major weather events at 45 of their retail locations".

To this end, in this capstone project, I would like to accomplish the following objects: 1.A model that will predict product sales in major weather events 2.Evaluate how the weather condition affect the sales

File location: https://github.com/ZackCode/Capstone1_Walmert_Sales

Capstone Project 2 (In progress):

This project comes from the Kaggle competition "Data Science Bowl 2017". This year's topic is lung cancer detection. The goal defined by the competition is:

"Using a data set of thousands of high-resolution lung scans provided by the National Cancer Institute, participants will develop algorithms that accurately determine when lesions in the lungs are cancerous. "

To this end, in this capstone project, I would like to accomplish the following objects: 1.A model that will detect lung cancer. 2.Compare the performance versus other submissions.

File location: https://github.com/ZackCode/Capstone2_Cancer_Detection

Mini-projects

A mini project is a small project where the task is limited to a certain area of data science.

Section 5 Data Wrangling Projects

  1. JSON Based Data Exercises Project report location: https://github.com/ZackCode/mini_project_data_wrangling_json/blob/master/sliderule_dsi_json_exercise.ipynb
  2. SQL Practice Cases at modeanalytics.com Project location: https://github.com/ZackCode/SQL_mini_projects_From_Mode/tree/master/Analysis_1

Section 8 Inferential Statistics and Hypothesis Testing Projects

  1. Human Body Temperature Using EDA: Project location: https://github.com/ZackCode/mini_project_human_temp
  2. Examine Racial Discrimination Using EDA: Project Location: https://github.com/ZackCode/mini_project_racial_disc
  3. Reduce Hospital Readmissions Using EDA : Project Location: https://github.com/ZackCode/mini_project_hospital_readmit

Section 10 Machine Learning Projects

  1. Linear Regression Using Boston Housing Data Set: Project Location: https://github.com/ZackCode/mini_project_linear_regression
  2. Heights and Weights Using Logistic Regression: Project Location: https://github.com/ZackCode/mini_project_logistic_regression
  3. Predicting Movie Ratings from Reviews Using Naive Bayes: Project Location: https://github.com/ZackCode/mini_project_naive_bayes
  4. Customer Segmentation Using Clustering: Project Location: https://github.com/ZackCode/mini_project_clustering

Section 18 Data Science at Scale

  1. MapReduce with Spark: Project Location: https://github.com/ZackCode/spark

Support Documents

To be added

Notes

To be added

About

This is a folder for my projects/documents on springboard data science career track

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published