Titanic Data Preprocessing and Feature Engineering

100DaysofML-Day11

Titanic Data Preprocessing and Feature Engineering

This project demonstrates the process of data preprocessing and feature engineering on the Titanic dataset. The goal is to clean and transform the data, making it suitable for machine learning models.

Dataset

The dataset used in this project is the Titanic dataset, which contains information about the passengers aboard the Titanic. You can download it from Kaggle.

Steps

Import the necessary libraries
Load the dataset
Explore the data
Perform data preprocessing and feature engineering:
- Handle missing values
- Create new features
- Encode categorical variables
Visualize the results
Perform a simple unit test

Dependencies

Pandas
Matplotlib
Seaborn

Visualization

The project includes visualization using Matplotlib and Seaborn to help you understand the distribution of features and their relationships with the target variable (Survived).

Unit Test

A simple unit test is included to ensure the correctness of the data preprocessing and feature engineering steps. The test checks that the resulting DataFrame has the expected columns and the correct number of columns.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
LICENSE		LICENSE
README.md		README.md
main.ipynb		main.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

100DaysofML-Day11

Titanic Data Preprocessing and Feature Engineering

Dataset

Steps

Dependencies

Visualization

Unit Test

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

100DaysofML-Day11

Titanic Data Preprocessing and Feature Engineering

Dataset

Steps

Dependencies

Visualization

Unit Test

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages