View on GitHub

Stroke

Predicting stroke risk from a Kaggle data set.

Stroke Risk

Description

The purpose of this project is to analyze a Kaggle data set and to create a predictive model to evaluate stroke risk that could be used by medical professionals to target risk mitigations.

Data

The Kaggle data provided by fedesoriano.

Southern Stroke Mortality Mapping Data provided by CDC.

EDA

A pandas-profiling report is available.

Code

The python code is available as a Jupyter notebook.

If you have trouble with GitHub rendering the file, please try here.

Documentation

Executive Report

Presentation

Executive Presentation

Instructions

To run this notebook locally, install Jupyter, download the data set, change the file location to load the code and data, and install all the library dependencies.

Try Anaconda.

Model Folders

Tools

Credits

Surgery image by Olga Guryanova at Unsplash