Tanzanian Water pumps classification
Problem Statement
Using data from the Tanzanian Ministry of Water, can you predict which pumps are functional, which need some repairs, and which don’t work at all? Predict one of these three classes based on a number of variables about what kind of pump is operating, when it was installed, and how it is managed. A smart understanding of which water points will fail can improve maintenance operations and ensure that clean, potable water is available to communities across Tanzania.
The methodology I used for my project can be broken down as followings:
- Obtain (import the data)
- Scrub (clean the data, deal with missing values and data types)
- Explore (answer descriptives questions using EDA)
- Model (build our predictive model)
- iNterpret (comment on our model and findings)
Below are all the steps which I implemented:
I tried some other models as well but somehow Random Forest Classifier worked the best for me. Hope you find this article helpful and please contact me @kristinelpetrosyan@gmail.com.