Justin_Hamilton_Portfolio

Click Here to view my Portfolio Repository with all Project Folders

There is currently one project, but this repo is intended to be updated with multiple projects.

Project 1: Home Credit Default Analysis - Kaggle Project for MSBA Capstone (Linked)

Project Overview:

Project Objective: Create a predictive model for Home Credit (Loan Brokerage) to identify safe candidates for lending. The model will use customer financial behavior data to determine whether a customer is likely to default on a loan. The project aims to increase revenue, enhance customer experience, and reduce default rates.

Business Problem: Home Credit faces challenges in identifying safe borrowers among customers who are unfamiliar with banking. Lending to individuals with a higher likelihood of loan default leads to decreased profits and negative customer experiences.

Analytic Problem:

The project focuses on predicting the creditworthiness of customers based on their financial behavior data. The target variable is binary, where 1 represents customers with payment difficulties (not trustworthy borrowers) and 0 represents trustworthy borrowers with a positive repayment history.

The objective is to develop a classification model that accurately identifies good borrowers.

Link to a more detailed Business Problem Statement

Solution to Business Problem

Our group’s solution to the business problem involved implementing a Gradient Boosting Model. This model yielded a Kaggle score of 0.656 and an AUC of 0.669, indicating its reasonable performance.

Based on our model’s analysis, we identified several important factors that significantly influence the likelihood of loan default:

  1. A high number of enquiries were made to the Credit Bureau just one day before the loan application
  2. Clients provide a work phone during the application process
  3. Clients residing in regions of the city with lower ratings, as determined by their address
  4. Loans used for purchasing high-priced goods
  5. A large number of enquiries were made to the Credit Bureau within one hour prior to the application
  6. Mismatch between the client’s permanent address and contact address

These findings can be crucial for Home Credit to assess the creditworthiness of applicants and make informed lending decisions that will lead to minimized risk, cost, increasing revenue, and profit for the company.

Link to Features Ranked by Importance Graph

Individual Contribution

Throughout the phases of this project, I contributed by,

Exploratory Data Analysis:

Link to Jupyter Notebook of EDA Individual Contributions

Link to HTML Version of EDA Individual Contributions

Modeling:

Link to Modeling Individual Contributions in Jupyter Notebook

Link to Modeling Individual Contribution in HTML File form

Presentation

Link to PowerPoint Slide Deck Individual Portion

Impact/Business Value:

Implementing this predictive model enables Home Credit to reduce costs by accurately identifying customers likely to default based on their historical data. It also minimizes the risk of granting loans to high-risk individuals. By using the model’s predictions, Home Credit can selectively offer loans only to customers projected to have a low likelihood of defaulting. This strategic decision-making process effectively mitigates the risk of loan defaults, reduces non-payment costs, and increases revenue by lending to dependable borrowers. Consequently, this combination of risk reduction, cost optimization, and increased revenue leads to a substantial increase in overall profits for Home Credit.

Difficulties Along the Way

Throughout this project, our group faced significant challenges related to the limited memory and processing capacity of our individual laptops. Each team member was using a laptop with minimal storage and computing power. As a result, running the models sometimes took up to 8 hours, leading to extended waiting times and difficulties in fine-tuning and training the models to achieve the desired results. The prolonged execution times hindered our ability to efficiently iterate and optimize the models according to our specific requirements.

Key Take Aways

Throughout this project my main key takeaways were that