⚖️ EquiLend AI — Open Source ML Challenge 2026

Bridging the credit gap with fair, explainable, alternative-data machine learning.

🌍 The Problem

Traditional credit scoring excludes millions of "credit invisible" individuals by relying solely on historical banking data. EquiLend AI addresses this by leveraging alternative data — utility payments, cash flow consistency, and digital footprints — to assess creditworthiness fairly and transparently.

🚀 The Challenge

A Streamlit UI prototype is provided as your starting point. The core "brain" is currently a mathematical placeholder with 4 critical logical flaws, and the data pipeline does not yet exist.

Your Mission

Fix the Core Logic — Resolve the 4 logical bugs in src/app.py
Build the ML Pipeline — Replace the formula with a robust, fair, optimized ML engine
Complete All 15 Tasks — Cover data ingestion, preprocessing, modeling, and evaluation

🐛 Task 00 — Hidden Logical Bugs (Fix First)

Before building the ML pipeline, resolve these 4 bugs in src/app.py:

#	Bug	Description
1	Division by Zero	Score crashes when `utility_bill` is entered as `0`
2	Age Guard Bypass	System allows scoring for users under 18
3	Linear Scaling Flaw	Simple ratio formula must be replaced with a trained ML model
4	State Persistence	Decisions disappear on refresh — must be saved to MongoDB

🛠 Tech Stack

Layer	Tools
Language	Python 3.10+
Frontend	Streamlit
ML Libraries	Scikit-learn, XGBoost, LightGBM, Imbalanced-learn (SMOTE), SHAP
Database	MongoDB Atlas (NoSQL)
Testing & Data	Pytest, Faker, Pandas

📂 Repository Structure

EquiLend-AI/
├── scripts/
│   └── generate_data.py          # RUN FIRST — generates synthetic dataset
├── src/
│   ├── app.py                    # Streamlit Dashboard UI
│   ├── data_ingestion/           # Task 01: MongoDB connection
│   ├── preprocessing/            # Tasks 02, 03, 04, 07: Cleaning & Encoding
│   ├── models/                   # Tasks 05, 06, 10, 11: Training logic
│   └── evaluation/               # Tasks 08, 09, 13: SHAP & Fairness logic
├── tests/                        # Task 12: Unit Tests
├── .env.example                  # MongoDB URI template
├── requirements.txt              # Python dependencies
├── Fairness_Report.md            # Task 15: Final fairness metrics
└── README.md

⚡ Getting Started

1. Prerequisites & Virtual Environment

Ensure Python 3.10+ is installed, then set up your environment:

# Create virtual environment
python -m venv venv

# Activate — Windows
.\venv\Scripts\activate

# Activate — Mac/Linux
source venv/bin/activate

2. Install Dependencies

pip install -r requirements.txt

3. Generate the Mock Dataset

Run this before building any models. It creates a synthetic dataset with intentional missing values and class imbalances:

python scripts/generate_data.py

This generates data/equilend_mock_data.csv. The data/ folder is git-ignored for security.

4. Setup Environment Variables

cp .env.example .env

Open .env and add your MongoDB Atlas URI.

5. Run the Dashboard

python -m streamlit run src/app.py

🏆 Evaluation Rubric

Criteria	🥇 Gold	🥈 Silver	🥉 Bronze
Model Quality	Optimized XGBoost/LightGBM with AUC > 0.85	Basic Random Forest	Hard-coded logic
Explainability	Interactive SHAP plots in Streamlit	Static feature importance in console	None
Fairness	Bias detection script + `Fairness_Report.md`	Fairness mentioned in README only	No bias checking
Security/Eng	Pytest validation + secure `.env` MongoDB	Basic try-except blocks	Hard-coded credentials

📤 Submission Guidelines

Fork this repository
Complete all 15 tasks in the designated src/ subfolders
Populate Fairness_Report.md with your model's final metrics
Submit a Pull Request with a summary of your XGBoost architecture and fairness outcomes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

⚖️ EquiLend AI — Open Source ML Challenge 2026

🌍 The Problem

🚀 The Challenge

Your Mission

🐛 Task 00 — Hidden Logical Bugs (Fix First)

🛠 Tech Stack

📂 Repository Structure

⚡ Getting Started

1. Prerequisites & Virtual Environment

2. Install Dependencies

3. Generate the Mock Dataset

4. Setup Environment Variables

5. Run the Dashboard

🏆 Evaluation Rubric

📤 Submission Guidelines

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
data		data
scripts		scripts
src		src
tests		tests
.env.example		.env.example
.gitignore		.gitignore
Fairness_Report.md		Fairness_Report.md
README.md		README.md
model_utils.py		model_utils.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

⚖️ EquiLend AI — Open Source ML Challenge 2026

🌍 The Problem

🚀 The Challenge

Your Mission

🐛 Task 00 — Hidden Logical Bugs (Fix First)

🛠 Tech Stack

📂 Repository Structure

⚡ Getting Started

1. Prerequisites & Virtual Environment

2. Install Dependencies

3. Generate the Mock Dataset

4. Setup Environment Variables

5. Run the Dashboard

🏆 Evaluation Rubric

📤 Submission Guidelines

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages