datascore

ML readiness scoring for tabular datasets.

Point it at a DataFrame and get a structured report telling you whether your data is ready for ML training — and if not, exactly why.

Install

pip install datascore

Usage

from datascore import score

report = score(df, target="churn") report.show()

Output

datascore Report

Rows: 7043 | Features: 21 | Target: Churn Score: 85/100 — READY

WARNINGS

High cardinality: customerID has 7043 unique values
High cardinality: TotalCharges has 6531 unique values
High skew in SeniorCitizen: 1.8332

INFO

No constant features detected
No infinite values detected
Class balance: 73/27

What it checks

Category	Checks
Completeness	Missing values, high missing rate per column
Integrity	Duplicate rows, constant features, infinite values
ML Readiness	Class imbalance, target leakage risk, high cardinality
Distribution	Skew, outliers per column

Scoring

Starts at 100. Each blocker deducts 15 points, each warning deducts 5.

Score	Verdict
80-100	READY
50-79	NEEDS WORK
0-49	NOT READY

Why not Great Expectations or Pandera?

Those tools validate data against rules you define. datascore tells you what the problems are without you having to know what to look for first. Assessment, not validation.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.github/workflows		.github/workflows
dist		dist
src		src
tests		tests
README.md		README.md
pyproject.toml		pyproject.toml
test_run.py		test_run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

datascore

Install

Usage

Output

datascore Report

What it checks

Scoring

Why not Great Expectations or Pandera?

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

datascore

Install

Usage

Output

datascore Report

What it checks

Scoring

Why not Great Expectations or Pandera?

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages