This project predicts Air Quality Levels using Machine Learning algorithms such as:
- π² Decision Tree Classifier
- π³ Random Forest Classifier
The system analyzes environmental and pollution-related parameters like temperature, humidity, PM2.5, PM10, NOβ, CO, and AQI to classify air quality into categories such as:
- Good
- Moderate
- Unhealthy for Sensitive Groups
- Unhealthy
- Hazardous
- Load and analyze air quality datasets
- Perform data preprocessing
- Train Machine Learning models
- Evaluate model performance
- Visualize important insights
- Predict air quality for new samples
A supervised learning algorithm that creates decision rules based on dataset features.
An ensemble learning algorithm that combines multiple decision trees for improved accuracy and reduced overfitting.
| Feature Name | Description |
|---|---|
| temperature_c | Temperature in Celsius |
| humidity_pct | Relative Humidity (%) |
| wind_speed_kmh | Wind Speed (km/h) |
| pm25 | Fine Particulate Matter |
| pm10 | Coarse Particulate Matter |
| no2 | Nitrogen Dioxide |
| co | Carbon Monoxide |
| aqi | Air Quality Index |
| city | City Name |
air_quality_label
Possible labels:
- Good
- Moderate
- Unhealthy for Sensitive
- Unhealthy
- Hazardous
- Python
- Pandas
- NumPy
- Matplotlib
- Seaborn
- Scikit-learn
Install all dependencies using:
pip install pandas numpy matplotlib seaborn scikit-learn