Air Quality Health Analysis¶
A machine learning project for analyzing the relationship between air quality and health outcomes in Spanish provinces. This project combines air quality, health, and socioeconomic datasets into a unified dataset for ML analysis.
Documentation¶
- Project Documentation - Complete documentation
Project Structure¶
This project follows a standard ML project structure:
- ETL Pipeline: Complete data extraction, transformation, and loading pipeline
- Modeling: (To be implemented)
- Monitoring: (To be implemented)
Installation¶
Prerequisites¶
- Python 3.11+
- pip
Setup¶
# Clone the repository
git clone git@github.com:alvarog2491/air-quality-health-analysis.git
cd air-quality-health-analysis
# Install the project
pip install -e .
# Install with ETL dependencies
pip install -e .[etl]
# Install with development dependencies
pip install -e .[dev]
Quick Start¶
Run the complete ETL pipeline:
# After installation, use the CLI command
air-quality-etl
# Or run manually (from project root)
cd src
python3 -m etl_pipeline.main_orchestrator
License¶
MIT License
Academic Use¶
This project is part of a university thesis research.