Skip to content

Air Quality Health Analysis

Tests Status Docs

A machine learning project for analyzing the relationship between air quality and health outcomes in Spanish provinces. This project combines air quality, health, and socioeconomic datasets into a unified dataset for ML analysis.

Documentation

Project Structure

This project follows a standard ML project structure:

  • ETL Pipeline: Complete data extraction, transformation, and loading pipeline
  • Modeling: (To be implemented)
  • Monitoring: (To be implemented)

Installation

Prerequisites

  • Python 3.11+
  • pip

Setup

# Clone the repository
git clone git@github.com:alvarog2491/air-quality-health-analysis.git
cd air-quality-health-analysis

# Install the project
pip install -e .

# Install with ETL dependencies
pip install -e .[etl]

# Install with development dependencies
pip install -e .[dev]

Quick Start

Run the complete ETL pipeline:

# After installation, use the CLI command
air-quality-etl

# Or run manually (from project root)
cd src
python3 -m etl_pipeline.main_orchestrator

License

MIT License

Academic Use

This project is part of a university thesis research.