CAISO Queue ETL & Analysis

This project automates the collection, processing, and analysis of CAISO's interconnection queue data.

Features

Automated weekly data collection from CAISO website
Historical data tracking with dated snapshots
Comprehensive analysis and KPI generation
GitHub Actions automation for consistent data updates
Email notifications for pipeline status

Setup

Local Development

Ensure Docker & pipenv are installed
Clone the repository
Build: docker build -t caiso-queue:latest .

Run the complete pipeline:

docker run --rm \
  -v %CD%/reports:/app/reports \
  -v %CD%/raw:/app/raw \
  -e SMTP_HOST=... \
  -e SMTP_USER=... \
  -e SMTP_PASS=... \
  -e NOTIFICATION_EMAIL=... \
  caiso-queue:latest \
  sh -c "python scripts/run_pipeline.py && python scripts/analyze_queue.py && python scripts/cleanup_raw.py"

Automated Updates

The project includes a GitHub Actions workflow that:

Downloads the latest CAISO queue report every Monday
Processes and analyzes the data
Generates updated reports
Commits changes back to the repository

Directory Structure

Refer to the top of this document.

Available KPIs and Reports

The analysis generates the following KPIs in the reports/ directory:

Capacity by Fuel Type (capacity_by_fuel.csv)
Aggregated capacity in MW for each fuel type combination in the queue.
Project Count by Status (project_count_by_status.csv)
Number of projects and total MW capacity grouped by application status.
Top 5 ISO Zones (top5_iso_zones.csv)
The 5 ISO zones with the highest active capacity.
Weekly Queue Growth (weekly_queue_growth.csv)
Weekly growth in MW capacity added to the queue.
Cancellation Rate (cancellation_rate.csv)
Ratio of withdrawn projects to active projects, measured in MW.
Average Lead Time (average_lead_time.csv)
Average days between interconnection request reception and queue date.
Top Projects by Net MW (top_projects_by_net_mw.csv)
The 10 largest projects by net MW contribution to the grid, including project name, location, fuel type, and status.

Pipeline Components

Data Collection (data_collection.py)
- Downloads the latest queue report from CAISO website
- Saves with date suffix for historical tracking
- Maintains a standard filename for compatibility
Data Processing (parse_queue.py)
- Parses multi-sheet Excel workbook
- Handles complex header structures
- Loads data into SQLite database
Analysis (analyze_queue.py)
- Generates standardized reports and KPIs
- Outputs CSV files for further analysis
- Tracks changes over time
Maintenance (cleanup_raw.py)
- Manages historical data retention
- Cleans up old raw files
- Maintains optimal storage usage

Environment Setup

GitHub Actions Setup

Go to your repository on GitHub
Navigate to Settings → Secrets and variables → Actions
Add the following secrets:
- SMTP_HOST: Your SMTP server address
- SMTP_USER: SMTP username
- SMTP_PASS: SMTP password
- NOTIFICATION_EMAIL: Notification recipient

Local Development

Create a .env file:

SMTP_HOST=smtp.example.com
SMTP_USER=your_user
SMTP_PASS=your_pass
NOTIFICATION_EMAIL=you@example.com

Run with environment file:

docker run --rm ^
  --env-file .env ^
  -v %CD%/reports:/app/reports ^
  -v %CD%/raw:/app/raw ^
  caiso-queue:latest ^
  sh -c "python scripts/run_pipeline.py"

Testing

Place a sample XLSX file in raw/ directory
Run the pipeline locally to verify:
- Data collection and parsing
- Report generation
- Database updates
Check data/caiso_queue.db for processed data
Verify reports in reports/ directory

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
.devcontainer		.devcontainer
.github/workflows		.github/workflows
dashboard		dashboard
data		data
raw		raw
reports		reports
scripts		scripts
.gitignore		.gitignore
DASHBOARD_README.md		DASHBOARD_README.md
Dockerfile		Dockerfile
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
docker-compose.yml		docker-compose.yml
entrypoint.sh		entrypoint.sh
requirements.txt		requirements.txt
run.py		run.py
run_dashboard.bat		run_dashboard.bat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CAISO Queue ETL & Analysis

Features

Setup

Local Development

Automated Updates

Directory Structure

Available KPIs and Reports

Pipeline Components

Environment Setup

GitHub Actions Setup

Local Development

Testing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

gkumarg/caiso-queue

Folders and files

Latest commit

History

Repository files navigation

CAISO Queue ETL & Analysis

Features

Setup

Local Development

Automated Updates

Directory Structure

Available KPIs and Reports

Pipeline Components

Environment Setup

GitHub Actions Setup

Local Development

Testing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages