Skip to content

Code

enable data union code is stored in a collection of Github repositories.

The edu_project_template repository is a templated implementation Github repository. It is intended to be used to generate an implementation-specific repository and then used to store non-secure configuration and customizations. The rest of the repositories are packages to be imported into the project by the implementation repository and used as-is.

All the repositories, with links, are listed below, followed by a diagram that helps to explain how they fit together to build the overall edu product.

Implementation template#

Git Repository Description Github Link
edu_project_template Implementation template that imports relevant packages and provides space for extension / customization https://github.com/edanalytics/edu_project_template

dbt packages#

Git Repository Description Github Link
edu_edfi_source dbt package for basic unopinionated transformation / unnesting of Ed-Fi data ​ https://github.com/edanalytics/edu_edfi_source
edu_wh dbt package that creates a configurable analytics data warehouse ​ https://github.com/edanalytics/edu_wh

Airflow setup packages#

Git Repository Description Github Link
edfi_api_client Standalone python package for interacting with the Ed-Fi API ​ https://github.com/edanalytics/edfi_api_client
edu_edfi_airflow Provides Airflow hooks and operators for extract-load from Ed-Fi API to data warehouse https://github.com/edanalytics/edu_edfi_airflow
ea_airflow_util Provides Airflow DAGs for other operations like dbt runs and alerts https://github.com/edanalytics/ea_airflow_util

Infrastructure packages#

Git Repository Description Github Link
edu_db_setup Scripts to set up database infrastructure and roles within Snowflake Not yet public, contact us at edu@edanalytics.org for access
edu_cloudformation CloudFormation templates to set up EDU Airflow infrastructure in AWS​ Not yet public, contact us at edu@edanalytics.org for access

Repository / Package Structure#

System Diagram