Getting Started for Developers#
Welcome to the EDU getting started guide for developers! If you're deploying, securing, or customizing an EDU data warehouse, you're in the right place. This guide builds on what you saw in the general overview and focuses specifically on what developers need to know before diving into infrastructure, Airflow, dbt, and warehouse configuration.
What you'll be responsible for as a developer#
At a high level, developers take the EDU framework and make it run for a specific education agency. That includes:
- Setting up the cloud environment EDU runs in
- Installing and configuring EDU’s packages
- Managing Airflow, data warehouse, and repository settings
- Extending or customizing the data model when needed
- Ensuring the pipeline runs reliably day to day
What the EDU Deployment Environment Includes#
Before you install anything, it helps to understand the main components that make up an EDU deployment. Once EDU is set up, you’ll typically have:
- An AWS environment running Apache Airflow, with pre-built DAGs that:
- Pull data from the Ed-Fi ODS
- Trigger dbt runs in the data warehouse
- An S3 bucket used for staging raw data before loading it into the warehouse
- A data warehouse, where both raw and modeled tables live
- An implementation GitHub repository, which stores configuration, overrides, and any custom development for your deployment
These pieces give EDU a place to read data from, a place to run code, and a place to store results, everything the pipeline needs to operate reliably.
Getting Started Sections#
| Section | Description |
|---|---|
| Setup and Secure | Learn how to deploy and secure your EDU data warehouse infrastructure |
| Manage and Extend | Learn how to customize, extend, and maintain your EDU implementation |
Choose a section above to begin your journey as an EDU developer.