Sky-Pipe
Summary
Sky-Pipe is a Prefect Dataflow Pipeline that integrates Google Cloud Platform and dbt.
Specifically, the Sky-Pipe fetches daily exchange data from CoinMarketCap.com and
loads this data into Google Cloud Platform, and transforms it with dbt. This pipeline
also utilizes GitHub Actions for dev ops.
Technologies Used
- Google Cloud Secrets Manager
- Google Cloud Artifact Registry
- Python 3.10
- Prefect
Prefect Flow Steps
- Fetch data from CoinMarketCap.com
- Write data to Pandas DataFrame
- Load DataFrame to Parquet file
- Upload Parquet file to Google Cloud Storage bucket
- Load Parquet file from Google Cloud Storage to BigQuery via Load Job
- Trigger dbt job to run transformations within BigQuery
GitHub Actions
- deploys Prefect Agent Docker Image and deploys Prefect Agent to
Google Compute Engine
- deploys Prefect Flows Docker Image and the actual Prefect Flow Deployment
for CoinMarketCap.com.
- deploys Docker Image to Artifact Registry
- Deploy Container to Compute Engine
- deploys Docker Image from Artifact Registry to Compute Engine
- deploys Prefect Deployments to Prefect Cloud