Modern Data Engineering Stack

Comprehensive overview of the leading cloud ecosystems and preferred data tools powering today's technical architectures.

cloud

Amazon Web Services

S3
S3Object Storage
Glue
GlueETL/Catalog
Redshift
RedshiftWarehouse
Athena
AthenaQuery
Lambda
LambdaServerless
Kinesis
KinesisStreams
SageMaker
SageMakerML Platform
Bedrock
BedrockGen AI
cloud_queue

Google Cloud Platform

GCS
GCSObject Storage
BigQuery
BigQueryWarehouse
Dataflow
DataflowProcessing
PubSub
Pub/SubIngestion
Functions
FunctionsServerless
Vertex AI
Vertex AIAI & ML
wb_cloudy

Microsoft Azure

Azure Blob
BlobObject Storage
Synapse
SynapseWarehouse
Fabric
FabricUnified Analytics
ADF
ADFIntegration
Azure SQL
SQL DBRelational
Cosmos
CosmosNoSQL DB
AI Foundry
AI FoundryAI Platform
Snowflake
Snowflake
Kafka
Kafka
Airflow
Airflow
dbt
dbt
Postgres
Postgres
MongoDB
MongoDB
Databricks
Databricks
MySQL
MySQL
Redis
Redis