Knowledge Base

 Results for "transform"

Articles, FAQs, project case studies, and service deep-dives.

Main site

Search results

12 results
Article Data & Platform ★ Featured

Introduction to Data Pipelines

What a data pipeline is, the core stages, and when to build vs buy.

data pipeline ETL ELT ingest transform
2 views Mar 30, 2026
Article Data & Platform

Getting Started with dbt (data build tool)

Models, tests, documentation, and the dbt workflow for transforming warehouse data.

dbt data build tool ELT SQL transformation
3 views Mar 30, 2026
Article Data & Platform

DuckDB — Blazing Fast Local Analytics

When to reach for DuckDB instead of Spark, and how to use it effectively.

DuckDB analytics local Parquet S3
3 views Mar 30, 2026
Article Data & Platform

Feature Stores — Bridging Data Engineering and ML

What a feature store is, online vs offline stores, and when to build vs buy.

feature store ML platform Feast training-serving skew online store
3 views Mar 30, 2026
Article Product Engineering

API Gateway — Responsibilities and Implementation Patterns

Authentication, rate limiting, routing, request aggregation, and when not to use a gateway.

API gateway Kong AWS API Gateway BFF rate limiting
3 views Mar 30, 2026
Article Data & Platform

Testing Strategy for Data Pipelines

Unit tests, integration tests, data contract tests, and regression testing for pipelines.

testing data pipeline dbt unit tests integration tests
3 views Mar 30, 2026
Article Data & Platform

Implementing Data Lineage Tracking

Column-level lineage, tools, and why it is critical for debugging and compliance.

data lineage OpenLineage DataHub dbt column lineage
2 views Mar 30, 2026
Article Data & Platform

ETL vs ELT — Which Pattern Should You Use?

Understand the difference between Extract-Transform-Load and Extract-Load-Transform and when each fits.

ETL ELT data warehouse dbt Snowflake
2 views Mar 30, 2026
Article Data & Platform

Apache Spark — Core Concepts and When to Use It

RDDs, DataFrames, Spark SQL, and the use cases where Spark is the right tool.

Spark Apache Spark DataFrames distributed compute Spark SQL
2 views Mar 30, 2026
Article Product Engineering

Event Sourcing and CQRS — Practical Implementation

Event store design, projection rebuilding, and operational realities.

event sourcing CQRS event store projection snapshot
2 views Mar 30, 2026
Article Data & Platform

Running Data Workloads on Kubernetes

Spark on K8s, Airflow on K8s, resource requests, and storage patterns.

Kubernetes K8s Spark Airflow KubernetesExecutor
2 views Mar 30, 2026
Article Product Engineering

Serverless Architecture — When Functions Work and When They Don't

Cold starts, event-driven patterns, cost model, and the right use cases.

serverless AWS Lambda cold start event-driven FaaS
2 views Mar 30, 2026