Search results
20 resultsData Warehouse Modelling — Star Schema and Dimensional Design
Facts, dimensions, slowly changing dimensions, and why modelling choices matter for query performance.
Apache Iceberg — The Open Table Format Explained
Snapshots, schema evolution, partition evolution, time travel, and compaction.
Introduction to Data Pipelines
What a data pipeline is, the core stages, and when to build vs buy.
Database Schema Migration Strategies
Expand-contract pattern, zero-downtime migrations, and tooling.
Multi-Tenancy Patterns — Database-per-Tenant, Schema-per-Tenant, and Row-Level
Tradeoffs for SaaS data isolation, compliance, and operational complexity.
Getting Started with dbt (data build tool)
Models, tests, documentation, and the dbt workflow for transforming warehouse data.
Implementing Data Lineage Tracking
Column-level lineage, tools, and why it is critical for debugging and compliance.
Data Lake vs Data Warehouse vs Lakehouse
Practical comparison of the three architectures and how to choose.
Data Observability — Detecting Silent Pipeline Failures
Freshness, volume, distribution, schema, and lineage monitoring for data reliability.
Delta Lake — ACID Transactions for Your Data Lake
Transaction log, upserts, schema enforcement, and time travel on S3.
Event Sourcing and CQRS — Practical Implementation
Event store design, projection rebuilding, and operational realities.
Schema Registry and Avro for Kafka Data Contracts
Why schema management matters for streaming pipelines and how to implement it.
Migrating from MySQL to PostgreSQL
Schema translation, data migration, and common incompatibilities to address.
MongoDB Schema Design Patterns
Embedding vs referencing, the subset pattern, and indexing strategy.
GraphQL vs REST — When to Use Each
Comparing query flexibility, over-fetching, tooling, and operational complexity.
Parquet vs CSV — Why Columnar Storage Matters
How Parquet's columnar format reduces storage costs and speeds up analytical queries.
Data & Platform — Service Overview
Pipelines, vector stores, governance, and privacy-first data design.
Testing Strategy for Data Pipelines
Unit tests, integration tests, data contract tests, and regression testing for pipelines.
OpenAPI Spec-First API Development
Write the contract before writing code — benefits, tooling, and workflow.
Data Contracts — Formalising Agreements Between Producers and Consumers
Schema, SLAs, semantics, and how to enforce data contracts in practice.