Knowledge Base

 Results for "column lineage"

Articles, FAQs, project case studies, and service deep-dives.

Main site

Search results

21 results
Article Product Engineering ★ Featured

Database Schema Migration Strategies

Expand-contract pattern, zero-downtime migrations, and tooling.

database migration expand-contract zero downtime Flyway Liquibase
47 views May 24, 2026
Article Data & Platform ★ Featured

Data Governance — Principles and Practical Implementation

Ownership, cataloguing, lineage tracking, and access control at scale.

data governance data catalog lineage DataHub access control
45 views May 24, 2026
Article Data & Platform ★ Featured

PostgreSQL Performance Tuning Fundamentals

Indexing strategy, EXPLAIN ANALYZE, vacuum, and configuration settings that matter most.

PostgreSQL performance indexing EXPLAIN vacuum
47 views May 24, 2026
Article Product Engineering ★ Featured

SQL Query Optimisation — Indexes, Execution Plans, and N+1

Practical techniques for making slow queries fast.

SQL query optimisation indexes N+1 EXPLAIN
45 views May 24, 2026
Article Data & Platform ★ Featured

Privacy-First Data Design — PII Handling Patterns

Tokenisation, pseudonymisation, encryption at rest, and right-to-deletion workflows.

PII privacy GDPR pseudonymisation tokenisation
51 views May 24, 2026
Article Data & Platform ★ Featured

Apache Iceberg — The Open Table Format Explained

Snapshots, schema evolution, partition evolution, time travel, and compaction.

Apache Iceberg open table format time travel schema evolution ACID
47 views May 24, 2026
Article Product Engineering ★ Featured

Multi-Tenancy Patterns — Database-per-Tenant, Schema-per-Tenant, and Row-Level

Tradeoffs for SaaS data isolation, compliance, and operational complexity.

multi-tenancy SaaS database per tenant row-level security PostgreSQL
46 views May 23, 2026
Article Data & Platform ★ Featured

Data Warehouse Modelling — Star Schema and Dimensional Design

Facts, dimensions, slowly changing dimensions, and why modelling choices matter for query performance.

data warehouse star schema dimensional modeling SCD fact table
44 views May 19, 2026
Article Data & Platform

Getting Started with dbt (data build tool)

Models, tests, documentation, and the dbt workflow for transforming warehouse data.

dbt data build tool ELT SQL transformation
53 views May 24, 2026
Article Data & Platform

Data Observability — Detecting Silent Pipeline Failures

Freshness, volume, distribution, schema, and lineage monitoring for data reliability.

data observability freshness volume distribution Monte Carlo
41 views May 24, 2026
Article Data & Platform

Snowflake Best Practices for Cost and Performance

Virtual warehouses, clustering, query optimization, and controlling spend.

Snowflake cost optimization virtual warehouse clustering query tuning
46 views May 24, 2026
Article Computer Repair

Fix 100% CPU Usage in Windows

Identify what is consuming CPU and permanently resolve the issue.

CPU usage 100% CPU performance WMI Windows Defender
44 views May 24, 2026
Article Product Engineering

API Pagination — Cursor, Offset, and Keyset Patterns

When each method works, performance tradeoffs, and implementation details.

pagination cursor offset keyset API design
44 views May 24, 2026
Article Data & Platform

Parquet vs CSV — Why Columnar Storage Matters

How Parquet's columnar format reduces storage costs and speeds up analytical queries.

Parquet CSV columnar storage compression PyArrow
47 views May 24, 2026
Article Data & Platform

Building a Data Catalog with DataHub

Ingestion, metadata, search, and making your catalog actually useful.

DataHub data catalog metadata lineage discoverability
44 views May 24, 2026
Article Data & Platform

BigQuery Cost and Performance Optimization

Partitioned tables, clustered tables, slot usage, and avoiding full scans.

BigQuery GCP partitioning clustering cost optimization
43 views May 24, 2026
Service Service Descriptions

Data & Platform — Service Overview

Pipelines, vector stores, governance, and privacy-first data design.

data platform pipelines vector store governance privacy
41 views May 23, 2026
Article Data & Platform

Data Platform Cost Optimization Strategies

Reducing Snowflake, S3, Spark, and Kafka spend without sacrificing performance.

cost optimization Snowflake S3 Spark Kafka
45 views May 22, 2026
Article Data & Platform

Amazon Redshift — Architecture and Query Optimization

Distribution styles, sort keys, VACUUM, ANALYZE, and WLM tuning.

Redshift AWS distribution key sort key VACUUM
50 views May 21, 2026
Article Data & Platform

Delta Lake — ACID Transactions for Your Data Lake

Transaction log, upserts, schema enforcement, and time travel on S3.

Delta Lake ACID upsert MERGE time travel
44 views May 19, 2026
Article Data & Platform

Implementing Data Lineage Tracking

Column-level lineage, tools, and why it is critical for debugging and compliance.

data lineage OpenLineage DataHub dbt column lineage
44 views May 19, 2026