Knowledge Base

 Results for "column lineage"

Articles, FAQs, project case studies, and service deep-dives.

Main site

Search results

21 results
Article Data & Platform ★ Featured

Data Warehouse Modelling — Star Schema and Dimensional Design

Facts, dimensions, slowly changing dimensions, and why modelling choices matter for query performance.

data warehouse star schema dimensional modeling SCD fact table
1 views Mar 30, 2026
Article Data & Platform ★ Featured

Apache Iceberg — The Open Table Format Explained

Snapshots, schema evolution, partition evolution, time travel, and compaction.

Apache Iceberg open table format time travel schema evolution ACID
1 views Mar 30, 2026
Article Data & Platform ★ Featured

PostgreSQL Performance Tuning Fundamentals

Indexing strategy, EXPLAIN ANALYZE, vacuum, and configuration settings that matter most.

PostgreSQL performance indexing EXPLAIN vacuum
1 views Mar 30, 2026
Article Data & Platform ★ Featured

Privacy-First Data Design — PII Handling Patterns

Tokenisation, pseudonymisation, encryption at rest, and right-to-deletion workflows.

PII privacy GDPR pseudonymisation tokenisation
1 views Mar 30, 2026
Article Product Engineering ★ Featured

Database Schema Migration Strategies

Expand-contract pattern, zero-downtime migrations, and tooling.

database migration expand-contract zero downtime Flyway Liquibase
1 views Mar 30, 2026
Article Data & Platform ★ Featured

Data Governance — Principles and Practical Implementation

Ownership, cataloguing, lineage tracking, and access control at scale.

data governance data catalog lineage DataHub access control
1 views Mar 30, 2026
Article Product Engineering ★ Featured

Multi-Tenancy Patterns — Database-per-Tenant, Schema-per-Tenant, and Row-Level

Tradeoffs for SaaS data isolation, compliance, and operational complexity.

multi-tenancy SaaS database per tenant row-level security PostgreSQL
1 views Mar 30, 2026
Article Product Engineering ★ Featured

SQL Query Optimisation — Indexes, Execution Plans, and N+1

Practical techniques for making slow queries fast.

SQL query optimisation indexes N+1 EXPLAIN
2 views Mar 30, 2026
Article Data & Platform

Getting Started with dbt (data build tool)

Models, tests, documentation, and the dbt workflow for transforming warehouse data.

dbt data build tool ELT SQL transformation
1 views Mar 30, 2026
Article Data & Platform

Implementing Data Lineage Tracking

Column-level lineage, tools, and why it is critical for debugging and compliance.

data lineage OpenLineage DataHub dbt column lineage
1 views Mar 30, 2026
Article Computer Repair

Fix 100% CPU Usage in Windows

Identify what is consuming CPU and permanently resolve the issue.

CPU usage 100% CPU performance WMI Windows Defender
1 views Mar 30, 2026
Article Data & Platform

Snowflake Best Practices for Cost and Performance

Virtual warehouses, clustering, query optimization, and controlling spend.

Snowflake cost optimization virtual warehouse clustering query tuning
2 views Mar 30, 2026
Article Data & Platform

Data Observability — Detecting Silent Pipeline Failures

Freshness, volume, distribution, schema, and lineage monitoring for data reliability.

data observability freshness volume distribution Monte Carlo
1 views Mar 30, 2026
Article Data & Platform

Delta Lake — ACID Transactions for Your Data Lake

Transaction log, upserts, schema enforcement, and time travel on S3.

Delta Lake ACID upsert MERGE time travel
1 views Mar 30, 2026
Article Data & Platform

Amazon Redshift — Architecture and Query Optimization

Distribution styles, sort keys, VACUUM, ANALYZE, and WLM tuning.

Redshift AWS distribution key sort key VACUUM
1 views Mar 30, 2026
Article Data & Platform

Parquet vs CSV — Why Columnar Storage Matters

How Parquet's columnar format reduces storage costs and speeds up analytical queries.

Parquet CSV columnar storage compression PyArrow
1 views Mar 30, 2026
Article Product Engineering

API Pagination — Cursor, Offset, and Keyset Patterns

When each method works, performance tradeoffs, and implementation details.

pagination cursor offset keyset API design
1 views Mar 30, 2026
Service Service Descriptions

Data & Platform — Service Overview

Pipelines, vector stores, governance, and privacy-first data design.

data platform pipelines vector store governance privacy
2 views Mar 30, 2026
Article Data & Platform

Data Platform Cost Optimization Strategies

Reducing Snowflake, S3, Spark, and Kafka spend without sacrificing performance.

cost optimization Snowflake S3 Spark Kafka
1 views Mar 30, 2026
Article Data & Platform

Building a Data Catalog with DataHub

Ingestion, metadata, search, and making your catalog actually useful.

DataHub data catalog metadata lineage discoverability
1 views Mar 30, 2026
Article Data & Platform

BigQuery Cost and Performance Optimization

Partitioned tables, clustered tables, slot usage, and avoiding full scans.

BigQuery GCP partitioning clustering cost optimization
1 views Mar 30, 2026