Search results
35 resultsPostgreSQL Performance Tuning Fundamentals
Indexing strategy, EXPLAIN ANALYZE, vacuum, and configuration settings that matter most.
Data Warehouse Modelling — Star Schema and Dimensional Design
Facts, dimensions, slowly changing dimensions, and why modelling choices matter for query performance.
SQL Query Optimisation — Indexes, Execution Plans, and N+1
Practical techniques for making slow queries fast.
Multi-Tenancy Patterns — Database-per-Tenant, Schema-per-Tenant, and Row-Level
Tradeoffs for SaaS data isolation, compliance, and operational complexity.
Building a Data Quality Framework
Dimensions of data quality, validation layers, and monitoring in production pipelines.
Apache Iceberg — The Open Table Format Explained
Snapshots, schema evolution, partition evolution, time travel, and compaction.
Privacy-First Data Design — PII Handling Patterns
Tokenisation, pseudonymisation, encryption at rest, and right-to-deletion workflows.
Database Schema Migration Strategies
Expand-contract pattern, zero-downtime migrations, and tooling.
Schema Registry and Avro for Kafka Data Contracts
Why schema management matters for streaming pipelines and how to implement it.
Recover Deleted Files Without Paid Software
Use Recuva, TestDisk, and Shadow Copies to get files back.
Change Data Capture (CDC) — Debezium and Log-Based CDC
How CDC works, why it beats polling, and how to implement it with Debezium.
Implementing Data Lineage Tracking
Column-level lineage, tools, and why it is critical for debugging and compliance.
Parquet vs CSV — Why Columnar Storage Matters
How Parquet's columnar format reduces storage costs and speeds up analytical queries.
Data Platform Cost Optimization Strategies
Reducing Snowflake, S3, Spark, and Kafka spend without sacrificing performance.
Building a Data Catalog with DataHub
Ingestion, metadata, search, and making your catalog actually useful.
Fix NTFS Errors and File System Corruption
Repair partition table and NTFS filesystem corruption using built-in and free tools.
BigQuery Cost and Performance Optimization
Partitioned tables, clustered tables, slot usage, and avoiding full scans.
Convert a Windows 10 Boot Disk from Legacy BIOS (MBR) to UEFI (GPT)
Step-by-step guide to switching an existing Windows 10 installation from MBR/Legacy BIOS boot to GPT/UEFI — without reinstalling Windows — using Microsoft's bu…
Implementing Data Retention Policies
Legal requirements, technical implementation, and automated deletion workflows.
Delta Lake — ACID Transactions for Your Data Lake
Transaction log, upserts, schema enforcement, and time travel on S3.
Migrating from MySQL to PostgreSQL
Schema translation, data migration, and common incompatibilities to address.
Event-Driven Data Architecture Patterns
Event sourcing, CQRS, outbox pattern, and when event-driven beats request/response.
Amazon Redshift — Architecture and Query Optimization
Distribution styles, sort keys, VACUUM, ANALYZE, and WLM tuning.
Background Job Queue Design
Idempotency, retries, dead letter queues, and job observability.
New SSD Not Showing Up in Windows
Initialize, partition, and format a new SSD that does not appear in File Explorer.
Data Observability — Detecting Silent Pipeline Failures
Freshness, volume, distribution, schema, and lineage monitoring for data reliability.
Snowflake Best Practices for Cost and Performance
Virtual warehouses, clustering, query optimization, and controlling spend.
Getting Started with dbt (data build tool)
Models, tests, documentation, and the dbt workflow for transforming warehouse data.
Time-Series Databases — InfluxDB vs TimescaleDB vs ClickHouse
Comparing purpose-built and general-purpose solutions for time-series data.
PostgreSQL Replication — Streaming, Logical, and Read Replicas
Set up read replicas, understand WAL, and choose between streaming and logical replication.
Trino (formerly PrestoSQL) — Federated SQL Across Data Sources
Architecture, connectors, query federation, and performance tuning.
Data Lake vs Data Warehouse vs Lakehouse
Practical comparison of the three architectures and how to choose.
Materialised Views — When and How to Use Them
Incremental refresh, use cases, and implementation across Postgres, Snowflake, and dbt.
Extracting Microservices from a Monolith
The strangler fig pattern, identifying seams, and avoiding the distributed monolith.
Event Sourcing and CQRS — Practical Implementation
Event store design, projection rebuilding, and operational realities.