Search results
22 resultsHow we run a one-week research spike
The exact process we use to de-risk a technical bet in five days.
Building a Data Quality Framework
Dimensions of data quality, validation layers, and monitoring in production pipelines.
Introduction to Data Pipelines
What a data pipeline is, the core stages, and when to build vs buy.
CI/CD Pipeline Design — From Commit to Production
Stages, gates, deployment strategies, and keeping pipelines fast.
Dependency Management and Supply Chain Security
Lock files, vulnerability scanning, SBOM, and keeping dependencies up to date.
API Testing Strategy — Unit, Integration, Contract, and E2E
Building a test pyramid that catches real bugs without slowing delivery.
Do you sign NDAs?
Standard policy on confidentiality and IP.
Event Sourcing and CQRS — Practical Implementation
Event store design, projection rebuilding, and operational realities.
Data Lake vs Data Warehouse vs Lakehouse
Practical comparison of the three architectures and how to choose.
Getting Started with dbt (data build tool)
Models, tests, documentation, and the dbt workflow for transforming warehouse data.
Monitoring and Alerting for Data Pipelines
What to monitor, SLIs/SLOs for data, and building effective alerting.
Trino (formerly PrestoSQL) — Federated SQL Across Data Sources
Architecture, connectors, query federation, and performance tuning.
Container Registry Management and Image Lifecycle
Tagging conventions, vulnerability scanning, retention policies, and registry options.
Building a Data Catalog with DataHub
Ingestion, metadata, search, and making your catalog actually useful.
Data Contracts — Formalising Agreements Between Producers and Consumers
Schema, SLAs, semantics, and how to enforce data contracts in practice.
Docker Containerisation Best Practices
Writing efficient Dockerfiles, multi-stage builds, security hardening, and image size reduction.
Change Data Capture (CDC) — Debezium and Log-Based CDC
How CDC works, why it beats polling, and how to implement it with Debezium.
Feature Stores — Bridging Data Engineering and ML
What a feature store is, online vs offline stores, and when to build vs buy.
Fixing CPU Overheating — Thermal Paste and Cooler Guide
How to diagnose thermal throttling, replace thermal paste, and improve CPU cooler performance.
Extracting Microservices from a Monolith
The strangler fig pattern, identifying seams, and avoiding the distributed monolith.
How long does a typical project take?
Timeline expectations from kick-off to launch.
WebSockets — Building Real-Time Features
Connection lifecycle, heartbeats, reconnection logic, and scaling with Redis pub/sub.