Knowledge Base

 Results for "vector database"

Articles, FAQs, project case studies, and service deep-dives.

Main site

Search results

27 results
Article Data & Platform ★ Featured

Graph Databases — When to Use Neo4j Over Relational

Nodes, edges, Cypher queries, and use cases where graph beats SQL.

Neo4j graph database Cypher knowledge graph fraud detection
1 views Mar 30, 2026
Article Data & Platform ★ Featured

Introduction to Data Pipelines

What a data pipeline is, the core stages, and when to build vs buy.

data pipeline ETL ELT ingest transform
1 views Mar 30, 2026
Article Applied AI & ML ★ Featured

What is Retrieval-Augmented Generation (RAG)?

A plain-English explanation of RAG: why it beats pure LLM memory for production knowledge systems.

RAG LLM embeddings vector database retrieval
2 views Mar 30, 2026
Article Product Engineering ★ Featured

Database Schema Migration Strategies

Expand-contract pattern, zero-downtime migrations, and tooling.

database migration expand-contract zero downtime Flyway Liquibase
1 views Mar 30, 2026
Article Product Engineering ★ Featured

Multi-Tenancy Patterns — Database-per-Tenant, Schema-per-Tenant, and Row-Level

Tradeoffs for SaaS data isolation, compliance, and operational complexity.

multi-tenancy SaaS database per tenant row-level security PostgreSQL
1 views Mar 30, 2026
Article Data & Platform ★ Featured

Choosing a vector database: pgvector vs Pinecone vs Weaviate

A practical comparison across dimensions that matter for production RAG systems.

vector database pgvector Pinecone Weaviate embeddings
3 views Mar 30, 2026
Article Product Engineering ★ Featured

SQL Query Optimisation — Indexes, Execution Plans, and N+1

Practical techniques for making slow queries fast.

SQL query optimisation indexes N+1 EXPLAIN
2 views Mar 30, 2026
Article Data & Platform

Change Data Capture (CDC) — Debezium and Log-Based CDC

How CDC works, why it beats polling, and how to implement it with Debezium.

CDC change data capture Debezium Kafka WAL
1 views Mar 30, 2026
Article Data & Platform

DuckDB — Blazing Fast Local Analytics

When to reach for DuckDB instead of Spark, and how to use it effectively.

DuckDB analytics local Parquet S3
1 views Mar 30, 2026
Article Product Engineering

Database Connection Pooling — Why It Matters and How to Configure It

Pool sizing, connection lifetime, and debugging pool exhaustion.

connection pool database HikariCP PgBouncer pool sizing
1 views Mar 30, 2026
Article Data & Platform

Secrets Management for Data Platforms

HashiCorp Vault, AWS Secrets Manager, and patterns for rotating credentials safely.

secrets management Vault AWS Secrets Manager credentials rotation
1 views Mar 30, 2026
Article Data & Platform

Airflow Best Practices for Production Pipelines

Idempotency, backfilling, SLA misses, and common pitfalls to avoid.

Airflow best practices idempotency backfill SLA
1 views Mar 30, 2026
Article Data & Platform

Real-Time Analytics Architecture Patterns

Lambda, Kappa, HTAP, and choosing the right pattern for sub-second analytics.

real-time analytics ClickHouse Druid Flink HTAP
1 views Mar 30, 2026
Article Data & Platform

Migrating from MySQL to PostgreSQL

Schema translation, data migration, and common incompatibilities to address.

MySQL PostgreSQL migration pgloader schema translation
1 views Mar 30, 2026
Article Product Engineering

The Twelve-Factor App — Principles for Modern Services

How the twelve factors apply to real production services today.

twelve-factor twelve-factor app cloud native stateless config
1 views Mar 30, 2026
Article Product Engineering

Logging Best Practices for Production Services

Structured logging, log levels, correlation IDs, and log aggregation.

logging structured logs correlation ID log levels Loki
1 views Mar 30, 2026
Article Product Engineering

Extracting Microservices from a Monolith

The strangler fig pattern, identifying seams, and avoiding the distributed monolith.

microservices monolith strangler fig distributed system architecture
1 views Mar 30, 2026
Article Data & Platform

Time-Series Databases — InfluxDB vs TimescaleDB vs ClickHouse

Comparing purpose-built and general-purpose solutions for time-series data.

time-series InfluxDB TimescaleDB ClickHouse metrics
1 views Mar 30, 2026
Article Data & Platform

Running Data Workloads on Kubernetes

Spark on K8s, Airflow on K8s, resource requests, and storage patterns.

Kubernetes K8s Spark Airflow KubernetesExecutor
1 views Mar 30, 2026
Article Product Engineering

API Pagination — Cursor, Offset, and Keyset Patterns

When each method works, performance tradeoffs, and implementation details.

pagination cursor offset keyset API design
1 views Mar 30, 2026
Service Service Descriptions

Data & Platform — Service Overview

Pipelines, vector stores, governance, and privacy-first data design.

data platform pipelines vector store governance privacy
2 views Mar 30, 2026
Article Data & Platform

Vector Embeddings — How They Work and Where They Live

From text to vectors, similarity search, and choosing the right embedding model.

embeddings vector search ANN HNSW MTEB
1 views Mar 30, 2026
Article Product Engineering

Implementing Search — From Basic SQL to Elasticsearch

Full-text search progression from LIKE queries to dedicated search engines.

search full-text search Elasticsearch PostgreSQL vector search
1 views Mar 30, 2026
Article Product Engineering

API Testing Strategy — Unit, Integration, Contract, and E2E

Building a test pyramid that catches real bugs without slowing delivery.

API testing unit tests integration tests contract testing Pact
1 views Mar 30, 2026
Article Data & Platform

Building a Data Catalog with DataHub

Ingestion, metadata, search, and making your catalog actually useful.

DataHub data catalog metadata lineage discoverability
1 views Mar 30, 2026
Article Product Engineering

API Idempotency — Safe Retries for Mutations

Idempotency keys, implementation, and which HTTP methods are idempotent by definition.

idempotency API Idempotency-Key POST retries
1 views Mar 30, 2026
Article Product Engineering

Database Connection Patterns in PHP

PDO, prepared statements, connection pooling, and transaction management.

PHP PDO prepared statements transactions database
1 views Mar 30, 2026