Open Source • Apache 2.0

A Search & Analytics
Polystore

Self-tuning database for agentic search and analytics workloads.

A unified, open source stack that blends search, analytics, and low-latency access at scale.

90%
Cost Savings
< 1s
Search Latency
PB
Scale
100%
API Compatible

KalDB Excels At

Built for the toughest log analytics challenges

Self-Hosted Deployments

Run in your own AWS, GCP, or Azure account. Full data sovereignty and compliance control.

Dramatic Cost Savings

90% lower TCO than Elasticsearch. Pay only for S3 storage and active compute.

Petabyte Scale

Built for massive log volumes. Scale indexing and query compute independently.

Enterprise Ready

Production-proven at Slack. SSO, RBAC, and compliance certifications available.

The Log Analytics Challenge

Traditional solutions force painful tradeoffs

10x
More Log Volume

Microservices and observability have exploded log data. Legacy solutions can't keep up economically.

100x
Longer Retention

Compliance requires years of retention. Per-GB storage costs become astronomical.

0
Latency Tolerance

When incidents happen, every second counts. Search must be instant, not minutes.

Legacy Search Limitations

Why teams are switching to KalDB

Complexity

Cluster management, shard rebalancing, and capacity planning require dedicated teams.

Scaling Issues

Coupled compute and storage means over-provisioning. Scaling is slow and expensive.

Licensing Concerns

Elastic's license changes left many teams scrambling for alternatives.

Technical Debt

Years of patches and workarounds. Upgrading is painful and risky.

Performance at Scale

Query times degrade as data grows. Hot-warm architectures add complexity.

Pricing Opacity

SaaS providers charge per-GB ingestion. Bills spike unpredictably.

Vendor Lock-in

Proprietary features trap your data. Migration becomes a major project.

Data Sovereignty

SaaS means your logs leave your infrastructure. Compliance becomes complicated.

Outdated Architecture

Not built for cloud-native. S3 is an afterthought, not a foundation.

Built For Your Use Case

Proven at scale across industries

Self-Hosted Search

Deploy in your own cloud account with full control. Meet compliance requirements without compromising on features.

  • Data never leaves your VPC
  • SOC2, HIPAA, GDPR ready
  • Full infrastructure control

Observability at Scale

Handle millions of events per second. Correlate logs, metrics, and traces across your entire infrastructure.

  • Sub-second queries at PB scale
  • Grafana & Kibana compatible
  • Real-time alerting

Security Analytics

Detect threats and investigate incidents in real-time. Retain security logs for years at minimal cost.

  • SIEM integration ready
  • Long-term retention at $0.023/GB
  • Forensic investigation

Enterprise-Grade Infrastructure

Built for mission-critical workloads

99.99%
Uptime SLA
90%
Cost Reduction
10x
Less Infrastructure
11+
Integrations

Works With Your Stack

Seamless integration with tools you already use

AWS
AWS
GCP
GCP
Azure
Azure
Kubernetes
Kubernetes
Docker
Docker
Terraform
Terraform
Prometheus
Prometheus
Grafana
Grafana
Ansible
Ansible
Jenkins
Jenkins
GitHub
GitHub
Fluent Bit

Cloud-Native Architecture

Built from the ground up for S3

The Ingestion Path

1
Durable Write to S3
Logs are written durably to S3 first for long-term retention and compliance
2
Real-time Indexing
Lucene indexes built on-demand for query time ranges (typically within seconds)
3
Index Caching
Active indexes cached locally for fast repeated queries

The Query Path

1
Index Lookup
Queries hit cached Lucene indexes for fast results (< 1s)
2
On-Demand Indexing
Older logs indexed on-demand from S3 when queried
3
Independent Scaling
Query and indexing compute scale independently based on load

Architecture Philosophy

KalDB decouples compute from storage to deliver cost-efficient search at scale. By leveraging S3 for durable storage and building indexes on-demand, you only pay for active compute and storage. Perfect for log-heavy workloads where retention is long but queries are sporadic.

Get Started in Minutes

Self-host KalDB with Docker or run in your cloud

Local Deployment

# Clone and start
git clone https://github.com/slackhq/kaldb
cd kaldb
docker-compose up
✓ Running on localhost:8080