Self-Hosting

Deploy Recall on your own infrastructure with Docker or Kubernetes.

Self-hosting Recall requires an Enterprise subscription. Contact sales@tryrecall.com for pricing and deployment options.

Requirements

Resource	Small	Standard	Production
CPU	2 cores	4 cores	8+ cores
RAM	12 GB	16 GB	32+ GB
Storage	20 GB SSD	50 GB SSD	100+ GB SSD
Docker	20.10+	20.10+	Latest

Small: Development, testing, single user (1-5 users) Standard: Teams (5-50 users), moderate workloads Production: Large teams (50+ users), high availability, heavy workflow execution

Resource requirements are driven by workflow execution (isolated-vm sandboxing), file processing (in-memory document parsing), and vector operations (pgvector). Memory is typically the constraining factor rather than CPU. Production telemetry shows the main app uses 4-8 GB average with peaks up to 12 GB under heavy load.

Quick Start

git clone https://github.com/tryrecall/recall.git && cd recall
docker compose -f docker-compose.prod.yml up -d

Open http://localhost:3000

Deployment Options

Docker

Deploy with Docker Compose on any server

Kubernetes

Deploy with Helm on Kubernetes clusters

Cloud Platforms

Railway, DigitalOcean, AWS, Azure, GCP guides

Architecture

Component	Port	Description
recall	3000	Main application
copilot	3003	AI assistant
realtime	3002	WebSocket server
db	5432	PostgreSQL with pgvector
migrations	-	Database migrations (runs once)

Common Questions

At minimum you need 2 CPU cores, 12 GB RAM, 20 GB SSD storage, and Docker 20.10 or later. Memory is typically the constraining factor due to workflow execution (isolated-vm sandboxing), file processing, and vector operations (pgvector).

Recall uses PostgreSQL 17 with the pgvector extension for vector similarity search. The Docker setup uses the pgvector/pgvector:pg17 image, which comes with the vector extension pre-installed.

Three secrets are required: BETTER_AUTH_SECRET (authentication), ENCRYPTION_KEY (data encryption), and INTERNAL_API_SECRET (service-to-service auth). Generate each with openssl rand -hex 32. You also need to set NEXT_PUBLIC_APP_URL and BETTER_AUTH_URL to your domain.

The realtime service is a dedicated WebSocket server that runs on port 3002. It handles real-time communication for features like live workflow execution updates. It has a 1 GB memory limit and requires its own DATABASE_URL and BETTER_AUTH_SECRET configuration.

The migrations container runs once at startup (restart: no) and executes bun run db:migrate to apply database schema changes. It depends on the database being healthy before running and must complete successfully before the main application starts.

Yes. Recall supports Ollama for local model inference. Use docker-compose.ollama.yml instead of docker-compose.prod.yml. It offers both GPU (with NVIDIA support) and CPU-only profiles, and automatically pulls gemma3:4b as a starter model.