Overview
This guide walks you through deploying Smallest Self-Host using Docker Compose. You’ll have a fully functional speech-to-text service running in under 15 minutes.Ensure you’ve completed all prerequisites before
starting this guide.
Step 1: Create Project Directory
Create a directory for your deployment:Step 2: Login to Container Registry
Authenticate with the Smallest container registry using credentials provided by support:Step 3: Create Environment File
Create a.env file with your license key:
your-license-key-here with the actual license key provided by Smallest.ai.
Step 4: Create Docker Compose File
- Lightning ASR (Standard)
Best for: Fast inference, real-time applicationsCreate a
docker-compose.yml file:docker-compose.yml
Step 5: Additional Configuration for Lightning ASR
- Lightning ASR
Add the model URL to your The MODEL_URL is provided by Smallest.ai support.
.env file (required for Lightning ASR):Step 6: Start Services
Launch all services with Docker Compose:Step 7: Monitor Startup
Watch the logs to monitor startup progress:1
Redis Ready
2
License Proxy Ready
3
Model Service Ready
Lightning ASR:
4
API Server Ready
Common Startup Issues
GPU Not Found
GPU Not Found
Error: If this fails, reinstall NVIDIA Container Toolkit.
could not select device driver "nvidia"Solution:License Validation Failed
License Validation Failed
Error:
License validation failedSolution:- Verify LICENSE_KEY in
.envis correct - Check internet connectivity
- Ensure firewall allows HTTPS to console-api.smallest.ai
Model Download Failed
Model Download Failed
Error:
Failed to download modelSolution:- Verify MODEL_URL in
.envis correct - Check disk space:
df -h - Check internet connectivity
Port Already in Use
Port Already in Use
Error: Either stop the conflicting service or change the port in docker-compose.yml
port is already allocatedSolution:
Check what’s using the port:
