Overview
Before deploying Smallest Self-Host ASR on Kubernetes, ensure your cluster meets the requirements and you have the necessary tools and credentials.Kubernetes Cluster Requirements
Minimum Cluster Specifications
Kubernetes Version
v1.19 or higherv1.24+ recommended
Node Count
Minimum 2 nodes
- 1 CPU node (control plane/general)
- 1 GPU node (Lightning ASR)
Total Resources
Minimum cluster capacity
- 8 CPU cores
- 32 GB RAM
- 1 NVIDIA GPU
Storage
Persistent volume support
- Storage class available
- 100 GB minimum capacity
Required Tools
Install the following tools on your local machine:Helm
Helm 3.0 or higher is required.- macOS
- Linux
- Windows
kubectl
Kubernetes CLI tool for cluster management.- macOS
- Linux
- Windows
Cluster Access
Configure kubectl
Ensure kubectl is configured to access your cluster:Test Cluster Access
Verify you have sufficient permissions:yes.
GPU Support
NVIDIA GPU Operator
For Kubernetes clusters, install the NVIDIA GPU Operator to manage GPU resources.The Smallest Self-Host Helm chart includes the GPU Operator as an optional dependency. You can enable it during installation or install it separately.
Verify GPU Nodes
Check that GPU nodes are properly labeled:nvidia.com/gpu in the capacity.
Credentials
Obtain the following from Smallest.ai before installation:License Key
License Key
Container Registry Credentials
Container Registry Credentials
Credentials to pull Docker images from
quay.io:- Username
- Password
values.yaml:Model URLs
Model URLs
Storage Requirements
Storage Class
Verify a storage class is available:(default) or available.
For AWS Deployments
If deploying on AWS EKS, you’ll need:- EBS CSI Driver for block storage
- EFS CSI Driver for shared file storage (recommended for model storage)
Network Requirements
Required Ports
Ensure the following ports are accessible within the cluster:| Port | Service | Purpose |
|---|---|---|
| 7100 | API Server | Client API requests |
| 2269 | Lightning ASR | Internal ASR processing |
| 3369 | License Proxy | Internal license validation |
| 6379 | Redis | Internal caching |
External Access
The License Proxy requires outbound HTTPS access to:console-api.smallest.ai(port 443)
Optional Components
Prometheus & Grafana
For monitoring and autoscaling based on custom metrics:- Prometheus Operator (included in chart)
- Grafana (included in chart)
- Prometheus Adapter (included in chart)
- Custom metrics-based autoscaling
- Advanced monitoring dashboards
- Performance visualization
Cluster Autoscaler
For automatic node scaling on AWS EKS:- IAM role with autoscaling permissions
- IRSA (IAM Roles for Service Accounts) configured
Namespace
Decide on a namespace for deployment:- Default Namespace
- Custom Namespace
Deploy to the default namespace:
Verification Checklist
Before proceeding, ensure:1
Cluster Access
2
GPU Nodes Available
3
Helm Installed
4
Storage Available
5
Credentials Ready
- License key obtained
- Container registry credentials
- Model download URL
6
Sufficient Resources
AWS-Specific Prerequisites
If deploying on AWS EKS, see:AWS EKS Setup
Complete guide for setting up EKS cluster with GPU support
What’s Next?
Once all prerequisites are met, proceed to the quick start:Quick Start
Deploy Smallest Self-Host with Helm

