Common Issues

Overview

This guide provides quick solutions to the most common issues encountered with Smallest Self-Host across Docker and Kubernetes deployments.

Installation Issues

License Key Invalid

Symptoms:

License validation failed
Invalid license key
Services fail to start

Quick Fix:

Verify License Key

Check for extra spaces, quotes, or line breaks

echo $LICENSE_KEY | wc -c

Should be exact length without whitespace

Test Manually

Contact license server directly:

curl -H "Authorization: Bearer ${LICENSE_KEY}" https://console-api.smallest.ai/validate

Contact Support

If key appears correct:Email: support@smallest.aiInclude: License key, error logs

Image Pull Failed

Symptoms:

ImagePullBackOff
unauthorized: authentication required
manifest unknown

Quick Fix:

Docker
Kubernetes

docker login quay.io
Username: your-username
Password: your-password

docker pull quay.io/smallestinc/lightning-asr:latest

Verify secret exists:

kubectl get secrets -n smallest | grep registry

Recreate if needed:

kubectl create secret docker-registry registry-secret \
  --docker-server=quay.io \
  --docker-username=your-username \
  --docker-password=your-password \
  --docker-email=your-email \
  -n smallest

Model Download Failed

Symptoms:

Lightning ASR stuck at startup
Failed to download model
Connection timeout

Quick Fix:

Verify URL:
```
curl -I $MODEL_URL
```
Check disk space:
```
df -h
```
Need at least 30 GB free
Test network:
```
wget --spider $MODEL_URL
```

Increase timeout (Kubernetes):

lightningAsr:
  env:
    - name: DOWNLOAD_TIMEOUT
      value: "3600"

Runtime Issues

High Latency

Symptoms:

Requests taking >1 second
Slow transcription
Timeouts

Quick Fix:

Check GPU Utilization

nvidia-smi
kubectl exec -it <lightning-asr-pod> -- nvidia-smi

If GPU util < 50%:

Model not loaded properly
CPU bottleneck
Check logs for errors

If GPU util > 90%:

Scale up replicas
Add more GPU nodes

Check Resource Limits

kubectl describe pod <pod-name> | grep -A5 "Limits"

Increase if needed:

lightningAsr:
  resources:
    limits:
      memory: 16Gi
      cpu: 8

Enable GPU Persistence Mode

sudo nvidia-smi -pm 1

Or in Kubernetes:

gpu-operator:
  driver:
    env:
      - name: NVIDIA_DRIVER_CAPABILITIES
        value: "compute,utility"

Out of Memory

Symptoms:

Pod killed (exit code 137)
OOMKilled status
Memory errors in logs

Quick Fix:

Increase memory limit:

lightningAsr:
  resources:
    limits:
      memory: 20Gi
    requests:
      memory: 16Gi

Check memory leaks:
```
kubectl top pod <pod-name>
```
Restart pod:
```
kubectl delete pod <pod-name>
```

Connection Refused

Symptoms:

Cannot connect to API
Connection refused
Service unavailable

Quick Fix:

Check Services Running
Verify Endpoints
Check Firewall

kubectl get pods -n smallest
docker compose ps

All should be Running or Up

kubectl get endpoints -n smallest
curl http://localhost:7100/health

sudo iptables -L
netstat -tuln | grep 7100

Performance Issues

Slow Autoscaling

Symptoms:

HPA not scaling fast enough
Pods stuck in Pending
Cluster Autoscaler delayed

Quick Fix:

Reduce HPA stabilization:

scaling:
  auto:
    lightningAsr:
      hpa:
        scaleUpStabilizationWindowSeconds: 0

Check metrics available:

kubectl get hpa
kubectl get --raw "/apis/custom.metrics.k8s.io/v1beta1"

Verify node capacity:

kubectl describe nodes | grep -A5 "Allocated resources"

Request Queue Building Up

Symptoms:

Increasing active requests
Users experiencing delays
HPA shows high metrics

Quick Fix:

Manual scale up:

kubectl scale deployment lightning-asr --replicas=10

Check autoscaling limits:

scaling:
  auto:
    lightningAsr:
      hpa:
        maxReplicas: 20

Add cluster capacity:

eksctl scale nodegroup --cluster=smallest-cluster --name=gpu-nodes --nodes=5

Data Issues

Transcription Quality Poor

Symptoms:

Low confidence scores
Incorrect transcriptions
Missing words

Quick Fix:

Check audio quality:
- Sample rate: 16 kHz minimum (44.1 kHz recommended)
- Format: WAV or FLAC preferred
- Channels: Mono for best results

Enable punctuation:

{
  "url": "...",
  "punctuate": true,
  "language": "en"
}

Verify correct language:
```
{
  "url": "...",
  "language": "es"
}
```

Missing Timestamps

Symptoms:

No word-level timing data
Unable to sync with video

Quick Fix: Enable timestamps in request:

{
  "url": "...",
  "timestamps": true
}

Response will include:

{
  "words": [
    {"word": "Hello", "start": 0.0, "end": 0.5}
  ]
}

Network Issues

Cannot Reach License Server

Symptoms:

Grace period activated
Connection to license server failed
Services still working but warnings

Quick Fix:

Test connectivity:

curl -v https://console-api.smallest.ai

Check firewall rules:
- Allow outbound HTTPS (port 443)
- Whitelist console-api.smallest.ai
Review network policies (Kubernetes):
```
kubectl get networkpolicy -n smallest
```

Monitor grace period:

kubectl logs -l app=license-proxy | grep -i "grace"

Slow Downloads

Symptoms:

Model download taking >30 minutes
Audio file upload slow

Quick Fix:

Use faster network:
- AWS S3 in same region
- CloudFront CDN

Enable parallel downloads:

lightningAsr:
  env:
    - name: DOWNLOAD_WORKERS
      value: "4"

Use shared storage (Kubernetes):

models:
  volumes:
    aws:
      efs:
        enabled: true

Quick Diagnostics

One-Command Health Check

curl http://localhost:7100/health && \
  kubectl get pods -n smallest && \
  kubectl top nodes && \
  kubectl top pods -n smallest

Collect All Logs

kubectl logs -l app=lightning-asr -n smallest --tail=100 > asr-logs.txt
kubectl logs -l app=api-server -n smallest --tail=100 > api-logs.txt
kubectl logs -l app=license-proxy -n smallest --tail=100 > license-logs.txt

Test Transcription

curl -X POST http://localhost:7100/v1/listen \
  -H "Authorization: Token ${LICENSE_KEY}" \
  -H "Content-Type: application/json" \
  -d '{"url": "https://www2.cs.uic.edu/~i101/SoundFiles/StarWars60.wav"}'

Getting Help

If issues persist:

Debugging Guide

Advanced debugging techniques

Logs Analysis

Interpret logs and error messages

Contact Support

Email: support@smallest.aiInclude:

Error logs
System information
Steps to reproduce

Getting Started

Docker Setup

Kubernetes Setup

Troubleshooting

Common Issues

Overview

Installation Issues

License Key Invalid

Image Pull Failed

Model Download Failed

Runtime Issues

High Latency

Out of Memory

Connection Refused

Performance Issues

Slow Autoscaling

Request Queue Building Up

Data Issues

Transcription Quality Poor

Missing Timestamps

Network Issues

Cannot Reach License Server

Slow Downloads

Quick Diagnostics

One-Command Health Check

Collect All Logs

Test Transcription

Getting Help

Debugging Guide

Logs Analysis

Contact Support

Getting Started

Docker Setup

Kubernetes Setup

Troubleshooting

​Overview

​Installation Issues

​License Key Invalid

​Image Pull Failed

​Model Download Failed

​Runtime Issues

​High Latency

​Out of Memory

​Connection Refused

​Performance Issues

​Slow Autoscaling

​Request Queue Building Up

​Data Issues

​Transcription Quality Poor

​Missing Timestamps

​Network Issues

​Cannot Reach License Server

​Slow Downloads

​Quick Diagnostics

​One-Command Health Check

​Collect All Logs

​Test Transcription

​Getting Help

Debugging Guide

Logs Analysis

Contact Support

Overview

Installation Issues

License Key Invalid

Image Pull Failed

Model Download Failed

Runtime Issues

High Latency

Out of Memory

Connection Refused

Performance Issues

Slow Autoscaling

Request Queue Building Up

Data Issues

Transcription Quality Poor

Missing Timestamps

Network Issues

Cannot Reach License Server

Slow Downloads

Quick Diagnostics

One-Command Health Check

Collect All Logs

Test Transcription

Getting Help