Overview
Understanding log messages is crucial for diagnosing issues. This guide helps you interpret logs from each component and identify common error patterns.Log Levels
All components use standard log levels:| Level | Description | Example |
|---|---|---|
DEBUG | Detailed diagnostic info | Variable values, function calls |
INFO | Normal operation events | Request received, model loaded |
WARNING | Potential issues | Slow response, retry attempt |
ERROR | Error that needs attention | Failed request, connection error |
CRITICAL | Severe error | Service crash, unrecoverable error |
Lightning ASR Logs
Successful Startup
Request Processing
Common Errors
GPU Not Found
GPU Not Found
- Check
nvidia-smiworks - Verify GPU device plugin (Kubernetes)
- Check NVIDIA Container Toolkit (Docker)
Out of GPU Memory
Out of GPU Memory
- Reduce concurrent requests
- Use larger GPU (A10 vs T4)
- Scale horizontally (more pods)
Model Download Failed
Model Download Failed
- Verify MODEL_URL
- Check disk space:
df -h - Test URL:
curl -I $MODEL_URL - Use shared storage (EFS)
Audio Processing Error
Audio Processing Error
- Verify audio format (WAV, MP3, FLAC supported)
- Check file is not corrupted
- Ensure proper sample rate (16kHz+)
API Server Logs
Successful Startup
Request Handling
Common Errors
Authentication Failed
Authentication Failed
- Verify
Authorization: Token <key>header - Check license key is correct
- Renew expired license
No ASR Workers
No ASR Workers
- Check Lightning ASR pods:
kubectl get pods - Scale up replicas
- Check HPA configuration
Request Timeout
Request Timeout
- Check Lightning ASR logs
- Increase timeout
- Scale up pods
License Proxy Logs
Successful Validation
Usage Reporting
Common Errors
License Validation Failed
License Validation Failed
- Verify LICENSE_KEY is correct
- Check license hasn’t expired
- Contact [email protected]
Connection Failed
Connection Failed
- Test:
curl https://console-api.smallest.ai - Check firewall allows HTTPS
- Restore connectivity before grace period expires
Grace Period Expiring
Grace Period Expiring
- Restore network connectivity immediately
- Check firewall rules
- Contact support if persistent
Redis Logs
Normal Operation
Common Errors
Memory Limit Reached
Memory Limit Reached
- Increase memory limit
- Enable eviction policy
- Clear old keys
Persistence Issues
Persistence Issues
- Increase disk space
- Disable persistence if not needed
- Clean up old snapshots
Log Pattern Analysis
Error Rate Analysis
Count errors in last 1000 lines:Performance Analysis
Extract response times:Request Tracking
Follow a specific request ID:Log Aggregation
Using stern
Install stern:Using Loki (if installed)
Query logs via LogQL:Structured Logging
Parse JSON Logs
If logs are in JSON format:Filter by Field
Log Retention
Configure Log Rotation
Docker:docker-compose.yml
Export Logs
Save logs for analysis:Debugging Log Issues
No Logs Appearing
Check pod is running:Logs Truncated
Increase log size limits:Best Practices
Use Structured Logging
Use Structured Logging
Prefer JSON format for easier parsing:
Include Context
Include Context
Always include relevant context in logs:
- Request ID
- Component name
- Timestamp
- User/session info (if applicable)
Set Appropriate Levels
Set Appropriate Levels
Use correct log levels:
- DEBUG: Development only
- INFO: Normal operation
- WARNING: Potential issues
- ERROR: Actual problems
- CRITICAL: Service-breaking issues
Aggregate Logs
Aggregate Logs
Use centralized logging:
- ELK Stack (Elasticsearch, Logstash, Kibana)
- Loki + Grafana
- CloudWatch Logs (AWS)
- Cloud Logging (GCP)

