🚀 Vaja Early Adopter API

Welcome to the Vaja Early Adopter Program!

Access GPU-accelerated Text-to-Speech services through our production-ready API. All services run on dedicated GPU infrastructure for optimal performance.

🔑 Authentication

All API requests require authentication using your Vaja API key in the Authorization header:

Authorization: Bearer YOUR_VAJA_API_KEY
Getting Your API Key:
Contact your Vaja representative to receive your unique API key for early adopter access.

🎙️ Text-to-Speech Services

We provide 4 different TTS engines optimized for different use cases:

1. Kokoro-82M TTS NEW

Lightweight & Fast - 82 Million Parameters

Available Voices (11 total)

GET https://early.vaja.ai/api/tts/kokoro/voices

Voice Options:

Synthesize Speech

POST https://early.vaja.ai/api/tts/kokoro

Example Request:

curl -X POST https://early.vaja.ai/api/tts/kokoro \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "text": "Hello! Welcome to Vaja AI services.",
    "voice": "af_bella"
  }' \
  --output output.wav

Request Parameters:

3. Coqui XTTS v2

Multi-lingual Text-to-Speech

Synthesize Speech

POST https://early.vaja.ai/api/tts/coqui

Example Request:

curl -X POST https://early.vaja.ai/api/tts/coqui \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "text": "Hello from Coqui TTS!",
    "language": "en"
  }' \
  --output output.wav

Request Parameters:

Supported Languages:

4. LiveTranslate TTS

Real-time Translation with TTS

Endpoint Structure

POST https://early.vaja.ai/api/tts/livetranslate/*
Contact your Vaja representative for specific LiveTranslate endpoint documentation and usage examples.

📊 Service Comparison

Service Speed Quality Languages Best For
Kokoro-82M ⚡⚡⚡ Very Fast ⭐⭐⭐⭐ Excellent English Fast English TTS with multiple voices
StyleTTS2 ⚡⚡⚡ 6x Faster ⭐⭐⭐⭐⭐ Best English Production apps needing SSML
Coqui XTTS ⚡⚡ Moderate ⭐⭐⭐⭐ Great 15+ Languages Multi-language support
LiveTranslate ⚡⚡ Moderate ⭐⭐⭐ Good Multi-lingual Translation + TTS combined

🔧 Technical Specifications

Audio Output Format

API Specifications

Response Times (Approximate)

Service Short Text (1-2 sentences) Medium Text (paragraph)
Kokoro-82M ~0.5-1s ~1-2s
StyleTTS2 ~0.5-1.5s ~1.5-3s
Coqui XTTS ~1-2s ~2-4s

📝 Quick Start Example

Save this script as test_vaja_tts.sh to test all services:

#!/bin/bash

# Replace with your actual API key
API_KEY="YOUR_VAJA_API_KEY"

# Test Kokoro-82M (Female Voice)
echo "Testing Kokoro-82M with female voice..."
curl -X POST https://early.vaja.ai/api/tts/kokoro \
  -H "Authorization: Bearer $API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"text":"Testing Kokoro female voice","voice":"af_bella"}' \
  -o kokoro_female.wav

# Test Kokoro-82M (Male Voice)
echo "Testing Kokoro-82M with male voice..."
curl -X POST https://early.vaja.ai/api/tts/kokoro \
  -H "Authorization: Bearer $API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"text":"Testing Kokoro male voice","voice":"am_adam"}' \
  -o kokoro_male.wav

# Test StyleTTS2
echo "Testing StyleTTS2..."
curl -X POST https://early.vaja.ai/api/tts/styletts2 \
  -H "Authorization: Bearer $API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"text":"Testing StyleTTS2 synthesis","voice":"af_sky"}' \
  -o styletts2.wav

# Test Coqui (English)
echo "Testing Coqui XTTS in English..."
curl -X POST https://early.vaja.ai/api/tts/coqui \
  -H "Authorization: Bearer $API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"text":"Testing Coqui English","language":"en"}' \
  -o coqui_english.wav

# Test Coqui (Spanish)
echo "Testing Coqui XTTS in Spanish..."
curl -X POST https://early.vaja.ai/api/tts/coqui \
  -H "Authorization: Bearer $API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"text":"Hola, esto es una prueba","language":"es"}' \
  -o coqui_spanish.wav

echo "✓ Done! Check the generated WAV files"

⚠️ Best Practices

Security

Performance

Error Handling

📚 Common Use Cases

1. Voice Assistant / Chatbot

Recommended: StyleTTS2 or Kokoro-82M

Fast response times and high quality make these ideal for interactive applications.

2. Multi-lingual Content

Recommended: Coqui XTTS v2

Support for 15+ languages with consistent voice quality.

3. Audiobook / Long-form Content

Recommended: StyleTTS2 with SSML

SSML support allows for natural pauses, emphasis, and prosody control.

4. Real-time Translation + Speech

Recommended: LiveTranslate TTS

Combined translation and speech synthesis in one step.

🆘 Support & Resources

Need Help?

For API keys, technical support, or questions about the Early Adopter program, contact your Vaja representative.

Useful Resources