Veqa · A Machplace Product

Voice AI agents that hold a real conversation.

Replace IVR menus with multilingual Voice AI agents that understand context, handle interruptions naturally, and never forget where the conversation left off — deployed on dedicated NVIDIA GPUs at your facility under a managed subscription.

Request Early Access See How It Works

What Veqa does differently

Four capabilities that make Veqa feel like a real conversation — not an IVR menu reading a script.

Natively multilingual

English, Spanish, French, and Cantonese without language-switching menus. Language is detected from the first utterance and routed automatically.

Interruptible with grace

Customers can cut in mid-sentence. The agent stops talking within 30–60 ms, listens, responds, and resumes naturally.

Full-call task memory

Multi-step procedures are tracked as structured task plans. On resume, the agent picks up at the exact step it left off.

Emotion-aware speech

TTS intonation adapts to message sentiment: confirmations of delays sound apologetic; good news sounds genuinely warm.

How a Veqa call works

Three layers running on NVIDIA hardware, with a streaming ASR → LLM → TTS pipeline.

Step 1

Listen with precision

Streaming ASR (NVIDIA Parakeet for English; faster-whisper for Spanish/French; SenseVoice for Cantonese) transcribes the caller in real-time, with low-latency barge-in detection via Silero VAD.

Step 2

Understand in context

Llama-3.3-Nemotron-Super-49B (NVIDIA-tuned) tracks the full conversation as a structured task plan. Multi-step procedures are state-machine tracked so resumption works exactly where the agent left off.

Step 3

Respond with emotion

F5-TTS, Kokoro, and CosyVoice 2 produce TTS with intonation tuned to message sentiment. Apologies sound sincere; confirmations sound warm; instructions sound clear and patient.

Plans

Veqa is sold as a deployed product, not as professional services. Pick the shape that fits your call volume and compliance posture.

Starter

For pilots and proofs of concept.

Single language, single GPU

Up to 25 concurrent calls
1 language at a time
Single RTX PRO 6000 Blackwell node
Email support, next-business-day response

Request pilot

Professional

For production call flows at small to mid volume.

Two-language production

Up to 50 concurrent calls
2 languages active simultaneously
Single RTX PRO 6000 Blackwell node
Email + chat support, 4-hour business-hours response
Standard call-flow analytics dashboard

Request access

Scale

For production call flows at high volume.

Multi-language, multi-GPU

Up to 250 concurrent calls
All four supported languages active
Multi-GPU cluster with automatic failover
Custom voice cloning per brand
Business-hours phone support

Talk to sales

Enterprise

For regulated and high-volume operations.

Air-gapped + custom integrations

Unlimited concurrent calls
Air-gapped on-prem deployment supported
Custom knowledge-base & CRM integrations
Custom-trained domain-specific LLM fine-tuning
24/7 on-call coverage + SLA

View Full Pricing & Comparison →

Early Access

Bring Veqa to your call flow.

We're onboarding a small cohort of early-access partners in healthcare, financial services, and regulated B2B. Tell us about your call volumes and compliance requirements and we'll be in touch within one business day.

Request Early Access

[email protected] · St. Petersburg, Florida