Veqa · A Machplace Product

Voice AI agents that hold a real conversation.

Replace IVR menus with multilingual Voice AI agents that understand context, handle interruptions naturally, and never forget where the conversation left off — deployed on dedicated NVIDIA GPUs at your facility under a managed subscription.

What Veqa does differently

Four capabilities that make Veqa feel like a real conversation — not an IVR menu reading a script.

Natively multilingual

English, Spanish, French, and Cantonese without language-switching menus. Language is detected from the first utterance and routed automatically.

Interruptible with grace

Customers can cut in mid-sentence. The agent stops talking within 30–60 ms, listens, responds, and resumes naturally.

Full-call task memory

Multi-step procedures are tracked as structured task plans. On resume, the agent picks up at the exact step it left off.

Emotion-aware speech

TTS intonation adapts to message sentiment: confirmations of delays sound apologetic; good news sounds genuinely warm.

How a Veqa call works

Three layers running on NVIDIA hardware, with a streaming ASR → LLM → TTS pipeline.

Step 1

Listen with precision

Streaming ASR (NVIDIA Parakeet for English; faster-whisper for Spanish/French; SenseVoice for Cantonese) transcribes the caller in real-time, with low-latency barge-in detection via Silero VAD.

Step 2

Understand in context

Llama-3.3-Nemotron-Super-49B (NVIDIA-tuned) tracks the full conversation as a structured task plan. Multi-step procedures are state-machine tracked so resumption works exactly where the agent left off.

Step 3

Respond with emotion

F5-TTS, Kokoro, and CosyVoice 2 produce TTS with intonation tuned to message sentiment. Apologies sound sincere; confirmations sound warm; instructions sound clear and patient.

Plans

Veqa is sold as a deployed product, not as professional services. Pick the shape that fits your call volume and compliance posture.

Starter

For pilots and proofs of concept.

Single language, single GPU

  • Up to 25 concurrent calls
  • 1 language at a time
  • Single RTX PRO 6000 Blackwell node
  • Email support, next-business-day response
Request pilot
Most Popular

Professional

For production call flows at small to mid volume.

Two-language production

  • Up to 50 concurrent calls
  • 2 languages active simultaneously
  • Single RTX PRO 6000 Blackwell node
  • Email + chat support, 4-hour business-hours response
  • Standard call-flow analytics dashboard
Request access

Scale

For production call flows at high volume.

Multi-language, multi-GPU

  • Up to 250 concurrent calls
  • All four supported languages active
  • Multi-GPU cluster with automatic failover
  • Custom voice cloning per brand
  • Business-hours phone support
Talk to sales

Enterprise

For regulated and high-volume operations.

Air-gapped + custom integrations

  • Unlimited concurrent calls
  • Air-gapped on-prem deployment supported
  • Custom knowledge-base & CRM integrations
  • Custom-trained domain-specific LLM fine-tuning
  • 24/7 on-call coverage + SLA
Contact us

View Full Pricing & Comparison →

Early Access

Bring Veqa to your call flow.

We're onboarding a small cohort of early-access partners in healthcare, financial services, and regulated B2B. Tell us about your call volumes and compliance requirements and we'll be in touch within one business day.

[email protected] · St. Petersburg, Florida