Enterprise Transcription

The model that transcribes your audio
is not the right model.

Every language has a best model. Most tools pick one and stick with it.
Werrex routes each file to the lowest-WER model for that language — automatically, at scale, with full audit trail.

5.33% English WER
(Granite 4.1)
9 Languages
routed
SOC2 Built in
from day one
EN HI ZH OUTPUT [file.wav] → transcript.txt WER: 5.33%
The Problem

Every tool picks one model.

Whisper for everything. Fast, broad, but not the best at any language. Every file gets the same treatment regardless of whether a lower-WER model exists for that language.

Enterprise buyers demand proof.

SOC2 Type II. ISO 27001. Data residency. Retention policies. Most transcription tools are retrofitted for compliance — built on top of existing architectures, not designed for it.

Werrex does neither.

It routes each file to the model with the lowest measured WER for that language. English to Granite. Hindi to Whisper Turbo. Sanskrit to Whisper — because that's what works. Architecture designed for compliance from scratch.

Routing logic — simplified
const routeAudio = async (audio, lang) => {
  const model = MODEL_MAP[lang] // pre-benchmarked per language
  const chunks = await chunkAudio(audio)      // VAD-based segmentation
  const results = await transcribeAll(chunks, model)
  return assemble(results, { timestamps: true, speaker_id: true })
}
Supported Languages

Nine languages. The right model for each.

No single model leads across all languages. Werrex benchmarks each candidate and routes to the one with the lowest measured WER — per language, per file.

South Asian
English Granite Speech 4.1 2B WER 5.33%
Hindi Whisper Large V3 Turbo Benchmarked
Tamil Whisper Large V3 Turbo Benchmarked
Sanskrit Whisper Large V3 Turbo Benchmarked
Mandarin Whisper Large V3 Turbo Benchmarked
European
Spanish Whisper Turbo / Granite Benchmarked
French Whisper Turbo / Granite Benchmarked
German Whisper Turbo / Granite Benchmarked
Italian Whisper Turbo / Granite Benchmarked

Model selection is dynamic — benchmarks are re-run quarterly as new model versions release.

Architecture

How a file becomes a transcript.

01

Ingest

Client uploads audio file. Server validates format, runs format detection, and assigns a job ID with full audit trail from the first byte.

02

Chunk

VAD-based segmentation breaks large files into speech-only chunks, preserving speaker boundaries and word-level timestamps. No dead air, no model waste.

03

Route

Language detection runs on a sample. Model selector picks the pre-benchmarked best model for that language. Chunk dispatched to GPU queue.

04

Assemble

Transcribed chunks merge with preserved timestamps and speaker IDs. Compliance metadata attached. Result returned via secure download link.

GPU Batch Inference

Containerized model serving on GPU nodes. Auto-scaling queue handles files from minutes to hours. No cold starts on hot path.

Multi-Tenant Ready

Single-tenant cloud or fully on-prem deployment. Data never leaves the client's infrastructure in on-prem mode. SOC2 scoped to API layer.

Model Benchmarking

Automated WER evaluation on target language datasets runs monthly. Model map updates automatically when a new version beats the baseline.

Secure Delivery

Signed, time-limited download URLs. Transcript payload encrypted at rest with per-tenant keys. Audit log covers every read/write operation.

Compliance is architecture, not a checkbox.

While most transcription tools bolt on compliance after the fact, Werrex was designed with it from the first line of code. Every control maps to a defined framework.

SOC 2 Type II
  • Annual audit by independent assessor
  • 365-day retention policy per job
  • Role-based access with MFA
  • Incident response automation
  • Encryption in transit (TLS 1.3) and at rest (AES-256)
ISO 27001
  • Information Security Management System (ISMS) built in
  • Risk assessment for every new feature
  • Vendor security review before third-party integrations
  • Secure deletion with cryptographic overwrite
  • Continuous surveillance audit schedule
Data Residency
  • On-prem deployment option for strict sovereignty
  • Region-locked processing (US, EU, APAC)
  • No cross-border data transfer in restricted deployments
  • Per-tenant encryption keys (BYOK)
  • Customer-managed key rotation
SOC2 Type II certification target: Q4 2026. ISO 27001 certification target: Q2 2027. See roadmap for details.

Every transcription tool makes a choice for you.
Werrex makes the right one.

Built by engineers who understand that accuracy isn't a feature — it's the product. For enterprises that can't afford a wrong word in a transcript.

English WER leaderboard 5.33% — Granite 4.1 2B
Multilingual breadth 99+ languages via Whisper
Architecture Compliance-first, GPU batch
Deployment Cloud or fully on-prem