Enterprise Transcription

The model that transcribes your audio
is not the right model.

Every language has a best model. Most tools pick one and stick with it.
Werrex routes each file to the lowest-WER model for that language — automatically, at scale, with full audit trail.

5.33% English WER
(Granite 4.1)

9 Languages
routed

SOC2 Built in
from day one

The Problem

Every tool picks one model.

Whisper for everything. Fast, broad, but not the best at any language. Every file gets the same treatment regardless of whether a lower-WER model exists for that language.

Enterprise buyers demand proof.

SOC2 Type II. ISO 27001. Data residency. Retention policies. Most transcription tools are retrofitted for compliance — built on top of existing architectures, not designed for it.

Werrex does neither.

It routes each file to the model with the lowest measured WER for that language. English to Granite. Hindi to Whisper Turbo. Sanskrit to Whisper — because that's what works. Architecture designed for compliance from scratch.

Routing logic — simplified

const routeAudio = async (audio, lang) => {
  const model = MODEL_MAP[lang] // pre-benchmarked per language
  const chunks = await chunkAudio(audio)      // VAD-based segmentation
  const results = await transcribeAll(chunks, model)
  return assemble(results, { timestamps: true, speaker_id: true })
}

Supported Languages

Nine languages. The right model for each.

No single model leads across all languages. Werrex benchmarks each candidate and routes to the one with the lowest measured WER — per language, per file.

South Asian

English Granite Speech 4.1 2B WER 5.33%

Hindi Whisper Large V3 Turbo Benchmarked

Tamil Whisper Large V3 Turbo Benchmarked

Sanskrit Whisper Large V3 Turbo Benchmarked

Mandarin Whisper Large V3 Turbo Benchmarked

European

Spanish Whisper Turbo / Granite Benchmarked

French Whisper Turbo / Granite Benchmarked

German Whisper Turbo / Granite Benchmarked

Italian Whisper Turbo / Granite Benchmarked

Model selection is dynamic — benchmarks are re-run quarterly as new model versions release.

Architecture

How a file becomes a transcript.

Ingest

Client uploads audio file. Server validates format, runs format detection, and assigns a job ID with full audit trail from the first byte.

Chunk

VAD-based segmentation breaks large files into speech-only chunks, preserving speaker boundaries and word-level timestamps. No dead air, no model waste.

Route

Language detection runs on a sample. Model selector picks the pre-benchmarked best model for that language. Chunk dispatched to GPU queue.

Assemble

Transcribed chunks merge with preserved timestamps and speaker IDs. Compliance metadata attached. Result returned via secure download link.

GPU Batch Inference

Containerized model serving on GPU nodes. Auto-scaling queue handles files from minutes to hours. No cold starts on hot path.

Multi-Tenant Ready

Single-tenant cloud or fully on-prem deployment. Data never leaves the client's infrastructure in on-prem mode. SOC2 scoped to API layer.

Model Benchmarking

Automated WER evaluation on target language datasets runs monthly. Model map updates automatically when a new version beats the baseline.

Secure Delivery

Signed, time-limited download URLs. Transcript payload encrypted at rest with per-tenant keys. Audit log covers every read/write operation.

Enterprise Ready

Compliance is architecture, not a checkbox.

While most transcription tools bolt on compliance after the fact, Werrex was designed with it from the first line of code. Every control maps to a defined framework.

SOC 2 Type II

Annual audit by independent assessor
365-day retention policy per job
Role-based access with MFA
Incident response automation
Encryption in transit (TLS 1.3) and at rest (AES-256)

ISO 27001

Information Security Management System (ISMS) built in
Risk assessment for every new feature
Vendor security review before third-party integrations
Secure deletion with cryptographic overwrite
Continuous surveillance audit schedule

Data Residency

On-prem deployment option for strict sovereignty
Region-locked processing (US, EU, APAC)
No cross-border data transfer in restricted deployments
Per-tenant encryption keys (BYOK)
Customer-managed key rotation

SOC2 Type II certification target: Q4 2026. ISO 27001 certification target: Q2 2027. See roadmap for details.

Every transcription tool makes a choice for you.
Werrex makes the right one.

Built by engineers who understand that accuracy isn't a feature — it's the product. For enterprises that can't afford a wrong word in a transcript.

English WER leaderboard 5.33% — Granite 4.1 2B

Multilingual breadth 99+ languages via Whisper

Architecture Compliance-first, GPU batch

Deployment Cloud or fully on-prem

The model that transcribes your audio is not the right model.