Project Khutba:
Real-Time Prayer Translation App

Plavno developed Project Khutba — an innovative mobile and web platform enabling mosques to broadcast prayers in real time with instant translation into multiple languages.The system allows mosque administrators to schedule prayer sessions, automatically launch live streams, and provide both live and on-demand access to translated prayers, making religious content accessible to global audiences

Let’s Build Your Solution

Plug-and-play Solutions

Universal live‑speech translation engine with sub‑second perceived latency

  • Sub‑second feel • RTF < 1 across languages

  • Modular ASR → NMT → TTS (or speech‑to‑speech)

  • Scales from 1 speaker to thousands of concurrent listeners

  • Multi‑region WebRTC/WebSocket streaming built for hostile networks

  • Designed for quick implementation with REST/gRPC/WebRTC SDKs

<span>Universal</span> live‑speech translation engine with sub‑second perceived latency
01

Problem

Organizations need real‑time multilingual access for sermons, lectures, town halls, and events — without spinning up a bespoke speech stack. Khutba’s goal: deliver human‑friendly latency, domain‑faithful translation, and industrial reliability in a form factor that product teams can integrate quickly

Problem
02

Challenge

Plavno identified 4 challenges that consistently break live multilingual experiences:

  • Sub‑second feel • RTF < 1 across languages

  • Modular ASR → NMT → TTS (or speech‑to‑speech)

  • Scales from 1 speaker to thousands of concurrent listeners

  • Multi‑region WebRTC/WebSocket streaming built for hostile networks

  • Designed for quick implementation with REST/gRPC/WebRTC SDKs

Challenge

Solution

Plavno’s solution is a modular, low‑latency pipeline you can drop in fast — engineered to preserve a live feel, keep domain terms correct, and scale across regions. Before the deep dive, here’s what it delivers:

Ingest & Scale

    • Speaker → Cloud ingest: The speaker streams audio over WebRTC to an EC2 cluster.

    • Outer Scaler: Returns the optimal machine URL and pre‑warms the pipeline.

    • Inner Scaler: Fans listeners across the RTC worker pool (ports 5001…5000+n) and spins up STT.

Streaming Stack

    • Front: VAD/diarization to segment speech.

    • ASR: Conformer‑Transducer with CTC alignment & auto‑punctuation.

    • MT: Prefix‑to‑prefix Simultaneous Translation (low look‑ahead).

    • Aggregation: The Vocalizer channel aggregator renders each language via Azure Neural TTS and sends live transcripts over WebSockets.

    • Output: Auto‑generate audio + VTT/SRT subtitles to S3 when the session ends.

Developer Experience

    • Pluggable modules: LEGO‑like pipeline components you can swap in/out.

    • Modes: Choose ASR → NMT → TTS or end‑to‑end speech‑to‑speech when prosody matters.

    • Integrations: Ship via REST, gRPC, or WebRTC SDKs.

Features

Hard Problems We Solved

Speaking‑rate shifts & code‑switching

Speaking‑rate shifts & code‑switching

Online re‑beaming + reliability‑based segmentation

Rare terminology

Rare terminology

RAG lexicons with a reviewer feedback loop

Cold starts

Cold starts

Model pre‑warm and audio pre‑buffer to avoid first‑word lag

Hostile networks/NAT

Hostile networks/NAT

STUN/TURN, adaptive bitrate, jitter buffers, and health checks

Value

Quality & Fidelity 

Advanced AI models and specialized techniques ensure accurate, contextually-aware translations across languages and domains

Hybrid Multilingual Cores

Hybrid Multilingual Cores

NLLB / M2M models with mixture of experts (MoE) heads for specialized language pairs

LoRA
Language Families
Dialect Support
MVP
LoRA Adapters

LoRA Adapters

Fine-tuned low-rank adaptation for language families and regional dialects

NLLB
M2M
MoE Architecture
MVP
Constrained Decoding

Constrained Decoding

Precise handling of proper nouns, places, and formatting consistency

Named Entity Recognition
Format Preservation
MVP
RAG Glossaries

RAG Glossaries

Retrieval-augmented generation for domain-specific terminology and context

LoRA
Language Families
Dialect Support
MVP

Benchmarks

Scale & Reliability

Enterprise-grade infrastructure designed to handle massive concurrent loads while maintaining consistent sub-second response times

Language-aware Routing

Language-aware Routing

Intelligent worker sharding based on language pairs and processing requirement

Optimized resource allocation
Multi-region infrastructure

Multi-region infrastructure

Distributed jitter buffers and health checks across geographic regions

Global latency reduction
NestJS Monitoring

NestJS Monitoring

Real-time QoS guardrails with automatic scaling and performance tracking

Proactive issue prevention
Burst Load Handling

Burst Load Handling

Maintains RTF < 1 performance even during traffic spikes and peak usage

Consistent user experience
RTF < 1

Real-time factor maintained under load

99.9%

Uptime across regions

1 000+

Concurrent listeners supported

Application

Where It’s Used 

Religious & Cultural

Religious & Cultural

Sermons, community gatherings

Conferences & Events

Conferences & Events

Summits, expos, academic forums

Government & Public Services

Government & Public Services

Councils, courts, emergency broadcasts

Education

Education

Universities, MOOCs, virtual classrooms

Corporate

Corporate

Global town halls, training, investor briefings

Media & Entertainment

Media & Entertainment

Sports commentary, theatre, live shows

Delivery Crew

Project Team

High-performing developers for growing companies

Competitive Ability

Key Performance Stats

Real-world performance metrics that demonstrate the system`s capabilities in production environments

01

Audio Ingestion

< 50ms

02

Speech Recognition

< 200ms

03

Translation & TTS

< 300ms

04

Total Delivery

< 550ms

Throughput & Acceleration

Throughput & Acceleration

    • 16x Cerebras Acceleration

    • 1000+ Concurrent Users

    • 50+ Language Pairs

    • 1K req / sec Peak Throughput

AI Quality Stack

AI Quality Stack

    • NLLB Base Translation Model

    • MoE Expert Specialization

    • LoRA Dialect Adaptation

    • RAG Domain Terms

Delivery Automation

Delivery Automation

    • NLLB Live Audio Stream

    • VTT: WebTT Captions

    • SRT: Subtitle Files

    • Auto-generated Post-event

Results

Leading developers driving success for dynamic businesses

Live experience

Live experience

Sub‑second perceived latency that feels conversational

Accuracy under pressure

Accuracy under pressure

Maintains domain fidelity during fast speech and code‑switching

Operational simplicity

Operational simplicity

Plug‑in SDKs; deploy without backend surgery

Audience scale

Audience scale

Thousands of concurrent listeners per session, multi‑region

Post‑event assets

Post‑event assets

Clean audio + VTT/SRT delivered automatically

Tools We Used

Technology stack

Compute & Streaming

Compute & Streaming

AWS EC2
WebRTC
Websocket
STUN / TURN
ASR

ASR

Conformer‑Transducer ASR
CTC Alignment
MT

MT

Simultaneous NMT
NLLB / M2M
MoE
LoRA
Mobile App

Mobile App

React Native (specific libraries available)
Acceleration

Acceleration

Cerebras Waferscale
Backend & Monitoring

Backend & Monitoring

NestJS
TTS & Storage

TTS & Storage

Azure Neural TTS
AWS S3
Quality Layer

Quality Layer

RAG
Constrained Decoding
bg image
bg image

Project Estimator

Answer several questions and get a free estimate

  • The estimated time to launch the product

  • Clear vision of functionality you need

  • 15% discount on your first sprint

Get AI Estimate

Frequently Asked Questions

Quick Answers

Find answers to your common concerns

How many people can join?

Up to 50 in regular rooms; in Conference Mode, multiple speakers with thousands of listeners.

Can we store recordings or transcripts?

Session artifacts can be stored in AWS S3 when enabled; retention is configurable.

What latency should we expect?

Sub‑second perceived delay in typical networks, thanks to WebRTC and streaming STT/NMT/TTS.

What SDKs are available?

TLS, token‑based auth, RBAC, Cloudflare WAF/CDN, and isolated rooms; access is scoped by roles.

What outputs are available?

Translated audio (TTS) and on‑screen captions; listeners can switch languages.

How does it behave on poor networks?

Use Conference Mode to assign speaker roles and broadcast to thousands with live translation.

Which platforms are supported?

Web app (React) and mobile app (React Native, details TBD).

About Plavno

Why choose Plavno?

Proven by our
customers feedback

clutch.co
AI-first Delivery

AI-first Delivery

Senior engineers + proven AI components to accelerate time-to-value.

800+ Projects Delivered

800+ Projects Delivered

From MVPs to enterprise platforms at global scale.

Full-stack Team

Full-stack Team

From extension UX to GPU pipelines and global scale.

Testimonials

We are trusted by our customers

“They really understand what we need. They’re very professional.”

The 3D configurator has received positive feedback from customers. Moreover, it has generated 30% more business and increased leads significantly, giving the client confidence for the future. Overall, Plavno has led the project seamlessly. Customers can expect a responsible, well-organized partner.
Read more on Clutch

Sergio Artimenia

Commercial Director, RNDpoint

Sergio Artimenia

“We appreciated the impactful contributions of Plavno.”

Plavno's efforts in addressing challenges and implementing effective solutions have played a crucial role in the success of T-Rize. The outcomes achieved have exceeded expectations, revolutionizing the investment sector and ensuring universal access to financial opportunities
Watch video review on YouTube

Thien Duy Tran

Product Manager, T-Rize Group

Thien Duy Tran

“We are very satisfied with their excellent work”

Through the partnership with Plavno, we built a system used by more than 40 million connected channels. Throughout the engagement, the team was communicative and quick in responding to our concerns. Overall, we were highly satisfied with the results of collaboration.
Read more on Clutch

Michael Bychenok

CEO, MediaCube

Michael Bychenok

“They have a clear understanding of what the end user needs.”

Plavno's codes and designs are user-friendly, and they complete all deliverables within the deadline. They are easy to work with and easily adapt to existing workflows, and the client values their professionalism and expertise. Overall, the team has delivered everything that was promised.
Read more on Clutch

Helen Lonskaya

Head of Growth, Codabrasoft LLC

Helen Lonskaya

“The app was delivered on time without any serious issues.”

The MVP app developed by Plavno is excellent and has all the functionality required. Plavno has delivered on time and ensured a successful execution via regular updates and fast problem-solving. The client is so satisfied with Plavno's work that they'll work with them on developing the full app.
Read more on Clutch

Mitya Smusin

Founder, 24hour.dev

Mitya Smusin

Contact Us

This is what will happen, after you submit form

Need a custom consultation? Ask me!

Plavno has a team of experts that ready to start your project. Ask me!

Vitaly Kovalev

Vitaly Kovalev

Sales Manager

Schedule a call

Get in touch

Fill in your details below or find us using these contacts. Let us know how we can help.

No more than 3 files may be attached up to 3MB each.
Formats: doc, docx, pdf, ppt, pptx.
Send request