UK Business

DeepL introduces instant voice translation with new Voice-to-Voice tech

The era of waiting for a human interpreter to bridge a language gap in a live conversation may be coming to a close. Language AI specialist DeepL has launched a comprehensive real-time voice translation suite, a strategic move that pitches the company directly against tech giants like Google and Microsoft in the race to dominate AI-powered communication.

The New Frontier: Voice-to-Voice Translation

DeepL Voice-to-Voice is designed to tackle what the company calls one of the last major barriers in organisations: live spoken translation. The suite comprises several products targeting different scenarios, all built on what DeepL claims is a superior AI translation core. The company’s proprietary technology, which reportedly uses Convolutional Neural Networks (CNNs) for better capture of linguistic nuance, forms the foundation of the new voice service.

For virtual meetings, DeepL Voice for Meetings integrates with platforms like Microsoft Teams and Zoom, allowing each participant to speak in their native tongue while others hear the translation. An early access programme for this feature began in June. For in-person or ad-hoc discussions, Voice for Conversations offers a mobile and web-based solution, now generally available, which is particularly useful in environments where installing apps is restricted.

A notable feature is Group Conversations, which facilitates multilingual training or workshops. Participants join via a QR code for real-time, simultaneous translation, a tool aimed at frontline workers needing shared understanding. This became generally available on April 30. For businesses wanting to embed the technology, a Voice-to-Voice API is available through an ongoing early access programme, enabling integration into internal tools or customer contact centres.

Further enhancing accuracy, a customisation feature launching on May 7 allows companies to ensure specific terminology—like industry jargon or product names—is captured and translated correctly, even in fast-paced, technical speech. This integrates with DeepL’s existing translation glossaries.

Critically for enterprise adoption, DeepL emphasises robust security, stating its voice technology does not use customer data to train models and does not permanently store transcription data after a call ends.

Performance and Market Reception

DeepL is backing its launch with performance claims from independent evaluators. In blind tests commissioned by DeepL and conducted by Slator, 96% of professional linguists preferred DeepL Voice over the native translation solutions from Google, Microsoft, and Zoom, citing superior fluency and contextual accuracy. The solutions for Zoom and Teams scored 96.4 and 96.3 out of 100 respectively. A separate Slator benchmark found DeepL Voice delivered a 76% average reduction in high-severity errors compared to other platforms.

The service supports over 40 languages, including all 24 official EU languages, as well as Vietnamese, Thai, Arabic, Norwegian, Hebrew, Bengali, and Tagalog.

Early adopters report significant impacts on workflow. Yoichi Okuyama, Head of DX System Department at Pioneer, stated that removing language friction led to more active participation and faster decision-making in global teams. At Mondelēz International, Geoffrey Wright highlighted the speed and confidentiality DeepL brought to handling sensitive documents, accelerating organisation-wide adoption. Other major clients include Porsche, NEC, and Deutsche Bahn.

Evolving the Core Translator Platform

Alongside the voice launch, DeepL is evolving its flagship product into what it terms the next-generation DeepL Translator platform. This aims to act as an end-to-end translation infrastructure for enterprises, moving beyond a simple tool to a centralised, AI-first system integrated into daily workflows.

The platform addresses several pain points: its Translation Flow feature allows content to be translated instantly within existing systems; a Translation Quality Assessment tool gives teams visibility into a translation’s reliability; and an ongoing improvements function learns from user edits to constantly enhance output.

The economic argument is being made forcefully. A commissioned Total Economic Impact study by Forrester Consulting suggested a composite organisation could see a 345% return on investment over three years using DeepL’s tools, primarily through efficiency savings.

Company Background and Future Roadmap

DeepL was founded in Cologne in 2017 by CEO Jarek Kutylowski and has grown to become a major player in language AI, now with over 1,000 employees. It achieved ‘unicorn’ status with a $2 billion valuation following a $300 million Series C funding round in May 2024, bringing total funding to $415 million. Its investors include Benchmark, IVP, and Index Ventures.

The company is not standing still. It is reportedly developing an end-to-end voice translation model to bypass the current three-step pipeline (speech-to-text, translation, text-to-speech) to reduce latency further. A voice-preservation feature, maintaining a speaker’s vocal characteristics in translation, is planned for late 2026. DeepL has also launched a marketplace for API-based apps and is developing an AI agent capable of autonomously using office systems based on natural language commands.

With this expansion from text into real-time speech, DeepL is betting that its reputation for superior translation quality can help it carve out a dominant position in a market long targeted by the world’s largest technology firms.

Thaddeus Norwell

Business & Technology Writer
Thaddeus Norwell is a business and technology writer based in London, UK. He reports on business trends, digital innovation, and regulatory developments shaping the UK economy, focusing on practical outcomes rather than speculation. His work explores how technology and policy affect companies, markets, and consumers.
· Market and regulatory analysis, fintech sector reporting, enterprise technology coverage
· UK corporate landscape, tax and fiscal policy, interest rates and mortgages, AI regulation, cybersecurity threats, startup ecosystem

Related Articles

Back to top button