next.js
SaaS
AI
Voice Recognition
Performance Optimization
United Kingdom
SaaS
Code refactoring
SEO optimization
CMS Optimization
1 year, ongoing
Speechmatics is a leading provider of automatic speech technology, offering high-accuracy, real-time speech-to-text solutions in over 50 languages. Their technology is widely used across industries such as media, CCaaS and EdTech, allowing for flexible deployment in both cloud and on-premise environments.
They deliver highly accurate speech-to-text transcription, even in difficult audio environments. This makes them a top choice for businesses looking to improve voice-driven applications.
As the platform grew quickly, Speechmatics faced a few challenges. They needed to make their technology easier and more natural to use by focusing on voice interaction, so users wouldn't need to rely on things like touchscreens or a mouse.
Like here:
Also, they had to solve some performance issues with their website, impacting SEO and the technical implementation of their tools, such as Google Tag Manager.
Since Speechmatics focused on backend ASR tech, building their own frontend team wasn’t practical. They needed a reliable partner with frontend expertise to smoothly integrate with their backend and fix website performance issues.
The challenge wasn’t just about a user-friendly interface—it also required strong knowledge of speech recognition and AI.
That's where our developers stepped in, creating a proof of concept for an AI agent that transformed how users interact with their language model.
Understanding What Is What
The goal was to create an AI agent that allowed voice communication with their language model, eliminating the need for traditional input methods.
We started from thoroughly analyzing the existing technology stack and aligned it with Speechmatics' business goals– increasing user interaction and improving performance of the platform. All of that to ensure the project started with a clear understanding of the requirements.
Developing the AI Agent
To address the challenge, we designed and developed an AI agent application that enabled users to communicate using voice commands. This app provided flexibility by allowing users to speak and receive voice responses, which eliminated the need for a touchscreen or mouse.
Enhancing Accessibility and Usability
In addition to voice-based communication, our team integrated options for keyboard input and real-time transcription of spoken conversations. These features enhanced the accessibility of the application, giving users more control over how they interacted with the system.
This approach ensured the solution was adaptable to various user environments and needs, improving overall usability.
The key outcomes include:
Increased speech recognition accuracy and efficiency through cutting-edge AI technologies.
Greater flexibility and innovation to meet a wide range of client demands.
Improved website performance, resulting in better SEO and user engagement.
Streamlined implementation of Google Tag Manager for more effective tracking and analytics.
Case Studies
Partners for your long-term success
TRUSTED BY