Case-Study: Text-to-Speech Inference Optimisation on Edge (Under NDA)

See how our team applied a case study approach to build a real-time Kazakh text-to-speech solution using ONNX, deep learning, and different optimisation methods.

Case-Study: Text-to-Speech Inference Optimisation on Edge (Under NDA)
Written by TechnoLynx Published on 12 Mar 2024

Problem

Our client, a small government-backed startup, aimed to improve accessibility for a group of individuals with visual impairment. The project goal was to expand multi-platform screen reading tools by supporting the Kazakh language on consumer devices. This was part of a broader effort to improve device access for users with visual challenges.

At first glance, it seemed like a standard text-to-speech (TTS) task. But the Kazakh language posed unique challenges.

Kazakh is not widely supported by major platforms. Few high-quality pre-trained models exist. Existing tools lacked native support.

At the same time, the client required fast deployment across Android, Windows, and iOS. The project had a tight schedule and high-quality standards.

We needed a working Kazakh TTS that produced natural speech and ran in real time across platforms. The system also had to meet memory and performance limits on mobile devices. Our task was clear: build a reliable solution using existing resources while meeting real-time user needs.

Challenges and Constraints

This individual case had several technical constraints, all of which shaped our research design.

Limited Time

Case studies often highlight how a business navigates challenges, and in this scenario, the key obstacle was the tight deadline. The client approached us with a firm release date already set, leaving limited time to prepare. Despite the late start, our team embraced the challenge and used pre-trained models for Kazakh, as there wasn’t sufficient time to develop custom models or retrain existing ones from scratch.

Format Compatibility

Our deployment required converting the PyTorch models into ONNX and CoreML formats. ONNX was chosen for Android and Windows. CoreML was needed for iOS. This process involved reworking inputs, outputs, and inference pipelines.

iOS Memory Limitations

The iOS screen reader framework (AVSpeechSynthesisProviderAudioUnit) had strict memory limits. The original model could not run within these constraints. This issue had to be addressed during implementation.

Audio Quality

Natural sounding speech was a core requirement. The Kazakh TTS had to be usable for formal communication, not just casual content. Robotic or artificial tones were unacceptable.

Outdated Application Layers

Some helpful open-source tools existed but used old build systems and dependencies. We had to update them or rewrite core components to ensure stability and compatibility.

Toolchain

To meet these challenges, we used the following tools:

  • PyTorch (for pre-trained model checkpoints)

  • ONNX Runtime (for Android and Windows deployment)

  • CoreML Tools (for iOS optimisation)

  • FFmpeg (for audio pre/post-processing)

  • Apple’s Xcode and AVFoundation (for iOS integration)

Solution

We applied a clear case study approach to reach the solution. Our research questions were simple: How can we deploy a Kazakh TTS model on three platforms using existing tools? How can we ensure high-quality, real-time speech under memory limits?

For Android and Windows, we converted the PyTorch model into ONNX. We optimised the model size by reducing layers and quantising weights. This helped us reduce the model size from 230MB to 97MB and inference time from 320ms to 130ms on average.

For iOS, the same model was too heavy. The screen reader framework imposed hard limits. We decided to build a standalone app for iOS. This let us run the model outside the system’s built-in memory limits.

The compromise was that the app could not integrate directly with VoiceOver. Still, it provided full text-to-speech functionality with natural voice quality.

Our solution passed internal tests and met all core requirements. Users could now listen to Kazakh content read aloud with high clarity. Deployment worked across all platforms.

Results

The project ultimately resulted in a high-quality text-to-speech system available on multiple platforms, including Android, Windows, and iOS. While the iOS version required a standalone application due to memory limitations, the overall solution was deemed a success. Our engineers minimised the size from 65MB to a set of ONNX models between 20 to 10MB and a 3.6MB CoreML model.

It let users hear natural-sounding speech in Kazakh. This made devices easier to use for people with visual impairments.

The generated speech was of sufficient quality to be practically indistinguishable from human speech, helping users interact with their devices more naturally and comfortably. It was important to ensure that the speech could serve for casual reading and in formal settings like school or business.

Future Steps

Our team proposed a number of potential improvements for future projects. Specifically, we suggested continuing to optimise the model for iOS devices, exploring alternative ways to reduce memory consumption. Additionally, as AI voice technologies continue to evolve, future projects could incorporate more advanced deep learning models to enhance the quality of text-to-speech software even further.

Outlook and Lessons

This case shows how careful planning and the right tools can solve practical language support problems. The project used a structured case study approach with clear goals. We adapted pre-trained models, updated legacy tools, and worked within device limits. The result was a working system used by real people.

Text-to-speech systems like this are part of everyday life. They help with accessibility, education, and customer support. Our work here demonstrates that we can include even languages with limited support.

The role of AI in this field is growing. As models improve, the voices will sound even more natural. Devices will run faster with smaller models. This helps everyone, especially users with special access needs.

We can apply the approach we used in this project to other languages or platforms. Any group of individuals needing voice support can benefit. Whether it’s in a government system, classroom, or personal device, this kind of tool can help.

Why This Case Study Matters

This is one of many types of case study that show how AI helps in real life. It focuses on a clear goal, a specific group of individuals, and measurable results. The research design followed practical steps. It asked direct research questions and solved real problems.

We believe this individual case shows how small changes make a big difference. Quality AI does not have to be complex. It just has to work. That’s what we did here, and that’s what we aim to do in every project.

Text-to-Speech: A Broader Perspective

Text-to-speech technologies have gained increasing importance in recent years, thanks to the rise of AI technologies. These systems are used not only to support accessibility for those with visual impairments but also in a wide range of other industries. For example, text-to-speech software is used in customer service applications, virtual assistants, and content generation. A free text-to-speech solution can help businesses save time by automating the reading of documents, reports, or other text-heavy content.

AI voice technology has come a long way from its early, robotic-sounding origins. Today’s systems use deep learning to generate voices that sound more human-like, and they can operate in multiple languages, making them suitable for a wide range of global applications. By developing solutions that can handle niche languages like Kazakh, companies can expand their markets and serve previously underserved populations.

Our Perspective

At TechnoLynx, we apply structured methods. We combine computer science with human-centred design. Each individual case teaches us how to work better and faster. We rely on deep learning, efficient frameworks, and strong quality control.

If you are working on a project that includes AI voice technology, accessibility, or language support, get in touch. We’re ready to help you build something that works for your users, not just your specs. Contact us now to discuss your needs and find the ultimate solution!

Image by Freepik

AI-Driven Opportunities for Smarter Problem Solving

AI-Driven Opportunities for Smarter Problem Solving

5/08/2025

AI-driven problem-solving opens new paths for complex issues. Learn how machine learning and real-time analysis enhance strategies.

How AI Is Transforming Wall Street Fast

How AI Is Transforming Wall Street Fast

1/08/2025

Discover how artificial intelligence and natural language processing with large language models, deep learning, neural networks, and real-time data are reshaping trading, analysis, and decision support on Wall Street.

How AI Transforms Communication: Key Benefits in Action

How AI Transforms Communication: Key Benefits in Action

31/07/2025

How AI transforms communication: body language, eye contact, natural languages. Top benefits explained. TechnoLynx guides real‑time communication with large language models.

Generative AI Security Risks and Best Practice Measures

Generative AI Security Risks and Best Practice Measures

28/07/2025

Generative AI security risks explained by TechnoLynx. Covers generative AI model vulnerabilities, mitigation steps, mitigation & best practices, training data risks, customer service use, learned models, and how to secure generative AI tools.

Real-Time Computer Vision for Live Streaming

Real-Time Computer Vision for Live Streaming

21/07/2025

Understand how real-time computer vision transforms live streaming through object detection, OCR, deep learning models, and fast image processing.

Next-Gen Chatbots for Immersive Customer Interaction

Next-Gen Chatbots for Immersive Customer Interaction

11/07/2025

Learn how chatbots and immersive portals enhance customer interaction and customer experience in real time across multiple channels for better support.

Generative AI Tools in Modern Video Game Creation

Generative AI Tools in Modern Video Game Creation

28/05/2025

Learn how generative AI, machine learning models, and neural networks transform content creation in video game development through real-time image generation, fine-tuning, and large language models.

Artificial Intelligence in Supply Chain Management

Artificial Intelligence in Supply Chain Management

27/05/2025

Learn how artificial intelligence transforms supply chain management with real-time insights, cost reduction, and improved customer service.

Machine Learning and AI in Modern Computer Science

Machine Learning and AI in Modern Computer Science

20/05/2025

Discover how computer science drives artificial intelligence and machine learning—from neural networks to NLP, computer vision, and real-world applications. Learn how TechnoLynx can guide your AI journey.

Real-Time Data Streaming with AI

Real-Time Data Streaming with AI

19/05/2025

You have surely heard that ‘Information is the most powerful weapon’. However, is a weapon really that powerful if it does not arrive on time? Explore how real-time streaming powers Generative AI across industries, from live image generation to fraud detection.

Case Study: CloudRF  Signal Propagation and Tower Optimisation

Case Study: CloudRF  Signal Propagation and Tower Optimisation

15/05/2025

See how TechnoLynx helped CloudRF speed up signal propagation and tower placement simulations with GPU acceleration, custom algorithms, and cross-platform support. Faster, smarter radio frequency planning made simple.

Cutting-Edge Marketing with Generative AI Tools

Cutting-Edge Marketing with Generative AI Tools

13/05/2025

Learn how generative AI transforms marketing strategies—from text-based content and image generation to social media and SEO. Boost your bottom line with TechnoLynx expertise.

Fine-Tuning Generative AI Models for Better Performance

8/05/2025

Understand how fine-tuning improves generative AI. From large language models to neural networks, TechnoLynx offers advanced solutions for real-world AI applications.

Generative AI's Role in Shaping Modern Data Science

6/05/2025

Learn how generative AI impacts data science, from enhancing training data and real-time AI applications to helping data scientists build advanced machine learning models.

Deep Learning vs. Traditional Computer Vision Methods

5/05/2025

Compare deep learning and traditional computer vision. Learn how deep neural networks, CNNs, and artificial intelligence handle image recognition and quality control.

Control Image Generation with Stable Diffusion

30/04/2025

Learn how to guide image generation using Stable Diffusion. Tips on text prompts, art style, aspect ratio, and producing high quality images.

The Foundation of Generative AI: Neural Networks Explained

28/04/2025

Find out how neural networks support generative AI models with applications like content creation, and where these models are used in real-world scenarios.

Agentic AI vs Generative AI: What Sets Them Apart?

17/04/2025

Understand the difference between agentic AI and generative AI, including how they work in content creation, deep learning, and artificial intelligence applications.

Top Cutting-Edge Generative AI Applications in 2025

14/04/2025

Learn how applications in text, image, music, fashion, architecture, and business are driven by deep learning, neural networks, and large language models.

TechnoLynx Named a Top Machine Learning Company

9/04/2025

TechnoLynx named a top machine learning development company by Vendorland. We specialise in AI, supervised learning, and custom machine learning systems that deliver real business results.

Generative AI Models: How They Work and Why They Matter

3/04/2025

Learn how generative AI models like GANs, VAEs, and LLMs work. Understand their role in content creation, image generation, and AI applications.

Markov Chains in Generative AI Explained

31/03/2025

Discover how Markov chains power Generative AI models, from text generation to computer vision and AR/VR/XR. Explore real-world applications!

How Generative AI Is Changing Search Engines

27/03/2025

Learn how generative AI models improve search engines. Understand text generation, image creation, user experiences, and machine learning in content delivery.

AI Prompt Engineering: 2025 Guide

21/03/2025

Learn how prompt engineering enhances generative AI outputs for text, images, and customer service.

Generative AI: Pharma's Drug Discovery Revolution

20/03/2025

Discover how generative AI transforms drug discovery, medical imaging, and customer service in the pharmaceutical industry.

Generative AI in Data Analytics: Enhancing Insights

14/03/2025

Learn how generative AI transforms data analytics by creating realistic datasets, enhancing predictive analytics, and improving data visualisation.

Generative AI and Supervised Learning: A Perfect Pair

12/03/2025

Learn how generative AI combines with supervised learning to improve model accuracy and efficiency. Understand the role of supervised learning algorithms in training generative AI models.

Generative AI in Medical Imaging: Transforming Diagnostics

7/03/2025

Learn how generative AI is revolutionising medical imaging with techniques like GANs and VAEs. Explore applications in image synthesis, segmentation, and diagnosis.

Generative AI and Prompt Engineering: A Simple Guide

4/03/2025

Learn about Generative AI and Prompt Engineering. Understand language models, training data, and real-world applications in AI-powered content creation.

Copyright Issues With Generative AI and How to Navigate Them

3/03/2025

Recent discussions about generative AI tools have raised copyright concerns. Explore how AI reinforces ethical practices.

Neural Networks and Their Role in Generative AI

27/02/2025

Learn how neural networks power generative AI models. Understand training data, image generation, computer vision, and AI systems in customer service and medical imaging.

The Pros and Cons of Generative AI in Customer Service

26/02/2025

Learn how generative AI models impact customer service. Understand the benefits and challenges of AI-powered customer interactions and personalised experiences.

Understanding Generative AI and Stable Diffusion Models

25/02/2025

Learn how generative AI models like Stable Diffusion generate content. Understand diffusion models, machine learning, and applications in medical imaging, computer vision, and customer service.

How Agents Learn Through Trial and Error: Reinforcement Learning

24/02/2025

Discover how RL is applied in various industries, from robotics and gaming to healthcare and finance. Explore the key concepts, algorithms, and real-world examples to grasp the potential of this transformative technology.

How AI Tools Are Changing the Way We Create Art

19/02/2025

Learn how AI tools and generative AI help artists create art, design animations, manage social media content, and produce high-quality content in real-time.

Generative AI is Driving Smarter Business Solutions

17/02/2025

Learn how businesses are using generative AI to improve productivity, streamline operations, and create personalised customer experiences.

Generative AI Development Services for Smarter AI Solutions

12/02/2025

Looking for generative AI development services? Learn how machine learning models, natural language processing, and neural networks improve content creation, image generation, and more.

The Impact of Computer Vision on Real-Time Face Detection

10/02/2025

Learn how computer vision, a branch of AI, drives innovation in face detection with the help of CNNs, real-time video processing, and Generative AI.

Deep Learning in Medical Computer Vision: How It Works

7/02/2025

Deep learning and computer vision improve medical image recognition and object detection. Learn how AI-powered models help in healthcare.

Generative AI and Supervised Learning in Real-World Use

6/02/2025

Generative AI and supervised learning use neural networks to process input data. Learn how these AI techniques improve image generation, text-based tasks, and medical images.

3D Visualisation Just Became Smarter with AI

3/02/2025

We are all very familiar with 3D printers. Chances are that you want one, you already have one, or you have a friend who does. The concept of ‘Let’s make everything 3D’ is not new, yet the industrial applications of it mostly are. In this article, we will discuss how incorporating AI into 3D scanning, projecting, and modelling can transform the industry in so many fields.

Custom AI Development Services for Business Growth

29/01/2025

Looking for custom AI development services? Learn how tailored AI models can improve efficiency and drive growth.

Benefits of Classical Computer Vision for Your Business

28/01/2025

Learn how classical computer vision technology, including image processing, optical character recognition (OCR), and facial recognition, can improve inventory management, medical imaging, and more for your business.

AI Assistants: Surpassing the Limits of Productivity

27/01/2025

Don’t we all dream of getting things done with the least amount of effort? AI assistants are here to solve this issue for most. If you are into content creation, if quality control is your main occupation or if your goal is to manage your tasks efficiently and set priorities, don’t miss this article.

Alan Turing: The Father of Artificial Intelligence

23/01/2025

In this era of technological revolution, we see new applications every day. If you take a closer look, almost every platform has some sort of AI-enhanced feature. However, how did this start? Let’s go back to the early 20th century and discover everything about the father of AI.

Generative AI vs. Traditional Machine Learning

10/01/2025

Learn the key differences between generative AI and traditional machine learning. Explore applications, data needs, and how these technologies shape AI innovation.

Generative AI for Customer Service: The Ultimate Guide

8/01/2025

Learn how generative AI transforms customer service by using natural language processing, machine learning models, and cutting-edge tools like GANs and VAEs.

AI in Security: Defence for All!

6/01/2025

Is it safe to say that we live securely? If not, what can we do to make things safer? Does this apply only to our homes, or are there things that we can do for ourselves? And what about homeland security? The answer lies inside!

← Back to Blog Overview