Benedetto Manca
A review of the state of the speech synthesis technology landscape – Neural TTS on the edge
Gerazov, Branislav;Mura, Antonello;Pagliara, Silvio
Last
2025-01-01
Abstract
The landscape of speech synthesis technology, particularly neural Text-to-Speech (TTS), has seen rapid advancements in recent years. This review examines the current state of neural TTS systems and their availability for edge on-device deployment. Traditionally, neural TTS models have required substantial computational resources, limiting their application to server-based cloud implementations. However, recent innovations in model architecture and synthesis techniques are making it possible to deploy these systems on edge devices with limited processing power. These developments are crucial for applications requiring low latency, in which it is necessary that data is processed locally without reliance on cloud services. One important use-case is in assistive technology such as Augmented and Alternative Communication (AAC) for users with speech impairments and screen readers for the visually impaired.| File | Size | Format | |
|---|---|---|---|
| A Review of the State of the Speech Synthesis Technology Landscape – Neural TTS on the Edge.pdf Solo gestori archivio
Type: versione editoriale
Size 370.19 kB
Format Adobe PDF
|
370.19 kB | Adobe PDF | & nbsp; View / Open Request a copy |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
University of Cagliari