A review of the state of the speech synthesis technology landscape – Neural TTS on the edge

Gerazov, Branislav;Mura, Antonello;Pagliara, Silvio
Last
2025-01-01

Abstract

The landscape of speech synthesis technology, particularly neural Text-to-Speech (TTS), has seen rapid advancements in recent years. This review examines the current state of neural TTS systems and their availability for edge on-device deployment. Traditionally, neural TTS models have required substantial computational resources, limiting their application to server-based cloud implementations. However, recent innovations in model architecture and synthesis techniques are making it possible to deploy these systems on edge devices with limited processing power. These developments are crucial for applications requiring low latency, in which it is necessary that data is processed locally without reliance on cloud services. One important use-case is in assistive technology such as Augmented and Alternative Communication (AAC) for users with speech impairments and screen readers for the visually impaired.
2025
Inglese
Technology for inclusion and participation for all: recent achievements and future directions. AAATE 2025, part II
9783032016317
9783032016324
Springer Nature Switzerland
Cham
Branislav Gerazov, et. al.
Katerina Mavrou, Pedro Encarnação
76
83
8
AAATE 2025 Conference, Technology for Inclusion and Participation for All: Recent Achievements and Future Directions
Esperti anonimi
10-12 Settembre 2025
Nicosia, Cipro
internazionale
scientifica
Neural speech synthesis; Text to speech; Augmented and Alternative Communication (AAC); Screen readers; On-device
Goal 4: Quality education
Goal 10: Reduced inequalities
4 Contributo in Atti di Convegno (Proceeding)::4.1 Contributo in Atti di convegno
Gerazov, Branislav; Lazareva, Vanesa; Dimitrovska, Marija Markovska; Taskovski, Dimitar; Mavrou, Katerina; Theodorou, Eleni; Zanfardino, Francesco; Sp ...espandi
273
16
4.1 Contributo in Atti di convegno
reserved
info:eu-repo/semantics/conferencePaper
Files in This Item:
File Size Format  
A Review of the State of the Speech Synthesis Technology Landscape – Neural TTS on the Edge.pdf

Solo gestori archivio

Type: versione editoriale
Size 370.19 kB
Format Adobe PDF
370.19 kB Adobe PDF & nbsp; View / Open   Request a copy

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Questionnaire and social

Share on:
Impostazioni cookie