Autori: Cutugno, Francesco , Giordano Orsini, Luigi Maria , Norman Vitale, Vincenzo
Titolo: Large scale acoustic models: A new perspective
Periodico: Sistemi intelligenti
Anno: 2023 - Fascicolo: 2 - Pagina iniziale: 401 - Pagina finale: 412

Large Language Models (LLM), such as ChatGPT, generate texts answering to a prompt after being trained through exposition to a huge amount of texts. Similar approaches are applied in Automatic Speech Recognition (ASR) systems which are trained with unprocessed and unlabeled audio data without supervision. The deriving process recalls what a newborn could do to learn speech structure when immersed in the acoustic environment. In parallel with LLM, we refer to this architecture as Large Acoustic Models (LAM). Taking from psycholinguistics literature, we will draw a further parallel between modern ASR and human behaviors introducing the paradigm of artificial language learning. Lastly, a new approach to ASR will be presented, focusing on linguistic theories underlying natural speech.


Premi sulle icone a fianco dei nomi per visualizzare i libri scritti dall'autore



SICI: 1120-9550(2023)2<401:LSAMAN>2.0.ZU;2-2
Testo completo: https://www.rivisteweb.it/download/article/10.1422/108137
Testo completo alternativo: https://www.rivisteweb.it/doi/10.1422/108137

Esportazione dati in Refworks (solo per utenti abilitati)

Record salvabile in Zotero

Biblioteche ACNP che possiedono il periodico
Le Biblioteche aderenti
foto biblioteca

Università degli studi [Cassino] : CSB di Area Giuridico-Economica : Biblioteca
Campus Folcara - Via S. Angelo, snc
03043 - Cassino