BIP! Finder - Listen, attend and spell: A neural network for large vocabulary conversational speech recognition

2016 • Listen, attend and spell: A neural network for large vocabulary conversational speech recognition

Authors: William Chan, Navdeep Jaitly, Quoc V. Le, Oriol Vinyals

Venue: 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Type: Publication

Abstract: We present Listen, Attend and Spell (LAS), a neural speech recognizer that transcribes speech utterances directly to characters without pronunciation models, HMMs or other components of traditional speech recognizers. In LAS, the neural network architecture subsumes the acoustic, pronunciation and language models making it not only an end-to-end trained system but an end-to-end model. In contrast to DNN-HMM, CTC and most other models, LAS makes no independence assumptions about the probability distribution of the output character sequences give... (read more)

Topics: Speech recognition Artificial intelligence Natural language processing

DOI: 10.1109/icassp.2016.7472621

BIP! social metrics: 0 1
External links: Crossref OpenAIRE

BibTex PDF

Topic-specific impact indicators

Popularity: This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
Influence: This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
Citation Count: This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
Impulse: This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.