BIP! Finder - Neural Speech Synthesis with Transformer Network

2019 • Neural Speech Synthesis with Transformer Network

Authors: Naihan Li; Shujie Liu 0001; Yanqing Liu; Sheng Zhao; Ming Liu

Venue: Proceedings of the AAAI Conference on Artificial Intelligence

Type: Publication

Abstract: Although end-to-end neural text-to-speech (TTS) methods (such as Tacotron2) are proposed and achieve state-of-theart performance, they still suffer from two problems: 1) low efficiency during training and inference; 2) hard to model long dependency using current recurrent neural networks (RNNs). Inspired by the success of Transformer network in neural machine translation (NMT), in this paper, we introduce and adapt the multi-head attention mechanism to replace the RNN structures and also the original attention mechanism in Tacotron2. With the h... (read more)

Impact: 3.527774E-7 6.0557085E-8 424 188 / Attention: 0 24

Topics: Speech recognition Artificial intelligence

DOI: 10.1609/aaai.v33i01.33016706

External links: Crossref OpenAIRE

BibTex PDF

Topic-specific impact indicators

Popularity:
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
Influence:
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
Citation Count:
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
Impulse:
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.