SSW9 Accepted Papers

SSW9 Accepted Papers

Investigating Spectral Amplitude Modulation Phase Hierarchy Features in Speech Synthesis
Alexandros Lazaridis, Milos Cernak, Pierre-Edouard Honnet and Philip N. Garner
Wideband Harmonic Model: Alignment and Noise Modeling for High Quality Speech Synthesis
Slava Shechtman, Alex Sorin
Using instantaneous frequency and aperiodicity detection to estimate F0 for high-quality speech synthesis
Hideki Kawahara, Yannis Agiomyrgiannakis, Heiga Zen
Non-filter waveform generation from cepstrum using spectral phase reconstruction
Yasuhiro Hamada, Nobutaka Ono, Shigeki Sagayama
Wide Passband Design for Cosine-Modulated Filter Banks in Sinusoidal Speech Synthesis
Nobuyuki Nishizawa and Tomonori Yazaki
A Pulse Model in Log-domain for a Uniform Synthesizer
Gilles Degottex, Pierre Lanchantin, Mark Gales
A hybrid harmonics-and-bursts modelling approach to speech synthesis
Jonas Beskow and Harald Berthelsen
Automatic, model-based detection of pause-less phrase boundaries from fundamental frequency and duration features
Mahsa Sadat Elyasi Langarani, and Jan van Santen
Mandarin Prosodic Phrase Prediction based on Syntactic Trees
Zhengchen Zhang, Fuxiang Wu, Chenyu Yang, Minghui Dong, and Fugen Zhou
Emphasis recreation for TTS using intonation atoms
Pierre-Edouard Honnet and Philip N. Garner
Synthesising Filled Pauses: Representation and Datamixing
Rasmus Dall, Marcus Tomalin, Mirjam Wester
Prediction of Emotions from Text using Sentiment Analysis for Expressive Speech Synthesis
Eva Vanmassenhove, Joao P. Cabral, Fasih Haider
Prosodic and Spectral iVectors for Expressive Speech Synthesis
Igor Jauk, Antonio Bonafonte
Emotional Voice Conversion Using Neural Networks with Different Temporal Scales of F0 based on Wavelet Transform
Zhaojie Luo, Tetsuya Takiguchi, Yasuo Ariki, Toru Nakashika
Novel Pre-processing using Outlier Removal in Voice Conversion
Sushant V. Rao, Nirmesh J. Shah, Hemant A. Patil
Nonaudible murmur enhancement based on statistical voice conversion and noise suppression with external noise monitoring
Yusuke Tajiri, Tomoki Toda
Multidimensional scaling of systems in the Voice Conversion Challenge 2016
Mirjam Wester, Zhizheng Wu, Junichi Yamagishi
An Automatic Voice Conversion Evaluation Strategy Based on Perceptual Background Noise Distortion and Speaker Similarity
Dong-Yan Huang, Lei Xie, Yvonne Siu Wa Lee, Jie Wu, Huaiping Ming, Xiaohai Tian, Shaofei Zhang, Chuang Ding, Mei Li, Quy Hy Nguyen, Minghui Dong, Haizhou LI
Non-intrusive Quality Assessment of Synthesized Speech using Spectral Features and Support Vector Regression
Meet H. Soni, Hemant A. Patil
How to select a good voice for TTS
Sunhee Kim
A Comparative Study of the Performance of HMM, DNN, and RNN based Speech Synthesis Systems Trained on Very Large Speaker-Dependent Corpora
Xin Wang, Shinji Takaki, Junichi Yamagishi
Investigating Very Deep Highway Networks for Parametric Speech Synthesis
Xin Wang, Shinji Takaki, Junichi Yamagishi
DNN-based Speech Synthesis for Indian Languages from ASCII text
Srikanth Ronanki, Siva Reddy, Bajibabu Bollepalli, Simon King
Parallel and cascaded deep neural networks for text-to-speech synthesis
Manuel Sam Ribeiro, Oliver Watts, Junichi Yamagishi
On the impact of phoneme alignment in DNN-based speech synthesis
Mei Li, Zhizheng Wu, Lei Xie
Temporal modeling in neural network based statistical parametric speech synthesis
Keiichi Tokuda, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku
Contextual Representation using Recurrent Neural Network Hidden State for Statistical Parametric Speech Synthesis
Sivanand Achanta, Rambabu Banoth, Ayushi Pandey, Anandaswarup Vadapalli, and Suryakanth V Gangashetty
Speaker Adaptation of Various Components in Deep Neural Network based Speech Synthesis
Shinji Takaki, SangJin Kim, Junichi Yamagishi
Multi-output RNN-LSTM for multiple speaker speech synthesis with \alpha-interpolation model
Santiago Pascual, Antonio Bonafonte
Investigating RNN-based speech enhancement methods for noise-robust Text-to-Speech
Cassia Valentini-Botinhao, Xin Wang, Shinji Takaki, Junichi Yamagishi
Development and evaluation of a statistical parametric synthesis system for operatic singing in German
Michael Pucher, Fernando Villavicencio, Junichi Yamagishi
Experiments with Cross-lingual Systems for Synthesis of Code-Mixed Text
Sunayana Sitaram, Sai Krishna Rallabandi, Shruti Rijhwani, Alan W Black
Jerk Minimization for Acoustic-To-Articulatory Inversion
Avni Rajpal and Hemant A. Patil
Utterance Selection Techniques for TTS Systems Using Found Speech
Pallavi Baljekar, Alan W. Black
Open-Source Consumer-Grade Indic Text To Speech
Andrew Wilkinson, Alok Parlikar, Sunayana Sitaram, Tim White, Alan W Black, Suresh Bazaj
Merlin: An Open Source Neural Network Speech Synthesis System
Zhizheng Wu, Oliver Watts, Simon King
WikiSpeech - enabling open source text-to-speech for Wikipedia
John Andersson, Sebastian Berlin, André Costa, Harald Berthelsen, Hanna Lindgren, Nikolaj Lindberg, Jonas Beskow, Jens Edlund, and Joakim Gustafson
ISCA

International Speech Communication Association.

SynSIG: promoting the study of Speech Synthesis