SSW9 Accepted Papers
Investigating Spectral Amplitude Modulation Phase Hierarchy Features in Speech Synthesis |
Alexandros Lazaridis, Milos Cernak, Pierre-Edouard Honnet and Philip N. Garner |
Wideband Harmonic Model: Alignment and Noise Modeling for High Quality Speech Synthesis |
Slava Shechtman, Alex Sorin |
Using instantaneous frequency and aperiodicity detection to estimate F0 for high-quality speech synthesis |
Hideki Kawahara, Yannis Agiomyrgiannakis, Heiga Zen |
Non-filter waveform generation from cepstrum using spectral phase reconstruction |
Yasuhiro Hamada, Nobutaka Ono, Shigeki Sagayama |
Wide Passband Design for Cosine-Modulated Filter Banks in Sinusoidal Speech Synthesis |
Nobuyuki Nishizawa and Tomonori Yazaki |
A Pulse Model in Log-domain for a Uniform Synthesizer |
Gilles Degottex, Pierre Lanchantin, Mark Gales |
A hybrid harmonics-and-bursts modelling approach to speech synthesis |
Jonas Beskow and Harald Berthelsen |
Automatic, model-based detection of pause-less phrase boundaries from fundamental frequency and duration features |
Mahsa Sadat Elyasi Langarani, and Jan van Santen |
Mandarin Prosodic Phrase Prediction based on Syntactic Trees |
Zhengchen Zhang, Fuxiang Wu, Chenyu Yang, Minghui Dong, and Fugen Zhou |
Emphasis recreation for TTS using intonation atoms |
Pierre-Edouard Honnet and Philip N. Garner |
Synthesising Filled Pauses: Representation and Datamixing |
Rasmus Dall, Marcus Tomalin, Mirjam Wester |
Prediction of Emotions from Text using Sentiment Analysis for Expressive Speech Synthesis |
Eva Vanmassenhove, Joao P. Cabral, Fasih Haider |
Prosodic and Spectral iVectors for Expressive Speech Synthesis |
Igor Jauk, Antonio Bonafonte |
Emotional Voice Conversion Using Neural Networks with Different Temporal Scales of F0 based on Wavelet Transform |
Zhaojie Luo, Tetsuya Takiguchi, Yasuo Ariki, Toru Nakashika |
Novel Pre-processing using Outlier Removal in Voice Conversion |
Sushant V. Rao, Nirmesh J. Shah, Hemant A. Patil |
Nonaudible murmur enhancement based on statistical voice conversion and noise suppression with external noise monitoring |
Yusuke Tajiri, Tomoki Toda |
Multidimensional scaling of systems in the Voice Conversion Challenge 2016 |
Mirjam Wester, Zhizheng Wu, Junichi Yamagishi |
An Automatic Voice Conversion Evaluation Strategy Based on Perceptual Background Noise Distortion and Speaker Similarity |
Dong-Yan Huang, Lei Xie, Yvonne Siu Wa Lee, Jie Wu, Huaiping Ming, Xiaohai Tian, Shaofei Zhang, Chuang Ding, Mei Li, Quy Hy Nguyen, Minghui Dong, Haizhou LI |
Non-intrusive Quality Assessment of Synthesized Speech using Spectral Features and Support Vector Regression |
Meet H. Soni, Hemant A. Patil |
How to select a good voice for TTS |
Sunhee Kim |
A Comparative Study of the Performance of HMM, DNN, and RNN based Speech Synthesis Systems Trained on Very Large Speaker-Dependent Corpora |
Xin Wang, Shinji Takaki, Junichi Yamagishi |
Investigating Very Deep Highway Networks for Parametric Speech Synthesis |
Xin Wang, Shinji Takaki, Junichi Yamagishi |
DNN-based Speech Synthesis for Indian Languages from ASCII text |
Srikanth Ronanki, Siva Reddy, Bajibabu Bollepalli, Simon King |
Parallel and cascaded deep neural networks for text-to-speech synthesis |
Manuel Sam Ribeiro, Oliver Watts, Junichi Yamagishi |
On the impact of phoneme alignment in DNN-based speech synthesis |
Mei Li, Zhizheng Wu, Lei Xie |
Temporal modeling in neural network based statistical parametric speech synthesis |
Keiichi Tokuda, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku |
Contextual Representation using Recurrent Neural Network Hidden State for Statistical Parametric Speech Synthesis |
Sivanand Achanta, Rambabu Banoth, Ayushi Pandey, Anandaswarup Vadapalli, and Suryakanth V Gangashetty |
Speaker Adaptation of Various Components in Deep Neural Network based Speech Synthesis |
Shinji Takaki, SangJin Kim, Junichi Yamagishi |
Multi-output RNN-LSTM for multiple speaker speech synthesis with \alpha-interpolation model |
Santiago Pascual, Antonio Bonafonte |
Investigating RNN-based speech enhancement methods for noise-robust Text-to-Speech |
Cassia Valentini-Botinhao, Xin Wang, Shinji Takaki, Junichi Yamagishi |
Development and evaluation of a statistical parametric synthesis system for operatic singing in German |
Michael Pucher, Fernando Villavicencio, Junichi Yamagishi |
Experiments with Cross-lingual Systems for Synthesis of Code-Mixed Text |
Sunayana Sitaram, Sai Krishna Rallabandi, Shruti Rijhwani, Alan W Black |
Jerk Minimization for Acoustic-To-Articulatory Inversion |
Avni Rajpal and Hemant A. Patil |
Utterance Selection Techniques for TTS Systems Using Found Speech |
Pallavi Baljekar, Alan W. Black |
Open-Source Consumer-Grade Indic Text To Speech |
Andrew Wilkinson, Alok Parlikar, Sunayana Sitaram, Tim White, Alan W Black, Suresh Bazaj |
Merlin: An Open Source Neural Network Speech Synthesis System |
Zhizheng Wu, Oliver Watts, Simon King |
WikiSpeech - enabling open source text-to-speech for Wikipedia |
John Andersson, Sebastian Berlin, André Costa, Harald Berthelsen, Hanna Lindgren, Nikolaj Lindberg, Jonas Beskow, Jens Edlund, and Joakim Gustafson |
Latest news
Keynote talks
We are pleased to announce Oriol Guasch, Alex Acero and Quoc V. Lee as Keynote Speakers for SSW9.August 26th 2016
Presentation Guidelines
Poster and oral presentation guidelines are published hereAugust 4th 2016
Call for Demos
If you want to make a demo presentation, please take a look at the Call for DemosJuly 25th 2016
Oral and Poster Sessions
The oral and poster sessions have been published.July 22th 2016
Accepted Papers
The list of accepted papers has been published.Deadline for camera-ready papers: July 26th
July 15th 2016
Registration
The registration system is open. Early registration ends on August 10th.July 9th 2016