A Neural TTS System with Parallel Prosody Transfer from Unseen Speakers

Slava Shechtman1 and Raul Fernandez2
1IBM Research, Haifa, Israel
2IBM Research, Yorktown Heights, NY, USA

Accepted to Interspeech 2023

Audio Samples


# Prosody Target Voice Target Reference Systems Proposed Systems
Ref HPC0-TTS HPC0-D0 HPC0-D1 HPC1-D0 HPC1-D1 HPC2-D0 HPC2-D1
1 F
M
2 F
M
3 F
M
4 F
M
5 F
M
6 F
M
7 F
M
8 F
M
9 F
M
10 F
M
11 F
M
12 F
M