TACOTRON2_GRIFFINLIM_PHONE_LJSPEECH¶

torchaudio.pipelines.TACOTRON2_GRIFFINLIM_PHONE_LJSPEECH¶

基於音素的 TTS 管線，使用在 LJSpeech 上訓練 1,500 個 epoch 的 Tacotron2 [Ito and Johnson, 2017]，並使用 GriffinLim 作為聲碼器。

文字處理器根據音素對輸入文字進行編碼。它使用 DeepPhonemizer 將字素轉換為音素。該模型 (en_us_cmudict_forward) 在 CMUDict 上進行訓練。

您可以在這裡找到訓練腳本。文字處理器設定為 “english_phonemes”。

範例 - “Hello world! T T S stands for Text to Speech!”

範例 - “The examination and testimony of the experts enabled the Commission to conclude that five shots may have been fired,”

文件