VCTK_092¶

class torchaudio.datasets.VCTK_092(root: str, mic_id: str = 'mic2', download: bool = False, url: str = 'https://datashare.is.ed.ac.uk/bitstream/handle/10283/3443/VCTK-Corpus-0.92.zip', audio_ext='.flac')[source]¶

VCTK 0.92 [Yamagishi et al., 2019] 資料集

參數:

root (str) – 找到資料集頂層目錄的根目錄。
mic_id (str, optional) – 麥克風 ID。可以是 "mic1" 或 "mic2"。(預設: "mic2")
download (bool, optional) – 是否下載資料集，如果資料集在根目錄路徑下找不到的話。(預設值：False)。
url (str, optional) – 下載資料集的網址。(預設值："https://datashare.is.ed.ac.uk/bitstream/handle/10283/3443/VCTK-Corpus-0.92.zip")
audio_ext (str, optional) – 如果資料集轉換成非預設的音訊格式，則可使用自訂的音訊副檔名。

注意

由於缺少對應的文字檔案，將跳過講者 p315 的所有語音。
由於缺少音訊檔案，將跳過 mic_id="mic2" 的 p280 的所有語音。
由於缺少音訊檔案，將跳過講者 p362 的部分語音。
另請參閱：https://datashare.is.ed.ac.uk/handle/10283/3443

getitem¶

VCTK_092.__getitem__(n: int) → Tuple[Tensor, int, str, str, str][source]¶

從資料集中載入第 n 個樣本。

參數:

n (int) – 要載入的樣本的索引

傳回值:

以下項目的 Tuple；

Tensor: 波形
int: 採樣率
str: 文字稿
str: 講者 ID
std: 語音 ID

VCTK_092¶

getitem¶

文件

教學

資源

VCTK_092¶

__getitem__¶

文件

教學

資源

getitem¶