CUCTCDecoder¶

class torchaudio.models.decoder.CUCTCDecoder[原始碼]¶

CUDA CTC 波束搜尋解碼器。

注意

若要建置解碼器，請使用 factory 函數 cuda_ctc_decoder()。

方法¶

CUCTCDecoder.__call__(log_prob: Tensor, encoder_out_lens: Tensor)[原始碼]¶

參數:

log_prob (torch.FloatTensor) – GPU 張量，形狀為 (batch, frame, num_tokens)，儲存標籤上的機率分佈序列；log_softmax(聲學模型的輸出)。
lengths (dpython:type torch.python:int32) – GPU 張量，形狀為 (batch, )，儲存每個批次中輸出張量在時間軸上的有效長度。

傳回:

批次中每個音訊序列的排序最佳假設清單。

傳回類型:

List[List[CUCTCHypothesis]]

class torchaudio.models.decoder.CUCTCHypothesis(tokens: List[int], words: List[str], score: float)[原始碼]¶

表示 CUCTC 波束搜尋解碼器 CUCTCDecoder 產生的假設。

使用 CUCTCHypothesis 的教學: 使用 CUDA CTC 解碼器進行 ASR 推論

使用 CUDA CTC 解碼器進行 ASR 推論