torch.nn.functional.gelu¶

torch.nn.functional.gelu(input, approximate='none') → Tensor¶

當 approximate 參數為 ‘none’ 時，它會逐元素地應用函數 $\text{GELU}(x) = x * \Phi(x)$

其中 $\Phi(x)$ 是高斯分佈的累積分佈函數。

當 approximate 參數為 ‘tanh’ 時，Gelu 使用以下公式估算：

\text{GELU}(x) = 0.5 * x * (1 + \text{Tanh}(\sqrt{2 / \pi} * (x + 0.044715 * x^3)))

文件