Tracing TensorDictModule¶
我們支援追蹤 TensorDictModule
的執行以建立 FX 圖。只需從 tensordict.prototype.fx
匯入 symbolic_trace
,而不是從 torch.fx
匯入。
注意
對 torch.fx
的支援是高度實驗性的,可能會發生變化。請謹慎使用,如果您嘗試並遇到問題,請提出 issue。
追蹤 TensorDictModule
¶
我們將使用概述中的一個範例來說明。我們建立一個 TensorDictModule
,追蹤它,並檢查圖和產生的程式碼。
追蹤 TensorDictModule¶
>>> import torch
>>> import torch.nn as nn
>>> from tensordict import TensorDict
>>> from tensordict.nn import TensorDictModule
>>> from tensordict.prototype.fx import symbolic_trace
>>> class Net(nn.Module):
... def __init__(self):
... super().__init__()
... self.linear = nn.LazyLinear(1)
...
... def forward(self, x):
... logits = self.linear(x)
... return logits, torch.sigmoid(logits)
>>> module = TensorDictModule(
... Net(),
... in_keys=["input"],
... out_keys=[("outputs", "logits"), ("outputs", "probabilities")],
... )
>>> graph_module = symbolic_trace(module)
>>> print(graph_module.graph)
graph():
%tensordict : [#users=1] = placeholder[target=tensordict]
%getitem : [#users=1] = call_function[target=operator.getitem](args = (%tensordict, input), kwargs = {})
%linear : [#users=2] = call_module[target=linear](args = (%getitem,), kwargs = {})
%sigmoid : [#users=1] = call_function[target=torch.sigmoid](args = (%linear,), kwargs = {})
return (linear, sigmoid)
>>> print(graph_module.code)
def forward(self, tensordict):
getitem = tensordict['input']; tensordict = None
linear = self.linear(getitem); getitem = None
sigmoid = torch.sigmoid(linear)
return (linear, sigmoid)
我們可以檢查每個模組的前向傳遞是否產生相同的輸出。
>>> tensordict = TensorDict({"input": torch.randn(32, 100)}, [32])
>>> module_out = module(tensordict, tensordict_out=TensorDict())
>>> graph_module_out = graph_module(tensordict, tensordict_out=TensorDict())
>>> assert (
... module_out["outputs", "logits"] == graph_module_out["outputs", "logits"]
... ).all()
>>> assert (
... module_out["outputs", "probabilities"]
... == graph_module_out["outputs", "probabilities"]
... ).all()
追蹤 TensorDictSequential
¶
我們也可以追蹤 TensorDictSequential
。在這種情況下,模組的整個執行過程都會被追蹤到單個圖中,從而消除了輸入 TensorDict
上的中間讀寫。
我們透過追蹤概述中的循序範例來示範。
Tracing TensorDictSequential¶
>>> import torch
>>> import torch.nn as nn
>>> from tensordict import TensorDict
>>> from tensordict.nn import TensorDictModule, TensorDictSequential
>>> from tensordict.prototype.fx import symbolic_trace
>>> class Net(nn.Module):
... def __init__(self, input_size=100, hidden_size=50, output_size=10):
... super().__init__()
... self.fc1 = nn.Linear(input_size, hidden_size)
... self.fc2 = nn.Linear(hidden_size, output_size)
...
... def forward(self, x):
... x = torch.relu(self.fc1(x))
... return self.fc2(x)
...
... class Masker(nn.Module):
... def forward(self, x, mask):
... return torch.softmax(x * mask, dim=1)
>>> net = TensorDictModule(
... Net(), in_keys=[("input", "x")], out_keys=[("intermediate", "x")]
... )
>>> masker = TensorDictModule(
... Masker(),
... in_keys=[("intermediate", "x"), ("input", "mask")],
... out_keys=[("output", "probabilities")],
... )
>>> module = TensorDictSequential(net, masker)
>>> graph_module = symbolic_trace(module)
>>> print(graph_module.code)
def forward(self, tensordict):
getitem = tensordict[('input', 'x')]
_0_fc1 = getattr(self, "0").module.fc1(getitem); getitem = None
relu = torch.relu(_0_fc1); _0_fc1 = None
_0_fc2 = getattr(self, "0").module.fc2(relu); relu = None
getitem_1 = tensordict[('input', 'mask')]; tensordict = None
mul = _0_fc2 * getitem_1; getitem_1 = None
softmax = torch.softmax(mul, dim = 1); mul = None
return (_0_fc2, softmax)
在這種情況下,生成的圖和程式碼會稍微複雜一些。我們可以如下可視化它(需要 pydot
)
可視化圖¶
>>> from torch.fx.passes.graph_drawer import FxGraphDrawer
>>> g = FxGraphDrawer(graph_module, "sequential")
>>> with open("graph.svg", "wb") as f:
... f.write(g.get_dot_graph().create_svg())
這會產生以下視覺化