优化

Transformation

class optimum.fx.optimization.Transformation

< source >

( )

参数

preserves_computation (bool, 默认为 False) — 变换是否保留图计算。如果为 True，则原始图和变换后的图应产生相同的输出。

torch.fx 图变换。

它必须实现 transform() 方法，并用作可调用对象。

call

< source >

( graph_module: GraphModule lint_and_recompile: bool = True ) → torch.fx.GraphModule

参数

graph_module (torch.fx.GraphModule) — 要变换的模块。
lint_and_recompile (bool, 默认为 True) — 是否应该对变换后的模块进行 lint 和重新编译。当链接多个变换以仅执行一次此操作时，可以将其设置为 False。

返回值

torch.fx.GraphModule

变换后的模块。

get_transformed_nodes

< source >

( graph_module: GraphModule ) → List[torch.fx.Node]

参数

graph_module (torch.fx.GraphModule) — 要从中获取节点的 graph_module。

返回值

List[torch.fx.Node]

给出被变换转换的节点列表。

mark_as_transformed

< source >

( node: Node )

参数

node (torch.fx.Node) — 要标记为已变换的节点。

将节点标记为此变换已变换。

transform

< source >

( graph_module: GraphModule ) → torch.fx.GraphModule

参数

graph_module (torch.fx.GraphModule) — 要变换的模块。

返回值

torch.fx.GraphModule

变换后的模块。

transformed

< source >

( node: Node ) → bool

参数

node (torch.fx.Node) — 要检查的节点。

返回值

bool

指定节点是否由此变换变换。

可逆变换

class optimum.fx.optimization.ReversibleTransformation

< source >

( )

参数

preserves_computation (bool, 默认为 False) — 变换是否保留图计算。如果为 True，则原始图和变换后的图应产生相同的输出。

一种可逆的 torch.fx 图变换。

它必须实现 transform() 和 reverse() 方法，并用作可调用对象。

call

< source >

( graph_module: GraphModule lint_and_recompile: bool = True reverse: bool = False ) → torch.fx.GraphModule

参数

graph_module (torch.fx.GraphModule) — 要变换的模块。
lint_and_recompile (bool, 默认为 True) — 是否应该对变换后的模块进行 lint 和重新编译。当链接多个变换以仅执行一次此操作时，可以将其设置为 False。
reverse (bool, 默认为 False) — 如果为 True，则执行反向变换。

返回值

torch.fx.GraphModule

变换后的模块。

mark_as_restored

< source >

( node: Node )

参数

node (torch.fx.Node) — 要标记为已恢复的节点。

将节点标记为已恢复到其原始状态。

reverse

< source >

( graph_module: GraphModule ) → torch.fx.GraphModule

参数

graph_module (torch.fx.GraphModule) — 要变换的模块。

返回值

torch.fx.GraphModule

反向变换后的模块。

optimum.fx.optimization.compose

< source >

( *args: Transformation inplace: bool = True )

参数

args (Transformation) — 要组合在一起的转换。
inplace (bool, 默认为 True) — 指示结果转换应该是就地操作，还是创建一个新的图模块。

将转换列表组合在一起。

示例

>>> from transformers import BertModel
>>> from transformers.utils.fx import symbolic_trace
>>> from optimum.fx.optimization import ChangeTrueDivToMulByInverse, MergeLinears, compose

>>> model = BertModel.from_pretrained("bert-base-uncased")
>>> traced = symbolic_trace(
...     model,
...     input_names=["input_ids", "attention_mask", "token_type_ids"],
... )
>>> composition = compose(ChangeTrueDivToMulByInverse(), MergeLinears())
>>> transformed_model = composition(traced)

转换

class optimum.fx.optimization.MergeLinears

< source >

( )

参数

preserves_computation (bool, 默认为 False) — 指示转换是否保留了图计算。如果为 True，则原始图和转换后的图应产生相同的输出。

将接受相同输入的线性层合并为一个大的线性层的转换。

示例

>>> from transformers import BertModel
>>> from transformers.utils.fx import symbolic_trace
>>> from optimum.fx.optimization import MergeLinears

>>> model = BertModel.from_pretrained("bert-base-uncased")
>>> traced = symbolic_trace(
...     model,
...     input_names=["input_ids", "attention_mask", "token_type_ids"],
... )
>>> transformation = MergeLinears()
>>> transformed_model = transformation(traced)
>>> restored_model = transformation(transformed_model, reverse=True)

class optimum.fx.optimization.FuseBiasInLinear

< source >

( )

参数

preserves_computation (bool, 默认为 False) — 指示转换是否保留了图计算。如果为 True，则原始图和转换后的图应产生相同的输出。

将 torch.nn.Linear 中的偏置融合到权重的转换。

示例

>>> from transformers import BertModel
>>> from transformers.utils.fx import symbolic_trace
>>> from optimum.fx.optimization import FuseBiasInLinear

>>> model = BertModel.from_pretrained("bert-base-uncased")
>>> traced = symbolic_trace(
...     model,
...     input_names=["input_ids", "attention_mask", "token_type_ids"],
... )
>>> transformation = FuseBiasInLinear()
>>> transformed_model = transformation(traced)
>>> restored_model = transformation(transformed_model, reverse=True)

class optimum.fx.optimization.ChangeTrueDivToMulByInverse

< source >

( )

参数

preserves_computation (bool, 默认为 False) — 指示转换是否保留了图计算。如果为 True，则原始图和转换后的图应产生相同的输出。

当分母是静态的时，将 truediv 节点更改为乘以倒数节点的转换。例如，这有时是注意力层中缩放因子的情况。

示例

>>> from transformers import BertModel
>>> from transformers.utils.fx import symbolic_trace
>>> from optimum.fx.optimization import ChangeTrueDivToMulByInverse

>>> model = BertModel.from_pretrained("bert-base-uncased")
>>> traced = symbolic_trace(
...     model,
...     input_names=["input_ids", "attention_mask", "token_type_ids"],
... )
>>> transformation = ChangeTrueDivToMulByInverse()
>>> transformed_model = transformation(traced)
>>> restored_model = transformation(transformed_model, reverse=True)

class optimum.fx.optimization.FuseBatchNorm2dInConv2d

< source >

( )

参数

preserves_computation (bool, 默认为 False) — 指示转换是否保留了图计算。如果为 True，则原始图和转换后的图应产生相同的输出。

将 nn.BatchNorm2d 融合到 nn.Conv2d 的转换。只有当卷积层将批归一化作为唯一后续节点时，才会进行融合。

例如，在以下情况下不会进行融合

     Conv2d
     /   \
    /     \
ReLU   BatchNorm2d

示例

>>> from transformers.utils.fx import symbolic_trace
>>> from transformers import AutoModelForImageClassification

>>> from optimum.fx.optimization import FuseBatchNorm2dInConv2d

>>> model = AutoModelForImageClassification.from_pretrained("microsoft/resnet-50")
>>> model.eval()
>>> traced_model = symbolic_trace(
...     model,
...     input_names=["pixel_values"],
...     disable_check=True
... )

>>> transformation = FuseBatchNorm2dInConv2d()
>>> transformed_model = transformation(traced_model)

class optimum.fx.optimization.FuseBatchNorm1dInLinear

< source >

( )

参数

preserves_computation (bool, 默认为 False) — 指示转换是否保留了图计算。如果为 True，则原始图和转换后的图应产生相同的输出。

将 nn.BatchNorm1d 融合到 nn.Linear 的转换，可以是在 nn.Linear 之后或之前。只有当线性层将批归一化作为唯一后续节点，或者批归一化将线性层作为唯一后续节点时，才会进行融合。

例如，在以下情况下不会进行融合

     Linear
     /   \
    /     \
ReLU   BatchNorm1d

示例

>>> from transformers.utils.fx import symbolic_trace
>>> from transformers import AutoModel

>>> from optimum.fx.optimization import FuseBatchNorm1dInLinear

>>> model = AutoModel.from_pretrained("nvidia/groupvit-gcc-yfcc")
>>> model.eval()
>>> traced_model = symbolic_trace(
...     model,
...     input_names=["input_ids", "attention_mask", "pixel_values"],
...     disable_check=True
... )

>>> transformation = FuseBatchNorm1dInLinear()
>>> transformed_model = transformation(traced_model)

< > 更新 on GitHub

Optimum

优化

Transformation

class optimum.fx.optimization.Transformation

__call__

get_transformed_nodes

mark_as_transformed

transform

transformed

可逆变换

class optimum.fx.optimization.ReversibleTransformation

__call__

mark_as_restored

reverse

optimum.fx.optimization.compose

转换

class optimum.fx.optimization.MergeLinears

class optimum.fx.optimization.FuseBiasInLinear

class optimum.fx.optimization.ChangeTrueDivToMulByInverse

class optimum.fx.optimization.FuseBatchNorm2dInConv2d

class optimum.fx.optimization.FuseBatchNorm1dInLinear

call

call