embedl_deploy.tensorrt package#
Subpackages:
embedl_deploy.tensorrt.modulespackageembedl_deploy.tensorrt.patternspackageActAddPatternAdaptiveAvgPoolPatternConvBNActPatternConvBNAddActPatternConvBNPatternDecomposeMultiheadAttentionPatternFlattenLinearToConv1x1PatternLayerNormPatternLinearActPatternLinearPatternMHAInProjectionPatternRemoveAssertPatternRemoveIdentityAdaptiveAvgPoolPatternRemoveIdentityPatternScaledDotProductAttentionPatternStemConvBNActMaxPoolPattern
Module contents:
TensorRT backend — curated pattern lists and convenience API.
Quick start:
import torch
from torchvision.models import resnet50
from embedl_deploy import transform
from embedl_deploy.tensorrt import TENSORRT_PATTERNS
model = resnet50(weights=None).eval()
deployed = transform(model, patterns=TENSORRT_PATTERNS).model
Pattern lists#
TENSORRT_PATTERNSComplete transformation pipeline: recomposition (lifting aten-level ops to
nn.Modulenodes), conversions (structural transforms), and fusions (Conv→BN→ReLU, etc.). This is the recommended list for most users and handlestorch.exportoutput correctly.TENSORRT_RECOMPOSITION_PATTERNSRecomposition-only patterns. Lift aten-level ops back into
nn.Modulenodes (fortorch.exportoutput). Already included inTENSORRT_PATTERNS.TENSORRT_CONVERSION_PATTERNSStructural conversions applied before fusion (e.g.
Flatten→Linear → Conv1×1→Flatten). Already included inTENSORRT_PATTERNS.TENSORRT_FUSION_PATTERNSFusion-only patterns (
Conv→BN→ReLU, Stem, residual, etc.). Already included inTENSORRT_PATTERNS.TENSORRT_QUANTIZED_PATTERNSQ/DQ stub insertion, propagation, deduplication, and surround.