Index A | C | D | E | F | G | H | I | L | M | N | P | Q | R | S | T | W | Z A activation (embedl_deploy.quantize.QuantConfig attribute) AdaptiveAvgPoolPattern (class in embedl_deploy.tensorrt.patterns) alpha (embedl_deploy.quantize.SmoothQuantConfig attribute) apply_transformation_plan() (in module embedl_deploy) C calibrate_qdq() (in module embedl_deploy.quantize) calibrate_smooth_quant() (in module embedl_deploy.quantize) calibration_method (embedl_deploy.quantize.TensorQuantConfig attribute) CalibrationMethod (class in embedl_deploy.quantize) compute_parameters() (embedl_deploy.quantize.QuantStub method) configure() (in module embedl_deploy.quantize) ConvBNActPattern (class in embedl_deploy.tensorrt.patterns) ConvBNAddActPattern (class in embedl_deploy.tensorrt.patterns) ConvBNPattern (class in embedl_deploy.tensorrt.patterns) D DecomposeMultiheadAttentionPattern (class in embedl_deploy.tensorrt.patterns) disable_fake_quant() (in module embedl_deploy.quantize) E embedl_deploy module embedl_deploy.quantize module embedl_deploy.tensorrt module embedl_deploy.tensorrt.modules module embedl_deploy.tensorrt.patterns module embedl_deploy.version module embedl_deploy.version.public module enable_fake_quant() (in module embedl_deploy.quantize) F FlattenLinearToConv1x1Pattern (class in embedl_deploy.tensorrt.patterns) forward() (embedl_deploy.quantize.QuantStub method) (embedl_deploy.quantize.WeightFakeQuantize method) (embedl_deploy.tensorrt.modules.FusedAdaptiveAvgPool2d method) (embedl_deploy.tensorrt.modules.FusedConvBN method) (embedl_deploy.tensorrt.modules.FusedConvBNAct method) (embedl_deploy.tensorrt.modules.FusedConvBNActMaxPool method) (embedl_deploy.tensorrt.modules.FusedConvBNAddAct method) (embedl_deploy.tensorrt.modules.FusedLayerNorm method) (embedl_deploy.tensorrt.modules.FusedLinear method) (embedl_deploy.tensorrt.modules.FusedLinearAct method) (embedl_deploy.tensorrt.modules.FusedMHAInProjection method) (embedl_deploy.tensorrt.modules.FusedScaledDotProductAttention method) freeze_bn_stats() (in module embedl_deploy.quantize) FusedAdaptiveAvgPool2d (class in embedl_deploy.tensorrt.modules) FusedConvBN (class in embedl_deploy.tensorrt.modules) FusedConvBNAct (class in embedl_deploy.tensorrt.modules) FusedConvBNActMaxPool (class in embedl_deploy.tensorrt.modules) FusedConvBNAddAct (class in embedl_deploy.tensorrt.modules) FusedLayerNorm (class in embedl_deploy.tensorrt.modules) FusedLinear (class in embedl_deploy.tensorrt.modules) FusedLinearAct (class in embedl_deploy.tensorrt.modules) FusedMHAInProjection (class in embedl_deploy.tensorrt.modules) FusedScaledDotProductAttention (class in embedl_deploy.tensorrt.modules) G get_transformation_plan() (in module embedl_deploy) H HISTOGRAM (embedl_deploy.quantize.CalibrationMethod attribute) I inputs_to_quantize (embedl_deploy.tensorrt.modules.FusedAdaptiveAvgPool2d attribute) (embedl_deploy.tensorrt.modules.FusedConvBN attribute) (embedl_deploy.tensorrt.modules.FusedConvBNAct attribute) (embedl_deploy.tensorrt.modules.FusedConvBNActMaxPool attribute) (embedl_deploy.tensorrt.modules.FusedConvBNAddAct attribute) (embedl_deploy.tensorrt.modules.FusedLayerNorm attribute) (embedl_deploy.tensorrt.modules.FusedLinear attribute) (embedl_deploy.tensorrt.modules.FusedLinearAct attribute) (embedl_deploy.tensorrt.modules.FusedMHAInProjection attribute) (embedl_deploy.tensorrt.modules.FusedScaledDotProductAttention attribute) is_conversion (embedl_deploy.tensorrt.patterns.DecomposeMultiheadAttentionPattern attribute) (embedl_deploy.tensorrt.patterns.FlattenLinearToConv1x1Pattern attribute) (embedl_deploy.tensorrt.patterns.RemoveAssertPattern attribute) (embedl_deploy.tensorrt.patterns.RemoveDeadAssertPattern attribute) (embedl_deploy.tensorrt.patterns.RemoveIdentityAdaptiveAvgPoolPattern attribute) (embedl_deploy.tensorrt.patterns.RemoveIdentityPattern attribute) L LayerNormPattern (class in embedl_deploy.tensorrt.patterns) LinearActPattern (class in embedl_deploy.tensorrt.patterns) LinearPattern (class in embedl_deploy.tensorrt.patterns) M match() (embedl_deploy.tensorrt.patterns.AdaptiveAvgPoolPattern method) (embedl_deploy.tensorrt.patterns.ConvBNActPattern method) (embedl_deploy.tensorrt.patterns.ConvBNAddActPattern method) (embedl_deploy.tensorrt.patterns.ConvBNPattern method) (embedl_deploy.tensorrt.patterns.DecomposeMultiheadAttentionPattern method) (embedl_deploy.tensorrt.patterns.FlattenLinearToConv1x1Pattern method) (embedl_deploy.tensorrt.patterns.LayerNormPattern method) (embedl_deploy.tensorrt.patterns.LinearActPattern method) (embedl_deploy.tensorrt.patterns.LinearPattern method) (embedl_deploy.tensorrt.patterns.MHAInProjectionPattern method) (embedl_deploy.tensorrt.patterns.RemoveAssertPattern method) (embedl_deploy.tensorrt.patterns.RemoveDeadAssertPattern method) (embedl_deploy.tensorrt.patterns.RemoveIdentityAdaptiveAvgPoolPattern method) (embedl_deploy.tensorrt.patterns.RemoveIdentityPattern method) (embedl_deploy.tensorrt.patterns.ScaledDotProductAttentionPattern method) (embedl_deploy.tensorrt.patterns.StemConvBNActMaxPoolPattern method) matches (embedl_deploy.TransformationPlan attribute) (embedl_deploy.TransformationResult attribute) MHAInProjectionPattern (class in embedl_deploy.tensorrt.patterns) MINMAX (embedl_deploy.quantize.CalibrationMethod attribute) model (embedl_deploy.TransformationPlan attribute) (embedl_deploy.TransformationResult attribute) module embedl_deploy embedl_deploy.quantize embedl_deploy.tensorrt embedl_deploy.tensorrt.modules embedl_deploy.tensorrt.patterns embedl_deploy.version embedl_deploy.version.public ModulesToSkip (class in embedl_deploy.quantize) MOVING_AVERAGE_MINMAX (embedl_deploy.quantize.CalibrationMethod attribute) N n_bits (embedl_deploy.quantize.TensorQuantConfig attribute) P per_channel (embedl_deploy.quantize.TensorQuantConfig attribute) prefers_fp_input (embedl_deploy.tensorrt.modules.FusedLayerNorm attribute) prepare_qat() (in module embedl_deploy.quantize) Q qdq_points (embedl_deploy.tensorrt.patterns.AdaptiveAvgPoolPattern attribute) (embedl_deploy.tensorrt.patterns.ConvBNActPattern attribute) (embedl_deploy.tensorrt.patterns.ConvBNAddActPattern attribute) (embedl_deploy.tensorrt.patterns.ConvBNPattern attribute) (embedl_deploy.tensorrt.patterns.LayerNormPattern attribute) (embedl_deploy.tensorrt.patterns.LinearActPattern attribute) (embedl_deploy.tensorrt.patterns.LinearPattern attribute) (embedl_deploy.tensorrt.patterns.MHAInProjectionPattern attribute) (embedl_deploy.tensorrt.patterns.ScaledDotProductAttentionPattern attribute) (embedl_deploy.tensorrt.patterns.StemConvBNActMaxPoolPattern attribute) quant_max (embedl_deploy.quantize.TensorQuantConfig property) quant_min (embedl_deploy.quantize.TensorQuantConfig property) quant_range() (embedl_deploy.quantize.TensorQuantConfig method) QuantConfig (class in embedl_deploy.quantize) quantize() (in module embedl_deploy.quantize) QuantStub (class in embedl_deploy.quantize) R RemoveAssertPattern (class in embedl_deploy.tensorrt.patterns) RemoveDeadAssertPattern (class in embedl_deploy.tensorrt.patterns) RemoveIdentityAdaptiveAvgPoolPattern (class in embedl_deploy.tensorrt.patterns) RemoveIdentityPattern (class in embedl_deploy.tensorrt.patterns) replace() (embedl_deploy.tensorrt.patterns.AdaptiveAvgPoolPattern method) (embedl_deploy.tensorrt.patterns.ConvBNActPattern method) (embedl_deploy.tensorrt.patterns.ConvBNAddActPattern method) (embedl_deploy.tensorrt.patterns.ConvBNPattern method) (embedl_deploy.tensorrt.patterns.DecomposeMultiheadAttentionPattern method) (embedl_deploy.tensorrt.patterns.FlattenLinearToConv1x1Pattern method) (embedl_deploy.tensorrt.patterns.LayerNormPattern method) (embedl_deploy.tensorrt.patterns.LinearActPattern method) (embedl_deploy.tensorrt.patterns.LinearPattern method) (embedl_deploy.tensorrt.patterns.MHAInProjectionPattern method) (embedl_deploy.tensorrt.patterns.RemoveAssertPattern method) (embedl_deploy.tensorrt.patterns.RemoveDeadAssertPattern method) (embedl_deploy.tensorrt.patterns.RemoveIdentityAdaptiveAvgPoolPattern method) (embedl_deploy.tensorrt.patterns.RemoveIdentityPattern method) (embedl_deploy.tensorrt.patterns.ScaledDotProductAttentionPattern method) (embedl_deploy.tensorrt.patterns.StemConvBNActMaxPoolPattern method) report (embedl_deploy.TransformationResult attribute) S scale (embedl_deploy.quantize.QuantStub attribute) ScaledDotProductAttentionPattern (class in embedl_deploy.tensorrt.patterns) skip (embedl_deploy.quantize.QuantConfig attribute) smooth (embedl_deploy.quantize.ModulesToSkip attribute) smooth_quant (embedl_deploy.quantize.QuantConfig attribute) SmoothQuantConfig (class in embedl_deploy.quantize) StemConvBNActMaxPoolPattern (class in embedl_deploy.tensorrt.patterns) stub (embedl_deploy.quantize.ModulesToSkip attribute) symmetric (embedl_deploy.quantize.TensorQuantConfig attribute) T TensorQuantConfig (class in embedl_deploy.quantize) transform() (in module embedl_deploy) TransformationPlan (class in embedl_deploy) TransformationResult (class in embedl_deploy) tree (embedl_deploy.tensorrt.patterns.AdaptiveAvgPoolPattern attribute) (embedl_deploy.tensorrt.patterns.ConvBNActPattern attribute) (embedl_deploy.tensorrt.patterns.ConvBNAddActPattern attribute) (embedl_deploy.tensorrt.patterns.ConvBNPattern attribute) (embedl_deploy.tensorrt.patterns.DecomposeMultiheadAttentionPattern attribute) (embedl_deploy.tensorrt.patterns.FlattenLinearToConv1x1Pattern attribute) (embedl_deploy.tensorrt.patterns.LayerNormPattern attribute) (embedl_deploy.tensorrt.patterns.LinearActPattern attribute) (embedl_deploy.tensorrt.patterns.LinearPattern attribute) (embedl_deploy.tensorrt.patterns.MHAInProjectionPattern attribute) (embedl_deploy.tensorrt.patterns.RemoveAssertPattern attribute) (embedl_deploy.tensorrt.patterns.RemoveDeadAssertPattern attribute) (embedl_deploy.tensorrt.patterns.RemoveIdentityAdaptiveAvgPoolPattern attribute) (embedl_deploy.tensorrt.patterns.RemoveIdentityPattern attribute) (embedl_deploy.tensorrt.patterns.ScaledDotProductAttentionPattern attribute) (embedl_deploy.tensorrt.patterns.StemConvBNActMaxPoolPattern attribute) W weight (embedl_deploy.quantize.ModulesToSkip attribute) (embedl_deploy.quantize.QuantConfig attribute) WeightFakeQuantize (class in embedl_deploy.quantize) Z zero_point (embedl_deploy.quantize.QuantStub attribute)