Crate caffe2_opt

Macros

add_arg

Structs

ASTExpr
ASTGraph
ASTStmt
AddConverter
AveragePoolConverter
BackendTransformOptions
BackendTransformerBase
| This class contains some common functions | for backend lowering and graph cutting |
BatchMatMulConverter
BoundShapeInferencer
BoundShapeInferencerBase
| \class A class that does bound shape inference | given a C2 net. Depending on its type, each op | have a maximum shape that it accepts. We | define some initial bound for certain | dimension, for example max batch size or max | sequnce lookup size. And the inference will | first infer the input size and then propagates | the bound shape down the network. For now the | variable part (bound part) is the first | dimension of the shape, which usually | corresponds to the batch size or sequence | lookup size.
BoundShapeSpec
| This struct stores the max bound size for batch | in the general sense. max_batch_size is the | upper bound of batch_size. | | max_seq_size is the upper bound of length of | every item in a batch. | | Upper bound of length of a batch of items | should be max_batch_size * max_seq_size.
Caffe2Annotation
CastConverter
ClipConverter
ClipRangesConverter
ClipRangesGatherSigridHashConverter
ClipRangesGatherSigridHashV2Converter
ConcatAddMulReplaceNaNClipConverter
ConcatAddMulReplaceNaNClipOp
ConcatConverter
ConvConverter
ConvTransposeConverter
FCConverter
GraphMatcher
| \brief Main graph matcher interface. | | This class solves a problem of finding | a matching subgraph, which is specified in | a text form.
GroupAnnotation
MatchedSubgraph
| Each match is a struct of subgraph and | map from the string used in the query | to a NodeRef in the subgraph note: | | the maps are injective but not necessarily | bijective – if you use the same name | in the query twice only one will be mapped. | | See getMatches to generate these | structs. |
MaxPoolConverter
MulConverter
OnnxAnnotation
| TORCH_API nom::repr::NNModule convertToNNModule(caffe2::NetDef | &net, std::unordered_map<std::string, | nom::repr::NNGraph::NodeRef>* blobMapOut | = nullptr); | | TORCH_API caffe2::NetDef convertToOnnxProto(nom::repr::NNModule&); | | TORCH_API std::unique_ptrnom::repr::NeuralNetOperator | convertToOperatorDef(caffe2::OperatorDef | op); |
OnnxifiOp
OnnxifiOptionHelper
OnnxifiTransformer
OnnxifiTransformerOptions
OptimizationPass
| This file sets up the optimization pass | registry. | | You’ll want to either create a class | that inherits from OptimizationPass | and implements run or use the | | REGISTER_OPT_PASS_FROM_FUNC(name, | func) to register a function that takes | in an NNModule*. | | If you need access to the workspace in | the optimization you’ll need to use | a different registry and inherit from | | WorkspaceOptimizationPass. |
OutputReshapeInfo
| Provides slicing info for the outputs. | All the vector members should be of the | same size as number of outputs of the | Onnxifi op. |
QShapeInfo
ReplaceNaNConverter
ShapeInfo
SigridHashConverter
SliceConverter
TransformSubgraph
| @note | | subgraph always starts with ops and | ends with tensors, except for the very | first group, which can be all tensors |
TvmTransformOptions
TvmTransformer
VisitorContext
WorkspaceOptimizationPass

Enums

ParallelizationScheme

Constants

Traits

Functions

addNNPACK
add_blob_device_options
| Helpers for the convertToNNModule for use if | you already have an NNModule. | | You probably don’t want to use these if you | can use convertToNNModule instead.
add_conv
adjust_quantized_offset
adjust_quantized_offset_impl
alloc_string
alloc_vector
blob_to_tensor_descriptor
change_tensor_bound_shapes
| In-place modify TensorBoundShape | to change shape size based on type |
check
check_net
check_shape_info
clean_up_predict_net
collect_inputs_and_outputs
compose_result_net
compute_dedup_rename_map
| Helper function for convertToNQLString | function. | | It takes a list of nodes and returns a map | node->unique_name. The new names are based on | the existing ones, but are also unique.
concat_add_mul_nan_clip_elim
concat_elim
construct_shape_info_with_default_dim_type
| Construct a ShapeInfo instance from | TensorShape and constructed dimType. | | Default first dimension of dimType is BATCH, | reason: | | We treat first dimension of hinted shapes as | BATCH. | | If there are shape hints on blobs in the | workspace, since they are already inserted as | CONSTANT, it will take effect here. | | For SEQ typed tensors, there are only a few of | them and they will be handled by | BoundShapeInferencer.
convert_to_c2net
convert_to_caffe_2proto
convert_to_caffe_2proto_with_old_net
| Pass in an oldNet to copy all the attributes | of that network. | | Be warned that transformations that modify the | graph’s inputs or outputs are not reflected in | changes to external_input or external_output.
convert_to_neural_net_operator
| Use these functions instead of the registry | directly. |
convert_to_nqlstring
| \brief Return a string representing the given | graph \param g. | | The returned string is a valid NQL query.
convert_to_operator_def
convert_to_value_info
convert_to_vec
copy_descriptor
create_test
dead_code_elim
dealloc_token_strings
detect_boundary_references
dump_graph
enforce_fp_32inputs_to_fp16
explore
| Explore the graph in topological order | until we hit stopping nodes. This is | based on Khan’s algorithm: | | https://en.wikipedia.org/wiki/Topological_sorting#Kahn’s_algorithm | | Precondition: nodes in current_frontier | must have satisfy in_degree == 0 |
extract_shape_info_from_tensor_bound_shapes
| Extract shape info from tensorBoundShapes to | a ShapeInfoMap. | | Change shape according to new max_batch_size | and max_feature_len at the same time if | necessary.
fake_fp_16fold_layer_norm
fake_fp_16fold_layer_norm_quant
fake_fp_16fold_swish
fake_fp_16fold_tanh_quant
fake_fp_16fuse_ops
fake_fp_16transform
| Transform normal fp32 operators to | fakefp16 operators. |
fetch_inputs_to_if_ops_subnet
fill_model_info
find_mutable_operator_by_input
freeze_quantization_params
| We have a variant of 2-input Int8Quantize and | 4-input Int8FC where the last input points to | a blob which contains the y_scale and | y_zero_point. | | It’s orginated from online snapshot update but | is creating complications for onnxifi flow. | | Hence this pass is just to absorb the | quantization params into the op itself and | remove the last input.
fuse_activation
| Generic activation fusion helper. | | ———– | @param OperationT | | The operator to be fused. | ––––– | @param ActivationT | | The activation to be fused. | ––––– | @param nn | | Neural network module to be modified | in place | ––––– | @param should_fuse | | Given a conv op, check whether we want | to fuse it with subsequent relu or not | ––––– | @param postprocess | | Functor to postprocess the conv node, | attaching additional attributes if | necessary |
fuse_cast_batch_one_hot
| ———– | @brief | | This fuses Cast -> BatchOneHot -> Cast | into a single call. |
fuse_convBN
fuse_conv_bnhelper
$$ X_{bn} = \frac{s(X - m)}{\sqrt{\sigma + \epsilon}} + b_{bn}$$ $$ X_{conv} = X * W + b_{conv} $$ thus, substituting $X$ with $X_{conv}$ in the BN equation we get: $$X_{bn} = X * \frac{sW}{\sqrt{\sigma + \epsilon}} + \frac{s(b_{conv} - m)}{\sqrt{\sigma + \epsilon}} + b_{bn}$$ or $$ W’ = W\frac{s}{\sqrt{\sigma + \epsilon}}$$ $$ b’ = (b_{conv} - m)\frac{s}{\sqrt{\sigma + \epsilon}} + b_{bn}$$
fuse_nnpackconv_relu
gather_fuse_8bit_rowwise_quant_float_mul_lengths_sum_elim
gen_tensors
| ———– | @brief | | Create tensor-nodes in \param graph | with names specified in \param names | and | | | ———– | @return | | a name->NodeRef map. |
get_blob1st_dim_size
get_bound_shape_inferencer
get_dilations
get_fake_fp_16op_mapping
Mapping from fp32 ops to fakefp16 ops
get_group
get_info
get_info_mut
get_input_edges
get_layout
get_name_for_blob
| Helper function for convertToNQLString | function. | | Given a node and a renameMap return the unique | name for this node.
get_node_name
| \brief Return a short string name for the | given \param node. | | The function works with both tensors and | operators.
get_nqlstring_for_blob
| Helper function for convertToNQLString | function. | | Given a node and a renameMap return a string | representing the node, which looks something | like: | | %a = Op(%b, %c, %d)
get_onnxifi_data_type
get_or_add_caffe2_annotation
| If the annotation doesn’t exist, attempt | to add it |
get_output_edges
get_pads
get_shape_info_from_blob
Generates ShapeInfo from Blob.
get_strides
get_weights_and_inputs
| Given a net, with primiary inputs and | outputs defined in its external_inputs/outputs, | and given the set of weights and extra | weights (created during conversion | to ONNX if exists), we check whether | some of the weights are used in the net, | and if so, we put it in the initialize_list | and add it to the external_inputs too. | | ———– | @param net | | [in] c2 net (cutoff from a bigger net) | ––––– | @param weights_in_ws | | [in] all the weights in the workspace | | conversion \param initialization_list | [out] weights that needs | to be offload to backend | ––––– | @param total_inputs_vec | | [out] total #inputs of the net that doesn’t | have a producer |
graph_optimzations
in_batch_broadcast
inject_data_edge_indicators
inject_data_edge_indicators_with_nnmodule
insert_copies
is_nnpackconv_relu_efficient
is_number
make_tensor_info
make_tensor_info_with_quantized_flag
map_onnx_status_to_string
merge_external_tensors
merge_fp_32inputs_and_convert_to_fp16
modify_tensor_shape_dim_size
| In-place modify TensorShape’s shape | at a specific dimension |
onnxifi
| Onnxifi transformation on the net and | workspace. We also needed the input | data/shape to populate the shape. In addition, | we take a \p blocklist to control and mask | what ops we want to consider in onnxifi | process. We can also set whether to use ONNX | proto or C2 proto through ONNXIFI interface. |
onnxifi_data_type
onnxifi_type_to_data_type
optimize
optimize_for_backend
optimize_for_mkldnn
optimize_with_workspace
parse_block_list_ops
parse_file
parse_net_position_list
| The list in in the form of “0-3,5,6-7” | which means, we will black list ops with | net positions in [0,1,2,3,5,6,7] |
parse_shape_info_map_from_string
parse_string
prune_unreferered_nodes
push_op_to_front
remove_data_edge_indicators
remove_data_edge_indicators_with_nnmodule
replace_subgraph
set_device_option
set_dim_type_with_first
set_input_tensor_descriptor_type_and_buffer
set_input_tensor_descriptor_type_and_buffer_with_int8tensor_cpu
set_output_tensor_descriptor_type_and_buffer
show_node
size_from_dim
size_to_dim
split_sparse_lengths_sum_sparse
| Split SparseLengthsSumSparse into | | SparseLengthsSumSparseLookup + SparseLengthsSum |
strip_shape_info_map
Convert ShapeInfo map to TensorShape map
supports
take_precedence_over
| Check precedence between two vector of | TensorBoundShape::DimType. | | If return 1: right take precedence over left | If return -1: left take precedence over right | If return 0: no precedence between left and right
to_lower
to_tensor_proto_data_type
transform
tvm_transform
Helper function to clean up a net and run tvm transform.
verify_shape_info
workspace_optimizations
wrap_shape_info_into_qtensor_proto
Wrap Quantized TensorShape into QTensorProto
wrap_shape_info_into_tensor_proto
Wrap TensorShape into TensorProto

Type Definitions