NOTE: This topic describes usage of C++ implementation of the Image Classification Sample Async. For the Python* implementation, refer to Image Classification Python* Sample Async.
The sample demonstrates how to use the new Infer Request API of Inference Engine in applications. Refer to Integrate the Inference Engine New Request API with Your Application for details. The sample demonstrates how to build and execute an inference request 10 times in the asynchronous mode on example of classifications networks. The asynchronous mode might increase the throughput of the pictures.
The batch mode is an independent attribute on the asynchronous mode. Asynchronous mode works efficiently with any batch size.
Upon the start-up, the sample application reads command line parameters and loads specified network and input images (or a folder with images) to the Inference Engine plugin. The batch size of the network is set according to the number of read images.
Then, the sample creates an inference request object and assigns completion callback for it. In scope of the completion callback handling the inference request is executed again.
After that, the application starts inference for the first infer request and waits of 10th inference request execution being completed.
When inference is done, the application outputs data to the standard output stream.
NOTE: By default, Inference Engine samples and demos expect input with BGR channels order. If you trained your model to work with RGB order, you need to manually rearrange the default channels order in the sample or demo application or reconvert your model using the Model Optimizer tool with
--reverse_input_channelsargument specified. For more information about the argument, refer to When to Reverse Input Channels section of Converting a Model Using General Conversion Parameters.
Running the application with the
-h option yields the following usage message:
Running the application with the empty list of options yields the usage message given above and an error message.
NOTE: Before running the sample with a trained model, make sure the model is converted to the Inference Engine format (*.xml + *.bin) using the Model Optimizer tool.
The sample accepts models in ONNX format (.onnx) that do not require preprocessing.
You can do inference of an image using a trained AlexNet network on FPGA with fallback to CPU using the following command:
By default the application outputs top-10 inference results for each infer request.