Convert ONNX* GPT-2 Model to the Intermediate Representation

Public pre-trained GPT-2 model is a large transformer-based language model with a simple objective: predict the next word, given all of the previous words within some text.

Download the Pre-Trained Base GPT-2 Model

To download the model, click Download on

To download the model and sample test data, click Download on

Convert ONNX* GPT-2 Model to IR

To generate the Intermediate Representation (IR) of the model GPT-2, run the Model Optimizer with the following parameters:

python3 --input_model gpt2-10.onnx --input_shape [X,Y,Z]