This is a text spotting model that simultaneously detects and recognizes text. The model detects symbol sequences separated by space and performs recognition without a dictionary. The model is built on top of the Mask-RCNN framework with additional attention-based text recognition head.
Symbols set is alphanumeric:
This model is a fully-convolutional encoder of text recognition head.
|Word spotting hmean ICDAR2015, without a dictionary||59.04%|
Hmean Word spotting is defined and measured according to the Incidental Scene Text (ICDAR2015) challenge.
input , shape: [1x64x28x28]. Text recognition features obtained from detection part.
output, shape: [1x256x64x64]. Encoded text recognition features.
[*] Other names and brands may be claimed as the property of others.