【个人开源】论文复现SRN：Towards Accurate Scene Text Recognition with Semantic Reasoning Networks

日期：2020-05-14 浏览：273 评论：0

核心提示：Towards Accurate Scene Text Recognition with Semantic Reasoning Networkscodehttps://github.com/chenjun2hao/SRN.pytorchUnofficial PyTorch implementation of the paper, which integrates not only globa...人工智能

Towards Accurate Scene Text Recognition with Semantic Reasoning Networks

Code:https://github.com/chenjun2hao/SRN.pytorch

Unofficial PyTorch implementation of the paper, which integrates not only global semantic reasoning module but also parallel visual attention module and visual-semantic fusion decoder.the semanti reasoning network(SRN) can be trained end-to-end.

At present, the accuracy of the paper cannot be achieved. And i borrowed code from deep-text-recognition-benchmark

model

result

IIIT5k_3000	SVT	IC03_860	IC03_867	IC13_857	IC13_1015	IC15_1811	IC15_2077	SVTP	CUTE80
84.600	83.617	92.907	92.849	90.315	88.177	71.010	68.064	71.008	68.641

total_accuracy: 80.597

Feature

predict the character at once time
DistributedDataParallel training

Requirements

Pytorch >= 1.1.0

Test

download the evaluation data from deep-text-recognition-benchmark
download the pretrained model from Baidu, Password: d2qn
test on the evaluation data

python test.py --eval_data path-to-data --saved_model path-to-model

Train

download the training data from deep-text-recognition-benchmark
training from scratch

python train.py --train_data path-to-train-data --valid-data path-to-valid-data

Reference

bert_ocr.pytorch
deep-text-recognition-benchmark
2D Attentional Irregular Scene Text Recognizer
Towards Accurate Scene Text Recognition with Semantic Reasoning Networks

difference with the origin paper

use resnet for 1D feature not resnetFpn 2D feature
use add not gated unit for visual-semanti fusion decoder

other

It is difficult to achieve the accuracy of the paper, hope more people to try and share

打赏

所有权利归属于原作者，如文章来源标示错误或侵犯了您的权利请联系微信13520258486

更多>最近资讯中心

更多>最新资讯中心

0 条相关评论

• 机智云接入教程（基于FreeRTOS）	• uni-app 中英文切换
• Error: Unsupported server version: '5.7.26-l	• 基于嵌入式平台与深度学习的智能气象监测仪器设
• 物联网传感技术——压阻式传感器	• MAX4173笔记

• Esp8266天猫精灵_RGB灯_非点灯平台	• STM32F103 串口1和串口3对发数据配合蓝牙模块
• TMS570学习【1】了解什么是TMS570	• 新闻稿 \| Qt公司收购froglogic公司以巩固市场领
• [Java]SpringBoot2整合mqtt服务器EMQ实现消息订	• 苹果群控投屏同步操作原理及运用的平台APP分享

• Esp8266天猫精灵_RGB灯_非点灯平台	• STM32F103 串口1和串口3对发数据配合蓝牙模块
• TMS570学习【1】了解什么是TMS570	• 新闻稿 \| Qt公司收购froglogic公司以巩固市场领
• [Java]SpringBoot2整合mqtt服务器EMQ实现消息订	• 苹果群控投屏同步操作原理及运用的平台APP分享
• STM32查询式按键输入[直接用寄存器]	• Ubuntu系统 USB设备端口绑定
• 2021-04-14 第四次按键输入实验	• Flutter扫码功能完美实现