OpenCV4.5에서 향상된 텍스트 감지에 대해 조사 중

OpenCV4.5 의 릴리스 노트 에는 , 문자 인식이 개량된 것이 써 있다.

게다가 릴리스 노트를 따라가면, 이 개변의 PR에 도착할 수 있다.
htps : // 기주 b. 코 m/오펜 cv/오펜 cv/푸 l/17675

최근에는 이 PR의 설명이 자세하고 어떻게 추가·변경 기능이 좋아졌는지를 알기 쉽게 정리하고 있다.

영숫자에 대해서도 문자 인식은 지금까지도 많이 진행되어 왔지만, 필기 문자에 대해서도 성능이 향상되고 있다.

문자 인식은 다음 두 단계로 구성됩니다.
- 문자가 찍힌 영역을 감지합니다. (text detection)
- 해당 영역에 나타나는 문자 중 하나의 문자를 식별합니다. (text recognition)
- (또한, 단독의 문자 인식으로 구별하기 어려운 문자를, 사전을 사용해 특정한다.)

이 OpenCV의 구현은 위의 PR을 읽으면 알 수 있듯이 문자를 확정하기 전까지이다.

$ cd opencv/samples/dnn
$ opencv/samples/dnn$ python text_detection.py -h
usage: text_detection.py [-h] [--input INPUT] --model MODEL [--ocr OCR]
                         [--width WIDTH] [--height HEIGHT] [--thr THR]
                         [--nms NMS]

Use this script to run TensorFlow implementation
(https://github.com/argman/EAST) of EAST: An Efficient and Accurate Scene Text
Detector (https://arxiv.org/abs/1704.03155v2)The OCR model can be obtained
from converting the pretrained CRNN model to .onnx format from the github
repository https://github.com/meijieru/crnn.pytorchOr you can download trained
OCR model directly from https://drive.google.com/drive/folders/1cTbQ3nuZG-
EKWak6emD_s8_hHXWz7lAr?usp=sharing

optional arguments:
  -h, --help            show this help message and exit
  --input INPUT         Path to input image or video file. Skip this argument
                        to capture frames from a camera.
  --model MODEL, -m MODEL
                        Path to a binary .pb file contains trained detector
                        network.
  --ocr OCR             Path to a binary .pb or .onnx file contains trained
                        recognition network
  --width WIDTH         Preprocess input image by resizing to a specific
                        width. It should be multiple by 32.
  --height HEIGHT       Preprocess input image by resizing to a specific
                        height. It should be multiple by 32.
  --thr THR             Confidence threshold.
  --nms NMS             Non-maximum suppression threshold.

영문으로 좋으면 다음 해설 기사가 있다.

OpenCV Text Detection (EAST text detector)

qiita OpenCV를 Python으로 문자 위치를 영수증에서 가져옵니다.

OpenVino의 경우

필기 일본어를 인식하고 있습니다!

Reference

이 문제에 관하여(OpenCV4.5에서 향상된 텍스트 감지에 대해 조사 중), 우리는 이곳에서 더 많은 자료를 발견하고 링크를 클릭하여 보았다 https://qiita.com/nonbiri15/items/a4151bfcda4e5ce43335

텍스트를 자유롭게 공유하거나 복사할 수 있습니다.하지만 이 문서의 URL은 참조 URL로 남겨 두십시오.

우수한 개발자 콘텐츠 발견에 전념 (Collection and Share based on the CC Protocol.)

좋은 웹페이지 즐겨찾기

개발자 우수 사이트 수집

개발자가 알아야 할 필수 사이트 100선 추천 우리는 당신을 위해 100개의 자주 사용하는 개발자 학습 사이트를 정리했습니다