Products tagged with 'Scikit-learn'

Scikit-learn

Some case study

Picture of Reading Insurance Card

Reading Insurance Card

Reading information from Insurance Cards is a problem of OCR (Optical Character Recognition) for Japanese Words. In order to read the 3 information from Insurance Card, Deep Learning approach is used. In details, we use Tesseract engine to read all information from the card and then employ some combination/ modification features to improve the recognition results. The improvement is basically based on β€œtry and improve” process based on real data that we collected from Internet. β€’ Categorial information (for example, MNIST: 0-9) should follow a categorical distribution: 𝑐1βˆΌπΆπ‘Žπ‘‘ (𝐾=10, 𝑝=0.1) β€’ Shape information (rotation, width) should follow a uniform distribution, for example: 𝑐2, 𝑐3βˆΌπ‘ˆπ‘›π‘–π‘“ (βˆ’1,1)
Picture of Rubber Stamp Removal

Rubber Stamp Removal

The project aims to develop an engine that can automatically detect & remove rubber stamp from scanned/captured document images.. There are many challenging that we have to cope with in this project. For example, no any standards for rubbers so far (e.g. the variety of rubber shapes, colors) or especially dealing with both scanned and captured images are also a big challenging. After trying several methods and considering between two important metrics (accuracy and performance), we finally deployed YOLOv3 for object detection step and making use of K-means scikit-learn and OpenCV for output generation.
top