Which method is efficient to extract the text from the image?

More Sabari Nathan's questions See All

How to use machine learning techniqueto extract the tables from scanned document images?

i want to extract the tables from scanned document images with help of ML. Please suggest robust method for extracting the tables. I need to extract the table details with help of ML functions. I...

05 June 2015 6,916 7 View

How to classify the scanned document images?

Hi friends, i want to classify the scanned document images and so many methods are there. But it depends high with texts in the document. Please suggest any best algorithm that can classify the...

05 June 2015 4,502 6 View

How can I convert the MSR 3D depth video into Mat file in matlab?

Hi Friends, I am working on human action classification. I have tried to convert depth.bin into Matlab mat file but i am not able to do . please share some code for reading depth file and...

03 April 2015 7,600 3 View

Which is the best feature for natural image character recongnition?

Hi friends, Now I am working on NCR. I have used hog,phog,co-occurence hog for natural character recognition. Even i have tried deep learning with RBM. But I got 50% accuracy only so please...

01 February 2015 5,895 8 View

Where can I find a handwritten character dataset ?

hi everyone, I am working on handwritten character recognition. I need some sample images for training. I have searched a lot but I got only few samples. So please share with me dataset links.

11 December 2014 6,310 13 View

How can I segment joined printed characters?

I am working on degraded document image enhancement. Most of them are using complex techniques If anyone knows a simple method for character segmentation, tell me the paper name or code link....

11 December 2014 3,236 2 View

Which author book is best for Hidden markov model learning?

Currently i am working on handwritten charater recognition so i want to learn markov model. Please suggest me good book.

10 November 2014 9,628 4 View

How to convert emxArray_real_T * image into Opencv Mat Image?

I have tried to convert Matlab code into opencv code. I have used few inbuilt Matlab c++ code for my opencv implementation. I don't know emxArray_real_T to Mat conversion. Does anyone know the answer?

07 August 2014 4,510 3 View

How to convert matlab mat file into opencv mat file?

Without using yml file.My mat size 12000X400(feature vector) so not able to create the YML file. please give me any suggestion and code.

06 July 2014 7,625 0 View

How to extract the confidence score value of the SVM classifier?

I am currently working on handwritten digit recognition. I need a confidence score for each recognized digit. Can any one help me solve this problem?

05 June 2014 3,846 5 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

How are iso-frequency contours plotted?

Let's say we have a standard, regular hexagonal honeycomb with a 3-arm primitive unit cell (something like the figure attached; the figure is only representative and not drawn to scale). The...

07 August 2024 1,937 1 View

How to prepare the nanoparticle treated fungal sample for Environmental SEM analysis?

A fungal strain was treated with nanoparticles. We want to do an environmental SEM analysis. So could anyone share your views on preparing the sample? Thank you.

07 August 2024 5,307 1 View

How to normalize and take the significance of the MTT OD values with 3 replicates for the same cell-line?

Hi, I have a question about normalizing the MTT OD values for doing the statistical analysis. So, if we have 3 different plates and we call them 3 different replicates, so, first we would...

07 August 2024 8,106 4 View

Why does my protein refolded to beta sheet during thermal denaturation analysis?

Hi! So i attempted to understand a novel protein behavior towards heat application by analyzing its secondary structure change. I subjected the protein to a thermal denaturation analysis using...

06 August 2024 1,989 3 View

Lambert Zijp

There already exists a lot of character recognition software... You want to develop something from scratch?

Sabari Nathan

Hi zijp, i am asking about text detection and extraction in the image not recognition.

Renoh Johnson Chalakkal

You car use scale space and morphological operations..

10.1109/ICCPCT.2013.6528865 refer this paper in IEEE Xplore.

Could you post some example images?

Question is for me?

No, I would like Sabari do post some images. I wonder how he can extract text without recognizing characters...

hi Zijp.PFA(sample image)

I hope you have got better resolution, and the method depends on the variability of your images.

Generally, letters have a specific color, and are set in a background of just one other color; this characteristic is NOT used in the algorithm below.

I would preprocess the image with median cut color quantization to get rid of anti aliasing effects, jpeg artifacts and minor color variations.

Then :

- Shrink the image with one pixel to remove the black rim.

- Floodfill using the upper left corner as a seed with a color that is not used in the image (maybe free a color first if all are used).

- Make the image binary: floodfill color vs other colors.

- Label the non-floodfill clusters.

- Loop over those clusters: if the bounding box is 'too' large (X, Y or area), then it is not text. This way you get rid of all images and the vertical and horizontal lines.

Unfortunately, 'TheGuardian' and other text set in blue is also lost. Also lost is the text on the cup, the licence plate and the crossword. And the bar-code will be identified as text...

I hope this will give you some ideas.

Thank you zijp

Carlos Pérez Lara

For text detection i recommend you Stroke Width Transform developed for Microsoft [http://research.microsoft.com/apps/pubs/default.aspx?id=149305]. The implementation is not so easy, but the results are good. A modification with this same algorithm is [http://www.bmva.org/bmvc/2012/BMVC/paper063/index.html], basically the change in the algorithm is a step in the edge extraction

Kapil Pandit

I think SWT algorithm is good for ROI.

Thanks Carlos Pérez Lara sir.

Salwa Adel

I think the methods of AI are used here, swarm methods. First segmenting the text from picture, then trying to extract the text from image depending on the method of AI or genetic algorithm