Intelligent Document Scanner

Devi Devapal; Aravind M. P.; Athul P.; Pranav S. Chandran; Vishak K. K.

Intelligent Document Scanner

Authors

Devi Devapal
Aravind M. P.
Athul P.
Pranav S. Chandran
Vishak K. K.

Abstract

Abstract
There is an irresistible trend in the present world for scanning the paper documents and convert it into a digitalized format. Most of these scanning treat the whole document as an entire image. In this scenario, we propose a novel “Intelligent Document Scanner” which automatically segment and classify the contents of the image document including texts, tables and images and store it as a PDF document with three sections that include all the above said contents that were extracted from the image. Segmentation consists of three steps which
consist of object extraction, object clustering and object filtering. Before object extraction, Mean-Shift filtering is performed for smoothening the document. To perform object clustering K-means algorithm is employed and to find the relationship between kernels, we employed Kernel Propagation algorithm. Classification includes seed point calculation and categorization of contents which is based on weighted priority. The categorized results are then stored as PDF document.

Keywords: Mean-Shift filtering, Object extraction, Object Clustering, K-means, Object Filter, Kernel Propagation, Seed Point, Categorization, Weight priority

Cite this Article
Devi Devapal, Aravind MP, Athul P et al. Intelligent Document Scanner. Journal of Computer Technology & Applications. 2016; 7(2): 47–57p.

Downloads

Requires Subscription PDF

Published

2019-07-16

Issue

Vol. 7 No. 2 (2016)

Section

Articles

License

Declaration and Copyright Transfer Form

(to be completed by authors)

I/ We, the undersigned author(s) of the submitted manuscript, hereby declare, that the above manuscript which is submitted for publication in the STM Journals(s), is not published already in part or whole (except in the form of abstract) in any journal or magazine for private or public circulation, and, is not under consideration of publication elsewhere.

I/We will not withdraw the manuscript after 1 week of submission as I have read the Author Guidelines and will adhere to the guidelines.
I/We Author(s ) have niether given nor will give this manuscript elsewhere for publishing after submitting in STM Journal(s).
I/ We have read the original version of the manuscript and am/ are responsible for the thought contents embodied in it. The work dealt in the manuscript is my/ our own, and my/ our individual contribution to this work is significant enough to qualify for authorship.
I/We also agree to the authorship of the article in the following order:

Author’s name

1. ________________

2. ________________

3. ________________

4. ________________

We Author(s) tick this box and would request you to consider it as our signature as we agree to the terms of this Copyright Notice, which will apply to this submission if and when it is published by this journal.

Intelligent Document Scanner

Authors

Abstract

Downloads

Published

Issue

Section

License

Developed By

Subscription

Language

Information