Monday, 19 December 2011

Document Analysis

Document Analysis is a discipline that combines image analysis and pattern recognition techniques to process and extract information from documents from different sources. Sources include either raster formats, after scanning paper-based documents, or electronic formats such as ps, html, pdf, etc. Document Analysis consists of three major research subfields: paper layout analysis, optical character recognition and graphics recognition.


Document Analysis is a discipline that combines image analysis and pattern recognition techniques to process and extract information from documents from different sources. Sources include either raster formats, after scanning paper-based documents, or electronic formats such as ps, html, pdf, etc. Document Analysis consists of three major research subfields: paper layout analysis, optical character recognition and graphics recognition. The Document Analysis Group of the CVC has research and development experience in the following concerns: symbol recognition, indexing and browsing by graphical content, sketchy interfaces, diagrammatic reasoning and visual languages for graphic documents, graphics recognition architectures, reading systems for forms and structured documents, camera-based OCR, fingerprint recognition.


Source: Computer Vision Center