[{"@context":"http:\/\/schema.org\/","@type":"BlogPosting","@id":"https:\/\/wiki.edu.vn\/en\/wiki12\/comparison-of-optical-character-recognition-software\/#BlogPosting","mainEntityOfPage":"https:\/\/wiki.edu.vn\/en\/wiki12\/comparison-of-optical-character-recognition-software\/","headline":"Comparison of optical character recognition software","name":"Comparison of optical character recognition software","description":"From Wikipedia, the free encyclopedia This comparison of optical character recognition software includes: OCR engines, that do the actual character","datePublished":"2019-05-28","dateModified":"2019-05-28","author":{"@type":"Person","@id":"https:\/\/wiki.edu.vn\/en\/wiki12\/author\/lordneo\/#Person","name":"lordneo","url":"https:\/\/wiki.edu.vn\/en\/wiki12\/author\/lordneo\/","image":{"@type":"ImageObject","@id":"https:\/\/secure.gravatar.com\/avatar\/cd810e53c1408c38cc766bc14e7ce26a?s=96&d=mm&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/cd810e53c1408c38cc766bc14e7ce26a?s=96&d=mm&r=g","height":96,"width":96}},"publisher":{"@type":"Organization","name":"Enzyklop\u00e4die","logo":{"@type":"ImageObject","@id":"https:\/\/wiki.edu.vn\/wiki4\/wp-content\/uploads\/2023\/11\/book.png","url":"https:\/\/wiki.edu.vn\/wiki4\/wp-content\/uploads\/2023\/11\/book.png","width":600,"height":60}},"image":{"@type":"ImageObject","@id":"https:\/\/en.wikipedia.org\/wiki\/Special:CentralAutoLogin\/start?type=1x1","url":"https:\/\/en.wikipedia.org\/wiki\/Special:CentralAutoLogin\/start?type=1x1","height":"1","width":"1"},"url":"https:\/\/wiki.edu.vn\/en\/wiki12\/comparison-of-optical-character-recognition-software\/","wordCount":4028,"articleBody":"From Wikipedia, the free encyclopediaThis comparison of optical character recognition software includes:OCR engines, that do the actual character identificationLayout analysis software, that divide scanned documents into zones suitable for OCRGraphical interfaces to one or more OCR enginesSoftware development kits that are used to add OCR capabilities to other software (e.g. forms processing applications, document imaging management systems, e-discovery systems, records management solutions)Sortable tableNameFounded yearLatest stable versionRelease yearLicenseOnlineWindowsMac OS XLinuxBSDAndroidiOSProgramming languageSDK?LanguagesFontsOutput FormatsNotesGoogle Drive OCR or Google Cloud Vision2015ProprietaryYesBrowserBrowserBrowserUnknown??UnknownYes200+All fontstextGoogle blog post[1][2]Tesseract19855.2.02022ApacheNoYesYesYesYes??C++, CYes100+[3]Any printed fontText, ALTO, hOCR,[4] PDF, others with different user interfaces[5] or the APICreated by Hewlett-Packard; under further development by Google[6]ABBYY FineReader1989162022ProprietaryYesYesYesNoYesYesYesC\/C++Yes192[7]All fontsDOC, DOCX, XLS, XLSX, PPTX, RTF, PDF, HTML, CSV, TXT, ODT, DjVu, EPUB, FB2[8]ABBYY also supplies SDKs for embedded and mobile devices. Professional, Corporate and Site License Editions for Windows, Express Edition for Mac.[9]E-aksharayan2010YesNoYesNo??14RTF, TXT, BRLAsprise OCR SDK1998152015ProprietaryYesYesYesYesYes??Java, C#,VB.NET, C\/C++\/DelphiYes20+[10]?Plain text, searchable PDF, XML[11]Java, C#, VB.NET, C\/C++\/Delphi SDKs for OCR and Barcode recognition on Windows, Linux, Mac OS X and Unix.[12]AnyDoc Software1989??ProprietaryNoYesNoNoNo??VBScript???Works with structured, semi-structured, and unstructured documents.CuneiForm19961.12011-04-19BSD variantNoYesYesYesYes??C\/C++Yes28Any printed fontHTML, hOCR, native, RTF, TeX, TXT[13]Enterprise-class system, can save text formatting and recognizes complicated tables of any structure Dynamsoft OCR SDK20038.22012ProprietaryYesYesNoNoNo??C\/C++Yes40+[14]?PDF, TXTOmniPage1970s19.22015ProprietaryYesYesYesYesNo??C\/C++, C#[15]Yes125[16]Machine and handprinted fontsDOC\/DOCX XLS\/XLSX PPTX RTF PDF PDF\/A Searchable PDF HTML Text XML ePUB MP3Product of Nuance CommunicationsMicrosoft Office OneNote 20072011?2007ProprietaryNoYesNoNoNo??????GOCR20000.52[17]2018-10-15GPLYes[18]YesYesYesYes??C?20+?Ocrad?0.26[19]2017-03-31GPLYesNoYesYesYes??C++YesLatin alphabet?Command lineSmartScore199110.5.82015-07ProprietaryNoYesYesNoNo??????For musical scoresMicrosoft Office Document Imaging?Office 20072007ProprietaryNoYesNoNoNo??????Uses OmniPage[citation needed]Puma.NET??2009-10-29BSDNoYesNoNoNo??C#Yes28Any printed font.NET OCR SDK based on Cognitive Technologies’ CuneiForm recognition engine. Wraps Puma COM server and provides simplified API for .NET applicationsReadSoft???ProprietaryNoYesNoNoNo??????Scan, capture and classify business documents such as invoices, forms and purchase orders integrated with business processes.Scantron???ProprietaryNoYesNoNoNo??????For working with localized interfaces, corresponding language support is required.OCRFeeder2009-030.8.32014-12-22GPLNoNoNoYesNo??Python???Features a full user interface and has a command-line tool for automatic operations. Has its own segmentation algorithm but uses system-wide OCR engines like Tesseract or OcradOCRopus20071.3.32017-12-16ApacheNoNoYesYesYes??Python?All languages using Latin script (other languages can be trained)Normal Latin script and Fraktur (other scripts can be trained)TXT, hOCR,[20] PDF[21]Pluggable framework under active development, used for Google BooksNameFounded yearLatest stable versionRelease yearLicenseOnlineWindowsMac OS XLinuxBSDAndroidiOSProgramming languageSDK?LanguagesFontsOutput FormatsNotesEvaluation[edit]A 2016 analysis of the accuracy and reliability of the OCR packages Google Docs OCR, Tesseract, ABBYY FineReader, and Transym, employing a dataset including 1227 images from 15 different categories concluded Google Docs OCR and ABBYY to be performing better than others.[22]References[edit]"},{"@context":"http:\/\/schema.org\/","@type":"BreadcrumbList","itemListElement":[{"@type":"ListItem","position":1,"item":{"@id":"https:\/\/wiki.edu.vn\/en\/wiki12\/#breadcrumbitem","name":"Enzyklop\u00e4die"}},{"@type":"ListItem","position":2,"item":{"@id":"https:\/\/wiki.edu.vn\/en\/wiki12\/comparison-of-optical-character-recognition-software\/#breadcrumbitem","name":"Comparison of optical character recognition software"}}]}]