Navigation Content

OCR SDK, Optical Text Recognition, and OCR for Developers

OmniPage Capture SDK

Established as the core technology behind all Nuance imaging products, OmniPage Capture SDK is widely recognized as the #1 imaging and OCR SDK toolkit on the market today.

OmniPage Capture SDK gives you everything you need to add robust imaging, OCR recognition, and PDF capabilities into your most critical applications as well as barcode recognition technology, intelligent character recognition, zonal recognition, and more.

Used by commercial vendors who are serious about high OCR accuracy and quality document imaging in their applications, the OmniPage Capture SDK provides scalable recognition, extraordinary PDF support, and a simple API that lets you create high-value, competitive products.

The OmniPage Capture SDK is available for Windows, Linux and Mac platforms. 

To learn more about the benefits of OCR integration and OCR for your developers, request your free OmniPage Capture SDK 19 evaluation today!

Request Free Evaluation



OCR for Developers: Benefits of OmniPage

  • The world’s most accurate OCR solution increases productivity, lowers costs, and maximizes ROI
  • Delivers everything you need for scanning, OCR, ICR, OMR, barcode, PDF, and document conversion
  • Enables developers to provide higher value to customers with new and enhanced functionality
  • Provides an easy-to-use API to shorten development cycles and accelerates time to market
  • Helps your organization differentiate itself from the competition with the most advanced scanning, OCR, and PDF technologies


What’s New in Version 19?

The OmniPage Capture Software Development Kit (SDK) has always been the gold standard for developers who want to add sophisticated optical character recognition (OCR), imaging, and PDF creation and conversion capabilities to their own applications quickly and easily. 

And with the release of the OmniPage Capture SDK 19, the best just got even better. 


Recognition: Forms Processing Made Easy; Enhanced Recognition Engines

  • Major enhancements to form-processing technologies, including the new Form Template Editor*
  • New template matching and data-extraction functions accelerate development efforts
  • Newly integrated Thai and Arabic OCR engines
  • Significant Asian accuracy improvements**:

- Character accuracy increased by up to 40%
- Layout accuracy increased by 45% for all Asian languages

  • Western language layout retention and document-formatting improvements: Significant enhancements to vertical text detection and recognition
  • Support for 17 new 1D barcodes and 2 new 2D barcodes, including QR and DataMatrix; 


Image Processing: Convert All the Data -- Even What You Can Hardly See

  • Camera and smartphone image-handling improvements

- Obtaining camera flag from EXIF information
- Automatic resolution (DPI) estimation
- Shading correction
- New binarization method
- Modified workflow for camera images for better OCR

  • JBIG2 and MRC compression improvements
  • Support for 32-bit bitmap input images
  • More robust PDF input – intelligently detect text and/or image in PDF and preserve any text in the PDF
  • Inclusion of the Scanning Enhancement Tools (SET), a convenient tool set for automatically enhancing the scanning quality*


Output: Work with the Format You Prefer

  • Support for PDF/A-1a, PDF/A-2a, PDF/A-3a, b, u (in addition to existing PDF A/2-b and PDF A/2-u support)  to improve compliance with government and industry standards
  • Support for Google Docs and Apple Pages
  • PDF file splitter to split OCR output files with maximum file size or page number*
  • Support for ePub output format to enhance the reading experience on eBook readers*
  • Support for MP3 audio output format with natural-sounding speech*
  • Formatted output is now supported on Linux and Mac platforms as well (DOCX, XLSX, HTML)


Improved Developer Experience: Faster, Easier, and More Intuitive

  • Many powerful user and developer experience enhancements:

- An intuitive and easy-to-use Distribution Wizard*
- More convenient and reliable license control
- Streamlined installation
- Consolidated and clearer documentation
- More convenient and productive API and settings
- WIA2 support for scanning
- Updated and improved licensing document
- Stability improvement

  • Native 64-bit binaries**

* Available in Windows version only.

** Also available in OmniPage Capture SDK v18.5 and higher.


Key Features for the OmniPage Capture SDK 19

The OmniPage Capture SDK offers a robust feature set to support all your document imaging needs. You get the power and accuracy of OmniPage - the most popular OCR software in the world - integrated into your applications, along with top-of-the line OCR engines and extensive PDF capabilities.

The strength of OmniPage Capture SDK extends beyond unrivaled accuracy, with additional features to streamline application development and provide added value to your product.

The most accurate and robust OCR available
OmniPage provides a scalable voting interface and significant throughput management capabilities. Combined with highly accurate machine-print OCR (OCR, OCR-A*, OCR-B* and MICR*), handprint (ICR), checkmark (OMR) and barcode (1D and 2D) recognition engines, the OmniPage Capture SDK delivers unmatched flexibility and accuracy.

Asian, Thai and Arabic OCR
The OmniPage Capture SDK Asian OCR module supports Simplified and Traditional Chinese, Japanese, and Korean. It can be used either as a standalone module or with the Western language kit. Thai and Arabic OCR modules are available as add-ons.

Support for the .NET Object Oriented Programming*
OmniPage Capture SDK 19 fully supports object oriented programming in .NET, C# and VB.NET. Sample recognition programs and sample viewers are included.

Multi-core and multi-thread processor support*
Better multi-threading and parallel processing on multi-page documents in the OmniPage Capture SDK let you exploit the full potential of your processing environment. In multi-page mode, OmniPage Capture SDK 19 runs faster than previous versions on a quad-core machine.

Pre-made user interfaces*
The OmniPage Capture SDK’s Professional Visual Toolbox gives you pre-made interfaces for creating and executing workflows, controlling scanning devices, and document processing. It includes visual controls for advanced OCR and image enhancement tools. Use this module to create OmniPage-compatible workflows and monitor their execution.

Workflow development and execution*
You can easily create complex image processing and OCR tasks and manage all parameters and settings. Then, adding OCR to your application can be just one workflow execution call. Workflow features also help balance the load on dual core and hyper-threaded systems to boost performance.

Logical Form Recognition technology and Form Template Editor
Our advanced logical form recognition (LFR) automates form template creation and streamlines form processing, providing significant savings in development time. The standalone Form Template Editor* helps both developers and end users to easily create, modify, test, and manage form templates.

Throughput management
Updated throughput capabilities provide significant advantages over other SDKs, allowing you to deploy optimal document imaging throughput for your application.

Integrated PDF toolkit
Extensive PDF capabilities - including unique PDF overlay matching that achieves near-100% accuracy in PDF conversion- allow you to significantly reduce development costs and time-to-market. The OmniPage Capture SDK also supports output to the PDF/archive (PDF/A) format and generates multi-raster-content PDFs optimized for file size and quality.

Format support
The OmniPage Capture SDK supports a wide range of image and application format, including BMP, GIF, TIF, PDF, HTML, Microsoft Office formats, XML, ePub* and more. These provide significant advantages over other SDKs, allowing you to achieve optimal image throughput for your applications.

Text-to-speech (TTS)*
The OmniPage Capture SDK is also the only OCR SDK that includes text-to-speech technologies. You applications can turn paper and digital documents into human-sounding audio files. Not only is this an important way to provide document support for disabled users, it allows everyone to save documents to files that can be played on personal computers and mobile devices, including Apple iPod.

These advanced features, along with breakthrough PDF capabilities that achieve up to 100% word accuracy in converting text-based PDF documents, enable you to significantly reduce the cost of development and time-to-market. That’s why the OmniPage Capture SDK is the most powerful and complete document-imaging SDK in the world.


* Available in Windows version only.

Tech Specs

For Windows

The functionalities of OmniPage Capture SDK can be accessed through C/C++ API, .NET API, or ActiveX interface. Support for  application development on Windows XP SP3 and above enables you to easily create applications with a wide variety of recognition technologies using a single set of developer tools.


Developer System Requirements

  • Windows XP SP3, Vista SP2 x86-x64, Windows 7 SP1 x86-x64, and Windows 8 x86-x64
  • Windows Server 2003 x86-x64, Windows Server 2008 x86-x64/R2, Windows Server 2012 x86-x64/R2
  • Intel or AMD 32-bit or 64-bit CPUs
  • Microsoft Visual C/C++ version .NET 2003/7.1, .NET 2005/8.0, 2008, 2010, Visual Studio 2012
  • Microsoft Visual Basic .NET   


Runtime System Requirements

  • Windows XP SP3, Vista SP2 x86-x64, Windows 7 SP1 x86-x64, Windows 8 x86-x64
  • Windows Server 2003 x86-x64, Windows Server 2008 x86-x64/R2, Windows Server 2012 x86-x64/R2
  • Intel or AMD 32-bit or 64-bit CPUs


For Linux

System Requirements

  • Intel or AMD 64-bit CPUs
  • Tested operating systems:
    • Fedora 20, 21
    • Debian 7.5, 7.7 and 8.1
    • Oracle Linux 6.5, 7.0
    • CentOS 6.3


For Mac

System Requirements

  • Intel 32-bit or 64-bit CPUs
  • OS X 10.8 or higher



Product Architecture

  • The OmniPage Capture SDK architecture accommodates multiple image-processing technologies through four main subsystems:
    • An image input subsystem for scanning* or importing images.
    • An image preprocessing subsystem for improving image quality prior to recognition.
    • A recognition subsystem that provides multiple recognition technologies for image processing.
    • An export subsystem to format the output from multiple recognition modules into a common format for conversion to popular word processing formats or text. 


Two programming interfaces are available with the OmniPage Capture SDK:

The C/C++ API provides control over image input, image preprocessing, recognition, and output and supports image processing on a page basis.

An ActiveX interface is provided for Visual C++ programmers. This interface includes all functionality of the C interface and offers document-processing capabilities so you can create solutions that manage documents more efficiently. This interface also expands the support of modern development environments, including managed environments like VB.NET or C#.

Professional Visual Toolbox*
In conjunction with the ActiveX interface, a set of controls, collectively called the Professional Visual Toolbox, is available as an add-on module. Pre-made controls allow you to reduce development time and speed time-to-market by allowing plug-in interfaces for your application.

    Pre-made Controls

    • Image viewing
    • Zone content validation
    • Image thumbnail viewing
    • Text verification and editing
    • Display statistical information and a draft of the document
    • Provide details and progress about the workflow being executed on the system
    • Create OmniPage-compatible workflows
    • Access and change output converter settings
    • Display and edit form fields and attributes


Image Input
The image input subsystem provides TWAIN, WIA, and ISIS scanner* and image-conversion interfaces. Both color and grayscale images can be handled by the OmniPage Capture SDK and you can send images from memory to the preprocessing and recognition processes. Input conversion for TIFF, TIFF/JPEG, TIFF-FX, PCX, DCX, BMP, ADF, JPEG, PNG and PDF image formats are available.

Image Pre-Processing
Image correction and pre-processing greatly enhances image quality and recognition accuracy. These capabilities include:

  • Rotate (90, 180, 270 degrees)
  • De-skew (auto and programmed)
  • Invert (auto and programmed)
  • De-speckle
  • Resolution enhancement

An interface for integrating additional image pre-processing technologies is also available and extends the system's functionality by permitting customization of the system's image processing capabilities.

Recognition Module Management
The OmniPage Capture SDK's component manager supports the integration of 12 individual recognition modules into your application. Modules for machine print OCR, ICR (handprint OCR), barcode, OMR (checkbox), OCR-A*, OCR-B* and E-13B (MICR)* are provided.

Asian OCR software is supported in the OmniPage Capture SDK, including Simplified and Traditional Chinese, Japanese, and Korean with full layout retention. Thai and Arabic languages are supported with Direct TXT output.

Output Processing
The OmniPage Capture SDK's output processing subsystem takes output from the recognition modules and converts it into a desired format, including TXT, XML, PDF, DOCX, XLSX, PPTX*, HTML, and many more. PDF output is supported in formats including:

  • PDF normal (text, image, and graphics)*
  • Image only
  • Searchable PDF (image on text)
  • Normal with image substitutes*
  • PDF 1.4 - 1.7
  • All conformations of PDF/A

* Available in Windows version only.


Product Configurations

Product Configurations

The OmniPage Capture SDK is available in three configurations with optional add-ons:


The Professional Recognition Kit

  • C/C++ Libraries
  • Two premade voting OCR (machine print) recognition modules
  • Access to individual OCR engines for application optimization
  • OCR-A, OCR-B, E-13B (MICR)
  • ICR (hand-printed character recognition)
  • OMR (checkbox recognition)
  • Barcode recognition


The Professional OCR Kit

  • C/C++ Libraries
  • Two premade voting OCR (machine print) recognition modules
  • Access to individual OCR engines for application optimization
  • OCR-A, OCR-B, E-12B (MICR)

Asian OCR Kit - This kit provides support for Japanese, Traditional and Simplified Chinese, Japanese and Korean OCR software with full layout retention and searchable PDF output.


Included Application Tool

Form Template Editor* - Improves form template creation, modification, testing and management.


Add-On Options

  • PDF Output Module - Adds support for PDF 1.7, PDF/A, export to PDF Normal*, PDF Image-only, PDF Image on Text formats, and high-compression rate PDF-MRC.
  • Professional Toolbox* - Provides a collection of visual controls to create and customize UI elements for Windows-based applications, including image display, manual zoning, and OCR proofreading tools.
  • Thai OCR Module - An add-on to Professional OCR or Professional Recognition kit for including Thai OCR engine in the application.
  • Arabic OCR Module - An add-on to Professional OCR or Professional Recognition kit for including Arabic OCR in the application.
* Available in Windows version only


Product Evaluation
Product Information
Product Videos

Form Template Editor

User Documentation

Licensing Tool

OmniPage Capture SDK

Choose your country.