• Post Reply Bookmark Topic Watch Topic
  • New Topic
programming forums Java Mobile Certification Databases Caching Books Engineering Micro Controllers OS Languages Paradigms IDEs Build Tools Frameworks Application Servers Open Source This Site Careers Other all forums
this forum made possible by our volunteer staff, including ...
  • Campbell Ritchie
  • Paul Clapham
  • Bear Bibeault
  • Liutauras Vilda
  • Devaka Cooray
  • Knute Snortum
  • Junilu Lacar
  • Henry Wong
Saloon Keepers:
  • Ron McLeod
  • Stephan van Hulst
  • Tim Moores
  • Carey Brown
  • Tim Holloway
  • salvin francis
  • Frits Walraven
  • Piet Souris

Recognize and extract text from images & scanned documents

Posts: 8
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
This task sounds like a trivial one, indeed. Mostly because there are many free and open-source tool available that solves such tasks.

However, what if developing a commercial application that you want to sell? The clients can depend on a large set of scanned documents, less popular languages, different fonts, glyphs, etc..
Can you invest the time to debug unknown code and afford broken deadlines!? Patching and fixing by yourself are exhausting. You’ll probably stuck some point in the infinite loop coding-testing-frustrating.

Aspose.OCR is a standalone library introduced to address all mentioned requirements. Firstly, this is the list of supported image file formats: JPG, JPEG, PNG, GIF, and TIFF.
All you have to do is to take a screenshot of the document page, or a part of it, and pass it to our OCR engine.

The code snippet is pretty much straight-forward

No need to be worried regarding platforms. .NET and C++ environments also provide instant and high fidelity text recognition. Enhance your ASP.NET, Windows Forms, WPF, C++ solutions the same as any other Java Framework.

Our development team proactively developing new features and extending the set of supported fonts & languages.
At the moment, the most popular fonts Arial, Times New Roman, Courier New, Verdana, Tahoma, and Calibri are fully supported. Also, the rich text formatting or different styles won’t confuse our product since it is capable to deal with it too.
One noticeable advantage is tuning up the filter sensitivity so you can accurately recognize the text even on the blurred images.

Of course, it is possible you experience some issues/bugs while tuning up the application. To avoid such scenarios, our Free Support will be thrilled to assist you and help you get the desired output.

In case of some urgent tasks, there are Paid Support that can help to prioritize issues you run into. The Paid Support service puts the escalated issues in front of the others to resolve them faster.
This is a recommended service when developing a large system used by many customers.

Therefore, please evaluate your temporary license and enjoy a 1-month trial for free. There are no limitations during this period so you can test all features.
I am going down to the lab. Do NOT let anyone in. Not even this tiny ad:
Java file APIs (DOC, XLS, PDF, and many more)
  • Post Reply Bookmark Topic Watch Topic
  • New Topic
Boost this thread!