› All Posts › OCR Software made by Abanoub Hanna - Software Developer

OCR Software made by Abanoub Hanna - Software Developer

Nov 17, 2023 · 494 words · 3 minute read

My currently available software tools ready to be used 🔗

img2txt ocr app for Android

A New Project for OCR on Android 🔗

The new project goals, including done and to-be-done, are:

Old Codebase
- use onActivityResult (modern code)
- use general/universal openUrl() method/function
- better way of cropping images
- share image into IMG2TXT app to extract text from it
- translations (English & Arabic), default is English
- About Me
- link to Google Play
v2.5.1
- first opensource version release is v2.5.1 on April 21st, 2024
- support Android 6 (Marshmallow) (SDK 23)
- support Android 7
- support Android 8
- support Android 9
- support Android 10
- support Android 11
- support Android 12
- support Android 13
v2.6.0
- support Android 14 (SDK 34)
- show thresholded/cleaned image (created by Tesseract)
v2.7.0
- release v2.7.0 on May 29th, 2024
- show bounding rectangles/boxes around each recognized word
v2.8.0
- image to text OCR app release v2.8.0 on May 31st, 2024 with enhanced user experience and bug fixes.
- simplify app UI layout
- new simpler theme
- show alert/notice if the mean_confidence of result text is less than 60%
- on-device Tesseract OCR (English & Arabic models)
- on-device Google vision API (latin scripted languages)
- on-device ML Kit (latin scripted languages)
- in case of Arabic language or both (Arabic+English), use Tesseract OCR
- fallback strategy in case of English language is ML Kit -> Google Vision -> TesseractOCR
- color-coded confidence/accuracy of the result text from ML Kit & Tesseract OCR
- prompt the app user to choose the language of text on the image before processing it
- customize crop-screen
- make result/extracted text editable
- add line breaks (newline) in extracted text (ML Kit & Tesseract OCR)
v2.9.0
- support Android 15 (SDK 35)
- save OCR history (aka : Detailed scanned images history)
v2.10.0
- fix: make sure OCR history is saved in local db
- feature: add reward ad (to collect points)
- ui: add coins count
- add 3 points every day
- 1 image scan consumes 1 point
- watch reward ads to gain points
- save points in sharedPreferences
- upgrade Tesseract4Android lib
Next Version
- rewrite app in Jetpack Compose
- batch processing (in bulk)
- PDF -> Images.foreach(ocr)
- expose more functions to Java : cpp files in tesseract4android/src/main/cpp/tesseract and java files in tesseract4android/src/main/java/com/googlecode/tesseract/android must be modified/added.
- support Hindi ( Indian language )
- support Farsi ( Persian language )
- save extracted text as PDF
- choose more than one image to OCR
- pre-process image with thresholding for more clarity and better results/accuracy/extracted text
- Ability to edit image before/after running OCR on it (manual)
- highest accuracy: +99% accurate on random real-world data.

The new OCR project app will be on Google Play. check it out here → OCR Text Scanner

A web app to OCR images and PDF 🔗

A new OCR text scanner SaaS (software as a service).

next
- supported languages
  - English
  - Arabic
  - Farsi (Persian)
  - Hindi (Indian)
- supported file formats
  - JPG / JPEG
  - PNG
  - PDF (monopage, multipage)
  - WEBP
  - HEIC
  - AVIF
- supported sources
  - file upload
  - web url
- color coded accuracy
- tasks queue with priority (free & premium)
- dedicated page to show status of all functionalities/features accuracy/status

Translations: العربية (كل برامج استخلاص النصوص من على الصور التي كتبها أبانوب حَنّا)