OCR Software made by Abanoub Hanna - Software Developer

ยท 494 words ยท 3 minute read

My currently available software tools ready to be used ๐Ÿ”—

A New Project for OCR on Android ๐Ÿ”—

The new project goals, including done and to-be-done, are:

  • Old Codebase
    • use onActivityResult (modern code)
    • use general/universal openUrl() method/function
    • better way of cropping images
    • share image into IMG2TXT app to extract text from it
    • translations (English & Arabic), default is English
    • About Me
    • link to Google Play
  • v2.5.1
    • first opensource version release is v2.5.1 on April 21st, 2024
    • support Android 6 (Marshmallow) (SDK 23)
    • support Android 7
    • support Android 8
    • support Android 9
    • support Android 10
    • support Android 11
    • support Android 12
    • support Android 13
  • v2.6.0
    • support Android 14 (SDK 34)
    • show thresholded/cleaned image (created by Tesseract)
  • v2.7.0
    • release v2.7.0 on May 29th, 2024
    • show bounding rectangles/boxes around each recognized word
  • v2.8.0
    • image to text OCR app release v2.8.0 on May 31st, 2024 with enhanced user experience and bug fixes.
    • simplify app UI layout
    • new simpler theme
    • show alert/notice if the mean_confidence of result text is less than 60%
    • on-device Tesseract OCR (English & Arabic models)
    • on-device Google vision API (latin scripted languages)
    • on-device ML Kit (latin scripted languages)
    • in case of Arabic language or both (Arabic+English), use Tesseract OCR
    • fallback strategy in case of English language is ML Kit -> Google Vision -> TesseractOCR
    • color-coded confidence/accuracy of the result text from ML Kit & Tesseract OCR
    • prompt the app user to choose the language of text on the image before processing it
    • customize crop-screen
    • make result/extracted text editable
    • add line breaks (newline) in extracted text (ML Kit & Tesseract OCR)
  • v2.9.0
    • support Android 15 (SDK 35)
    • save OCR history (aka : Detailed scanned images history)
  • v2.10.0
    • fix: make sure OCR history is saved in local db
    • feature: add reward ad (to collect points)
    • ui: add coins count
    • add 3 points every day
    • 1 image scan consumes 1 point
    • watch reward ads to gain points
    • save points in sharedPreferences
    • upgrade Tesseract4Android lib
  • Next Version
    • rewrite app in Jetpack Compose
    • batch processing (in bulk)
    • PDF -> Images.foreach(ocr)
    • expose more functions to Java : cpp files in tesseract4android/src/main/cpp/tesseract and java files in tesseract4android/src/main/java/com/googlecode/tesseract/android must be modified/added.
    • support Hindi ( Indian language )
    • support Farsi ( Persian language )
    • save extracted text as PDF
    • choose more than one image to OCR
    • pre-process image with thresholding for more clarity and better results/accuracy/extracted text
    • Ability to edit image before/after running OCR on it (manual)
    • highest accuracy: +99% accurate on random real-world data.

The new OCR project app will be on Google Play. check it out here โ†’ OCR Text Scanner

A web app to OCR images and PDF ๐Ÿ”—

A new OCR text scanner SaaS (software as a service).

  • next
    • supported languages
      • English
      • Arabic
      • Farsi (Persian)
      • Hindi (Indian)
    • supported file formats
      • JPG / JPEG
      • PNG
      • PDF (monopage, multipage)
      • WEBP
      • HEIC
      • AVIF
    • supported sources
      • file upload
      • web url
    • color coded accuracy
    • tasks queue with priority (free & premium)
    • dedicated page to show status of all functionalities/features accuracy/status
Share:
Translations:  ุงู„ุนุฑุจูŠุฉ (ูƒู„ ุจุฑุงู…ุฌ ุงุณุชุฎู„ุงุต ุงู„ู†ุตูˆุต ู…ู† ุนู„ู‰ ุงู„ุตูˆุฑ ุงู„ุชูŠ ูƒุชุจู‡ุง ุฃุจุงู†ูˆุจ ุญูŽู†ู‘ุง)