All Posts Free OCR Software created by Abanoub Hanna - Software Developer

Free OCR Software created by Abanoub Hanna - Software Developer

ยท 683 words ยท 4 minute read

What is OCR ? ๐Ÿ”—

OCR stands for Optical Character Recognition. It is a technique we use to identify characters of text from photos of papers and banners.

Why OCR ? ๐Ÿ”—

So, having a paper with written paragraphs and sit down on a computer and starting to type on the keyboard each and every letter from the paper you hold. This is tedious, exhausting, and tiring. Instead, you can use any OCR software to extract text from photos of papers, then copy the identified text and edit the typos. That’s much better.

What are the use cases for OCR apps ? ๐Ÿ”—

If you need to type anything on a keyboard on your laptop computer, tablet, or smartphone and that thing you wanna write down is already on a banner, paper, or a wall.. instead of writing each letter of that text, open camera on your smartphone, aim at that written thing, shoot a well lit photo, import this image into the OCR app you installed, get the extracted text, paste the extracted text in any text editors such as LibreOffice Writer, edit the text however you like and don’t forget to fix the typos. That’s much better. The more the text is, the better it is to use OCR software for it.

So, OCR software Android apps, Windows programs, CLI apps, web apps, and cross-platform apps are great for:

  • data entry jobs
  • librarians
  • writers
  • journalists
  • students trying to write a report or a research

and many more other use-cases that make it easier for people to do their jobs.

OCR app for Android phones and tablets ๐Ÿ”—

I created img2txt ocr app for Android . It started as an app to extract text from images only, but now, it has the ability to extract text from PDF documents too. The OCR app supports English, Arabic, Hindi, Persian, and many other languages. Moreover, you can extract two or more language written text paragraphs from the same photographed paper.

OCR app features and capabilities ๐Ÿ”—

  • able to crop images before scanning and identifying written characters
  • share image from studio app, or gallery app, or files app into IMG2TXT app to extract text from it
  • the app UI supports two languages; English and Arabic, the default language is English
  • the OCR app supports all up-to-date Android version releases; from Android 6 (Marshmallow) (SDK 23) up to the latest Android version release as I update the app often
  • cleaning the photo before handing it to the Tesseract OCR engine for better OCR results
  • show bounding rectangles/boxes around each recognized word
  • show alert/notice if the mean_confidence of result text is less than 60%
  • on-device Tesseract OCR with English & Arabic models pre-loaded
  • on-device Google vision API (latin scripted languages only)
  • on-device ML Kit (latin scripted languages)
  • in case of Arabic language or both (Arabic+English), use Tesseract OCR
  • fallback strategy in case of English language is ML Kit -> Google Vision -> TesseractOCR
  • color-coded confidence/accuracy of the result text from ML Kit & Tesseract OCR
  • prompt the app user to choose the language of text on the image before processing it
  • make result/extracted text editable
  • add line breaks (newline) in extracted text (ML Kit & Tesseract OCR)
  • save OCR history (aka : Detailed scanned images history)
  • batch processing (in bulk) of images and PDFs
  • OCR processing engine supports Hindi ( Indian language ), Farsi ( Persian language ), Arabic, English, .. and many other languages which you can download their tessdata models inside the app.
  • choose more than one image to OCR
  • pre-process image with thresholding for more clarity and better results/accuracy/extracted text
  • Ability to edit image before/after running OCR on it (manual)

A web app to OCR images and PDF ๐Ÿ”—

A new OCR text scanner SaaS (software as a service). It will be released soon.

  • next
    • supported languages
      • English
      • Arabic
      • Farsi (Persian)
      • Hindi (Indian)
    • supported file formats
      • JPG / JPEG
      • PNG
      • PDF (monopage, multipage)
      • WEBP
      • HEIC
      • AVIF
      • JXL (JPEG XL)
    • supported sources
      • file upload
      • web url
    • color coded accuracy
    • tasks queue with priority (free & premium)
    • dedicated page to show status of all functionalities/features accuracy/status
Share:
Translations:  ุงู„ุนุฑุจูŠุฉ (ูƒู„ ุจุฑุงู…ุฌ ุงุณุชุฎู„ุงุต ุงู„ู†ุตูˆุต ู…ู† ุนู„ู‰ ุงู„ุตูˆุฑ ุงู„ุชูŠ ูƒุชุจู‡ุง ุฃุจุงู†ูˆุจ ุญูŽู†ู‘ุง)