ocrmypdf: generate a searchable PDF
ยท
171 words
ยท
1 minute read
What is ocrmypdf? ๐
ocrmypdf is a command-line app to generate a searchable PDF or PDF/A from a scanned PDF or an image of text.
More information: https://ocrmypdf.readthedocs.io/en/latest/cookbook.html .
Usage ๐
Create a new searchable PDF/A file from a scanned PDF or image file:
ocrmypdf path/to/input_file path/to/output.pdf
Replace a scanned PDF file with a searchable PDF file:
ocrmypdf path/to/file.pdf path/to/file.pdf
Skip pages of a mixed-format input PDF file that already contain text:
ocrmypdf --skip-text path/to/input.pdf path/to/output.pdf
Clean, de-skew, and rotate pages of a poor scan:
ocrmypdf --clean --deskew --rotate-pages path/to/input_file path/to/output.pdf
Set the metadata of the searchable PDF file:
ocrmypdf --title "title" --author "author" --subject "subject" --keywords "keyword; key phrase; ..." path/to/input_file path/to/output.pdf
Display help:
ocrmypdf --help
I hope you enjoyed reading this post as much as I enjoyed writing it. If you know a person who can benefit from this information, send them a link of this post. If you want to get notified about new posts, follow me on YouTube , Twitter (x) , LinkedIn , and GitHub .