ocrmypdf
  • Introduction
  • Release notes
  • Installing additional language packs

Usage

  • Cookbook
  • Advanced features
  • Batch processing
  • PDF security issues
  • Common error messages
ocrmypdf
  • Docs »
  • OCRmyPDF documentation
  • View page source

OCRmyPDF documentation¶

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched.

PDF is the best format for storing and exchanging scanned documents. Unfortunately, PDFs can be difficult to modify. OCRmyPDF makes it easy to apply image processing and OCR to existing PDFs.

  • Introduction
  • Release notes
  • Installing additional language packs

Usage

  • Cookbook
    • Basic examples
    • OCR images, not PDFs
    • Image processing
    • Improving OCR quality
  • Advanced features
    • Control of OCR options
    • Changing the PDF renderer
  • Batch processing
    • Batch jobs
    • Directory trees
    • Hot (watched) folders
  • PDF security issues
    • PDFs may contain malware
    • How OCRmyPDF processes PDFs
    • Using OCRmyPDF online or as a service
    • Password protection, digital signatures and certification
  • Common error messages
    • Page already has text
    • Input file ‘filename’ is not a valid PDF

Indices and tables¶

  • Index
  • Module Index
  • Search Page
Next

© Copyright 2018, James R. Barlow. Licensed under Creative Commons Attribution-ShareAlike 4.0.

Built with Sphinx using a theme provided by Read the Docs.