OCR (PDF → Markdown) with Ollama + deepseek-ocr

Converts a PDF to text (Markdown) by rendering each page to an image and sending it through the deepseek-ocr:latest model served by Ollama via its OpenAI-compatible API (Responses API).

Requirements

Ollama running locally and the model pulled:
- ollama pull deepseek-ocr:latest
On Linux, pdf2image requires Poppler:
- Debian/Ubuntu: sudo apt-get install poppler-utils

Installation (pipx)

pipx install git+https://github.com/arrase/OCR.git

Usage

ocr 2512.15741v1.pdf

Page selection

You can include/exclude pages (1-based) and use both at the same time; --include is applied first and then --exclude.

Examples:

# Only page 1
ocr --include 1 2512.15741v1.pdf

# Pages 1 to 5 except 3
ocr --include 1-5 --exclude 3 2512.15741v1.pdf

# Combinations
ocr --include 1,3,5-8 --exclude 6-7 2512.15741v1.pdf

Output: creates 2512.15741v1.md in the same directory.

Configuration

You can configure the tool using a YAML file. The tool looks for a configuration file in the following order:

Path specified via --config / -c.
~/ocr_config.yaml.

Default Configuration

A default configuration file is provided in config/default_config.yaml:

model: deepseek-ocr:latest
base_url: http://localhost:11434/v1
prompt: |
  Convert the document to markdown.

Environment variables (optional)

Environment variables override configuration files:

OLLAMA_BASE_URL
OLLAMA_MODEL

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
config		config
docs		docs
src/ocr_ollama		src/ocr_ollama
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OCR (PDF → Markdown) with Ollama + deepseek-ocr

Requirements

Installation (pipx)

Usage

Page selection

Configuration

Default Configuration

Environment variables (optional)

About

Uh oh!

Releases 2

Packages

Uh oh!

Languages

arrase/OCR

Folders and files

Latest commit

History

Repository files navigation

OCR (PDF → Markdown) with Ollama + deepseek-ocr

Requirements

Installation (pipx)

Usage

Page selection

Configuration

Default Configuration

Environment variables (optional)

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Languages

Packages