docs: Example on how to apply external OCR as post processing #2517

maxmnemonic · 2025-10-23T14:48:58Z

Example on how to apply OCR with "nanonets-ocr2-3b" via LM Studio in a post-processing of Docling Document.
It respects bounding boxes of layout, table cells, and key-values, and re-populate existing text (if any) with the text obtained from OCR for a given bounding box of an element.
This requires LM Studio running inference server with "nanonets-ocr2-3b" model pre-loaded.

Checklist:

Documentation has been updated, if necessary.
Examples have been added, if necessary.
Tests have been added, if necessary.

…with "nanonets-ocr2-3b" via LM Studio Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

github-actions · 2025-10-23T14:49:14Z

✅ DCO Check Passed

Thanks @maxmnemonic, all your commits are properly signed off. 🎉

mergify · 2025-10-23T14:49:33Z

Merge Protections

Your pull request matches the following merge protections and will not be merged until they are valid.

🟢 Enforce conventional commit

Wonderful, this rule succeeded.

Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/

title ~= ^(fix|feat|docs|style|refactor|perf|test|build|ci|chore|revert)(?:\(.+\))?(!)?:

codecov · 2025-10-23T14:56:09Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

dosubot · 2025-10-24T15:47:39Z

Related Documentation

Checked 3 published document(s). No updates required.

^{How did I do? Any feedback?}

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

…functions Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

PeterStaar-IBM

lgtm!

Example on how to apply to Docling Document OCR as a post-processing …

48cedcd

…with "nanonets-ocr2-3b" via LM Studio Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

maxmnemonic self-assigned this Oct 23, 2025

Added support of elements with multiple provenances

f9d67fe

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

maxmnemonic requested a review from cau-git October 24, 2025 12:36

cleaning up

44f81c9

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

maxmnemonic force-pushed the dev/post_process_ocr_example branch from 40ed3b8 to 44f81c9 Compare October 24, 2025 12:39

Maksym Lysak added 2 commits October 24, 2025 17:20

improved prompt for nanonets-ocr2-3b

14d988e

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

cleaning up

567f092

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

maxmnemonic changed the title ~~WIP: Example on how to apply OCR as post processing~~ docs: Example on how to apply OCR as post processing Oct 24, 2025

maxmnemonic marked this pull request as ready for review October 24, 2025 15:47

Maksym Lysak added 2 commits October 27, 2025 10:06

excluded example from CI

e95dcbc

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

updated class name

34e4268

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

maxmnemonic changed the title ~~docs: Example on how to apply OCR as post processing~~ docs: Example on how to apply external OCR as post processing Oct 27, 2025

Improved usability of the example, added simple cli, and some helper …

78c9abc

…functions Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>

PeterStaar-IBM approved these changes Nov 3, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

docs: Example on how to apply external OCR as post processing #2517

docs: Example on how to apply external OCR as post processing #2517

maxmnemonic commented Oct 23, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Oct 23, 2025 •

edited

Loading

Uh oh!

mergify bot commented Oct 23, 2025 •

edited

Loading

Uh oh!

codecov bot commented Oct 23, 2025

Uh oh!

dosubot bot commented Oct 24, 2025

Uh oh!

PeterStaar-IBM left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

docs: Example on how to apply external OCR as post processing #2517

Are you sure you want to change the base?

docs: Example on how to apply external OCR as post processing #2517

Conversation

maxmnemonic commented Oct 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Oct 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mergify bot commented Oct 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merge Protections

🟢 Enforce conventional commit

Uh oh!

codecov bot commented Oct 23, 2025

Codecov Report

Uh oh!

dosubot bot commented Oct 24, 2025

Uh oh!

PeterStaar-IBM left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

maxmnemonic commented Oct 23, 2025 •

edited

Loading

github-actions bot commented Oct 23, 2025 •

edited

Loading

mergify bot commented Oct 23, 2025 •

edited

Loading