Skip to content

Conversation

@maxmnemonic
Copy link
Contributor

@maxmnemonic maxmnemonic commented Oct 23, 2025

Example on how to apply OCR with "nanonets-ocr2-3b" via LM Studio in a post-processing of Docling Document.
It respects bounding boxes of layout, table cells, and key-values, and re-populate existing text (if any) with the text obtained from OCR for a given bounding box of an element.
This requires LM Studio running inference server with "nanonets-ocr2-3b" model pre-loaded.

Checklist:

  • Documentation has been updated, if necessary.
  • Examples have been added, if necessary.
  • Tests have been added, if necessary.

…with "nanonets-ocr2-3b" via LM Studio

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
@maxmnemonic maxmnemonic self-assigned this Oct 23, 2025
@github-actions
Copy link
Contributor

github-actions bot commented Oct 23, 2025

DCO Check Passed

Thanks @maxmnemonic, all your commits are properly signed off. 🎉

@mergify
Copy link

mergify bot commented Oct 23, 2025

Merge Protections

Your pull request matches the following merge protections and will not be merged until they are valid.

🟢 Enforce conventional commit

Wonderful, this rule succeeded.

Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/

  • title ~= ^(fix|feat|docs|style|refactor|perf|test|build|ci|chore|revert)(?:\(.+\))?(!)?:

@codecov
Copy link

codecov bot commented Oct 23, 2025

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
@maxmnemonic maxmnemonic requested a review from cau-git October 24, 2025 12:36
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
@maxmnemonic maxmnemonic force-pushed the dev/post_process_ocr_example branch from 40ed3b8 to 44f81c9 Compare October 24, 2025 12:39
Maksym Lysak added 2 commits October 24, 2025 17:20
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
@maxmnemonic maxmnemonic changed the title WIP: Example on how to apply OCR as post processing docs: Example on how to apply OCR as post processing Oct 24, 2025
@maxmnemonic maxmnemonic marked this pull request as ready for review October 24, 2025 15:47
@dosubot
Copy link

dosubot bot commented Oct 24, 2025

Related Documentation

Checked 3 published document(s). No updates required.

How did I do? Any feedback?  Join Discord

Maksym Lysak added 2 commits October 27, 2025 10:06
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
@maxmnemonic maxmnemonic changed the title docs: Example on how to apply OCR as post processing docs: Example on how to apply external OCR as post processing Oct 27, 2025
…functions

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
Copy link
Contributor

@PeterStaar-IBM PeterStaar-IBM left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants