feat: OCR as enrichment for pictures in simple pipeline (docx, pptx, html, etc) #2488

dolfim-ibm · 2025-10-17T14:12:03Z

This PR allows to run the OCR step also in the pictures found in the documents converted with the SimplePipeline, e.g. docx, pptx, html, etc.

Unfinished work TODO

actually call the OCR model
each OCR models is currently implementing its logic in the call method. For this feature to work it should be better to refactor and decouple some components

Checklist:

Documentation has been updated, if necessary.
Examples have been added, if necessary.
Tests have been added, if necessary.

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>

mergify · 2025-10-17T14:12:38Z

Merge Protections

Your pull request matches the following merge protections and will not be merged until they are valid.

🟢 Enforce conventional commit

Wonderful, this rule succeeded.

Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/

title ~= ^(fix|feat|docs|style|refactor|perf|test|build|ci|chore|revert)(?:\(.+\))?(!)?:

github-actions · 2025-10-17T14:12:58Z

✅ DCO Check Passed

Thanks @dolfim-ibm, all your commits are properly signed off. 🎉

codecov · 2025-10-17T14:15:45Z

Codecov Report

❌ Patch coverage is 90.62500% with 3 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
docling/models/ocr_enrichment.py	89.28%	3 Missing ⚠️

📢 Thoughts on this report? Let us know!

add ocr as enrichment for pictures in simple pipeline

ee5aedc

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: OCR as enrichment for pictures in simple pipeline (docx, pptx, html, etc) #2488

feat: OCR as enrichment for pictures in simple pipeline (docx, pptx, html, etc) #2488

dolfim-ibm commented Oct 17, 2025

Uh oh!

mergify bot commented Oct 17, 2025

Uh oh!

github-actions bot commented Oct 17, 2025

Uh oh!

codecov bot commented Oct 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: OCR as enrichment for pictures in simple pipeline (docx, pptx, html, etc) #2488

Are you sure you want to change the base?

feat: OCR as enrichment for pictures in simple pipeline (docx, pptx, html, etc) #2488

Conversation

dolfim-ibm commented Oct 17, 2025

Uh oh!

mergify bot commented Oct 17, 2025

Merge Protections

🟢 Enforce conventional commit

Uh oh!

github-actions bot commented Oct 17, 2025

Uh oh!

codecov bot commented Oct 17, 2025

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants