Preventing header, footers, page indicators being inserted into the text #1326

Nabaddon · 2025-07-14T13:03:23Z

Nabaddon
Jul 14, 2025

Every page's header ("XYZ MANUAL"), footer ("XYT Manual - XXX"), and page-numbers of my pdf has been inserted directly into the body of the text of the markdown.

Is there a way to have markitdown identify headers, footers, page indicators and similar elements, and prevent this behaviour?

Thank you!

Ocsa654 · 2025-07-16T06:18:11Z

Ocsa654
Jul 16, 2025

At the moment, Markitdown doesn't automatically distinguish or filter out headers, footers, or page numbers inserted into the body during PDF-to-markdown conversion.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Preventing header, footers, page indicators being inserted into the text #1326

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Preventing header, footers, page indicators being inserted into the text #1326

Uh oh!

Nabaddon Jul 14, 2025

Replies: 1 comment

Uh oh!

Ocsa654 Jul 16, 2025

Nabaddon
Jul 14, 2025

Ocsa654
Jul 16, 2025