Newest 'pdf' Questions

0 votes

0 answers

32 views

On a PdfTable changing individual cells background color

Using PdfTable from SlapKit.PDF. How do I change a specific cell's background color? I setup my table as shown below. There is no problem there. This causes them to have a single uniform color. ...

BigGapave

1

asked 21 hours ago

0 votes

0 answers

35 views

Ghostscript heavily distorts colors of PDF image [closed]

Consider the following PDF which contains this image: Now I would like to compress the PDF using GhostScript by running this command: "C:\Program Files\gs\gs10.06.0\bin\gswin64c.exe" -...

robertspierre

5,791

asked yesterday

5 votes

1 answer

81 views

PDF Precise checking location letters with different ascents and descents

My question is I am trying to check a letters precise x, y so I can find if it's getting too close to a border I drew on the PDF. Code is below good for most cases, but some fonts have ascents and ...

Arlon Jakal

45

asked yesterday

Advice

0 votes

1 replies

10 views

Best practice for extracting structured numeric data from PDFs returned by an API for calculations

I’m working on a backend process where I need to: Fetch a PDF document from a third-party API Extract a small, known set of numeric values from the document Feed those values into deterministic ...

James Burch

1

asked Jan 31 at 23:11

1 vote

0 answers

59 views

PDF.js + iTextSharp: anonymization rectangles wrong on rotated (/Rotate) pages

I’m building a PDF anonymization feature: Frontend: PDF.js viewer where user draws blackout rectangles on a page. Backend: iTextSharp to draw black filled rectangles (anonymize). If the pdf has Rotate ...

user27910466

21

asked Jan 29 at 11:49

-3 votes

0 answers

41 views

Coordinate wise extraction [closed]

I have extracted content from a scanned pdf using paddle OCR , I got the coordinates in this .I wanted to obtain the output by preserving the pdf's layout . But I didn't get. I tried taking the median ...

SAI SIVA RUBHA

1

asked Jan 28 at 3:33

0 votes

0 answers

47 views

Text parsing in PDF

I am trying to parse the pdf and align the content, but I am facing issue while parsing. It reads line by line so I am not able to extract the content present inside that section. Expected content for ...

Abinash abi

1

asked Jan 28 at 1:19

1 vote

1 answer

72 views

Wrong characters from PDF when extracting text using PDF Pig in c#

I am using pdf pig to extract text from a user uploaded PDF. Purpose being to save time for manual reentry. The files are consistent enough to use this approach and search for identifiable keyword ...

Michael Yamokoski

21

asked Jan 22 at 21:24

0 votes

0 answers

67 views

Apache PDFBox 3.0.6: external pdf signing - The document was changed or altered error by signature validation

I am trying to integrate a external signature into a PDF using Apache PDFBox. I get the signature from an external service, which I call based on a base64 encoded SHA256 String from my pdf content. As ...

Visez Frunze

1

asked Jan 22 at 19:37

Tooling

1 vote

3 replies

144 views

How to create PDFs in chatGPT?

I want to create a math worksheet pdf, but chatGPT gives me either a poorly formatted word doc, or it gives me latex code, and I don't know how to run latex. I know gemini can you can use the canvas ...

Pulkit Agarwal

1

asked Jan 5 at 10:09

0 votes

0 answers

55 views

Make Pdf LTV without CRLs using DSS 6.0

I am using DSS 6.0: <dependency> <groupId>eu.europa.ec.joinup.sd-dss</groupId> <artifactId>dss-pades</artifactId> <version>6.0</version> </...

Shahid Ghafoor

3,123

asked Jan 5 at 4:56

-1 votes

2 answers

88 views

iOS PDF Kit PDF Compression not reducing PDF Size

I am trying to compress the PDF (to reduce its storage size) and using following code but instead of decreasing storage size, it is increasing it: import Foundation import PDFKit import CoreGraphics ...

Malwinder Singh

7,226

asked Dec 31, 2025 at 9:32

0 votes

0 answers

70 views

Reprocessing PDF Content Streams Results in Malformed PDF

I'm attempting to use pdf-lib to reprocess an existing PDF. Specifically, I'm modifying the content streams inside that PDF. The idea of my project is to force PDF/UA, and possibly PDF/A-1 later, ...

Jason

4,198

asked Dec 28, 2025 at 20:05

1 vote

4 answers

191 views

Split in Half Long PDF Texts with Poor OCR-ing to Generate Speaker Turns in the Correct Order (R)

I am trying to process text for quantitative text analysis. I need to read in pdfs of transcripts from WHO plenary meetings and process the text into a speaker turn dataframe, identifying the speaker ...

flâneur

363

asked Dec 24, 2025 at 16:47

1 vote

0 answers

73 views

WeasyPrint + pypdf: visible, non-editable escape characters on parentheses

I'm generating PDF form from HTML (WeasyPrint), filling the AcroForm fields with pypdf, then merging that page into a 4‑page template. However, when the data contain parentheses, pdf viewers (browser, ...

Yaourt

113

asked Dec 23, 2025 at 11:03

1 vote

1 answer

165 views

How to stretch a tmap wider in plot mode?

I have two questions about the sample tmap code below. How would I make the tmap fill the entire page width? How to avoid generating the blank 1st page? I've tried grid.arrange and I was able to ...

Nick Bear

127

asked Dec 13, 2025 at 7:02

0 votes

1 answer

136 views

pdf-lib: How to stack cropped PDF page chunks back onto a single page instead of creating hidden full pages?

simple web app to use for pdf chunking I’m using the pdf-lib npm package to crop and manipulate PDF files. My goal is to: List item. Crop multiple horizontal slices from each page. Re-export those ...

Crow V

59

asked Dec 10, 2025 at 23:35

Advice

1 vote

5 replies

129 views

How to build an API that signs a PDF using eMudhra DSC on a Hypersecure USB token?

I need to build an API (in any language — Python, Node.js, or .NET) that accepts a PDF file and returns the digitally signed PDF. The signature must be done using my eMudhra Digital Signature ...

Wanda Maximoff

21

asked Dec 6, 2025 at 7:57

-1 votes

1 answer

78 views

html2pdf.js on the fly css issue [closed]

I am attempting to save a pdf file from an editor I have put together at my website (aimed at movie scripts incidentally), and in putting together the css for the page, I insert css rules using ...

Ben Cahan

1

asked Dec 5, 2025 at 23:04

Advice

0 votes

6 replies

122 views

Determine whether checkbox is checked or not

I want to programatically determine whether checkbox is checked or not in a PDF. I have tried Tesseract & Google cloud vision API with Full Text Detection & Label Detection and could't find a ...

Saumini Navaratnam

8,955

asked Dec 4, 2025 at 11:23

0 votes

1 answer

139 views

Is it possible to create a valid PAdES-LT certification signature with DocMDP level 1 (“No changes allowed”)?

I’m trying to find out whether it is possible to create a PAdES-LT certification signature while setting the DocMDP permission level to 1 (No changes allowed). I’ve tested adding the DSS in the same ...

krillov

139

asked Dec 3, 2025 at 23:43

Tooling

0 votes

1 replies

47 views

Angular: How to Store Text Highlights in a PDF Viewer and Navigate Back to Them Later

Question I’m using ngx-extended-pdf-viewer to render PDFs inside an Angular application. I need to let users highlight text in the PDF, save those highlights as “references”, and later click a ...

Brayden de Koning

7

asked Dec 3, 2025 at 23:24

3 votes

1 answer

251 views

How can I display the Persian language fonts?

I use my custom Persian font in jsPDF, but characters for this font show me wrong. I hear The IranNastaliq font uses the 'cswh' feature for such ligatures. 'cswh' is default-off according to the ...

Chris Sermanni

45

asked Nov 29, 2025 at 4:51

Best practices

0 votes

2 replies

97 views

javascript : how to display and scroll a remote pdf, on a browser non dependant way

I wish to display in an html page and control its scrolling, the image of a pdf hosted on a server. I am currently using Mozilla’s pdf.js scripts, which work well for pdf files stored on my computer. ...

jean-marie Detrey

1

asked Nov 27, 2025 at 15:33

Best practices

1 vote

6 replies

119 views

Loading XDP, in PDF with multiple subforms, already certificate signed

A smart PDF is given. Adobe LiveCycle Designer ES 10.0 PDF https://mfinante.gov.ro/documents/2552173/3124827/ALOP_DocumentFundamentare_V03.pdf It's even more special, it has 2 different sections A and ...

Daniel

29

asked Nov 26, 2025 at 8:20

3 votes

1 answer

111 views

Extracting decimal and negative numbers from PDF, where decimals look like exponents in the PDF

I am processing a set of PDF files using Python. From each PDF I need to extract several monetary amounts and then write them to an Excel file. The problem is that in the PDF the decimals are written ...

Larisa Palimaru

31

asked Nov 24, 2025 at 11:53

1 vote

1 answer

81 views

Text overlaps and resets position when rendering multiple text chunks in a loopHow

I am building a client-side application ("SpeakFlow") that takes large PDF files, processes the text in chunks using an LLM to generate Markdown, and then compiles those chunks back into a ...

Chirag Singhal

103

asked Nov 20, 2025 at 9:42

Tooling

2 votes

1 replies

67 views

How to reduce memory consumption when processing PDFs with large embedded images using PDFBox?

I'm using Apache PDFBox to process PDFs that contain very large, embedded images (e.g., 6538x6570px = ~163MB when decompressed). The service consumes 290-446MB of memory during conversion, causing ...

user31898390

1

asked Nov 18, 2025 at 5:51

0 votes

0 answers

77 views

How to add [page] , [topage] in the body of PDF

I am confused about generating the PDF. I have a case Table of content A-1 .......................................................................... 2 B-2 ..............................................

LihnNguyen

883

asked Nov 17, 2025 at 12:53

Tooling

1 vote

3 replies

103 views

Convert complex pdf to an excel

I'm currently searching for a solution to read a pdf and convert it to an excel. For now I found the "tabula-py" library, which seems to be good. But I'm not quite sure because the pdf has ...

marskernel

1

asked Nov 15, 2025 at 15:19

0 votes

0 answers

51 views

DinkToPdf / wkhtmltopdf prints only 19 pages even with 64-bit DLL

I am generating a PDF in ASP.NET Core using DinkToPdf (wkhtmltopdf wrapper). My HTML report is around 20+ pages, but the generated PDF always stops at 19 pages. It never prints the full content. PDF ...

Arpit Lathiya

11

asked Nov 15, 2025 at 12:19

3 votes

3 answers

197 views

Storing biometric data in a PDF without breaking the digital signature

I'm developing a tool that stores a signer's biometric data inside a PDF together with the digital signature, but I'm unsure where this information should be embedded. The biometric data is captured ...

nex0

33

asked Nov 14, 2025 at 4:14

Advice

1 vote

0 replies

35 views

Sending pdf from mail.google.com via mail-sending service

I have a question. I need to create a new email at mail.google.com, enter the recipient name, upload a PDF, and send it. The recipient should receive the converted PDF. To do this, the email must pass ...

Ichi

69

asked Nov 13, 2025 at 18:07

Advice

0 votes

6 replies

147 views

MS access Split report and export each row as a PDF

I am completely new to Visual Basic as a coding language coming mainly from Java and SQL but rusty in both of them in current role I am working with more Microsoft Access Databases but I need to ...

Darren Murtagh

567

asked Nov 13, 2025 at 12:00

1 vote

1 answer

97 views

How do I speed up pdf2docx?

Right now, I have very simple code which takes a pdf, and turns it into a docx file at the same path with the same name. However, with larger pdfs over 100 pages, it's taking over a minute to convert. ...

user30589464

136

asked Nov 12, 2025 at 19:32

Tooling

1 vote

3 replies

76 views

Optimized convertion SVG in to PDF

I have a hundred SVG files that I want to distribute. To make it easier, I converted them into a single PDF. However, since the SVG files define some paths that are used many times, during the ...

AsrtoMichi

36

asked Nov 11, 2025 at 21:27

Tooling

0 votes

1 replies

71 views

How to add interactive components like button/input on top of PDF in Angular?

I am trying to create a UI when user click 'Sign' button, it will show a pop up modal with a PDF content display. I want to add a button on the sign field of PDF, if user click on it a sign pad appear ...

Tracy

3

asked Nov 6, 2025 at 6:48

2 votes

1 answer

92 views

How can I make pdfcrowd script to save on Windows Downloads Folder instead of the site folder [closed]

I am trying to create a page with pdfcrowd, which works well. The issue I have is that it saves the created PDF file inside the site folder. I want it to save to the Windows Downloads Folder instead. ...

Bubu

21

asked Nov 5, 2025 at 10:11

0 votes

0 answers

92 views

Cloudinary error: "Customer is marked as untrusted" or "Blocked for delivery" when accessing uploaded PDF files

I’m using Cloudinary to upload documents (PDFs, ZIPs, etc.) from my Next.js app. The upload succeeds, but when I try to access the file via URL like: In the Cloudinary Media Library, it says: Access ...

Kenil Mangukiya

1

asked Nov 4, 2025 at 4:13

3 votes

1 answer

112 views

"Cannot access a closed Stream" when trying to return PDF from controller using iText

I'm trying to copy pages from an existing PDF to a new one and return it. I'm writing to the new PDF with memory, but it returns an error stating Cannot access a closed Stream. Here is my code: var ...

Marwan Hashem

33

asked Nov 2, 2025 at 9:05

3 votes

1 answer

228 views

How to extract table from PDF with boxes into pandas dataframe

I have code that detects a table in a PDF that appears after a specific section, and parses the information in the table and copies it into a pandas dataframe. Now, I want to indicate whether a box is ...

user2813606

961

asked Nov 1, 2025 at 19:08

-1 votes

1 answer

114 views

Swift / CoreGraphics: semicircular arc draws correctly but arrow pointer position/rotation doesn't match BMI value in exported PDF

I'm drawing a semicircular BMI gauge in a PDF using UIGraphicsPDFRenderer and UIBezierPath. The colored arc and labels render correctly, but the arrow pointer (which should point to the arc position ...

he who remains

9

asked Oct 31, 2025 at 12:23

Best practices

1 vote

3 replies

74 views

How can I print very thin lines (e. g. 1/300 dpi) to PDF via Chrome? (Prevent rounding to full css-px)

I would like to print a 1/300 inch wide line with chrome (PDF). Unfortunately, the line is rounded up to a value of a css-pixel. (It has to be Chrome [not PRINCE XML], due to some advanced features…)

Adler

2,907

asked Oct 31, 2025 at 4:42

2 votes

0 answers

120 views

Applying JavaScript formatting to IronPDF form using PyMuPDF

I am using IronPdf (2025.6.1.5) to create fillable PDF forms. This involves creating HTML forms and converting them to PDFs with IronPDF. I am post-processing the PDFs with PyMuPDF (1.26.4) to apply ...

BalooRM

514

asked Oct 24, 2025 at 18:05

0 votes

1 answer

90 views

Can't find elements when a pdf is opened in tab (chromedriver)

When I open a PDF file via a normal link (e.g., https://unec.edu.az/application/uploads/2014/12/pdf-sample.pdf), Chrome opens the PDF in the same tab with Chrome's integrated PDF viewer component. In ...

bar2

10

asked Oct 16, 2025 at 15:53

1 vote

1 answer

114 views

Problems creating a PDF with all truetype fonts

I'm working on a program to create a PDF of all TrueType fonts. The program causes an error relating to the Tooth&Nail.ttf font which prevents the PDF from being created. Other fonts produce ...

ckx

130

asked Oct 16, 2025 at 10:00

0 votes

0 answers

140 views

PDF field misalignment between React-PDF editor and final signed PDF (fields shift down by one field height per placement)

I'm building a custom React-based PDF editor and signer using react-pdf and pdf-lib. The editor (PdfEditor.tsx) lets users drag-and-drop fields (Signature, Email, Date, etc.) on a PDF canvas. Later, ...

KINETIC

33

asked Oct 15, 2025 at 4:38

0 votes

0 answers

81 views

Remove signature and add picture of signature instead

I need to remove digital signatures from a PDF, but "print" the signature to the content beforehand (like print to "Microsoft PDF Printer" - print all Metadata and Signatures). In ...

Andreas Radakovits

1

asked Oct 9, 2025 at 14:20

1 vote

2 answers

201 views

How to generate bookmarks in a PDF?

I've been using Playwright to generate a document from HTML code with a table of content corresponding to the H1-6 tags I'm using. I was hoping that bookmarks in the PDF would be generated from those ...

Pierre Gidel

11

asked Oct 9, 2025 at 13:46

1 vote

1 answer

151 views

PDF link opens to exact page on Windows but not on mobile or tablet

I’m using a link to open a PDF file hosted on SharePoint (or any other web location) and I want it to open directly at a specific page. For example: https://example.com/myfile.pdf#page=5 On Windows ...

Meir

187

asked Oct 8, 2025 at 10:47

Collectives™ on Stack Overflow