51,165 questions
0
votes
0
answers
32
views
On a PdfTable changing individual cells background color
Using PdfTable from SlapKit.PDF. How do I change a specific cell's background color?
I setup my table as shown below. There is no problem there. This causes them to have a single uniform color. ...
0
votes
0
answers
35
views
Ghostscript heavily distorts colors of PDF image [closed]
Consider the following PDF which contains this image:
Now I would like to compress the PDF using GhostScript by running this command:
"C:\Program Files\gs\gs10.06.0\bin\gswin64c.exe" -...
5
votes
1
answer
81
views
PDF Precise checking location letters with different ascents and descents
My question is I am trying to check a letters precise x, y so I can find if it's getting too close to a border I drew on the PDF.
Code is below good for most cases, but some fonts have ascents and ...
Advice
0
votes
1
replies
10
views
Best practice for extracting structured numeric data from PDFs returned by an API for calculations
I’m working on a backend process where I need to:
Fetch a PDF document from a third-party API
Extract a small, known set of numeric values from the document
Feed those values into deterministic ...
1
vote
0
answers
59
views
PDF.js + iTextSharp: anonymization rectangles wrong on rotated (/Rotate) pages
I’m building a PDF anonymization feature:
Frontend: PDF.js viewer where user draws blackout rectangles on a page.
Backend: iTextSharp to draw black filled rectangles (anonymize).
If the pdf has Rotate ...
-3
votes
0
answers
41
views
Coordinate wise extraction [closed]
I have extracted content from a scanned pdf using paddle OCR , I got the coordinates in this .I wanted to obtain the output by preserving the pdf's layout . But I didn't get. I tried taking the median ...
0
votes
0
answers
47
views
Text parsing in PDF
I am trying to parse the pdf and align the content, but I am facing issue while parsing. It reads line by line so I am not able to extract the content present inside that section.
Expected content for ...
1
vote
1
answer
72
views
Wrong characters from PDF when extracting text using PDF Pig in c#
I am using pdf pig to extract text from a user uploaded PDF. Purpose being to save time for manual reentry. The files are consistent enough to use this approach and search for identifiable keyword ...
0
votes
0
answers
67
views
Apache PDFBox 3.0.6: external pdf signing - The document was changed or altered error by signature validation
I am trying to integrate a external signature into a PDF using Apache PDFBox.
I get the signature from an external service, which I call based on a base64 encoded SHA256 String from my pdf content. As ...
Tooling
1
vote
3
replies
144
views
How to create PDFs in chatGPT?
I want to create a math worksheet pdf, but chatGPT gives me either a poorly formatted word doc, or it gives me latex code, and I don't know how to run latex. I know gemini can you can use the canvas ...
0
votes
0
answers
55
views
Make Pdf LTV without CRLs using DSS 6.0
I am using DSS 6.0:
<dependency>
<groupId>eu.europa.ec.joinup.sd-dss</groupId>
<artifactId>dss-pades</artifactId>
<version>6.0</version>
</...
-1
votes
2
answers
88
views
iOS PDF Kit PDF Compression not reducing PDF Size
I am trying to compress the PDF (to reduce its storage size) and using following code but instead of decreasing storage size, it is increasing it:
import Foundation
import PDFKit
import CoreGraphics
...
0
votes
0
answers
70
views
Reprocessing PDF Content Streams Results in Malformed PDF
I'm attempting to use pdf-lib to reprocess an existing PDF. Specifically, I'm modifying the content streams inside that PDF. The idea of my project is to force PDF/UA, and possibly PDF/A-1 later, ...
1
vote
4
answers
191
views
Split in Half Long PDF Texts with Poor OCR-ing to Generate Speaker Turns in the Correct Order (R)
I am trying to process text for quantitative text analysis. I need to read in pdfs of transcripts from WHO plenary meetings and process the text into a speaker turn dataframe, identifying the speaker ...
1
vote
0
answers
73
views
WeasyPrint + pypdf: visible, non-editable escape characters on parentheses
I'm generating PDF form from HTML (WeasyPrint), filling the AcroForm fields with pypdf, then merging that page into a 4‑page template.
However, when the data contain parentheses, pdf viewers (browser, ...
1
vote
1
answer
165
views
How to stretch a tmap wider in plot mode?
I have two questions about the sample tmap code below.
How would I make the tmap fill the entire page width?
How to avoid generating the blank 1st page?
I've tried grid.arrange and I was able to ...
0
votes
1
answer
136
views
pdf-lib: How to stack cropped PDF page chunks back onto a single page instead of creating hidden full pages?
simple web app to use for pdf chunking
I’m using the pdf-lib npm package to crop and manipulate PDF files.
My goal is to:
List item.
Crop multiple horizontal slices from each page.
Re-export those ...
Advice
1
vote
5
replies
129
views
How to build an API that signs a PDF using eMudhra DSC on a Hypersecure USB token?
I need to build an API (in any language — Python, Node.js, or .NET) that accepts a PDF file and returns the digitally signed PDF.
The signature must be done using my eMudhra Digital Signature ...
-1
votes
1
answer
78
views
html2pdf.js on the fly css issue [closed]
I am attempting to save a pdf file from an editor I have put together at my website (aimed at movie scripts incidentally), and in putting together the css for the page, I insert css rules using ...
Advice
0
votes
6
replies
122
views
Determine whether checkbox is checked or not
I want to programatically determine whether checkbox is checked or not in a PDF. I have tried Tesseract & Google cloud vision API with Full Text Detection & Label Detection and could't find a ...
0
votes
1
answer
139
views
Is it possible to create a valid PAdES-LT certification signature with DocMDP level 1 (“No changes allowed”)?
I’m trying to find out whether it is possible to create a PAdES-LT certification signature while setting the DocMDP permission level to 1 (No changes allowed).
I’ve tested adding the DSS in the same ...
Tooling
0
votes
1
replies
47
views
Angular: How to Store Text Highlights in a PDF Viewer and Navigate Back to Them Later
Question
I’m using ngx-extended-pdf-viewer to render PDFs inside an Angular application. I need to let users highlight text in the PDF, save those highlights as “references”, and later click a ...
3
votes
1
answer
251
views
How can I display the Persian language fonts?
I use my custom Persian font in jsPDF, but characters for this font show me wrong.
I hear
The IranNastaliq font uses the 'cswh' feature for such ligatures. 'cswh' is default-off according to the ...
Best practices
0
votes
2
replies
97
views
javascript : how to display and scroll a remote pdf, on a browser non dependant way
I wish to display in an html page and control its scrolling, the image of a pdf hosted on a server.
I am currently using Mozilla’s pdf.js scripts, which work well for pdf files stored on my computer.
...
Best practices
1
vote
6
replies
119
views
Loading XDP, in PDF with multiple subforms, already certificate signed
A smart PDF is given. Adobe LiveCycle Designer ES 10.0 PDF
https://mfinante.gov.ro/documents/2552173/3124827/ALOP_DocumentFundamentare_V03.pdf
It's even more special, it has 2 different sections A and ...
3
votes
1
answer
111
views
Extracting decimal and negative numbers from PDF, where decimals look like exponents in the PDF
I am processing a set of PDF files using Python.
From each PDF I need to extract several monetary amounts and then write them to an Excel file.
The problem is that in the PDF the decimals are written ...
1
vote
1
answer
81
views
Text overlaps and resets position when rendering multiple text chunks in a loopHow
I am building a client-side application ("SpeakFlow") that takes large PDF files, processes the text in chunks using an LLM to generate Markdown, and then compiles those chunks back into a ...
Tooling
2
votes
1
replies
67
views
How to reduce memory consumption when processing PDFs with large embedded images using PDFBox?
I'm using Apache PDFBox to process PDFs that contain very large, embedded images (e.g., 6538x6570px = ~163MB when decompressed). The service consumes 290-446MB of memory during conversion, causing ...
0
votes
0
answers
77
views
How to add [page] , [topage] in the body of PDF
I am confused about generating the PDF. I have a case
Table of content
A-1 .......................................................................... 2
B-2 ..............................................
Tooling
1
vote
3
replies
103
views
Convert complex pdf to an excel
I'm currently searching for a solution to read a pdf and convert it to an excel.
For now I found the "tabula-py" library, which seems to be good. But I'm not quite sure because the pdf has ...
0
votes
0
answers
51
views
DinkToPdf / wkhtmltopdf prints only 19 pages even with 64-bit DLL
I am generating a PDF in ASP.NET Core using DinkToPdf (wkhtmltopdf wrapper).
My HTML report is around 20+ pages, but the generated PDF always stops at 19 pages. It never prints the full content.
PDF ...
3
votes
3
answers
197
views
Storing biometric data in a PDF without breaking the digital signature
I'm developing a tool that stores a signer's biometric data inside a PDF together with the digital signature, but I'm unsure where this information should be embedded.
The biometric data is captured ...
Advice
1
vote
0
replies
35
views
Sending pdf from mail.google.com via mail-sending service
I have a question.
I need to create a new email at mail.google.com, enter the recipient name, upload a PDF, and send it. The recipient should receive the converted PDF.
To do this, the email must pass ...
Advice
0
votes
6
replies
147
views
MS access Split report and export each row as a PDF
I am completely new to Visual Basic as a coding language coming mainly from Java and SQL but rusty in both of them
in current role I am working with more Microsoft Access Databases but I need to ...
1
vote
1
answer
97
views
How do I speed up pdf2docx?
Right now, I have very simple code which takes a pdf, and turns it into a docx file at the same path with the same name.
However, with larger pdfs over 100 pages, it's taking over a minute to convert. ...
Tooling
1
vote
3
replies
76
views
Optimized convertion SVG in to PDF
I have a hundred SVG files that I want to distribute. To make it easier, I converted them into a single PDF. However, since the SVG files define some paths that are used many times, during the ...
Tooling
0
votes
1
replies
71
views
How to add interactive components like button/input on top of PDF in Angular?
I am trying to create a UI when user click 'Sign' button, it will show a pop up modal with a PDF content display. I want to add a button on the sign field of PDF, if user click on it a sign pad appear ...
2
votes
1
answer
92
views
How can I make pdfcrowd script to save on Windows Downloads Folder instead of the site folder [closed]
I am trying to create a page with pdfcrowd, which works well. The issue I have is that it saves the created PDF file inside the site folder.
I want it to save to the Windows Downloads Folder instead. ...
0
votes
0
answers
92
views
Cloudinary error: "Customer is marked as untrusted" or "Blocked for delivery" when accessing uploaded PDF files
I’m using Cloudinary to upload documents (PDFs, ZIPs, etc.) from my Next.js app.
The upload succeeds, but when I try to access the file via URL like:
In the Cloudinary Media Library, it says:
Access ...
3
votes
1
answer
112
views
"Cannot access a closed Stream" when trying to return PDF from controller using iText
I'm trying to copy pages from an existing PDF to a new one and return it. I'm writing to the new PDF with memory, but it returns an error stating
Cannot access a closed Stream.
Here is my code:
var ...
3
votes
1
answer
228
views
How to extract table from PDF with boxes into pandas dataframe
I have code that detects a table in a PDF that appears after a specific section, and parses the information in the table and copies it into a pandas dataframe.
Now, I want to indicate whether a box is ...
-1
votes
1
answer
114
views
Swift / CoreGraphics: semicircular arc draws correctly but arrow pointer position/rotation doesn't match BMI value in exported PDF
I'm drawing a semicircular BMI gauge in a PDF using UIGraphicsPDFRenderer and UIBezierPath. The colored arc and labels render correctly, but the arrow pointer (which should point to the arc position ...
Best practices
1
vote
3
replies
74
views
How can I print very thin lines (e. g. 1/300 dpi) to PDF via Chrome? (Prevent rounding to full css-px)
I would like to print a 1/300 inch wide line with chrome (PDF). Unfortunately, the line is rounded up to a value of a css-pixel.
(It has to be Chrome [not PRINCE XML], due to some advanced features…)
2
votes
0
answers
120
views
Applying JavaScript formatting to IronPDF form using PyMuPDF
I am using IronPdf (2025.6.1.5) to create fillable PDF forms. This involves creating HTML forms and converting them to PDFs with IronPDF. I am post-processing the PDFs with PyMuPDF (1.26.4) to apply ...
0
votes
1
answer
90
views
Can't find elements when a pdf is opened in tab (chromedriver)
When I open a PDF file via a normal link (e.g., https://unec.edu.az/application/uploads/2014/12/pdf-sample.pdf), Chrome opens the PDF in the same tab with Chrome's integrated PDF viewer component.
In ...
1
vote
1
answer
114
views
Problems creating a PDF with all truetype fonts
I'm working on a program to create a PDF of all TrueType fonts. The program causes an error relating to the Tooth&Nail.ttf font which prevents the PDF from being created. Other fonts produce ...
0
votes
0
answers
140
views
PDF field misalignment between React-PDF editor and final signed PDF (fields shift down by one field height per placement)
I'm building a custom React-based PDF editor and signer using react-pdf and pdf-lib.
The editor (PdfEditor.tsx) lets users drag-and-drop fields (Signature, Email, Date, etc.) on a PDF canvas.
Later, ...
0
votes
0
answers
81
views
Remove signature and add picture of signature instead
I need to remove digital signatures from a PDF, but "print" the signature to the content beforehand (like print to "Microsoft PDF Printer" - print all Metadata and Signatures). In ...
1
vote
2
answers
201
views
How to generate bookmarks in a PDF?
I've been using Playwright to generate a document from HTML code with a table of content corresponding to the H1-6 tags I'm using. I was hoping that bookmarks in the PDF would be generated from those ...
1
vote
1
answer
151
views
PDF link opens to exact page on Windows but not on mobile or tablet
I’m using a link to open a PDF file hosted on SharePoint (or any other web location) and I want it to open directly at a specific page.
For example:
https://example.com/myfile.pdf#page=5
On Windows ...