Parsing PDF to Excel using C#

Question

I have a PDF file contains table for eg employee (empID, empName, Title). I want to parse these pdf file to excel and parsing that table in Excel to datatable in my code.

Stop asking for quick responses from complete strangers. The fact the something is urgent to you does not make it so for us. — Oded
– Oded, Commented Nov 7, 2010 at 13:43
i have tried to parse pdf using abcpdf.net ,,and it gives me a conversion of pdf to text file but unstructured because my pdf file contains multiple tables,,then i have a thought of converting pdf to excel file then dealing with excel file in my code — hatem
– hatem, Commented Nov 7, 2010 at 14:20
Can you post an example PDF that you are trying to extract from and then we may be able to give you some extra clues ? — Andrew Cash
– Andrew Cash, Commented Nov 8, 2010 at 5:04

Bobrovsky · Accepted Answer · 2010-11-07 19:53:23Z

1

If your file was created with Structured Content in it, then it may be possible to extract all the data as XML file and then import XML into Excel.

Otherwise, you pretty much left with bunch of text blocks and there is probably nothing you can do about it.

For more information check great article about PDF Text in JPedal's blog.

answered Nov 7, 2010 at 19:53

Bobrovsky

14.3k20 gold badges89 silver badges138 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

hatem Over a year ago

unfortunatly it is unstructured pdf and i cant convert it to xml

Collectives™ on Stack Overflow

Parsing PDF to Excel using C#

1 Answer 1

1 Comment

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

1 Comment

Your Answer

Sign up or log in

Post as a guest

Related