0

I have a PDF file contains table for eg employee (empID, empName, Title). I want to parse these pdf file to excel and parsing that table in Excel to datatable in my code.

3
  • 3
    Stop asking for quick responses from complete strangers. The fact the something is urgent to you does not make it so for us. Commented Nov 7, 2010 at 13:43
  • i have tried to parse pdf using abcpdf.net ,,and it gives me a conversion of pdf to text file but unstructured because my pdf file contains multiple tables,,then i have a thought of converting pdf to excel file then dealing with excel file in my code Commented Nov 7, 2010 at 14:20
  • Can you post an example PDF that you are trying to extract from and then we may be able to give you some extra clues ? Commented Nov 8, 2010 at 5:04

1 Answer 1

1

If your file was created with Structured Content in it, then it may be possible to extract all the data as XML file and then import XML into Excel.

Otherwise, you pretty much left with bunch of text blocks and there is probably nothing you can do about it.

For more information check great article about PDF Text in JPedal's blog.

Sign up to request clarification or add additional context in comments.

1 Comment

unfortunatly it is unstructured pdf and i cant convert it to xml

Your Answer

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.