How to extract the whole table content from pdf file in C#?

**Leito** · Feb 8 '11, 10:47 AM

Look around the PDFBox solution. It is a DLL that you have to import in your project references.

See some help [here] for text extraction.
Download last version of PDFBox [here].

Post again if you need further help ;)

**Puneeth kamath** · Feb 8 '11, 11:10 AM

thanks for the reply sir,can we do it by using pdfclown library?

**Puneeth kamath** · Feb 8 '11, 11:19 AM

I saw that pdfbox its in java,but i want it in C# to capture the table content from pdf file by using pdfclown as a library.

**Leito** · Feb 9 '11, 03:27 PM

First, you're right, PDFBox is a Java library. But it can be used in order to read .pdf files with C#.

The complete, simply and well documented solution to use PDFBox can be found here.
I'm sure it will help you to understand how it works. ;)

---
I don't know the pdfClown library, so I won't be able to help you on this solution.

How to extract the whole table content from pdf file in C#?

How to extract the whole table content from pdf file in C#?

Comment

Comment

Comment

Comment