How to extract the whole table content from pdf file in C#?

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • Puneeth kamath
    New Member
    • Dec 2010
    • 10

    How to extract the whole table content from pdf file in C#?

    How to extract the whole table content from pdf file in C#?
  • Leito
    New Member
    • Apr 2010
    • 58

    #2
    Look around the PDFBox solution. It is a DLL that you have to import in your project references.

    See some help [here] for text extraction.
    Download last version of PDFBox [here].

    Post again if you need further help ;)

    Comment

    • Puneeth kamath
      New Member
      • Dec 2010
      • 10

      #3
      thanks for the reply sir,can we do it by using pdfclown library?

      Comment

      • Puneeth kamath
        New Member
        • Dec 2010
        • 10

        #4
        I saw that pdfbox its in java,but i want it in C# to capture the table content from pdf file by using pdfclown as a library.

        Comment

        • Leito
          New Member
          • Apr 2010
          • 58

          #5
          First, you're right, PDFBox is a Java library. But it can be used in order to read .pdf files with C#.

          The complete, simply and well documented solution to use PDFBox can be found here.
          I'm sure it will help you to understand how it works. ;)

          ---
          I don't know the pdfClown library, so I won't be able to help you on this solution.

          Comment

          Working...