extracting text from pdf files

Collapse
This topic is closed.
X
X
 
  • Time
  • Show
Clear All
new posts
  • runner7@fastmail.fm

    extracting text from pdf files

    Can anyone help me with how to extract text from pdf files using PHP or
    ColdFusion? Thanks for any help.

  • petersprc@gmail.com

    #2
    Re: extracting text from pdf files

    Hi,

    Try the Xpdf project. Run the pdftotext command in the shell to produce
    the text.



    There's more tips at php.net/pdf.

    runner7@fastmai l.fm wrote:
    Can anyone help me with how to extract text from pdf files using PHP or
    ColdFusion? Thanks for any help.

    Comment

    • runner7

      #3
      Re: extracting text from pdf files

      petersprc@gmail .com wrote:
      Hi,
      >
      Try the Xpdf project. Run the pdftotext command in the shell to produce
      the text.
      >

      >
      There's more tips at php.net/pdf.
      >
      runner7@fastmai l.fm wrote:
      Can anyone help me with how to extract text from pdf files using PHP or
      ColdFusion? Thanks for any help.
      I really appreciate this lead, thanks, but can I do this all
      programmaticall y without having to manually use a command line? I need
      to process hundreds of pdf files to text and then extract what I need
      from them.

      Comment

      • Toby Inkster

        #4
        Re: extracting text from pdf files

        runner7 wrote:
        I really appreciate this lead, thanks, but can I do this all
        programmaticall y without having to manually use a command line? I need
        to process hundreds of pdf files to text and then extract what I need
        from them.
        The system() function.

        --
        Toby A Inkster BSc (Hons) ARCS
        Contact Me ~ http://tobyinkster.co.uk/contact

        Comment

        • Thomas Merz

          #5
          Re: extracting text from pdf files

          runner7@fastmai l.fm wrote:
          Can anyone help me with how to extract text from pdf files using PHP or
          ColdFusion? Thanks for any help.
          Our TET product extracts the text from PDF. It contains a programming
          interface for PHP (and other languages); you can directly
          fetch the text (and coordinates, font, etc.) from your PHP
          script. Free evaluation version on our Web site.

          Thomas

          _______________ _______________ _______________ _______________ ___
          Thomas Merz tm@pdflib.com http://www.pdflib.com
          PDFlib 7: Create PDF/A for archiving, format tables, and more!
          _______PDFlib - a library for generating PDF on the fly________

          Comment

          Working...