PDF content to DB

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • DreamOn
    New Member
    • Jun 2015
    • 6

    PDF content to DB

    Hi guys. Is there anyway so that when i upload a PDF file to the DB i store its content on a DB field?

    Thanks in advance ;)
  • computerfox
    Contributor
    • Mar 2010
    • 276

    #2
    The first step would be to extract the content.
    This can be done via TET: http://www.pdflib.com/download/tet/
    Or you can convert to a Word document->change the file extension to zip->unzip->find the right xml files->gather data

    Then you can insert into your database.

    Comment

    • DreamOn
      New Member
      • Jun 2015
      • 6

      #3
      Could you explain how the conversion works?

      Edit: As i read there TET only works for PDF with max 10 pages and 1MB. Or not but i need to pay for that :/

      Comment

      • Luuk
        Recognized Expert Top Contributor
        • Mar 2012
        • 1043

        #4
        I suggest to take a look at this site:
        With PDF2TXT you can convert PDF file to text. Convert PDF to plain text PDF2TXT converts Adobe Acrobat PDF to plain text. It provides batch converting of Adobe Acrobat PDF files to plain text. PDF2TXT is easy to use for converting Adobe Acrobat PDF files into plain text. Read more... Batch converting of PDFs You


        It can convert a PDF to text. (Other tools like this exist too!, search for 'PDF to TEXT conversion')

        Comment

        • DreamOn
          New Member
          • Jun 2015
          • 6

          #5
          Is it possible to use this converter when clicking the upload button? example: I choose a pdf file for upload and click the button. When i click it this would convert the pdf to text and store it in a database field.

          Comment

          • DreamOn
            New Member
            • Jun 2015
            • 6

            #6
            Basically it would be:

            -Choose file (done)
            -Click upload (done)
            -Onclick run program and convert selected file (trickiest part)
            -Save text to a variable (tricky part)
            -Save variable content to a field in DB (easy enough)

            Comment

            • Luuk
              Recognized Expert Top Contributor
              • Mar 2012
              • 1043

              #7
              I see a lot 'trickiest part'

              1) Break down your problem in to small parts (you've already done that)
              2) Do not start to give the first task the status 'trickiest part', end the last 'easy enough'. You do not seem to have a clue how to solve the 'Onclick run program and convert selected file', so stop claiming that 'Save variable content to a field in DB ' is easy. It can be easy, but what help is this if you do not know how to do the earlier part?

              What program did you find to convert the selected file?

              Please share the code you have, someone reading this will give tips on how to proceed....

              Comment

              Working...