The first step would be to extract the content.
This can be done via TET: http://www.pdflib.com/download/tet/
Or you can convert to a Word document->change the file extension to zip->unzip->find the right xml files->gather data
With PDF2TXT you can convert PDF file to text. Convert PDF to plain text PDF2TXT converts Adobe Acrobat PDF to plain text. It provides batch converting of Adobe Acrobat PDF files to plain text. PDF2TXT is easy to use for converting Adobe Acrobat PDF files into plain text. Read more... Batch converting of PDFs You
It can convert a PDF to text. (Other tools like this exist too!, search for 'PDF to TEXT conversion')
Is it possible to use this converter when clicking the upload button? example: I choose a pdf file for upload and click the button. When i click it this would convert the pdf to text and store it in a database field.
-Choose file (done)
-Click upload (done)
-Onclick run program and convert selected file (trickiest part)
-Save text to a variable (tricky part)
-Save variable content to a field in DB (easy enough)
1) Break down your problem in to small parts (you've already done that)
2) Do not start to give the first task the status 'trickiest part', end the last 'easy enough'. You do not seem to have a clue how to solve the 'Onclick run program and convert selected file', so stop claiming that 'Save variable content to a field in DB ' is easy. It can be easy, but what help is this if you do not know how to do the earlier part?
What program did you find to convert the selected file?
Please share the code you have, someone reading this will give tips on how to proceed....
Comment