Is there any library that can extract titles from PDFs?

Collapse
This topic is closed.
X
X
 
  • Time
  • Show
Clear All
new posts
  • ZelluX

    Is there any library that can extract titles from PDFs?

    Hi, all

    I want to write a script which will rename PDFs according to their
    titles. I want to know if there is any library that can extract
    titles(the first line of the PDF) from PDFs.

    Many thanks for your reply ;-)
  • Fredrik Lundh

    #2
    Re: Is there any library that can extract titles from PDFs?

    ZelluX wrote:
    I want to write a script which will rename PDFs according to their
    titles. I want to know if there is any library that can extract
    titles(the first line of the PDF) from PDFs.
    Mathieu Fenniak's PyPdf should be able to do this:



    (but note that "the first line of the PDF" may not be the first thing
    that's rendered on the page, so don't expect such an approach to work
    for all files)

    </F>

    Comment

    Working...