regex and dates

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • DevInCode
    New Member
    • Apr 2008
    • 55

    regex and dates

    Hello,
    I'm working on a project that involves parsing dates out of HTML files (text files, for all intents and purposes).

    The problem is the date format and placement in the page isn't consistent. I managed to work around this for the first batch of files, but the second batch is in French and has even more formats.
    For example
    Le 5 Janvier 2006
    1er Fevrier 2009
    10 Fevrier, 2009
    Fevrier 10, 2009

    And so on. There are accented characters but I can replace them with their unaccented counterpart.

    Efficiency doesn't matter as once all the files have the dates extracted, the program won't be used again.

    Any suggestions?
  • Stang02GT
    Recognized Expert Top Contributor
    • Jun 2007
    • 1206

    #2
    This appears to be an HTML question, so I am going to move this over to the HTML/CSS forum so it will get the proper attention.

    Comment

    • Markus
      Recognized Expert Expert
      • Jun 2007
      • 6092

      #3
      Originally posted by Stang02GT
      This appears to be an HTML question, so I am going to move this over to the HTML/CSS forum so it will get the proper attention.
      Since when could html parse a webpage? ;]

      Comment

      • gits
        Recognized Expert Moderator Expert
        • May 2007
        • 5390

        #4
        certainly not a HTML/CSS issue ... what language do you want to use/prefer or what should be the output?

        kind regards

        Comment

        • acoder
          Recognized Expert MVP
          • Nov 2006
          • 16032

          #5
          Moved back to Misc. Questions until further details provided.

          Moderator.

          Comment

          Working...