User Profile

Collapse

Profile Sidebar

Collapse
s2krish
s2krish
Last Activity: Jun 8 '10, 05:18 AM
Joined: Jun 7 '10
Location:
  •  
  • Time
  • Show
  • Source
Clear All
new posts

  • s2krish
    replied to accented characters to unaccented
    Hi,

    Thanks for your reply. Let me elaborate problem:

    I have used urllib module to open and read web site, scripts looks like:
    import urllib
    txt = urllib.urlopen( "http://www.terme-catez.si").read ()
    txt

    gives result like below:
    ....some more portion is skipped....
    Code:
    <div class="noga">\r\n    <p>\r\n      Vse gradivo\r\n      &copy; 1999-\
    r\n
    ...
    See more | Go to post
    Last edited by Dormilich; Jun 8 '10, 04:05 PM. Reason: Please use [code] tags when posting code

    Leave a comment:


  • s2krish
    started a topic accented characters to unaccented

    accented characters to unaccented

    Is there python library or function to convert accented characters to unaccented. For example

    From 'Terme \xc4\x8cate\xc5 \xbe' to 'Terme Čatež'.

    When I read Terme Čatež website using urllib.urlopen( ) functio, gives 'Terme \xc4\x8cate\xc5 \xbe'
    See more | Go to post
No activity results to display
Show More
Working...