yEnc implementation in Python, bit slow

**Oren Tirosh** · Jul 18 '05, 01:17 AM

Re: yEnc implementation in Python, bit slow

On Tue, Aug 05, 2003 at 12:50:58AM +1000, Freddie wrote:[color=blue]
> Hi,
>
> I posted a while ago for some help with my word finder program, which is now
> quite a lot faster than I could manage. Thanks to all who helped :)
>
> This time, I've written a basic batch binary usenet poster in Python, but
> encoding the data into yEnc format is fairly slow. Is it possible to improve
> the routine any, WITHOUT using non-standard libraries? I don't want to have
> to rely on something strange ;)[/color]

Python is pretty quick as long as you avoid loops that operate character
by character. Try to use functions that operate on longer strings.

Suggestions:

For the (x+42)%256 build a translation table and use str.translate.
To encode characters as escape sequences use str.replace or re.sub.

Oren

**Freddie** · Jul 18 '05, 01:17 AM

Re: yEnc implementation in Python, bit slow

Oren Tirosh <oren-py-l@hishome.net> wrote in
news:mailman.10 60033689.18067. python-list@python.org :
[color=blue]
> Suggestions:
>
> For the (x+42)%256 build a translation table and use str.translate.
> To encode characters as escape sequences use str.replace or re.sub.
>
> Oren[/color]

Aahh. I couldn't work out how to use translate() at 4am this morning, but I
worked it out now :) This version is a whoooole lot faster, and actually
meets the yEnc line splitting spec. Bonus!

$ python2.3 testyenc.py
yEncode1 407682 1.98
yEncode2 407707 0.18

I'm not sure how to use re.sub to escape the characters, I assume it would
also be 4 seperate replaces? Also, it needs a slightly more random input
string than 'a' * 400000, so here we go.

test = []
for i in xrange(256):
test.append(chr (i))
teststr = ''.join(test*15 62)

def yEncode2(data):
trans = ''
for i in range(256):
trans += chr((i+42)%256)

translated = data.translate( trans)

# escape =, NUL, LF, CR
for i in (61, 0, 10, 13):
j = '=%c' % (i + 64)
translated = translated.repl ace(chr(i), j)

encoded = []
n = 0
for i in range(0, len(translated) , 256):
chunk = translated[n+i:n+i+256]
if chunk[-1] == '=':
chunk += translated[n+i+256+1]
n += 1
encoded.append( chunk)
encoded.append( '\n')

result = ''.join(encoded )

print len(result),
return result

--
-----------------------------------------------------------
Remove the oinks!

**Freddie** · Jul 18 '05, 01:17 AM

Re: yEnc implementation in Python, bit slow

Freddie <oinkfreddie@oi nkshlick.oinkne t> wrote in
news:Xns93CE8D8 1747C5freddieth escaryeleph@218 .100.3.9:

Arr. There's an error here, the [n+i+256+1] shouldn't have a 1. I always get
that wrong :) The posted files actually decode now, and the yEncode()
overhead is a lot lower.

<snip>
[color=blue]
> encoded = []
> n = 0
> for i in range(0, len(translated) , 256):
> chunk = translated[n+i:n+i+256]
> if chunk[-1] == '=':
> chunk += translated[n+i+256] <<< this line
> n += 1
> encoded.append( chunk)
> encoded.append( '\n')[/color]

--
Remove the oinks!

yEnc implementation in Python, bit slow

yEnc implementation in Python, bit slow

Comment

Comment

Comment