Re: python regex character group matches

**Steven D'Aprano** · Sep 17 '08, 03:05 PM

Re: python regex character group matches

On Wed, 17 Sep 2008 15:56:31 +0200, Fredrik Lundh wrote:

Assuming that you want to find runs of \uXXXX escapes, simply use
non-capturing parentheses:
>
pat = re.compile(u"(? :\\\u[0-9A-F]{4})")

Doesn't work for me:

>>pat = re.compile(u"(? :\\\u[0-9A-F]{4})")

UnicodeDecodeEr ror: 'unicodeescape' codec can't decode bytes in position
5-7: truncated \uXXXX escape

Assuming that the OP is searching byte strings, I came up with this:

>>pat = re.compile('(\\ \u[0-9A-F]{4})+')
>>pat.search('a bcd\\u1234\\uAA 99\\u0BC4efg'). group(0)

'\\u1234\\uAA99 \\u0BC4'

--
Steven

**Fredrik Lundh** · Sep 17 '08, 03:25 PM

Re: python regex character group matches

Steven D'Aprano wrote:

>Assuming that you want to find runs of \uXXXX escapes, simply use
>non-capturing parentheses:
>>
> pat = re.compile(u"(? :\\\u[0-9A-F]{4})")

>
Doesn't work for me:
>

>>>pat = re.compile(u"(? :\\\u[0-9A-F]{4})")

it helps if you cut and paste the right line... here's a better version:

pat = re.compile(r"(? :\\u[0-9A-F]{4})+")

</F>

Re: python regex character group matches

Re: python regex character group matches

Comment

Comment