Problem with processing XML

**Paul McGuire** · Jan 22 '08, 03:05 PM

Re: Problem with processing XML

On Jan 22, 8:11 am, John Carlyle-Clarke <j...@nowhere.o rgwrote:

Hi.
>
I'm new to Python and trying to use it to solve a specific problem. I
have an XML file in which I need to locate a specific text node and
replace the contents with some other text. The text in question is
actually about 70k of base64 encoded data.
>

Here is a pyparsing hack for your problem. I normally advise against
using literal strings like "<value>" to match XML or HTML tags in a
parser, since this doesn't cover variations in case, embedded
whitespace, or unforeseen attributes, but your example was too simple
to haul in the extra machinery of an expression created by pyparsing's
makeXMLTags.

Also, I don't generally recommend pyparsing for working on XML, since
there are so many better and faster XML-specific modules available.
But if this does the trick for you for your specific base64-removal
task, great.

-- Paul

# requires pyparsing 1.4.8 or later
from pyparsing import makeXMLTags, withAttribute, keepOriginalTex t,
SkipTo

xml = """
... long XML string goes here ...
"""

# define a filter that will key off of the <datatag with the
# attribute 'name="PctShow. Image"', and then use suppress to filter
the
# body of the following <valuetag
dataTag = makeXMLTags("da ta")[0]
dataTag.setPars eAction(withAtt ribute(name="Pc tShow.Image"),
keepOriginalTex t)

filter = dataTag + "<value>" + SkipTo("</value>").suppre ss() + "</
value>"

xmlWithoutBase6 4Block = filter.transfor mString(xml)
print xmlWithoutBase6 4Block

**Alnilam** · Jan 22 '08, 04:15 PM

Re: Problem with processing XML

On Jan 22, 9:11 am, John Carlyle-Clarke <j...@nowhere.o rgwrote:

By the way, is pyxml a live project or not? Should it still be used?
It's odd that if you go tohttp://www.python.org/and click the link
"Using python for..." XML, it leads you tohttp://pyxml.sourcefor ge.net/topics/
>
If you then follow the download links tohttp://sourceforge.net/project/showfiles.php?g roup_id=6473you see that
the latest file is 2004, and there are no versions for newer pythons.
It also says "PyXML is no longer maintained". Shouldn't the link be
removed from python.org?

I was wondering that myself. Any answer yet?

**John Carlyle-Clarke** · Jan 22 '08, 07:55 PM

Re: Problem with processing XML

Paul McGuire wrote:

>
Here is a pyparsing hack for your problem.

Thanks Paul! This looks like an interesting approach, and once I get my
head around the syntax, I'll give it a proper whirl.

Problem with processing XML

Problem with processing XML

Comment

Comment

Comment