User Profile

moconno5 · Aug 9 '07, 01:35 AM

Thanks for the guidance. My question is why are you creating a second string (s1), and running the pattern and replacing in that string? I'm still working my way around understanding regex's.

Mark

moconno5 · Aug 6 '07, 05:28 AM

That has done the trick! Thanks for all of the help, I didn't even know about Python having a regex module. Still wondering why the triple quotes, but I will go to python.org and read up on it.

Mark

moconno5 · Aug 6 '07, 02:57 AM

I re-copied and re-pasted the code, and it is working much better now. The program is no longer splitting the keys, but it is pasting multiple values back-to-back instead of next to the key for multiple matches:

browser details MusmusculusmiR-450b1 AUUGGGAACAUUUUG CAUGCAU AUUGGGAACAUUUUG CAUGCAU 20 1 22 22 95.5% Un.003.104 - 440337 440358 22
browser details MusmusculusmiR-450b1 20 1 22 ...

moconno5 · Aug 6 '07, 01:35 AM

Okay, now I am getting a new error:

Traceback (most recent call last):
File "<pyshell#2 8>", line 1, in <module>
newfile = EditFile ( data, mouse )
File "BatchEditor.py ", line 45, in EditFile
patt = re.compile(r''' \bMusmusculusle t-[0-9a-z]+\b+|\bMusmuscu lusmiR-[0-9a-z\-]+\b''', re.VERBOSE)
File "C:\Python25\li b\re.py", line 180, in compile...

moconno5 · Aug 6 '07, 01:27 AM

The code I am currently using and still getting the same problem:

Code:

def EditFile ( s1, dd ):
    print dd
    import re
        patt = re.compile(r'''\bMusmusculuslet-[0-9a-z]+\b+|\bMusmusculusmiR-[0-9a-z\-]+\b''', re.VERBOSE)
    strList = patt.findall(s1)
    s2 = s1
    for item in strList:
        if dd.has_key(item):
            s2List = s2.split(' ')

...

moconno5 · Aug 5 '07, 11:45 AM

I tried your suggestion but recieved the same result. Is there a statement I could write that checks each line for capital A,T,C, or G? If I could put that into an 'if' statement then maybe it wouldn't re-format a line that has already been formatted. Of course then there would be the problem of did it replace it with Mus..R-1, or with Mus..R-106a, etc. Is there an order, or is it random because I am using a dictionary?

Mark

moconno5 · Aug 4 '07, 11:23 PM

Thanks for the help ilikepython and bvdet. I'm running into only one problem. I am getting multiple matches for certain strings, e.g. the key
MusmusculusmiR-1 also matches with MusmusculusmiR-146b, so I get the following output:

browser details MusmusculusmiR-1 UGGAAUGUAAAGAAG UAUGUA46b UGAGAACUGAAUUCC AUAGGCU 22 1 22 22 100.0% 26 - 20924724 20924745 22

when the original string is:
...

moconno5 · Aug 3 '07, 09:20 PM

Thanks! That has done the trick

Mark

moconno5 · Aug 3 '07, 04:23 PM

Thanks for the replies.
My thinking was that when there is no data left in the string I am reading (which I read in from a file), then start would equal -1.

The sample data I am working with is:

>Musmusculusl et-7g
UGAGGUAGUAGUUUG UACAGU
>Musmusculusl et-7i
UGAGGUAGUAGUUUG UGCUGU
>Musmusculusm iR-1
UGGAAUGUAAAGAAG UAUGUA

I also practiced by using this:
...

moconno5 · Jul 29 '07, 12:22 AM

Thanks! That did the trick

Mark

moconno5 · Jul 25 '07, 02:16 AM

I have also modified my code and tried to send as a cookie, but getting error messages. Here is the code:

Code:

#!/usr/bin/env python
# written 7/22/2007
# by Mark O'Connor

import urllib
import urllib2
import Cookie

def ReadSite():
        
  
    # First, encode the data.
    infile = open ('mouseguts1', 'r')
    data = infile.read()

...

moconno5 · Jul 24 '07, 10:39 PM

Thanks for the help. I have dropped the cgi module, and I was playing with the urllib and have come up with the following code:

Code:

#!/usr/bin/env python
# written 7/22/2007
# by Mark O'Connor

import urllib

def ReadSite():
        
    # First, encode the data.
    infile = open ('mouseguts1', 'r')
    data = infile.read()
    #print "Here is your

...

moconno5 · Jul 22 '07, 11:29 AM

hello again, and thanks for the swift replies!

Actually I am studying at George Mason University in Virginia, but yes the website I am using is located in Santa Cruz. I'm studying for a Masters in Bioinformatics there. I originally had the url as:

url = "http://genome-test.cse.ucsc.e du/cgi-bin/hgBlat?command= start"

But then I tried some modifications with no success and yes, I have been leaving...

User Profile

Profile Sidebar

Leave a comment:

String Replacement

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Iterating over a string

Leave a comment:

Leave a comment:

string loop

Matching strings with a dictionary built from a flat file

Leave a comment:

Iterating over a file in python

Leave a comment:

Leave a comment:

Leave a comment:

urllib and urlleb2 modules in Python