User Profile

haobijam · Nov 21 '10, 12:13 PM

Hello,

The output for this python code is attached here but the line number 86 in MINiML.xml file is not printed. This an error. Please see the output.

Regards,
Haobijam...

haobijam · Nov 21 '10, 12:05 PM

Hello,

Thanks for your help. I do have assembled and run the script but there was an error while running it on Platform section at line number 86 in MINiML.xml file. When i remove this line and run the script it prints correctly what we want in the output. The error in output prints like -

>>>

Traceback (most recent call last):
File "C:\Users\haoja m\Desktop\GEO\G SE10006\test2.p y",...

haobijam · Nov 19 '10, 10:02 AM

Dear,

Could yo please tell me how could i parse the attributes and its values from the XML file (MINiML.txt). I would like to print output like below -

Contributoriid = "contrib1"
Person Yael Strulovici-Bare
Email yas2003@med.cor nell.edu
Phone 646-962-5560
Laboratory Crystal
Department Department of Genetic Medicine
Organization Weill Cornell Medical College
Line 1300...

haobijam · Oct 28 '10, 04:57 AM

Parsing attributes and extracting unique terms from adf.txt

Dear Sir,

I do have a query regarding parsing attributes and extracting unique terms from adf.txt files from ArrayExpress [ftp://ftp.ebi.ac.uk/pub/databases/mi...y/data/array/] .The python code written here is feasible for running individual file with similar starting term but it is infeasible for running around 2270 adf.txt files at one time. Could you please...

haobijam · Oct 19 '10, 05:28 AM

Parsing attributes from sdrf.txt files and extracting unique terms for all sdrf.txt

Dear Sir,

I would like to extract only unique terms from all sdrf.txt files but this python code outputs unique terms for every file individually. Like Array Data File , Array Design REF ... are repeated in most of sdrf.txt files so i don't wanna print it as unique terms. Could you please tell me to hide case sensitive in python because...

haobijam · Oct 14 '10, 05:13 AM

Code:

#!/usr/bin/python
import glob
#import linecache
outfile = open('output_att.txt' , 'w')
files = glob.glob('*.sdrf.txt')
for file in files:
    infile = open(file)
    #count = 0
    for line in infile:
        
        lineArray = line.rstrip()
        if not line.startswith('Source Name') : continue
        #count = count + 1
        lineArray = line.split('%s\t')

...

haobijam · Oct 14 '10, 05:11 AM

Parsing tab separated .txt files with distinct or unique attributes

Dear Sir,
I have written a script to extract the first line starting with Source Name AND ends with Comment [ArrayExpress Data Retrieval URI] and i have done it but i could not parse distinct or unique attributes which is not repeated in every files. I would like to parse only the first line attributes not the table values. Could you please rectify this script....

haobijam · Oct 13 '10, 02:31 PM

Dear,

What is wrong with this script? I could not print any output.

Code:

#!/usr/bin/python
import glob

outFile = open('output.txt', 'w')
fileNameList = glob.glob('*.adf.txt')
for file in fileNameList:
    f = open(file)
    output = []
    for line in f:
        line = line.strip().split("\t")
        #lineArray = line.split('\t')
        if line:

...

haobijam · Oct 13 '10, 01:56 PM

Dear,
Yes there is always a blank line separating the header information i want from the text data i do not want to extract in all the files.

Regards,
Haobijam

haobijam · Oct 13 '10, 12:41 PM

Parsing headers with \n\n separated

I would like to extract only the headers from the file parsed. Every files in header starts with Array Design Name but ends with unfix attribute. So i would like to extract headers with space (\n\n)separated gap which is attached in zip file format. I would like to extract only the RED encircled headers. I have attached the output for this script written. I would be glad for your support and cooperation....

haobijam · Oct 13 '10, 12:20 PM

Parsing headers with \n\n separated

Dear,

Please find the attached zip file. I would like to extract only the headers from the file parsed. Every files in header starts with Array Design Name but not fix attribute. So i would like to extract headers with space (\n\n)separated gap which is attached in zip file format. I would like to extract only the RED encircled headers. I would be glad for your support and cooperation....

User Profile

Profile Sidebar

mapping of genomic positions within start and end positions

map attributes between three columns and extract common

map and extract common numbers between two columns

Leave a comment:

Leave a comment:

Leave a comment:

Parsing tag names and values from XML files

Leave a comment:

Parsing attributes from adf.txt files

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment: