Hi,
I am new to python. I have to mapp fasta file into dictionary. There are around 1000 sequences in my fasta file. The problem is that there are some the same sequences under different sequence id. I can sorted them out by accession number which is unique. The first line of my fasta file looks as follows:
>seqId|GeneName |AccessionNumbe r|taxaNumber|Or ganizmName|Addi tionalInfo
the next lines consist of amino acids.
I need to make non-redundant fasta file for these sequences on the base of unique AccessionNumber . I was sugessted to create dictionary but I am not sure how to do it for that problem. Can someone help me please.
Many tanks,
E.
I am new to python. I have to mapp fasta file into dictionary. There are around 1000 sequences in my fasta file. The problem is that there are some the same sequences under different sequence id. I can sorted them out by accession number which is unique. The first line of my fasta file looks as follows:
>seqId|GeneName |AccessionNumbe r|taxaNumber|Or ganizmName|Addi tionalInfo
the next lines consist of amino acids.
I need to make non-redundant fasta file for these sequences on the base of unique AccessionNumber . I was sugessted to create dictionary but I am not sure how to do it for that problem. Can someone help me please.
Many tanks,
E.
Comment