Getting errors parsing TIGR XML in SeqIO

View: New views
4 Messages — Rating Filter:   Alert me  

Getting errors parsing TIGR XML in SeqIO

by Moore, Jonathan-2 :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

I'm trying to parse the TAIR9 Arabidopsis release from the TIGR XML files at the TAIR FTP site.

I've tried SeqIO with both tigr and tigrxml formats but both are giving errors in 1.6.0.  Has anyone advice on whether it's likely to be doable, or should I wait til the .gb files are available?

Jay Moore

_______________________________________________
Bioperl-l mailing list
Bioperl-l@...
http://lists.open-bio.org/mailman/listinfo/bioperl-l

Re: Getting errors parsing TIGR XML in SeqIO

by Chris Fields-5 :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

If there are errors this should be submitted as a bug.  You should  
attach example data to the report after it (e.g. don't copy&paste into  
the text box).

http://www.bioperl.org/wiki/Bugs

chris

On Jun 26, 2009, at 4:55 AM, Moore, Jonathan wrote:

> I'm trying to parse the TAIR9 Arabidopsis release from the TIGR XML  
> files at the TAIR FTP site.
>
> I've tried SeqIO with both tigr and tigrxml formats but both are  
> giving errors in 1.6.0.  Has anyone advice on whether it's likely to  
> be doable, or should I wait til the .gb files are available?
>
> Jay Moore
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@...
> http://lists.open-bio.org/mailman/listinfo/bioperl-l

_______________________________________________
Bioperl-l mailing list
Bioperl-l@...
http://lists.open-bio.org/mailman/listinfo/bioperl-l

Re: Getting errors parsing TIGR XML in SeqIO

by Jason Stajich-3 :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

There are several flavors of TIGR XML for rice and arabidoposis, and  
other projects etc, I don't know which is tracked with the current  
tigrxml version unfortunately but one can compare the test files in t/
data to the versions downloaded to see what is currently supported.  
Usually the gbk will be more consistently parseable but we can try and  
work it out if it is a sensible transformation.

On Jun 26, 2009, at 2:55 AM, Moore, Jonathan wrote:

> I'm trying to parse the TAIR9 Arabidopsis release from the TIGR XML  
> files at the TAIR FTP site.
>
> I've tried SeqIO with both tigr and tigrxml formats but both are  
> giving errors in 1.6.0.  Has anyone advice on whether it's likely to  
> be doable, or should I wait til the .gb files are available?
>
> Jay Moore
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@...
> http://lists.open-bio.org/mailman/listinfo/bioperl-l

--
Jason Stajich
jason@...




_______________________________________________
Bioperl-l mailing list
Bioperl-l@...
http://lists.open-bio.org/mailman/listinfo/bioperl-l

Re: Getting errors parsing TIGR XML in SeqIO

by Moore, Jonathan-2 :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

Thanks for the suggestion Jason.  

There is a bit of a gulf between the tigrxml test file and the TAIR9 Arabidopsis release in TIGR XML format.  BP's tigrxml test file's top-level object is ASSEMBLY, whereas in the TAIR file ASSEMBLY is already two levels deep in the object hierarchy inside TIGR and PSEUDOCHROMOSOME.  In addition, the two main objects within the TAIR ASSEMBLY object, GENE_LIST and ASSEMBLY_SEQUENCE, don't get a mention in our test file.  Looks like a bit of work would be needed to map this.

Jay

>There are several flavors of TIGR XML for rice and arabidoposis, and  
>other projects etc, I don't know which is tracked with the current  
>tigrxml version unfortunately but one can compare the test files in t/
>data to the versions downloaded to see what is currently supported.  
>Usually the gbk will be more consistently parseable but we can try and  
>work it out if it is a sensible transformation.
>
>
>> I'm trying to parse the TAIR9 Arabidopsis release from the TIGR XML  
>> files at the TAIR FTP site.
>>
>> I've tried SeqIO with both tigr and tigrxml formats but both are  
>> giving errors in 1.6.0.  Has anyone advice on whether it's likely to  
>> be doable, or should I wait til the .gb files are available?
>>
>> Jay Moore
>
>
>--
>Jason Stajich
>jason@...






_______________________________________________
Bioperl-l mailing list
Bioperl-l@...
http://lists.open-bio.org/mailman/listinfo/bioperl-l