Bio::SeqIO::genbank.pm

View: New views
1 Messages — Rating Filter:   Alert me  

Bio::SeqIO::genbank.pm

by Gabriel Valiente :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message



There is a line in Bio::SeqIO::genbank.pm to convert data in classification lines into a classification array by splitting only
on ';' or '.' so that a classification that is 2
or more words will still get
matched,my @class = map { s/^\s+//; s/\s+$//; s/\s{2,}/ /g; $_; } split /(?<!subgen)[;\.]+/, $class_lines;but this
will break organism names that have a dot inside, such as "Salmonella
enterica subsp. enterica serovar Typhimurium", which is now
being broken into "Salmonella enterica subsp" and "enterica serovar
Typhimurium".Changing [;\.]
to [;] solves this issue,my @class = map { s/^\s+//; s/\s+$//; s/\s{2,}/ /g; $_; } split /(?<!subgen)[;]+/,
$class_lines;Does anybody want to further
test it before I commit this change? Thanks,Gabriel
_______________________________________________
Bioperl-l mailing list
Bioperl-l@...
http://lists.open-bio.org/mailman/listinfo/bioperl-l