[gt-users] gff3 parser

Brent Pedersen bpederse at gmail.com
Thu Jan 29 18:20:35 CET 2009


hi, so i cant see that this is a bug according to the spec, but i
think it's unexpected.
when i have a fasta like:

##gff-version 3
1   ucb gene    2234602 2234702 .   -   .
ID=grape_1_2234602_2234702;match=EVM_prediction_supercontig_1.248,EVM_prediction_supercontig_1.248.mRNA
1   ucb gene    2300292 2302123 .   +   .
ID=grape_1_2300292_2302123;match=EVM_prediction_supercontig_244.8
1   ucb gene    2303615 2303967 .   +   .
ID=grape_1_2303615_2303967;match=EVM_prediction_supercontig_244.8
1   ucb gene    3596400 3596503 .   -   .
ID=grape_1_3596400_3596503;match=evm.TU.supercontig_167.27
1   ucb gene    3600651 3600977 .   -   .
ID=grape_1_3600651_3600977;match=evm.model.supercontig_1217.1,evm.model.supercontig_1217.1.mRNA


and i run gt gff3 -sort on it, it gives:

##gff-version   3
##sequence-region   1 2234602 3600977
1       ucb     gene    2234602 2234702 .       -       .
match=EVM_prediction_supercontig_1.248,EVM_prediction_supercontig_1.248.mRNA
1       ucb     gene    2300292 2302123 .       +       .
match=EVM_prediction_supercontig_244.8
1       ucb     gene    2303615 2303967 .       +       .
match=EVM_prediction_supercontig_244.8
1       ucb     gene    3596400 3596503 .       -       .
match=evm.TU.supercontig_167.27
1       ucb     gene    3600651 3600977 .       -       .
match=evm.model.supercontig_1217.1,evm.model.supercontig_1217.1.mRNA


which is annoying because it removes the ID's. i think it's since they
have no child features,
but then when merged with another gff, it has lost vital info for the pipeline.

is this intentional? or a can it be filed as a bug?
-b


More information about the gt-users mailing list