[gt-users] gff3 parser
Brent Pedersen
bpederse at gmail.com
Thu Jan 29 18:20:35 CET 2009
hi, so i cant see that this is a bug according to the spec, but i
think it's unexpected.
when i have a fasta like:
##gff-version 3
1 ucb gene 2234602 2234702 . - .
ID=grape_1_2234602_2234702;match=EVM_prediction_supercontig_1.248,EVM_prediction_supercontig_1.248.mRNA
1 ucb gene 2300292 2302123 . + .
ID=grape_1_2300292_2302123;match=EVM_prediction_supercontig_244.8
1 ucb gene 2303615 2303967 . + .
ID=grape_1_2303615_2303967;match=EVM_prediction_supercontig_244.8
1 ucb gene 3596400 3596503 . - .
ID=grape_1_3596400_3596503;match=evm.TU.supercontig_167.27
1 ucb gene 3600651 3600977 . - .
ID=grape_1_3600651_3600977;match=evm.model.supercontig_1217.1,evm.model.supercontig_1217.1.mRNA
and i run gt gff3 -sort on it, it gives:
##gff-version 3
##sequence-region 1 2234602 3600977
1 ucb gene 2234602 2234702 . - .
match=EVM_prediction_supercontig_1.248,EVM_prediction_supercontig_1.248.mRNA
1 ucb gene 2300292 2302123 . + .
match=EVM_prediction_supercontig_244.8
1 ucb gene 2303615 2303967 . + .
match=EVM_prediction_supercontig_244.8
1 ucb gene 3596400 3596503 . - .
match=evm.TU.supercontig_167.27
1 ucb gene 3600651 3600977 . - .
match=evm.model.supercontig_1217.1,evm.model.supercontig_1217.1.mRNA
which is annoying because it removes the ID's. i think it's since they
have no child features,
but then when merged with another gff, it has lost vital info for the pipeline.
is this intentional? or a can it be filed as a bug?
-b
More information about the gt-users
mailing list