|
数据中很多cds是由小片段join来的,比如CP002688.1.txt里面的
- >lcl|CP002688.1_cdsid_AED90282.1 [gene=AT5G01010] [protein=uncharacterized protein] [protein_id=AED90282.1] [location=complement(join(1388..1459,1572..1646,1745..1780,1914..1961,2435..2509,2748..2799,2872..2934,3303..3383,3543..3659,3762..3802,3927..4005,4102..4258,4335..4467,4552..4679,4765..4924))]
复制代码 那么,我们是只要找出这些小片段,还是要把小片段连成一个CDS? |
|