Protein ID | AgabiH97|076160 |
Gene name | |
Location | scaffold_4:2548291..2550784 |
Strand | + |
Gene length (bp) | 2493 |
Transcript length (bp) | 2493 |
Coding sequence length (bp) | 2493 |
Protein length (aa) | 831 |
PFAM Domain ID | Short name | Long name | E-value | Start | End |
---|---|---|---|---|---|
PF00665 | rve | Integrase core domain | 6.1E-11 | 731 | 824 |
PF13976 | gag_pre-integrs | GAG-pre-integrase domain | 7.9E-09 | 672 | 715 |
PF14223 | Retrotran_gag_2 | gag-polypeptide of LTR copia-type | 4.3E-08 | 2 | 113 |
Swissprot ID | Swissprot Description | Start | End | E-value |
---|---|---|---|---|
sp|P10978|POLX_TOBAC | Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1 | 521 | 822 | 8.0E-16 |
GO Term | Description | Terminal node |
---|---|---|
GO:0015074 | DNA integration | Yes |
GO:0008150 | biological_process | No |
GO:0006259 | DNA metabolic process | No |
GO:0044238 | primary metabolic process | No |
GO:0090304 | nucleic acid metabolic process | No |
GO:0006725 | cellular aromatic compound metabolic process | No |
GO:0009987 | cellular process | No |
GO:0071704 | organic substance metabolic process | No |
GO:0043170 | macromolecule metabolic process | No |
GO:0008152 | metabolic process | No |
GO:0046483 | heterocycle metabolic process | No |
GO:0044237 | cellular metabolic process | No |
GO:0006807 | nitrogen compound metabolic process | No |
GO:0006139 | nucleobase-containing compound metabolic process | No |
GO:0044260 | cellular macromolecule metabolic process | No |
GO:0034641 | cellular nitrogen compound metabolic process | No |
GO:1901360 | organic cyclic compound metabolic process | No |
SignalP signal predicted | Location (based on Ymax) |
D score (significance: > 0.45) |
---|---|---|
No | 1 - 13 | 0.45 |
Type of sequence | Sequence |
---|---|
Locus | Download genbank file of locus
The gene with 5 kb flanks (if sufficient flanking sequence is available). For use in cloning design programs. NOTE: features (genes or exons) that are only partially contained within the sequence are completely excluded. |
Protein | >AgabiH97|076160 MLSQAAAQTDARALWNWLEGQYGQKGPTFAYERFITAQNFKINDSADPSSAIADLYALFAAVESTGALIHESIRT MMLLNALPNWFEHVAANILAEKRSVADLKWEETCARIIAVWRNPHTVANAACFRQQNQPKPRWDNKKPNQGQGKS QGSGSGSGSGQNQNKQQQPQQKKEGEEQSNDKKKKRTRSRKKKQGNEASTTSIEEVKDTTPDYSASSAIASMAQL ASTSTCTKVNEMKSRFVPLPSRSDPNLTRNRSPKLGNHTISGEEFFFFPDGNMIDLSPLDTDIDSSTLPRLCPAT RNLLLDESDVDLDPSDSGLIPRDPRKRLSPRHSPMPYERAKSKVLSLWDQRLNAVTVTAPPIEDGAGQVGGNAPA VKMPATKGKLVEKMDVDKADEISLSDEDGEFEYEYDVECEPDSFAQVPRFSPISQINSWILQGSIVSRDKGLVEP AYLRERSGRSLWLQLGSSSTATPVLYNMIANETTSLSSLLDRMNPICEVLHACHSNCEGCKTRKNLEKHSWLLDS GASCHFTPNLEDFAHLQCGNFGIVHTANKDSVLKIQGRGHVLIEHTVKDISTGKEFKSVSKIWPVFYVNGMNHRL LSTAQLLKSGLKLESTKDGSVFKNEAGQAILSARPEGLFGGMHIVECTFLKHSKTEPYLLNASGILQLPDYTIWH CRLGHPSDNVLKKFFEETLGVPKINIPQQKPVCDGCVRGKLTQQSFPQSEKCATSVLELVHLDLFELPVPSYHKY KWVMTLLDDYSGLAQIVMITKKSDAALQLINILKQLATQSDQKIKRLRTDQGGEYVNDTLSTYLKSQGIVHELSH QEKVK* |
Coding | >AgabiH97|076160 ATGCTCAGTCAAGCGGCCGCTCAAACGGATGCTAGAGCTCTTTGGAATTGGTTGGAAGGCCAATATGGCCAAAAA GGACCAACGTTCGCCTATGAACGTTTTATTACAGCTCAAAATTTCAAGATAAATGACTCCGCCGATCCATCATCT GCGATTGCAGATCTCTACGCATTATTTGCCGCTGTCGAAAGCACTGGTGCGTTGATACATGAGTCGATCCGCACT ATGATGCTCCTCAACGCTCTCCCCAACTGGTTCGAGCATGTTGCTGCAAACATTCTTGCTGAGAAAAGGTCGGTT GCCGATCTTAAATGGGAGGAAACTTGTGCCCGCATTATTGCTGTGTGGCGCAACCCCCATACAGTTGCCAATGCT GCGTGCTTCCGCCAGCAGAACCAGCCTAAACCCAGATGGGACAATAAAAAGCCCAATCAAGGGCAAGGCAAGTCC CAAGGCTCTGGTTCCGGTTCTGGCTCCGGCCAGAACCAAAACAAGCAACAACAACCTCAGCAGAAGAAGGAAGGT GAAGAGCAGAGTAACGACAAAAAGAAGAAGAGAACCCGCTCCCGTAAAAAGAAGCAGGGTAACGAGGCTTCTACC ACCTCTATCGAAGAAGTCAAGGATACGACTCCTGACTACTCCGCTTCTTCTGCCATTGCTTCAATGGCGCAGCTC GCATCCACCTCTACATGCACAAAAGTCAATGAGATGAAGAGCAGGTTCGTCCCGCTTCCATCTAGATCAGATCCT AATCTAACAAGGAATAGATCGCCCAAGTTGGGCAACCATACCATTTCGGGTGAGGAATTCTTTTTCTTCCCTGAT GGAAATATGATTGATCTTTCCCCACTGGATACTGACATTGACTCCTCGACTTTGCCTCGACTTTGTCCTGCGACG CGAAATCTCCTTCTGGACGAATCTGATGTTGATCTTGATCCATCAGACTCCGGGTTGATCCCTCGGGACCCTCGG AAACGTCTTTCCCCTCGCCATTCCCCCATGCCCTACGAAAGGGCCAAATCGAAAGTGTTATCCCTCTGGGATCAA CGTCTTAACGCAGTGACTGTTACAGCCCCTCCAATTGAGGATGGGGCCGGTCAAGTGGGTGGTAATGCTCCGGCT GTAAAAATGCCGGCTACGAAGGGAAAACTAGTTGAGAAGATGGATGTTGACAAAGCAGATGAGATTTCTCTCAGC GATGAAGATGGAGAATTTGAATATGAGTATGATGTGGAATGCGAACCGGATAGCTTTGCGCAAGTCCCTCGCTTC TCTCCCATATCTCAGATTAACTCATGGATTCTACAGGGCAGTATTGTCTCCAGAGACAAAGGCTTGGTTGAACCA GCCTATCTCCGAGAAAGATCGGGCCGCTCTTTGTGGCTTCAATTAGGATCGTCGAGTACTGCTACACCTGTATTA TATAATATGATCGCAAATGAGACTACTTCTTTATCCTCTTTGCTTGATCGAATGAATCCAATATGTGAAGTTCTC CATGCCTGTCATTCAAATTGTGAGGGTTGCAAAACCCGAAAGAATCTAGAAAAACATAGTTGGTTGCTCGATAGT GGTGCTTCATGCCACTTCACACCGAACCTTGAGGATTTTGCACATTTACAATGTGGAAACTTCGGTATCGTGCAT ACTGCAAATAAAGACTCTGTACTCAAAATCCAGGGACGCGGACATGTCTTAATCGAGCATACCGTAAAAGATATC TCCACGGGAAAAGAATTTAAATCAGTATCAAAGATCTGGCCCGTGTTCTATGTCAACGGCATGAATCATAGACTT CTTTCAACTGCTCAGTTGTTGAAGTCTGGACTTAAACTAGAGTCTACAAAAGATGGATCTGTATTCAAAAACGAA GCAGGTCAAGCGATTCTTAGTGCCCGTCCGGAAGGTCTTTTTGGCGGCATGCACATTGTTGAGTGTACCTTCTTA AAACACTCCAAAACAGAACCATACTTGCTTAACGCAAGCGGAATACTCCAGCTCCCTGACTACACCATTTGGCAT TGTAGATTAGGTCATCCTTCTGATAATGTGTTGAAGAAATTCTTCGAGGAAACCCTAGGAGTACCGAAGATTAAT ATTCCACAACAGAAACCGGTTTGTGATGGTTGTGTTCGTGGCAAACTTACGCAACAATCATTTCCACAATCGGAA AAATGTGCTACTTCAGTACTCGAACTGGTCCACTTGGATCTATTCGAACTCCCTGTCCCCTCTTATCACAAATAC AAATGGGTGATGACTTTGCTTGACGACTATTCCGGTCTCGCTCAGATTGTCATGATAACCAAGAAAAGTGATGCC GCGTTACAGTTGATTAATATCCTAAAACAGTTGGCTACTCAATCTGATCAAAAGATCAAAAGATTGCGTACAGAT CAAGGAGGAGAGTATGTCAATGATACACTCTCGACATACCTGAAGTCACAAGGCATTGTGCATGAACTCTCACAT CAGGAAAAGGTAAAGTGA |
Transcript | >AgabiH97|076160 ATGCTCAGTCAAGCGGCCGCTCAAACGGATGCTAGAGCTCTTTGGAATTGGTTGGAAGGCCAATATGGCCAAAAA GGACCAACGTTCGCCTATGAACGTTTTATTACAGCTCAAAATTTCAAGATAAATGACTCCGCCGATCCATCATCT GCGATTGCAGATCTCTACGCATTATTTGCCGCTGTCGAAAGCACTGGTGCGTTGATACATGAGTCGATCCGCACT ATGATGCTCCTCAACGCTCTCCCCAACTGGTTCGAGCATGTTGCTGCAAACATTCTTGCTGAGAAAAGGTCGGTT GCCGATCTTAAATGGGAGGAAACTTGTGCCCGCATTATTGCTGTGTGGCGCAACCCCCATACAGTTGCCAATGCT GCGTGCTTCCGCCAGCAGAACCAGCCTAAACCCAGATGGGACAATAAAAAGCCCAATCAAGGGCAAGGCAAGTCC CAAGGCTCTGGTTCCGGTTCTGGCTCCGGCCAGAACCAAAACAAGCAACAACAACCTCAGCAGAAGAAGGAAGGT GAAGAGCAGAGTAACGACAAAAAGAAGAAGAGAACCCGCTCCCGTAAAAAGAAGCAGGGTAACGAGGCTTCTACC ACCTCTATCGAAGAAGTCAAGGATACGACTCCTGACTACTCCGCTTCTTCTGCCATTGCTTCAATGGCGCAGCTC GCATCCACCTCTACATGCACAAAAGTCAATGAGATGAAGAGCAGGTTCGTCCCGCTTCCATCTAGATCAGATCCT AATCTAACAAGGAATAGATCGCCCAAGTTGGGCAACCATACCATTTCGGGTGAGGAATTCTTTTTCTTCCCTGAT GGAAATATGATTGATCTTTCCCCACTGGATACTGACATTGACTCCTCGACTTTGCCTCGACTTTGTCCTGCGACG CGAAATCTCCTTCTGGACGAATCTGATGTTGATCTTGATCCATCAGACTCCGGGTTGATCCCTCGGGACCCTCGG AAACGTCTTTCCCCTCGCCATTCCCCCATGCCCTACGAAAGGGCCAAATCGAAAGTGTTATCCCTCTGGGATCAA CGTCTTAACGCAGTGACTGTTACAGCCCCTCCAATTGAGGATGGGGCCGGTCAAGTGGGTGGTAATGCTCCGGCT GTAAAAATGCCGGCTACGAAGGGAAAACTAGTTGAGAAGATGGATGTTGACAAAGCAGATGAGATTTCTCTCAGC GATGAAGATGGAGAATTTGAATATGAGTATGATGTGGAATGCGAACCGGATAGCTTTGCGCAAGTCCCTCGCTTC TCTCCCATATCTCAGATTAACTCATGGATTCTACAGGGCAGTATTGTCTCCAGAGACAAAGGCTTGGTTGAACCA GCCTATCTCCGAGAAAGATCGGGCCGCTCTTTGTGGCTTCAATTAGGATCGTCGAGTACTGCTACACCTGTATTA TATAATATGATCGCAAATGAGACTACTTCTTTATCCTCTTTGCTTGATCGAATGAATCCAATATGTGAAGTTCTC CATGCCTGTCATTCAAATTGTGAGGGTTGCAAAACCCGAAAGAATCTAGAAAAACATAGTTGGTTGCTCGATAGT GGTGCTTCATGCCACTTCACACCGAACCTTGAGGATTTTGCACATTTACAATGTGGAAACTTCGGTATCGTGCAT ACTGCAAATAAAGACTCTGTACTCAAAATCCAGGGACGCGGACATGTCTTAATCGAGCATACCGTAAAAGATATC TCCACGGGAAAAGAATTTAAATCAGTATCAAAGATCTGGCCCGTGTTCTATGTCAACGGCATGAATCATAGACTT CTTTCAACTGCTCAGTTGTTGAAGTCTGGACTTAAACTAGAGTCTACAAAAGATGGATCTGTATTCAAAAACGAA GCAGGTCAAGCGATTCTTAGTGCCCGTCCGGAAGGTCTTTTTGGCGGCATGCACATTGTTGAGTGTACCTTCTTA AAACACTCCAAAACAGAACCATACTTGCTTAACGCAAGCGGAATACTCCAGCTCCCTGACTACACCATTTGGCAT TGTAGATTAGGTCATCCTTCTGATAATGTGTTGAAGAAATTCTTCGAGGAAACCCTAGGAGTACCGAAGATTAAT ATTCCACAACAGAAACCGGTTTGTGATGGTTGTGTTCGTGGCAAACTTACGCAACAATCATTTCCACAATCGGAA AAATGTGCTACTTCAGTACTCGAACTGGTCCACTTGGATCTATTCGAACTCCCTGTCCCCTCTTATCACAAATAC AAATGGGTGATGACTTTGCTTGACGACTATTCCGGTCTCGCTCAGATTGTCATGATAACCAAGAAAAGTGATGCC GCGTTACAGTTGATTAATATCCTAAAACAGTTGGCTACTCAATCTGATCAAAAGATCAAAAGATTGCGTACAGAT CAAGGAGGAGAGTATGTCAATGATACACTCTCGACATACCTGAAGTCACAAGGCATTGTGCATGAACTCTCACAT CAGGAAAAGGTAAAGTGA |
Gene | >AgabiH97|076160 ATGCTCAGTCAAGCGGCCGCTCAAACGGATGCTAGAGCTCTTTGGAATTGGTTGGAAGGCCAATATGGCCAAAAA GGACCAACGTTCGCCTATGAACGTTTTATTACAGCTCAAAATTTCAAGATAAATGACTCCGCCGATCCATCATCT GCGATTGCAGATCTCTACGCATTATTTGCCGCTGTCGAAAGCACTGGTGCGTTGATACATGAGTCGATCCGCACT ATGATGCTCCTCAACGCTCTCCCCAACTGGTTCGAGCATGTTGCTGCAAACATTCTTGCTGAGAAAAGGTCGGTT GCCGATCTTAAATGGGAGGAAACTTGTGCCCGCATTATTGCTGTGTGGCGCAACCCCCATACAGTTGCCAATGCT GCGTGCTTCCGCCAGCAGAACCAGCCTAAACCCAGATGGGACAATAAAAAGCCCAATCAAGGGCAAGGCAAGTCC CAAGGCTCTGGTTCCGGTTCTGGCTCCGGCCAGAACCAAAACAAGCAACAACAACCTCAGCAGAAGAAGGAAGGT GAAGAGCAGAGTAACGACAAAAAGAAGAAGAGAACCCGCTCCCGTAAAAAGAAGCAGGGTAACGAGGCTTCTACC ACCTCTATCGAAGAAGTCAAGGATACGACTCCTGACTACTCCGCTTCTTCTGCCATTGCTTCAATGGCGCAGCTC GCATCCACCTCTACATGCACAAAAGTCAATGAGATGAAGAGCAGGTTCGTCCCGCTTCCATCTAGATCAGATCCT AATCTAACAAGGAATAGATCGCCCAAGTTGGGCAACCATACCATTTCGGGTGAGGAATTCTTTTTCTTCCCTGAT GGAAATATGATTGATCTTTCCCCACTGGATACTGACATTGACTCCTCGACTTTGCCTCGACTTTGTCCTGCGACG CGAAATCTCCTTCTGGACGAATCTGATGTTGATCTTGATCCATCAGACTCCGGGTTGATCCCTCGGGACCCTCGG AAACGTCTTTCCCCTCGCCATTCCCCCATGCCCTACGAAAGGGCCAAATCGAAAGTGTTATCCCTCTGGGATCAA CGTCTTAACGCAGTGACTGTTACAGCCCCTCCAATTGAGGATGGGGCCGGTCAAGTGGGTGGTAATGCTCCGGCT GTAAAAATGCCGGCTACGAAGGGAAAACTAGTTGAGAAGATGGATGTTGACAAAGCAGATGAGATTTCTCTCAGC GATGAAGATGGAGAATTTGAATATGAGTATGATGTGGAATGCGAACCGGATAGCTTTGCGCAAGTCCCTCGCTTC TCTCCCATATCTCAGATTAACTCATGGATTCTACAGGGCAGTATTGTCTCCAGAGACAAAGGCTTGGTTGAACCA GCCTATCTCCGAGAAAGATCGGGCCGCTCTTTGTGGCTTCAATTAGGATCGTCGAGTACTGCTACACCTGTATTA TATAATATGATCGCAAATGAGACTACTTCTTTATCCTCTTTGCTTGATCGAATGAATCCAATATGTGAAGTTCTC CATGCCTGTCATTCAAATTGTGAGGGTTGCAAAACCCGAAAGAATCTAGAAAAACATAGTTGGTTGCTCGATAGT GGTGCTTCATGCCACTTCACACCGAACCTTGAGGATTTTGCACATTTACAATGTGGAAACTTCGGTATCGTGCAT ACTGCAAATAAAGACTCTGTACTCAAAATCCAGGGACGCGGACATGTCTTAATCGAGCATACCGTAAAAGATATC TCCACGGGAAAAGAATTTAAATCAGTATCAAAGATCTGGCCCGTGTTCTATGTCAACGGCATGAATCATAGACTT CTTTCAACTGCTCAGTTGTTGAAGTCTGGACTTAAACTAGAGTCTACAAAAGATGGATCTGTATTCAAAAACGAA GCAGGTCAAGCGATTCTTAGTGCCCGTCCGGAAGGTCTTTTTGGCGGCATGCACATTGTTGAGTGTACCTTCTTA AAACACTCCAAAACAGAACCATACTTGCTTAACGCAAGCGGAATACTCCAGCTCCCTGACTACACCATTTGGCAT TGTAGATTAGGTCATCCTTCTGATAATGTGTTGAAGAAATTCTTCGAGGAAACCCTAGGAGTACCGAAGATTAAT ATTCCACAACAGAAACCGGTTTGTGATGGTTGTGTTCGTGGCAAACTTACGCAACAATCATTTCCACAATCGGAA AAATGTGCTACTTCAGTACTCGAACTGGTCCACTTGGATCTATTCGAACTCCCTGTCCCCTCTTATCACAAATAC AAATGGGTGATGACTTTGCTTGACGACTATTCCGGTCTCGCTCAGATTGTCATGATAACCAAGAAAAGTGATGCC GCGTTACAGTTGATTAATATCCTAAAACAGTTGGCTACTCAATCTGATCAAAAGATCAAAAGATTGCGTACAGAT CAAGGAGGAGAGTATGTCAATGATACACTCTCGACATACCTGAAGTCACAAGGCATTGTGCATGAACTCTCACAT CAGGAAAAGGTAAAGTGA |