Protein ID | Agabi119p4|719100 |
Gene name | |
Location | scaffold_09:1369832..1371665 |
Strand | - |
Gene length (bp) | 1833 |
Transcript length (bp) | 1833 |
Coding sequence length (bp) | 1833 |
Protein length (aa) | 611 |
PFAM Domain ID | Short name | Long name | E-value | Start | End |
---|---|---|---|---|---|
PF00665 | rve | Integrase core domain | 5.8E-12 | 477 | 574 |
PF13976 | gag_pre-integrs | GAG-pre-integrase domain | 1.3E-08 | 413 | 463 |
Swissprot ID | Swissprot Description | Start | End | E-value |
---|---|---|---|---|
sp|P10978|POLX_TOBAC | Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1 | 276 | 607 | 2.0E-28 |
sp|P04146|COPIA_DROME | Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3 | 354 | 607 | 3.0E-12 |
GO Term | Description | Terminal node |
---|---|---|
GO:0015074 | DNA integration | Yes |
GO:0044260 | cellular macromolecule metabolic process | No |
GO:0006139 | nucleobase-containing compound metabolic process | No |
GO:0009987 | cellular process | No |
GO:0034641 | cellular nitrogen compound metabolic process | No |
GO:0008152 | metabolic process | No |
GO:0008150 | biological_process | No |
GO:0044238 | primary metabolic process | No |
GO:0071704 | organic substance metabolic process | No |
GO:0046483 | heterocycle metabolic process | No |
GO:0043170 | macromolecule metabolic process | No |
GO:1901360 | organic cyclic compound metabolic process | No |
GO:0090304 | nucleic acid metabolic process | No |
GO:0006725 | cellular aromatic compound metabolic process | No |
GO:0044237 | cellular metabolic process | No |
GO:0006807 | nitrogen compound metabolic process | No |
GO:0006259 | DNA metabolic process | No |
SignalP signal predicted | Location (based on Ymax) |
D score (significance: > 0.45) |
---|---|---|
No | 1 - 61 | 0.45 |
Type of sequence | Sequence |
---|---|
Locus | Download genbank file of locus
The gene with 5 kb flanks (if sufficient flanking sequence is available). For use in cloning design programs. NOTE: features (genes or exons) that are only partially contained within the sequence are completely excluded. |
Protein | >Agabi119p4|719100 MFSDNDELTHSPPSHYSYTWSFDLRHHYDLPFSLDDTIMENELPHGRTYTWIRGRYTDITYMELDIDRIAPTGAQ RTRDPSLNVARTGRVTPYQTAKLNALLAWDKRLGARPIDRIALTTHNEEATRPDETPSEYHTVAEEPERRSGSPI DYAIEEFGNSVIEEKEDGEISIHSGQDLMDNEEELDYDAENDGDSYPQVLPLYSQSITNSSFQCNDVDRSGSRVE QTSSLSSSCNGSDINYAMELSVRNALLDNITPEFKSFIMSIRSNSDSYPIKWMLDSGASAHFTGSLSDFSTVDRG FFGMVQTASGQLKIQGRGTVHIQHLVVDTNTGTKEVQRTKLWPVFYINGMHMRLISVGQLLRSGLRLEANAKHLT FRDENNHAVLSGISGHFPSISAVLSRIIQEIPKAGAYATTNTDYSTWHRRLGHPSDLVLCKFSKESLGVPPISIP QDKPVCKGCAEGKLAQKPFPTSGSRGTQVLELIHSDLFELLVISYHRHKWVLTILDDYSSTAFTVMLAHKSDAPR EMTKVMTLLANSTDQKIKRLRIDRGEEYTNSALQEYLSTNSIKHELSAPNVHQQNGRAERLNRMLHEKSQAMRKH ACLPDSWLLR* |
Coding | >Agabi119p4|719100 ATGTTTTCAGATAACGACGAACTTACACATTCCCCTCCCTCGCACTATTCGTACACATGGTCGTTCGACCTCCGT CATCACTACGATCTACCGTTCTCTTTGGACGATACGATCATGGAAAACGAACTTCCTCATGGACGAACGTACACT TGGATCCGCGGCCGCTATACCGACATTACCTATATGGAACTCGATATAGATCGGATCGCTCCAACTGGTGCACAA CGGACACGAGACCCTAGTCTCAACGTCGCTCGGACTGGTAGGGTCACACCTTACCAAACGGCTAAACTCAATGCG CTCCTCGCCTGGGACAAACGTCTCGGTGCTCGACCTATTGATAGAATCGCTCTGACGACTCACAATGAGGAAGCA ACGAGGCCCGATGAAACTCCATCCGAGTATCACACGGTAGCCGAAGAACCGGAAAGACGATCCGGATCGCCTATT GATTATGCGATCGAGGAATTTGGGAATTCTGTGATAGAAGAGAAGGAAGACGGTGAGATATCGATCCATTCTGGG CAAGACCTGATGGATAACGAAGAAGAACTCGATTATGATGCTGAGAATGATGGGGATTCATACCCGCAAGTATTA CCCCTCTACTCGCAATCAATTACTAATTCTTCTTTTCAGTGCAATGATGTAGATCGCTCCGGATCAAGAGTAGAA CAGACAAGTAGTCTTAGTTCCTCATGTAACGGTTCGGATATAAATTATGCGATGGAATTGAGCGTCAGGAACGCT CTTCTCGATAACATCACGCCTGAATTTAAGTCATTCATCATGAGCATAAGATCAAACTCTGATTCTTATCCGATT AAGTGGATGCTAGATAGTGGTGCTTCTGCGCACTTTACTGGCAGTCTCTCAGACTTTTCTACAGTTGATCGAGGC TTCTTTGGAATGGTACAAACTGCCTCTGGCCAATTAAAAATCCAAGGCCGAGGAACAGTACACATCCAACACTTA GTTGTTGATACAAACACAGGAACCAAAGAAGTTCAAAGGACCAAACTTTGGCCTGTCTTCTACATAAACGGTATG CACATGCGGCTCATCTCTGTCGGACAACTTTTACGGTCCGGACTAAGATTAGAAGCCAATGCGAAGCATCTCACC TTTCGAGATGAAAATAATCATGCTGTACTTTCAGGTATCTCGGGACACTTTCCCAGTATTTCCGCTGTCCTTTCC AGGATCATTCAGGAAATCCCCAAAGCTGGTGCTTATGCAACCACAAACACTGATTATTCTACCTGGCATCGTCGA TTAGGACATCCGTCCGATCTTGTGTTGTGTAAATTCTCAAAAGAATCTTTAGGTGTTCCACCTATTAGTATTCCA CAAGATAAACCAGTTTGTAAGGGATGTGCTGAAGGTAAACTCGCACAAAAACCCTTTCCCACTTCAGGAAGCCGA GGGACACAAGTTCTTGAGTTAATCCATTCGGATCTATTCGAACTTCTGGTAATCTCTTATCACCGCCATAAATGG GTATTAACCATATTGGATGATTATTCCAGTACGGCATTCACCGTAATGCTTGCCCACAAAAGTGATGCACCTCGA GAAATGACGAAAGTCATGACGTTGCTCGCTAACTCCACTGATCAAAAGATCAAGAGACTGCGCATTGATCGAGGA GAAGAATATACCAATAGTGCTCTACAAGAGTACTTATCTACCAACAGCATTAAACATGAGCTTTCTGCCCCTAAT GTTCACCAACAAAATGGACGCGCTGAAAGACTCAATAGAATGTTGCACGAGAAATCACAAGCCATGCGTAAACAT GCATGTCTTCCCGACTCATGGTTGTTAAGATAG |
Transcript | >Agabi119p4|719100 ATGTTTTCAGATAACGACGAACTTACACATTCCCCTCCCTCGCACTATTCGTACACATGGTCGTTCGACCTCCGT CATCACTACGATCTACCGTTCTCTTTGGACGATACGATCATGGAAAACGAACTTCCTCATGGACGAACGTACACT TGGATCCGCGGCCGCTATACCGACATTACCTATATGGAACTCGATATAGATCGGATCGCTCCAACTGGTGCACAA CGGACACGAGACCCTAGTCTCAACGTCGCTCGGACTGGTAGGGTCACACCTTACCAAACGGCTAAACTCAATGCG CTCCTCGCCTGGGACAAACGTCTCGGTGCTCGACCTATTGATAGAATCGCTCTGACGACTCACAATGAGGAAGCA ACGAGGCCCGATGAAACTCCATCCGAGTATCACACGGTAGCCGAAGAACCGGAAAGACGATCCGGATCGCCTATT GATTATGCGATCGAGGAATTTGGGAATTCTGTGATAGAAGAGAAGGAAGACGGTGAGATATCGATCCATTCTGGG CAAGACCTGATGGATAACGAAGAAGAACTCGATTATGATGCTGAGAATGATGGGGATTCATACCCGCAAGTATTA CCCCTCTACTCGCAATCAATTACTAATTCTTCTTTTCAGTGCAATGATGTAGATCGCTCCGGATCAAGAGTAGAA CAGACAAGTAGTCTTAGTTCCTCATGTAACGGTTCGGATATAAATTATGCGATGGAATTGAGCGTCAGGAACGCT CTTCTCGATAACATCACGCCTGAATTTAAGTCATTCATCATGAGCATAAGATCAAACTCTGATTCTTATCCGATT AAGTGGATGCTAGATAGTGGTGCTTCTGCGCACTTTACTGGCAGTCTCTCAGACTTTTCTACAGTTGATCGAGGC TTCTTTGGAATGGTACAAACTGCCTCTGGCCAATTAAAAATCCAAGGCCGAGGAACAGTACACATCCAACACTTA GTTGTTGATACAAACACAGGAACCAAAGAAGTTCAAAGGACCAAACTTTGGCCTGTCTTCTACATAAACGGTATG CACATGCGGCTCATCTCTGTCGGACAACTTTTACGGTCCGGACTAAGATTAGAAGCCAATGCGAAGCATCTCACC TTTCGAGATGAAAATAATCATGCTGTACTTTCAGGTATCTCGGGACACTTTCCCAGTATTTCCGCTGTCCTTTCC AGGATCATTCAGGAAATCCCCAAAGCTGGTGCTTATGCAACCACAAACACTGATTATTCTACCTGGCATCGTCGA TTAGGACATCCGTCCGATCTTGTGTTGTGTAAATTCTCAAAAGAATCTTTAGGTGTTCCACCTATTAGTATTCCA CAAGATAAACCAGTTTGTAAGGGATGTGCTGAAGGTAAACTCGCACAAAAACCCTTTCCCACTTCAGGAAGCCGA GGGACACAAGTTCTTGAGTTAATCCATTCGGATCTATTCGAACTTCTGGTAATCTCTTATCACCGCCATAAATGG GTATTAACCATATTGGATGATTATTCCAGTACGGCATTCACCGTAATGCTTGCCCACAAAAGTGATGCACCTCGA GAAATGACGAAAGTCATGACGTTGCTCGCTAACTCCACTGATCAAAAGATCAAGAGACTGCGCATTGATCGAGGA GAAGAATATACCAATAGTGCTCTACAAGAGTACTTATCTACCAACAGCATTAAACATGAGCTTTCTGCCCCTAAT GTTCACCAACAAAATGGACGCGCTGAAAGACTCAATAGAATGTTGCACGAGAAATCACAAGCCATGCGTAAACAT GCATGTCTTCCCGACTCATGGTTGTTAAGATAG |
Gene | >Agabi119p4|719100 ATGTTTTCAGATAACGACGAACTTACACATTCCCCTCCCTCGCACTATTCGTACACATGGTCGTTCGACCTCCGT CATCACTACGATCTACCGTTCTCTTTGGACGATACGATCATGGAAAACGAACTTCCTCATGGACGAACGTACACT TGGATCCGCGGCCGCTATACCGACATTACCTATATGGAACTCGATATAGATCGGATCGCTCCAACTGGTGCACAA CGGACACGAGACCCTAGTCTCAACGTCGCTCGGACTGGTAGGGTCACACCTTACCAAACGGCTAAACTCAATGCG CTCCTCGCCTGGGACAAACGTCTCGGTGCTCGACCTATTGATAGAATCGCTCTGACGACTCACAATGAGGAAGCA ACGAGGCCCGATGAAACTCCATCCGAGTATCACACGGTAGCCGAAGAACCGGAAAGACGATCCGGATCGCCTATT GATTATGCGATCGAGGAATTTGGGAATTCTGTGATAGAAGAGAAGGAAGACGGTGAGATATCGATCCATTCTGGG CAAGACCTGATGGATAACGAAGAAGAACTCGATTATGATGCTGAGAATGATGGGGATTCATACCCGCAAGTATTA CCCCTCTACTCGCAATCAATTACTAATTCTTCTTTTCAGTGCAATGATGTAGATCGCTCCGGATCAAGAGTAGAA CAGACAAGTAGTCTTAGTTCCTCATGTAACGGTTCGGATATAAATTATGCGATGGAATTGAGCGTCAGGAACGCT CTTCTCGATAACATCACGCCTGAATTTAAGTCATTCATCATGAGCATAAGATCAAACTCTGATTCTTATCCGATT AAGTGGATGCTAGATAGTGGTGCTTCTGCGCACTTTACTGGCAGTCTCTCAGACTTTTCTACAGTTGATCGAGGC TTCTTTGGAATGGTACAAACTGCCTCTGGCCAATTAAAAATCCAAGGCCGAGGAACAGTACACATCCAACACTTA GTTGTTGATACAAACACAGGAACCAAAGAAGTTCAAAGGACCAAACTTTGGCCTGTCTTCTACATAAACGGTATG CACATGCGGCTCATCTCTGTCGGACAACTTTTACGGTCCGGACTAAGATTAGAAGCCAATGCGAAGCATCTCACC TTTCGAGATGAAAATAATCATGCTGTACTTTCAGGTATCTCGGGACACTTTCCCAGTATTTCCGCTGTCCTTTCC AGGATCATTCAGGAAATCCCCAAAGCTGGTGCTTATGCAACCACAAACACTGATTATTCTACCTGGCATCGTCGA TTAGGACATCCGTCCGATCTTGTGTTGTGTAAATTCTCAAAAGAATCTTTAGGTGTTCCACCTATTAGTATTCCA CAAGATAAACCAGTTTGTAAGGGATGTGCTGAAGGTAAACTCGCACAAAAACCCTTTCCCACTTCAGGAAGCCGA GGGACACAAGTTCTTGAGTTAATCCATTCGGATCTATTCGAACTTCTGGTAATCTCTTATCACCGCCATAAATGG GTATTAACCATATTGGATGATTATTCCAGTACGGCATTCACCGTAATGCTTGCCCACAAAAGTGATGCACCTCGA GAAATGACGAAAGTCATGACGTTGCTCGCTAACTCCACTGATCAAAAGATCAAGAGACTGCGCATTGATCGAGGA GAAGAATATACCAATAGTGCTCTACAAGAGTACTTATCTACCAACAGCATTAAACATGAGCTTTCTGCCCCTAAT GTTCACCAACAAAATGGACGCGCTGAAAGACTCAATAGAATGTTGCACGAGAAATCACAAGCCATGCGTAAACAT GCATGTCTTCCCGACTCATGGTTGTTAAGATAG |