Gene search


Sequence information


Select Gene Cds Cds_length GC_content Pep Pep_length
Cec10g1975 ATGCGGGCCTTCTCTTCCTTGATCTTGGCCTTAGCATTCGCCGTTCTGCCGGCGACCGTGGTTAACTGTGGCTTCCCCGTCCCACTTCTCTCTCTGCACAGGGCCTTCCCTTCTTCTCTCCCTTTCCAACTCGAAACCCTCAGGGCTCGGGACCGACTCAGGCATGCCAGAATCTTGCAAGGTGTTGTCGACTTCTCAGTCGAAGGTTCTTCCGATCCATTACTCGTCGGGCTGTACTTTACCAAAGTCAAATTGGGCACTCCCCCAATGGAATTTACGGTACAGATCGACACTGGAAGTGATATATTGTGGGTTAACTGCAATTCCTGCAATGGCTGCCCCAGATCAAGTGGACTTGGAATTCAACTAAATTTCTTTGATGCTTCTAGCTCATCAAGTTCTTCACTCGTCTCCTGTTCTGACCCAATATGCAATTCTGCATTCCAAACGACTGCAACACAATGTTTAACTCAGAGTAATCAATGCAGTTACACCTTTCAGTATGGCGATGGAAGTGGAACATCAGGTTATTATGTATCTGAGTCCATGTACTTTGACATGGTCATGGGACATTCTATGATTGCTAACTCTTCAGCTAGTGTTGTTTTTGGTTGTAGCACCTACCAGTCTGGAGACTTGACTAAATCAGACCATGCAATAGATGGCATTTTTGGATTTGGCCCCGGGGATTTGTCTGTCATATCACAGTTGTCAGCTCGAGGGATAACCCCTAAAGTATTTTCCCATTGTTTAAAGGGAGAAGGAAATGGTGGTGGTATATTGGTTCTTGGTGAGGTTTTGGAGCCAAGCATTGTGTATAGCCCACTTGCGCCCTCTCAGCCTCACTATAACTTATATCTGCAAAGCATCGCTGTCAATGGTCAGACACTGCCAATTGATCCATCTGTGTTTGCAACATCAGTAAATCGAGGAACTATTGTAGACTCTGGGACAACATTGGCTTACCTTGTTGAAGAAGCTTATACTCCATTTGTCAGTGCTATAACTGCTGCTGTTTCCCAATCTGTAACTCCTACTATTTCAAAGGGCAACCAGTGTTATCTAGTATCCACTAGTGTAGGCGAGATATTTCCTCTGGTTAGCTTAAACTTCGCGGGCAGTGCATCTATGGTGTTAAAACCCGAAGAATATCTTATGCATCTTGGCTTTTATGATGGTGCTGCATTGTGGTGCATTGGTTTTCAGAAAGTTCAGGAAGGAGTAACGATCTTAGGAGATCTTGTTATGAAAGATAAGATATTTGTATATGATTTGGCTCGCCAGCGAATCGGGTGGGCAAACTATGATTGTTCTCAAGCTGTAAATGTATCAGTCACATCTGGGAAGAACGAGTTCGTCAATGCAGGGCAGTTGAGCGTGAGCGGCTCAACCCGAGATGAGCTCCTTCAATCACTAACTATGGTAGCATTAGCAATGTTAATGAGTCTCATTTTGTTCATCCACTCCCAACTTCTGTGTTTGAGCGATTTCCTTTTCGCAATGGCGGCAGTTTCTCAGACGACGGTTCATGCCTCTCCGGCTTCCCTCTACGTTGGCGACCTTCATCCTGATGTCACTGATGGCCAGCTCTTTGATGCTTTCTCTGGATTCAAGAGCCTCGCCTCTGTTCGTATCTGCAGAGATTCCTCCACTGGGCGCTCTCTCTCTTATGGCTATGTCAATTTCATTTCTCCTCAAGATGCAACCAATGCCATGGAGGTAATGAATCACAGTATGCTGAATGGAAGAGCGATTCGAGTCATGTGGTCACGTCGCGATGCTGATGCAAGAAAAAGTGGAATTGGAAACGTGTTTGTTAAGAACTTGAGCGATTCAATCAATAGTTTAGGACTTCAAGAGCTATTTAAGAAATTTGGAAATGTTCTGTCCAGCAAAGTTGCAACATCCGATGATGGGAAGAGCAAAGGATATGGCTTTGTTCAATTTGAGTCGGAGGATTCTGCAAATGCTGCCATAGAGTCAATGAATGGTTTTACCATTGGTGATAAGCAGATATATGTTGGAAAGTTTGTCAGGAAGAGTGATCGAGTTTTGGCCAATGCTGATGTTAAATATACCAATTTGTATGTGAAAAATCTTGACCCAGAGATTGGGGAAGAGCATTTGCAGGAGAAGTTCTCTGAGTTTGGAAAGATTTCCAGCATGATTATTTCACGGGATGAGAATGGGGTGTCAAGGGGTTTTGGCTTTATAAACTTTGACAACTCTGATGATGCCAAACGGGCTTTGGAAACGCTTAATGGATCGCAACTTGGTTCTAAGGTTATCTACATTGCGAGGGCACAAAAGAAGACCGAACGTGAGGAAGTATTACGTCGACATTATGAAGAGAAATGTAAGGAGCAAGTGTTAAAATACAAGGGATCAAATGTGTATGTGAAGAACATTGATGATGATGTTACTGATGAAGAACTAAGAGAACGTTTTAGTCAGTTTGGCACCATCACTTCATCAAAACTTATGCGAGATGACAAGGGAATAAACAAAGGGTTTGGATTTGTCTGCTTCTCCAATCCTGATGAAGCTAAAAGAGCCGTGAACACTTTGCAAGGATGTATGTTTCACGGGAAGCCACTCTATTTGGCCATAGCACAAAGAAAAGAGGATAGACAAATGCAATTAAAGCTTCAGTTTGCACAACGGCTGGCAGGGATTCCTGGACCGTCAACTACAATTTTTCCTGGTGGATACCCACCTTATTACTACCCAGCACCAGGTGTTGTTCCACCAGTACCATCTCGTCCAGGTCTGATGTTTCCGCCTTTAGGAATGAGGCCTGGGTGGAGAGCTAATACTTATACATCTCCTGCAAGACCTGGCTTTCAACCTTCACCTGTACCTATTATTCCAAATGCTTCGAGGCAACCCAGGCAAAACAGAGGCAAAATGAATGGACCAATGCTTTCTCATCAAAATGGAGTCCAAGCCGTCTCTTATATGCAAAATTCACAAGATGCCAATCAATCAGTTGTTACTACAAAATCATCCGGTAATCAACAGAATGGTTTCTGTGGTTATTTATTTTTCTATCTGTTGGGATCTCTTCAATTTTCTTGTGTTGAACTGAAAGGTGCTGATGGTACTAATTTGGACAAAGTTTTAGTTAAGTATGTCCCAAATGCTCGGCCATGCGAAACAAACAAAGCTTCTGGCCCCGCCTCAGCTGCTGCTTTTAATTCTGTTGGAGATGTCTCTCAGGGATCACAAATACTGAGCAGCATGCTTGCTTCCTCTCCCCCAGACCAACAGAAACAGATCCTTGGGGAGCATCTTTACCCCTTAGTCCAAAAACGCAAGCCGGACCTTGCTGCGAAAATTACCGGTATGCTTCTGGAGATGGACAACTCGGAGTTGCTGCTCCTGTTAGAGTCACCTGAATCTCTGGCTGCCAAGGTTGAAGAAGCAGTGCAGGGTCAGGATGGAATGCTTGTTTGGAACTTGGATTTGTCGAATGCTGGTAAAACGTACATCTCTACCATTTGCCCCTTTCGAGCTGTCTTATGTGATGGGTTCATTGTCTCTTATAATTCGTTGGCAGATAATTTTGTTGTCCTTATAAACATGGATTCTTTCCTCCACAACATTGTATTGGTCTCTTACAGGTCTCCTCTTCTTGTGGTTGATCATGGATTGCCAACACTAGAAAGATCATTGTGA 3678 43.88 MRAFSSLILALAFAVLPATVVNCGFPVPLLSLHRAFPSSLPFQLETLRARDRLRHARILQGVVDFSVEGSSDPLLVGLYFTKVKLGTPPMEFTVQIDTGSDILWVNCNSCNGCPRSSGLGIQLNFFDASSSSSSSLVSCSDPICNSAFQTTATQCLTQSNQCSYTFQYGDGSGTSGYYVSESMYFDMVMGHSMIANSSASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGITPKVFSHCLKGEGNGGGILVLGEVLEPSIVYSPLAPSQPHYNLYLQSIAVNGQTLPIDPSVFATSVNRGTIVDSGTTLAYLVEEAYTPFVSAITAAVSQSVTPTISKGNQCYLVSTSVGEIFPLVSLNFAGSASMVLKPEEYLMHLGFYDGAALWCIGFQKVQEGVTILGDLVMKDKIFVYDLARQRIGWANYDCSQAVNVSVTSGKNEFVNAGQLSVSGSTRDELLQSLTMVALAMLMSLILFIHSQLLCLSDFLFAMAAVSQTTVHASPASLYVGDLHPDVTDGQLFDAFSGFKSLASVRICRDSSTGRSLSYGYVNFISPQDATNAMEVMNHSMLNGRAIRVMWSRRDADARKSGIGNVFVKNLSDSINSLGLQELFKKFGNVLSSKVATSDDGKSKGYGFVQFESEDSANAAIESMNGFTIGDKQIYVGKFVRKSDRVLANADVKYTNLYVKNLDPEIGEEHLQEKFSEFGKISSMIISRDENGVSRGFGFINFDNSDDAKRALETLNGSQLGSKVIYIARAQKKTEREEVLRRHYEEKCKEQVLKYKGSNVYVKNIDDDVTDEELRERFSQFGTITSSKLMRDDKGINKGFGFVCFSNPDEAKRAVNTLQGCMFHGKPLYLAIAQRKEDRQMQLKLQFAQRLAGIPGPSTTIFPGGYPPYYYPAPGVVPPVPSRPGLMFPPLGMRPGWRANTYTSPARPGFQPSPVPIIPNASRQPRQNRGKMNGPMLSHQNGVQAVSYMQNSQDANQSVVTTKSSGNQQNGFCGYLFFYLLGSLQFSCVELKGADGTNLDKVLVKYVPNARPCETNKASGPASAAAFNSVGDVSQGSQILSSMLASSPPDQQKQILGEHLYPLVQKRKPDLAAKITGMLLEMDNSELLLLLESPESLAAKVEEAVQGQDGMLVWNLDLSNAGKTYISTICPFRAVLCDGFIVSYNSLADNFVVLINMDSFLHNIVLVSYRSPLLVVDHGLPTLERSL 1225
       

Gff information


Chromosome Start End Strand Old_gene Gene Num
10 34522892 34535834 - CePI673135_10g019750.1 Cec10g1975 214890
       

Annotation information


Select Seq ID Length Analysis Description Start End IPR GO
Cec10g1975 1225 Gene3D - 1071 1147 - -
Cec10g1975 1225 Gene3D Acid Proteases 63 265 IPR021109 -
Cec10g1975 1225 Gene3D - 678 779 IPR012677 -
Cec10g1975 1225 ProSiteProfiles Eukaryotic RNA Recognition Motif (RRM) profile. 796 873 IPR000504 GO:0003723(InterPro)
Cec10g1975 1225 FunFam Polyadenylate-binding protein 1072 1150 - -
Cec10g1975 1225 FunFam Polyadenylate-binding protein 684 778 - -
Cec10g1975 1225 Gene3D Acid Proteases 266 444 IPR021109 -
Cec10g1975 1225 Pfam Poly-adenylate binding protein, unique domain 1081 1144 IPR002004 GO:0003723(InterPro)
Cec10g1975 1225 Gene3D - 502 602 IPR012677 -
Cec10g1975 1225 Gene3D - 603 675 IPR012677 -
Cec10g1975 1225 FunFam Polyadenylate-binding protein 779 887 - -
Cec10g1975 1225 CDD RRM4_I_PABPs 795 873 - -
Cec10g1975 1225 Gene3D - 780 883 IPR012677 -
Cec10g1975 1225 CDD pepsin_A_like_plant 79 437 IPR034161 -
Cec10g1975 1225 SUPERFAMILY RNA-binding domain, RBD 794 887 IPR035979 GO:0003676(InterPro)
Cec10g1975 1225 SUPERFAMILY Acid proteases 75 441 IPR021109 -
Cec10g1975 1225 Coils Coil 765 785 - -
Cec10g1975 1225 SMART rrm1_1 797 869 IPR000504 GO:0003723(InterPro)
Cec10g1975 1225 SMART rrm1_1 694 766 IPR000504 GO:0003723(InterPro)
Cec10g1975 1225 SMART rrm1_1 515 588 IPR000504 GO:0003723(InterPro)
Cec10g1975 1225 SMART rrm1_1 603 675 IPR000504 GO:0003723(InterPro)
Cec10g1975 1225 NCBIfam polyadenylate binding protein, human types 1, 2, 3, 4 family 514 1144 IPR006515 GO:0003723(InterPro)
Cec10g1975 1225 ProSiteProfiles Poly(A)-binding protein C-terminal (PABC) domain profile. 1074 1151 IPR002004 GO:0003723(InterPro)
Cec10g1975 1225 ProSiteProfiles Eukaryotic RNA Recognition Motif (RRM) profile. 514 592 IPR000504 GO:0003723(InterPro)
Cec10g1975 1225 SUPERFAMILY PABC (PABP) domain 1071 1143 IPR036053 GO:0003723(InterPro)
Cec10g1975 1225 ProSiteProfiles Eukaryotic RNA Recognition Motif (RRM) profile. 693 770 IPR000504 GO:0003723(InterPro)
Cec10g1975 1225 PANTHER RNA BINDING PROTEIN 693 950 - GO:0003723(PANTHER)|GO:0003730(PANTHER)|GO:0005634(PANTHER)|GO:0005829(PANTHER)|GO:0008143(PANTHER)|GO:0008266(PANTHER)|GO:1990904(PANTHER)
Cec10g1975 1225 FunFam Aspartic proteinase-like protein 2 266 444 - -
Cec10g1975 1225 Pfam Xylanase inhibitor N-terminal 79 264 IPR032861 -
Cec10g1975 1225 FunFam Polyadenylate-binding protein 506 598 - -
Cec10g1975 1225 ProSiteProfiles Eukaryotic RNA Recognition Motif (RRM) profile. 602 679 IPR000504 GO:0003723(InterPro)
Cec10g1975 1225 SUPERFAMILY RNA-binding domain, RBD 514 681 IPR035979 GO:0003676(InterPro)
Cec10g1975 1225 PRINTS Pepsin (A1) aspartic protease family signature 85 105 IPR001461 GO:0004190(InterPro)|GO:0006508(InterPro)
Cec10g1975 1225 PRINTS Pepsin (A1) aspartic protease family signature 312 323 IPR001461 GO:0004190(InterPro)|GO:0006508(InterPro)
Cec10g1975 1225 PRINTS Pepsin (A1) aspartic protease family signature 409 424 IPR001461 GO:0004190(InterPro)|GO:0006508(InterPro)
Cec10g1975 1225 Pfam Xylanase inhibitor C-terminal 282 433 IPR032799 -
Cec10g1975 1225 SUPERFAMILY RNA-binding domain, RBD 687 780 IPR035979 GO:0003676(InterPro)
Cec10g1975 1225 CDD RRM2_I_PABPs 600 676 IPR045305 -
Cec10g1975 1225 FunFam Aspartic proteinase-like protein 2 62 265 - -
Cec10g1975 1225 FunFam Polyadenylate-binding protein 599 683 - -
Cec10g1975 1225 SMART rrm2_1 797 869 IPR003954 GO:0003676(InterPro)
Cec10g1975 1225 SMART rrm2_1 694 766 IPR003954 GO:0003676(InterPro)
Cec10g1975 1225 SMART rrm2_1 603 675 IPR003954 GO:0003676(InterPro)
Cec10g1975 1225 ProSiteProfiles Peptidase family A1 domain profile. 79 433 IPR033121 -
Cec10g1975 1225 SMART poly_2 1086 1149 IPR002004 GO:0003723(InterPro)
Cec10g1975 1225 Pfam RNA recognition motif 695 763 IPR000504 GO:0003723(InterPro)
Cec10g1975 1225 Pfam RNA recognition motif 798 866 IPR000504 GO:0003723(InterPro)
Cec10g1975 1225 Pfam RNA recognition motif 516 586 IPR000504 GO:0003723(InterPro)
Cec10g1975 1225 Pfam RNA recognition motif 604 672 IPR000504 GO:0003723(InterPro)
Cec10g1975 1225 CDD RRM3_I_PABPs 692 771 - -
       

Duplication type information


Select Gene1 Location1 Gene2 Location2 E-value Duplicated-type
Cec05g1192 Cec-Chr5:10914748 Cec10g1975 Cec-Chr10:34522892 1.50E-127 dispersed
Cec08g0749 Cec-Chr8:19232701 Cec10g1975 Cec-Chr10:34522892 1.00E-118 dispersed
Cec10g1975 Cec-Chr10:34522892 Cec11g1703 Cec-Chr11:31251670 1.50E-30 dispersed
Cec03g0194 Cec-Chr3:2015054 Cec10g1975 Cec-Chr10:34522892 2.90E-10 transposed
Cec03g1198 Cec-Chr3:26489873 Cec10g1975 Cec-Chr10:34522892 1.30E-91 transposed
Cec10g1199 Cec-Chr10:26077259 Cec10g1975 Cec-Chr10:34522892 1.20E-14 transposed
Cec11g0983 Cec-Chr11:16376611 Cec10g1975 Cec-Chr10:34522892 2.30E-88 transposed
Cec10g1975 Cec-Chr10:34522892 Cec02g2285 Cec-Chr2:40162579 2.50E-166 wgd
Cec10g1975 Cec-Chr10:34522892 Cec04g1102 Cec-Chr4:30577808 1.20E-165 wgd
       

Pathway information


Select Query KO Definition Second KO KEGG Genes ID GHOSTX Score
Cec10g1975 K13126 - csv:101203793 1087.4