Gene search


Sequence information


Select Gene Cds Cds_length GC_content Pep Pep_length
Cec09g2121 ATGGAGATTACATACTCATCACCTAGCGCCACCAGTCTTGCTTTCTCTAATTCTGCTCTTTCCGCCATGCCTGCCTCCGGCCGGCCTGTCAAGGTCATTCCATTGCAGCATCCTAGTACGACGTCCGCCTCCTCGGCCGGTGGTTTCTACGCCGGCACTCTGGTGAAGTCATGGACGACGAAGGTTAAGCGGATGACTTGGATTCACTGGATGGAGCTTCTGTTACCTTGCTCCCGTTGGATCAGAACGTATAAATGGCGGGAGTATTTACAGAGCGATATCTTGTCCGGGATTACTATTGGCATCATGCTCGTTCCGCAGGCAATGTCTTATGCAAAACTAGCCGGGCTTCAACCTATATACGGACTTTATTCCGGTTTCCTCCCTTTGTTTGTCTATGCAATTTTTGGTTCTTCTCGTCAGCTTGCAGTTGGTCCAGTAGCATTGGTTTCTCTACTGGTTTCCAATGTCCTGGGTGGAATTGTTAACTCATCTGAGGAATTATATACTGAACTCGCAATATTATTGGCATTCATGGTTGGAATATTGGAATGCGTGATGGGTCTCTTGAGGCTTGGATGGCTTATTCGCTTTATCAGCCACTCTGTAATCTCTGGCTTTACTACAGCTTCTGCCTTTGTGATTGGATTATCCCAAGTGAAATACTTTCTGGGGTATGATGTATCAAGAAGTAGCAGAATCGTGCCTCTAATTGAGAGCATAATAGCTGGAGCAGATGGGTTCTTATGGGCACCTTTTATAATGGGATCAGTCATCCTGGCGGTACTTCAAATCATGAAGCATTTGGGAAAAACAAGGAAGCACTTACGGTTTCTCAGAGTTGCTGGTCCCCTTACAGCCGTTGTTTTGGGCACAACTTTGACGAAAGTATTAAATCTACCTTCCATTACTTTGGTTGGAGACATTCCCCAAGGCCTTCCAACGTTTTCTATTCCTAAAAGATTTGAGCACGTGAAGTCGTTGATTCCAACTGCCTTTCTGATCACTGGAGTGGCTATATTGGAATCTGTTGGGATTGCAAAAGCATTAGCAGCCAAGAATGGGTACGAGTTAGATTCAAATCAGGAGTTATTTGGCCTTGGAGTAGCCAATGTTGTCGGCTCATTTTTTTCTGCATATCCCACAACAGGTTCTTTCTCAAGATCAGCTGTGAACCATGAAAGTGGAGCAAAAACTAGCTTATCTCAGATTGTTACAGGAATCATTATGGGTGGTGCCCTTCTTTTCTTGACTCCGCTGTTTGAGCACATTCCTCAGTGTGCTTTGGCTGCCATTGTCATCTCTGCGGTTTTAACTTTGGTGGATTACGAGGAAGCTATTTTTTTGTGGCGTATAGATAAGAAAGATTTTCTTCTTTGGGTGATTACTGCCATTACTACGTTGTTCCTTGGTATTGAGATTGGTGTCTTAATTGGGGTGGGTGTTTCACTGGTCTTTGTCATTCACGAATCTGCAAATCCACATATGGCTGTATTGGGGCGTCTTCCTGGCACCACTGTGTATAGAAATATTCAACAGTATCCTGAGGCATATACCTATAATGGGATCGTGATTGTTCGAATTGATGCACCAATTTATTTTGCAAACACAAGTTACATCAAAGACAGATTACGTGATTATGAAGTTGAAGTGGATCGATCTACTGGTCGTGGACCGGACGTCGAAAGAGTCTATTTTGTGATTATAGAGATGGCACCTGTTACTTACATAGATTCAAGTGCTGTCCAAGCTCTAAAAGAATTGTATCAAGAGTACAAACTTCGCGATATTCAGATTGCCATTTCCAATCCGAATCGAGATGTTCTGCTTACATTTTCAAGATCTGGCGTCGTCGAGCTTATTGGCAAGGAGTGGTTTTTTGTGAGAGTTCATGATGCAGTTCAAGTTTGTCTTCAACATGTGGAGAGCTTAAATGAAACTAACAAAATATCAGATCCATCACCTAAAGATGGATCATCAAGCTTTCTCCAAAGTTTACTGAAGTCTAGAAGTGAGGATTTATCAGTTTCTTCGGAGTTCGTTTTACATCAACTCCTTCTTCTTCCCCTTCCTCAGCTGCCTACACGTTTTCTCCTCCTGCTGTTTTGTCACTGCTTTCCTCAGCCATTGTTATTACACTCCGAAGTTGCAGTTCAAGGCCGACGGATGCGATTAAGGATTCTTAGTGGGGTCTCTGCCGGAATGAGTATGGGTAGAAAACCTGATCTCATTTGTGCAATCATTTTGGATTCACACAGTCAAATCACATTAATTCACAATGAAGATAACTGTGGAGATCTCTTAAATGCTTTTGGGAATTCGATTCAACAATATATGGCGATGGCGGACACCTCCAGTGATTTCTACTTATCATGCTTTCCACTTCAATGGCCACTTCTCAACCTCTTCTCCATATATGGATCACTCTGCACTTATCAGGAACTACATTCCTTAGCTGAGACTGAGAGTAATGATGGCACATTTCTACCATGGTTGGAACGAAAAGCAGAGACGAAGATCTCGTCAGTGCTTTCTATTGGGAAATCTTCCATTGGAAGGTCTCTGTTTGCTTCTGAGACTATACGGGCTGGAGATTGTATTTTAAAGGTTCCTTTCAATGTGCAAATTTCACCTGATAGTCTTCCTTTACCCATTAGAGATCTTTTAGGCAATGAGATTGGAAATGTTGCCAAGCTTGCTGTTGTGATTCTTCTTGAACAGAAATTGGGCCTGGGCTCTGAATGGGCGCCTTACATTACCCGGCTTCCTCAACCATGGGAGATGCATAACACAATATTTTGGAATGAAAGTGAGTTGGAGATGATTCGTAAAAGCCCTTTGTATGAGGAATCACTTAATCAAAGATCACAGATTGGAAGGGAATTTCTGGCAATCAGGAACGCTCTGGAAACCTTCCCTGAAATTATTGATCGGATCAATTGTGATGATTTCATGCATGCATATTCCCTTGTTACTTCTAGAGCATGGAGAAGCCCAAAGAGTGTCTCTCTGATTCCATTTGCAGATTTTTTAAATCACAATGGCGTTTCAGAAGCAATGGTATTGAATGATGATGAAAAACAGCTCTCTGAGGTCATTGCTGATCGTGATTATGCCCCTGGTGAACATGTACTAATAAGGTATGGAAAATATTCAAATGCTACGCTGATGTTGGACTTTGGGTTTGCGCTTCCATACAACATTCATGATCAGGTACAGGTCCAGGTTAAAACAGTTAAAGATGATCCTTTGGCAGGCGTAAAGTTGGAACTTTGGCAAAGAAGTTGCACACCAGCTACTGAGTATGTTAATGGCGTTTACTCCCTTGGGAATTCTTTCACCATCAGGGAAGTGAGATGTGCCACTGGGAAAGGGAGAGGTCTTCCCCAATCACTTCGTGCATTTGCTCGTATTTTGTCTTGCACTAATCCTCAGGAATTAAATGAATTAAGTTCTGAAGCTGTTAATGGTGATGGTCGGTTGGCTCGAATTCCACTGAAGAATGTCAATAAAGAGGTTGAAGCACATCGGATTCTGCTTTCTCAATTCAAACAATTAGTTGAAGAGTATAATGCATCTATTGAGGCACTGGGGCCTGTTGATTCTCCCTGTTTGTGCAACAAGTTGGCACGGCGAAGGCTGATGGCCCAACATCTTCTCACTGGTGAGCTTCGCATCCTCAACTCCGCTATTGCTTGGCTGGAGAATTATTGTGATGCCATTTAG 3726 42.83 MEITYSSPSATSLAFSNSALSAMPASGRPVKVIPLQHPSTTSASSAGGFYAGTLVKSWTTKVKRMTWIHWMELLLPCSRWIRTYKWREYLQSDILSGITIGIMLVPQAMSYAKLAGLQPIYGLYSGFLPLFVYAIFGSSRQLAVGPVALVSLLVSNVLGGIVNSSEELYTELAILLAFMVGILECVMGLLRLGWLIRFISHSVISGFTTASAFVIGLSQVKYFLGYDVSRSSRIVPLIESIIAGADGFLWAPFIMGSVILAVLQIMKHLGKTRKHLRFLRVAGPLTAVVLGTTLTKVLNLPSITLVGDIPQGLPTFSIPKRFEHVKSLIPTAFLITGVAILESVGIAKALAAKNGYELDSNQELFGLGVANVVGSFFSAYPTTGSFSRSAVNHESGAKTSLSQIVTGIIMGGALLFLTPLFEHIPQCALAAIVISAVLTLVDYEEAIFLWRIDKKDFLLWVITAITTLFLGIEIGVLIGVGVSLVFVIHESANPHMAVLGRLPGTTVYRNIQQYPEAYTYNGIVIVRIDAPIYFANTSYIKDRLRDYEVEVDRSTGRGPDVERVYFVIIEMAPVTYIDSSAVQALKELYQEYKLRDIQIAISNPNRDVLLTFSRSGVVELIGKEWFFVRVHDAVQVCLQHVESLNETNKISDPSPKDGSSSFLQSLLKSRSEDLSVSSEFVLHQLLLLPLPQLPTRFLLLLFCHCFPQPLLLHSEVAVQGRRMRLRILSGVSAGMSMGRKPDLICAIILDSHSQITLIHNEDNCGDLLNAFGNSIQQYMAMADTSSDFYLSCFPLQWPLLNLFSIYGSLCTYQELHSLAETESNDGTFLPWLERKAETKISSVLSIGKSSIGRSLFASETIRAGDCILKVPFNVQISPDSLPLPIRDLLGNEIGNVAKLAVVILLEQKLGLGSEWAPYITRLPQPWEMHNTIFWNESELEMIRKSPLYEESLNQRSQIGREFLAIRNALETFPEIIDRINCDDFMHAYSLVTSRAWRSPKSVSLIPFADFLNHNGVSEAMVLNDDEKQLSEVIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIHDQVQVQVKTVKDDPLAGVKLELWQRSCTPATEYVNGVYSLGNSFTIREVRCATGKGRGLPQSLRAFARILSCTNPQELNELSSEAVNGDGRLARIPLKNVNKEVEAHRILLSQFKQLVEEYNASIEALGPVDSPCLCNKLARRRLMAQHLLTGELRILNSAIAWLENYCDAI 1241
       

Gff information


Chromosome Start End Strand Old_gene Gene Num
9 38189550 38211206 - CePI673135_09g021210.1 Cec09g2121 212631
       

Annotation information


Select Seq ID Length Analysis Description Start End IPR GO
Cec09g2121 1241 Gene3D STAS domain 493 640 IPR036513 -
Cec09g2121 1241 Gene3D set domain protein methyltransferase, domain 1 807 1062 - -
Cec09g2121 1241 FunFam Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit N-methyltransferase, chloroplastic 821 1062 - -
Cec09g2121 1241 SUPERFAMILY SET domain 827 1067 IPR046341 -
Cec09g2121 1241 ProSiteProfiles STAS domain profile. 513 637 IPR002645 -
Cec09g2121 1241 PANTHER SULFATE TRANSPORTER 63 639 IPR001902 GO:0005887(PANTHER)|GO:0016020(InterPro)|GO:0055085(InterPro)
Cec09g2121 1241 FunFam Sulfate transporter 31 493 640 - -
Cec09g2121 1241 Pfam Sulfate permease family 90 463 IPR011547 GO:0016020(InterPro)
Cec09g2121 1241 NCBIfam sulfate permease 76 633 IPR001902 GO:0016020(InterPro)|GO:0055085(InterPro)
Cec09g2121 1241 SUPERFAMILY SpoIIaa-like 515 637 IPR036513 -
Cec09g2121 1241 Gene3D - 1065 1238 IPR036464 -
Cec09g2121 1241 SUPERFAMILY RuBisCo LSMT C-terminal, substrate-binding domain 1060 1239 IPR036464 -
Cec09g2121 1241 CDD STAS_SulP_like_sulfate_transporter 514 630 - -
Cec09g2121 1241 ProSitePatterns SLC26A transporters signature. 119 140 IPR018045 GO:0008271(InterPro)|GO:0008272(InterPro)
Cec09g2121 1241 Pfam Rubisco LSMT substrate-binding 1080 1227 IPR015353 -
Cec09g2121 1241 Pfam STAS domain 514 633 IPR002645 -
       

Duplication type information


Select Gene1 Location1 Gene2 Location2 E-value Duplicated-type
Cec01g0658 Cec-Chr1:7180070 Cec09g2121 Cec-Chr9:38189550 1.60E-14 dispersed
Cec09g2121 Cec-Chr9:38189550 Cec11g0134 Cec-Chr11:1561254 1.80E-92 dispersed
Cec09g2121 Cec-Chr9:38189550 Cec11g1587 Cec-Chr11:30181370 4.80E-109 transposed
       

Transcription factors information


Select Gene Hmm_acc Hmm_name Score E-value Regulatory Factors Family
30281 PF00916 Sulfate_transp 2.20E-119 CL0062 Cec TR
       

Pathway information


Select Query KO Definition Second KO KEGG Genes ID GHOSTX Score
Cec09g2121 K18059 - csv:101202870 1220.68