Gene search


Sequence information


Select Gene Cds Cds_length GC_content Pep Pep_length
Csa04g01897 ATGGGGAAAAACGACGGAGAACAGCCACTGCCCTCCGCCATCGACTCCAGGCCCTCCGGTCTGGTTGCCGATGGTCGATGCTGTTGTGGGTGTGTTTCGATTCGAAGACTCATTGGCTTCAGATGCATCTTCATTCTCCTATTATCCGTTGCCTTGTTCGTTTCTGCTGTTTTTTGGCTGCCCCCTTTTCTCCATTATGCAGATCAAAAGGATCTGGATCTCAATCCGTCGTATCGAGGTCATGATATTGTTGCAACATTTAATGTTGAGAGATCGGTGTCTTTGCTGGAAGACAATTTTGATCAACTCCGAACCGACATTTTTGAAGAGTTCCCTATACCTTCTATCAAAGTGAATATACTGTCTCTAGAACCGTTGTCTGGATCAAACCGTACAAAAGTTGTTTTCAGCCTCGATCCAGATACTGATGATTCGGAAATCTCGTCAACGTATCTAAGTTTAATCAGGTCAATCATTACAAGTCTAGTAACAAATCAGTTCCTCAGCATTACCAAATCCACATTCGGGGAGGCCTATTCGTTTGAAGTGCTGAAATTCCCCGGAGGAATAACGATAATCCCGCCACAGAGTGCATTTCTTTTGCAAAAAGTGCAAATTCTTTTCAACTTTACATTGAACTTCTCTATTCATCAGATTCAAGTACATTTCAGCGAACTGACCAGCCAACTGGAGGCGGGATTAAGACTAGCTCCATATGAGATTTTATATATTAAACTTTGGAATGCGGAAGGTTCAACCGTGACAGATCCTACAATTGTCCAGACGTCTGTACTTCTTGAAGTTGGAAATACTCCGTCGATGCGACGGCTGAAGCAGCTAGCTCAAACAATCTCAGGTTCTAATTCTAGCAACCTCGGCCTGAATAATACGGAGTTTGGTAAGGTGAAACAAGTTCGCCTTTCTTCTATTCTTAAACACTCCCTCAATGGCAGTGATGGGAATGGCCCCGTAAGGTCACCTTCTCCTGCTCCTACACCCCAGCCCCATAACCAACATCATCCTCCAACTCACCACCACCACCACCATCACACCCCTCTAACCCCTGCAATTTCACCTGCTCCTGCAACCGAGAAGGGTGCACCAGAATATGGTTCGCCTGCCCCTGAAAGAAATGCAGCATCACCTAAGAGAAGTTATACGGCTAAGCCGCCTGGTTGTCAATATAGATACAAGAGGAAGTCTGGTAGGAAAGAAGGAAAGCAATCTCATTTAACCCCGCTTGCGTCACCCAATATATCTCCTGATCATTCTGCTGCATCGCCATCACCACAACATCAAATCAACCCACCAGCAGCACCCGTCTCACCAGCTCCGGCATTAACTCCATTGCCAAACGTCATTTACGCTCATGTTCAACCACCTTCCAAAAGCGACTCCAATCACCCCGCAAATCCATCAATTGCACCATCTCCATCTGGCGCAGATCGTTGTCATATGATCACTCAATGGGGATTCACACTGTTTCTTATTCTCGCATGCCATATGCTTTTTGTACTCAAAATGGCCTTACTCTTGTATTGCTCAAAACTGAAGCTTAAAACTTGTTCTGGGTCTGTTTCAATTCTGAATAAGATGGTGAATTTGGGATTGTTTTCTGGAGCCAAACGTGCTAAGTCGGTGAATGTGAGATTCAAATTGCCATACTATACTCATTGGGGTCAAAGCTTGGTAGTTTGTGGCTCTGACTCCTTGGTTGGTTCGTGGAATGTGAAGAAGGGGTTGTTACTGAGTCCTGTTCACCAAGGAGATCAGCTTATATGGTGTGGAAGTATTGCGGTGTCTGATGGATTTGAATGTGAGTACAATTATTATGTTGTTGATGACAATAGAAACGTCTTGAGATGGGAGAAGGGAAACAGGAGAAAGGTGTTGTTACCTCAAGGATTACAGGGTGCGGAGGTCATCGAGCTCCGTGATCTTTGGCAGACTGGTGGAGATGCGATTCCTTTCAAGAGTGCCTTTAAAGATGTCATCTTTGGAAGAAGTTCAACTTTGAGCATAGAAAGACCACTAGGAAATTTCGTTCATAGTTTGGATGAAGATGACTCTGTTCTTGTCCATTTCAAAATTTGCTGTCCAAATATAGAAGAAGATACAACAATCTATGTCATTGGTAGTTCTTCGAAATTAGGACAGTGGAAGGTACAGAATGGAATTAAACTTAGCCATGCGGGAGTTATTTCTTCAGAATTTGGGCAAAACCGGGACCTTCTCCTTGATGCTTCAAATTTTCCACCAAGATATATTTTACTTTCAGATGGCATGTTGCGAGATCTTCCTTGGCGTGGTTCTGGTGTTGCAATCCCCATGTTCTCTGTTAGATCAGACGATGATCTTGGAGTTGGCGAGTTTCTTGACTTGAAATTACTTGTTGACTGGGCTGTGGAGTCTGGTCTCCATTTAGTTCAACTTCTACCTGTTAATGATACATCCGTCCATGGAATGTGGTGGGACTCGTATCCTTACAGTTCACTTTCCGTCTTTGCCCTGCATCCATTGTACTTGAGAGTCCAAGCACTATCAGATAATATTCCAGAGGATATCAAGCTGGAGATTCAAAAGGCAAAAGTTGAACTGGATGGAAAGGATGTGGATTATGAGGCGACTATGGCTGCTAAGCTTACGCTTGCACAGAAAATTTTTGCTCGAGAGAAAGATTCAGTATTGAATTCCAGTTCCTTTCAGAAGTATCTTTCTGAAAACGAGGAATGGCTAAAACCCTATGCCGCGTTCTGTTTTCTGCGAGACTTCTTCGAAACATCAGATCACAGCCAATGGGGTCGTTTCTCTCAATTTTCGAAAGACAAGCTTGAGAAGCTTATCTCAAAGGACAGCTTGCATTATGAAGTCATCTGCTTCCACTATTATATTCAATATCATTTGCACCAACAGTTGTCGGAGGCAGCAAATTATGGAAGAAAGAAAGGAGTGATATTGAAAGGCGATCTTCCGATTGGTGTCGATAAGAATAGTGTAGACACCTGGGTTTACCCTACTTTATTCCGTATGAACACGTCAACCGGAGCACCTCCAGATTACTTTGACAAAAATGGACAAAATTGGGGATTTCCAACATACAACTGGGAGGAGATGTCAAAAGATAACTATGCTTGGTGGCGCGCTCGTTTAACTCAGATGTCAAACTACTTCACAGCTTACAGAATTGATCATATATTGGGTTTCTTTCGAATTTGGGAGCTTCCAGAGCATGCTATGACCGGTTTAGTTGGAAAATTCCGCCCTTCTATTCCGTTAAGCCAGGAAGAGCTCGAAAGAGAGGGAATATGGGACTTTGATCGCTTGAGTCGTCCATATATCAAGGCTGAATTTTTACAGGATAAATTTGGAGCTGCATGGGGGTTTATTGCGTCTCACTTTCTGAATGAATATCAGAAGAACTTTTATGAGTTCAAGGAGGAATGTAACACAGAGAAAAAAATTGCCTCCAAACTGAAGTCACTTATTGAAGAGACACAGTTGCAAAACCCAGACCAGATACGACGCAGTCTTTTTGATCTTATACAGAACATAGTTCTTATGAGAGACCTAGAGAATCCAAGAAGTTTCTATCCACGCTTCAATCTTGAAGATACTTCTAGCTTTAATGATTTGGATGATCACAGTAAGGATGTTCTGAAAAGATTATACTATGACTACTACTTCCACCGGCAAGAGGATCTTTGGCGGAAAAATGCTTTAAAGACCTTGCCTGTTCTCCTTGACTCTTCTGACATGCTGGCCTGTGGAGAAGATCTTGGTCTCATTCCTTCATGTGTTCATCCTGTTATGGAGGAGTTGGGATTAATTGGTTTACGGATTCAACGTATGCCAAATGAACCTGATCTAGAGTTTGGGATCCCTTCTCAATATAGCTACATGACTGTGTGTGCTCCATCCTGCCATGACTGCTCGACTTTGCGTGCTTGGTGGGATGAAGACGAAGAAAGAAGACAACGGTTCATGAAGAACGTTATAGAATCTGATATATTGCCACCTAGCCAATGTATTCCAGAGATAGCACATTTCATCATAAAGCAGCATTTTGAAGCTCCATCTATGTGGGCTATCTTCCCACTTCAGGACCTATTAGCACTGAAAGAGGAATACACAACACGACCTGCAAAAGAGGAGACGATCAATGACCCTACAAATCCAAAGCACTATTGGAGATTCCGTAGTCATGTGACATTAGAATCTCTAATGAAGGATAAAGAACTCCAAGCAACCATCAAAGGTCTTTCCCTTGAGAGTGGGAGATCGGTTCCTCATGATGAAGCCAAACCAGCATCAAAACCAACATCTGTAGATGTTGAAGCTAACGAAGAAAAGATATCTCTAGCAACGAAGTCCAATGGAAAACCACAAAAGGAGACACTCGCAGTCACATGA 4410 42.99 MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLDLNPSYRGHDIVATFNVERSVSLLEDNFDQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTPQPHNQHHPPTHHHHHHHTPLTPAISPAPATEKGAPEYGSPAPERNAASPKRSYTAKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPANPSIAPSPSGADRCHMITQWGFTLFLILACHMLFVLKMALLLYCSKLKLKTCSGSVSILNKMVNLGLFSGAKRAKSVNVRFKLPYYTHWGQSLVVCGSDSLVGSWNVKKGLLLSPVHQGDQLIWCGSIAVSDGFECEYNYYVVDDNRNVLRWEKGNRRKVLLPQGLQGAEVIELRDLWQTGGDAIPFKSAFKDVIFGRSSTLSIERPLGNFVHSLDEDDSVLVHFKICCPNIEEDTTIYVIGSSSKLGQWKVQNGIKLSHAGVISSEFGQNRDLLLDASNFPPRYILLSDGMLRDLPWRGSGVAIPMFSVRSDDDLGVGEFLDLKLLVDWAVESGLHLVQLLPVNDTSVHGMWWDSYPYSSLSVFALHPLYLRVQALSDNIPEDIKLEIQKAKVELDGKDVDYEATMAAKLTLAQKIFAREKDSVLNSSSFQKYLSENEEWLKPYAAFCFLRDFFETSDHSQWGRFSQFSKDKLEKLISKDSLHYEVICFHYYIQYHLHQQLSEAANYGRKKGVILKGDLPIGVDKNSVDTWVYPTLFRMNTSTGAPPDYFDKNGQNWGFPTYNWEEMSKDNYAWWRARLTQMSNYFTAYRIDHILGFFRIWELPEHAMTGLVGKFRPSIPLSQEELEREGIWDFDRLSRPYIKAEFLQDKFGAAWGFIASHFLNEYQKNFYEFKEECNTEKKIASKLKSLIEETQLQNPDQIRRSLFDLIQNIVLMRDLENPRSFYPRFNLEDTSSFNDLDDHSKDVLKRLYYDYYFHRQEDLWRKNALKTLPVLLDSSDMLACGEDLGLIPSCVHPVMEELGLIGLRIQRMPNEPDLEFGIPSQYSYMTVCAPSCHDCSTLRAWWDEDEERRQRFMKNVIESDILPPSQCIPEIAHFIIKQHFEAPSMWAIFPLQDLLALKEEYTTRPAKEETINDPTNPKHYWRFRSHVTLESLMKDKELQATIKGLSLESGRSVPHDEAKPASKPTSVDVEANEEKISLATKSNGKPQKETLAVT* 1470
       

Gff information


Chromosome Start End Strand Old_gene Gene Num
4 18711972 18731459 + CsaV3_4G029230.1 Csa04g01897 527507
       

Annotation information


Select Seq ID Length Analysis Description Start End IPR GO
Csa04g01897 1469 Pfam Starch binding domain 691 731 IPR002044 GO:2001070
Csa04g01897 1469 Pfam Starch binding domain 547 638 IPR002044 GO:2001070
Csa04g01897 1469 PANTHER - 538 1427 - -
Csa04g01897 1469 SUPERFAMILY Starch-binding domain-like 546 639 IPR013784 GO:0030246
Csa04g01897 1469 Gene3D Glycosidases 1222 1426 - -
Csa04g01897 1469 Gene3D Glycosidases 768 1098 - -
Csa04g01897 1469 CDD CBM20_DPE2_repeat1 548 648 IPR034840 GO:2001070
Csa04g01897 1469 ProSiteProfiles CBM20 (carbohydrate binding type-20) domain profile. 541 650 IPR002044 GO:2001070
Csa04g01897 1469 MobiDBLite consensus disorder prediction 459 474 - -
Csa04g01897 1469 MobiDBLite consensus disorder prediction 1451 1469 - -
Csa04g01897 1469 MobiDBLite consensus disorder prediction 457 478 - -
Csa04g01897 1469 Pfam 4-alpha-glucanotransferase 775 1402 IPR003385 GO:0004134|GO:0005975
Csa04g01897 1469 SUPERFAMILY (Trans)glycosidases 768 1419 IPR017853 -
Csa04g01897 1469 MobiDBLite consensus disorder prediction 1421 1469 - -
Csa04g01897 1469 SMART CBM_20_2 546 642 IPR002044 GO:2001070
Csa04g01897 1469 SMART CBM_20_2 690 822 IPR002044 GO:2001070
Csa04g01897 1469 SUPERFAMILY Starch-binding domain-like 688 738 IPR013784 GO:0030246
Csa04g01897 1469 MobiDBLite consensus disorder prediction 411 432 - -
Csa04g01897 1469 Gene3D Immunoglobulins 539 650 IPR013783 -
Csa04g01897 1469 MobiDBLite consensus disorder prediction 313 443 - -
       

Event-related genes


Select Gene_1 Gene_2 Event_name
Csa03g03701 Csa04g01897 CCT
       

Duplication type information


Select Gene1 Location1 Gene2 Location2 E-value Duplicated-type
Csa02g01539 Csa-Chr2:14595979 Csa04g01897 Csa-Chr4:18711972 1.16E-24 dispersed
Csa03g03701 Csa-Chr3:32895202 Csa04g01897 Csa-Chr4:18711972 3.54E-122 dispersed
       

Pathway information


Select Query KO Definition Second KO KEGG Genes ID GHOSTX Score
Csa04g01897 K00705 - csv:101216247 1918.66