>AT2G07050.1 | CAS1 (cycloartenol synthase 1) cycloartenol synthase
MWKLKIAEGGSPWLRTTNNHVGRQFWEFDPNLGTPEDLAAVEEARKSFSDNRFVQKHSAD
LLMRLQFSRENLISPVLPQVKIEDTDDVTEEMVETTLKRGLDFYSTIQAHDGHWPGDYGG
PMFLLPGLIITLSITGALNTVLSEQHKQEMRRYLYNHQNEDGGWGLHIEGPSTMFGSVLN
YVTLRLLGEGPNDGDGDMEKGRDWILNHGGATNITSWGKMWLSVLGAFEWSGNNPLPPEI
WLLPYFLPIHPGRMWCHCRMVYLPMSYLYGKRFVGPITSTVLSLRKELFTVPYHEVNWNE
ARNLCAKEDLYYPHPLVQDILWASLHKIVEPVLMRWPGANLREKAIRTAIEHIHYEDENT
RYICIGPVNKVLNMLCCWVEDPNSEAFKLHLPRIHDFLWLAEDGMKMQGYNGSQLWDTGF
AIQAILATNLVEEYGPVLEKAHSFVKNSQVLEDCPGDLNYWYRHISKGAWPFSTADHGWP
ISDCTAEGLKAALLLSKVPKAIVGEPIDAKRLYEAVNVIISLQNADGGLATYELTRSYPW
LELINPAETFGDIVIDYPYVECTSAAIQALISFRKLYPGHRKKEVDECIEKAVKFIESIQ
AADGSWYGSWAVCFTYGTWFGVKGLVAVGKTLKNSPHVAKACEFLLSKQQPSGGWGESYL
SCQDKVYSNLDGNRSHVVNTAWAMLALIGAGQAEVDRKPLHRAARYLINAQMENGDFPQQ
EIMGVFNRNCMITYAAYRNIFPIWALGEYRCQVLLQQGE*
>AT5G48010.1 | THAS1 (THALIANOL SYNTHASE 1) catalytic/ thalianol synthase
MWRLRTGPKAGEDTHLFTTNNYAGRQIWEFDANAGSPQEIAEVEDARHKFSDNTSRFKTT
ADLLWRMQFLREKKFEQKIPRVIIEDARKIKYEDAKTALKRGLLYFTALQADDGHWPAEN
SGPNFYTPPFLICLYITGHLEKIFTPEHVKELLRHIYNMQNEDGGWGLHVESHSVMFCTV
INYVCLRIVGEEVGHDDQRNGCAKAHKWIMDHGGATYTPLIGKALLSVLGVYDWSGCNPI
PPEFWLLPSSFPVNGGTLWIYLRDTFMGLSYLYGKKFVAPPTPLILQLREELYPEPYAKI
NWTQTRNRCGKEDLYYPRSFLQDLFWKSVHMFSESILDRWPLNKLIRQRALQSTMALIHY
HDESTRYITGGCLPKAFHMLACWIEDPKSDYFKKHLARVREYIWIGEDGLKIQSFGSQLW
DTALSLHALLDGIDDHDVDDEIKTTLVKGYDYLKKSQITENPRGDHFKMFRHKTKGGWTF
SDQDQGWPVSDCTAESLECCLFFESMPSELIGKKMDVEKLYDAVDYLLYLQSDNGGIAAW
QPVEGKAWLELLNIMIFRYVECTGSAIAALTQFNKQFPGYKNVEVKRFITKAAKYIEDMQ
TVDGSWYGNWGVCFIYGTFFAVRGLVAAGKTYSNCEAIRKAVRFLLDTQNPEGGWGESFL
SCPSKKYTPLKGNSTNVVQTAQALMVLIMGDQMERDPLPVHRAAQVLINSQLDNGDFPQQ
EIMGTFMRTVMLHFPTYRNTFSLWALTHYTHALRRLLP*
>AT5G48010.1 | THAS1 (THALIANOL SYNTHASE 1) catalytic/ thalianol synthase
MWRLRTGPKAGEDTHLFTTNNYAGRQIWEFDANAGSPQEIAEVEDARHKFSDNTSRFKTT
ADLLWRMQFLREKKFEQKIPRVIIEDARKIKYEDAKTALKRGLLYFTALQADDGHWPAEN
SGPNFYTPPFLICLYITGHLEKIFTPEHVKELLRHIYNMQNEDGGWGLHVESHSVMFCTV
INYVCLRIVGEEVGHDDQRNGCAKAHKWIMDHGGATYTPLIGKALLSVLGVYDWSGCNPI
PPEFWLLPSSFPVNGGTLWIYLRDTFMGLSYLYGKKFVAPPTPLILQLREELYPEPYAKI
NWTQTRNRCGKEDLYYPRSFLQDLFWKSVHMFSESILDRWPLNKLIRQRALQSTMALIHY
HDESTRYITGGCLPKAFHMLACWIEDPKSDYFKKHLARVREYIWIGEDGLKIQSFGSQLW
DTALSLHALLDGIDDHDVDDEIKTTLVKGYDYLKKSQITENPRGDHFKMFRHKTKGGWTF
SDQDQGWPVSDCTAESLECCLFFESMPSELIGKKMDVEKLYDAVDYLLYLQSDNGGIAAW
QPVEGKAWLELLNIMIFRYVECTGSAIAALTQFNKQFPGYKNVEVKRFITKAAKYIEDMQ
TVDGSWYGNWGVCFIYGTFFAVRGLVAAGKTYSNCEAIRKAVRFLLDTQNPEGGWGESFL
SCPSKKYTPLKGNSTNVVQTAQALMVLIMGDQMERDPLPVHRAAQVLINSQLDNGDFPQQ
EIMGTFMRTVMLHFPTYRNTFSLWALTHYTHALRRLLP*
>AT5G48010.2 | THAS1 (THALIANOL SYNTHASE 1) catalytic/ thalianol synthase
MWRLRTGPKAGEDTHLFTTNNYAGRQIWEFDANAGSPQEIAEVEDARHKFSDNTSRFKTT
ADLLWRMQFLREKKFEQKIPRVIIEDARKIKYEDAKTALKRGLLYFTALQADDGHWPAEN
SGPNFYTPPFLICLYITGHLEKIFTPEHVKELLRHIYNMQNEDGGWGLHVESHSVMFCTV
INYVCLRIVGEEVGHDDQRNGCAKAHKWIMDHGGATYTPLIGKALLSVLGVYDWSGCNPI
PPEFWLLPSSFPVNGGTLWIYLRDTFMGLSYLYGKKFVAPPTPLILQLREELYPEPYAKI
NWTQTRNRCGKEDLYYPRSFLQDLFWKSVHMFSESILDRWPLNKLIRQRALQSTMALIHY
HDESTRYITGGCLPKAFHMLACWIEDPKSDYFKKHLARVREYIWIGEDGLKIQSFGSQLW
DTALSLHALLDGIDDHDVDDEIKTTLVKGYDYLKKSQITENPRGDHFKMFRHKTKGGWTF
SDQDQGWPVSDCTAESLECCLFFESMPSELIGKKMDVEKLYDAVDYLLYLQSDNGGIAAW
QPVEGKAWLEWLSPVEFLEDTIVEYEYVECTGSAIAALTQFNKQFPGYKNVEVKRFITKA
AKYIEDMQTVDGSWYGNWGVCFIYGTFFAVRGLVAAGKTYSNCEAIRKAVRFLLDTQNPE
GGWGESFLSCPSKKYTPLKGNSTNVVQTAQALMVLIMGDQMERDPLPVHRAAQVLINSQL
DNGDFPQQEIMGTFMRTVMLHFPTYRNTFSLWALTHYTHALRRLLP*
>AT5G48010.2 | THAS1 (THALIANOL SYNTHASE 1) catalytic/ thalianol synthase
MWRLRTGPKAGEDTHLFTTNNYAGRQIWEFDANAGSPQEIAEVEDARHKFSDNTSRFKTT
ADLLWRMQFLREKKFEQKIPRVIIEDARKIKYEDAKTALKRGLLYFTALQADDGHWPAEN
SGPNFYTPPFLICLYITGHLEKIFTPEHVKELLRHIYNMQNEDGGWGLHVESHSVMFCTV
INYVCLRIVGEEVGHDDQRNGCAKAHKWIMDHGGATYTPLIGKALLSVLGVYDWSGCNPI
PPEFWLLPSSFPVNGGTLWIYLRDTFMGLSYLYGKKFVAPPTPLILQLREELYPEPYAKI
NWTQTRNRCGKEDLYYPRSFLQDLFWKSVHMFSESILDRWPLNKLIRQRALQSTMALIHY
HDESTRYITGGCLPKAFHMLACWIEDPKSDYFKKHLARVREYIWIGEDGLKIQSFGSQLW
DTALSLHALLDGIDDHDVDDEIKTTLVKGYDYLKKSQITENPRGDHFKMFRHKTKGGWTF
SDQDQGWPVSDCTAESLECCLFFESMPSELIGKKMDVEKLYDAVDYLLYLQSDNGGIAAW
QPVEGKAWLEWLSPVEFLEDTIVEYEYVECTGSAIAALTQFNKQFPGYKNVEVKRFITKA
AKYIEDMQTVDGSWYGNWGVCFIYGTFFAVRGLVAAGKTYSNCEAIRKAVRFLLDTQNPE
GGWGESFLSCPSKKYTPLKGNSTNVVQTAQALMVLIMGDQMERDPLPVHRAAQVLINSQL
DNGDFPQQEIMGTFMRTVMLHFPTYRNTFSLWALTHYTHALRRLLP*
>AT3G62870.1 | 60S ribosomal protein L7A (RPL7aB)
MAPKKGVKVASKKKPEKVTNPLFERRPKQFGIGGALPPKKDLSRYIKWPKSIRLQRQKRI
LKQRLKVPPALNQFTKTLDKNLATSLFKILLKYRPEDKAAKKERLLNKAQAEAEGKPAES
KKPIVVKYGLNHVTYLIEQNKAQLVVIAHDVDPIELVVWLPALCRKMEVPYCIVKGKSRL
GAVVHQKTAAALCLTTVKNEDKLEFSKILEAIKANFNDKYEEYRKKWGGGIMGSKSQAKT
KAKERVIAKEAAQRMN*
>AT3G07100.1 | protein transport protein Sec24 putative
MGTENQGYPNFPARPASSPFASAPPPGIPPQSGGPPTGSEAVGFRPFTPSASQPTRPFTA
SGPPPAPPVGTMRPGQPSPFVSQIPGSRPPPPSSNSFPSPAYGPPGGAPFQRFPSPPFPT
TQNPPQGPPPPQTLAGHLSPPMSLRPQQPMAPVAMGPPPQSTTSGLPGANAYPPATDYHM
PARPGFQQSMPPVTPSYPGVGGSQPSFPGYPSKQVLQAPTPFQTSQGPPGPPPVSSYPPH
TGGFAQRPNMAAQQNLHPNYAPPPSNVQGLTEDFNSLSLSSIPGSLEPGLDHKSFPRPLD
GDVEPNSFAEMYPMNCHSRYLRLTTSAIPNSQSLASRWHLPLGAVVCPLAETPEGEEVPL
IDFGSTGIIRCRRCRTYVNPFVTFTDSGRKWRCNICSMLNDVPGEYFSHLDATGRRMDMD
QRPELTKGSVEIIAPTEYMVRPPMPPIYFFLIDVSISATKSGMLEVVAQTIKSCLDNLPG
YPRTQIGFITYDSTLHFYNMKSSLSQPQMMVVSDLDDIFVPLPDDLLVNLSESRTVVDAF
LDSLPLMFQDNFNVESAFGPALRAAFMVMNQLGGKLLIFQNSLPSLGAGRLKLRGDDPRV
YGTDKEYALRVAEDPFYKQMAADCTKFQIGINVYAFSDKYTDIASLGTLAKYTGGQVYYY
PGFQSSVHGDKLRHELARDLTRETAWEAVMRIRCGKGIRFSSYHGNFMLRSTDLLALPAV
DCDKAYAMQLSLEETLLTSQTVYFQVALLYTASCGERRIRVHTSVAPVVTDLGEMYRQAD
TGSIVSLYARLAIEKSLSAKLDDARNAIQQKIVKALKEYRNLHAVQHRLGSRLVYPESLK
FLPLYGLAITKSTPLLGGPADTSLDERCAAGFTMMALPVKKLLKLLYPNLFRVDEWLLKP
SAAHDDFKDVLRRLPLAAESLDSRGLYIYDDGFRLVLWFGRMLSPDIAKNLLGVDFAADL
SRVTFQEQENGMSKKLMRLVKKLRESDPSYHPMCFLVRQGEQPREGFLLLRNLIEDQMGG
SSGYVDWILQLHRQVQQN*
>AT5G01410.1 | RSR4 (REDUCED SUGAR RESPONSE 4) protein heterodimerization/ protein homodimerization
MEGTGVVAVYGNGAITEAKKSPFSVKVGLAQMLRGGVIMDVVNAEQARIAEEAGACAVMA
LERVPADIRAQGGVARMSDPQMIKEIKQAVTIPVMAKARIGHFVEAQILEAIGIDYIDES
EVLTLADEDHHINKHNFRIPFVCGCRNLGEALRRIREGAAMIRTKGEAGTGNIIEAVRHV
RSVNGDIRVLRNMDDDEVFTFAKKLAAPYDLVMQTKQLGRLPVVQFAAGGVATPADAALM
MQLGCDGVFVGSGIFKSGDPARRARAIVQAVTHYSDPEMLVEVSCGLGEAMVGINLNDEK
VERFANRSE*
>AT1G02500.1 | SAM1 (S-ADENOSYLMETHIONINE SYNTHETASE 1) methionine adenosyltransferase
METFLFTSESVNEGHPDKLCDQISDAVLDACLEQDPDSKVACETCTKTNMVMVFGEITTK
ATVDYEKIVRDTCRAIGFVSDDVGLDADKCKVLVNIEQQSPDIAQGVHGHFTKCPEEIGA
GDQGHMFGYATDETPELMPLSHVLATKLGARLTEVRKNGTCAWLRPDGKTQVTVEYYNDK
GAMVPIRVHTVLISTQHDETVTNDEIARDLKEHVIKPVIPEKYLDEKTIFHLNPSGRFVI
GGPHGDAGLTGRKIIIDTYGGWGAHGGGAFSGKDPTKVDRSGAYIVRQAAKSVVANGMAR
RALVQVSYAIGVPEPLSVFVDTYETGLIPDKEILKIVKESFDFRPGMMTINLDLKRGGNG
RFLKTAAYGHFGRDDPDFTWEVVKPLKWDKPQA*
>AT1G02500.1 | SAM1 (S-ADENOSYLMETHIONINE SYNTHETASE 1) methionine adenosyltransferase
METFLFTSESVNEGHPDKLCDQISDAVLDACLEQDPDSKVACETCTKTNMVMVFGEITTK
ATVDYEKIVRDTCRAIGFVSDDVGLDADKCKVLVNIEQQSPDIAQGVHGHFTKCPEEIGA
GDQGHMFGYATDETPELMPLSHVLATKLGARLTEVRKNGTCAWLRPDGKTQVTVEYYNDK
GAMVPIRVHTVLISTQHDETVTNDEIARDLKEHVIKPVIPEKYLDEKTIFHLNPSGRFVI
GGPHGDAGLTGRKIIIDTYGGWGAHGGGAFSGKDPTKVDRSGAYIVRQAAKSVVANGMAR
RALVQVSYAIGVPEPLSVFVDTYETGLIPDKEILKIVKESFDFRPGMMTINLDLKRGGNG
RFLKTAAYGHFGRDDPDFTWEVVKPLKWDKPQA*
>AT1G02500.2 | SAM1 (S-ADENOSYLMETHIONINE SYNTHETASE 1) methionine adenosyltransferase
METFLFTSESVNEGHPDKLCDQISDAVLDACLEQDPDSKVACETCTKTNMVMVFGEITTK
ATVDYEKIVRDTCRAIGFVSDDVGLDADKCKVLVNIEQQSPDIAQGVHGHFTKCPEEIGA
GDQGHMFGYATDETPELMPLSHVLATKLGARLTEVRKNGTCAWLRPDGKTQVTVEYYNDK
GAMVPIRVHTVLISTQHDETVTNDEIARDLKEHVIKPVIPEKYLDEKTIFHLNPSGRFVI
GGPHGDAGLTGRKIIIDTYGGWGAHGGGAFSGKDPTKVDRSGAYIVRQAAKSVVANGMAR
RALVQVSYAIGVPEPLSVFVDTYETGLIPDKEILKIVKESFDFRPGMMTINLDLKRGGNG
RFLKTAAYGHFGRDDPDFTWEVVKPLKWDKPQA*
>AT1G02500.2 | SAM1 (S-ADENOSYLMETHIONINE SYNTHETASE 1) methionine adenosyltransferase
METFLFTSESVNEGHPDKLCDQISDAVLDACLEQDPDSKVACETCTKTNMVMVFGEITTK
ATVDYEKIVRDTCRAIGFVSDDVGLDADKCKVLVNIEQQSPDIAQGVHGHFTKCPEEIGA
GDQGHMFGYATDETPELMPLSHVLATKLGARLTEVRKNGTCAWLRPDGKTQVTVEYYNDK
GAMVPIRVHTVLISTQHDETVTNDEIARDLKEHVIKPVIPEKYLDEKTIFHLNPSGRFVI
GGPHGDAGLTGRKIIIDTYGGWGAHGGGAFSGKDPTKVDRSGAYIVRQAAKSVVANGMAR
RALVQVSYAIGVPEPLSVFVDTYETGLIPDKEILKIVKESFDFRPGMMTINLDLKRGGNG
RFLKTAAYGHFGRDDPDFTWEVVKPLKWDKPQA*
>AT5G26340.1 | MSS1 carbohydrate transmembrane transporter/ hexosehydrogen symporter/ high-affinity hydrogenglucose symporter/ sugarhydrogen symporter
MTGGGFATSANGVEFEAKITPIVIISCIMAATGGLMFGYDVGVSGGVTSMPDFLEKFFPV
VYRKVVAGADKDSNYCKYDNQGLQLFTSSLYLAGLTATFFASYTTRTLGRRLTMLIAGVF
FIIGVALNAGAQDLAMLIAGRILLGCGVGFANQAVPLFLSEIAPTRIRGGLNILFQLNVT
IGILFANLVNYGTAKIKGGWGWRLSLGLAGIPALLLTVGALLVTETPNSLVERGRLDEGK
AVLRRIRGTDNVEPEFADLLEASRLAKEVKHPFRNLLQRRNRPQLVIAVALQIFQQCTGI
NAIMFYAPVLFSTLGFGSDASLYSAVVTGAVNVLSTLVSIYSVDKVGRRVLLLEAGVQMF
FSQVVIAIILGVKVTDTSTNLSKGFAILVVVMICTYVAAFAWSWGPLGWLIPSETFPLET
RSAGQSVTVCVNLLFTFIIAQAFLSMLCHFKFGIFIFFSAWVLIMSVFVMFLLPETKNIP
IEEMTERVWKKHWFWARFMDDHNDHEFVNGEKSNGKSNGFDPSTRL*
>AT1G29960.1 | peptidase/ serine-type peptidase
MATPSSSFWNTASREAMKSGVLLAKLYCFLHVTTNYLGFMAYAYGPSMTPTLHPSGNVLL
AERISKRYQKPSRGDIVVIRSPENPNKTPIKRVIGIEGDCISFVIDSRKSDESQTIVVPK
GHVFVQGDYTHNSRDSRNFGTVPYGLIQGRVLWRVWPFQDFGPLGPTPT*
>AT4G16830.1 | nuclear RNA-binding protein (RGGA)
MATLNPFDLLDDDAEDPSQLAVAIEKIDKSKKSGQVSSLPAKSAPKLPSKPLPPAQAVRE
ARSDAPRGGGGRGGFNRGRGGYNRDDGNNGYSGGYTKPSGEGDVSKSSYERRGGGGAPRG
SFRGEGGGPGGGRRGGFSNEGGDGERPRRAFERRSGTGRGSDFKRDGSGRGNWGTPGEEI
AAETEAVAGVETEKDVGEKPAVDDVAADANKEDTVVEEKEPEDKEMTLDEYEKILEEKKK
ALQSLTTSERKVDTKVFESMQQLSNKKSNDEIFIKLGSDKDKRKDDKEEKAKKAVSINEF
LKPAEGGNYYRGGRGGRGRGGRGRGGVSSGESGGYRNEAAPAIGDAAQFPSLGGK*
>AT4G16830.1 | nuclear RNA-binding protein (RGGA)
MATLNPFDLLDDDAEDPSQLAVAIEKIDKSKKSGQVSSLPAKSAPKLPSKPLPPAQAVRE
ARSDAPRGGGGRGGFNRGRGGYNRDDGNNGYSGGYTKPSGEGDVSKSSYERRGGGGAPRG
SFRGEGGGPGGGRRGGFSNEGGDGERPRRAFERRSGTGRGSDFKRDGSGRGNWGTPGEEI
AAETEAVAGVETEKDVGEKPAVDDVAADANKEDTVVEEKEPEDKEMTLDEYEKILEEKKK
ALQSLTTSERKVDTKVFESMQQLSNKKSNDEIFIKLGSDKDKRKDDKEEKAKKAVSINEF
LKPAEGGNYYRGGRGGRGRGGRGRGGVSSGESGGYRNEAAPAIGDAAQFPSLGGK*
>AT4G16830.1 | nuclear RNA-binding protein (RGGA)
MATLNPFDLLDDDAEDPSQLAVAIEKIDKSKKSGQVSSLPAKSAPKLPSKPLPPAQAVRE
ARSDAPRGGGGRGGFNRGRGGYNRDDGNNGYSGGYTKPSGEGDVSKSSYERRGGGGAPRG
SFRGEGGGPGGGRRGGFSNEGGDGERPRRAFERRSGTGRGSDFKRDGSGRGNWGTPGEEI
AAETEAVAGVETEKDVGEKPAVDDVAADANKEDTVVEEKEPEDKEMTLDEYEKILEEKKK
ALQSLTTSERKVDTKVFESMQQLSNKKSNDEIFIKLGSDKDKRKDDKEEKAKKAVSINEF
LKPAEGGNYYRGGRGGRGRGGRGRGGVSSGESGGYRNEAAPAIGDAAQFPSLGGK*
>AT4G16830.2 | nuclear RNA-binding protein (RGGA)
MLHVVVEAVEDLTVVVVVTTVMMVTMDIQGDTLNPQVKEMFQSLLTRGVAGGDGERPRRA
FERRSGTGRGSDFKRDGSGRGNWGTPGEEIAAETEAVAGVETEKDVGEKPAVDDVAADAN
KEDTVVEEKEPEDKEMTLDEYEKILEEKKKALQSLTTSERKVDTKVFESMQQLSNKKSND
EIFIKLGSDKDKRKDDKEEKAKKAVSINEFLKPAEGGNYYRGGRGGRGRGGRGRGGVSSG
ESGGYRNEAAPAIGDAAQFPSLGGK*
>AT4G16830.2 | nuclear RNA-binding protein (RGGA)
MLHVVVEAVEDLTVVVVVTTVMMVTMDIQGDTLNPQVKEMFQSLLTRGVAGGDGERPRRA
FERRSGTGRGSDFKRDGSGRGNWGTPGEEIAAETEAVAGVETEKDVGEKPAVDDVAADAN
KEDTVVEEKEPEDKEMTLDEYEKILEEKKKALQSLTTSERKVDTKVFESMQQLSNKKSND
EIFIKLGSDKDKRKDDKEEKAKKAVSINEFLKPAEGGNYYRGGRGGRGRGGRGRGGVSSG
ESGGYRNEAAPAIGDAAQFPSLGGK*
>AT4G16830.2 | nuclear RNA-binding protein (RGGA)
MLHVVVEAVEDLTVVVVVTTVMMVTMDIQGDTLNPQVKEMFQSLLTRGVAGGDGERPRRA
FERRSGTGRGSDFKRDGSGRGNWGTPGEEIAAETEAVAGVETEKDVGEKPAVDDVAADAN
KEDTVVEEKEPEDKEMTLDEYEKILEEKKKALQSLTTSERKVDTKVFESMQQLSNKKSND
EIFIKLGSDKDKRKDDKEEKAKKAVSINEFLKPAEGGNYYRGGRGGRGRGGRGRGGVSSG
ESGGYRNEAAPAIGDAAQFPSLGGK*
>AT4G16830.3 | nuclear RNA-binding protein (RGGA)
MMMLRIQASSLLPSRRLISPRNLDRFRACLLSQLLSFHRSHFLLLKPVREARSDAPRGGG
GRGGFNRGRGGYNRDDGNNGYSGGYTKPSGEGDVSKSSYERRGGGGAPRGSFRGEGGGPG
GGRRGGFSNEGGDGERPRRAFERRSGTGRGSDFKRDGSGRGNWGTPGEEIAAETEAVAGV
ETEKDVGEKPAVDDVAADANKEDTVVEEKEPEDKEMTLDEYEKILEEKKKALQSLTTSER
KVDTKVFESMQQLSNKKSNDEIFIKLGSDKDKRKDDKEEKAKKAVSINEFLKPAEGGNYY
RGGRGGRGRGGRGRGGVSSGESGGYRNEAAPAIGDAAQFPSLGGK*
>AT4G16830.3 | nuclear RNA-binding protein (RGGA)
MMMLRIQASSLLPSRRLISPRNLDRFRACLLSQLLSFHRSHFLLLKPVREARSDAPRGGG
GRGGFNRGRGGYNRDDGNNGYSGGYTKPSGEGDVSKSSYERRGGGGAPRGSFRGEGGGPG
GGRRGGFSNEGGDGERPRRAFERRSGTGRGSDFKRDGSGRGNWGTPGEEIAAETEAVAGV
ETEKDVGEKPAVDDVAADANKEDTVVEEKEPEDKEMTLDEYEKILEEKKKALQSLTTSER
KVDTKVFESMQQLSNKKSNDEIFIKLGSDKDKRKDDKEEKAKKAVSINEFLKPAEGGNYY
RGGRGGRGRGGRGRGGVSSGESGGYRNEAAPAIGDAAQFPSLGGK*
>AT4G16830.3 | nuclear RNA-binding protein (RGGA)
MMMLRIQASSLLPSRRLISPRNLDRFRACLLSQLLSFHRSHFLLLKPVREARSDAPRGGG
GRGGFNRGRGGYNRDDGNNGYSGGYTKPSGEGDVSKSSYERRGGGGAPRGSFRGEGGGPG
GGRRGGFSNEGGDGERPRRAFERRSGTGRGSDFKRDGSGRGNWGTPGEEIAAETEAVAGV
ETEKDVGEKPAVDDVAADANKEDTVVEEKEPEDKEMTLDEYEKILEEKKKALQSLTTSER
KVDTKVFESMQQLSNKKSNDEIFIKLGSDKDKRKDDKEEKAKKAVSINEFLKPAEGGNYY
RGGRGGRGRGGRGRGGVSSGESGGYRNEAAPAIGDAAQFPSLGGK*
>AT4G39160.1 | DNA binding / transcription factor
MDLDFDDQPSDHAAPAVRAGARFKPKGRPQPKKKQVSLSTTQTTLSPDVAQEKLSTQSED
LVPLDGSSEIPSNALPSETNVPDSGSINKSTIGTLSEENEDAFPRGVHWSVKPSILRACN
NVNLVGNRRDDGIEATTSFPDDPRTQDSAIFGDYVTPETGADEGRVDMETLDIVQEEGTT
SSYVQHTGKLQPKPRLLETVVEEPEPHYSAGDTGYFPMGTNESEFMANVESRNGFSTYED
LQEEELNIPEAPRETVGEMEAQNASGGWEQEEQGVSPCINNTVTGEEENCMGNTVEEQSK
RESKTGKSKRATSRKRKKTSEEPNKSSEKTEQKKFKHSSRRQKRTLEKELLETPDHEIRS
LPLRDMLRLVEYKEWMQKKEAKGAGVQPSQESNNMNGSGSQYHSQGFDEEDEFGDFGIES
SEYQENNVVKPDSPVNYQTYMNKTSRTRWSKEDTELFYEGIQEFGSNLSMIQQLFPERTR
EQMKLKFKLEERRNPLKLNDALSSRSKHFTHFKNVIKKLQQEAAAAKEGEEEEEAGAEAE
TTDVPENEEPEKSEETERASDGVAAGVKESDGGDVENGVRSDGGDECDDDEDFWNSYKSD
M*
>AT1G53530.1 | signal peptidase I family protein
MRMTFLSYLKQWRGTAKEAFENVSIVAKFLCLLHVTDRYIISTTHVHGPSMLPTLNLTGD
VILAEHLSHRFGKIGLGDVVLVRSPRDPKRMVTKRILGLEGDRLTFSADPLVGDASVSVL
VPKGHVWIQGDNLYASTDSRHFGPVPYSLIEGKALLRVWPPEYFGSLR*
>AT1G53530.1 | signal peptidase I family protein
MRMTFLSYLKQWRGTAKEAFENVSIVAKFLCLLHVTDRYIISTTHVHGPSMLPTLNLTGD
VILAEHLSHRFGKIGLGDVVLVRSPRDPKRMVTKRILGLEGDRLTFSADPLVGDASVSVL
VPKGHVWIQGDNLYASTDSRHFGPVPYSLIEGKALLRVWPPEYFGSLR*
>AT1G53530.2 | signal peptidase I family protein
MLPTLNLTGDVILAEHLSHRFGKIGLGDVVLVRSPRDPKRMVTKRILGLEGDRLTFSADP
LVGDASVSVLVPKGHVWIQGDNLYASTDSRHFGPVPYSLIEGKALLRVWPPEYFGSLR*
>AT1G53530.2 | signal peptidase I family protein
MLPTLNLTGDVILAEHLSHRFGKIGLGDVVLVRSPRDPKRMVTKRILGLEGDRLTFSADP
LVGDASVSVLVPKGHVWIQGDNLYASTDSRHFGPVPYSLIEGKALLRVWPPEYFGSLR*
>AT2G05170.1 | ATVPS11 binding / protein binding / transporter/ zinc ion binding
MYQLRKFDFFEEKYGGKIPEDVTGDIQCCSSGRGKVVIGSNDGSVSFLDRGVKFDSGFQA
HSSSVLFLQHLKQRNFLVTVGEDEQISPQQSGMCLKVFDLDKVQEEGTSSSAPECIGILR
IFTNQFPEAKITSFLVLEEVPPILLIAIGLDNGCIYCVKGDIARERITRFKLQVDGRSAI
TGLGFRMDGQALLLFAVTPESVNLFSMQAQPPKLQTLDHIGGSVNTVTMSDRSELIVGRP
EAVYFYEVDGRGPCWAFEGEKKFMGWFRGYLLCVIDDSKTGNTVFNVYDLRNRLIAYSIV
VDKVSNMLCEWGNIILIKADKSLLCITEKDMESKLDMLFKKNLYTVAINLVQSQHADAAA
TANVMRKYGDHLYGKQDFDEAMLQYINTIGYLEPSFVIQKFLDAQRIYNLTNYLEKLHEK
GLASKDHTTLLLNCYTKLKDVEKLNTFIRKEDGIGELKFDVETAIRVCRAANYHEHAMYV
AKKAGKHEWYLKILLEDLGNYDEALQYVSSLEPSQAGVTIEQYGKILIEHKPKETIDILM
RLCTEQGIPNGVFLSMLPSPVDFITVFVQHPHSLMHFLERYAEIVQDSPAQAEINNTLLE
LYLSRDLNFPSISLSENGLDKDLIDHSVAAAVSKADPEKKTNADSKDAMEKDCTERQQKG
LELLKMAWPSDLEQPLYDVDLAVILCEMNSFKDGLLYLYEKMKFYKEVIACYMQNHDHEG
LIACCKRLGDSSKGGDPSLWADLLKYFGEIGEDCTKEVKEVLTYIERDDILPPIIVLQTL
AKNPCLTLSVIKDYIARKLEQESKIIEEDRRAVEKYQETTKNMRKEIEDLRTNARIFQLS
KCTACTFTLDIPAVHFMCMHSFHQRCLGDNEKECPECAPEYRSVMEMKRSLEQNSKDQDL
FFQQVKGSKDGFSVIAEYFGKGIISKTRDATS*
>AT2G28060.1 | protein kinase-related
MNSQNPDDHEDTTVVGFEVPVSPVSSYNNVYSSTEDETRDPPAVPPHLQHSLLGNQGSME
LAYAPQNVVLNHLYIENRDAPRSVVALGFSHRFRTKFVTVVIYKPVQRRGSANV*
>AT2G36930.1 | zinc finger (C2H2 type) family protein
MGRCPTRKVKKRRLSHKTARRDKFEVKGDDLVYTELRKPETEIKPLQLDEDLPGMGQFYC
LHCDRYFSNVSVRDDHFKTKKHKKRVNMMMGQAPHSQLDADLAGGMGMPDNGPKLMSNLV
FTELRKPETEDLPGMGQFNCLLCHRNFSNASVMDYHFKTKKHKKRVKKIERPAPHSQLDA
DLAGGMGMPDNGPKLMSA*
>AT4G12620.1 | ORC1B (ORIGIN OF REPLICATION COMPLEX 1B) DNA binding / double-stranded methylated DNA binding / protein binding
MASTPRAKTFKSPTKTPSNIYRKSYLSPSSTSHTPQTPETHTPLRRSARHVSRKIDLGND
PIDAPGNDPIEGMNLIRKRERAPRKPTTDVVPSKSKKTETPKKKKKIDSFTPVSPIRSET
IKKTKKKKRVYYNKVEFDETEFEIGDDVYVKRREDSNSDEEEDPEIEDCQICFKSDTNIM
IECDDCLGGFHLKCLKPPLKEVPEGDWICQFCEVKKSGQSQTLDLPKPPEGKKLARTMRE
KLLSGDLWAARIDKLWKEVDDGVYWIRARWYMIPEETVSGRQPHNLKRELYLTNDFADIE
MECILRHCSVKCPKEFSKASNDGDDVFLCEYEYDVHWRSFKRLAELADGDSDSDQEWNGR
KEEEVDDSDEEMELDDEVLKSKRGGLTSARGGANSRKGRFFGVEKVGMKLIPEHVRCHKQ
SELEKAKATLLLATRPKSLPCRSKEMEEITSFIKGSISDDQCLGRCMYIHGVPGTGKTIS
VLSVMKNLKAEVEEGSVSPYCFVEINGLKLASPENIYSVIYEALSGHRVGWKKALQCLNE
RFAEGKRIGKEDEKPCILLIDELDLLVTRNQSVLYNILDWPTKPNSKLVVLGIANTMDLP
EKLLPRISSRMGIQRLCFGPYNHTQLQEIISTRLNGIDAFEKTAIEFASRKVAAISGDAR
RALEICRRAAEVADHRLNTNKSAKNQLVIMADVEAAIQEMFQAPHIQVMKSVSKLSKIFL
TAMVHELYKTGMAETTFDRVATTVSSICLTNGEAFPGWDILLKIGCDLGECRIILCEPGE
KHRLQKLQLNFPSDDVAFALKDNKDLPWLANYL*
>AT4G21480.1 | carbohydrate transmembrane transporter/ sugarhydrogen symporter
MPSVGIVIGDGKKEYPGKLTLYVTVTCIVAAMGGLIFGYDIGISGGVTTMDSFQQKFFPS
VYEKQKKDHDSNQYCRFDSVSLTLFTSSLYLAALCSSLVASYVTRQFGRKISMLLGGVLF
CAGALLNGFATAVWMLIVGRLLLGFGIGFTNQSVPLYLSEMAPYKYRGALNIGFQLSITI
GILVANVLNFFFSKISWGWRLSLGGAVVPALIITVGSLILPDTPNSMIERGQFRLAEAKL
RKIRGVDDIDDEINDLIIASEASKLVEHPWRNLLQRKYRPHLTMAILIPAFQQLTGINVI
MFYAPVLFQTIGFGSDAALISAVVTGLVNVGATVVSIYGVDKWGRRFLFLEGGFQMLISQ
VAVAAAIGAKFGVDGTPGVLPKWYAIVVVLFICIYVAAFAWSWGPLGWLVPSEIFPLEIR
SAAQSITVSVNMIFTFLIAQVFLMMLCHLKFGLFIFFAFFVVVMSIFVYLFLPETRGVPI
EEMNRVWRSHWYWSKFVDARRI*
>AT4G22330.1 | ATCES1 catalytic/ hydrolase acting on carbon-nitrogen (but not peptide) bonds in linear amides
MADGISSFWGPVTSTIECCEMNYAYSSYIAEFYNTISNVPGILLALIGLVNALRQRFEKR
FSILHISNMILAIGSMLYHATLQHVQQQSDETPMVWEILLYMYILYSPDWHYRSTMPTFL
FLYGAAFAIVHAYLRFGIGFKVHYVILCLLCIPRMYKYYIHTEDTAAKRIAKWYVATILV
GSICWFCDRVFCKTISQWPVNPQGHALWHVFMSFNSYCANTFLMFCRAQQRGWNPKVKYF
LGVLPYVKIEKPKTQ*
>AT4G29140.1 | MATE efflux protein-related
MCNPSTTTTTTGSENQESRTGLFLDLFSINSFEPTKRNLRHCENRGSPLMAEAVTEAKSL
FTLAFPIAVTALVLYLRSAVSMFFLGQLGDLELAAGSLAIAFANITGYSVLSGLALGMEP
LCSQAFGAHRFKLLSLTLHRTVVFLLVCCVPISVLWFNVGKISVYLHQDPDIAKLAQTYL
IFSLPDLLTNTLLHPIRIYLRAQGIIHPVTLASLSGAVFHLPANLFLVSYLRLGLTGVAV
ASSITNIFVVAFLVCYVWASGLHAPTWTDPTRDCFRGWAPLLRLAGPSCVSVCLEWWWYE
IMIVLCGLLVNPRSTVAAMGVLIQTTSFLYVFPSSLSFAVSTRVGNELGANRPKTAKLTA
TVAIVFAAVTGIIAAAFAYSVRNAWGRIFTGDKEILQLTAAALPILGLCEIGNCPQTVGC
GVVRGTARPSTAANVNLGAFYLVGMPVAVGLGFWAGIGFNGLWVGLLAAQISCAGLMMYV
VGTTDWESEAKKAQTLTCAETVENDIIKAVVASTIDGECDEAEPLIRITVLY*
>AT4G33070.1 | pyruvate decarboxylase putative
MDTKIGSIDDCKPTNGDVCSPTNGTVATIHNSVPSSAITINYCDATLGRHLARRLVQAGV
TDVFSVPGDFNLTLLDHLMAEPDLNLIGCCNELNAGYAADGYARSRGVGACVVTFTVGGL
SVLNAIAGAYSENLPLICIVGGPNSNDYGTNRILHHTIGLPDFSQELRCFQTVTCYQAVV
NNLDDAHEQIDKAISTALKESKPVYISVSCNLAAIPHHTFSRDPVPFSLAPRLSNKMGLE
AAVEATLEFLNKAVKPVMVGGPKLRVAKACDAFVELADASGYALAMMPSAKGFVPEHHPH
FIGTYWGAVSTPFCSEIVESADAYIFAGPIFNDYSSVGYSLLLKKEKAIVVQPDRITVAN
GPTFGCILMSDFFRELSKRVKRNETAYENYHRIFVPEGKPLKCESREPLRVNTMFQHIQK
MLSSETAVIAETGDSWFNCQKLKLPKGCGYEFQMQYGSIGWSVGATLGYAQASPEKRVLA
FIGDGSFQVTVQDISTMLRNGQKTIIFLINNGGYTIEVEIHDGPYNVIKNWNYTGLVDAI
HNGEGNCWTAKVRYEEELVEAITTATTEKKDCLCFIEVILHKDDTSKELLEWGSRVSAAN
SRPPNPQ*
>AT4G37880.1 | protein binding / zinc ion binding
MELKSIKDAFDRVATKQKLSYSKTNEIVHMLSQEIDKALSILEETPSSDTMLLDHRSILA
DVKKVFMEIAPITQLEATEKELHAALTKYPKVLEKQLNPDISKAYRHNVEFDTHIVNQII
ANFFYRQGMFDIGDCFVAETGESECSTRQSFVEMYRILEAMKRRDLEPALNWAVSNSDKL
KEARSDLEMKLHSLHFLEIARGKNSKEAIDYARKHIATFADSCLPEIQKLMCSLLWNRKL
DKSPYSEFLSPALWNNAVKELTRQYCNLLGESSESPLSITVTAGTQALPVLLKYMNVVMA
NKKLDWQTMEQLPVDAQLSEEFQFHSVFVCPVSKEQSSDDNPPMMMSCGHVLCKQTINKM
SKNGSKSSFKCPYCPTDVDISRCRQLHF*
>AT1G13950.1 | ELF5A-1 (EUKARYOTIC ELONGATION FACTOR 5A-1) translation initiation factor
MSDEEHHFESSDAGASKTYPQQAGTIRKNGYIVIKNRPCKVVEVSTSKTGKHGHAKCHFV
AIDIFTSKKLEDIVPSSHNCDVPHVNRTDYQLIDISEDGYVSLLTDNGSTKDDLKLPNDD
TLLQQIKSGFDDGKDLVVSVMSAMGEEQINALKDIGPK*
>AT1G33040.1 | NACA5 (NASCENT POLYPEPTIDE-ASSOCIATED COMPLEX SUBUNIT ALPHA-LIKE PROTEIN 5)
MPGAIVEEEKSQIESIKEQLKLEKEDDVVVEDVKDGEEEDDDEDDEDVEVEGEGGNENAK
QSRSEKKSRKAVLKLGMKPVSDVSRVTIKRAKNVLFVISKPDVYKSPNAETYVIFGEAKV
DDLSSQLQTQAAQRFKMPDVTSMLPNAGSEATMAPLAEEEDEDDVDDTGVEARDIDLVMT
QAGVSKAKAVSALKANDGDIVSAIMELTT*
>AT1G52500.2 | ATMMH-1 (ARABIDOPSIS THALIANA MUTM HOMOLOG-1) DNA N-glycosylase
MPELPEVEAARRAIEENCLGKKIKRVIIADDNKVIHGISPSDFQTSILGKTIISARRKGK
NLWLELDSPPFPSFQFGMAGAIYIKGVAVTKYKRSAVKDSEEWPSKYSKFFVELDDGLEL
SFTDKRRFAKVRLLANPTSVSPISELGPDALLEPMTVDEFAESLAKKKITIKPLLLDQGY
ISGIGNWIADEVLYQARIHPLQTASSLSKEQCEALHTSIKEVIEKAVEVDADSSQFPSYW
IFHNREKKPGKAFVDGKKIDFITAGGRTTAYVPELQKLYGKDAEKAAKVRPAKRGVKPKE
DDGDGEEDEQETEKEDESAKSKKGQKPRGGRGKKPASKTKTEESDDDGDDSEAEEEVVKP
KGRGTKPAIKRKSEEKATSQAGKKPKGRKS*
>AT1G52500.2 | ATMMH-1 (ARABIDOPSIS THALIANA MUTM HOMOLOG-1) DNA N-glycosylase
MPELPEVEAARRAIEENCLGKKIKRVIIADDNKVIHGISPSDFQTSILGKTIISARRKGK
NLWLELDSPPFPSFQFGMAGAIYIKGVAVTKYKRSAVKDSEEWPSKYSKFFVELDDGLEL
SFTDKRRFAKVRLLANPTSVSPISELGPDALLEPMTVDEFAESLAKKKITIKPLLLDQGY
ISGIGNWIADEVLYQARIHPLQTASSLSKEQCEALHTSIKEVIEKAVEVDADSSQFPSYW
IFHNREKKPGKAFVDGKKIDFITAGGRTTAYVPELQKLYGKDAEKAAKVRPAKRGVKPKE
DDGDGEEDEQETEKEDESAKSKKGQKPRGGRGKKPASKTKTEESDDDGDDSEAEEEVVKP
KGRGTKPAIKRKSEEKATSQAGKKPKGRKS*
>AT1G52500.1 | ATMMH-1 (ARABIDOPSIS THALIANA MUTM HOMOLOG-1) DNA N-glycosylase
MPELPEVEAARRAIEENCLGKKIKRVIIADDNKVIHGISPSDFQTSILGKTIISARRKGK
NLWLELDSPPFPSFQFGMAGAIYIKGVAVTKYKRSAVKDSEEWPSKYSKFFVELDDGLEL
SFTDKRRFAKVRLLANPTSVSPISELGPDALLEPMTVDEFAESLAKKKITIKPLLLDQGY
ISGIGNWIADEVLYQARIHPLQTASSLSKEQCEALHTSIKEVIQHAVQVNADSKEFPVEW
LFHFRWGKKAGKVNGKLSHHLSINLMKQNLGFCR*
>AT1G52500.1 | ATMMH-1 (ARABIDOPSIS THALIANA MUTM HOMOLOG-1) DNA N-glycosylase
MPELPEVEAARRAIEENCLGKKIKRVIIADDNKVIHGISPSDFQTSILGKTIISARRKGK
NLWLELDSPPFPSFQFGMAGAIYIKGVAVTKYKRSAVKDSEEWPSKYSKFFVELDDGLEL
SFTDKRRFAKVRLLANPTSVSPISELGPDALLEPMTVDEFAESLAKKKITIKPLLLDQGY
ISGIGNWIADEVLYQARIHPLQTASSLSKEQCEALHTSIKEVIQHAVQVNADSKEFPVEW
LFHFRWGKKAGKVNGKLSHHLSINLMKQNLGFCR*
>AT3G24010.1 | ING1 (INHIBITOR OF GROWTH 1) DNA binding / methylated histone residue binding
MSFAEEFEANLVSLAHVLQKKYALLRDLDKSLQENQRQNEQRCEKEIEDIRRGRAGNITP
NTSLTKFSEEALDEQKHSVRIADEKVTLAMQAYDLVDMHVQQLDQYMKKSDEVIRKEKEA
AAATLELENNGKAGNAGEGGRGGRKKTRLATAASTAAASTGMTSSNMDLDLPVDPNEPTY
CICNQVSFGEMVACDNNACKIEWFHFGCVGLKEQPKGKWYCPECATVKKSRKGR*
>AT3G53030.1 | SRPK4 (Ser/Arg-rich protein kinase 4) kinase/ protein kinase
MEAEKWNSDGGEYTSEDEGTEDYRRGGYHAVRIGDSFKTGRYVVQSKLGWGHFSTVWLSW
DTQSSRYVALKVQKSAQHYTEAAMDEITILQQIAEGDTDDTKCVVKLLDHFKHSGPNGQH
VCMVFEYLGDNLLTLIKYSDYRGLPIPMVKEICYHMLVGLDYLHKQLSIIHTDLKPENVL
LPSTIDPSKDPRKSGAPLVLPTDKDNTVVDSNGDFVKNQKTGSHRKAKLSAQGHAENKGN
TESDKVRGVGSPVNGKQCAAEKSVEEDCPSTSDAIELDGSEKGKQGGKKGSRSSRRHLVA
SADLKCKLVDFGNACWTYKQFTSDIQTRQYRCPEVILGSKYSTSADLWSFACICFELVTG
DVLFDPHSGDNYDRDEDHLALMMELLGMMPRKIALGGRYSRDFFNRHGDLRHIRRLRFWP
MNKVLTEKYEFSEQDANDLSDFLVSILDFVPEKRPTAAQCLLHPWINSGPRSIKPSLKDE
NSDKLDTEKNKRENEEQEAVEVKMGNVVISSLDSKPGMSQSSSTLKLAI*
>AT4G19880.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal (InterProIPR004046) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1635 Blast hits to 1635 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 158 Plants - 57 Viruses - 0 Other Eukaryotes - 480 (source NCBI BLink)
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVYAVHFKCNKKLIREYPNLFN
YTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPFGIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1621 Blast hits to 1621 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 155 Plants - 57 Viruses - 0 Other Eukaryotes - 469 (source NCBI BLink)
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVYAVHFKCNKKLIREYPNLFN
YTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPFGIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal (InterProIPR004046) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1635 Blast hits to 1635 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 158 Plants - 57 Viruses - 0 Other Eukaryotes - 480 (source NCBI BLink)
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSSYFQSKKKKYTICERISNV
ETLIQVYAVHFKCNKKLIREYPNLFNYTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPF
GIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1621 Blast hits to 1621 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 155 Plants - 57 Viruses - 0 Other Eukaryotes - 469 (source NCBI BLink)
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSSYFQSKKKKYTICERISNV
ETLIQVYAVHFKCNKKLIREYPNLFNYTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPF
GIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.3 | FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal (InterProIPR004046) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1635 Blast hits to 1635 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 158 Plants - 57 Viruses - 0 Other Eukaryotes - 480 (source NCBI BLink)
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSLCSPLQMQQETHKGVSEFV
QLHERHFPDPRYE*
>AT4G19880.3 | FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1621 Blast hits to 1621 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 155 Plants - 57 Viruses - 0 Other Eukaryotes - 469 (source NCBI BLink)
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSLCSPLQMQQETHKGVSEFV
QLHERHFPDPRYE*
>AT4G19880.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1567 Blast hits to 1567 proteins in 487 species Archae - 12 Bacteria - 899 Metazoa - 23 Fungi - 155 Plants - 55 Viruses - 0 Other Eukaryotes - 423 (source NCBI BLink)
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVYAVHFKCNKKLIREYPNLFN
YTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPFGIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1567 Blast hits to 1567 proteins in 487 species Archae - 12 Bacteria - 899 Metazoa - 23 Fungi - 155 Plants - 55 Viruses - 0 Other Eukaryotes - 423 (source NCBI BLink)
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSSYFQSKKKKYTICERISNV
ETLIQVYAVHFKCNKKLIREYPNLFNYTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPF
GIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.3 | FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1567 Blast hits to 1567 proteins in 487 species Archae - 12 Bacteria - 899 Metazoa - 23 Fungi - 155 Plants - 55 Viruses - 0 Other Eukaryotes - 423 (source NCBI BLink)
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSLCSPLQMQQETHKGVSEFV
QLHERHFPDPRYE*
>AT3G45130.1 | LAS1 lanosterol synthase
MWRLKLSEGDEESVNQHVGRQFWEYDNQFGTSEERHHINHLRSNFTLNRFSSKHSSDLLY
RFQCWKEKGKGMERLPQVKVKEGEERLINEEVVNVTLRRSLRFYSILQSQDGFWPGDYGG
PLFLLPALVIGLYVTEVLDGTLTAQHQIEIRRYLYNHQNKDGGWGLHVEGNSTMFCTVLS
YVALRLMGEELDGGDGAMESARSWIHHHGGATFIPSWGKFWLSVLGAYEWSGNNPLPPEL
WLLPYSLPFHPGRMWCHCRMVYLPMSYLYGRRFVCRTNGTILSLRRELYTIPYHHIDWDT
ARNQCAKEDLYYPHPKIQDVLWSCLNKFGEPLLERWPLNNLRNHALQTVMQHIHYEDQNS
HYICIGPVNKVLNMLCCWVESSNSEAFKSHLSRIKDYLWVAEDGMKMQGYNGSQLWDVTL
AVQAILATNLVDDYGLMLKKAHNYIKNTQIRKDTSGDPGLWYRHPCKGGWGFSTGDNPWP
VSDCTAEALKAALLLSQMPVNLVGEPMPEEHLVDAVNFILSLQNKNGGFASYELTRSYPE
LEVINPSETFGDIIIDYQYVECTSAAIQGLVLFTTLNSSYKRKEIVGSINKAVEFIEKTQ
LPDGSWYGSWGVCFTYATWFGIKGMLASGKTYESSLCIRKACGFLLSKQLCCGGWGESYL
SCQNKVYTNLPGNKSHIVNTSWALLALIEAGQASRDPMPLHRGAKSLINSQMEDGDYPQQ
EILGVFNRNCMISYSAYRNIFPIWALGEYRKLMLSL*
>AT5G42600.1 | MRN1 (MARNERAL SYNTHASE) catalytic/ marneral synthase
MWRLRIGAEARQDPHLFTTNNFAGRQIWEFDANGGSPEELAEVEEARLNFANNKSRFKAS
PDLFWRRQFLREKKFEQKIPRVRIEDAEKITYEDAKTALRRGVLYYAACQANDGHWPSEV
SGSMFLDAPFVICLYITGHLEKIFTLEHVKELLRYMYNTQNEDGGWGLDVESHSVMFCTV
LNYICLRILGVEPDHDGQKSACARARKWILDHGGATYAPMVAKAWLSVLGVYDWSGCKPL
PPEIWMLPSFSPINGGTLWIYIRDLLMGMSYLYGKKFVATPTALILQLREELYPQPYSKI
IWSKARNRCAKEDLLYPKSFGQDLFWEGVHMLSENIINRWPLNKFVRQRALRTTMELVHY
HDETTHYITGACVAKPFHMLACWVEDPDGDYFKKHLARVPDFIWIAEDGLKFQLMGMQSW
NAALSLQVMLAANMDDEIRSTLIKGYDFLKQSQISENPQGDHLKMFRDITKGGWTFQDRE
QGLPISDGTAESIECCIHFHRMPSEFIGEKMDVEKLYDAVNFLIYLQSDNGGMPVWEPAP
GKKWLEWLSPVEHVENTVVEQEYLECTGSVIAGLVCFKKEFPDHRPKEIEKLIKKGLKYI
EDLQMPDGSWYGNWGVCFTYGTLFAVRGLAAAGKTFGNSEAIRRAVQFILNTQNAEGGWG
ESALSCPNKKYIPSKGNVTNVVNTGQAMMVLLIGGQMERDPSPVHRAAKVLINSQLDIGD
FPQQERRGIYMNMLLHYPTYRNMFSLWALALYTNALRLLVS*
>AT4G15370.1 | BARS1 (BARUOL SYNTHASE 1) baruol synthase/ catalytic
MWRLRIGAKAKDNTHLFTTNNYVGRQIWEFDANAGSPEELAEVEEARRNFSNNRSRFKAS
ADLLWRMQFLREKKFEQKIPRVIVEDAEKITYEDAKTALRRGLLYFTALQADDGHWPAEN
AGSIFFNAPFVICLYITGHLEKIFTHEHRVELLRYMYNHQNEDGGWGLHVESPSNMFCSV
INYICLRILGVEAGHDDKGSACARARKWILDHGGATYSPLIGKAWLSVLGVYDWSGCKPI
PPEFWFLPSFFPVNGGTLWIYLRDIFMGLSYLYGKNFVATSTPLILQLREEIYPEPYTNI
SWRQARNRCAKEDLYYPQSFLQDLFWKGVHVFSENILNRWPFNNLIRQRALRTTMELVHY
HDEATRYITGGSVPKVIAVFHMLACWVEDPESDYFKKHLARVPDFIWIGEDGLKIQSFGS
QVWDTALSLHVFIDGFDDDVDEEIRSTLLKGYDYLEKSQVTENPPGDYMKMFRHMAKGGW
TFSDQDQGWPVSDCTAESLECCLFFESMSSEFIGKKMDVEKLYDAVDFLLYLQSDNGGIT
AWQPADGKLVEFIEDAVVEHEYVECTGSAIVALAQFNKQFPGYKKEEVERFITKGVKYIE
DLQMVDGSWYGNWGVCFIYGTFFAVRGLVAAGKCYNNCEAIRRAVRFILDTQNTEGGWGE
SYLSCPRKKYIPLIGNKTNVVNTGQALMVLIMGNQMKRDPLPVHRAAKVLINSQMDNGDF
PQQEIMGVFKMNVMLHFPTYRNMFTLWALTHYTKALRGL*
>AT1G66960.1 | lupeol synthase putative / 23-oxidosqualene-triterpenoid cyclase putative
MWRLKVGEGKGKDPYLFSSNNFVGRQTWEFDPKAGTREERTAVEEARRSFFDNRSRVKPS
SDLLWKMQFLKEAKFEQVIPPVKIDGGEAITYEKATNALRRGVAFLSALQASDGHWPGEF
TGPLCMLPPLVFCLYITGHLEEVFDAEHRKEMLRYIYCHQNEDGGWGFHIESKSIMFTTT
LNYICLRILGVGPDGGLENACKRARQWILSHGGVIYIPCWGKVWLSVLGIYDWSGVNPMP
PEIWLLPYFLPIHLGKAFSYTRITYMPISYLYGKKFVGQITPLIMQLREELHLQPYEEIN
WNKARHLCAKEDKYYPHPLVQDLIWDALHTFVEPLLASWPINKLVRKKALQVAMKHIHYE
DENSHYITIGCIEKNLCMLACWIDNPDGNHFKKHLSRIPDMMWVAEDGMKMQCFGSQLWM
TGFAVQALLASDPRDETYDVLRRAHDYIKKSQVRDNPSGDFKSMYRHISKGGWTLSDRDH
GWQVSDCTAEAAKCCMLLSTMPTDITGEKINLEQLYDSVNLMLSLQSENGGFTAWEPVRA
YKWMELMNPTDLFANAMTEREYTECTSAVLQALVIFNQLYPDHRTKEITKSIEKAVQFIE
SKQLRDGSWYGSWGICFTYGTWFALCGLAAIGKTYNNCLSMRDGVHFLLNIQNEDGGWGE
SYMSCPEQRYIPLEGNRSNVVQTAWAMMALIHAGQAKRDLIPLHSAAKFIITSQLENGDF
PQQELLGASMSTCMLHYSTYKDIFPPWALAEYRKAAFIHHADL*
>AT1G78500.1 | pentacyclic triterpene synthase putative
MWRLKIGAKGGDETHLFTTNNYTGRQTWEFDADACSPEELAEVDEARQNFSINRSRFKIS
ADLLWRMQFLREKKFEQKIPRVEIGDAENITYKDAKTALRRGILYFKALQAEDGHWPAEN
SGCLFFEAPFVICLYITGHLEKILTLEHRKELLRYMYNHQNEDGGWGIHVEGQSAMFCTV
INYICLRILGVEADLDDIKGSGCARARKWILDHGGATYTPLIGKAWLSILGVYDWSGCKP
IPPEVWMLPTFSPFNGGTLWIYFRDIFMGVSYLYGKKFVATPTPLILQLREELYPQPYDK
ILWSQARNQCAKEDLYYPQSFLQEMFWKCVHILSENILNRWPCNKLIRQKALRTTMELLH
YQDEASRYFTGGCVPKPFHMLACWVEDPDGDYFKKHLARVPDYIWIGEDGLKIQSFGSQL
WDTAFSLQVMLAYQDVDDDDDEIRSTLIKGYSFLNKSQLTQNPPGDHRKMLKDIAKGGWT
FSDQDQGWPVSDCTAESLECCLVFGSMPSELIGEKMDVERLYDAVNLLLYFQSKNGGITV
WEAARGRTWLEWLSPVEFMEDTIVEHEYVECTGSAIVALARFLKEFPEHRREEVEKFIKN
AVKYIESFQMPDGSWYGNWGVCFMYGTFFAVRGLVAAGKTYQNCEPIRKAVQFILETQNV
EGGWGESYLSCPNKKYTLLEGNRTNVVNTGQALMVLIMGGQMERDPLPVHRAAKVLINSQ
LDNGDFPQEEIMGVFKMNVMVHYATYRNIFTLWALTYYTKALRVPLC*
>AT1G78970.2 | LUP1 (LUPEOL SYNTHASE 1) beta-amyrin synthase/ lupeol synthase
MWKLKIGKGNGEDPHLFSSNNFVGRQTWKFDHKAGSPEERAAVEEARRGFLDNRFRVKGC
SDLLWRMQFLREKKFEQGIPQLKATNIEEITYETTTNALRRGVRYFTALQASDGHWPGEI
TGPLFFLPPLIFCLYITGHLEEVFDAEHRKEMLRHIYCHQNEDGGWGLHIESKSVMFCTV
LNYICLRMLGENPEQDACKRARQWILDRGGVIFIPSWGKFWLSILGVYDWSGTNPTPPEL
LMLPSFLPIHPGKILCYSRMVSIPMSYLYGKRFVGPITPLILLLREELYLEPYEEINWKK
SRRLYAKEDMYYAHPLVQDLLSDTLQNFVEPLLTRWPLNKLVREKALQLTMKHIHYEDEN
SHYITIGCVEKVLCMLACWVENPNGDYFKKHLARIPDYMWVAEDGMKMQSFGCQLWDTGF
AIQALLASNLPDETDDALKRGHNYIKASQVRENPSGDFRSMYRHISKGAWTFSDRDHGWQ
VSDCTAEALKCCLLLSMMSADIVGQKIDDEQLYDSVNLLLSLQSGNGGVNAWEPSRAYKW
LELLNPTEFMANTMVEREFVECTSSVIQALDLFRKLYPDHRKKEINRSIEKAVQFIQDNQ
TPDGSWYGNWGVCFIYATWFALGGLAAAGETYNDCLAMRNGVHFLLTTQRDDGGWGESYL
SCSEQRYIPSEGERSNLVQTSWAMMALIHTGQAERDLIPLHRAAKLIINSQLENGDFPQQ
EIVGAFMNTCMLHYATYRNTFPLWALAEYRKVVFIVN*
>AT1G78970.2 | LUP1 (LUPEOL SYNTHASE 1) beta-amyrin synthase/ lupeol synthase
MWKLKIGKGNGEDPHLFSSNNFVGRQTWKFDHKAGSPEERAAVEEARRGFLDNRFRVKGC
SDLLWRMQFLREKKFEQGIPQLKATNIEEITYETTTNALRRGVRYFTALQASDGHWPGEI
TGPLFFLPPLIFCLYITGHLEEVFDAEHRKEMLRHIYCHQNEDGGWGLHIESKSVMFCTV
LNYICLRMLGENPEQDACKRARQWILDRGGVIFIPSWGKFWLSILGVYDWSGTNPTPPEL
LMLPSFLPIHPGKILCYSRMVSIPMSYLYGKRFVGPITPLILLLREELYLEPYEEINWKK
SRRLYAKEDMYYAHPLVQDLLSDTLQNFVEPLLTRWPLNKLVREKALQLTMKHIHYEDEN
SHYITIGCVEKVLCMLACWVENPNGDYFKKHLARIPDYMWVAEDGMKMQSFGCQLWDTGF
AIQALLASNLPDETDDALKRGHNYIKASQVRENPSGDFRSMYRHISKGAWTFSDRDHGWQ
VSDCTAEALKCCLLLSMMSADIVGQKIDDEQLYDSVNLLLSLQSGNGGVNAWEPSRAYKW
LELLNPTEFMANTMVEREFVECTSSVIQALDLFRKLYPDHRKKEINRSIEKAVQFIQDNQ
TPDGSWYGNWGVCFIYATWFALGGLAAAGETYNDCLAMRNGVHFLLTTQRDDGGWGESYL
SCSEQRYIPSEGERSNLVQTSWAMMALIHTGQAERDLIPLHRAAKLIINSQLENGDFPQQ
EIVGAFMNTCMLHYATYRNTFPLWALAEYRKVVFIVN*
>AT1G78970.1 | LUP1 (LUPEOL SYNTHASE 1) beta-amyrin synthase/ lupeol synthase
MWKLKIGKGNGEDPHLFSSNNFVGRQTWKFDHKAGSPEERAAVEEARRGFLDNRFRVKGC
SDLLWRMQFLREKKFEQGIPQLKATNIEEITYETTTNALRRGVRYFTALQASDGHWPGEI
TGPLFFLPPLIFCLYITGHLEEVFDAEHRKEMLRHIYCHQNEDGGWGLHIESKSVMFCTV
LNYICLRMLGENPEQDACKRARQWILDRGGVIFIPSWGKFWLSILGVYDWSGTNPTPPEL
LMLPSFLPIHPGKILCYSRMVSIPMSYLYGKRFVGPITPLILLLREELYLEPYEEINWKK
SRRLYAKEDMYYAHPLVQDLLSDTLQNFVEPLLTRWPLNKLVREKALQLTMKHIHYEDEN
SHYITIGCVEKVLCMLACWVENPNGDYFKKHLARIPDYMWVAEDGMKMQSFGCQLWDTGF
AIQALLASNLPDETDDALKRGHNYIKASQVRENPSGDFRSMYRHISKGAWTFSDRDHGWQ
VSDCTAEALKCCLLLSMMSADIVGQKIDDEQLYDSVNLLLSLQSGNGGVNAWEPSRAYKW
LELLNPTEFMANTMVEREFVECTSSVIQALDLFRKLYPDHRKKEINRSIEKAVQFIQDNQ
TPDGSWYGNWGVCFIYATWFALGGLAAAGETYNDCLAMRNGVHFLLTTQRDDGGWGESYL
SCSEQRYIPSEGERSNLVQTSWAMMALIHTGQAERDLIPLHRAAKLIINSQLENGDFPQQ
EIVGAFMNTCMLHYATYRNTFPLWALAEYRKVVFIVN*
>AT1G78970.1 | LUP1 (LUPEOL SYNTHASE 1) beta-amyrin synthase/ lupeol synthase
MWKLKIGKGNGEDPHLFSSNNFVGRQTWKFDHKAGSPEERAAVEEARRGFLDNRFRVKGC
SDLLWRMQFLREKKFEQGIPQLKATNIEEITYETTTNALRRGVRYFTALQASDGHWPGEI
TGPLFFLPPLIFCLYITGHLEEVFDAEHRKEMLRHIYCHQNEDGGWGLHIESKSVMFCTV
LNYICLRMLGENPEQDACKRARQWILDRGGVIFIPSWGKFWLSILGVYDWSGTNPTPPEL
LMLPSFLPIHPGKILCYSRMVSIPMSYLYGKRFVGPITPLILLLREELYLEPYEEINWKK
SRRLYAKEDMYYAHPLVQDLLSDTLQNFVEPLLTRWPLNKLVREKALQLTMKHIHYEDEN
SHYITIGCVEKVLCMLACWVENPNGDYFKKHLARIPDYMWVAEDGMKMQSFGCQLWDTGF
AIQALLASNLPDETDDALKRGHNYIKASQVRENPSGDFRSMYRHISKGAWTFSDRDHGWQ
VSDCTAEALKCCLLLSMMSADIVGQKIDDEQLYDSVNLLLSLQSGNGGVNAWEPSRAYKW
LELLNPTEFMANTMVEREFVECTSSVIQALDLFRKLYPDHRKKEINRSIEKAVQFIQDNQ
TPDGSWYGNWGVCFIYATWFALGGLAAAGETYNDCLAMRNGVHFLLTTQRDDGGWGESYL
SCSEQRYIPSEGERSNLVQTSWAMMALIHTGQAERDLIPLHRAAKLIINSQLENGDFPQQ
EIVGAFMNTCMLHYATYRNTFPLWALAEYRKVVFIVN*
>AT4G15340.1 | ATPEN1 (ARABIDOPSIS THALIANA PENTACYCLIC TRITERPENE SYNTHASE 1) arabidiol synthase
MWRLRIGAKAGNDTHLFTTNNYVGRQIWEFDANAGSPQELAEVEEARRNFSNNRSHYKAS
ADLLWRMQFLREKGFEQKIPRVRVEDAAKIRYEDAKTALKRGLHYFTALQADDGHWPADN
SGPNFFIAPLVICLYITGHLEKIFTVEHRIELIRYMYNHQNEDGGWGLHVESPSIMFCTV
INYICLRIVGVEAGHDDDQGSTCTKARKWILDHGGATYTPLIGKACLSVLGVYDWSGCKP
MPPEFWFLPSSFPINGGTLWIYLRDIFMGLSYLYGKKFVATPTPLILQLQEELYPEPYTK
INWRLTRNRCAKEDLCYPSSFLQDLFWKGVHIFSESILNRWPFNKLIRQAALRTTMKLLH
YQDEANRYITGGSVPKAFHMLACWVEDPEGEYFKKHLARVSDFIWIGEDGLKIQSFGSQL
WDTVMSLHFLLDGVEDDVDDEIRSTLVKGYDYLKKSQVTENPPSDHIKMFRHISKGGWTF
SDKDQGWPVSDCTAESLKCCLLFERMPSEFVGQKMDVEKLFDAVDFLLYLQSDNGGITAW
EPADGKTWLEWFSPVEFVQDTVIEHEYVECTGSAIVALTQFSKQFPEFRKKEVERFITNG
VKYIEDLQMKDGSWCGNWGVCFIYGTLFAVRGLVAAGKTFHNCEPIRRAVRFLLDTQNQE
GGWGESYLSCLRKKYTPLAGNKTNIVSTGQALMVLIMGGQMERDPLPVHRAAKVVINLQL
DNGDFPQQEVMGVFNMNVLLHYPTYRNIYSLWALTLYTQALRRLQP*
>AT5G36150.1 | ATPEN3 (putative pentacyclic triterpene synthase 3) catalytic/ lupeol synthase
MWRLRIGAKAGDDPHLCTTNNFLGRQIWEFDANAGSPAELSEVDQARQNFSNNRSQYKAC
ADLLWRMQFLREKNFEQKIPRVRIEDAKKITFEDAKNTLRRGIHYMAALQSDDGHWPSEN
AGCIFFNAPFVICLYITGHLDKVFSEEHRKEMLRYMYNHQNDDGGWGIDVESHSFMFCTV
INYICLRIFGVDPDHDGESACARARKWIIDHGGATYTPLFGKAWLSVLGVYEWSGCKPIP
PEFWFFPSYFPINGGTLWIYLRDTFMAMSYLYGKKFVAKPTPLILQLREELYPQPYAEIV
WSQARSRCAKEDLYYPQSLVQDLFWKLVHMFSENILNRWPFNKLIREKAIRTAMELIHYH
DEATRYITGGAVPKVFHMLACWVEDPESDYFKKHLARVSHFIWIAEDGLKIQTFGSQIWD
TAFVLQVMLAADVDDEIRPTLIKGYSYLRKSQFTENPPGDYINMFRDISKGGWGYSDKDQ
GWPVSDCISESLECCLIFESMSSEFIGEKMEVERLYDAVNMLLYMQSRNGGISIWEAASG
KKWLEWLSPIEFIEDTILEHEYLECTGSAIVVLARFMKQFPGHRTEEVKKFITKGVKYIE
SLQIADGSWYGNWGICFIYGTFFAVRGLVAAGNTYDNCEAIRRAVRFLLDIQNGEGGWGE
SFLSCPNKNYIPLEGNKTDVVNTGQALMVLIMGGQMDRDPLPVHRAAKVLINSQMDNGDF
PQQEIRGVYKMNVMLNFPTFRNSFTLWALTHYTKAIRLLL*
>AT1G78960.1 | ATLUP2 beta-amyrin synthase/ lupeol synthase
MWKLKIGEGNGEDPYLFSSNNFVGRQTWEFDPKAGTPEERAAVEDARRNYLDNRPRVKGC
SDLLWRMQFLKEAKFEQVIPPVKIDDGEGITYKNATDALRRAVSFYSALQSSDGHWPAEI
TGTLFFLPPLVFCFYITGHLEKIFDAEHRKEMLRHIYCHQNEDGGWGLHIEGKSVMFCTV
LNYICLRMLGEGPNGGRNNACKRARQWILDHGGVTYIPSWGKIWLSILGIYDWSGTNPMP
PEIWLLPSFFPIHLGKTLCYTRMVYMPMSYLYGKRFVGPLTPLIMLLRKELHLQPYEEIN
WNKARRLCAKEDMIYPHPLVQDLLWDTLHNFVEPILTNWPLKKLVREKALRVAMEHIHYE
DENSHYITIGCVEKVLCMLACWIENPNGDHFKKHLARIPDFMWVAEDGLKMQSFGSQLWD
TVFAIQALLACDLSDETDDVLRKGHSFIKKSQVRENPSGDFKSMYRHISKGAWTLSDRDH
GWQVSDCTAEALKCCMLLSMMPAEVVGQKIDPEQLYDSVNLLLSLQGEKGGLTAWEPVRA
QEWLELLNPTDFFTCVMAEREYVECTSAVIQALVLFKQLYPDHRTKEIIKSIEKGVQFIE
SKQTPDGSWHGNWGICFIYATWFALSGLAAAGKTYKSCLAVRKGVDFLLAIQEEDGGWGE
SHLSCPEQRYIPLEGNRSNLVQTAWAMMGLIHAGQAERDPTPLHRAAKLIITSQLENGDF
PQQEILGVFMNTCMLHYATYRNIFPLWALAEYRKAAFATHQDL*
>AT1G78950.1 | beta-amyrin synthase putative
MWRLKIGEGNGDDPYLFTTNNFAGRQTWEFDPDGGSPEERHSVVEARRIFYDNRFHVKAS
SDLLWRMQFLREKKFEQRIAPVKVEDSEKVTFETATSALRRGIHFFSALQASDGHWPAEN
AGPLFFLPPLVFCLYITGHLDEVFTSEHRKEILRYIYCHQKEDGGWGLHIEGHSTMFCTT
LNYICMRILGESPDGGHDNACGRAREWILSHGGVTYIPSWGKTWLSILGVFDWSGSNPMP
PEFWILPSFFPVHPAKMWSYCRMVYLPMSYLYGKRFVGPITSLILQLRKELYLQPYEEIN
WMKVRHLCAKEDTYYPRPLVQELVWDSLYIFAEPFLARWPFNKLLREKALQLAMKHIHYE
DENSRYITIGCVEKVLCMLACWVEDPNGDYFKKHLSRISDYLWMAEDGMKMQSFGSQLWD
TGFAMQALLASNLSSEISDVLRRGHEFIKNSQVGENPSGDYKSMYRHISKGAWTFSDRDH
GWQVSDCTAHGLKCCLLFSMLAPDIVGPKQDPERLHDSVNILLSLQSKNGGMTAWEPAGA
PKWLELLNPTEMFSDIVIEHEYSECTSSAIQALSLFKQLYPDHRTTEITAFIKKAAEYLE
NMQTRDGSWYGNWGICFTYGTWFALAGLAAAGKTFNDCEAIRKGVQFLLAAQKDNGGWGE
SYLSCSKKIYIAQVGEISNVVQTAWALMGLIHSGQAERDPIPLHRAAKLIINSQLESGDF
PQQQATGVFLKNCTLHYAAYRNIHPLWALAEYRARVSLP*
>AT1G78955.1 | CAMS1 (Camelliol C synthase 1) beta-amyrin synthase
MWKLKIANGNKEEPYLFSTNNFLGRQTWEFDPDAGTVEELAAVEEARRKFYDDRFRVKAS
SDLIWRMQFLKEKKFEQVIPPAKVEDANNITSEIATNALRKGVNFLSALQASDGHWPAEN
AGPLFFLPPLVFCLYVTGHLHEIFTQDHRREVLRYIYCHQNEDGGWGLHIEGNSTMFCTT
LNYICMRILGEGPNGGPGNACKRARDWILDHGGATYIPSWGKTWLSILGVFDWSGSNPMP
PEFWILPSFLPIHPAKMWCYCRLVYMPMSYLYGKRFVGPISPLILQLREEIYLQPYAKIN
WNRARHLCAKEDAYCPHPQIQDVIWNCLYIFTEPFLACWPFNKLLREKALGVAMKHIHYE
DENSRYITIGCVEKALCMLACWVEDPNGIHFKKHLLRISDYLWIAEDGMKMQSFGSQLWD
SGFALQALVASNLVNEIPDVLRRGYDFLKNSQVRENPSGDFTNMYRHISKGSWTFSDRDH
GWQASDCTAESFKCCLLLSMIPPDIVGPKMDPEQLYEAVTILLSLQSKNGGVTAWEPARG
QEWLELLNPTEVFADIVVEHEYNECTSSAIQALILFKQLYPNHRTEEINTSIKKAVQYIE
SIQMLDGSWYGSWGVCFTYSTWFGLGGLAAAGKTYNNCLAMRKGVHFLLTTQKDNGGWGE
SYLSCPKKRYIPSEGERSNLVQTSWAMMGLLHAGQAERDPSPLHRAAKLLINSQLENGDF
PQQEITGAFMKNCLLHYAAYRNIFPVWALAEYRRRVPLPYEKPSTERRS*