>AT2G07050.1 |  CAS1 (cycloartenol synthase 1) cycloartenol synthase 
MWKLKIAEGGSPWLRTTNNHVGRQFWEFDPNLGTPEDLAAVEEARKSFSDNRFVQKHSAD 
LLMRLQFSRENLISPVLPQVKIEDTDDVTEEMVETTLKRGLDFYSTIQAHDGHWPGDYGG 
PMFLLPGLIITLSITGALNTVLSEQHKQEMRRYLYNHQNEDGGWGLHIEGPSTMFGSVLN 
YVTLRLLGEGPNDGDGDMEKGRDWILNHGGATNITSWGKMWLSVLGAFEWSGNNPLPPEI 
WLLPYFLPIHPGRMWCHCRMVYLPMSYLYGKRFVGPITSTVLSLRKELFTVPYHEVNWNE 
ARNLCAKEDLYYPHPLVQDILWASLHKIVEPVLMRWPGANLREKAIRTAIEHIHYEDENT 
RYICIGPVNKVLNMLCCWVEDPNSEAFKLHLPRIHDFLWLAEDGMKMQGYNGSQLWDTGF 
AIQAILATNLVEEYGPVLEKAHSFVKNSQVLEDCPGDLNYWYRHISKGAWPFSTADHGWP 
ISDCTAEGLKAALLLSKVPKAIVGEPIDAKRLYEAVNVIISLQNADGGLATYELTRSYPW 
LELINPAETFGDIVIDYPYVECTSAAIQALISFRKLYPGHRKKEVDECIEKAVKFIESIQ 
AADGSWYGSWAVCFTYGTWFGVKGLVAVGKTLKNSPHVAKACEFLLSKQQPSGGWGESYL 
SCQDKVYSNLDGNRSHVVNTAWAMLALIGAGQAEVDRKPLHRAARYLINAQMENGDFPQQ 
EIMGVFNRNCMITYAAYRNIFPIWALGEYRCQVLLQQGE*
>AT5G48010.1 |  THAS1 (THALIANOL SYNTHASE 1) catalytic/ thalianol synthase 
MWRLRTGPKAGEDTHLFTTNNYAGRQIWEFDANAGSPQEIAEVEDARHKFSDNTSRFKTT 
ADLLWRMQFLREKKFEQKIPRVIIEDARKIKYEDAKTALKRGLLYFTALQADDGHWPAEN 
SGPNFYTPPFLICLYITGHLEKIFTPEHVKELLRHIYNMQNEDGGWGLHVESHSVMFCTV 
INYVCLRIVGEEVGHDDQRNGCAKAHKWIMDHGGATYTPLIGKALLSVLGVYDWSGCNPI 
PPEFWLLPSSFPVNGGTLWIYLRDTFMGLSYLYGKKFVAPPTPLILQLREELYPEPYAKI 
NWTQTRNRCGKEDLYYPRSFLQDLFWKSVHMFSESILDRWPLNKLIRQRALQSTMALIHY 
HDESTRYITGGCLPKAFHMLACWIEDPKSDYFKKHLARVREYIWIGEDGLKIQSFGSQLW 
DTALSLHALLDGIDDHDVDDEIKTTLVKGYDYLKKSQITENPRGDHFKMFRHKTKGGWTF 
SDQDQGWPVSDCTAESLECCLFFESMPSELIGKKMDVEKLYDAVDYLLYLQSDNGGIAAW 
QPVEGKAWLELLNIMIFRYVECTGSAIAALTQFNKQFPGYKNVEVKRFITKAAKYIEDMQ 
TVDGSWYGNWGVCFIYGTFFAVRGLVAAGKTYSNCEAIRKAVRFLLDTQNPEGGWGESFL 
SCPSKKYTPLKGNSTNVVQTAQALMVLIMGDQMERDPLPVHRAAQVLINSQLDNGDFPQQ 
EIMGTFMRTVMLHFPTYRNTFSLWALTHYTHALRRLLP*
>AT5G48010.1 |  THAS1 (THALIANOL SYNTHASE 1) catalytic/ thalianol synthase 
MWRLRTGPKAGEDTHLFTTNNYAGRQIWEFDANAGSPQEIAEVEDARHKFSDNTSRFKTT 
ADLLWRMQFLREKKFEQKIPRVIIEDARKIKYEDAKTALKRGLLYFTALQADDGHWPAEN 
SGPNFYTPPFLICLYITGHLEKIFTPEHVKELLRHIYNMQNEDGGWGLHVESHSVMFCTV 
INYVCLRIVGEEVGHDDQRNGCAKAHKWIMDHGGATYTPLIGKALLSVLGVYDWSGCNPI 
PPEFWLLPSSFPVNGGTLWIYLRDTFMGLSYLYGKKFVAPPTPLILQLREELYPEPYAKI 
NWTQTRNRCGKEDLYYPRSFLQDLFWKSVHMFSESILDRWPLNKLIRQRALQSTMALIHY 
HDESTRYITGGCLPKAFHMLACWIEDPKSDYFKKHLARVREYIWIGEDGLKIQSFGSQLW 
DTALSLHALLDGIDDHDVDDEIKTTLVKGYDYLKKSQITENPRGDHFKMFRHKTKGGWTF 
SDQDQGWPVSDCTAESLECCLFFESMPSELIGKKMDVEKLYDAVDYLLYLQSDNGGIAAW 
QPVEGKAWLELLNIMIFRYVECTGSAIAALTQFNKQFPGYKNVEVKRFITKAAKYIEDMQ 
TVDGSWYGNWGVCFIYGTFFAVRGLVAAGKTYSNCEAIRKAVRFLLDTQNPEGGWGESFL 
SCPSKKYTPLKGNSTNVVQTAQALMVLIMGDQMERDPLPVHRAAQVLINSQLDNGDFPQQ 
EIMGTFMRTVMLHFPTYRNTFSLWALTHYTHALRRLLP*
>AT5G48010.2 |  THAS1 (THALIANOL SYNTHASE 1) catalytic/ thalianol synthase 
MWRLRTGPKAGEDTHLFTTNNYAGRQIWEFDANAGSPQEIAEVEDARHKFSDNTSRFKTT 
ADLLWRMQFLREKKFEQKIPRVIIEDARKIKYEDAKTALKRGLLYFTALQADDGHWPAEN 
SGPNFYTPPFLICLYITGHLEKIFTPEHVKELLRHIYNMQNEDGGWGLHVESHSVMFCTV 
INYVCLRIVGEEVGHDDQRNGCAKAHKWIMDHGGATYTPLIGKALLSVLGVYDWSGCNPI 
PPEFWLLPSSFPVNGGTLWIYLRDTFMGLSYLYGKKFVAPPTPLILQLREELYPEPYAKI 
NWTQTRNRCGKEDLYYPRSFLQDLFWKSVHMFSESILDRWPLNKLIRQRALQSTMALIHY 
HDESTRYITGGCLPKAFHMLACWIEDPKSDYFKKHLARVREYIWIGEDGLKIQSFGSQLW 
DTALSLHALLDGIDDHDVDDEIKTTLVKGYDYLKKSQITENPRGDHFKMFRHKTKGGWTF 
SDQDQGWPVSDCTAESLECCLFFESMPSELIGKKMDVEKLYDAVDYLLYLQSDNGGIAAW 
QPVEGKAWLEWLSPVEFLEDTIVEYEYVECTGSAIAALTQFNKQFPGYKNVEVKRFITKA 
AKYIEDMQTVDGSWYGNWGVCFIYGTFFAVRGLVAAGKTYSNCEAIRKAVRFLLDTQNPE 
GGWGESFLSCPSKKYTPLKGNSTNVVQTAQALMVLIMGDQMERDPLPVHRAAQVLINSQL 
DNGDFPQQEIMGTFMRTVMLHFPTYRNTFSLWALTHYTHALRRLLP*
>AT5G48010.2 |  THAS1 (THALIANOL SYNTHASE 1) catalytic/ thalianol synthase 
MWRLRTGPKAGEDTHLFTTNNYAGRQIWEFDANAGSPQEIAEVEDARHKFSDNTSRFKTT 
ADLLWRMQFLREKKFEQKIPRVIIEDARKIKYEDAKTALKRGLLYFTALQADDGHWPAEN 
SGPNFYTPPFLICLYITGHLEKIFTPEHVKELLRHIYNMQNEDGGWGLHVESHSVMFCTV 
INYVCLRIVGEEVGHDDQRNGCAKAHKWIMDHGGATYTPLIGKALLSVLGVYDWSGCNPI 
PPEFWLLPSSFPVNGGTLWIYLRDTFMGLSYLYGKKFVAPPTPLILQLREELYPEPYAKI 
NWTQTRNRCGKEDLYYPRSFLQDLFWKSVHMFSESILDRWPLNKLIRQRALQSTMALIHY 
HDESTRYITGGCLPKAFHMLACWIEDPKSDYFKKHLARVREYIWIGEDGLKIQSFGSQLW 
DTALSLHALLDGIDDHDVDDEIKTTLVKGYDYLKKSQITENPRGDHFKMFRHKTKGGWTF 
SDQDQGWPVSDCTAESLECCLFFESMPSELIGKKMDVEKLYDAVDYLLYLQSDNGGIAAW 
QPVEGKAWLEWLSPVEFLEDTIVEYEYVECTGSAIAALTQFNKQFPGYKNVEVKRFITKA 
AKYIEDMQTVDGSWYGNWGVCFIYGTFFAVRGLVAAGKTYSNCEAIRKAVRFLLDTQNPE 
GGWGESFLSCPSKKYTPLKGNSTNVVQTAQALMVLIMGDQMERDPLPVHRAAQVLINSQL 
DNGDFPQQEIMGTFMRTVMLHFPTYRNTFSLWALTHYTHALRRLLP*
>AT3G62870.1 |  60S ribosomal protein L7A (RPL7aB) 
MAPKKGVKVASKKKPEKVTNPLFERRPKQFGIGGALPPKKDLSRYIKWPKSIRLQRQKRI 
LKQRLKVPPALNQFTKTLDKNLATSLFKILLKYRPEDKAAKKERLLNKAQAEAEGKPAES 
KKPIVVKYGLNHVTYLIEQNKAQLVVIAHDVDPIELVVWLPALCRKMEVPYCIVKGKSRL 
GAVVHQKTAAALCLTTVKNEDKLEFSKILEAIKANFNDKYEEYRKKWGGGIMGSKSQAKT 
KAKERVIAKEAAQRMN*
>AT3G07100.1 |  protein transport protein Sec24 putative 
MGTENQGYPNFPARPASSPFASAPPPGIPPQSGGPPTGSEAVGFRPFTPSASQPTRPFTA 
SGPPPAPPVGTMRPGQPSPFVSQIPGSRPPPPSSNSFPSPAYGPPGGAPFQRFPSPPFPT 
TQNPPQGPPPPQTLAGHLSPPMSLRPQQPMAPVAMGPPPQSTTSGLPGANAYPPATDYHM 
PARPGFQQSMPPVTPSYPGVGGSQPSFPGYPSKQVLQAPTPFQTSQGPPGPPPVSSYPPH 
TGGFAQRPNMAAQQNLHPNYAPPPSNVQGLTEDFNSLSLSSIPGSLEPGLDHKSFPRPLD 
GDVEPNSFAEMYPMNCHSRYLRLTTSAIPNSQSLASRWHLPLGAVVCPLAETPEGEEVPL 
IDFGSTGIIRCRRCRTYVNPFVTFTDSGRKWRCNICSMLNDVPGEYFSHLDATGRRMDMD 
QRPELTKGSVEIIAPTEYMVRPPMPPIYFFLIDVSISATKSGMLEVVAQTIKSCLDNLPG 
YPRTQIGFITYDSTLHFYNMKSSLSQPQMMVVSDLDDIFVPLPDDLLVNLSESRTVVDAF 
LDSLPLMFQDNFNVESAFGPALRAAFMVMNQLGGKLLIFQNSLPSLGAGRLKLRGDDPRV 
YGTDKEYALRVAEDPFYKQMAADCTKFQIGINVYAFSDKYTDIASLGTLAKYTGGQVYYY 
PGFQSSVHGDKLRHELARDLTRETAWEAVMRIRCGKGIRFSSYHGNFMLRSTDLLALPAV 
DCDKAYAMQLSLEETLLTSQTVYFQVALLYTASCGERRIRVHTSVAPVVTDLGEMYRQAD 
TGSIVSLYARLAIEKSLSAKLDDARNAIQQKIVKALKEYRNLHAVQHRLGSRLVYPESLK 
FLPLYGLAITKSTPLLGGPADTSLDERCAAGFTMMALPVKKLLKLLYPNLFRVDEWLLKP 
SAAHDDFKDVLRRLPLAAESLDSRGLYIYDDGFRLVLWFGRMLSPDIAKNLLGVDFAADL 
SRVTFQEQENGMSKKLMRLVKKLRESDPSYHPMCFLVRQGEQPREGFLLLRNLIEDQMGG 
SSGYVDWILQLHRQVQQN*
>AT5G01410.1 |  RSR4 (REDUCED SUGAR RESPONSE 4) protein heterodimerization/ protein homodimerization 
MEGTGVVAVYGNGAITEAKKSPFSVKVGLAQMLRGGVIMDVVNAEQARIAEEAGACAVMA 
LERVPADIRAQGGVARMSDPQMIKEIKQAVTIPVMAKARIGHFVEAQILEAIGIDYIDES 
EVLTLADEDHHINKHNFRIPFVCGCRNLGEALRRIREGAAMIRTKGEAGTGNIIEAVRHV 
RSVNGDIRVLRNMDDDEVFTFAKKLAAPYDLVMQTKQLGRLPVVQFAAGGVATPADAALM 
MQLGCDGVFVGSGIFKSGDPARRARAIVQAVTHYSDPEMLVEVSCGLGEAMVGINLNDEK 
VERFANRSE*
>AT1G02500.1 |  SAM1 (S-ADENOSYLMETHIONINE SYNTHETASE 1) methionine adenosyltransferase 
METFLFTSESVNEGHPDKLCDQISDAVLDACLEQDPDSKVACETCTKTNMVMVFGEITTK 
ATVDYEKIVRDTCRAIGFVSDDVGLDADKCKVLVNIEQQSPDIAQGVHGHFTKCPEEIGA 
GDQGHMFGYATDETPELMPLSHVLATKLGARLTEVRKNGTCAWLRPDGKTQVTVEYYNDK 
GAMVPIRVHTVLISTQHDETVTNDEIARDLKEHVIKPVIPEKYLDEKTIFHLNPSGRFVI 
GGPHGDAGLTGRKIIIDTYGGWGAHGGGAFSGKDPTKVDRSGAYIVRQAAKSVVANGMAR 
RALVQVSYAIGVPEPLSVFVDTYETGLIPDKEILKIVKESFDFRPGMMTINLDLKRGGNG 
RFLKTAAYGHFGRDDPDFTWEVVKPLKWDKPQA*
>AT1G02500.1 |  SAM1 (S-ADENOSYLMETHIONINE SYNTHETASE 1) methionine adenosyltransferase 
METFLFTSESVNEGHPDKLCDQISDAVLDACLEQDPDSKVACETCTKTNMVMVFGEITTK 
ATVDYEKIVRDTCRAIGFVSDDVGLDADKCKVLVNIEQQSPDIAQGVHGHFTKCPEEIGA 
GDQGHMFGYATDETPELMPLSHVLATKLGARLTEVRKNGTCAWLRPDGKTQVTVEYYNDK 
GAMVPIRVHTVLISTQHDETVTNDEIARDLKEHVIKPVIPEKYLDEKTIFHLNPSGRFVI 
GGPHGDAGLTGRKIIIDTYGGWGAHGGGAFSGKDPTKVDRSGAYIVRQAAKSVVANGMAR 
RALVQVSYAIGVPEPLSVFVDTYETGLIPDKEILKIVKESFDFRPGMMTINLDLKRGGNG 
RFLKTAAYGHFGRDDPDFTWEVVKPLKWDKPQA*
>AT1G02500.2 |  SAM1 (S-ADENOSYLMETHIONINE SYNTHETASE 1) methionine adenosyltransferase 
METFLFTSESVNEGHPDKLCDQISDAVLDACLEQDPDSKVACETCTKTNMVMVFGEITTK 
ATVDYEKIVRDTCRAIGFVSDDVGLDADKCKVLVNIEQQSPDIAQGVHGHFTKCPEEIGA 
GDQGHMFGYATDETPELMPLSHVLATKLGARLTEVRKNGTCAWLRPDGKTQVTVEYYNDK 
GAMVPIRVHTVLISTQHDETVTNDEIARDLKEHVIKPVIPEKYLDEKTIFHLNPSGRFVI 
GGPHGDAGLTGRKIIIDTYGGWGAHGGGAFSGKDPTKVDRSGAYIVRQAAKSVVANGMAR 
RALVQVSYAIGVPEPLSVFVDTYETGLIPDKEILKIVKESFDFRPGMMTINLDLKRGGNG 
RFLKTAAYGHFGRDDPDFTWEVVKPLKWDKPQA*
>AT1G02500.2 |  SAM1 (S-ADENOSYLMETHIONINE SYNTHETASE 1) methionine adenosyltransferase 
METFLFTSESVNEGHPDKLCDQISDAVLDACLEQDPDSKVACETCTKTNMVMVFGEITTK 
ATVDYEKIVRDTCRAIGFVSDDVGLDADKCKVLVNIEQQSPDIAQGVHGHFTKCPEEIGA 
GDQGHMFGYATDETPELMPLSHVLATKLGARLTEVRKNGTCAWLRPDGKTQVTVEYYNDK 
GAMVPIRVHTVLISTQHDETVTNDEIARDLKEHVIKPVIPEKYLDEKTIFHLNPSGRFVI 
GGPHGDAGLTGRKIIIDTYGGWGAHGGGAFSGKDPTKVDRSGAYIVRQAAKSVVANGMAR 
RALVQVSYAIGVPEPLSVFVDTYETGLIPDKEILKIVKESFDFRPGMMTINLDLKRGGNG 
RFLKTAAYGHFGRDDPDFTWEVVKPLKWDKPQA*
>AT5G26340.1 |  MSS1 carbohydrate transmembrane transporter/ hexosehydrogen symporter/ high-affinity hydrogenglucose symporter/ sugarhydrogen symporter 
MTGGGFATSANGVEFEAKITPIVIISCIMAATGGLMFGYDVGVSGGVTSMPDFLEKFFPV 
VYRKVVAGADKDSNYCKYDNQGLQLFTSSLYLAGLTATFFASYTTRTLGRRLTMLIAGVF 
FIIGVALNAGAQDLAMLIAGRILLGCGVGFANQAVPLFLSEIAPTRIRGGLNILFQLNVT 
IGILFANLVNYGTAKIKGGWGWRLSLGLAGIPALLLTVGALLVTETPNSLVERGRLDEGK 
AVLRRIRGTDNVEPEFADLLEASRLAKEVKHPFRNLLQRRNRPQLVIAVALQIFQQCTGI 
NAIMFYAPVLFSTLGFGSDASLYSAVVTGAVNVLSTLVSIYSVDKVGRRVLLLEAGVQMF 
FSQVVIAIILGVKVTDTSTNLSKGFAILVVVMICTYVAAFAWSWGPLGWLIPSETFPLET 
RSAGQSVTVCVNLLFTFIIAQAFLSMLCHFKFGIFIFFSAWVLIMSVFVMFLLPETKNIP 
IEEMTERVWKKHWFWARFMDDHNDHEFVNGEKSNGKSNGFDPSTRL*
>AT1G29960.1 |  peptidase/ serine-type peptidase 
MATPSSSFWNTASREAMKSGVLLAKLYCFLHVTTNYLGFMAYAYGPSMTPTLHPSGNVLL 
AERISKRYQKPSRGDIVVIRSPENPNKTPIKRVIGIEGDCISFVIDSRKSDESQTIVVPK 
GHVFVQGDYTHNSRDSRNFGTVPYGLIQGRVLWRVWPFQDFGPLGPTPT*
>AT4G16830.1 |  nuclear RNA-binding protein (RGGA) 
MATLNPFDLLDDDAEDPSQLAVAIEKIDKSKKSGQVSSLPAKSAPKLPSKPLPPAQAVRE 
ARSDAPRGGGGRGGFNRGRGGYNRDDGNNGYSGGYTKPSGEGDVSKSSYERRGGGGAPRG 
SFRGEGGGPGGGRRGGFSNEGGDGERPRRAFERRSGTGRGSDFKRDGSGRGNWGTPGEEI 
AAETEAVAGVETEKDVGEKPAVDDVAADANKEDTVVEEKEPEDKEMTLDEYEKILEEKKK 
ALQSLTTSERKVDTKVFESMQQLSNKKSNDEIFIKLGSDKDKRKDDKEEKAKKAVSINEF 
LKPAEGGNYYRGGRGGRGRGGRGRGGVSSGESGGYRNEAAPAIGDAAQFPSLGGK*
>AT4G16830.1 |  nuclear RNA-binding protein (RGGA) 
MATLNPFDLLDDDAEDPSQLAVAIEKIDKSKKSGQVSSLPAKSAPKLPSKPLPPAQAVRE 
ARSDAPRGGGGRGGFNRGRGGYNRDDGNNGYSGGYTKPSGEGDVSKSSYERRGGGGAPRG 
SFRGEGGGPGGGRRGGFSNEGGDGERPRRAFERRSGTGRGSDFKRDGSGRGNWGTPGEEI 
AAETEAVAGVETEKDVGEKPAVDDVAADANKEDTVVEEKEPEDKEMTLDEYEKILEEKKK 
ALQSLTTSERKVDTKVFESMQQLSNKKSNDEIFIKLGSDKDKRKDDKEEKAKKAVSINEF 
LKPAEGGNYYRGGRGGRGRGGRGRGGVSSGESGGYRNEAAPAIGDAAQFPSLGGK*
>AT4G16830.1 |  nuclear RNA-binding protein (RGGA) 
MATLNPFDLLDDDAEDPSQLAVAIEKIDKSKKSGQVSSLPAKSAPKLPSKPLPPAQAVRE 
ARSDAPRGGGGRGGFNRGRGGYNRDDGNNGYSGGYTKPSGEGDVSKSSYERRGGGGAPRG 
SFRGEGGGPGGGRRGGFSNEGGDGERPRRAFERRSGTGRGSDFKRDGSGRGNWGTPGEEI 
AAETEAVAGVETEKDVGEKPAVDDVAADANKEDTVVEEKEPEDKEMTLDEYEKILEEKKK 
ALQSLTTSERKVDTKVFESMQQLSNKKSNDEIFIKLGSDKDKRKDDKEEKAKKAVSINEF 
LKPAEGGNYYRGGRGGRGRGGRGRGGVSSGESGGYRNEAAPAIGDAAQFPSLGGK*
>AT4G16830.2 |  nuclear RNA-binding protein (RGGA) 
MLHVVVEAVEDLTVVVVVTTVMMVTMDIQGDTLNPQVKEMFQSLLTRGVAGGDGERPRRA 
FERRSGTGRGSDFKRDGSGRGNWGTPGEEIAAETEAVAGVETEKDVGEKPAVDDVAADAN 
KEDTVVEEKEPEDKEMTLDEYEKILEEKKKALQSLTTSERKVDTKVFESMQQLSNKKSND 
EIFIKLGSDKDKRKDDKEEKAKKAVSINEFLKPAEGGNYYRGGRGGRGRGGRGRGGVSSG 
ESGGYRNEAAPAIGDAAQFPSLGGK*
>AT4G16830.2 |  nuclear RNA-binding protein (RGGA) 
MLHVVVEAVEDLTVVVVVTTVMMVTMDIQGDTLNPQVKEMFQSLLTRGVAGGDGERPRRA 
FERRSGTGRGSDFKRDGSGRGNWGTPGEEIAAETEAVAGVETEKDVGEKPAVDDVAADAN 
KEDTVVEEKEPEDKEMTLDEYEKILEEKKKALQSLTTSERKVDTKVFESMQQLSNKKSND 
EIFIKLGSDKDKRKDDKEEKAKKAVSINEFLKPAEGGNYYRGGRGGRGRGGRGRGGVSSG 
ESGGYRNEAAPAIGDAAQFPSLGGK*
>AT4G16830.2 |  nuclear RNA-binding protein (RGGA) 
MLHVVVEAVEDLTVVVVVTTVMMVTMDIQGDTLNPQVKEMFQSLLTRGVAGGDGERPRRA 
FERRSGTGRGSDFKRDGSGRGNWGTPGEEIAAETEAVAGVETEKDVGEKPAVDDVAADAN 
KEDTVVEEKEPEDKEMTLDEYEKILEEKKKALQSLTTSERKVDTKVFESMQQLSNKKSND 
EIFIKLGSDKDKRKDDKEEKAKKAVSINEFLKPAEGGNYYRGGRGGRGRGGRGRGGVSSG 
ESGGYRNEAAPAIGDAAQFPSLGGK*
>AT4G16830.3 |  nuclear RNA-binding protein (RGGA) 
MMMLRIQASSLLPSRRLISPRNLDRFRACLLSQLLSFHRSHFLLLKPVREARSDAPRGGG 
GRGGFNRGRGGYNRDDGNNGYSGGYTKPSGEGDVSKSSYERRGGGGAPRGSFRGEGGGPG 
GGRRGGFSNEGGDGERPRRAFERRSGTGRGSDFKRDGSGRGNWGTPGEEIAAETEAVAGV 
ETEKDVGEKPAVDDVAADANKEDTVVEEKEPEDKEMTLDEYEKILEEKKKALQSLTTSER 
KVDTKVFESMQQLSNKKSNDEIFIKLGSDKDKRKDDKEEKAKKAVSINEFLKPAEGGNYY 
RGGRGGRGRGGRGRGGVSSGESGGYRNEAAPAIGDAAQFPSLGGK*
>AT4G16830.3 |  nuclear RNA-binding protein (RGGA) 
MMMLRIQASSLLPSRRLISPRNLDRFRACLLSQLLSFHRSHFLLLKPVREARSDAPRGGG 
GRGGFNRGRGGYNRDDGNNGYSGGYTKPSGEGDVSKSSYERRGGGGAPRGSFRGEGGGPG 
GGRRGGFSNEGGDGERPRRAFERRSGTGRGSDFKRDGSGRGNWGTPGEEIAAETEAVAGV 
ETEKDVGEKPAVDDVAADANKEDTVVEEKEPEDKEMTLDEYEKILEEKKKALQSLTTSER 
KVDTKVFESMQQLSNKKSNDEIFIKLGSDKDKRKDDKEEKAKKAVSINEFLKPAEGGNYY 
RGGRGGRGRGGRGRGGVSSGESGGYRNEAAPAIGDAAQFPSLGGK*
>AT4G16830.3 |  nuclear RNA-binding protein (RGGA) 
MMMLRIQASSLLPSRRLISPRNLDRFRACLLSQLLSFHRSHFLLLKPVREARSDAPRGGG 
GRGGFNRGRGGYNRDDGNNGYSGGYTKPSGEGDVSKSSYERRGGGGAPRGSFRGEGGGPG 
GGRRGGFSNEGGDGERPRRAFERRSGTGRGSDFKRDGSGRGNWGTPGEEIAAETEAVAGV 
ETEKDVGEKPAVDDVAADANKEDTVVEEKEPEDKEMTLDEYEKILEEKKKALQSLTTSER 
KVDTKVFESMQQLSNKKSNDEIFIKLGSDKDKRKDDKEEKAKKAVSINEFLKPAEGGNYY 
RGGRGGRGRGGRGRGGVSSGESGGYRNEAAPAIGDAAQFPSLGGK*
>AT4G39160.1 |  DNA binding / transcription factor 
MDLDFDDQPSDHAAPAVRAGARFKPKGRPQPKKKQVSLSTTQTTLSPDVAQEKLSTQSED 
LVPLDGSSEIPSNALPSETNVPDSGSINKSTIGTLSEENEDAFPRGVHWSVKPSILRACN 
NVNLVGNRRDDGIEATTSFPDDPRTQDSAIFGDYVTPETGADEGRVDMETLDIVQEEGTT 
SSYVQHTGKLQPKPRLLETVVEEPEPHYSAGDTGYFPMGTNESEFMANVESRNGFSTYED 
LQEEELNIPEAPRETVGEMEAQNASGGWEQEEQGVSPCINNTVTGEEENCMGNTVEEQSK 
RESKTGKSKRATSRKRKKTSEEPNKSSEKTEQKKFKHSSRRQKRTLEKELLETPDHEIRS 
LPLRDMLRLVEYKEWMQKKEAKGAGVQPSQESNNMNGSGSQYHSQGFDEEDEFGDFGIES 
SEYQENNVVKPDSPVNYQTYMNKTSRTRWSKEDTELFYEGIQEFGSNLSMIQQLFPERTR 
EQMKLKFKLEERRNPLKLNDALSSRSKHFTHFKNVIKKLQQEAAAAKEGEEEEEAGAEAE 
TTDVPENEEPEKSEETERASDGVAAGVKESDGGDVENGVRSDGGDECDDDEDFWNSYKSD 
M*
>AT1G53530.1 |  signal peptidase I family protein 
MRMTFLSYLKQWRGTAKEAFENVSIVAKFLCLLHVTDRYIISTTHVHGPSMLPTLNLTGD 
VILAEHLSHRFGKIGLGDVVLVRSPRDPKRMVTKRILGLEGDRLTFSADPLVGDASVSVL 
VPKGHVWIQGDNLYASTDSRHFGPVPYSLIEGKALLRVWPPEYFGSLR*
>AT1G53530.1 |  signal peptidase I family protein 
MRMTFLSYLKQWRGTAKEAFENVSIVAKFLCLLHVTDRYIISTTHVHGPSMLPTLNLTGD 
VILAEHLSHRFGKIGLGDVVLVRSPRDPKRMVTKRILGLEGDRLTFSADPLVGDASVSVL 
VPKGHVWIQGDNLYASTDSRHFGPVPYSLIEGKALLRVWPPEYFGSLR*
>AT1G53530.2 |  signal peptidase I family protein 
MLPTLNLTGDVILAEHLSHRFGKIGLGDVVLVRSPRDPKRMVTKRILGLEGDRLTFSADP 
LVGDASVSVLVPKGHVWIQGDNLYASTDSRHFGPVPYSLIEGKALLRVWPPEYFGSLR*
>AT1G53530.2 |  signal peptidase I family protein 
MLPTLNLTGDVILAEHLSHRFGKIGLGDVVLVRSPRDPKRMVTKRILGLEGDRLTFSADP 
LVGDASVSVLVPKGHVWIQGDNLYASTDSRHFGPVPYSLIEGKALLRVWPPEYFGSLR*
>AT2G05170.1 |  ATVPS11 binding / protein binding / transporter/ zinc ion binding 
MYQLRKFDFFEEKYGGKIPEDVTGDIQCCSSGRGKVVIGSNDGSVSFLDRGVKFDSGFQA 
HSSSVLFLQHLKQRNFLVTVGEDEQISPQQSGMCLKVFDLDKVQEEGTSSSAPECIGILR 
IFTNQFPEAKITSFLVLEEVPPILLIAIGLDNGCIYCVKGDIARERITRFKLQVDGRSAI 
TGLGFRMDGQALLLFAVTPESVNLFSMQAQPPKLQTLDHIGGSVNTVTMSDRSELIVGRP 
EAVYFYEVDGRGPCWAFEGEKKFMGWFRGYLLCVIDDSKTGNTVFNVYDLRNRLIAYSIV 
VDKVSNMLCEWGNIILIKADKSLLCITEKDMESKLDMLFKKNLYTVAINLVQSQHADAAA 
TANVMRKYGDHLYGKQDFDEAMLQYINTIGYLEPSFVIQKFLDAQRIYNLTNYLEKLHEK 
GLASKDHTTLLLNCYTKLKDVEKLNTFIRKEDGIGELKFDVETAIRVCRAANYHEHAMYV 
AKKAGKHEWYLKILLEDLGNYDEALQYVSSLEPSQAGVTIEQYGKILIEHKPKETIDILM 
RLCTEQGIPNGVFLSMLPSPVDFITVFVQHPHSLMHFLERYAEIVQDSPAQAEINNTLLE 
LYLSRDLNFPSISLSENGLDKDLIDHSVAAAVSKADPEKKTNADSKDAMEKDCTERQQKG 
LELLKMAWPSDLEQPLYDVDLAVILCEMNSFKDGLLYLYEKMKFYKEVIACYMQNHDHEG 
LIACCKRLGDSSKGGDPSLWADLLKYFGEIGEDCTKEVKEVLTYIERDDILPPIIVLQTL 
AKNPCLTLSVIKDYIARKLEQESKIIEEDRRAVEKYQETTKNMRKEIEDLRTNARIFQLS 
KCTACTFTLDIPAVHFMCMHSFHQRCLGDNEKECPECAPEYRSVMEMKRSLEQNSKDQDL 
FFQQVKGSKDGFSVIAEYFGKGIISKTRDATS*
>AT2G28060.1 |  protein kinase-related 
MNSQNPDDHEDTTVVGFEVPVSPVSSYNNVYSSTEDETRDPPAVPPHLQHSLLGNQGSME 
LAYAPQNVVLNHLYIENRDAPRSVVALGFSHRFRTKFVTVVIYKPVQRRGSANV*
>AT2G36930.1 |  zinc finger (C2H2 type) family protein 
MGRCPTRKVKKRRLSHKTARRDKFEVKGDDLVYTELRKPETEIKPLQLDEDLPGMGQFYC 
LHCDRYFSNVSVRDDHFKTKKHKKRVNMMMGQAPHSQLDADLAGGMGMPDNGPKLMSNLV 
FTELRKPETEDLPGMGQFNCLLCHRNFSNASVMDYHFKTKKHKKRVKKIERPAPHSQLDA 
DLAGGMGMPDNGPKLMSA*
>AT4G12620.1 |  ORC1B (ORIGIN OF REPLICATION COMPLEX 1B) DNA binding / double-stranded methylated DNA binding / protein binding 
MASTPRAKTFKSPTKTPSNIYRKSYLSPSSTSHTPQTPETHTPLRRSARHVSRKIDLGND 
PIDAPGNDPIEGMNLIRKRERAPRKPTTDVVPSKSKKTETPKKKKKIDSFTPVSPIRSET 
IKKTKKKKRVYYNKVEFDETEFEIGDDVYVKRREDSNSDEEEDPEIEDCQICFKSDTNIM 
IECDDCLGGFHLKCLKPPLKEVPEGDWICQFCEVKKSGQSQTLDLPKPPEGKKLARTMRE 
KLLSGDLWAARIDKLWKEVDDGVYWIRARWYMIPEETVSGRQPHNLKRELYLTNDFADIE 
MECILRHCSVKCPKEFSKASNDGDDVFLCEYEYDVHWRSFKRLAELADGDSDSDQEWNGR 
KEEEVDDSDEEMELDDEVLKSKRGGLTSARGGANSRKGRFFGVEKVGMKLIPEHVRCHKQ 
SELEKAKATLLLATRPKSLPCRSKEMEEITSFIKGSISDDQCLGRCMYIHGVPGTGKTIS 
VLSVMKNLKAEVEEGSVSPYCFVEINGLKLASPENIYSVIYEALSGHRVGWKKALQCLNE 
RFAEGKRIGKEDEKPCILLIDELDLLVTRNQSVLYNILDWPTKPNSKLVVLGIANTMDLP 
EKLLPRISSRMGIQRLCFGPYNHTQLQEIISTRLNGIDAFEKTAIEFASRKVAAISGDAR 
RALEICRRAAEVADHRLNTNKSAKNQLVIMADVEAAIQEMFQAPHIQVMKSVSKLSKIFL 
TAMVHELYKTGMAETTFDRVATTVSSICLTNGEAFPGWDILLKIGCDLGECRIILCEPGE 
KHRLQKLQLNFPSDDVAFALKDNKDLPWLANYL*
>AT4G21480.1 |  carbohydrate transmembrane transporter/ sugarhydrogen symporter 
MPSVGIVIGDGKKEYPGKLTLYVTVTCIVAAMGGLIFGYDIGISGGVTTMDSFQQKFFPS 
VYEKQKKDHDSNQYCRFDSVSLTLFTSSLYLAALCSSLVASYVTRQFGRKISMLLGGVLF 
CAGALLNGFATAVWMLIVGRLLLGFGIGFTNQSVPLYLSEMAPYKYRGALNIGFQLSITI 
GILVANVLNFFFSKISWGWRLSLGGAVVPALIITVGSLILPDTPNSMIERGQFRLAEAKL 
RKIRGVDDIDDEINDLIIASEASKLVEHPWRNLLQRKYRPHLTMAILIPAFQQLTGINVI 
MFYAPVLFQTIGFGSDAALISAVVTGLVNVGATVVSIYGVDKWGRRFLFLEGGFQMLISQ 
VAVAAAIGAKFGVDGTPGVLPKWYAIVVVLFICIYVAAFAWSWGPLGWLVPSEIFPLEIR 
SAAQSITVSVNMIFTFLIAQVFLMMLCHLKFGLFIFFAFFVVVMSIFVYLFLPETRGVPI 
EEMNRVWRSHWYWSKFVDARRI*
>AT4G22330.1 |  ATCES1 catalytic/ hydrolase acting on carbon-nitrogen (but not peptide) bonds in linear amides 
MADGISSFWGPVTSTIECCEMNYAYSSYIAEFYNTISNVPGILLALIGLVNALRQRFEKR 
FSILHISNMILAIGSMLYHATLQHVQQQSDETPMVWEILLYMYILYSPDWHYRSTMPTFL 
FLYGAAFAIVHAYLRFGIGFKVHYVILCLLCIPRMYKYYIHTEDTAAKRIAKWYVATILV 
GSICWFCDRVFCKTISQWPVNPQGHALWHVFMSFNSYCANTFLMFCRAQQRGWNPKVKYF 
LGVLPYVKIEKPKTQ*
>AT4G29140.1 |  MATE efflux protein-related 
MCNPSTTTTTTGSENQESRTGLFLDLFSINSFEPTKRNLRHCENRGSPLMAEAVTEAKSL 
FTLAFPIAVTALVLYLRSAVSMFFLGQLGDLELAAGSLAIAFANITGYSVLSGLALGMEP 
LCSQAFGAHRFKLLSLTLHRTVVFLLVCCVPISVLWFNVGKISVYLHQDPDIAKLAQTYL 
IFSLPDLLTNTLLHPIRIYLRAQGIIHPVTLASLSGAVFHLPANLFLVSYLRLGLTGVAV 
ASSITNIFVVAFLVCYVWASGLHAPTWTDPTRDCFRGWAPLLRLAGPSCVSVCLEWWWYE 
IMIVLCGLLVNPRSTVAAMGVLIQTTSFLYVFPSSLSFAVSTRVGNELGANRPKTAKLTA 
TVAIVFAAVTGIIAAAFAYSVRNAWGRIFTGDKEILQLTAAALPILGLCEIGNCPQTVGC 
GVVRGTARPSTAANVNLGAFYLVGMPVAVGLGFWAGIGFNGLWVGLLAAQISCAGLMMYV 
VGTTDWESEAKKAQTLTCAETVENDIIKAVVASTIDGECDEAEPLIRITVLY*
>AT4G33070.1 |  pyruvate decarboxylase putative 
MDTKIGSIDDCKPTNGDVCSPTNGTVATIHNSVPSSAITINYCDATLGRHLARRLVQAGV 
TDVFSVPGDFNLTLLDHLMAEPDLNLIGCCNELNAGYAADGYARSRGVGACVVTFTVGGL 
SVLNAIAGAYSENLPLICIVGGPNSNDYGTNRILHHTIGLPDFSQELRCFQTVTCYQAVV 
NNLDDAHEQIDKAISTALKESKPVYISVSCNLAAIPHHTFSRDPVPFSLAPRLSNKMGLE 
AAVEATLEFLNKAVKPVMVGGPKLRVAKACDAFVELADASGYALAMMPSAKGFVPEHHPH 
FIGTYWGAVSTPFCSEIVESADAYIFAGPIFNDYSSVGYSLLLKKEKAIVVQPDRITVAN 
GPTFGCILMSDFFRELSKRVKRNETAYENYHRIFVPEGKPLKCESREPLRVNTMFQHIQK 
MLSSETAVIAETGDSWFNCQKLKLPKGCGYEFQMQYGSIGWSVGATLGYAQASPEKRVLA 
FIGDGSFQVTVQDISTMLRNGQKTIIFLINNGGYTIEVEIHDGPYNVIKNWNYTGLVDAI 
HNGEGNCWTAKVRYEEELVEAITTATTEKKDCLCFIEVILHKDDTSKELLEWGSRVSAAN 
SRPPNPQ*
>AT4G37880.1 |  protein binding / zinc ion binding 
MELKSIKDAFDRVATKQKLSYSKTNEIVHMLSQEIDKALSILEETPSSDTMLLDHRSILA 
DVKKVFMEIAPITQLEATEKELHAALTKYPKVLEKQLNPDISKAYRHNVEFDTHIVNQII 
ANFFYRQGMFDIGDCFVAETGESECSTRQSFVEMYRILEAMKRRDLEPALNWAVSNSDKL 
KEARSDLEMKLHSLHFLEIARGKNSKEAIDYARKHIATFADSCLPEIQKLMCSLLWNRKL 
DKSPYSEFLSPALWNNAVKELTRQYCNLLGESSESPLSITVTAGTQALPVLLKYMNVVMA 
NKKLDWQTMEQLPVDAQLSEEFQFHSVFVCPVSKEQSSDDNPPMMMSCGHVLCKQTINKM 
SKNGSKSSFKCPYCPTDVDISRCRQLHF*
>AT1G13950.1 |  ELF5A-1 (EUKARYOTIC ELONGATION FACTOR 5A-1) translation initiation factor 
MSDEEHHFESSDAGASKTYPQQAGTIRKNGYIVIKNRPCKVVEVSTSKTGKHGHAKCHFV 
AIDIFTSKKLEDIVPSSHNCDVPHVNRTDYQLIDISEDGYVSLLTDNGSTKDDLKLPNDD 
TLLQQIKSGFDDGKDLVVSVMSAMGEEQINALKDIGPK*
>AT1G33040.1 |  NACA5 (NASCENT POLYPEPTIDE-ASSOCIATED COMPLEX SUBUNIT ALPHA-LIKE PROTEIN 5) 
MPGAIVEEEKSQIESIKEQLKLEKEDDVVVEDVKDGEEEDDDEDDEDVEVEGEGGNENAK 
QSRSEKKSRKAVLKLGMKPVSDVSRVTIKRAKNVLFVISKPDVYKSPNAETYVIFGEAKV 
DDLSSQLQTQAAQRFKMPDVTSMLPNAGSEATMAPLAEEEDEDDVDDTGVEARDIDLVMT 
QAGVSKAKAVSALKANDGDIVSAIMELTT*
>AT1G52500.2 |  ATMMH-1 (ARABIDOPSIS THALIANA MUTM HOMOLOG-1) DNA N-glycosylase 
MPELPEVEAARRAIEENCLGKKIKRVIIADDNKVIHGISPSDFQTSILGKTIISARRKGK 
NLWLELDSPPFPSFQFGMAGAIYIKGVAVTKYKRSAVKDSEEWPSKYSKFFVELDDGLEL 
SFTDKRRFAKVRLLANPTSVSPISELGPDALLEPMTVDEFAESLAKKKITIKPLLLDQGY 
ISGIGNWIADEVLYQARIHPLQTASSLSKEQCEALHTSIKEVIEKAVEVDADSSQFPSYW 
IFHNREKKPGKAFVDGKKIDFITAGGRTTAYVPELQKLYGKDAEKAAKVRPAKRGVKPKE 
DDGDGEEDEQETEKEDESAKSKKGQKPRGGRGKKPASKTKTEESDDDGDDSEAEEEVVKP 
KGRGTKPAIKRKSEEKATSQAGKKPKGRKS*
>AT1G52500.2 |  ATMMH-1 (ARABIDOPSIS THALIANA MUTM HOMOLOG-1) DNA N-glycosylase 
MPELPEVEAARRAIEENCLGKKIKRVIIADDNKVIHGISPSDFQTSILGKTIISARRKGK 
NLWLELDSPPFPSFQFGMAGAIYIKGVAVTKYKRSAVKDSEEWPSKYSKFFVELDDGLEL 
SFTDKRRFAKVRLLANPTSVSPISELGPDALLEPMTVDEFAESLAKKKITIKPLLLDQGY 
ISGIGNWIADEVLYQARIHPLQTASSLSKEQCEALHTSIKEVIEKAVEVDADSSQFPSYW 
IFHNREKKPGKAFVDGKKIDFITAGGRTTAYVPELQKLYGKDAEKAAKVRPAKRGVKPKE 
DDGDGEEDEQETEKEDESAKSKKGQKPRGGRGKKPASKTKTEESDDDGDDSEAEEEVVKP 
KGRGTKPAIKRKSEEKATSQAGKKPKGRKS*
>AT1G52500.1 |  ATMMH-1 (ARABIDOPSIS THALIANA MUTM HOMOLOG-1) DNA N-glycosylase 
MPELPEVEAARRAIEENCLGKKIKRVIIADDNKVIHGISPSDFQTSILGKTIISARRKGK 
NLWLELDSPPFPSFQFGMAGAIYIKGVAVTKYKRSAVKDSEEWPSKYSKFFVELDDGLEL 
SFTDKRRFAKVRLLANPTSVSPISELGPDALLEPMTVDEFAESLAKKKITIKPLLLDQGY 
ISGIGNWIADEVLYQARIHPLQTASSLSKEQCEALHTSIKEVIQHAVQVNADSKEFPVEW 
LFHFRWGKKAGKVNGKLSHHLSINLMKQNLGFCR*
>AT1G52500.1 |  ATMMH-1 (ARABIDOPSIS THALIANA MUTM HOMOLOG-1) DNA N-glycosylase 
MPELPEVEAARRAIEENCLGKKIKRVIIADDNKVIHGISPSDFQTSILGKTIISARRKGK 
NLWLELDSPPFPSFQFGMAGAIYIKGVAVTKYKRSAVKDSEEWPSKYSKFFVELDDGLEL 
SFTDKRRFAKVRLLANPTSVSPISELGPDALLEPMTVDEFAESLAKKKITIKPLLLDQGY 
ISGIGNWIADEVLYQARIHPLQTASSLSKEQCEALHTSIKEVIQHAVQVNADSKEFPVEW 
LFHFRWGKKAGKVNGKLSHHLSINLMKQNLGFCR*
>AT3G24010.1 |  ING1 (INHIBITOR OF GROWTH 1) DNA binding / methylated histone residue binding 
MSFAEEFEANLVSLAHVLQKKYALLRDLDKSLQENQRQNEQRCEKEIEDIRRGRAGNITP 
NTSLTKFSEEALDEQKHSVRIADEKVTLAMQAYDLVDMHVQQLDQYMKKSDEVIRKEKEA 
AAATLELENNGKAGNAGEGGRGGRKKTRLATAASTAAASTGMTSSNMDLDLPVDPNEPTY 
CICNQVSFGEMVACDNNACKIEWFHFGCVGLKEQPKGKWYCPECATVKKSRKGR*
>AT3G53030.1 |  SRPK4 (Ser/Arg-rich protein kinase 4) kinase/ protein kinase 
MEAEKWNSDGGEYTSEDEGTEDYRRGGYHAVRIGDSFKTGRYVVQSKLGWGHFSTVWLSW 
DTQSSRYVALKVQKSAQHYTEAAMDEITILQQIAEGDTDDTKCVVKLLDHFKHSGPNGQH 
VCMVFEYLGDNLLTLIKYSDYRGLPIPMVKEICYHMLVGLDYLHKQLSIIHTDLKPENVL 
LPSTIDPSKDPRKSGAPLVLPTDKDNTVVDSNGDFVKNQKTGSHRKAKLSAQGHAENKGN 
TESDKVRGVGSPVNGKQCAAEKSVEEDCPSTSDAIELDGSEKGKQGGKKGSRSSRRHLVA 
SADLKCKLVDFGNACWTYKQFTSDIQTRQYRCPEVILGSKYSTSADLWSFACICFELVTG 
DVLFDPHSGDNYDRDEDHLALMMELLGMMPRKIALGGRYSRDFFNRHGDLRHIRRLRFWP 
MNKVLTEKYEFSEQDANDLSDFLVSILDFVPEKRPTAAQCLLHPWINSGPRSIKPSLKDE 
NSDKLDTEKNKRENEEQEAVEVKMGNVVISSLDSKPGMSQSSSTLKLAI*
>AT4G19880.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal (InterProIPR004046) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1635 Blast hits to 1635 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 158 Plants - 57 Viruses - 0 Other Eukaryotes - 480 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVYAVHFKCNKKLIREYPNLFN 
YTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPFGIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1621 Blast hits to 1621 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 155 Plants - 57 Viruses - 0 Other Eukaryotes - 469 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVYAVHFKCNKKLIREYPNLFN 
YTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPFGIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal (InterProIPR004046) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1635 Blast hits to 1635 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 158 Plants - 57 Viruses - 0 Other Eukaryotes - 480 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSSYFQSKKKKYTICERISNV 
ETLIQVYAVHFKCNKKLIREYPNLFNYTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPF 
GIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1621 Blast hits to 1621 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 155 Plants - 57 Viruses - 0 Other Eukaryotes - 469 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSSYFQSKKKKYTICERISNV 
ETLIQVYAVHFKCNKKLIREYPNLFNYTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPF 
GIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal (InterProIPR004046) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1635 Blast hits to 1635 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 158 Plants - 57 Viruses - 0 Other Eukaryotes - 480 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSLCSPLQMQQETHKGVSEFV 
QLHERHFPDPRYE*
>AT4G19880.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1621 Blast hits to 1621 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 155 Plants - 57 Viruses - 0 Other Eukaryotes - 469 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSLCSPLQMQQETHKGVSEFV 
QLHERHFPDPRYE*
>AT4G19880.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1567 Blast hits to 1567 proteins in 487 species Archae - 12 Bacteria - 899 Metazoa - 23 Fungi - 155 Plants - 55 Viruses - 0 Other Eukaryotes - 423 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVYAVHFKCNKKLIREYPNLFN 
YTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPFGIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1567 Blast hits to 1567 proteins in 487 species Archae - 12 Bacteria - 899 Metazoa - 23 Fungi - 155 Plants - 55 Viruses - 0 Other Eukaryotes - 423 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSSYFQSKKKKYTICERISNV 
ETLIQVYAVHFKCNKKLIREYPNLFNYTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPF 
GIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1567 Blast hits to 1567 proteins in 487 species Archae - 12 Bacteria - 899 Metazoa - 23 Fungi - 155 Plants - 55 Viruses - 0 Other Eukaryotes - 423 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSLCSPLQMQQETHKGVSEFV 
QLHERHFPDPRYE*
>AT3G45130.1 |  LAS1 lanosterol synthase 
MWRLKLSEGDEESVNQHVGRQFWEYDNQFGTSEERHHINHLRSNFTLNRFSSKHSSDLLY 
RFQCWKEKGKGMERLPQVKVKEGEERLINEEVVNVTLRRSLRFYSILQSQDGFWPGDYGG 
PLFLLPALVIGLYVTEVLDGTLTAQHQIEIRRYLYNHQNKDGGWGLHVEGNSTMFCTVLS 
YVALRLMGEELDGGDGAMESARSWIHHHGGATFIPSWGKFWLSVLGAYEWSGNNPLPPEL 
WLLPYSLPFHPGRMWCHCRMVYLPMSYLYGRRFVCRTNGTILSLRRELYTIPYHHIDWDT 
ARNQCAKEDLYYPHPKIQDVLWSCLNKFGEPLLERWPLNNLRNHALQTVMQHIHYEDQNS 
HYICIGPVNKVLNMLCCWVESSNSEAFKSHLSRIKDYLWVAEDGMKMQGYNGSQLWDVTL 
AVQAILATNLVDDYGLMLKKAHNYIKNTQIRKDTSGDPGLWYRHPCKGGWGFSTGDNPWP 
VSDCTAEALKAALLLSQMPVNLVGEPMPEEHLVDAVNFILSLQNKNGGFASYELTRSYPE 
LEVINPSETFGDIIIDYQYVECTSAAIQGLVLFTTLNSSYKRKEIVGSINKAVEFIEKTQ 
LPDGSWYGSWGVCFTYATWFGIKGMLASGKTYESSLCIRKACGFLLSKQLCCGGWGESYL 
SCQNKVYTNLPGNKSHIVNTSWALLALIEAGQASRDPMPLHRGAKSLINSQMEDGDYPQQ 
EILGVFNRNCMISYSAYRNIFPIWALGEYRKLMLSL*
>AT5G42600.1 |  MRN1 (MARNERAL SYNTHASE) catalytic/ marneral synthase 
MWRLRIGAEARQDPHLFTTNNFAGRQIWEFDANGGSPEELAEVEEARLNFANNKSRFKAS 
PDLFWRRQFLREKKFEQKIPRVRIEDAEKITYEDAKTALRRGVLYYAACQANDGHWPSEV 
SGSMFLDAPFVICLYITGHLEKIFTLEHVKELLRYMYNTQNEDGGWGLDVESHSVMFCTV 
LNYICLRILGVEPDHDGQKSACARARKWILDHGGATYAPMVAKAWLSVLGVYDWSGCKPL 
PPEIWMLPSFSPINGGTLWIYIRDLLMGMSYLYGKKFVATPTALILQLREELYPQPYSKI 
IWSKARNRCAKEDLLYPKSFGQDLFWEGVHMLSENIINRWPLNKFVRQRALRTTMELVHY 
HDETTHYITGACVAKPFHMLACWVEDPDGDYFKKHLARVPDFIWIAEDGLKFQLMGMQSW 
NAALSLQVMLAANMDDEIRSTLIKGYDFLKQSQISENPQGDHLKMFRDITKGGWTFQDRE 
QGLPISDGTAESIECCIHFHRMPSEFIGEKMDVEKLYDAVNFLIYLQSDNGGMPVWEPAP 
GKKWLEWLSPVEHVENTVVEQEYLECTGSVIAGLVCFKKEFPDHRPKEIEKLIKKGLKYI 
EDLQMPDGSWYGNWGVCFTYGTLFAVRGLAAAGKTFGNSEAIRRAVQFILNTQNAEGGWG 
ESALSCPNKKYIPSKGNVTNVVNTGQAMMVLLIGGQMERDPSPVHRAAKVLINSQLDIGD 
FPQQERRGIYMNMLLHYPTYRNMFSLWALALYTNALRLLVS*
>AT4G15370.1 |  BARS1 (BARUOL SYNTHASE 1) baruol synthase/ catalytic 
MWRLRIGAKAKDNTHLFTTNNYVGRQIWEFDANAGSPEELAEVEEARRNFSNNRSRFKAS 
ADLLWRMQFLREKKFEQKIPRVIVEDAEKITYEDAKTALRRGLLYFTALQADDGHWPAEN 
AGSIFFNAPFVICLYITGHLEKIFTHEHRVELLRYMYNHQNEDGGWGLHVESPSNMFCSV 
INYICLRILGVEAGHDDKGSACARARKWILDHGGATYSPLIGKAWLSVLGVYDWSGCKPI 
PPEFWFLPSFFPVNGGTLWIYLRDIFMGLSYLYGKNFVATSTPLILQLREEIYPEPYTNI 
SWRQARNRCAKEDLYYPQSFLQDLFWKGVHVFSENILNRWPFNNLIRQRALRTTMELVHY 
HDEATRYITGGSVPKVIAVFHMLACWVEDPESDYFKKHLARVPDFIWIGEDGLKIQSFGS 
QVWDTALSLHVFIDGFDDDVDEEIRSTLLKGYDYLEKSQVTENPPGDYMKMFRHMAKGGW 
TFSDQDQGWPVSDCTAESLECCLFFESMSSEFIGKKMDVEKLYDAVDFLLYLQSDNGGIT 
AWQPADGKLVEFIEDAVVEHEYVECTGSAIVALAQFNKQFPGYKKEEVERFITKGVKYIE 
DLQMVDGSWYGNWGVCFIYGTFFAVRGLVAAGKCYNNCEAIRRAVRFILDTQNTEGGWGE 
SYLSCPRKKYIPLIGNKTNVVNTGQALMVLIMGNQMKRDPLPVHRAAKVLINSQMDNGDF 
PQQEIMGVFKMNVMLHFPTYRNMFTLWALTHYTKALRGL*
>AT1G66960.1 |  lupeol synthase putative / 23-oxidosqualene-triterpenoid cyclase putative 
MWRLKVGEGKGKDPYLFSSNNFVGRQTWEFDPKAGTREERTAVEEARRSFFDNRSRVKPS 
SDLLWKMQFLKEAKFEQVIPPVKIDGGEAITYEKATNALRRGVAFLSALQASDGHWPGEF 
TGPLCMLPPLVFCLYITGHLEEVFDAEHRKEMLRYIYCHQNEDGGWGFHIESKSIMFTTT 
LNYICLRILGVGPDGGLENACKRARQWILSHGGVIYIPCWGKVWLSVLGIYDWSGVNPMP 
PEIWLLPYFLPIHLGKAFSYTRITYMPISYLYGKKFVGQITPLIMQLREELHLQPYEEIN 
WNKARHLCAKEDKYYPHPLVQDLIWDALHTFVEPLLASWPINKLVRKKALQVAMKHIHYE 
DENSHYITIGCIEKNLCMLACWIDNPDGNHFKKHLSRIPDMMWVAEDGMKMQCFGSQLWM 
TGFAVQALLASDPRDETYDVLRRAHDYIKKSQVRDNPSGDFKSMYRHISKGGWTLSDRDH 
GWQVSDCTAEAAKCCMLLSTMPTDITGEKINLEQLYDSVNLMLSLQSENGGFTAWEPVRA 
YKWMELMNPTDLFANAMTEREYTECTSAVLQALVIFNQLYPDHRTKEITKSIEKAVQFIE 
SKQLRDGSWYGSWGICFTYGTWFALCGLAAIGKTYNNCLSMRDGVHFLLNIQNEDGGWGE 
SYMSCPEQRYIPLEGNRSNVVQTAWAMMALIHAGQAKRDLIPLHSAAKFIITSQLENGDF 
PQQELLGASMSTCMLHYSTYKDIFPPWALAEYRKAAFIHHADL*
>AT1G78500.1 |  pentacyclic triterpene synthase putative 
MWRLKIGAKGGDETHLFTTNNYTGRQTWEFDADACSPEELAEVDEARQNFSINRSRFKIS 
ADLLWRMQFLREKKFEQKIPRVEIGDAENITYKDAKTALRRGILYFKALQAEDGHWPAEN 
SGCLFFEAPFVICLYITGHLEKILTLEHRKELLRYMYNHQNEDGGWGIHVEGQSAMFCTV 
INYICLRILGVEADLDDIKGSGCARARKWILDHGGATYTPLIGKAWLSILGVYDWSGCKP 
IPPEVWMLPTFSPFNGGTLWIYFRDIFMGVSYLYGKKFVATPTPLILQLREELYPQPYDK 
ILWSQARNQCAKEDLYYPQSFLQEMFWKCVHILSENILNRWPCNKLIRQKALRTTMELLH 
YQDEASRYFTGGCVPKPFHMLACWVEDPDGDYFKKHLARVPDYIWIGEDGLKIQSFGSQL 
WDTAFSLQVMLAYQDVDDDDDEIRSTLIKGYSFLNKSQLTQNPPGDHRKMLKDIAKGGWT 
FSDQDQGWPVSDCTAESLECCLVFGSMPSELIGEKMDVERLYDAVNLLLYFQSKNGGITV 
WEAARGRTWLEWLSPVEFMEDTIVEHEYVECTGSAIVALARFLKEFPEHRREEVEKFIKN 
AVKYIESFQMPDGSWYGNWGVCFMYGTFFAVRGLVAAGKTYQNCEPIRKAVQFILETQNV 
EGGWGESYLSCPNKKYTLLEGNRTNVVNTGQALMVLIMGGQMERDPLPVHRAAKVLINSQ 
LDNGDFPQEEIMGVFKMNVMVHYATYRNIFTLWALTYYTKALRVPLC*
>AT1G78970.2 |  LUP1 (LUPEOL SYNTHASE 1) beta-amyrin synthase/ lupeol synthase 
MWKLKIGKGNGEDPHLFSSNNFVGRQTWKFDHKAGSPEERAAVEEARRGFLDNRFRVKGC 
SDLLWRMQFLREKKFEQGIPQLKATNIEEITYETTTNALRRGVRYFTALQASDGHWPGEI 
TGPLFFLPPLIFCLYITGHLEEVFDAEHRKEMLRHIYCHQNEDGGWGLHIESKSVMFCTV 
LNYICLRMLGENPEQDACKRARQWILDRGGVIFIPSWGKFWLSILGVYDWSGTNPTPPEL 
LMLPSFLPIHPGKILCYSRMVSIPMSYLYGKRFVGPITPLILLLREELYLEPYEEINWKK 
SRRLYAKEDMYYAHPLVQDLLSDTLQNFVEPLLTRWPLNKLVREKALQLTMKHIHYEDEN 
SHYITIGCVEKVLCMLACWVENPNGDYFKKHLARIPDYMWVAEDGMKMQSFGCQLWDTGF 
AIQALLASNLPDETDDALKRGHNYIKASQVRENPSGDFRSMYRHISKGAWTFSDRDHGWQ 
VSDCTAEALKCCLLLSMMSADIVGQKIDDEQLYDSVNLLLSLQSGNGGVNAWEPSRAYKW 
LELLNPTEFMANTMVEREFVECTSSVIQALDLFRKLYPDHRKKEINRSIEKAVQFIQDNQ 
TPDGSWYGNWGVCFIYATWFALGGLAAAGETYNDCLAMRNGVHFLLTTQRDDGGWGESYL 
SCSEQRYIPSEGERSNLVQTSWAMMALIHTGQAERDLIPLHRAAKLIINSQLENGDFPQQ 
EIVGAFMNTCMLHYATYRNTFPLWALAEYRKVVFIVN*
>AT1G78970.2 |  LUP1 (LUPEOL SYNTHASE 1) beta-amyrin synthase/ lupeol synthase 
MWKLKIGKGNGEDPHLFSSNNFVGRQTWKFDHKAGSPEERAAVEEARRGFLDNRFRVKGC 
SDLLWRMQFLREKKFEQGIPQLKATNIEEITYETTTNALRRGVRYFTALQASDGHWPGEI 
TGPLFFLPPLIFCLYITGHLEEVFDAEHRKEMLRHIYCHQNEDGGWGLHIESKSVMFCTV 
LNYICLRMLGENPEQDACKRARQWILDRGGVIFIPSWGKFWLSILGVYDWSGTNPTPPEL 
LMLPSFLPIHPGKILCYSRMVSIPMSYLYGKRFVGPITPLILLLREELYLEPYEEINWKK 
SRRLYAKEDMYYAHPLVQDLLSDTLQNFVEPLLTRWPLNKLVREKALQLTMKHIHYEDEN 
SHYITIGCVEKVLCMLACWVENPNGDYFKKHLARIPDYMWVAEDGMKMQSFGCQLWDTGF 
AIQALLASNLPDETDDALKRGHNYIKASQVRENPSGDFRSMYRHISKGAWTFSDRDHGWQ 
VSDCTAEALKCCLLLSMMSADIVGQKIDDEQLYDSVNLLLSLQSGNGGVNAWEPSRAYKW 
LELLNPTEFMANTMVEREFVECTSSVIQALDLFRKLYPDHRKKEINRSIEKAVQFIQDNQ 
TPDGSWYGNWGVCFIYATWFALGGLAAAGETYNDCLAMRNGVHFLLTTQRDDGGWGESYL 
SCSEQRYIPSEGERSNLVQTSWAMMALIHTGQAERDLIPLHRAAKLIINSQLENGDFPQQ 
EIVGAFMNTCMLHYATYRNTFPLWALAEYRKVVFIVN*
>AT1G78970.1 |  LUP1 (LUPEOL SYNTHASE 1) beta-amyrin synthase/ lupeol synthase 
MWKLKIGKGNGEDPHLFSSNNFVGRQTWKFDHKAGSPEERAAVEEARRGFLDNRFRVKGC 
SDLLWRMQFLREKKFEQGIPQLKATNIEEITYETTTNALRRGVRYFTALQASDGHWPGEI 
TGPLFFLPPLIFCLYITGHLEEVFDAEHRKEMLRHIYCHQNEDGGWGLHIESKSVMFCTV 
LNYICLRMLGENPEQDACKRARQWILDRGGVIFIPSWGKFWLSILGVYDWSGTNPTPPEL 
LMLPSFLPIHPGKILCYSRMVSIPMSYLYGKRFVGPITPLILLLREELYLEPYEEINWKK 
SRRLYAKEDMYYAHPLVQDLLSDTLQNFVEPLLTRWPLNKLVREKALQLTMKHIHYEDEN 
SHYITIGCVEKVLCMLACWVENPNGDYFKKHLARIPDYMWVAEDGMKMQSFGCQLWDTGF 
AIQALLASNLPDETDDALKRGHNYIKASQVRENPSGDFRSMYRHISKGAWTFSDRDHGWQ 
VSDCTAEALKCCLLLSMMSADIVGQKIDDEQLYDSVNLLLSLQSGNGGVNAWEPSRAYKW 
LELLNPTEFMANTMVEREFVECTSSVIQALDLFRKLYPDHRKKEINRSIEKAVQFIQDNQ 
TPDGSWYGNWGVCFIYATWFALGGLAAAGETYNDCLAMRNGVHFLLTTQRDDGGWGESYL 
SCSEQRYIPSEGERSNLVQTSWAMMALIHTGQAERDLIPLHRAAKLIINSQLENGDFPQQ 
EIVGAFMNTCMLHYATYRNTFPLWALAEYRKVVFIVN*
>AT1G78970.1 |  LUP1 (LUPEOL SYNTHASE 1) beta-amyrin synthase/ lupeol synthase 
MWKLKIGKGNGEDPHLFSSNNFVGRQTWKFDHKAGSPEERAAVEEARRGFLDNRFRVKGC 
SDLLWRMQFLREKKFEQGIPQLKATNIEEITYETTTNALRRGVRYFTALQASDGHWPGEI 
TGPLFFLPPLIFCLYITGHLEEVFDAEHRKEMLRHIYCHQNEDGGWGLHIESKSVMFCTV 
LNYICLRMLGENPEQDACKRARQWILDRGGVIFIPSWGKFWLSILGVYDWSGTNPTPPEL 
LMLPSFLPIHPGKILCYSRMVSIPMSYLYGKRFVGPITPLILLLREELYLEPYEEINWKK 
SRRLYAKEDMYYAHPLVQDLLSDTLQNFVEPLLTRWPLNKLVREKALQLTMKHIHYEDEN 
SHYITIGCVEKVLCMLACWVENPNGDYFKKHLARIPDYMWVAEDGMKMQSFGCQLWDTGF 
AIQALLASNLPDETDDALKRGHNYIKASQVRENPSGDFRSMYRHISKGAWTFSDRDHGWQ 
VSDCTAEALKCCLLLSMMSADIVGQKIDDEQLYDSVNLLLSLQSGNGGVNAWEPSRAYKW 
LELLNPTEFMANTMVEREFVECTSSVIQALDLFRKLYPDHRKKEINRSIEKAVQFIQDNQ 
TPDGSWYGNWGVCFIYATWFALGGLAAAGETYNDCLAMRNGVHFLLTTQRDDGGWGESYL 
SCSEQRYIPSEGERSNLVQTSWAMMALIHTGQAERDLIPLHRAAKLIINSQLENGDFPQQ 
EIVGAFMNTCMLHYATYRNTFPLWALAEYRKVVFIVN*
>AT4G15340.1 |  ATPEN1 (ARABIDOPSIS THALIANA PENTACYCLIC TRITERPENE SYNTHASE 1) arabidiol synthase 
MWRLRIGAKAGNDTHLFTTNNYVGRQIWEFDANAGSPQELAEVEEARRNFSNNRSHYKAS 
ADLLWRMQFLREKGFEQKIPRVRVEDAAKIRYEDAKTALKRGLHYFTALQADDGHWPADN 
SGPNFFIAPLVICLYITGHLEKIFTVEHRIELIRYMYNHQNEDGGWGLHVESPSIMFCTV 
INYICLRIVGVEAGHDDDQGSTCTKARKWILDHGGATYTPLIGKACLSVLGVYDWSGCKP 
MPPEFWFLPSSFPINGGTLWIYLRDIFMGLSYLYGKKFVATPTPLILQLQEELYPEPYTK 
INWRLTRNRCAKEDLCYPSSFLQDLFWKGVHIFSESILNRWPFNKLIRQAALRTTMKLLH 
YQDEANRYITGGSVPKAFHMLACWVEDPEGEYFKKHLARVSDFIWIGEDGLKIQSFGSQL 
WDTVMSLHFLLDGVEDDVDDEIRSTLVKGYDYLKKSQVTENPPSDHIKMFRHISKGGWTF 
SDKDQGWPVSDCTAESLKCCLLFERMPSEFVGQKMDVEKLFDAVDFLLYLQSDNGGITAW 
EPADGKTWLEWFSPVEFVQDTVIEHEYVECTGSAIVALTQFSKQFPEFRKKEVERFITNG 
VKYIEDLQMKDGSWCGNWGVCFIYGTLFAVRGLVAAGKTFHNCEPIRRAVRFLLDTQNQE 
GGWGESYLSCLRKKYTPLAGNKTNIVSTGQALMVLIMGGQMERDPLPVHRAAKVVINLQL 
DNGDFPQQEVMGVFNMNVLLHYPTYRNIYSLWALTLYTQALRRLQP*
>AT5G36150.1 |  ATPEN3 (putative pentacyclic triterpene synthase 3) catalytic/ lupeol synthase 
MWRLRIGAKAGDDPHLCTTNNFLGRQIWEFDANAGSPAELSEVDQARQNFSNNRSQYKAC 
ADLLWRMQFLREKNFEQKIPRVRIEDAKKITFEDAKNTLRRGIHYMAALQSDDGHWPSEN 
AGCIFFNAPFVICLYITGHLDKVFSEEHRKEMLRYMYNHQNDDGGWGIDVESHSFMFCTV 
INYICLRIFGVDPDHDGESACARARKWIIDHGGATYTPLFGKAWLSVLGVYEWSGCKPIP 
PEFWFFPSYFPINGGTLWIYLRDTFMAMSYLYGKKFVAKPTPLILQLREELYPQPYAEIV 
WSQARSRCAKEDLYYPQSLVQDLFWKLVHMFSENILNRWPFNKLIREKAIRTAMELIHYH 
DEATRYITGGAVPKVFHMLACWVEDPESDYFKKHLARVSHFIWIAEDGLKIQTFGSQIWD 
TAFVLQVMLAADVDDEIRPTLIKGYSYLRKSQFTENPPGDYINMFRDISKGGWGYSDKDQ 
GWPVSDCISESLECCLIFESMSSEFIGEKMEVERLYDAVNMLLYMQSRNGGISIWEAASG 
KKWLEWLSPIEFIEDTILEHEYLECTGSAIVVLARFMKQFPGHRTEEVKKFITKGVKYIE 
SLQIADGSWYGNWGICFIYGTFFAVRGLVAAGNTYDNCEAIRRAVRFLLDIQNGEGGWGE 
SFLSCPNKNYIPLEGNKTDVVNTGQALMVLIMGGQMDRDPLPVHRAAKVLINSQMDNGDF 
PQQEIRGVYKMNVMLNFPTFRNSFTLWALTHYTKAIRLLL*
>AT1G78960.1 |  ATLUP2 beta-amyrin synthase/ lupeol synthase 
MWKLKIGEGNGEDPYLFSSNNFVGRQTWEFDPKAGTPEERAAVEDARRNYLDNRPRVKGC 
SDLLWRMQFLKEAKFEQVIPPVKIDDGEGITYKNATDALRRAVSFYSALQSSDGHWPAEI 
TGTLFFLPPLVFCFYITGHLEKIFDAEHRKEMLRHIYCHQNEDGGWGLHIEGKSVMFCTV 
LNYICLRMLGEGPNGGRNNACKRARQWILDHGGVTYIPSWGKIWLSILGIYDWSGTNPMP 
PEIWLLPSFFPIHLGKTLCYTRMVYMPMSYLYGKRFVGPLTPLIMLLRKELHLQPYEEIN 
WNKARRLCAKEDMIYPHPLVQDLLWDTLHNFVEPILTNWPLKKLVREKALRVAMEHIHYE 
DENSHYITIGCVEKVLCMLACWIENPNGDHFKKHLARIPDFMWVAEDGLKMQSFGSQLWD 
TVFAIQALLACDLSDETDDVLRKGHSFIKKSQVRENPSGDFKSMYRHISKGAWTLSDRDH 
GWQVSDCTAEALKCCMLLSMMPAEVVGQKIDPEQLYDSVNLLLSLQGEKGGLTAWEPVRA 
QEWLELLNPTDFFTCVMAEREYVECTSAVIQALVLFKQLYPDHRTKEIIKSIEKGVQFIE 
SKQTPDGSWHGNWGICFIYATWFALSGLAAAGKTYKSCLAVRKGVDFLLAIQEEDGGWGE 
SHLSCPEQRYIPLEGNRSNLVQTAWAMMGLIHAGQAERDPTPLHRAAKLIITSQLENGDF 
PQQEILGVFMNTCMLHYATYRNIFPLWALAEYRKAAFATHQDL*
>AT1G78950.1 |  beta-amyrin synthase putative 
MWRLKIGEGNGDDPYLFTTNNFAGRQTWEFDPDGGSPEERHSVVEARRIFYDNRFHVKAS 
SDLLWRMQFLREKKFEQRIAPVKVEDSEKVTFETATSALRRGIHFFSALQASDGHWPAEN 
AGPLFFLPPLVFCLYITGHLDEVFTSEHRKEILRYIYCHQKEDGGWGLHIEGHSTMFCTT 
LNYICMRILGESPDGGHDNACGRAREWILSHGGVTYIPSWGKTWLSILGVFDWSGSNPMP 
PEFWILPSFFPVHPAKMWSYCRMVYLPMSYLYGKRFVGPITSLILQLRKELYLQPYEEIN 
WMKVRHLCAKEDTYYPRPLVQELVWDSLYIFAEPFLARWPFNKLLREKALQLAMKHIHYE 
DENSRYITIGCVEKVLCMLACWVEDPNGDYFKKHLSRISDYLWMAEDGMKMQSFGSQLWD 
TGFAMQALLASNLSSEISDVLRRGHEFIKNSQVGENPSGDYKSMYRHISKGAWTFSDRDH 
GWQVSDCTAHGLKCCLLFSMLAPDIVGPKQDPERLHDSVNILLSLQSKNGGMTAWEPAGA 
PKWLELLNPTEMFSDIVIEHEYSECTSSAIQALSLFKQLYPDHRTTEITAFIKKAAEYLE 
NMQTRDGSWYGNWGICFTYGTWFALAGLAAAGKTFNDCEAIRKGVQFLLAAQKDNGGWGE 
SYLSCSKKIYIAQVGEISNVVQTAWALMGLIHSGQAERDPIPLHRAAKLIINSQLESGDF 
PQQQATGVFLKNCTLHYAAYRNIHPLWALAEYRARVSLP*
>AT1G78955.1 |  CAMS1 (Camelliol C synthase 1) beta-amyrin synthase 
MWKLKIANGNKEEPYLFSTNNFLGRQTWEFDPDAGTVEELAAVEEARRKFYDDRFRVKAS 
SDLIWRMQFLKEKKFEQVIPPAKVEDANNITSEIATNALRKGVNFLSALQASDGHWPAEN 
AGPLFFLPPLVFCLYVTGHLHEIFTQDHRREVLRYIYCHQNEDGGWGLHIEGNSTMFCTT 
LNYICMRILGEGPNGGPGNACKRARDWILDHGGATYIPSWGKTWLSILGVFDWSGSNPMP 
PEFWILPSFLPIHPAKMWCYCRLVYMPMSYLYGKRFVGPISPLILQLREEIYLQPYAKIN 
WNRARHLCAKEDAYCPHPQIQDVIWNCLYIFTEPFLACWPFNKLLREKALGVAMKHIHYE 
DENSRYITIGCVEKALCMLACWVEDPNGIHFKKHLLRISDYLWIAEDGMKMQSFGSQLWD 
SGFALQALVASNLVNEIPDVLRRGYDFLKNSQVRENPSGDFTNMYRHISKGSWTFSDRDH 
GWQASDCTAESFKCCLLLSMIPPDIVGPKMDPEQLYEAVTILLSLQSKNGGVTAWEPARG 
QEWLELLNPTEVFADIVVEHEYNECTSSAIQALILFKQLYPNHRTEEINTSIKKAVQYIE 
SIQMLDGSWYGSWGVCFTYSTWFGLGGLAAAGKTYNNCLAMRKGVHFLLTTQKDNGGWGE 
SYLSCPKKRYIPSEGERSNLVQTSWAMMGLLHAGQAERDPSPLHRAAKLLINSQLENGDF 
PQQEITGAFMKNCLLHYAAYRNIFPVWALAEYRRRVPLPYEKPSTERRS*