>AT2G07050.1 |  CAS1 (cycloartenol synthase 1) cycloartenol synthase 
MWKLKIAEGGSPWLRTTNNHVGRQFWEFDPNLGTPEDLAAVEEARKSFSDNRFVQKHSAD 
LLMRLQFSRENLISPVLPQVKIEDTDDVTEEMVETTLKRGLDFYSTIQAHDGHWPGDYGG 
PMFLLPGLIITLSITGALNTVLSEQHKQEMRRYLYNHQNEDGGWGLHIEGPSTMFGSVLN 
YVTLRLLGEGPNDGDGDMEKGRDWILNHGGATNITSWGKMWLSVLGAFEWSGNNPLPPEI 
WLLPYFLPIHPGRMWCHCRMVYLPMSYLYGKRFVGPITSTVLSLRKELFTVPYHEVNWNE 
ARNLCAKEDLYYPHPLVQDILWASLHKIVEPVLMRWPGANLREKAIRTAIEHIHYEDENT 
RYICIGPVNKVLNMLCCWVEDPNSEAFKLHLPRIHDFLWLAEDGMKMQGYNGSQLWDTGF 
AIQAILATNLVEEYGPVLEKAHSFVKNSQVLEDCPGDLNYWYRHISKGAWPFSTADHGWP 
ISDCTAEGLKAALLLSKVPKAIVGEPIDAKRLYEAVNVIISLQNADGGLATYELTRSYPW 
LELINPAETFGDIVIDYPYVECTSAAIQALISFRKLYPGHRKKEVDECIEKAVKFIESIQ 
AADGSWYGSWAVCFTYGTWFGVKGLVAVGKTLKNSPHVAKACEFLLSKQQPSGGWGESYL 
SCQDKVYSNLDGNRSHVVNTAWAMLALIGAGQAEVDRKPLHRAARYLINAQMENGDFPQQ 
EIMGVFNRNCMITYAAYRNIFPIWALGEYRCQVLLQQGE*
>AT4G15370.1 |  BARS1 (BARUOL SYNTHASE 1) baruol synthase/ catalytic 
MWRLRIGAKAKDNTHLFTTNNYVGRQIWEFDANAGSPEELAEVEEARRNFSNNRSRFKAS 
ADLLWRMQFLREKKFEQKIPRVIVEDAEKITYEDAKTALRRGLLYFTALQADDGHWPAEN 
AGSIFFNAPFVICLYITGHLEKIFTHEHRVELLRYMYNHQNEDGGWGLHVESPSNMFCSV 
INYICLRILGVEAGHDDKGSACARARKWILDHGGATYSPLIGKAWLSVLGVYDWSGCKPI 
PPEFWFLPSFFPVNGGTLWIYLRDIFMGLSYLYGKNFVATSTPLILQLREEIYPEPYTNI 
SWRQARNRCAKEDLYYPQSFLQDLFWKGVHVFSENILNRWPFNNLIRQRALRTTMELVHY 
HDEATRYITGGSVPKVIAVFHMLACWVEDPESDYFKKHLARVPDFIWIGEDGLKIQSFGS 
QVWDTALSLHVFIDGFDDDVDEEIRSTLLKGYDYLEKSQVTENPPGDYMKMFRHMAKGGW 
TFSDQDQGWPVSDCTAESLECCLFFESMSSEFIGKKMDVEKLYDAVDFLLYLQSDNGGIT 
AWQPADGKLVEFIEDAVVEHEYVECTGSAIVALAQFNKQFPGYKKEEVERFITKGVKYIE 
DLQMVDGSWYGNWGVCFIYGTFFAVRGLVAAGKCYNNCEAIRRAVRFILDTQNTEGGWGE 
SYLSCPRKKYIPLIGNKTNVVNTGQALMVLIMGNQMKRDPLPVHRAAKVLINSQMDNGDF 
PQQEIMGVFKMNVMLHFPTYRNMFTLWALTHYTKALRGL*
>AT1G15520.1 |  PDR12 (PLEIOTROPIC DRUG RESISTANCE 12) ATPase coupled to transmembrane movement of substances 
MEGTSFHQASNSMRRNSSVWKKDSGREIFSRSSREEDDEEALRWAALEKLPTFDRLRKGI 
LTASHAGGPINEIDIQKLGFQDTKKLLERLIKVGDDEHEKLLWKLKKRIDRVGIDLPTIE 
VRFDHLKVEAEVHVGGRALPTFVNFISNFADKFLNTLHLVPNRKKKFTILNDVSGIVKPG 
RMALLLGPPSSGKTTLLLALAGKLDQELKQTGRVTYNGHGMNEFVPQRTAAYIGQNDVHI 
GEMTVRETFAYAARFQGVGSRYDMLTELARREKEANIKPDPDIDIFMKAMSTAGEKTNVM 
TDYILKILGLEVCADTMVGDDMLRGISGGQKKRVTTGEMLVGPSRALFMDEISTGLDSST 
TYQIVNSLRNYVHIFNGTALISLLQPAPETFNLFDDIILIAEGEIIYEGPRDHVVEFFET 
MGFKCPPRKGVADFLQEVTSKKDQMQYWARRDEPYRFIRVREFAEAFQSFHVGRRIGDEL 
ALPFDKTKSHPAALTTKKYGVGIKELVKTSFSREYLLMKRNSFVYYFKFGQLLVMAFLTM 
TLFFRTEMQKKTEVDGSLYTGALFFILMMLMFNGMSELSMTIAKLPVFYKQRDLLFYPAW 
VYSLPPWLLKIPISFMEAALTTFITYYVIGFDPNVGRLFKQYILLVLMNQMASALFKMVA 
ALGRNMIVANTFGAFAMLVFFALGGVVLSRDDIKKWWIWGYWISPIMYGQNAILANEFFG 
HSWSRAVENSSETLGVTFLKSRGFLPHAYWYWIGTGALLGFVVLFNFGFTLALTFLNSLG 
KPQAVIAEEPASDETELQSARSEGVVEAGANKKRGMVLPFEPHSITFDNVVYSVDMPQEM 
IEQGTQEDRLVLLKGVNGAFRPGVLTALMGVSGAGKTTLMDVLAGRKTGGYIDGNITISG 
YPKNQQTFARISGYCEQTDIHSPHVTVYESLVYSAWLRLPKEVDKNKRKIFIEEVMELVE 
LTPLRQALVGLPGESGLSTEQRKRLTIAVELVANPSIIFMDEPTSGLDARAAAIVMRTVR 
NTVDTGRTVVCTIHQPSIDIFEAFDELFLLKRGGEEIYVGPLGHESTHLINYFESIQGIN 
KITEGYNPATWMLEVSTTSQEAALGVDFAQVYKNSELYKRNKELIKELSQPAPGSKDLYF 
PTQYSQSFLTQCMASLWKQHWSYWRNPPYTAVRFLFTIGIALMFGTMFWDLGGKTKTRQD 
LSNAMGSMYTAVLFLGLQNAASVQPVVNVERTVFYREQAAGMYSAMPYAFAQVFIEIPYV 
LVQAIVYGLIVYAMIGFEWTAVKFFWYLFFMYGSFLTFTFYGMMAVAMTPNHHIASVVSS 
AFYGIWNLFSGFLIPRPSMPVWWEWYYWLCPVAWTLYGLIASQFGDITEPMADSNMSVKQ 
FIREFYGYREGFLGVVAAMNVIFPLLFAVIFAIGIKSFNFQKR*
>AT2G47620.1 |  ATSWI3A (SWITCH/SUCROSE NONFERMENTING 3A) DNA binding 
MEATDPSAEIELYTIPAQSSWFLWDDIHEIERREFAEFFTESSITRTPKVYKEYRDFIIN 
KFREDTCRRLTFTSVRKFLVGDVNLLQKVFLFLEKWGLINFSSSLKKNDHLLSVDNAKIE 
QGTPAGIRVTATPNSLRPITAPPLVEERVETGIKVPPLTSYSDVFSDLKKPDHVLVCAHC 
GERCDSPFYQHNKGIVNICEKCFKNGNYGENNTADDFKLIGNSAAAVWTEEEILLLLESV 
LKHGDDWELISQSVSTKSRLDCISKLIELPFGEFLMGSASGRLNPSILTEDENTEQVQTD 
GQEHEETETREEKEDRVNEDEPPAKRKRVALISEGDSSLMKQVAAMASKVGPSVATAAAK 
AALAALCDEASCPKEIFDTDDYSNFTVDRANGEKDTDMEEQQEEKDGPQGLPVALRIRAS 
VATALGAAAAQAKILADQEEREMEQLAATVIEQQLKKLQSKLKFLDDLESIMDEEEKVIE 
GVKETIIQERVSVLQCAFRSGITKRWDHTYVK*
>AT4G13090.1 |  xyloglucanxyloglucosyl transferase putative / xyloglucan endotransglycosylase putative / endo-xyloglucan transferase putative 
MNRIRYCFELVSVLFLMFTANARARGRGAIDFDVNYVVTWGQDHILKLNQGKEVQLSMDY 
SSGSGFESKSHYGSGFFQMRIKLPPRDSAGVVTAFYLTSKGDTHDEVDFEFLGNRQGKPI 
AIQTNVFSNGQGGREQKFVPWFDPTTSFHTYGILWNPYQIVFYVDKVPIRVFKNIKKSGV 
NYPSKPMQLVASLWNGENWATSGGKEKINWAYAPFKAQYQGFSDHGCHVNGQSNNANVCG 
STRYWWNTRTYSQLSANEQKVMENVRAKYMTYDYCSDRPRYPVPPSECRWNQ*
>AT2G29540.1 |  ATRPC14 (RNA POLYMERASE 14 KDA SUBUNIT) DNA binding / DNA-directed RNA polymerase/ protein dimerization 
MEHGSFTNVSHASFTLSEEDHTLANAVRFVLNQDPRVTVAAYTIPHPSLEQVNIRVQTTG 
DPAREVFKDACQELMQMNRHVRSVFDKAVAEYKDEQKRKEEAEEEELKRQRDLFGSMDIE 
NN*
>AT2G29540.1 |  ATRPC14 (RNA POLYMERASE 14 KDA SUBUNIT) DNA binding / DNA-directed RNA polymerase/ protein dimerization 
MEHGSFTNVSHASFTLSEEDHTLANAVRFVLNQDPRVTVAAYTIPHPSLEQVNIRVQTTG 
DPAREVFKDACQELMQMNRHVRSVFDKAVAEYKDEQKRKEEAEEEELKRQRDLFGSMDIE 
NN*
>AT2G29540.1 |  ATRPC14 (RNA POLYMERASE 14 KDA SUBUNIT) DNA binding / DNA-directed RNA polymerase/ protein dimerization 
MEHGSFTNVSHASFTLSEEDHTLANAVRFVLNQDPRVTVAAYTIPHPSLEQVNIRVQTTG 
DPAREVFKDACQELMQMNRHVRSVFDKAVAEYKDEQKRKEEAEEEELKRQRDLFGSMDIE 
NN*
>AT2G29540.2 |  ATRPC14 (RNA POLYMERASE 14 KDA SUBUNIT) DNA binding / DNA-directed RNA polymerase/ protein dimerization 
MEHGSFTNVSHASFTLSEEDHTLANAVRFVLNQDPRVTVAAYTIPHPSLEQVNIRVQTTG 
DPAREVFKDACQELMQMNRHVRSVFDKAVAEYKDEQKRKEEAEEEELKRQRDLFGSMDIE 
NN*
>AT2G29540.2 |  ATRPC14 (RNA POLYMERASE 14 KDA SUBUNIT) DNA binding / DNA-directed RNA polymerase/ protein dimerization 
MEHGSFTNVSHASFTLSEEDHTLANAVRFVLNQDPRVTVAAYTIPHPSLEQVNIRVQTTG 
DPAREVFKDACQELMQMNRHVRSVFDKAVAEYKDEQKRKEEAEEEELKRQRDLFGSMDIE 
NN*
>AT2G29540.2 |  ATRPC14 (RNA POLYMERASE 14 KDA SUBUNIT) DNA binding / DNA-directed RNA polymerase/ protein dimerization 
MEHGSFTNVSHASFTLSEEDHTLANAVRFVLNQDPRVTVAAYTIPHPSLEQVNIRVQTTG 
DPAREVFKDACQELMQMNRHVRSVFDKAVAEYKDEQKRKEEAEEEELKRQRDLFGSMDIE 
NN*
>AT2G29540.3 |  ATRPC14 (RNA POLYMERASE 14 KDA SUBUNIT) DNA binding / DNA-directed RNA polymerase/ protein dimerization 
MLVYIHVMLLEILIVFNECHIILSLRLMELICWCVDNDDYVNNQYCFQFCSPRVTVAAYT 
IPHPSLEQVNIRVQTTGDPAREVFKDACQELMQMNRHVRSVFDKAVAEYKDEQKRKEEAE 
EEELKRQRDLFGSMDIENN*
>AT2G29540.3 |  ATRPC14 (RNA POLYMERASE 14 KDA SUBUNIT) DNA binding / DNA-directed RNA polymerase/ protein dimerization 
MLVYIHVMLLEILIVFNECHIILSLRLMELICWCVDNDDYVNNQYCFQFCSPRVTVAAYT 
IPHPSLEQVNIRVQTTGDPAREVFKDACQELMQMNRHVRSVFDKAVAEYKDEQKRKEEAE 
EEELKRQRDLFGSMDIENN*
>AT2G29540.3 |  ATRPC14 (RNA POLYMERASE 14 KDA SUBUNIT) DNA binding / DNA-directed RNA polymerase/ protein dimerization 
MLVYIHVMLLEILIVFNECHIILSLRLMELICWCVDNDDYVNNQYCFQFCSPRVTVAAYT 
IPHPSLEQVNIRVQTTGDPAREVFKDACQELMQMNRHVRSVFDKAVAEYKDEQKRKEEAE 
EEELKRQRDLFGSMDIENN*
>AT1G10600.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN ubiquitin-dependent protein catabolic process LOCATED IN chloroplast EXPRESSED IN cultured cell CONTAINS InterPro DOMAIN/s Mov34/MPN/PAD-1 (InterProIPR000555) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G161441) Has 753 Blast hits to 752 proteins in 167 species Archae - 0 Bacteria - 0 Metazoa - 359 Fungi - 199 Plants - 115 Viruses - 0 Other Eukaryotes - 80 (source NCBI BLink) 
MVTLSSPSPSLSCVENVTCKSSHVSRVLISGTDNINHGESSEAKILRDVHISERLLEDFT 
ELARENTEKDLETCGTLAAFLERGIFYVTTLIIPKQESTSNSCQAMNEVEVFSIQNEREL 
YPVGWIHTHPSQGCFMSSVDLHTHYSYQVMVPEAFAIVVAPTDSSKSYGIFKLTDPGGME 
VLRGCSETGFHPHKEPEDGNPVYEHCSNVYKNSNLRFEIFDLR*
>AT1G10600.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN ubiquitin-dependent protein catabolic process LOCATED IN mitochondrion EXPRESSED IN cultured cell CONTAINS InterPro DOMAIN/s Mov34/MPN/PAD-1 (InterProIPR000555) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G161441) Has 601 Blast hits to 601 proteins in 160 species Archae - 0 Bacteria - 0 Metazoa - 286 Fungi - 140 Plants - 102 Viruses - 0 Other Eukaryotes - 73 (source NCBI BLink) 
MVTLSSPSPSLSCVENVTCKSSHVSRVLISGTDNINHGESSEAKILRDVHISERLLEDFT 
ELARENTEKDLETCGTLAAFLERGIFYVTTLIIPKQESTSNSCQAMNEVEVFSIQNEREL 
YPVGWIHTHPSQGCFMSSVDLHTHYSYQVMVPEAFAIVVAPTDSSKSYGIFKLTDPGGME 
VLRGCSETGFHPHKEPEDGNPVYEHCSNVYKNSNLRFEIFDLR*
>AT1G10600.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN ubiquitin-dependent protein catabolic process LOCATED IN chloroplast EXPRESSED IN cultured cell CONTAINS InterPro DOMAIN/s Mov34/MPN/PAD-1 (InterProIPR000555) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G161441) Has 771 Blast hits to 770 proteins in 170 species Archae - 0 Bacteria - 0 Metazoa - 366 Fungi - 201 Plants - 115 Viruses - 0 Other Eukaryotes - 89 (source NCBI BLink) 
MVTLSSPSPSLSCVENVTCKSSHVSRVLISGTDNINHGESSEAKILRDVHISERLLEDFT 
ELARENTEKDLETCGTLAAFLERGIFYVTTLIIPKQESTSNSCQAMNEVEVFSIQNEREL 
YPVGWIHTHPSQGCFMSSVDLHTHYSYQVMVPEAFAIVVAPTDSSKSYGIFKLTDPGGME 
VLRGCSETGFHPHKEPEDGNPVYEHCSNVYKNSNLRFEIFDLR*
>AT1G10600.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN ubiquitin-dependent protein catabolic process LOCATED IN chloroplast EXPRESSED IN cultured cell CONTAINS InterPro DOMAIN/s Mov34/MPN/PAD-1 (InterProIPR000555) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G161441) Has 753 Blast hits to 752 proteins in 167 species Archae - 0 Bacteria - 0 Metazoa - 359 Fungi - 199 Plants - 115 Viruses - 0 Other Eukaryotes - 80 (source NCBI BLink) 
MFISQKGYWRISLSLQERTLRRTSRLVGLSLPFLVLRFSSFMNLMCQAMNEVEVFSIQNE 
RELYPVGWIHTHPSQGCFMSSVDLHTHYSYQVMVPEAFAIVVAPTDSSKSYGIFKLTDPG 
GMEVLRGCSETGFHPHKEPEDGNPVYEHCSNVYKNSNLRFEIFDLR*
>AT1G10600.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN ubiquitin-dependent protein catabolic process LOCATED IN mitochondrion EXPRESSED IN cultured cell CONTAINS InterPro DOMAIN/s Mov34/MPN/PAD-1 (InterProIPR000555) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G161441) Has 601 Blast hits to 601 proteins in 160 species Archae - 0 Bacteria - 0 Metazoa - 286 Fungi - 140 Plants - 102 Viruses - 0 Other Eukaryotes - 73 (source NCBI BLink) 
MFISQKGYWRISLSLQERTLRRTSRLVGLSLPFLVLRFSSFMNLMCQAMNEVEVFSIQNE 
RELYPVGWIHTHPSQGCFMSSVDLHTHYSYQVMVPEAFAIVVAPTDSSKSYGIFKLTDPG 
GMEVLRGCSETGFHPHKEPEDGNPVYEHCSNVYKNSNLRFEIFDLR*
>AT1G10600.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN ubiquitin-dependent protein catabolic process LOCATED IN chloroplast EXPRESSED IN cultured cell CONTAINS InterPro DOMAIN/s Mov34/MPN/PAD-1 (InterProIPR000555) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G161441) Has 771 Blast hits to 770 proteins in 170 species Archae - 0 Bacteria - 0 Metazoa - 366 Fungi - 201 Plants - 115 Viruses - 0 Other Eukaryotes - 89 (source NCBI BLink) 
MFISQKGYWRISLSLQERTLRRTSRLVGLSLPFLVLRFSSFMNLMCQAMNEVEVFSIQNE 
RELYPVGWIHTHPSQGCFMSSVDLHTHYSYQVMVPEAFAIVVAPTDSSKSYGIFKLTDPG 
GMEVLRGCSETGFHPHKEPEDGNPVYEHCSNVYKNSNLRFEIFDLR*
>AT1G10600.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN ubiquitin-dependent protein catabolic process LOCATED IN chloroplast EXPRESSED IN cultured cell CONTAINS InterPro DOMAIN/s Mov34/MPN/PAD-1 (InterProIPR000555) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G161441) Has 753 Blast hits to 752 proteins in 167 species Archae - 0 Bacteria - 0 Metazoa - 359 Fungi - 199 Plants - 115 Viruses - 0 Other Eukaryotes - 80 (source NCBI BLink) 
MVTLSSPSPSLSCVENVTCKSSHVSRVLISGTDNINHGESSEAKILRDVHISERLLEDFT 
ELARENTEKDLETCGTLAAFLERGIFYVTTLIIPKQESTSNSCQAMNEVEVFSIQNEREL 
YPVGWIHTHPSQGCFMSSVDLHTHYSYQVMVPEAFAIVVAPTDSSNYGIFKLTDPGGMEV 
LRGCSETGFHPHKEPEDGNPVYEHCSNVYKNSNLRFEIFDLR*
>AT1G10600.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN ubiquitin-dependent protein catabolic process LOCATED IN mitochondrion EXPRESSED IN cultured cell CONTAINS InterPro DOMAIN/s Mov34/MPN/PAD-1 (InterProIPR000555) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G161441) Has 601 Blast hits to 601 proteins in 160 species Archae - 0 Bacteria - 0 Metazoa - 286 Fungi - 140 Plants - 102 Viruses - 0 Other Eukaryotes - 73 (source NCBI BLink) 
MVTLSSPSPSLSCVENVTCKSSHVSRVLISGTDNINHGESSEAKILRDVHISERLLEDFT 
ELARENTEKDLETCGTLAAFLERGIFYVTTLIIPKQESTSNSCQAMNEVEVFSIQNEREL 
YPVGWIHTHPSQGCFMSSVDLHTHYSYQVMVPEAFAIVVAPTDSSNYGIFKLTDPGGMEV 
LRGCSETGFHPHKEPEDGNPVYEHCSNVYKNSNLRFEIFDLR*
>AT1G10600.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN ubiquitin-dependent protein catabolic process LOCATED IN chloroplast EXPRESSED IN cultured cell CONTAINS InterPro DOMAIN/s Mov34/MPN/PAD-1 (InterProIPR000555) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G161441) Has 771 Blast hits to 770 proteins in 170 species Archae - 0 Bacteria - 0 Metazoa - 366 Fungi - 201 Plants - 115 Viruses - 0 Other Eukaryotes - 89 (source NCBI BLink) 
MVTLSSPSPSLSCVENVTCKSSHVSRVLISGTDNINHGESSEAKILRDVHISERLLEDFT 
ELARENTEKDLETCGTLAAFLERGIFYVTTLIIPKQESTSNSCQAMNEVEVFSIQNEREL 
YPVGWIHTHPSQGCFMSSVDLHTHYSYQVMVPEAFAIVVAPTDSSNYGIFKLTDPGGMEV 
LRGCSETGFHPHKEPEDGNPVYEHCSNVYKNSNLRFEIFDLR*
>AT2G31020.1 |  ORP1A (OSBP(OXYSTEROL BINDING PROTEIN)-RELATED PROTEIN 1A) oxysterol binding 
MYAATPETPFGSARSQPVITRSVSQRYNHPGQSNHHHLLHSLSFNHQNVLALPAAAREPP 
VDVKINDIAGNSIAGILYKWVNYGKGWRPRWFVLQDGVLSYYKIKGPDKIVVIHETEKGS 
RVIGEESTRMISRNKRHAATNNTNHQLRRKPFGEVHLKVSSIRESRSDDKRFSIFTGTKR 
LHLRAETREDREAWIEALQAVKDMFPRMSNCELMAPTNNLDISIEKLRLRLVEEGVSESA 
IQDCEQITRSEFSAIQSQLLLLKQKQWLLIDTLRQLETEKVDLENTVVDETQRQAGNGDS 
EETISESDDDNEQFDEAEEEMDTCDSLSSSSFKSIGSVFRTSSFSSDDDGLTNGFESEND 
DVDPSIKTIGFNYPHVKRRKKLPDPVEKEKSVSLWSMIKDNIGKDLTKVCLPVYFNEPLS 
SLQKCFEDLEYSYLLDQASEWGKRGNNLMRILNVAAFAVSGYASTEGRICKPFNPMLGET 
YEADYPDKGLRFFSEKVSHHPMIVACHCDGTGWKFWGDSNLKSKFWGRSIQLDPIGLLTL 
QFDDGEIVQWSKVTTSIYNLILGKLYCDHYGTMKIEGNGEYSCKLKFKEQSMIDRNPHQV 
QGIVEDKNGKTVARLFGKWDESIHYVMVDQGKVNESHLLWKRNKQPENPTKYNLTRFGIT 
LNELTPGLKEKLPPTDSRLRPDQRYLEKGEYEMGNAEKLRLEQRQRQAREMQERGWKPKW 
FRKEKGSETYRYIGGYWEARDSGSWDDCPDIFGQVHQSIK*
>AT1G78970.2 |  LUP1 (LUPEOL SYNTHASE 1) beta-amyrin synthase/ lupeol synthase 
MWKLKIGKGNGEDPHLFSSNNFVGRQTWKFDHKAGSPEERAAVEEARRGFLDNRFRVKGC 
SDLLWRMQFLREKKFEQGIPQLKATNIEEITYETTTNALRRGVRYFTALQASDGHWPGEI 
TGPLFFLPPLIFCLYITGHLEEVFDAEHRKEMLRHIYCHQNEDGGWGLHIESKSVMFCTV 
LNYICLRMLGENPEQDACKRARQWILDRGGVIFIPSWGKFWLSILGVYDWSGTNPTPPEL 
LMLPSFLPIHPGKILCYSRMVSIPMSYLYGKRFVGPITPLILLLREELYLEPYEEINWKK 
SRRLYAKEDMYYAHPLVQDLLSDTLQNFVEPLLTRWPLNKLVREKALQLTMKHIHYEDEN 
SHYITIGCVEKVLCMLACWVENPNGDYFKKHLARIPDYMWVAEDGMKMQSFGCQLWDTGF 
AIQALLASNLPDETDDALKRGHNYIKASQVRENPSGDFRSMYRHISKGAWTFSDRDHGWQ 
VSDCTAEALKCCLLLSMMSADIVGQKIDDEQLYDSVNLLLSLQSGNGGVNAWEPSRAYKW 
LELLNPTEFMANTMVEREFVECTSSVIQALDLFRKLYPDHRKKEINRSIEKAVQFIQDNQ 
TPDGSWYGNWGVCFIYATWFALGGLAAAGETYNDCLAMRNGVHFLLTTQRDDGGWGESYL 
SCSEQRYIPSEGERSNLVQTSWAMMALIHTGQAERDLIPLHRAAKLIINSQLENGDFPQQ 
EIVGAFMNTCMLHYATYRNTFPLWALAEYRKVVFIVN*
>AT1G78970.2 |  LUP1 (LUPEOL SYNTHASE 1) beta-amyrin synthase/ lupeol synthase 
MWKLKIGKGNGEDPHLFSSNNFVGRQTWKFDHKAGSPEERAAVEEARRGFLDNRFRVKGC 
SDLLWRMQFLREKKFEQGIPQLKATNIEEITYETTTNALRRGVRYFTALQASDGHWPGEI 
TGPLFFLPPLIFCLYITGHLEEVFDAEHRKEMLRHIYCHQNEDGGWGLHIESKSVMFCTV 
LNYICLRMLGENPEQDACKRARQWILDRGGVIFIPSWGKFWLSILGVYDWSGTNPTPPEL 
LMLPSFLPIHPGKILCYSRMVSIPMSYLYGKRFVGPITPLILLLREELYLEPYEEINWKK 
SRRLYAKEDMYYAHPLVQDLLSDTLQNFVEPLLTRWPLNKLVREKALQLTMKHIHYEDEN 
SHYITIGCVEKVLCMLACWVENPNGDYFKKHLARIPDYMWVAEDGMKMQSFGCQLWDTGF 
AIQALLASNLPDETDDALKRGHNYIKASQVRENPSGDFRSMYRHISKGAWTFSDRDHGWQ 
VSDCTAEALKCCLLLSMMSADIVGQKIDDEQLYDSVNLLLSLQSGNGGVNAWEPSRAYKW 
LELLNPTEFMANTMVEREFVECTSSVIQALDLFRKLYPDHRKKEINRSIEKAVQFIQDNQ 
TPDGSWYGNWGVCFIYATWFALGGLAAAGETYNDCLAMRNGVHFLLTTQRDDGGWGESYL 
SCSEQRYIPSEGERSNLVQTSWAMMALIHTGQAERDLIPLHRAAKLIINSQLENGDFPQQ 
EIVGAFMNTCMLHYATYRNTFPLWALAEYRKVVFIVN*
>AT1G78970.1 |  LUP1 (LUPEOL SYNTHASE 1) beta-amyrin synthase/ lupeol synthase 
MWKLKIGKGNGEDPHLFSSNNFVGRQTWKFDHKAGSPEERAAVEEARRGFLDNRFRVKGC 
SDLLWRMQFLREKKFEQGIPQLKATNIEEITYETTTNALRRGVRYFTALQASDGHWPGEI 
TGPLFFLPPLIFCLYITGHLEEVFDAEHRKEMLRHIYCHQNEDGGWGLHIESKSVMFCTV 
LNYICLRMLGENPEQDACKRARQWILDRGGVIFIPSWGKFWLSILGVYDWSGTNPTPPEL 
LMLPSFLPIHPGKILCYSRMVSIPMSYLYGKRFVGPITPLILLLREELYLEPYEEINWKK 
SRRLYAKEDMYYAHPLVQDLLSDTLQNFVEPLLTRWPLNKLVREKALQLTMKHIHYEDEN 
SHYITIGCVEKVLCMLACWVENPNGDYFKKHLARIPDYMWVAEDGMKMQSFGCQLWDTGF 
AIQALLASNLPDETDDALKRGHNYIKASQVRENPSGDFRSMYRHISKGAWTFSDRDHGWQ 
VSDCTAEALKCCLLLSMMSADIVGQKIDDEQLYDSVNLLLSLQSGNGGVNAWEPSRAYKW 
LELLNPTEFMANTMVEREFVECTSSVIQALDLFRKLYPDHRKKEINRSIEKAVQFIQDNQ 
TPDGSWYGNWGVCFIYATWFALGGLAAAGETYNDCLAMRNGVHFLLTTQRDDGGWGESYL 
SCSEQRYIPSEGERSNLVQTSWAMMALIHTGQAERDLIPLHRAAKLIINSQLENGDFPQQ 
EIVGAFMNTCMLHYATYRNTFPLWALAEYRKVVFIVN*
>AT1G78970.1 |  LUP1 (LUPEOL SYNTHASE 1) beta-amyrin synthase/ lupeol synthase 
MWKLKIGKGNGEDPHLFSSNNFVGRQTWKFDHKAGSPEERAAVEEARRGFLDNRFRVKGC 
SDLLWRMQFLREKKFEQGIPQLKATNIEEITYETTTNALRRGVRYFTALQASDGHWPGEI 
TGPLFFLPPLIFCLYITGHLEEVFDAEHRKEMLRHIYCHQNEDGGWGLHIESKSVMFCTV 
LNYICLRMLGENPEQDACKRARQWILDRGGVIFIPSWGKFWLSILGVYDWSGTNPTPPEL 
LMLPSFLPIHPGKILCYSRMVSIPMSYLYGKRFVGPITPLILLLREELYLEPYEEINWKK 
SRRLYAKEDMYYAHPLVQDLLSDTLQNFVEPLLTRWPLNKLVREKALQLTMKHIHYEDEN 
SHYITIGCVEKVLCMLACWVENPNGDYFKKHLARIPDYMWVAEDGMKMQSFGCQLWDTGF 
AIQALLASNLPDETDDALKRGHNYIKASQVRENPSGDFRSMYRHISKGAWTFSDRDHGWQ 
VSDCTAEALKCCLLLSMMSADIVGQKIDDEQLYDSVNLLLSLQSGNGGVNAWEPSRAYKW 
LELLNPTEFMANTMVEREFVECTSSVIQALDLFRKLYPDHRKKEINRSIEKAVQFIQDNQ 
TPDGSWYGNWGVCFIYATWFALGGLAAAGETYNDCLAMRNGVHFLLTTQRDDGGWGESYL 
SCSEQRYIPSEGERSNLVQTSWAMMALIHTGQAERDLIPLHRAAKLIINSQLENGDFPQQ 
EIVGAFMNTCMLHYATYRNTFPLWALAEYRKVVFIVN*
>AT1G78950.1 |  beta-amyrin synthase putative 
MWRLKIGEGNGDDPYLFTTNNFAGRQTWEFDPDGGSPEERHSVVEARRIFYDNRFHVKAS 
SDLLWRMQFLREKKFEQRIAPVKVEDSEKVTFETATSALRRGIHFFSALQASDGHWPAEN 
AGPLFFLPPLVFCLYITGHLDEVFTSEHRKEILRYIYCHQKEDGGWGLHIEGHSTMFCTT 
LNYICMRILGESPDGGHDNACGRAREWILSHGGVTYIPSWGKTWLSILGVFDWSGSNPMP 
PEFWILPSFFPVHPAKMWSYCRMVYLPMSYLYGKRFVGPITSLILQLRKELYLQPYEEIN 
WMKVRHLCAKEDTYYPRPLVQELVWDSLYIFAEPFLARWPFNKLLREKALQLAMKHIHYE 
DENSRYITIGCVEKVLCMLACWVEDPNGDYFKKHLSRISDYLWMAEDGMKMQSFGSQLWD 
TGFAMQALLASNLSSEISDVLRRGHEFIKNSQVGENPSGDYKSMYRHISKGAWTFSDRDH 
GWQVSDCTAHGLKCCLLFSMLAPDIVGPKQDPERLHDSVNILLSLQSKNGGMTAWEPAGA 
PKWLELLNPTEMFSDIVIEHEYSECTSSAIQALSLFKQLYPDHRTTEITAFIKKAAEYLE 
NMQTRDGSWYGNWGICFTYGTWFALAGLAAAGKTFNDCEAIRKGVQFLLAAQKDNGGWGE 
SYLSCSKKIYIAQVGEISNVVQTAWALMGLIHSGQAERDPIPLHRAAKLIINSQLESGDF 
PQQQATGVFLKNCTLHYAAYRNIHPLWALAEYRARVSLP*
>AT5G42600.1 |  MRN1 (MARNERAL SYNTHASE) catalytic/ marneral synthase 
MWRLRIGAEARQDPHLFTTNNFAGRQIWEFDANGGSPEELAEVEEARLNFANNKSRFKAS 
PDLFWRRQFLREKKFEQKIPRVRIEDAEKITYEDAKTALRRGVLYYAACQANDGHWPSEV 
SGSMFLDAPFVICLYITGHLEKIFTLEHVKELLRYMYNTQNEDGGWGLDVESHSVMFCTV 
LNYICLRILGVEPDHDGQKSACARARKWILDHGGATYAPMVAKAWLSVLGVYDWSGCKPL 
PPEIWMLPSFSPINGGTLWIYIRDLLMGMSYLYGKKFVATPTALILQLREELYPQPYSKI 
IWSKARNRCAKEDLLYPKSFGQDLFWEGVHMLSENIINRWPLNKFVRQRALRTTMELVHY 
HDETTHYITGACVAKPFHMLACWVEDPDGDYFKKHLARVPDFIWIAEDGLKFQLMGMQSW 
NAALSLQVMLAANMDDEIRSTLIKGYDFLKQSQISENPQGDHLKMFRDITKGGWTFQDRE 
QGLPISDGTAESIECCIHFHRMPSEFIGEKMDVEKLYDAVNFLIYLQSDNGGMPVWEPAP 
GKKWLEWLSPVEHVENTVVEQEYLECTGSVIAGLVCFKKEFPDHRPKEIEKLIKKGLKYI 
EDLQMPDGSWYGNWGVCFTYGTLFAVRGLAAAGKTFGNSEAIRRAVQFILNTQNAEGGWG 
ESALSCPNKKYIPSKGNVTNVVNTGQAMMVLLIGGQMERDPSPVHRAAKVLINSQLDIGD 
FPQQERRGIYMNMLLHYPTYRNMFSLWALALYTNALRLLVS*
>AT1G66960.1 |  lupeol synthase putative / 23-oxidosqualene-triterpenoid cyclase putative 
MWRLKVGEGKGKDPYLFSSNNFVGRQTWEFDPKAGTREERTAVEEARRSFFDNRSRVKPS 
SDLLWKMQFLKEAKFEQVIPPVKIDGGEAITYEKATNALRRGVAFLSALQASDGHWPGEF 
TGPLCMLPPLVFCLYITGHLEEVFDAEHRKEMLRYIYCHQNEDGGWGFHIESKSIMFTTT 
LNYICLRILGVGPDGGLENACKRARQWILSHGGVIYIPCWGKVWLSVLGIYDWSGVNPMP 
PEIWLLPYFLPIHLGKAFSYTRITYMPISYLYGKKFVGQITPLIMQLREELHLQPYEEIN 
WNKARHLCAKEDKYYPHPLVQDLIWDALHTFVEPLLASWPINKLVRKKALQVAMKHIHYE 
DENSHYITIGCIEKNLCMLACWIDNPDGNHFKKHLSRIPDMMWVAEDGMKMQCFGSQLWM 
TGFAVQALLASDPRDETYDVLRRAHDYIKKSQVRDNPSGDFKSMYRHISKGGWTLSDRDH 
GWQVSDCTAEAAKCCMLLSTMPTDITGEKINLEQLYDSVNLMLSLQSENGGFTAWEPVRA 
YKWMELMNPTDLFANAMTEREYTECTSAVLQALVIFNQLYPDHRTKEITKSIEKAVQFIE 
SKQLRDGSWYGSWGICFTYGTWFALCGLAAIGKTYNNCLSMRDGVHFLLNIQNEDGGWGE 
SYMSCPEQRYIPLEGNRSNVVQTAWAMMALIHAGQAKRDLIPLHSAAKFIITSQLENGDF 
PQQELLGASMSTCMLHYSTYKDIFPPWALAEYRKAAFIHHADL*
>AT5G48010.1 |  THAS1 (THALIANOL SYNTHASE 1) catalytic/ thalianol synthase 
MWRLRTGPKAGEDTHLFTTNNYAGRQIWEFDANAGSPQEIAEVEDARHKFSDNTSRFKTT 
ADLLWRMQFLREKKFEQKIPRVIIEDARKIKYEDAKTALKRGLLYFTALQADDGHWPAEN 
SGPNFYTPPFLICLYITGHLEKIFTPEHVKELLRHIYNMQNEDGGWGLHVESHSVMFCTV 
INYVCLRIVGEEVGHDDQRNGCAKAHKWIMDHGGATYTPLIGKALLSVLGVYDWSGCNPI 
PPEFWLLPSSFPVNGGTLWIYLRDTFMGLSYLYGKKFVAPPTPLILQLREELYPEPYAKI 
NWTQTRNRCGKEDLYYPRSFLQDLFWKSVHMFSESILDRWPLNKLIRQRALQSTMALIHY 
HDESTRYITGGCLPKAFHMLACWIEDPKSDYFKKHLARVREYIWIGEDGLKIQSFGSQLW 
DTALSLHALLDGIDDHDVDDEIKTTLVKGYDYLKKSQITENPRGDHFKMFRHKTKGGWTF 
SDQDQGWPVSDCTAESLECCLFFESMPSELIGKKMDVEKLYDAVDYLLYLQSDNGGIAAW 
QPVEGKAWLELLNIMIFRYVECTGSAIAALTQFNKQFPGYKNVEVKRFITKAAKYIEDMQ 
TVDGSWYGNWGVCFIYGTFFAVRGLVAAGKTYSNCEAIRKAVRFLLDTQNPEGGWGESFL 
SCPSKKYTPLKGNSTNVVQTAQALMVLIMGDQMERDPLPVHRAAQVLINSQLDNGDFPQQ 
EIMGTFMRTVMLHFPTYRNTFSLWALTHYTHALRRLLP*
>AT5G48010.1 |  THAS1 (THALIANOL SYNTHASE 1) catalytic/ thalianol synthase 
MWRLRTGPKAGEDTHLFTTNNYAGRQIWEFDANAGSPQEIAEVEDARHKFSDNTSRFKTT 
ADLLWRMQFLREKKFEQKIPRVIIEDARKIKYEDAKTALKRGLLYFTALQADDGHWPAEN 
SGPNFYTPPFLICLYITGHLEKIFTPEHVKELLRHIYNMQNEDGGWGLHVESHSVMFCTV 
INYVCLRIVGEEVGHDDQRNGCAKAHKWIMDHGGATYTPLIGKALLSVLGVYDWSGCNPI 
PPEFWLLPSSFPVNGGTLWIYLRDTFMGLSYLYGKKFVAPPTPLILQLREELYPEPYAKI 
NWTQTRNRCGKEDLYYPRSFLQDLFWKSVHMFSESILDRWPLNKLIRQRALQSTMALIHY 
HDESTRYITGGCLPKAFHMLACWIEDPKSDYFKKHLARVREYIWIGEDGLKIQSFGSQLW 
DTALSLHALLDGIDDHDVDDEIKTTLVKGYDYLKKSQITENPRGDHFKMFRHKTKGGWTF 
SDQDQGWPVSDCTAESLECCLFFESMPSELIGKKMDVEKLYDAVDYLLYLQSDNGGIAAW 
QPVEGKAWLELLNIMIFRYVECTGSAIAALTQFNKQFPGYKNVEVKRFITKAAKYIEDMQ 
TVDGSWYGNWGVCFIYGTFFAVRGLVAAGKTYSNCEAIRKAVRFLLDTQNPEGGWGESFL 
SCPSKKYTPLKGNSTNVVQTAQALMVLIMGDQMERDPLPVHRAAQVLINSQLDNGDFPQQ 
EIMGTFMRTVMLHFPTYRNTFSLWALTHYTHALRRLLP*
>AT5G48010.2 |  THAS1 (THALIANOL SYNTHASE 1) catalytic/ thalianol synthase 
MWRLRTGPKAGEDTHLFTTNNYAGRQIWEFDANAGSPQEIAEVEDARHKFSDNTSRFKTT 
ADLLWRMQFLREKKFEQKIPRVIIEDARKIKYEDAKTALKRGLLYFTALQADDGHWPAEN 
SGPNFYTPPFLICLYITGHLEKIFTPEHVKELLRHIYNMQNEDGGWGLHVESHSVMFCTV 
INYVCLRIVGEEVGHDDQRNGCAKAHKWIMDHGGATYTPLIGKALLSVLGVYDWSGCNPI 
PPEFWLLPSSFPVNGGTLWIYLRDTFMGLSYLYGKKFVAPPTPLILQLREELYPEPYAKI 
NWTQTRNRCGKEDLYYPRSFLQDLFWKSVHMFSESILDRWPLNKLIRQRALQSTMALIHY 
HDESTRYITGGCLPKAFHMLACWIEDPKSDYFKKHLARVREYIWIGEDGLKIQSFGSQLW 
DTALSLHALLDGIDDHDVDDEIKTTLVKGYDYLKKSQITENPRGDHFKMFRHKTKGGWTF 
SDQDQGWPVSDCTAESLECCLFFESMPSELIGKKMDVEKLYDAVDYLLYLQSDNGGIAAW 
QPVEGKAWLEWLSPVEFLEDTIVEYEYVECTGSAIAALTQFNKQFPGYKNVEVKRFITKA 
AKYIEDMQTVDGSWYGNWGVCFIYGTFFAVRGLVAAGKTYSNCEAIRKAVRFLLDTQNPE 
GGWGESFLSCPSKKYTPLKGNSTNVVQTAQALMVLIMGDQMERDPLPVHRAAQVLINSQL 
DNGDFPQQEIMGTFMRTVMLHFPTYRNTFSLWALTHYTHALRRLLP*
>AT5G48010.2 |  THAS1 (THALIANOL SYNTHASE 1) catalytic/ thalianol synthase 
MWRLRTGPKAGEDTHLFTTNNYAGRQIWEFDANAGSPQEIAEVEDARHKFSDNTSRFKTT 
ADLLWRMQFLREKKFEQKIPRVIIEDARKIKYEDAKTALKRGLLYFTALQADDGHWPAEN 
SGPNFYTPPFLICLYITGHLEKIFTPEHVKELLRHIYNMQNEDGGWGLHVESHSVMFCTV 
INYVCLRIVGEEVGHDDQRNGCAKAHKWIMDHGGATYTPLIGKALLSVLGVYDWSGCNPI 
PPEFWLLPSSFPVNGGTLWIYLRDTFMGLSYLYGKKFVAPPTPLILQLREELYPEPYAKI 
NWTQTRNRCGKEDLYYPRSFLQDLFWKSVHMFSESILDRWPLNKLIRQRALQSTMALIHY 
HDESTRYITGGCLPKAFHMLACWIEDPKSDYFKKHLARVREYIWIGEDGLKIQSFGSQLW 
DTALSLHALLDGIDDHDVDDEIKTTLVKGYDYLKKSQITENPRGDHFKMFRHKTKGGWTF 
SDQDQGWPVSDCTAESLECCLFFESMPSELIGKKMDVEKLYDAVDYLLYLQSDNGGIAAW 
QPVEGKAWLEWLSPVEFLEDTIVEYEYVECTGSAIAALTQFNKQFPGYKNVEVKRFITKA 
AKYIEDMQTVDGSWYGNWGVCFIYGTFFAVRGLVAAGKTYSNCEAIRKAVRFLLDTQNPE 
GGWGESFLSCPSKKYTPLKGNSTNVVQTAQALMVLIMGDQMERDPLPVHRAAQVLINSQL 
DNGDFPQQEIMGTFMRTVMLHFPTYRNTFSLWALTHYTHALRRLLP*
>AT5G36150.1 |  ATPEN3 (putative pentacyclic triterpene synthase 3) catalytic/ lupeol synthase 
MWRLRIGAKAGDDPHLCTTNNFLGRQIWEFDANAGSPAELSEVDQARQNFSNNRSQYKAC 
ADLLWRMQFLREKNFEQKIPRVRIEDAKKITFEDAKNTLRRGIHYMAALQSDDGHWPSEN 
AGCIFFNAPFVICLYITGHLDKVFSEEHRKEMLRYMYNHQNDDGGWGIDVESHSFMFCTV 
INYICLRIFGVDPDHDGESACARARKWIIDHGGATYTPLFGKAWLSVLGVYEWSGCKPIP 
PEFWFFPSYFPINGGTLWIYLRDTFMAMSYLYGKKFVAKPTPLILQLREELYPQPYAEIV 
WSQARSRCAKEDLYYPQSLVQDLFWKLVHMFSENILNRWPFNKLIREKAIRTAMELIHYH 
DEATRYITGGAVPKVFHMLACWVEDPESDYFKKHLARVSHFIWIAEDGLKIQTFGSQIWD 
TAFVLQVMLAADVDDEIRPTLIKGYSYLRKSQFTENPPGDYINMFRDISKGGWGYSDKDQ 
GWPVSDCISESLECCLIFESMSSEFIGEKMEVERLYDAVNMLLYMQSRNGGISIWEAASG 
KKWLEWLSPIEFIEDTILEHEYLECTGSAIVVLARFMKQFPGHRTEEVKKFITKGVKYIE 
SLQIADGSWYGNWGICFIYGTFFAVRGLVAAGNTYDNCEAIRRAVRFLLDIQNGEGGWGE 
SFLSCPNKNYIPLEGNKTDVVNTGQALMVLIMGGQMDRDPLPVHRAAKVLINSQMDNGDF 
PQQEIRGVYKMNVMLNFPTFRNSFTLWALTHYTKAIRLLL*
>AT1G78500.1 |  pentacyclic triterpene synthase putative 
MWRLKIGAKGGDETHLFTTNNYTGRQTWEFDADACSPEELAEVDEARQNFSINRSRFKIS 
ADLLWRMQFLREKKFEQKIPRVEIGDAENITYKDAKTALRRGILYFKALQAEDGHWPAEN 
SGCLFFEAPFVICLYITGHLEKILTLEHRKELLRYMYNHQNEDGGWGIHVEGQSAMFCTV 
INYICLRILGVEADLDDIKGSGCARARKWILDHGGATYTPLIGKAWLSILGVYDWSGCKP 
IPPEVWMLPTFSPFNGGTLWIYFRDIFMGVSYLYGKKFVATPTPLILQLREELYPQPYDK 
ILWSQARNQCAKEDLYYPQSFLQEMFWKCVHILSENILNRWPCNKLIRQKALRTTMELLH 
YQDEASRYFTGGCVPKPFHMLACWVEDPDGDYFKKHLARVPDYIWIGEDGLKIQSFGSQL 
WDTAFSLQVMLAYQDVDDDDDEIRSTLIKGYSFLNKSQLTQNPPGDHRKMLKDIAKGGWT 
FSDQDQGWPVSDCTAESLECCLVFGSMPSELIGEKMDVERLYDAVNLLLYFQSKNGGITV 
WEAARGRTWLEWLSPVEFMEDTIVEHEYVECTGSAIVALARFLKEFPEHRREEVEKFIKN 
AVKYIESFQMPDGSWYGNWGVCFMYGTFFAVRGLVAAGKTYQNCEPIRKAVQFILETQNV 
EGGWGESYLSCPNKKYTLLEGNRTNVVNTGQALMVLIMGGQMERDPLPVHRAAKVLINSQ 
LDNGDFPQEEIMGVFKMNVMVHYATYRNIFTLWALTYYTKALRVPLC*
>AT3G45130.1 |  LAS1 lanosterol synthase 
MWRLKLSEGDEESVNQHVGRQFWEYDNQFGTSEERHHINHLRSNFTLNRFSSKHSSDLLY 
RFQCWKEKGKGMERLPQVKVKEGEERLINEEVVNVTLRRSLRFYSILQSQDGFWPGDYGG 
PLFLLPALVIGLYVTEVLDGTLTAQHQIEIRRYLYNHQNKDGGWGLHVEGNSTMFCTVLS 
YVALRLMGEELDGGDGAMESARSWIHHHGGATFIPSWGKFWLSVLGAYEWSGNNPLPPEL 
WLLPYSLPFHPGRMWCHCRMVYLPMSYLYGRRFVCRTNGTILSLRRELYTIPYHHIDWDT 
ARNQCAKEDLYYPHPKIQDVLWSCLNKFGEPLLERWPLNNLRNHALQTVMQHIHYEDQNS 
HYICIGPVNKVLNMLCCWVESSNSEAFKSHLSRIKDYLWVAEDGMKMQGYNGSQLWDVTL 
AVQAILATNLVDDYGLMLKKAHNYIKNTQIRKDTSGDPGLWYRHPCKGGWGFSTGDNPWP 
VSDCTAEALKAALLLSQMPVNLVGEPMPEEHLVDAVNFILSLQNKNGGFASYELTRSYPE 
LEVINPSETFGDIIIDYQYVECTSAAIQGLVLFTTLNSSYKRKEIVGSINKAVEFIEKTQ 
LPDGSWYGSWGVCFTYATWFGIKGMLASGKTYESSLCIRKACGFLLSKQLCCGGWGESYL 
SCQNKVYTNLPGNKSHIVNTSWALLALIEAGQASRDPMPLHRGAKSLINSQMEDGDYPQQ 
EILGVFNRNCMISYSAYRNIFPIWALGEYRKLMLSL*
>AT4G15340.1 |  ATPEN1 (ARABIDOPSIS THALIANA PENTACYCLIC TRITERPENE SYNTHASE 1) arabidiol synthase 
MWRLRIGAKAGNDTHLFTTNNYVGRQIWEFDANAGSPQELAEVEEARRNFSNNRSHYKAS 
ADLLWRMQFLREKGFEQKIPRVRVEDAAKIRYEDAKTALKRGLHYFTALQADDGHWPADN 
SGPNFFIAPLVICLYITGHLEKIFTVEHRIELIRYMYNHQNEDGGWGLHVESPSIMFCTV 
INYICLRIVGVEAGHDDDQGSTCTKARKWILDHGGATYTPLIGKACLSVLGVYDWSGCKP 
MPPEFWFLPSSFPINGGTLWIYLRDIFMGLSYLYGKKFVATPTPLILQLQEELYPEPYTK 
INWRLTRNRCAKEDLCYPSSFLQDLFWKGVHIFSESILNRWPFNKLIRQAALRTTMKLLH 
YQDEANRYITGGSVPKAFHMLACWVEDPEGEYFKKHLARVSDFIWIGEDGLKIQSFGSQL 
WDTVMSLHFLLDGVEDDVDDEIRSTLVKGYDYLKKSQVTENPPSDHIKMFRHISKGGWTF 
SDKDQGWPVSDCTAESLKCCLLFERMPSEFVGQKMDVEKLFDAVDFLLYLQSDNGGITAW 
EPADGKTWLEWFSPVEFVQDTVIEHEYVECTGSAIVALTQFSKQFPEFRKKEVERFITNG 
VKYIEDLQMKDGSWCGNWGVCFIYGTLFAVRGLVAAGKTFHNCEPIRRAVRFLLDTQNQE 
GGWGESYLSCLRKKYTPLAGNKTNIVSTGQALMVLIMGGQMERDPLPVHRAAKVVINLQL 
DNGDFPQQEVMGVFNMNVLLHYPTYRNIYSLWALTLYTQALRRLQP*
>AT1G78960.1 |  ATLUP2 beta-amyrin synthase/ lupeol synthase 
MWKLKIGEGNGEDPYLFSSNNFVGRQTWEFDPKAGTPEERAAVEDARRNYLDNRPRVKGC 
SDLLWRMQFLKEAKFEQVIPPVKIDDGEGITYKNATDALRRAVSFYSALQSSDGHWPAEI 
TGTLFFLPPLVFCFYITGHLEKIFDAEHRKEMLRHIYCHQNEDGGWGLHIEGKSVMFCTV 
LNYICLRMLGEGPNGGRNNACKRARQWILDHGGVTYIPSWGKIWLSILGIYDWSGTNPMP 
PEIWLLPSFFPIHLGKTLCYTRMVYMPMSYLYGKRFVGPLTPLIMLLRKELHLQPYEEIN 
WNKARRLCAKEDMIYPHPLVQDLLWDTLHNFVEPILTNWPLKKLVREKALRVAMEHIHYE 
DENSHYITIGCVEKVLCMLACWIENPNGDHFKKHLARIPDFMWVAEDGLKMQSFGSQLWD 
TVFAIQALLACDLSDETDDVLRKGHSFIKKSQVRENPSGDFKSMYRHISKGAWTLSDRDH 
GWQVSDCTAEALKCCMLLSMMPAEVVGQKIDPEQLYDSVNLLLSLQGEKGGLTAWEPVRA 
QEWLELLNPTDFFTCVMAEREYVECTSAVIQALVLFKQLYPDHRTKEIIKSIEKGVQFIE 
SKQTPDGSWHGNWGICFIYATWFALSGLAAAGKTYKSCLAVRKGVDFLLAIQEEDGGWGE 
SHLSCPEQRYIPLEGNRSNLVQTAWAMMGLIHAGQAERDPTPLHRAAKLIITSQLENGDF 
PQQEILGVFMNTCMLHYATYRNIFPLWALAEYRKAAFATHQDL*
>AT1G78955.1 |  CAMS1 (Camelliol C synthase 1) beta-amyrin synthase 
MWKLKIANGNKEEPYLFSTNNFLGRQTWEFDPDAGTVEELAAVEEARRKFYDDRFRVKAS 
SDLIWRMQFLKEKKFEQVIPPAKVEDANNITSEIATNALRKGVNFLSALQASDGHWPAEN 
AGPLFFLPPLVFCLYVTGHLHEIFTQDHRREVLRYIYCHQNEDGGWGLHIEGNSTMFCTT 
LNYICMRILGEGPNGGPGNACKRARDWILDHGGATYIPSWGKTWLSILGVFDWSGSNPMP 
PEFWILPSFLPIHPAKMWCYCRLVYMPMSYLYGKRFVGPISPLILQLREEIYLQPYAKIN 
WNRARHLCAKEDAYCPHPQIQDVIWNCLYIFTEPFLACWPFNKLLREKALGVAMKHIHYE 
DENSRYITIGCVEKALCMLACWVEDPNGIHFKKHLLRISDYLWIAEDGMKMQSFGSQLWD 
SGFALQALVASNLVNEIPDVLRRGYDFLKNSQVRENPSGDFTNMYRHISKGSWTFSDRDH 
GWQASDCTAESFKCCLLLSMIPPDIVGPKMDPEQLYEAVTILLSLQSKNGGVTAWEPARG 
QEWLELLNPTEVFADIVVEHEYNECTSSAIQALILFKQLYPNHRTEEINTSIKKAVQYIE 
SIQMLDGSWYGSWGVCFTYSTWFGLGGLAAAGKTYNNCLAMRKGVHFLLTTQKDNGGWGE 
SYLSCPKKRYIPSEGERSNLVQTSWAMMGLLHAGQAERDPSPLHRAAKLLINSQLENGDF 
PQQEITGAFMKNCLLHYAAYRNIFPVWALAEYRRRVPLPYEKPSTERRS*