>AT5G12010.1 |  unknown protein 
MKAAVFRNEDGDEEEEEEEEEEVCNVFDGGEVESKRNNETKNLKGFFTSLLLMEEHEKQD 
QEARNAASRREMSDFQSNYRKRARTMSDYYSDLNDYYADAEESGDINLKKSRVSRAVASV 
AVAAASEIEAESSEITGSGSVRGTGSGQQRRLWVKDRSRAWWEECSRLDYPEEDFKKAFR 
MSKSTFELICDELNSAVAKEDTALRNAIPVRQRVAVCIWRLATGEPLRLVSKKFGLGIST 
CHKLVLEVCKAIKDVLMPKYLQWPDDESLRNIRERFESVSGIPNVVGSMYTTHIPIIAPK 
ISVASYFNKRHTERNQKTSYSITIQAVVNPKGVFTDLCIGWPGSMPDDKVLEKSLLYQRA 
NNGGLLKGMWVAGGPGHPLLDWVLVPYTQQNLTWTQHAFNEKMSEVQGVAKEAFGRLKGR 
WACLQKRTEVKLQDLPTVLGACCVLHNICEMREEKMEPELMVEVIDDEVLPENVLRSVNA 
MKARDTISHNLLHHGLAGTSFL*
>AT4G29140.1 |  MATE efflux protein-related 
MCNPSTTTTTTGSENQESRTGLFLDLFSINSFEPTKRNLRHCENRGSPLMAEAVTEAKSL 
FTLAFPIAVTALVLYLRSAVSMFFLGQLGDLELAAGSLAIAFANITGYSVLSGLALGMEP 
LCSQAFGAHRFKLLSLTLHRTVVFLLVCCVPISVLWFNVGKISVYLHQDPDIAKLAQTYL 
IFSLPDLLTNTLLHPIRIYLRAQGIIHPVTLASLSGAVFHLPANLFLVSYLRLGLTGVAV 
ASSITNIFVVAFLVCYVWASGLHAPTWTDPTRDCFRGWAPLLRLAGPSCVSVCLEWWWYE 
IMIVLCGLLVNPRSTVAAMGVLIQTTSFLYVFPSSLSFAVSTRVGNELGANRPKTAKLTA 
TVAIVFAAVTGIIAAAFAYSVRNAWGRIFTGDKEILQLTAAALPILGLCEIGNCPQTVGC 
GVVRGTARPSTAANVNLGAFYLVGMPVAVGLGFWAGIGFNGLWVGLLAAQISCAGLMMYV 
VGTTDWESEAKKAQTLTCAETVENDIIKAVVASTIDGECDEAEPLIRITVLY*
>AT3G62870.1 |  60S ribosomal protein L7A (RPL7aB) 
MAPKKGVKVASKKKPEKVTNPLFERRPKQFGIGGALPPKKDLSRYIKWPKSIRLQRQKRI 
LKQRLKVPPALNQFTKTLDKNLATSLFKILLKYRPEDKAAKKERLLNKAQAEAEGKPAES 
KKPIVVKYGLNHVTYLIEQNKAQLVVIAHDVDPIELVVWLPALCRKMEVPYCIVKGKSRL 
GAVVHQKTAAALCLTTVKNEDKLEFSKILEAIKANFNDKYEEYRKKWGGGIMGSKSQAKT 
KAKERVIAKEAAQRMN*
>AT1G18540.1 |  60S ribosomal protein L6 (RPL6A) 
MPAAKRTPKVNRNPDLIRGVGKYSRSQMYHKRGLWAIKAKNGGVFPRHDAQPKVDAPVEK 
PAKFYPAEDVKKPLVNRRKPKPTKLKASITPGTVLIILAGRFKGKRVVFLKQLSSGLLLV 
TGPFKINGVPLRRVNQAYVIGTSTKIDISGVNTEKFDDKYFGKVAEKKKKKTEGEFFEAE 
KEEKKEIPQEKKEDQKTVDAALIKSIEAVPELKVYLGARFSLSQGMKPHELVF*
>AT2G39290.1 |  PGP1 (PHOSPHATIDYLGLYCEROLPHOSPHATE SYNTHASE 1) CDP-alcohol phosphatidyltransferase/ CDP-diacylglycerol-glycerol-3-phosphate 3-phosphatidyltransferase 
MLRSGLASLIVDVNLRRTLRPSPTFSFPAHLSRCIITSRYSSRTSLRFPIQISRHQHRLS 
YFSSSSSSEQSRPTSSSRNSFSGHGQLDSDDNSSPPPSQSSSKVLTLPTVLTLGRVAAVP 
LLVATFYVDSWWGTTATTSIFIAAAITDWLDGYLARKMRLGSAFGAFLDPVADKLMVAAT 
LILLCTKPIQVAELGPLPWLLTVPSIAIIGREITMSAVREWAASQNGKLLEAVAVNNLGK 
WKTATQMTALTILLASRDSNVGWLVASGAGLLYVSAGLSVWSLAVYMRKIWKVLMK*
>AT5G45970.1 |  ARAC2 (ARABIDOPSIS RAC-LIKE 2) GTP binding 
MSTARFIKCVTVGDGAVGKTCMLISYTSNTFPTDYVPTVFDNFSANVVVDGSTVNLGLWD 
TAGQEDYNRLRPLSYRGADVFLLAFSLISKASYENIHKKWLPELKHYAPGIPIVLVGTKL 
DLRDDKQFLKDHPGAASITTAQGEELRKMIGAVRYLECSSKTQQNVKAVFDTAIRVALRP 
PKAKKKIKPLKTKRSRICFFL*
>AT2G01680.1 |  ankyrin repeat family protein 
MEMKQMKFLTHQAFFSSVRSGDLSQLQQLVDNLTGDELIDESSPCSAVAELMSVQNDAGE 
TAVYISAAENLEDIFRYLIRFSSLETVKIRSKSDMNAFHVAAKRGHLGIVKELLRLWPEL 
CRICDASNTSPLYAAAVQDHLEIVNAMLDVDPSCAMIVRKNGKTSLHTAGRYGLLRIVKA 
LIEKDAAIVGVKDKKGQTALHMAVKGRSLEVVEEILQADYTILNERDRKGNTALHIATRK 
ARPQITSLLLTFTAIEVNAINNQKETAMDLADKLQYSESALEINEALVEAGAKHGRFIGR 
EDEARALKRAVSDIKHEVQSQLLQNEKTNRRVSGIAKELRKLHREAVQNTTNSITVVAVL 
FASIAFLAIFNLPGQYFTEGSHVGQANIAGRTGFRVFCLLNATSLFISLAVVVVQITLVA 
WDTRAQKKVVSVVNKLMWAACACTFGAFLAIAFAVVGKGNSWMAITITLLGAPILVGTLA 
SMCYFVFRQRFRSGNDSQRRIRRGSSKSFSWSYSHHVSDFEDESDFEKIIAL*
>AT1G59740.1 |  proton-dependent oligopeptide transport (POT) family protein 
MAEINKQSNKWEQEEVSNENNWELAEEESVDWRGRPSNPNKHGGMRAALFVLGLQAFEIM 
GIAAVGNNLITYVINEMHFPLSKAANIVTNFVGTIFIFALLGGYLSDAFLGSFWTIIIFG 
FVELSGFILLSVQAHLPQLKPPKCNPLIDQTCEEAKGFKAMIFFMALYLVALGSGCVKPN 
MIAHGADQFSQSHPKQSKRLSSYFNAAYFAFSMGELIALTLLVWVQTHSGMDIGFGVSAA 
AMTMGIISLVSGTMYFRNKRPRRSIFTPIAHVIVAAILKRKLASPSDPRMLHGDHHVAND 
VVPSSTLPHTPRFRFLDKACIKIQDTNTKESPWRLCTVTQVEQVKTLISLVPIFASTIVF 
NTILAQLQTFSVQQGSSMNTRLSNSFHIPPASLQAIPYIMLIFLVPLYDSFLVPFARKLT 
GHNSGIPPLTRIGIGLFLSTFSMVSAAMLEKKRRDSSVLDGRILSIFWITPQFLIFGISE 
MFTAVGLIEFFYKQSAKGMESFLMALTYCSYSFGFYFSSVLVSVVNKITSTSVDSKGWLG 
ENDLNKDRLDLFYWLLAVLSLLNFLSYLFWSRWNIKSSRRNNTNVVGDENI*
>AT5G12350.1 |  Ran GTPase binding / chromatin binding / zinc ion binding 
MSRNGRMASDLSRAGPVERDIEQAIIALKKGAYLLKYGRRGKPKFCPFRLSNDETVLIWF 
SGNEEKHLKLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIYSERSLDVICKDKDEAEVW 
FTGLKALISHCHQRNRRTESRSDGTPSEANSPRTYTRRSSPLHSPFSSNDSLQKDGSNHL 
RIHSPFESPPKNGLDKAFSDMALYAVPPKGFYPSDSATISVHSGGSDSMHGHMRGMGMDA 
FRVSMSSAVSSSSHGSGHDDGDALGDVFIWGEGIGEGVLGGGNRRVGSSFDIKMDSLLPK 
ALESTIVLDVQNIACGGQHAVLVTKQGESFSWGEESEGRLGHGVDSNIQQPKLIDALNTT 
NIELVACGEFHSCAVTLSGDLYTWGKGDFGVLGHGNEVSHWVPKRVNFLLEGIHVSSIAC 
GPYHTAVVTSAGQLFTFGDGTFGVLGHGDKKSVFIPREVDSLKGLRTVRAACGVWHTAAV 
VEVMVGSSSSSNCSSGKLFTWGDGDKGRLGHGNKEPKLVPTCVAALVEPNFCQVACGHSL 
TVALTTSGHVYTMGSPVYGQLGNSHADGKTPNRVEGKLHKSFVEEIACGAYHVAVLTSRT 
EVYTWGKGSNGRLGHGDVDDRNSPTLVESLKDKQVKSIACGTNFTAAVCIHRWASGMDQS 
MCSGCRQPFSFKRKRHNCYNCGLVFCHSCTSKKSLKACMAPNPNKPYRVCDKCFNKLKKT 
METDPSSHSSLSRRGSINQGSDPIDKDDKFDSRSDGQLARFSLMESMRQVDSRHKKNKKY 
EFNSSRVSPIPSGSSQRGALNIAKSFNPVFGASKKFFSASVPGSRIVSRATSPISRRPSP 
PRSTTPTPTLSGLATPKFVVDDTKRTNDNLSQEVVKLRSQVESLTRKAQLQEVELERTTK 
QLKEALAITNEETTRCKAAKEVIKSLTAQLKDMAERLPVGSARTVKSPPSLNSFGSSPGR 
IDPFNILNQANSQESEPNGITTPMFSNGTMTPAFGNGEATNEARNEKEWVEQDEPGVYIT 
LTALAGGARDLKRVRFSRKRFSEIQAEQWWADNRGRVYEQYNVRMVDKASEDLPR*
>AT1G72560.1 |  PSD (PAUSED) nucleobase nucleoside nucleotide and nucleic acid transmembrane transporter/ tRNA binding 
MDDLEQAIVISFETGAVDSALKSQAVTYCQQIKETPSICSICIEKLWFSKLVQVQFWCLQ 
TLQDVLRVKYGSMSLDEQSYVRKSVFSMACLEVIDNENAGRVVEGPPFVKNKLAQVLATL 
IYYEYPLIWSSVFLDFMLHLCKGAVVIDMFCRVLNALDDELISLDYPRTPEEISVAARVK 
DAMRQQCVPQIARAWYDIVSMYKNSDPDLSATVLDCMRRFVSWIDIGLVANDAFVPLLFE 
LILSDGLSEQVRGAAAGCVLAMVSKRMDPQSKLPLLQTLQISRVFGLVSGDVDSDLVSKV 
SALLTGYAVEVLECHKRLNSEDTKAVSMDLLNEVLPSVFYVMQKCEVDSTFSIVQFLLGY 
VSTLKGLPALKEKQLLHITQILEVIRIQICYDPMYRNNLNSLDKTGLEEEDRMSEFRKDL 
FVLLRTVGRVAPEVTQHFIRNSLANAVESSSESNVEEVEAALSLLYSFGESMTEEAMKTG 
SGCLSELIPMLLTTQFPGHSHRLVALVYLENITRYMKFIQENSQYIPNVLGAFLDDRGLH 
HQNFYVSRRAGYLFMRVVKLLKSKLVPFIDKILQNLQDTLSQLTTMNFASRELTGTEDGS 
HIFEAIGIIIGLEDVPAEKQSDYLSLLLTPLCQQIEAGLVQAKVASSEDFPVKIANIQFA 
IVAINALSKGFNERLVTASRPGIGLMFKQTLDVLLRVLIEFPKVEPLRSKVTSFIHRMVD 
TLGSAVFPYLPKALEQLLADSEPKEMVGFMVLLNQLICKFNSALHDILEEVYPVVAVRIF 
NVIPRDGLPSRPGAVTEEMRELIELQRMLYTFLHVIATHDLSSVFLTPKSRAYLDPMMQL 
VLNTSCNHKDITVRKACVQIFIKLIKDWCAEPYSEEKVPGFQNFVIEAFATNCCLYSVLD 
KSFNFSDANTHALFGEIITAQKVMYEKFGNTFLMHLMSKSFPSAHIPQDLAEQYCQKLQG 
NDIRSLKSYYQSLIENLRLQQNGSHVFR*
>AT1G72560.1 |  PSD (PAUSED) nucleobase nucleoside nucleotide and nucleic acid transmembrane transporter/ tRNA binding 
MDDLEQAIVISFETGAVDSALKSQAVTYCQQIKETPSICSICIEKLWFSKLVQVQFWCLQ 
TLQDVLRVKYGSMSLDEQSYVRKSVFSMACLEVIDNENAGRVVEGPPFVKNKLAQVLATL 
IYYEYPLIWSSVFLDFMLHLCKGAVVIDMFCRVLNALDDELISLDYPRTPEEISVAARVK 
DAMRQQCVPQIARAWYDIVSMYKNSDPDLSATVLDCMRRFVSWIDIGLVANDAFVPLLFE 
LILSDGLSEQVRGAAAGCVLAMVSKRMDPQSKLPLLQTLQISRVFGLVSGDVDSDLVSKV 
SALLTGYAVEVLECHKRLNSEDTKAVSMDLLNEVLPSVFYVMQKCEVDSTFSIVQFLLGY 
VSTLKGLPALKEKQLLHITQILEVIRIQICYDPMYRNNLNSLDKTGLEEEDRMSEFRKDL 
FVLLRTVGRVAPEVTQHFIRNSLANAVESSSESNVEEVEAALSLLYSFGESMTEEAMKTG 
SGCLSELIPMLLTTQFPGHSHRLVALVYLENITRYMKFIQENSQYIPNVLGAFLDDRGLH 
HQNFYVSRRAGYLFMRVVKLLKSKLVPFIDKILQNLQDTLSQLTTMNFASRELTGTEDGS 
HIFEAIGIIIGLEDVPAEKQSDYLSLLLTPLCQQIEAGLVQAKVASSEDFPVKIANIQFA 
IVAINALSKGFNERLVTASRPGIGLMFKQTLDVLLRVLIEFPKVEPLRSKVTSFIHRMVD 
TLGSAVFPYLPKALEQLLADSEPKEMVGFMVLLNQLICKFNSALHDILEEVYPVVAVRIF 
NVIPRDGLPSRPGAVTEEMRELIELQRMLYTFLHVIATHDLSSVFLTPKSRAYLDPMMQL 
VLNTSCNHKDITVRKACVQIFIKLIKDWCAEPYSEEKVPGFQNFVIEAFATNCCLYSVLD 
KSFNFSDANTHALFGEIITAQKVMYEKFGNTFLMHLMSKSFPSAHIPQDLAEQYCQKLQG 
NDIRSLKSYYQSLIENLRLQQNGSHVFR*
>AT1G72560.2 |  PSD (PAUSED) nucleobase nucleoside nucleotide and nucleic acid transmembrane transporter/ tRNA binding 
MDDLEQAIVISFETGAVDSALKSQAVTYCQQIKETPSICSICIEKLWFSKLVQVQFWCLQ 
TLQDVLRVKYGSMSLDEQSYVRKSVFSMACLEVIDNENAGRVVEGPPFVKNKLAQVLATL 
IYYEYPLIWSSVFLDFMLHLCKGAVVIDMFCRVLNALDDELISLDYPRTPEEISVAARVK 
DAMRQQCVPQIARAWYDIVSMYKNSDPDLSATVLDCMRRFVSWIDIGLVANDAFVPLLFE 
LILSDGLSEQVRGAAAGCVLAMVSKRMDPQSKLPLLQTLQISRVFGLVSGDVDSDLVSKV 
SALLTGYAVEVLECHKRLNSEDTKAVSMDLLNEVLPSVFYVMQKCEVDSTFSIVQFLLGY 
VSTLKGLPALKEKQLLHITQILEVIRIQICYDPMYRNNLNSLDKTGLEEEDRMSEFRKDL 
FVLLRTVGRVAPEVTQHFIRNSLANAVESSSESNVEEVEAALSLLYSFGESMTEEAMKTG 
SGCLSELIPMLLTTQFPGHSHRLVALVYLENITRYMKFIQENSQYIPNVLGAFLDDRGLH 
HQNFYVSRRAGYLFMRVVKLLKSKLVPFIDKILQNLQDTLSQLTTMNFASRELTGTEDGS 
HIFEAIGIIIGLEDVPAEKQSDYLSLLLTPLCQQIEAGLVQAKVASSEDFPVKIANIQFA 
IVAINALSKGFNERLVTASRPGIGLMFKQTLDVLLRVLIEFPKVEPLRSKVTSFIHRMVD 
TLGSAVFPYLPKALEQLLADSEPKEMVGFMVLLNQLICKFNSALHDILEEVYPVVAVRIF 
NVIPRDGLPSRPGAVTEEMRELIELQRMLYTFLHVIATHDLSSVFLTPKSRAYLDPMMQL 
VLNTSCNHKDITVRKACVQIFIKLIKDWCAEPYSEEKVPGFQNFVIEAFATNCCLYSVLD 
KSFNFSDANTHALFGEIITAQKVMYEKFGNTFLMHLMSKSFPSAHIPQDLAEQYCQKLQG 
NDIRSLKSYYQSLIENLRLQQNGSHVFR*
>AT1G72560.2 |  PSD (PAUSED) nucleobase nucleoside nucleotide and nucleic acid transmembrane transporter/ tRNA binding 
MDDLEQAIVISFETGAVDSALKSQAVTYCQQIKETPSICSICIEKLWFSKLVQVQFWCLQ 
TLQDVLRVKYGSMSLDEQSYVRKSVFSMACLEVIDNENAGRVVEGPPFVKNKLAQVLATL 
IYYEYPLIWSSVFLDFMLHLCKGAVVIDMFCRVLNALDDELISLDYPRTPEEISVAARVK 
DAMRQQCVPQIARAWYDIVSMYKNSDPDLSATVLDCMRRFVSWIDIGLVANDAFVPLLFE 
LILSDGLSEQVRGAAAGCVLAMVSKRMDPQSKLPLLQTLQISRVFGLVSGDVDSDLVSKV 
SALLTGYAVEVLECHKRLNSEDTKAVSMDLLNEVLPSVFYVMQKCEVDSTFSIVQFLLGY 
VSTLKGLPALKEKQLLHITQILEVIRIQICYDPMYRNNLNSLDKTGLEEEDRMSEFRKDL 
FVLLRTVGRVAPEVTQHFIRNSLANAVESSSESNVEEVEAALSLLYSFGESMTEEAMKTG 
SGCLSELIPMLLTTQFPGHSHRLVALVYLENITRYMKFIQENSQYIPNVLGAFLDDRGLH 
HQNFYVSRRAGYLFMRVVKLLKSKLVPFIDKILQNLQDTLSQLTTMNFASRELTGTEDGS 
HIFEAIGIIIGLEDVPAEKQSDYLSLLLTPLCQQIEAGLVQAKVASSEDFPVKIANIQFA 
IVAINALSKGFNERLVTASRPGIGLMFKQTLDVLLRVLIEFPKVEPLRSKVTSFIHRMVD 
TLGSAVFPYLPKALEQLLADSEPKEMVGFMVLLNQLICKFNSALHDILEEVYPVVAVRIF 
NVIPRDGLPSRPGAVTEEMRELIELQRMLYTFLHVIATHDLSSVFLTPKSRAYLDPMMQL 
VLNTSCNHKDITVRKACVQIFIKLIKDWCAEPYSEEKVPGFQNFVIEAFATNCCLYSVLD 
KSFNFSDANTHALFGEIITAQKVMYEKFGNTFLMHLMSKSFPSAHIPQDLAEQYCQKLQG 
NDIRSLKSYYQSLIENLRLQQNGSHVFR*
>AT3G16050.1 |  A37 protein heterodimerization 
MADQAMTDQDQGAVTLYSGTAITDAKKNHPFSVKVGLAQVLRGGAIVEVSSVNQAKLAES 
AGACSVIVSDPVRSRGGVRRMPDPVLIKEVKRAVSVPVMARARVGHFVEAQILESLAVDY 
IDESEIISVADDDHFINKHNFRSPFICGCRDTGEALRRIREGAAMIRIQGDLTATGNIAE 
TVKNVRSLMGEVRVLNNMDDDEVFTFAKKISAPYDLVAQTKQMGRVPVVQFASGGITTPA 
DAALMMQLGCDGVFVGSEVFDGPDPFKKLRSIVQAVQHYNDPHVLAEMSSGLENAMESLN 
VRGDRIQDFGQGSV*
>AT4G30660.1 |  hydrophobic protein putative / low temperature and salt responsive protein putative 
MPSNCEILCEIIIAILLPPLGVCFRKGCCTVEFLICLVLTILGYVPGIIYAIYVIVFQHR 
EEYFDEYRRPIYSA*
>AT4G30660.1 |  hydrophobic protein putative / low temperature and salt responsive protein putative 
MPSNCEILCEIIIAILLPPLGVCFRKGCCTVEFLICLVLTILGYVPGIIYAIYVIVFQHR 
EEYFDEYRRPIYSA*
>AT4G30660.2 |  hydrophobic protein putative / low temperature and salt responsive protein putative 
MPSNCEILCEIIIAILLPPLGVCFRKGCCTVEFLICLVLTILGYVPGIIYAIYVIVFQHR 
EEYFDEYRRPIYSA*
>AT4G30660.2 |  hydrophobic protein putative / low temperature and salt responsive protein putative 
MPSNCEILCEIIIAILLPPLGVCFRKGCCTVEFLICLVLTILGYVPGIIYAIYVIVFQHR 
EEYFDEYRRPIYSA*
>AT1G19730.1 |  ATTRX4 oxidoreductase acting on sulfur group of donors disulfide as acceptor 
MAAEEGQVIGCHTNDVWTVQLDKAKESNKLIVIDFTASWCPPCRMIAPIFNDLAKKFMSS 
AIFFKVDVDELQSVAKEFGVEAMPTFVFIKAGEVVDKLVGANKEDLQAKIVKHTGVTTA*
>AT2G46230.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 22 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF652 (InterProIPR006984) Nucleotide binding protein PINc (InterProIPR006596) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G265301) Has 493 Blast hits to 493 proteins in 165 species Archae - 21 Bacteria - 0 Metazoa - 192 Fungi - 143 Plants - 53 Viruses - 0 Other Eukaryotes - 84 (source NCBI BLink) 
MGKAKKPQKFAVVKKMISHKALKHYKEEVLNPNKKDLTELPRNVPSVPAGLFFSYNSTLV 
PPYRVLVDTNFINFSIQNKIDLEKGMRDCLYANCTPCITDCVMAELEKLGQKYRVALRIA 
KDPHFERLPCIHKGTYADDCLVDRVTQHKCFIVATCDRDLKRRIRKIPGVPIMYVTNRKY 
SIEKLPEATLGGAPRY*
>AT2G46230.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 22 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF652 (InterProIPR006984) Nucleotide binding protein PINc (InterProIPR006596) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G265301) Has 441 Blast hits to 441 proteins in 161 species Archae - 15 Bacteria - 0 Metazoa - 185 Fungi - 119 Plants - 49 Viruses - 0 Other Eukaryotes - 73 (source NCBI BLink) 
MGKAKKPQKFAVVKKMISHKALKHYKEEVLNPNKKDLTELPRNVPSVPAGLFFSYNSTLV 
PPYRVLVDTNFINFSIQNKIDLEKGMRDCLYANCTPCITDCVMAELEKLGQKYRVALRIA 
KDPHFERLPCIHKGTYADDCLVDRVTQHKCFIVATCDRDLKRRIRKIPGVPIMYVTNRKY 
SIEKLPEATLGGAPRY*
>AT2G46230.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 22 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF652 (InterProIPR006984) Nucleotide binding protein PINc (InterProIPR006596) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G265301) Has 493 Blast hits to 493 proteins in 165 species Archae - 21 Bacteria - 0 Metazoa - 192 Fungi - 143 Plants - 53 Viruses - 0 Other Eukaryotes - 84 (source NCBI BLink) 
MGKAKKPQKFAVVKKMISHKALKHYKEEVLNPNKKDLTELPRNVPSVPAGLFFSYNSTLV 
PPYRVLVDTNFINFSIQNKIDLEKGMRDCLYANCTPCITDCVMAELEKLGQKYRVALRIA 
KDPHFERLPCIHKGTYADDCLVDRVTQHKCFIVATCDRDLKRRIRKVKIVCCISALVYIH 
G*
>AT2G46230.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 22 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF652 (InterProIPR006984) Nucleotide binding protein PINc (InterProIPR006596) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G265301) Has 441 Blast hits to 441 proteins in 161 species Archae - 15 Bacteria - 0 Metazoa - 185 Fungi - 119 Plants - 49 Viruses - 0 Other Eukaryotes - 73 (source NCBI BLink) 
MGKAKKPQKFAVVKKMISHKALKHYKEEVLNPNKKDLTELPRNVPSVPAGLFFSYNSTLV 
PPYRVLVDTNFINFSIQNKIDLEKGMRDCLYANCTPCITDCVMAELEKLGQKYRVALRIA 
KDPHFERLPCIHKGTYADDCLVDRVTQHKCFIVATCDRDLKRRIRKVKIVCCISALVYIH 
G*
>AT1G20696.1 |  HMGB3 (HIGH MOBILITY GROUP B 3) DNA binding / chromatin binding / structural constituent of chromatin / transcription factor 
MKGAKSKAETRSTKLSVTKKPAKGAKGAAKDPNKPKRPSSAFFVFMEDFRVTYKEEHPKN 
KSVAAVGKAGGEKWKSLSDSEKAPYVAKADKRKVEYEKNMKAYNKKLEEGPKEDEESDKS 
VSEVNDEDDAEDGSEEEEDDD*
>AT1G20696.1 |  HMGB3 (HIGH MOBILITY GROUP B 3) DNA binding / chromatin binding / structural constituent of chromatin / transcription factor 
MKGAKSKAETRSTKLSVTKKPAKGAKGAAKDPNKPKRPSSAFFVFMEDFRVTYKEEHPKN 
KSVAAVGKAGGEKWKSLSDSEKAPYVAKADKRKVEYEKNMKAYNKKLEEGPKEDEESDKS 
VSEVNDEDDAEDGSEEEEDDD*
>AT1G20696.1 |  HMGB3 (HIGH MOBILITY GROUP B 3) DNA binding / chromatin binding / structural constituent of chromatin / transcription factor 
MKGAKSKAETRSTKLSVTKKPAKGAKGAAKDPNKPKRPSSAFFVFMEDFRVTYKEEHPKN 
KSVAAVGKAGGEKWKSLSDSEKAPYVAKADKRKVEYEKNMKAYNKKLEEGPKEDEESDKS 
VSEVNDEDDAEDGSEEEEDDD*
>AT1G20696.2 |  HMGB3 (HIGH MOBILITY GROUP B 3) DNA binding / chromatin binding / structural constituent of chromatin / transcription factor 
MKGAKSKAETRSTKLSVTKKPAKGAKGAAKDPNKPKRPSSAFFVFMEDFRVTYKEEHPKN 
KSVAAVGKAGGEKWKSLSDSEKAPYVAKADKRKVEYEKNMKAYNKKLVIALRKMRNLTSQ 
FQRLTMRMMLRMAVKRRKTMIKKLKLW*
>AT1G20696.2 |  HMGB3 (HIGH MOBILITY GROUP B 3) DNA binding / chromatin binding / structural constituent of chromatin / transcription factor 
MKGAKSKAETRSTKLSVTKKPAKGAKGAAKDPNKPKRPSSAFFVFMEDFRVTYKEEHPKN 
KSVAAVGKAGGEKWKSLSDSEKAPYVAKADKRKVEYEKNMKAYNKKLVIALRKMRNLTSQ 
FQRLTMRMMLRMAVKRRKTMIKKLKLW*
>AT1G20696.2 |  HMGB3 (HIGH MOBILITY GROUP B 3) DNA binding / chromatin binding / structural constituent of chromatin / transcription factor 
MKGAKSKAETRSTKLSVTKKPAKGAKGAAKDPNKPKRPSSAFFVFMEDFRVTYKEEHPKN 
KSVAAVGKAGGEKWKSLSDSEKAPYVAKADKRKVEYEKNMKAYNKKLVIALRKMRNLTSQ 
FQRLTMRMMLRMAVKRRKTMIKKLKLW*
>AT1G20696.3 |  HMGB3 (HIGH MOBILITY GROUP B 3) DNA binding / chromatin binding / structural constituent of chromatin / transcription factor 
MKGAKSKAETRSTKLSVTKKPAKGAKGAAKDPNKPKRPSSAFFVFMEDFRVTYKEEHPKN 
KSVAAVGKAGGEKWKSLSDSEKAPYVAKADKRKVEYEKNMKAYNKKLEEGPKEDEESDKS 
VSEVNDEDDAEDGSEEVRRR*
>AT1G20696.3 |  HMGB3 (HIGH MOBILITY GROUP B 3) DNA binding / chromatin binding / structural constituent of chromatin / transcription factor 
MKGAKSKAETRSTKLSVTKKPAKGAKGAAKDPNKPKRPSSAFFVFMEDFRVTYKEEHPKN 
KSVAAVGKAGGEKWKSLSDSEKAPYVAKADKRKVEYEKNMKAYNKKLEEGPKEDEESDKS 
VSEVNDEDDAEDGSEEVRRR*
>AT1G20696.3 |  HMGB3 (HIGH MOBILITY GROUP B 3) DNA binding / chromatin binding / structural constituent of chromatin / transcription factor 
MKGAKSKAETRSTKLSVTKKPAKGAKGAAKDPNKPKRPSSAFFVFMEDFRVTYKEEHPKN 
KSVAAVGKAGGEKWKSLSDSEKAPYVAKADKRKVEYEKNMKAYNKKLEEGPKEDEESDKS 
VSEVNDEDDAEDGSEEVRRR*
>AT1G62850.2 |  translation release factor 
MAAIRTMTNMILREFIHHPLLLHSSSKSCQSLLPCLRLTPLISPIHSNSRLVSVRCAAST 
SGGSGGDRKVSSRLSQVQQMLHEAEERASSAGNEPTPQITLDNVTLNFARSGGPGGQNVN 
KLNTKVDMRFNVKNAYWLSDRIREKILLTEKNRINKDGELVISSTKTRTQKGNIDDALEK 
LQAIIDAASYVPPPPSEEQKKKIVKLAAKADNKRLKSKKVLSDKKSARRSRGSYDD*
>AT1G62850.2 |  translation release factor 
MAAIRTMTNMILREFIHHPLLLHSSSKSCQSLLPCLRLTPLISPIHSNSRLVSVRCAAST 
SGGSGGDRKVSSRLSQVQQMLHEAEERASSAGNEPTPQITLDNVTLNFARSGGPGGQNVN 
KLNTKVDMRFNVKNAYWLSDRIREKILLTEKNRINKDGELVISSTKTRTQKGNIDDALEK 
LQAIIDAASYVPPPPSEEQKKKIVKLAAKADNKRLKSKKVLSDKKSARRSRGSYDD*
>AT1G62850.3 |  translation release factor 
MAAIRTMTNMILREFIHHPLLLHSSSKSCQSLLPCLRLTPLISPIHSNSRLVSVRCAAST 
SGGSGGDRKVSSRLSQVQQMLHEAEERASSAGNEPTPQITLDNVTLNFARSGGPGGQNVN 
KLNTKVDMRFNVKNAYWLSDRIREKILLTEKNRINKDGELVISSTKTRTQKGNIDDALEK 
LQAIIDAASYVPPPPSEEQKKKIVKLAAKADNKRLKSKKVLSDKKSARRSRGSYDD*
>AT1G62850.3 |  translation release factor 
MAAIRTMTNMILREFIHHPLLLHSSSKSCQSLLPCLRLTPLISPIHSNSRLVSVRCAAST 
SGGSGGDRKVSSRLSQVQQMLHEAEERASSAGNEPTPQITLDNVTLNFARSGGPGGQNVN 
KLNTKVDMRFNVKNAYWLSDRIREKILLTEKNRINKDGELVISSTKTRTQKGNIDDALEK 
LQAIIDAASYVPPPPSEEQKKKIVKLAAKADNKRLKSKKVLSDKKSARRSRGSYDD*
>AT1G66240.1 |  ATX1 (ARABIDOPSIS HOMOLOG OF ANTI-OXIDANT 1) metal ion binding 
MLKDLFQAVSYQNTASLSLFQALSVVESKAMSQTVVLRVAMTCEGCVGAVKRVLGKMEGV 
ESFDVDIKEQKVTVKGNVQPDAVLQTVTKTGKKTAFWEAEGETAKA*
>AT1G66240.1 |  ATX1 (ARABIDOPSIS HOMOLOG OF ANTI-OXIDANT 1) metal ion binding 
MLKDLFQAVSYQNTASLSLFQALSVVESKAMSQTVVLRVAMTCEGCVGAVKRVLGKMEGV 
ESFDVDIKEQKVTVKGNVQPDAVLQTVTKTGKKTAFWEAEGETAKA*
>AT1G66240.2 |  ATX1 (ARABIDOPSIS HOMOLOG OF ANTI-OXIDANT 1) metal ion binding 
MTCEGCVGAVKRVLGKMEGVESFDVDIKEQKVTVKGNVQPDAVLQTVTKTGKKTAFWEAE 
GETAKA*
>AT1G66240.2 |  ATX1 (ARABIDOPSIS HOMOLOG OF ANTI-OXIDANT 1) metal ion binding 
MTCEGCVGAVKRVLGKMEGVESFDVDIKEQKVTVKGNVQPDAVLQTVTKTGKKTAFWEAE 
GETAKA*
>AT1G80710.1 |  transducin family protein / WD-40 repeat family protein 
MATEYERKRLENIRRNDEMLAALNVRAKASSLLSAAKRSRDDSKSFKKKKPKPASTPTVI 
RMSLRTRGLNPDSAGLPDGFSDFRMGSQITHNQPSPQKQSPRLLAPIPFESAYEGYGSYT 
QLVDTLLGIESKSCRGKLVKGEIGVVKDENESPMVRTRSSSRVSKVSVKKEEPEDDSFSD 
YVNKEFSIPVKPEKIEFDLDLLTLEPQNVARVVPGRIFVVQFLPCENVKMVAAGDKLGNV 
GFWNLDCGNEEDNDGIYLFTPHSAPVSSIVFQQNSLSRVISSSYDGLIRLMDVEKSVFDL 
VYSTDEAIFSLSQRPNDEQSLYFGQDYGVFNVWDLRAGKSVFHWELHERRINSIDFNPQN 
PHVMATSSTDGTACLWDLRSMGAKKPKTLSTVNHSRAVHSAYFSPSGLSLATTSLDNYIG 
VLSGANFENTCMIYHNNTSRWISKFKAVWGWDDSYIYVGNLSKKIDVINPKLKRTVMELH 
NPLQRAIPCRIHCHPYNVGTLAGSTAGGQVYVWTTK*
>AT2G27920.2 |  SCPL51 (SERINE CARBOXYPEPTIDASE-LIKE 51) serine-type carboxypeptidase 
MKQGASGVGIGNFQEVGPLDTFLKPRNSTWLKKADLLFVDSPVGAGYSFVEGNQKDLYVK 
SDEEAAQDLTKLLQQLFNKNQTLNQSPLFIVAESYGGKIAVKLGLSVIDAVQSGKLKLHL 
GGVILGDSWISPEDFVFSWGPLLKHVSRLDDNGLDSSNSLAEKIKTQIKNGEYVGATQTW 
MDLENLISSKSNFVDFYNFLLDTGMDPVSLTTSLKIKKEEKIKKYSRYLNDMRSLSDVED 
VEGDLDKLMNGVIKKKLKIIPNDLIWGNNSDDVFTAMEAAFMKPVIEDVDELLATGVDVT 
IYNGQLDVICSTSGTEAWVHKLRWEGLEEFKKMEREPLFCESDRATRGFTKSYKNLHFYW 
ILGAGHFVPVDEPCVALKMVGEITKSPQL*
>AT2G27920.2 |  SCPL51 (SERINE CARBOXYPEPTIDASE-LIKE 51) serine-type carboxypeptidase 
MKQGASGVGIGNFQEVGPLDTFLKPRNSTWLKKADLLFVDSPVGAGYSFVEGNQKDLYVK 
SDEEAAQDLTKLLQQLFNKNQTLNQSPLFIVAESYGGKIAVKLGLSVIDAVQSGKLKLHL 
GGVILGDSWISPEDFVFSWGPLLKHVSRLDDNGLDSSNSLAEKIKTQIKNGEYVGATQTW 
MDLENLISSKSNFVDFYNFLLDTGMDPVSLTTSLKIKKEEKIKKYSRYLNDMRSLSDVED 
VEGDLDKLMNGVIKKKLKIIPNDLIWGNNSDDVFTAMEAAFMKPVIEDVDELLATGVDVT 
IYNGQLDVICSTSGTEAWVHKLRWEGLEEFKKMEREPLFCESDRATRGFTKSYKNLHFYW 
ILGAGHFVPVDEPCVALKMVGEITKSPQL*
>AT2G27920.2 |  SCPL51 (SERINE CARBOXYPEPTIDASE-LIKE 51) serine-type carboxypeptidase 
MKQGASGVGIGNFQEVGPLDTFLKPRNSTWLKKADLLFVDSPVGAGYSFVEGNQKDLYVK 
SDEEAAQDLTKLLQQLFNKNQTLNQSPLFIVAESYGGKIAVKLGLSVIDAVQSGKLKLHL 
GGVILGDSWISPEDFVFSWGPLLKHVSRLDDNGLDSSNSLAEKIKTQIKNGEYVGATQTW 
MDLENLISSKSNFVDFYNFLLDTGMDPVSLTTSLKIKKEEKIKKYSRYLNDMRSLSDVED 
VEGDLDKLMNGVIKKKLKIIPNDLIWGNNSDDVFTAMEAAFMKPVIEDVDELLATGVDVT 
IYNGQLDVICSTSGTEAWVHKLRWEGLEEFKKMEREPLFCESDRATRGFTKSYKNLHFYW 
ILGAGHFVPVDEPCVALKMVGEITKSPQL*
>AT2G27920.1 |  SCPL51 (SERINE CARBOXYPEPTIDASE-LIKE 51) serine-type carboxypeptidase 
MKTTVVYLVILCLIVSCTNGETKHVRKINSDGSEAWGYVEVRPKAHMFWWHYKSPYRVEN 
PSKPWPIILWLQGGPGASGVGIGNFQEVGPLDTFLKPRNSTWLKKADLLFVDSPVGAGYS 
FVEGNQKDLYVKSDEEAAQDLTKLLQQLFNKNQTLNQSPLFIVAESYGGKIAVKLGLSVI 
DAVQSGKLKLHLGGVILGDSWISPEDFVFSWGPLLKHVSRLDDNGLDSSNSLAEKIKTQI 
KNGEYVGATQTWMDLENLISSKSNFVDFYNFLLDTGMDPVSLTTSLKIKKEEKIKKYSRY 
LNDMRSLSDVEDVEGDLDKLMNGVIKKKLKIIPNDLIWGNNSDDVFTAMEAAFMKPVIED 
VDELLATGVDVTIYNGQLDVICSTSGTEAWVHKLRWEGLEEFKKMEREPLFCESDRATRG 
FTKSYKNLHFYWILGAGHFVPVDEPCVALKMVGEITKSPQL*
>AT2G27920.1 |  SCPL51 (SERINE CARBOXYPEPTIDASE-LIKE 51) serine-type carboxypeptidase 
MKTTVVYLVILCLIVSCTNGETKHVRKINSDGSEAWGYVEVRPKAHMFWWHYKSPYRVEN 
PSKPWPIILWLQGGPGASGVGIGNFQEVGPLDTFLKPRNSTWLKKADLLFVDSPVGAGYS 
FVEGNQKDLYVKSDEEAAQDLTKLLQQLFNKNQTLNQSPLFIVAESYGGKIAVKLGLSVI 
DAVQSGKLKLHLGGVILGDSWISPEDFVFSWGPLLKHVSRLDDNGLDSSNSLAEKIKTQI 
KNGEYVGATQTWMDLENLISSKSNFVDFYNFLLDTGMDPVSLTTSLKIKKEEKIKKYSRY 
LNDMRSLSDVEDVEGDLDKLMNGVIKKKLKIIPNDLIWGNNSDDVFTAMEAAFMKPVIED 
VDELLATGVDVTIYNGQLDVICSTSGTEAWVHKLRWEGLEEFKKMEREPLFCESDRATRG 
FTKSYKNLHFYWILGAGHFVPVDEPCVALKMVGEITKSPQL*
>AT2G27920.1 |  SCPL51 (SERINE CARBOXYPEPTIDASE-LIKE 51) serine-type carboxypeptidase 
MKTTVVYLVILCLIVSCTNGETKHVRKINSDGSEAWGYVEVRPKAHMFWWHYKSPYRVEN 
PSKPWPIILWLQGGPGASGVGIGNFQEVGPLDTFLKPRNSTWLKKADLLFVDSPVGAGYS 
FVEGNQKDLYVKSDEEAAQDLTKLLQQLFNKNQTLNQSPLFIVAESYGGKIAVKLGLSVI 
DAVQSGKLKLHLGGVILGDSWISPEDFVFSWGPLLKHVSRLDDNGLDSSNSLAEKIKTQI 
KNGEYVGATQTWMDLENLISSKSNFVDFYNFLLDTGMDPVSLTTSLKIKKEEKIKKYSRY 
LNDMRSLSDVEDVEGDLDKLMNGVIKKKLKIIPNDLIWGNNSDDVFTAMEAAFMKPVIED 
VDELLATGVDVTIYNGQLDVICSTSGTEAWVHKLRWEGLEEFKKMEREPLFCESDRATRG 
FTKSYKNLHFYWILGAGHFVPVDEPCVALKMVGEITKSPQL*
>AT2G27920.3 |  SCPL51 (SERINE CARBOXYPEPTIDASE-LIKE 51) serine-type carboxypeptidase 
MAYHPLAPRWTWSFRGWYREFSGGWTIRHISQAQKLNLVEESRSFVCVGAGYSFVEGNQK 
DLYVKSDEEAAQDLTKLLQQLFNKNQTLNQSPLFIVAESYGGKIAVKLGLSVIDAVQSGK 
LKLHLGGVILGDSWISPEDFVFSWGPLLKHVSRLDDNGLDSSNSLAEKIKTQIKNGEYVG 
ATQTWMDLENLISSKSNFVDFYNFLLDTGMDPVSLTTSLKIKKEEKIKKYSRYLNDMRSL 
SDVEDVEGDLDKLMNGVIKKKLKIIPNDLIWGNNSDDVFTAMEAAFMKPVIEDVDELLAT 
GVDVTIYNGQLDVICSTSGTEAWVHKLRWEGLEEFKKMEREPLFCESDRATRGFTKSYKN 
LHFYWILGAGHFVPVDEPCVALKMVGEITKSPQL*
>AT2G27920.3 |  SCPL51 (SERINE CARBOXYPEPTIDASE-LIKE 51) serine-type carboxypeptidase 
MAYHPLAPRWTWSFRGWYREFSGGWTIRHISQAQKLNLVEESRSFVCVGAGYSFVEGNQK 
DLYVKSDEEAAQDLTKLLQQLFNKNQTLNQSPLFIVAESYGGKIAVKLGLSVIDAVQSGK 
LKLHLGGVILGDSWISPEDFVFSWGPLLKHVSRLDDNGLDSSNSLAEKIKTQIKNGEYVG 
ATQTWMDLENLISSKSNFVDFYNFLLDTGMDPVSLTTSLKIKKEEKIKKYSRYLNDMRSL 
SDVEDVEGDLDKLMNGVIKKKLKIIPNDLIWGNNSDDVFTAMEAAFMKPVIEDVDELLAT 
GVDVTIYNGQLDVICSTSGTEAWVHKLRWEGLEEFKKMEREPLFCESDRATRGFTKSYKN 
LHFYWILGAGHFVPVDEPCVALKMVGEITKSPQL*
>AT2G27920.3 |  SCPL51 (SERINE CARBOXYPEPTIDASE-LIKE 51) serine-type carboxypeptidase 
MAYHPLAPRWTWSFRGWYREFSGGWTIRHISQAQKLNLVEESRSFVCVGAGYSFVEGNQK 
DLYVKSDEEAAQDLTKLLQQLFNKNQTLNQSPLFIVAESYGGKIAVKLGLSVIDAVQSGK 
LKLHLGGVILGDSWISPEDFVFSWGPLLKHVSRLDDNGLDSSNSLAEKIKTQIKNGEYVG 
ATQTWMDLENLISSKSNFVDFYNFLLDTGMDPVSLTTSLKIKKEEKIKKYSRYLNDMRSL 
SDVEDVEGDLDKLMNGVIKKKLKIIPNDLIWGNNSDDVFTAMEAAFMKPVIEDVDELLAT 
GVDVTIYNGQLDVICSTSGTEAWVHKLRWEGLEEFKKMEREPLFCESDRATRGFTKSYKN 
LHFYWILGAGHFVPVDEPCVALKMVGEITKSPQL*
>AT3G54150.1 |  embryo-abundant protein-related 
MAALSEKEAEAYLDARPRYPIDWFKKIAARTQDHKFAWDVGTGNGQAAIGLVEHYENVVA 
TDINEAQLQRAIKHSRISYHHTPTTISEDEMVDLLGGENSVDLIVAAQAVHFFDLNVFYN 
VAKRVLRKEGGLIAVWVYNDIIISHEIDPIMKRLVDSTLPFRTPIMNLAFDGYKTLTFPF 
ETIGMGSEGKPITLDIPHKLSLKGFIGFLRSWQPAMKAKEKGVELINEDLITKFEEAWGD 
ETQVKDVFYKAHMIVGKIPEEKFESDQVLSNKGRLLLETEVGRNQKRPQPSDEGDRQSKK 
QNTSEDDACKDKPSSVFSTHYRF*
>AT4G04700.1 |  CPK27 ATP binding / calcium ion binding / kinase/ protein kinase/ protein serine/threonine kinase/ protein tyrosine kinase 
MGCFSSKELQQSKRTILEKPLVDITKIYILGEELGRGNFGLTRKCVEKSTGKTFACKTIL 
KTKLKDEECEEDVKREIRIMKQLSGEPNIVEFKNAYEDKDSVHIVMEYCGGGELYDKILA 
LYDVGKSYSEKEAAGIIRSIVNVVKNCHYMGVMHRDLKPENFLLTSNDDNATVKVIDFGC 
SVFIEEGKVYQDLAGSDYYIAPEVLQGNYGKEADIWSAGIILYILLCGKSPFVKEPEGQM 
FNEIKSLEIDYSEEPWPLRDSRAIHLVKRMLDRNPKERISAAEVLGHPWMKEGEASDKPI 
DGVVLSRLKRFRDANKFKKVVLKFIAANLSEEEIKGLKTLFTNIDTDKSGNITLEELKTG 
LTRLGSNLSKTEVEQLMEAADMDGNGTIDIDEFISATMHRYKLDRDEHVYKAFQHFDKDN 
DGHITKEELEMAMKEDGAGDEGSIKQIIADADTDNDGKINFEEFRTMMRTESSLQPEGEL 
LPIIN*
>AT4G19880.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal (InterProIPR004046) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1635 Blast hits to 1635 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 158 Plants - 57 Viruses - 0 Other Eukaryotes - 480 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVYAVHFKCNKKLIREYPNLFN 
YTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPFGIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1621 Blast hits to 1621 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 155 Plants - 57 Viruses - 0 Other Eukaryotes - 469 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVYAVHFKCNKKLIREYPNLFN 
YTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPFGIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal (InterProIPR004046) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1635 Blast hits to 1635 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 158 Plants - 57 Viruses - 0 Other Eukaryotes - 480 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSSYFQSKKKKYTICERISNV 
ETLIQVYAVHFKCNKKLIREYPNLFNYTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPF 
GIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1621 Blast hits to 1621 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 155 Plants - 57 Viruses - 0 Other Eukaryotes - 469 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSSYFQSKKKKYTICERISNV 
ETLIQVYAVHFKCNKKLIREYPNLFNYTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPF 
GIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal (InterProIPR004046) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1635 Blast hits to 1635 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 158 Plants - 57 Viruses - 0 Other Eukaryotes - 480 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSLCSPLQMQQETHKGVSEFV 
QLHERHFPDPRYE*
>AT4G19880.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1621 Blast hits to 1621 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 155 Plants - 57 Viruses - 0 Other Eukaryotes - 469 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSLCSPLQMQQETHKGVSEFV 
QLHERHFPDPRYE*
>AT4G19880.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1567 Blast hits to 1567 proteins in 487 species Archae - 12 Bacteria - 899 Metazoa - 23 Fungi - 155 Plants - 55 Viruses - 0 Other Eukaryotes - 423 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVYAVHFKCNKKLIREYPNLFN 
YTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPFGIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1567 Blast hits to 1567 proteins in 487 species Archae - 12 Bacteria - 899 Metazoa - 23 Fungi - 155 Plants - 55 Viruses - 0 Other Eukaryotes - 423 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSSYFQSKKKKYTICERISNV 
ETLIQVYAVHFKCNKKLIREYPNLFNYTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPF 
GIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1567 Blast hits to 1567 proteins in 487 species Archae - 12 Bacteria - 899 Metazoa - 23 Fungi - 155 Plants - 55 Viruses - 0 Other Eukaryotes - 423 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSLCSPLQMQQETHKGVSEFV 
QLHERHFPDPRYE*
>AT4G25950.1 |  VATG3 (vacuolar ATP synthase G3) hydrolase acting on acid anhydrides catalyzing transmembrane movement of substances 
MDSLRGQGGIQMLLTAEQEAGRIVSAARTAKLARMKQAKDEAEKEMEEYRSRLEEEYQTQ 
VSGTDQEADAKRLDDETDVRITNLKESSSKVSKDIVKMLIKYVTTTAA*
>AT4G29580.1 |  cytidine deaminase putative / cytidine aminohydrolase putative 
MAQPPNPYAALTPTEAESSGPFEPETLLPLINRALPLAQALPSQSPLVAVGRGSSGRTFL 
GVNVELPGLSPLHSIHAGQFLVVHLALNNERTLNCLAFSSNGSYFDPPCPHCCQLLQEIR 
NASSTKLLITDPSRQRDMSLSTYLPQKYLSLYNEVPKYFFARLLDENRNNGLTLINPNPI 
RDCLDSEICNHLSCRALKAANRSYAPYSKSPSGVALMDFQGRVYSGWSIESVANPILGAA 
QAALVDFMTNGGGHEFNNIVRGFLVEKRDAKLSHLATAREILNKVAHFSFILRVLHCQ*
>AT4G29580.1 |  cytidine deaminase putative / cytidine aminohydrolase putative 
MAQPPNPYAALTPTEAESSGPFEPETLLPLINRALPLAQALPSQSPLVAVGRGSSGRTFL 
GVNVELPGLSPLHSIHAGQFLVVHLALNNERTLNCLAFSSNGSYFDPPCPHCCQLLQEIR 
NASSTKLLITDPSRQRDMSLSTYLPQKYLSLYNEVPKYFFARLLDENRNNGLTLINPNPI 
RDCLDSEICNHLSCRALKAANRSYAPYSKSPSGVALMDFQGRVYSGWSIESVANPILGAA 
QAALVDFMTNGGGHEFNNIVRGFLVEKRDAKLSHLATAREILNKVAHFSFILRVLHCQ*
>AT4G29580.2 |  cytidine deaminase putative / cytidine aminohydrolase putative 
MAQPPNPYAALTPTEAESSGPFEPETLLPLINRALPLAQALPSQSPLVAVGRGSSGRTFL 
GVNVELPGLSPLHSIHAGQFLVVHLALNNERTLNCLAFSSNGSYFDPPCPHCCQLLQEIR 
NASSTKLLITDPSRQRDMSLSTYLPQKYLSLYNEVPKYFFARLLDENRNNGLTLINPNPI 
RDCLDSEICNHLSCRALKAANRSYAPYSKSPSGVALMDFQGRVYSGWSIESVANPILGAA 
QAALVDFMTNGGGHEFNNIVRGFLVEKRDAKLSHLATAREILNKVAHFSFILRVLHCHKI 
QCFFTHESPSMSEKEDFANILKDINSKRRRNIHHGTIYKTGDEKTLKFILGVNYKDMEAV 
DEPPMRKRKVDELMHAVDPSNDSRFTFQTRVTRATCQKDEDLRILTEVCRIKPKIQGTGE 
QSDGMTKLLMVSTKAIYLSGYRMIF*
>AT4G29580.2 |  cytidine deaminase putative / cytidine aminohydrolase putative 
MAQPPNPYAALTPTEAESSGPFEPETLLPLINRALPLAQALPSQSPLVAVGRGSSGRTFL 
GVNVELPGLSPLHSIHAGQFLVVHLALNNERTLNCLAFSSNGSYFDPPCPHCCQLLQEIR 
NASSTKLLITDPSRQRDMSLSTYLPQKYLSLYNEVPKYFFARLLDENRNNGLTLINPNPI 
RDCLDSEICNHLSCRALKAANRSYAPYSKSPSGVALMDFQGRVYSGWSIESVANPILGAA 
QAALVDFMTNGGGHEFNNIVRGFLVEKRDAKLSHLATAREILNKVAHFSFILRVLHCHKI 
QCFFTHESPSMSEKEDFANILKDINSKRRRNIHHGTIYKTGDEKTLKFILGVNYKDMEAV 
DEPPMRKRKVDELMHAVDPSNDSRFTFQTRVTRATCQKDEDLRILTEVCRIKPKIQGTGE 
QSDGMTKLLMVSTKAIYLSGYRMIF*
>AT4G32350.1 |  unknown protein 
MFDGFLGRGFAPKGKPLIKLTKNRIDVLRRKRNATIKFLKRDLADLIINGHDYNAFSRAG 
GLLDELRYLWSLDFVEQTCDFVYKQLSTMQKTPECPEDCREAISSLMFAASGFSELPELR 
ELRQMFHEKYTDSLALFVNQELVENMSSKPFSMEKKVKLMEDVALEFSIRWDSKDFEKRI 
VRQNSISVMETPKSTNDKYKPVDRNMALPKREEFEGSENGVSLNRKTAEASERRDPLFQS 
DKESYQNGLRGNQRGLTYKERSENVLHASRSESKDNKAERKEFYLHSKQNPAREKHQPIF 
NEGDTIVMKVNYGNLGQGNGHRPGVVDAHKKTEFVASERKEFYSQSKQEPSRERHQPIFN 
DGDTIVMKVKHENHVQGNGHKNGVVDLHKKIEVNASEKLKSSSSKRADKLVIGFKQESFF 
QGYKHEKNEEHAHQKVEDSTSRPPKPNSKSKRAESINPGSRHHNDRESKENAVLVGKSTE 
EDPSGDNVKGGEYEYDHANPARKVEERETERMKSPFYKSLPPPYVKKSIAKARHEKAEAL 
DNPKARFDGEEGNHPDNGKNVYGAERRNGAGHHEVNDIDNASLKRQTNRRKHIVESGGDD 
HISSRRRENSRKGLQVLIDEDEKDSEEKMMDKLLMHYSKKPSSYEKDNVQEESKSRRTHL 
KKGESDEEMMIHQPARSRSLPAEQLAGPSEPAKTFARAASFQPERSSEAKHVHPKLPNYD 
DLAARFAELKGR*
>AT5G13240.1 |  transcription regulator 
MKFLEYTNLDRLNVFLGHLNLGERTIKGCLEAYSCKHAGSDKRLSLSLENEMLDYLGKSS 
DTDSSSPVDLLLSRSSRKALIYLVLTLYQMYPDYDFSAVKAHQFFSEESWDTFKQIFNNY 
MFEASKEWTERNEDGSLLEVIYKALDEVVKLAECEIYVYNPNPNADPFLEEGAIWSFCFL 
FYNRKLKRVAGFRFSCTSNLANDAFLTDSPPYEEDEEIFADMDM*
>AT5G42600.1 |  MRN1 (MARNERAL SYNTHASE) catalytic/ marneral synthase 
MWRLRIGAEARQDPHLFTTNNFAGRQIWEFDANGGSPEELAEVEEARLNFANNKSRFKAS 
PDLFWRRQFLREKKFEQKIPRVRIEDAEKITYEDAKTALRRGVLYYAACQANDGHWPSEV 
SGSMFLDAPFVICLYITGHLEKIFTLEHVKELLRYMYNTQNEDGGWGLDVESHSVMFCTV 
LNYICLRILGVEPDHDGQKSACARARKWILDHGGATYAPMVAKAWLSVLGVYDWSGCKPL 
PPEIWMLPSFSPINGGTLWIYIRDLLMGMSYLYGKKFVATPTALILQLREELYPQPYSKI 
IWSKARNRCAKEDLLYPKSFGQDLFWEGVHMLSENIINRWPLNKFVRQRALRTTMELVHY 
HDETTHYITGACVAKPFHMLACWVEDPDGDYFKKHLARVPDFIWIAEDGLKFQLMGMQSW 
NAALSLQVMLAANMDDEIRSTLIKGYDFLKQSQISENPQGDHLKMFRDITKGGWTFQDRE 
QGLPISDGTAESIECCIHFHRMPSEFIGEKMDVEKLYDAVNFLIYLQSDNGGMPVWEPAP 
GKKWLEWLSPVEHVENTVVEQEYLECTGSVIAGLVCFKKEFPDHRPKEIEKLIKKGLKYI 
EDLQMPDGSWYGNWGVCFTYGTLFAVRGLAAAGKTFGNSEAIRRAVQFILNTQNAEGGWG 
ESALSCPNKKYIPSKGNVTNVVNTGQAMMVLLIGGQMERDPSPVHRAAKVLINSQLDIGD 
FPQQERRGIYMNMLLHYPTYRNMFSLWALALYTNALRLLVS*
>AT5G48010.1 |  THAS1 (THALIANOL SYNTHASE 1) catalytic/ thalianol synthase 
MWRLRTGPKAGEDTHLFTTNNYAGRQIWEFDANAGSPQEIAEVEDARHKFSDNTSRFKTT 
ADLLWRMQFLREKKFEQKIPRVIIEDARKIKYEDAKTALKRGLLYFTALQADDGHWPAEN 
SGPNFYTPPFLICLYITGHLEKIFTPEHVKELLRHIYNMQNEDGGWGLHVESHSVMFCTV 
INYVCLRIVGEEVGHDDQRNGCAKAHKWIMDHGGATYTPLIGKALLSVLGVYDWSGCNPI 
PPEFWLLPSSFPVNGGTLWIYLRDTFMGLSYLYGKKFVAPPTPLILQLREELYPEPYAKI 
NWTQTRNRCGKEDLYYPRSFLQDLFWKSVHMFSESILDRWPLNKLIRQRALQSTMALIHY 
HDESTRYITGGCLPKAFHMLACWIEDPKSDYFKKHLARVREYIWIGEDGLKIQSFGSQLW 
DTALSLHALLDGIDDHDVDDEIKTTLVKGYDYLKKSQITENPRGDHFKMFRHKTKGGWTF 
SDQDQGWPVSDCTAESLECCLFFESMPSELIGKKMDVEKLYDAVDYLLYLQSDNGGIAAW 
QPVEGKAWLELLNIMIFRYVECTGSAIAALTQFNKQFPGYKNVEVKRFITKAAKYIEDMQ 
TVDGSWYGNWGVCFIYGTFFAVRGLVAAGKTYSNCEAIRKAVRFLLDTQNPEGGWGESFL 
SCPSKKYTPLKGNSTNVVQTAQALMVLIMGDQMERDPLPVHRAAQVLINSQLDNGDFPQQ 
EIMGTFMRTVMLHFPTYRNTFSLWALTHYTHALRRLLP*
>AT5G48010.1 |  THAS1 (THALIANOL SYNTHASE 1) catalytic/ thalianol synthase 
MWRLRTGPKAGEDTHLFTTNNYAGRQIWEFDANAGSPQEIAEVEDARHKFSDNTSRFKTT 
ADLLWRMQFLREKKFEQKIPRVIIEDARKIKYEDAKTALKRGLLYFTALQADDGHWPAEN 
SGPNFYTPPFLICLYITGHLEKIFTPEHVKELLRHIYNMQNEDGGWGLHVESHSVMFCTV 
INYVCLRIVGEEVGHDDQRNGCAKAHKWIMDHGGATYTPLIGKALLSVLGVYDWSGCNPI 
PPEFWLLPSSFPVNGGTLWIYLRDTFMGLSYLYGKKFVAPPTPLILQLREELYPEPYAKI 
NWTQTRNRCGKEDLYYPRSFLQDLFWKSVHMFSESILDRWPLNKLIRQRALQSTMALIHY 
HDESTRYITGGCLPKAFHMLACWIEDPKSDYFKKHLARVREYIWIGEDGLKIQSFGSQLW 
DTALSLHALLDGIDDHDVDDEIKTTLVKGYDYLKKSQITENPRGDHFKMFRHKTKGGWTF 
SDQDQGWPVSDCTAESLECCLFFESMPSELIGKKMDVEKLYDAVDYLLYLQSDNGGIAAW 
QPVEGKAWLELLNIMIFRYVECTGSAIAALTQFNKQFPGYKNVEVKRFITKAAKYIEDMQ 
TVDGSWYGNWGVCFIYGTFFAVRGLVAAGKTYSNCEAIRKAVRFLLDTQNPEGGWGESFL 
SCPSKKYTPLKGNSTNVVQTAQALMVLIMGDQMERDPLPVHRAAQVLINSQLDNGDFPQQ 
EIMGTFMRTVMLHFPTYRNTFSLWALTHYTHALRRLLP*
>AT5G48010.2 |  THAS1 (THALIANOL SYNTHASE 1) catalytic/ thalianol synthase 
MWRLRTGPKAGEDTHLFTTNNYAGRQIWEFDANAGSPQEIAEVEDARHKFSDNTSRFKTT 
ADLLWRMQFLREKKFEQKIPRVIIEDARKIKYEDAKTALKRGLLYFTALQADDGHWPAEN 
SGPNFYTPPFLICLYITGHLEKIFTPEHVKELLRHIYNMQNEDGGWGLHVESHSVMFCTV 
INYVCLRIVGEEVGHDDQRNGCAKAHKWIMDHGGATYTPLIGKALLSVLGVYDWSGCNPI 
PPEFWLLPSSFPVNGGTLWIYLRDTFMGLSYLYGKKFVAPPTPLILQLREELYPEPYAKI 
NWTQTRNRCGKEDLYYPRSFLQDLFWKSVHMFSESILDRWPLNKLIRQRALQSTMALIHY 
HDESTRYITGGCLPKAFHMLACWIEDPKSDYFKKHLARVREYIWIGEDGLKIQSFGSQLW 
DTALSLHALLDGIDDHDVDDEIKTTLVKGYDYLKKSQITENPRGDHFKMFRHKTKGGWTF 
SDQDQGWPVSDCTAESLECCLFFESMPSELIGKKMDVEKLYDAVDYLLYLQSDNGGIAAW 
QPVEGKAWLEWLSPVEFLEDTIVEYEYVECTGSAIAALTQFNKQFPGYKNVEVKRFITKA 
AKYIEDMQTVDGSWYGNWGVCFIYGTFFAVRGLVAAGKTYSNCEAIRKAVRFLLDTQNPE 
GGWGESFLSCPSKKYTPLKGNSTNVVQTAQALMVLIMGDQMERDPLPVHRAAQVLINSQL 
DNGDFPQQEIMGTFMRTVMLHFPTYRNTFSLWALTHYTHALRRLLP*
>AT5G48010.2 |  THAS1 (THALIANOL SYNTHASE 1) catalytic/ thalianol synthase 
MWRLRTGPKAGEDTHLFTTNNYAGRQIWEFDANAGSPQEIAEVEDARHKFSDNTSRFKTT 
ADLLWRMQFLREKKFEQKIPRVIIEDARKIKYEDAKTALKRGLLYFTALQADDGHWPAEN 
SGPNFYTPPFLICLYITGHLEKIFTPEHVKELLRHIYNMQNEDGGWGLHVESHSVMFCTV 
INYVCLRIVGEEVGHDDQRNGCAKAHKWIMDHGGATYTPLIGKALLSVLGVYDWSGCNPI 
PPEFWLLPSSFPVNGGTLWIYLRDTFMGLSYLYGKKFVAPPTPLILQLREELYPEPYAKI 
NWTQTRNRCGKEDLYYPRSFLQDLFWKSVHMFSESILDRWPLNKLIRQRALQSTMALIHY 
HDESTRYITGGCLPKAFHMLACWIEDPKSDYFKKHLARVREYIWIGEDGLKIQSFGSQLW 
DTALSLHALLDGIDDHDVDDEIKTTLVKGYDYLKKSQITENPRGDHFKMFRHKTKGGWTF 
SDQDQGWPVSDCTAESLECCLFFESMPSELIGKKMDVEKLYDAVDYLLYLQSDNGGIAAW 
QPVEGKAWLEWLSPVEFLEDTIVEYEYVECTGSAIAALTQFNKQFPGYKNVEVKRFITKA 
AKYIEDMQTVDGSWYGNWGVCFIYGTFFAVRGLVAAGKTYSNCEAIRKAVRFLLDTQNPE 
GGWGESFLSCPSKKYTPLKGNSTNVVQTAQALMVLIMGDQMERDPLPVHRAAQVLINSQL 
DNGDFPQQEIMGTFMRTVMLHFPTYRNTFSLWALTHYTHALRRLLP*
>AT2G04100.1 |  MATE efflux family protein 
MEDPLLLGDNQIITGSLKPTPTWRMNFTAELKNLSRMALPMATVTVAQYLLPVISVMVAG 
HRSELQLSGVALATSFTNVSGFSVMFGLAGALETLCGQAYGAKQYAKIGTYTFSAIVSNV 
PIVVLISILWFYMDKLFVSLGQDPDISKVAGSYAVCLIPALLAQAVQQPLTRFLQTQGLV 
LPLLYCAITTLLFHIPVCLILVYAFGLGSNGAALAIGLSYWFNVLILALYVRFSSSCEKT 
RGFVSDDFVLSVKQFFQYGIPSAAMTTIEWSLFEFLILSSGLLPNPKLETSVLSICLTTS 
SLHYVIPMGIGAAGSIRVSNELGAGNPEVARLAVFAGIFLWFLEATICSTLLFICRDIFG 
YAFSNSKEVVDYVTELSPLLCISFLVDGFSAVLGGVARGSGWQHIGAWANVVAYYLLGAP 
VGLFLGFWCHMNGKGLWIGVVVGSTAQGIILAIVTACMSWNEQAAKARQRIVVRTSSFGN 
GLA*