>AT3G26590.1 |  MATE efflux family protein 
MAKDKDITETLLTAAEERSDLPFLSVDDIPPITTVGGFVREFNVETKKLWYLAGPAIFTS 
VNQYSLGAITQVFAGHISTIALAAVSVENSVVAGFSFGIMLGMGSALETLCGQAFGAGKL 
SMLGVYLQRSWVILNVTALILSLLYIFAAPILASIGQTAAISSAAGIFSIYMIPQIFAYA 
INFPTAKFLQSQSKIMVMAVISAVALVIHVPLTWFVIVKLQWGMPGLAVVLNASWCFIDM 
AQLVYIFSGTCGEAWSGFSWEAFHNLWSFVRLSLASAVMLCLEVWYFMAIILFAGYLKNA 
EISVAALSICMNILGWTAMIAIGMNTAVSVRVSNELGANHPRTAKFSLLVAVITSTLIGF 
IVSMILLIFRDQYPSLFVKDEKVIILVKELTPILALSIVINNVQPVLSGVAVGAGWQAVV 
AYVNIACYYVFGIPFGLLLGYKLNYGVMGIWCGMLTGTVVQTIVLTWMICKTNWDTEASM 
AEDRIREWGGEVSEIKQLIN*
>AT1G23300.1 |  MATE efflux family protein 
METLNVDHEDTISSEQEHRAHTKSDTDMPPISGGRDFIRQFAAESKKLWWLAGPAIFTSF 
CQYSLGAVTQILAGHVNTLALAAVSIQNSVISGFSVGIMLGMGSALATLCGQAYGAGQLE 
MMGIYLQRSWIILNSCALLLCLFYVFATPLLSLLGQSPEISKAAGKFSLWMIPQLFAYAV 
NFATAKFLQAQSKVIAMAVIAATVLLQHTLLSWLLMLKLRWGMAGGAVVLNMSWWLIDVT 
QIVYICGGSSGRAWSGLSWMAFKNLRGFARLSLASAVMVCLEVWYFMALILFAGYLKNPQ 
VSVAALSICMNILGWPIMVAFGFNAAVSVRESNELGAEHPRRAKFLLIVAMITSVSIGIV 
ISVTLIVLRDKYPAMFSDDEEVRVLVKQLTPLLALTIVINNIQPVLSGVAVGAGWQGIVA 
YVNIGCYYLCGIPIGLVLGYKMELGVKGIWTGMLTGTVVQTSVLLFIIYRTNWKKEASLA 
EARIKKWGDQSNKREEIDLCEEDENNSNGENNHRK*
>AT3G62870.1 |  60S ribosomal protein L7A (RPL7aB) 
MAPKKGVKVASKKKPEKVTNPLFERRPKQFGIGGALPPKKDLSRYIKWPKSIRLQRQKRI 
LKQRLKVPPALNQFTKTLDKNLATSLFKILLKYRPEDKAAKKERLLNKAQAEAEGKPAES 
KKPIVVKYGLNHVTYLIEQNKAQLVVIAHDVDPIELVVWLPALCRKMEVPYCIVKGKSRL 
GAVVHQKTAAALCLTTVKNEDKLEFSKILEAIKANFNDKYEEYRKKWGGGIMGSKSQAKT 
KAKERVIAKEAAQRMN*
>AT3G53420.1 |  PIP2A (PLASMA MEMBRANE INTRINSIC PROTEIN 2A) water channel 
MAKDVEAVPGEGFQTRDYQDPPPAPFIDGAELKKWSFYRAVIAEFVATLLFLYITVLTVI 
GYKIQSDTDAGGVDCGGVGILGIAWAFGGMIFILVYCTAGISGGHINPAVTFGLFLARKV 
SLPRALLYIIAQCLGAICGVGFVKAFQSSYYTRYGGGANSLADGYSTGTGLAAEIIGTFV 
LVYTVFSATDPKRSARDSHVPVLAPLPIGFAVFMVHLATIPITGTGINPARSFGAAVIYN 
KSKPWDDHWIFWVGPFIGAAIAAFYHQFVLRASGSKSLGSFRSAANV*
>AT3G53420.1 |  PIP2A (PLASMA MEMBRANE INTRINSIC PROTEIN 2A) water channel 
MAKDVEAVPGEGFQTRDYQDPPPAPFIDGAELKKWSFYRAVIAEFVATLLFLYITVLTVI 
GYKIQSDTDAGGVDCGGVGILGIAWAFGGMIFILVYCTAGISGGHINPAVTFGLFLARKV 
SLPRALLYIIAQCLGAICGVGFVKAFQSSYYTRYGGGANSLADGYSTGTGLAAEIIGTFV 
LVYTVFSATDPKRSARDSHVPVLAPLPIGFAVFMVHLATIPITGTGINPARSFGAAVIYN 
KSKPWDDHWIFWVGPFIGAAIAAFYHQFVLRASGSKSLGSFRSAANV*
>AT3G53420.2 |  PIP2A (PLASMA MEMBRANE INTRINSIC PROTEIN 2A) water channel 
MAKDVEAVPGEGFQTRDYQDPPPAPFIDGAELKKWSFYRAVIAEFVATLLFLYITVLTVI 
GYKIQSDTDAGGVDCGGVGILGIAWAFGGMIFILVYCTAGISGGHINPAVTFGLFLARKV 
SLPRALLYIIAQCLGAICGVGFVKAFQSSYYTRYGGGANSLADGYSTGTGLAAEIIGTFV 
LVYTVFSATDPKRSARDSHVPVLAPLPIGFAVFMVHLATIPITGTGINPARSFGAAVIYN 
KSKPWDDHWIFWVGPFIGAAIAAFYHQFVLRASGSKSLGSFRSAANV*
>AT3G53420.2 |  PIP2A (PLASMA MEMBRANE INTRINSIC PROTEIN 2A) water channel 
MAKDVEAVPGEGFQTRDYQDPPPAPFIDGAELKKWSFYRAVIAEFVATLLFLYITVLTVI 
GYKIQSDTDAGGVDCGGVGILGIAWAFGGMIFILVYCTAGISGGHINPAVTFGLFLARKV 
SLPRALLYIIAQCLGAICGVGFVKAFQSSYYTRYGGGANSLADGYSTGTGLAAEIIGTFV 
LVYTVFSATDPKRSARDSHVPVLAPLPIGFAVFMVHLATIPITGTGINPARSFGAAVIYN 
KSKPWDDHWIFWVGPFIGAAIAAFYHQFVLRASGSKSLGSFRSAANV*
>AT3G09820.1 |  ADK1 (adenosine kinase 1) adenosine kinase/ copper ion binding 
MASSDFDGILLGMGNPLLDVSAVVDQQFLDKYDIKLNNAILAEDKHLPMYDEMSQKFNVE 
YIAGGATQNSIKVAQWMLQVPGATSYMGSIGKDKYGEAMKKDATAAGVYVHYYEDEATPT 
GTCGVCVLGGERSLIANLSAANCYKVEHLKKPENWALVEKAKFYYIAGFFLTVSPESIQL 
VREHAAANNKVFTMNLSAPFICEFFKDVQEKCLPYMDYIFGNETEARTFSRVHGWETDDV 
EQIAIKMSQLPKASGTYKRTTVITQGADPVVVAEDGKVKKYPVIPLPKEKLVDTNGAGDA 
FVGGFLSQLVHGKGIEECVRAGCYASNVVIQRSGCTYPEKPDFN*
>AT3G09820.1 |  ADK1 (adenosine kinase 1) adenosine kinase/ copper ion binding 
MASSDFDGILLGMGNPLLDVSAVVDQQFLDKYDIKLNNAILAEDKHLPMYDEMSQKFNVE 
YIAGGATQNSIKVAQWMLQVPGATSYMGSIGKDKYGEAMKKDATAAGVYVHYYEDEATPT 
GTCGVCVLGGERSLIANLSAANCYKVEHLKKPENWALVEKAKFYYIAGFFLTVSPESIQL 
VREHAAANNKVFTMNLSAPFICEFFKDVQEKCLPYMDYIFGNETEARTFSRVHGWETDDV 
EQIAIKMSQLPKASGTYKRTTVITQGADPVVVAEDGKVKKYPVIPLPKEKLVDTNGAGDA 
FVGGFLSQLVHGKGIEECVRAGCYASNVVIQRSGCTYPEKPDFN*
>AT3G09820.2 |  ADK1 (adenosine kinase 1) adenosine kinase/ copper ion binding 
MIIGMFRYDEMSQKFNVEYIAGGATQNSIKVAQWMLQVPGATSYMGSIGKDKYGEAMKKD 
ATAAGVYVHYYEDEATPTGTCGVCVLGGERSLIANLSAANCYKVEHLKKPENWALVEKAK 
FYYIAGFFLTVSPESIQLVREHAAANNKVFTMNLSAPFICEFFKDVQEKCLPYMDYIFGN 
ETEARTFSRVHGWETDDVEQIAIKMSQLPKASGTYKRTTVITQGADPVVVAEDGKVKKYP 
VIPLPKEKLVDTNGAGDAFVGGFLSQLVHGKGIEECVRAGCYASNVVIQRSGCTYPEKPD 
FN*
>AT3G09820.2 |  ADK1 (adenosine kinase 1) adenosine kinase/ copper ion binding 
MIIGMFRYDEMSQKFNVEYIAGGATQNSIKVAQWMLQVPGATSYMGSIGKDKYGEAMKKD 
ATAAGVYVHYYEDEATPTGTCGVCVLGGERSLIANLSAANCYKVEHLKKPENWALVEKAK 
FYYIAGFFLTVSPESIQLVREHAAANNKVFTMNLSAPFICEFFKDVQEKCLPYMDYIFGN 
ETEARTFSRVHGWETDDVEQIAIKMSQLPKASGTYKRTTVITQGADPVVVAEDGKVKKYP 
VIPLPKEKLVDTNGAGDAFVGGFLSQLVHGKGIEECVRAGCYASNVVIQRSGCTYPEKPD 
FN*
>AT5G06290.1 |  2-Cys Prx B (2-Cysteine peroxiredoxin B) antioxidant/ peroxiredoxin 
MSMASIASSSSTTLLSSSRVLLPSKSSLLSPTVSFPRIIPSSSASSSSLCSGFSSLGSLT 
TNRSASRRNFAVKAQADDLPLVGNKAPDFEAEAVFDQEFIKVKLSEYIGKKYVILFFYPL 
DFTFVCPTEITAFSDRYEEFEKLNTEVLGVSVDSVFSHLAWVQTDRKSGGLGDLNYPLVS 
DITKSISKSFGVLIPDQGIALRGLFIIDKEGVIQHSTINNLGIGRSVDETMRTLQALQYV 
QENPDEVCPAGWKPGEKSMKPDPKLSKEYFSAI*
>AT3G52990.1 |  pyruvate kinase putative 
MHSSHLLLEEPIRMASILEPSKSSFFPALTKIVGTLGPKSRSVEALSGCLKAGMSVARFD 
FSWGDADYHQETLDNLKVAVRSTKKLCAVMLDTVGPELQVINKSEKAITLKADGLVTLTP 
NQDQEASSEVLPINFNGLAKAVKKGDTIFVGQYLFTGSETTSVWLEVDEVKGDDVICLSR 
NAATLAGSLFTLHSSQVHIDLPTLTEKDKEVISTWGVQNKIDFLSLSYCRHAEDVRQTRE 
MLKKLGDLSQTQIFAKIENVEGLTHFDEILQEADGIILSRGNLGIDLPPEKVFLFQKAAL 
YKCNMAGKPAVLTRVVDSMTDNLRPTRAEATDVANAVLDGSDAILLGAETLRGLYPVETI 
STVGRICAEAEKVFNQDLYFKKTVKYVGEPMTHLESIASSAVRAAIKVKASVIICFTSSG 
RAARLIAKYRPTMPVISVVIPRVKTNQLKWSFSGAFEARQSLIVRGLFPMLADPRHPAES 
TSATNESVLKVALDHGKHAGVIKSHDRVVVCQKVGDASVVKIIELED*
>AT3G52990.1 |  pyruvate kinase putative 
MHSSHLLLEEPIRMASILEPSKSSFFPALTKIVGTLGPKSRSVEALSGCLKAGMSVARFD 
FSWGDADYHQETLDNLKVAVRSTKKLCAVMLDTVGPELQVINKSEKAITLKADGLVTLTP 
NQDQEASSEVLPINFNGLAKAVKKGDTIFVGQYLFTGSETTSVWLEVDEVKGDDVICLSR 
NAATLAGSLFTLHSSQVHIDLPTLTEKDKEVISTWGVQNKIDFLSLSYCRHAEDVRQTRE 
MLKKLGDLSQTQIFAKIENVEGLTHFDEILQEADGIILSRGNLGIDLPPEKVFLFQKAAL 
YKCNMAGKPAVLTRVVDSMTDNLRPTRAEATDVANAVLDGSDAILLGAETLRGLYPVETI 
STVGRICAEAEKVFNQDLYFKKTVKYVGEPMTHLESIASSAVRAAIKVKASVIICFTSSG 
RAARLIAKYRPTMPVISVVIPRVKTNQLKWSFSGAFEARQSLIVRGLFPMLADPRHPAES 
TSATNESVLKVALDHGKHAGVIKSHDRVVVCQKVGDASVVKIIELED*
>AT3G52990.2 |  pyruvate kinase putative 
MSVARFDFSWGDADYHQETLDNLKVAVRSTKKLCAVMLDTVGPELQVINKSEKAITLKAD 
GLVTLTPNQDQEASSEVLPINFNGLAKAVKKGDTIFVGQYLFTGSETTSVWLEVDEVKGD 
DVICLSRNAATLAGSLFTLHSSQVHIDLPTLTEKDKEVISTWGVQNKIDFLSLSYCRHAE 
DVRQTREMLKKLGDLSQTQIFAKIENVEGLTHFDEILQEADGIILSRGNLGIDLPPEKVF 
LFQKAALYKCNMAGKPAVLTRVVDSMTDNLRPTRAEATDVANAVLDGSDAILLGAETLRG 
LYPVETISTVGRICAEAEKVFNQDLYFKKTVKYVGEPMTHLESIASSAVRAAIKVKASVI 
ICFTSSGRAARLIAKYRPTMPVISVVIPRVKTNQLKWSFSGAFEARQSLIVRGLFPMLAD 
PRHPAESTSATNESVLKVALDHGKHAGVIKSHDRVVVCQKVGDASVVKIIELED*
>AT3G52990.2 |  pyruvate kinase putative 
MSVARFDFSWGDADYHQETLDNLKVAVRSTKKLCAVMLDTVGPELQVINKSEKAITLKAD 
GLVTLTPNQDQEASSEVLPINFNGLAKAVKKGDTIFVGQYLFTGSETTSVWLEVDEVKGD 
DVICLSRNAATLAGSLFTLHSSQVHIDLPTLTEKDKEVISTWGVQNKIDFLSLSYCRHAE 
DVRQTREMLKKLGDLSQTQIFAKIENVEGLTHFDEILQEADGIILSRGNLGIDLPPEKVF 
LFQKAALYKCNMAGKPAVLTRVVDSMTDNLRPTRAEATDVANAVLDGSDAILLGAETLRG 
LYPVETISTVGRICAEAEKVFNQDLYFKKTVKYVGEPMTHLESIASSAVRAAIKVKASVI 
ICFTSSGRAARLIAKYRPTMPVISVVIPRVKTNQLKWSFSGAFEARQSLIVRGLFPMLAD 
PRHPAESTSATNESVLKVALDHGKHAGVIKSHDRVVVCQKVGDASVVKIIELED*
>AT1G50310.1 |  STP9 (SUGAR TRANSPORTER 9) carbohydrate transmembrane transporter/ sugarhydrogen symporter 
MAGGAFVSEGGGGGNSYEGGVTVFVIMTCIVAAMGGLLFGYDLGISGGVTSMEEFLSKFF 
PEVDKQMHEARRETAYCKFDNQLLQLFTSSLYLAALASSFVASAVTRKYGRKISMFVGGV 
AFLIGSLFNAFATNVAMLIVGRLLLGVGVGFANQSTPVYLSEMAPAKIRGALNIGFQMAI 
TIGILIANLINYGTSQMAKNGWRVSLGLAAVPAVIMVIGSFVLPDTPNSMLERGKYEQAR 
EMLQKIRGADNVDEEFQDLCDACEAAKKVDNPWKNIFQQAKYRPALVFCSAIPFFQQITG 
INVIMFYAPVLFKTLGFADDASLISAVITGAVNVVSTLVSIYAVDRYGRRILFLEGGIQM 
IVSQIVVGTLIGMKFGTTGSGTLTPATADWILAFICLYVAGFAWSWGPLGWLVPSEICPL 
EIRPAGQAINVSVNMFFTFLIGQFFLTMLCHMKFGLFYFFGGMVAVMTVFIYFLLPETKG 
VPIEEMGRVWKQHPFWKRYMPDDAVIGGGEENYVKEV*
>AT4G13090.1 |  xyloglucanxyloglucosyl transferase putative / xyloglucan endotransglycosylase putative / endo-xyloglucan transferase putative 
MNRIRYCFELVSVLFLMFTANARARGRGAIDFDVNYVVTWGQDHILKLNQGKEVQLSMDY 
SSGSGFESKSHYGSGFFQMRIKLPPRDSAGVVTAFYLTSKGDTHDEVDFEFLGNRQGKPI 
AIQTNVFSNGQGGREQKFVPWFDPTTSFHTYGILWNPYQIVFYVDKVPIRVFKNIKKSGV 
NYPSKPMQLVASLWNGENWATSGGKEKINWAYAPFKAQYQGFSDHGCHVNGQSNNANVCG 
STRYWWNTRTYSQLSANEQKVMENVRAKYMTYDYCSDRPRYPVPPSECRWNQ*
>AT1G07820.1 |  histone H4 
MSGRGKGGKGLGKGGAKRHRKVLRDNIQGITKPAIRRLARRGGVKRISGLIYEETRGVLK 
IFLENVIRDAVTYTEHARRKTVTAMDVVYALKRQGRTLYGFGG*
>AT1G07820.1 |  histone H4 
MSGRGKGGKGLGKGGAKRHRKVLRDNIQGITKPAIRRLARRGGVKRISGLIYEETRGVLK 
IFLENVIRDAVTYTEHARRKTVTAMDVVYALKRQGRTLYGFGG*
>AT1G07820.2 |  histone H4 
MSGRGKGGKGLGKGGAKRHRKVLRDNIQGITKPAIRRLARRGGVKRISGLIYEETRGVLK 
IFLENVIRDAVTYTEHARRKTVTAMDVVYALKRQGRTLYGFGG*
>AT1G07820.2 |  histone H4 
MSGRGKGGKGLGKGGAKRHRKVLRDNIQGITKPAIRRLARRGGVKRISGLIYEETRGVLK 
IFLENVIRDAVTYTEHARRKTVTAMDVVYALKRQGRTLYGFGG*
>AT2G20410.1 |  activating signal cointegrator-related 
MSRGNYKNPCLTMHQPWASLLVHGIKRIEGRSWPSPIRGRLWIHAASKVPDEATIKAMEE 
FYQQIYAVDGITDIQFPQHYPVSRLIGCVEVVGCVTSDELQNWDALPQGVRLEGQTNFCW 
LCEKPQKLIIPFEMRGYQGVYNLENKIYVAAARGLMPSQNSFKVKFPLPDPKDPFSLKPG 
SIPCTMQEKKELDSKQVTSLTAAIKQVTSLTAAIAGAKAAATQFSKKGQSLQTNNIFDYT 
TRSKSKVIEDEAAESLDNPVLGSGGTSDRTYRTRSKNRGTQMGEEVCSESSNSSSKVESS 
QRSAVTKSEDRNTRIGERRFDPGSARIMAAAIRNLKPPS*
>AT4G14240.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF21 (InterProIPR002550) Cystathionine beta-synthase core (InterProIPR000644) BEST Arabidopsis thaliana protein match is CBS domain-containing protein-related (TAIRAT4G142301) Has 6770 Blast hits to 6657 proteins in 1347 species Archae - 62 Bacteria - 4461 Metazoa - 254 Fungi - 179 Plants - 121 Viruses - 0 Other Eukaryotes - 1693 (source NCBI BLink) 
MHLINAVAAARILSGIGQSNGNNGGEAIPFGSFEWITYAGISCFLVLFAGIMSGLTLGLM 
SLGLVELEILQRSGTPNEKKQAAAIFPVVQKQHQLLVTLLLCNAMAMEGLPIYLDKLFNE 
YVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVWLVRILMTLCYPIAFPIGKILDLVL 
GHNDALFRRAQLKALVSIHSQEAGKGGELTHDETTIISGALDLTEKTAQEAMTPIESTFS 
LDVNSKLDWEAMGKILARGHSRVPVYSGNPKNVIGLLLVKSLLTVRPETETLVSAVCIRR 
IPRVPADMPLYDILNEFQKGSSHMAAVVKVKGKSKVPPSTLLEEHTDESNDSDLTAPLLL 
KREGNHDNVIVTIDKANGQSFFQNNESGPHGFSHTSEAIEDGEVIGIITLEDVFEELLQE 
EIVDETDEYVDVHKRIRVAAAAAASSIARAPSSRKLLAQKGTGGQNKQGQTNKVPGQEQD 
KMLGTITEPIRRNN*
>AT4G14240.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF21 (InterProIPR002550) Cystathionine beta-synthase core (InterProIPR000644) BEST Arabidopsis thaliana protein match is CBS domain-containing protein-related (TAIRAT4G142301) Has 6735 Blast hits to 6622 proteins in 1349 species Archae - 62 Bacteria - 4446 Metazoa - 254 Fungi - 179 Plants - 121 Viruses - 0 Other Eukaryotes - 1673 (source NCBI BLink) 
MHLINAVAAARILSGIGQSNGNNGGEAIPFGSFEWITYAGISCFLVLFAGIMSGLTLGLM 
SLGLVELEILQRSGTPNEKKQAAAIFPVVQKQHQLLVTLLLCNAMAMEGLPIYLDKLFNE 
YVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVWLVRILMTLCYPIAFPIGKILDLVL 
GHNDALFRRAQLKALVSIHSQEAGKGGELTHDETTIISGALDLTEKTAQEAMTPIESTFS 
LDVNSKLDWEAMGKILARGHSRVPVYSGNPKNVIGLLLVKSLLTVRPETETLVSAVCIRR 
IPRVPADMPLYDILNEFQKGSSHMAAVVKVKGKSKVPPSTLLEEHTDESNDSDLTAPLLL 
KREGNHDNVIVTIDKANGQSFFQNNESGPHGFSHTSEAIEDGEVIGIITLEDVFEELLQE 
EIVDETDEYVDVHKRIRVAAAAAASSIARAPSSRKLLAQKGTGGQNKQGQTNKVPGQEQD 
KMLGTITEPIRRNN*
>AT4G14240.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF21 (InterProIPR002550) Cystathionine beta-synthase core (InterProIPR000644) BEST Arabidopsis thaliana protein match is CBS domain-containing protein-related (TAIRAT4G142301) Has 6770 Blast hits to 6657 proteins in 1347 species Archae - 62 Bacteria - 4461 Metazoa - 254 Fungi - 179 Plants - 121 Viruses - 0 Other Eukaryotes - 1693 (source NCBI BLink) 
MHLINAVAAARILSGIGQSNGNNGGEAIPFGSFEWITYAGISCFLVLFAGIMSGLTLGLM 
SLGLVELEILQRSAAIFPVVQKQHQLLVTLLLCNAMAMEGLPIYLDKLFNEYVAIILSVT 
FVLAFGEVIPQAICTRYGLAVGANFVWLVRILMTLCYPIAFPIGKILDLVLGHNDALFRR 
AQLKALVSIHSQEAGKGGELTHDETTIISGALDLTEKTAQEAMTPIESTFSLDVNSKLDW 
EAMGKILARGHSRVPVYSGNPKNVIGLLLVKSLLTVRPETETLVSAVCIRRIPRVPADMP 
LYDILNEFQKGSSHMAAVVKVKGKSKVPPSTLLEEHTDESNDSDLTAPLLLKREGNHDNV 
IVTIDKANGQSFFQNNESGPHGFSHTSEAIEDGEVIGIITLEDVFEELLQEEIVDETDEY 
VDVHKRIRVAAAAAASSIARAPSSRKLLAQKGTGGQNKQGQTNKVPGQEQDKMLGTITEP 
IRRNN*
>AT4G14240.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF21 (InterProIPR002550) Cystathionine beta-synthase core (InterProIPR000644) BEST Arabidopsis thaliana protein match is CBS domain-containing protein-related (TAIRAT4G142301) Has 6735 Blast hits to 6622 proteins in 1349 species Archae - 62 Bacteria - 4446 Metazoa - 254 Fungi - 179 Plants - 121 Viruses - 0 Other Eukaryotes - 1673 (source NCBI BLink) 
MHLINAVAAARILSGIGQSNGNNGGEAIPFGSFEWITYAGISCFLVLFAGIMSGLTLGLM 
SLGLVELEILQRSAAIFPVVQKQHQLLVTLLLCNAMAMEGLPIYLDKLFNEYVAIILSVT 
FVLAFGEVIPQAICTRYGLAVGANFVWLVRILMTLCYPIAFPIGKILDLVLGHNDALFRR 
AQLKALVSIHSQEAGKGGELTHDETTIISGALDLTEKTAQEAMTPIESTFSLDVNSKLDW 
EAMGKILARGHSRVPVYSGNPKNVIGLLLVKSLLTVRPETETLVSAVCIRRIPRVPADMP 
LYDILNEFQKGSSHMAAVVKVKGKSKVPPSTLLEEHTDESNDSDLTAPLLLKREGNHDNV 
IVTIDKANGQSFFQNNESGPHGFSHTSEAIEDGEVIGIITLEDVFEELLQEEIVDETDEY 
VDVHKRIRVAAAAAASSIARAPSSRKLLAQKGTGGQNKQGQTNKVPGQEQDKMLGTITEP 
IRRNN*
>AT4G34370.1 |  ARI1 (ARIADNE) protein binding / zinc ion binding 
MDDYFSAEEEACYYSSDQDSLDGIDNEESELQPLSSKRSNTQVITQESLLAAQREDLLRV 
MELLSIKEHHARTLLIHYQWDVEKLFAVFVEKGKDSLFSGAGVTVFDYQYGNSSFPQSSQ 
MSCDVCMEDLPGDHMTRMDCGHCFCNNCWTEHFTVQINEGQSKRIRCMAHQCNAICDEDI 
VRSLVSKKRPDLAAKFDRYLLESYIEDNRMVKWCPSTPHCGNAIRAEDDKLCEVECSCGL 
QFCFSCLCQAHSPCSCLMWELWRKKCRDESETINWITVHTKLCPKCYKPVEKNGGCNLVR 
CICGQCFCWLCGGATGSDHTYRSIAGHSCGRYQDDKEKQMERAKRDLNRYTHYHHRYKAH 
TDSSKLEDKLRDTIHEKVSKSEKRELKLKDFSWVTNGLDRLFRSRRVLSYSYAFAYYMFG 
EEMFKDEMTPEEREIKKNLFEDQQQQLESNVEKLSQFLEEPFDEFSNDKVMAIRIQIINL 
SVAVDTLCKKMYECIENDLLGSLQLGIHNISPYRSKGIEQAAQFYASWNSKDADKFQPLD 
SGTSGVTSRPEQASGSRSSEDTICSSSQKRPKKEGSFLNNKVTLLDLNLPADFVDQN*
>AT5G38030.1 |  MATE efflux family protein 
MEEDKILTETLLSAAEEPPALPFSSVEDIPPITTVGGFVKEFNVEVKKLWYLAGPAIFMS 
ITQYSLGAATQVFAGHISTIALAAVSVENSVIAGFSFGVMLGMGSALETLCGQAFGAGKL 
SMLGVYLQRSWVILNVTAVILSLLYIFAAPILAFIGQTPAISSATGIFSIYMIPQIFAYA 
VNYPTAKFLQSQSKIMVMAAISAVALVLHVLLTWFVIEGLQWGTAGLAVVLNASWWFIVV 
AQLVYIFSGTCGEAWSGFSWEAFHNLWSFVRLSLASAVMLCLEVWYLMAVILFAGYLKNA 
EISVAALSICMNILGWTAMIAIGMNAAVSVRVSNELGAKHPRTAKFSLLVAVITSTVIGL 
AISIALLIFRDKYPSLFVGDEEVIIVVKDLTPILAVSIVINNVQPVLSGVAVGAGWQAVV 
AYVNIVCYYVFGIPFGLLLGYKLNFGVMGIWCGMLTGTVVQTIVLTWMICRTNWDTEAAM 
AEGRIREWGGEVSDQLLN*
>AT1G12950.1 |  MATE efflux family protein 
MEKDNDFKDPFLASTEEEELDPATQKALMEYLGVGSRASSLVSFSSTAVDIPPISGVGDF 
VREFRIESRKLWKLAGPAIFTTMSQYSLGAVTQVFAGHISTLALAAVSIENSVIAGFSFG 
IMLGMGSALETLCGQAFGAGKVSMLGVYLQRSWVILSVTALFLSLIYIFAAPILTFIGQT 
AAISAMAGIFSIYMIPQIFAYAINFPTAKFLQSQSKIMVMAGISGVVLVIHSFFTWLVMS 
RLHWGLPGLALVLNTSWWVIVVAQLVYIFNCTCGEAWSGFTWEAFHNLWGFVKLSLASAA 
MLCLEIWYFMALVLFAGYLKNAEVSVAALSICMNILGWAAMVAFGTNAAVSVRVSNELGA 
SHPRTAKFSLVVAVILSTAIGMFIAAGLLFFRNEYPVLFVEDEEVRNVVRELTPMLAFCI 
VINNVQPVLSGVAVGAGWQAVVAYVNIACYYLFGVPFGLLLGFKLEYGVMGIWWGMVTGT 
FVQSIVLTWMICKTNWEKEASMAEERIKEWGGVPAEKETLLN*
>AT1G51340.2 |  MATE efflux family protein 
MMSEDGYNTDFPRNPLYIFFSDFRSVLKFDELGLEIARIALPAALALTADPIASLVDTAF 
IGQIGPVELAAVGVSIALFNQVSRIAIFPLVSITTSFVAEEDACSSQQDTVRDHKECIEI 
GINNPTEETIELIPEKHKDSLSDEFKTSSSIFSISKPPAKKRNIPSASSALIIGGVLGLF 
QAVFLISAAKPLLSFMGVKHDSPMMRPSQRYLSLRSLGAPAVLLSLAAQGVFRGFKDTTT 
PLFATVIGDVTNIILDPIFIFVFRLGVTGAATAHVISQYLMCGILLWKLMGQVDIFNMST 
KHLQFCRFMKNGFLLLMRVIAVTFCVTLSASLAAREGSTSMAAFQVCLQVWLATSLLADG 
YAVAGQAILASAFAKKDYKRAAATASRVLQLGLVLGFVLAVILGAGLHFGARVFTKDDKV 
LHLISIGLPFVAGTQPINALAFVFDGVNFGASDFGYAAASLVMVAIVSILCLLFLSSTHG 
FIGLWFGLTIYMSLRAAVGFWRIGTGTGPWSFLRS*
>AT1G51340.2 |  MATE efflux family protein 
MMSEDGYNTDFPRNPLYIFFSDFRSVLKFDELGLEIARIALPAALALTADPIASLVDTAF 
IGQIGPVELAAVGVSIALFNQVSRIAIFPLVSITTSFVAEEDACSSQQDTVRDHKECIEI 
GINNPTEETIELIPEKHKDSLSDEFKTSSSIFSISKPPAKKRNIPSASSALIIGGVLGLF 
QAVFLISAAKPLLSFMGVKHDSPMMRPSQRYLSLRSLGAPAVLLSLAAQGVFRGFKDTTT 
PLFATVIGDVTNIILDPIFIFVFRLGVTGAATAHVISQYLMCGILLWKLMGQVDIFNMST 
KHLQFCRFMKNGFLLLMRVIAVTFCVTLSASLAAREGSTSMAAFQVCLQVWLATSLLADG 
YAVAGQAILASAFAKKDYKRAAATASRVLQLGLVLGFVLAVILGAGLHFGARVFTKDDKV 
LHLISIGLPFVAGTQPINALAFVFDGVNFGASDFGYAAASLVMVAIVSILCLLFLSSTHG 
FIGLWFGLTIYMSLRAAVGFWRIGTGTGPWSFLRS*
>AT1G51340.1 |  MATE efflux family protein 
MATTQIFQETLYTFSLVISVLKFDELGLEIARIALPAALALTADPIASLVDTAFIGQIGP 
VELAAVGVSIALFNQVSRIAIFPLVSITTSFVAEEDACSSQQDTVRDHKECIEIGINNPT 
EETIELIPEKHKDSLSDEFKTSSSIFSISKPPAKKRNIPSASSALIIGGVLGLFQAVFLI 
SAAKPLLSFMGVKHDSPMMRPSQRYLSLRSLGAPAVLLSLAAQGVFRGFKDTTTPLFATV 
IGDVTNIILDPIFIFVFRLGVTGAATAHVISQYLMCGILLWKLMGQVDIFNMSTKHLQFC 
RFMKNGFLLLMRVIAVTFCVTLSASLAAREGSTSMAAFQVCLQVWLATSLLADGYAVAGQ 
AILASAFAKKDYKRAAATASRVLQLGLVLGFVLAVILGAGLHFGARVFTKDDKVLHLISI 
GLPFVAGTQPINALAFVFDGVNFGASDFGYAAASLVMVAIVSILCLLFLSSTHGFIGLWF 
GLTIYMSLRAAVGFWRIGTGTGPWSFLRS*
>AT1G51340.1 |  MATE efflux family protein 
MATTQIFQETLYTFSLVISVLKFDELGLEIARIALPAALALTADPIASLVDTAFIGQIGP 
VELAAVGVSIALFNQVSRIAIFPLVSITTSFVAEEDACSSQQDTVRDHKECIEIGINNPT 
EETIELIPEKHKDSLSDEFKTSSSIFSISKPPAKKRNIPSASSALIIGGVLGLFQAVFLI 
SAAKPLLSFMGVKHDSPMMRPSQRYLSLRSLGAPAVLLSLAAQGVFRGFKDTTTPLFATV 
IGDVTNIILDPIFIFVFRLGVTGAATAHVISQYLMCGILLWKLMGQVDIFNMSTKHLQFC 
RFMKNGFLLLMRVIAVTFCVTLSASLAAREGSTSMAAFQVCLQVWLATSLLADGYAVAGQ 
AILASAFAKKDYKRAAATASRVLQLGLVLGFVLAVILGAGLHFGARVFTKDDKVLHLISI 
GLPFVAGTQPINALAFVFDGVNFGASDFGYAAASLVMVAIVSILCLLFLSSTHGFIGLWF 
GLTIYMSLRAAVGFWRIGTGTGPWSFLRS*