>AT5G45020.1 |  LOCATED IN cellular_component unknown EXPRESSED IN 22 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal (InterProIPR004046) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G198801) Has 1632 Blast hits to 1632 proteins in 489 species Archae - 12 Bacteria - 907 Metazoa - 22 Fungi - 156 Plants - 57 Viruses - 0 Other Eukaryotes - 478 (source NCBI BLink) 
MARSGVDETSESGAFVRTASTFRNFVSQDPDSQFPAESGRYHLYISYACPWACRCLSYLK 
IKGLDEAITFSSVHAIWGRTKETDDHRGWVFPDSDTELPGAEPDYLNGAKSVRELYEIAS 
PNYEGKYTVPVLWDKKLKTVVNNESSEIIRMFNTEFNGIAKTPSLDLYPSHLRDVINETN 
GWVFNGINNGVYKCGFARKQEPYNEAVNQLYEAVDRCEEVLGKQRYICGNTFTEADIRLF 
VTLIRFDEVYAVHFKCNKRLLREYPNIFNYIKDIYQIHGMSSTVNMEHIKQHYYGSHPTI 
NPFGIIPHGPNIDYSSPHDRDRFSS*
>AT3G26590.1 |  MATE efflux family protein 
MAKDKDITETLLTAAEERSDLPFLSVDDIPPITTVGGFVREFNVETKKLWYLAGPAIFTS 
VNQYSLGAITQVFAGHISTIALAAVSVENSVVAGFSFGIMLGMGSALETLCGQAFGAGKL 
SMLGVYLQRSWVILNVTALILSLLYIFAAPILASIGQTAAISSAAGIFSIYMIPQIFAYA 
INFPTAKFLQSQSKIMVMAVISAVALVIHVPLTWFVIVKLQWGMPGLAVVLNASWCFIDM 
AQLVYIFSGTCGEAWSGFSWEAFHNLWSFVRLSLASAVMLCLEVWYFMAIILFAGYLKNA 
EISVAALSICMNILGWTAMIAIGMNTAVSVRVSNELGANHPRTAKFSLLVAVITSTLIGF 
IVSMILLIFRDQYPSLFVKDEKVIILVKELTPILALSIVINNVQPVLSGVAVGAGWQAVV 
AYVNIACYYVFGIPFGLLLGYKLNYGVMGIWCGMLTGTVVQTIVLTWMICKTNWDTEASM 
AEDRIREWGGEVSEIKQLIN*
>AT3G62870.1 |  60S ribosomal protein L7A (RPL7aB) 
MAPKKGVKVASKKKPEKVTNPLFERRPKQFGIGGALPPKKDLSRYIKWPKSIRLQRQKRI 
LKQRLKVPPALNQFTKTLDKNLATSLFKILLKYRPEDKAAKKERLLNKAQAEAEGKPAES 
KKPIVVKYGLNHVTYLIEQNKAQLVVIAHDVDPIELVVWLPALCRKMEVPYCIVKGKSRL 
GAVVHQKTAAALCLTTVKNEDKLEFSKILEAIKANFNDKYEEYRKKWGGGIMGSKSQAKT 
KAKERVIAKEAAQRMN*
>AT1G21640.1 |  NADK2 NAD+ kinase/ calmodulin binding 
MFLCFCPCHVPIMSRLSPATGISSRLRFSIGLSSDGRLIPFGFRFRRNDVPFKRRLRFVI 
RAQLSEAFSPDLGLDSQAVKSRDTSNLPWIGPVPGDIAEVEAYCRIFRSAERLHGALMET 
LCNPVTGECRVPYDFSPEEKPLLEDKIVSVLGCILSLLNKGRKEILSGRSSSMNSFNLDD 
VGVAEESLPPLAVFRGEMKRCCESLHIALENYLTPDDERSGIVWRKLQKLKNVCYDAGFP 
RSDNYPCQTLFANWDPIYSSNTKEDIDSYESEIAFWRGGQVTQEGLKWLIENGFKTIVDL 
RAEIVKDTFYQTALDDAISLGKITVVQIPIDVRMAPKAEQVELFASIVSDSSKRPIYVHS 
KEGVWRTSAMVSRWKQYMTRPITKEIPVSEESKRREVSETKLGSNAVVSGKGVPDEQTDK 
VSEINEVDSRSASSQSKESGRFEGDTSASEFNMVSDPLKSQVPPGNIFSRKEMSKFLKSK 
SIAPAGYLTNPSKILGTVPTPQFSYTGVTNGNQIVDKDSIRRLAETGNSNGTLLPTSSQS 
LDFGNGKFSNGNVHASDNTNKSISDNRGNGFSAAPIAVPPSDNLSRAVGSHSVRESQTQR 
NNSGSSSDSSDDEAGAIEGNMCASATGVVRVQSRKKAEMFLVRTDGVSCTREKVTESSLA 
FTHPSTQQQMLLWKTTPKTVLLLKKLGQELMEEAKEAASFLYHQENMNVLVEPEVHDVFA 
RIPGFGFVQTFYIQDTSDLHERVDFVACLGGDGVILHASNLFKGAVPPVVSFNLGSLGFL 
TSHPFEDFRQDLKRVIHGNNTLDGVYITLRMRLRCEIYRKGKAMPGKVFDVLNEIVVDRG 
SNPYLSKIECYEHDRLITKVQGDGVIVATPTGSTAYSTAAGGSMVHPNVPCMLFTPICPH 
SLSFRPVILPDSAKLELKIPDDARSNAWVSFDGKRRQQLSRGDSVRIYMSQHPLPTVNKS 
DQTGDWFRSLIRCLNWNERLDQKAL*
>AT1G10070.1 |  ATBCAT-2 (ARABIDOPSIS THALIANA BRANCHED-CHAIN AMINO ACID TRANSAMINASE 2) branched-chain-amino-acid transaminase/ catalytic 
MIKTITSLRKTLVLPLHLHIRTLQTFAKYNAQAASALREERKKPLYQNGDDVYADLDWDN 
LGFGLNPADYMYVMKCSKDGEFTQGELSPYGNIQLSPSAGVLNYGQAIYEGTKAYRKENG 
KLLLFRPDHNAIRMKLGAERMLMPSPSVDQFVNAVKQTALANKRWVPPAGKGTLYIRPLL 
MGSGPILGLGPAPEYTFIVYASPVGNYFKEGMAALNLYVEEEYVRAAPGGAGGVKSITNY 
APVLKALSRAKSRGFSDVLYLDSVKKKYLEEASSCNVFVVKGRTISTPATNGTILEGITR 
KSVMEIASDQGYQVVEKAVHVDEVMDADEVFCTGTAVVVAPVGTITYQEKRVEYKTGDES 
VCQKLRSVLVGIQTGLIEDNKGWVTDIN*
>AT1G10070.1 |  ATBCAT-2 (ARABIDOPSIS THALIANA BRANCHED-CHAIN AMINO ACID TRANSAMINASE 2) branched-chain-amino-acid transaminase/ catalytic 
MIKTITSLRKTLVLPLHLHIRTLQTFAKYNAQAASALREERKKPLYQNGDDVYADLDWDN 
LGFGLNPADYMYVMKCSKDGEFTQGELSPYGNIQLSPSAGVLNYGQAIYEGTKAYRKENG 
KLLLFRPDHNAIRMKLGAERMLMPSPSVDQFVNAVKQTALANKRWVPPAGKGTLYIRPLL 
MGSGPILGLGPAPEYTFIVYASPVGNYFKEGMAALNLYVEEEYVRAAPGGAGGVKSITNY 
APVLKALSRAKSRGFSDVLYLDSVKKKYLEEASSCNVFVVKGRTISTPATNGTILEGITR 
KSVMEIASDQGYQVVEKAVHVDEVMDADEVFCTGTAVVVAPVGTITYQEKRVEYKTGDES 
VCQKLRSVLVGIQTGLIEDNKGWVTDIN*
>AT1G10070.1 |  ATBCAT-2 (ARABIDOPSIS THALIANA BRANCHED-CHAIN AMINO ACID TRANSAMINASE 2) branched-chain-amino-acid transaminase/ catalytic 
MIKTITSLRKTLVLPLHLHIRTLQTFAKYNAQAASALREERKKPLYQNGDDVYADLDWDN 
LGFGLNPADYMYVMKCSKDGEFTQGELSPYGNIQLSPSAGVLNYGQAIYEGTKAYRKENG 
KLLLFRPDHNAIRMKLGAERMLMPSPSVDQFVNAVKQTALANKRWVPPAGKGTLYIRPLL 
MGSGPILGLGPAPEYTFIVYASPVGNYFKEGMAALNLYVEEEYVRAAPGGAGGVKSITNY 
APVLKALSRAKSRGFSDVLYLDSVKKKYLEEASSCNVFVVKGRTISTPATNGTILEGITR 
KSVMEIASDQGYQVVEKAVHVDEVMDADEVFCTGTAVVVAPVGTITYQEKRVEYKTGDES 
VCQKLRSVLVGIQTGLIEDNKGWVTDIN*
>AT1G10070.2 |  ATBCAT-2 (ARABIDOPSIS THALIANA BRANCHED-CHAIN AMINO ACID TRANSAMINASE 2) branched-chain-amino-acid transaminase/ catalytic 
MIKTITSLRKTLVLPLHLHIRTLQTFAKYNAQAASALREERKKPLYQNGDDVYADLDWDN 
LGFGLNPADYMYVMKCSKDGEFTQGELSPYGNIQLSPSAGVLNYGQAIYEGTKAYRKENG 
KLLLFRPDHNAIRMKLGAERMLMPSPSVDQFVNAVKQTALANKRWVPPAGKGTLYIRPLL 
MGSGPILGLGPAPEYTFIVYASPVGNYFKEGMAALNLYVEEEYVRAAPGGAGGVKSITNY 
APVLKALSRAKSRGFSDVLYLDSVKKKYLEEASSCNVFVVKGRTISTPATNGTILEGITR 
KSVMEIASDQGYQVVEKAVHVDEVMDADEVFCTGTAVVVAPVGTITYQEKRVEYKTGDES 
VCQKLRSVLVGIQTGLIEDNKGWVTDIN*
>AT1G10070.2 |  ATBCAT-2 (ARABIDOPSIS THALIANA BRANCHED-CHAIN AMINO ACID TRANSAMINASE 2) branched-chain-amino-acid transaminase/ catalytic 
MIKTITSLRKTLVLPLHLHIRTLQTFAKYNAQAASALREERKKPLYQNGDDVYADLDWDN 
LGFGLNPADYMYVMKCSKDGEFTQGELSPYGNIQLSPSAGVLNYGQAIYEGTKAYRKENG 
KLLLFRPDHNAIRMKLGAERMLMPSPSVDQFVNAVKQTALANKRWVPPAGKGTLYIRPLL 
MGSGPILGLGPAPEYTFIVYASPVGNYFKEGMAALNLYVEEEYVRAAPGGAGGVKSITNY 
APVLKALSRAKSRGFSDVLYLDSVKKKYLEEASSCNVFVVKGRTISTPATNGTILEGITR 
KSVMEIASDQGYQVVEKAVHVDEVMDADEVFCTGTAVVVAPVGTITYQEKRVEYKTGDES 
VCQKLRSVLVGIQTGLIEDNKGWVTDIN*
>AT1G10070.2 |  ATBCAT-2 (ARABIDOPSIS THALIANA BRANCHED-CHAIN AMINO ACID TRANSAMINASE 2) branched-chain-amino-acid transaminase/ catalytic 
MIKTITSLRKTLVLPLHLHIRTLQTFAKYNAQAASALREERKKPLYQNGDDVYADLDWDN 
LGFGLNPADYMYVMKCSKDGEFTQGELSPYGNIQLSPSAGVLNYGQAIYEGTKAYRKENG 
KLLLFRPDHNAIRMKLGAERMLMPSPSVDQFVNAVKQTALANKRWVPPAGKGTLYIRPLL 
MGSGPILGLGPAPEYTFIVYASPVGNYFKEGMAALNLYVEEEYVRAAPGGAGGVKSITNY 
APVLKALSRAKSRGFSDVLYLDSVKKKYLEEASSCNVFVVKGRTISTPATNGTILEGITR 
KSVMEIASDQGYQVVEKAVHVDEVMDADEVFCTGTAVVVAPVGTITYQEKRVEYKTGDES 
VCQKLRSVLVGIQTGLIEDNKGWVTDIN*
>AT1G10070.3 |  ATBCAT-2 (ARABIDOPSIS THALIANA BRANCHED-CHAIN AMINO ACID TRANSAMINASE 2) branched-chain-amino-acid transaminase/ catalytic 
MYVMKCSKDGEFTQGELSPYGNIQLSPSAGVLNYGQAIYEGTKAYRKENGKLLLFRPDHN 
AIRMKLGAERMLMPSPSVDQFVNAVKQTALANKRWVPPAGKGTLYIRPLLMGSGPILGLG 
PAPEYTFIVYASPVGNYFKEGMAALNLYVEEEYVRAAPGGAGGVKSITNYAPVLKALSRA 
KSRGFSDVLYLDSVKKKYLEEASSCNVFVVKGRTISTPATNGTILEGITRKSVMEIASDQ 
GYQVVEKAVHVDEVMDADEVFCTGTAVVVAPVGTITYQEKRVEYKTGDESVCQKLRSVLV 
GIQTGLIEDNKGWVTDIN*
>AT1G10070.3 |  ATBCAT-2 (ARABIDOPSIS THALIANA BRANCHED-CHAIN AMINO ACID TRANSAMINASE 2) branched-chain-amino-acid transaminase/ catalytic 
MYVMKCSKDGEFTQGELSPYGNIQLSPSAGVLNYGQAIYEGTKAYRKENGKLLLFRPDHN 
AIRMKLGAERMLMPSPSVDQFVNAVKQTALANKRWVPPAGKGTLYIRPLLMGSGPILGLG 
PAPEYTFIVYASPVGNYFKEGMAALNLYVEEEYVRAAPGGAGGVKSITNYAPVLKALSRA 
KSRGFSDVLYLDSVKKKYLEEASSCNVFVVKGRTISTPATNGTILEGITRKSVMEIASDQ 
GYQVVEKAVHVDEVMDADEVFCTGTAVVVAPVGTITYQEKRVEYKTGDESVCQKLRSVLV 
GIQTGLIEDNKGWVTDIN*
>AT1G10070.3 |  ATBCAT-2 (ARABIDOPSIS THALIANA BRANCHED-CHAIN AMINO ACID TRANSAMINASE 2) branched-chain-amino-acid transaminase/ catalytic 
MYVMKCSKDGEFTQGELSPYGNIQLSPSAGVLNYGQAIYEGTKAYRKENGKLLLFRPDHN 
AIRMKLGAERMLMPSPSVDQFVNAVKQTALANKRWVPPAGKGTLYIRPLLMGSGPILGLG 
PAPEYTFIVYASPVGNYFKEGMAALNLYVEEEYVRAAPGGAGGVKSITNYAPVLKALSRA 
KSRGFSDVLYLDSVKKKYLEEASSCNVFVVKGRTISTPATNGTILEGITRKSVMEIASDQ 
GYQVVEKAVHVDEVMDADEVFCTGTAVVVAPVGTITYQEKRVEYKTGDESVCQKLRSVLV 
GIQTGLIEDNKGWVTDIN*
>AT5G52640.1 |  ATHSP901 (HEAT SHOCK PROTEIN 901) ATP binding / unfolded protein binding 
MADVQMADAETFAFQAEINQLLSLIINTFYSNKEIFLRELISNSSDALDKIRFESLTDKS 
KLDGQPELFIRLVPDKSNKTLSIIDSGIGMTKADLVNNLGTIARSGTKEFMEALQAGADV 
SMIGQFGVGFYSAYLVAEKVVVTTKHNDDEQYVWESQAGGSFTVTRDVDGEPLGRGTKIT 
LFLKDDQLEYLEERRLKDLVKKHSEFISYPIYLWTEKTTEKEISDDEDEDEPKKENEGEV 
EEVDEEKEKDGKKKKKIKEVSHEWELINKQKPIWLRKPEEITKEEYAAFYKSLTNDWEDH 
LAVKHFSVEGQLEFKAILFVPKRAPFDLFDTRKKLNNIKLYVRRVFIMDNCEELIPEYLS 
FVKGVVDSDDLPLNISRETLQQNKILKVIRKNLVKKCIEMFNEIAENKEDYTKFYEAFSK 
NLKLGIHEDSQNRGKIADLLRYHSTKSGDEMTSFKDYVTRMKEGQKDIFYITGESKKAVE 
NSPFLERLKKRGYEVLYMVDAIDEYAVGQLKEYDGKKLVSATKEGLKLEDETEEEKKKRE 
EKKKSFENLCKTIKEILGDKVEKVVVSDRIVDSPCCLVTGEYGWTANMERIMKAQALRDS 
SMSGYMSSKKTMEINPDNGIMEELRKRAEADKNDKSVKDLVMLLYETALLTSGFSLDEPN 
TFAARIHRMLKLGLSIDEDENVEEDGDMPELEEDAAEESKMEEVD*
>AT1G70580.1 |  AOAT2 (ALANINE-2-OXOGLUTARATE AMINOTRANSFERASE 2) L-alanine2-oxoglutarate aminotransferase/ glycine2-oxoglutarate aminotransferase 
MSLKALDYESLNENVKNCQYAVRGELYLRASELQKEGKKIIFTNVGNPHALGQKPLTFPR 
QVVSLCQAPFLLDDPNVGMIFPADAIARAKHYLSLTSGGLGAYSDSRGLPGVRKEVAEFI 
ERRDGYPSDPELIFLTDGASKGVMQILNCVIRGQKDGILVPVPQYPLYSATISLLGGTLV 
PYYLEESENWGLDVNNLRQSVAQARSQGITVRAMVIINPGNPTGQCLSEANIREILRFCC 
DERLVLLGDEVYQQNIYQDERPFISSKKVLMDMGAPISKEVQLISFHTVSKGYWGECGQR 
GGYFEMTNIPPRTVEEIYKVASIALSPNVSAQIFMGLMVSPPKPGDISYDQFVRESKGIL 
ESLRRRARMMTDGFNSCKNVVCNFTEGAMYSFPQIKLPSKAIQAAKQAGKVPDVFYCLKL 
LEATGISTVPGSGFGQKEGVFHLRTTILPAEEEMPEIMDSFKKFNDEFMSQYADNFGYSR 
M*
>AT1G70580.1 |  AOAT2 (ALANINE-2-OXOGLUTARATE AMINOTRANSFERASE 2) L-alanine2-oxoglutarate aminotransferase/ glycine2-oxoglutarate aminotransferase 
MSLKALDYESLNENVKNCQYAVRGELYLRASELQKEGKKIIFTNVGNPHALGQKPLTFPR 
QVVSLCQAPFLLDDPNVGMIFPADAIARAKHYLSLTSGGLGAYSDSRGLPGVRKEVAEFI 
ERRDGYPSDPELIFLTDGASKGVMQILNCVIRGQKDGILVPVPQYPLYSATISLLGGTLV 
PYYLEESENWGLDVNNLRQSVAQARSQGITVRAMVIINPGNPTGQCLSEANIREILRFCC 
DERLVLLGDEVYQQNIYQDERPFISSKKVLMDMGAPISKEVQLISFHTVSKGYWGECGQR 
GGYFEMTNIPPRTVEEIYKVASIALSPNVSAQIFMGLMVSPPKPGDISYDQFVRESKGIL 
ESLRRRARMMTDGFNSCKNVVCNFTEGAMYSFPQIKLPSKAIQAAKQAGKVPDVFYCLKL 
LEATGISTVPGSGFGQKEGVFHLRTTILPAEEEMPEIMDSFKKFNDEFMSQYADNFGYSR 
M*
>AT1G70580.1 |  AOAT2 (ALANINE-2-OXOGLUTARATE AMINOTRANSFERASE 2) L-alanine2-oxoglutarate aminotransferase/ glycine2-oxoglutarate aminotransferase 
MSLKALDYESLNENVKNCQYAVRGELYLRASELQKEGKKIIFTNVGNPHALGQKPLTFPR 
QVVSLCQAPFLLDDPNVGMIFPADAIARAKHYLSLTSGGLGAYSDSRGLPGVRKEVAEFI 
ERRDGYPSDPELIFLTDGASKGVMQILNCVIRGQKDGILVPVPQYPLYSATISLLGGTLV 
PYYLEESENWGLDVNNLRQSVAQARSQGITVRAMVIINPGNPTGQCLSEANIREILRFCC 
DERLVLLGDEVYQQNIYQDERPFISSKKVLMDMGAPISKEVQLISFHTVSKGYWGECGQR 
GGYFEMTNIPPRTVEEIYKVASIALSPNVSAQIFMGLMVSPPKPGDISYDQFVRESKGIL 
ESLRRRARMMTDGFNSCKNVVCNFTEGAMYSFPQIKLPSKAIQAAKQAGKVPDVFYCLKL 
LEATGISTVPGSGFGQKEGVFHLRTTILPAEEEMPEIMDSFKKFNDEFMSQYADNFGYSR 
M*
>AT1G70580.1 |  AOAT2 (ALANINE-2-OXOGLUTARATE AMINOTRANSFERASE 2) L-alanine2-oxoglutarate aminotransferase/ glycine2-oxoglutarate aminotransferase 
MSLKALDYESLNENVKNCQYAVRGELYLRASELQKEGKKIIFTNVGNPHALGQKPLTFPR 
QVVSLCQAPFLLDDPNVGMIFPADAIARAKHYLSLTSGGLGAYSDSRGLPGVRKEVAEFI 
ERRDGYPSDPELIFLTDGASKGVMQILNCVIRGQKDGILVPVPQYPLYSATISLLGGTLV 
PYYLEESENWGLDVNNLRQSVAQARSQGITVRAMVIINPGNPTGQCLSEANIREILRFCC 
DERLVLLGDEVYQQNIYQDERPFISSKKVLMDMGAPISKEVQLISFHTVSKGYWGECGQR 
GGYFEMTNIPPRTVEEIYKVASIALSPNVSAQIFMGLMVSPPKPGDISYDQFVRESKGIL 
ESLRRRARMMTDGFNSCKNVVCNFTEGAMYSFPQIKLPSKAIQAAKQAGKVPDVFYCLKL 
LEATGISTVPGSGFGQKEGVFHLRTTILPAEEEMPEIMDSFKKFNDEFMSQYADNFGYSR 
M*
>AT1G70580.2 |  AOAT2 (ALANINE-2-OXOGLUTARATE AMINOTRANSFERASE 2) L-alanine2-oxoglutarate aminotransferase/ glycine2-oxoglutarate aminotransferase 
MSLKALDYESLNENVKNCQYAVRGELYLRASELQKEGKKIIFTNVGNPHALGQKPLTFPR 
QVVSLCQAPFLLDDPNVGMIFPADAIARAKHYLSLTSGGLGAYSDSRGLPGVRKEVAEFI 
ERRDGYPSDPELIFLTDGASKGVMQILNCVIRGQKDGILVPVPQYPLYSATISLLGGTLV 
PYYLEESENWGLDVNNLRQSVAQARSQGITVRAMVIINPGNPTGQCLSEANIREILRFCC 
DERLVLLGDEVYQQNIYQDERPFISSKKVLMDMGAPISKEVQLISFHTVSKGYWGECGQR 
GGYFEMTNIPPRTVEEIYKVASIALSPNVSAQIFMGLMVSPPKPGDISYDQFVRESKGIL 
ESLRRRARMMTDGFNSCKNVVCNFTEGAMYSFPQIKLPSKAIQAAKQAGKVPDVFYCLKL 
LEATGISTVPGSGFGQKEGVFHLRTTILPAEEEMPEIMDSFKKFNDEFMSQYADNFGYSR 
M*
>AT1G70580.2 |  AOAT2 (ALANINE-2-OXOGLUTARATE AMINOTRANSFERASE 2) L-alanine2-oxoglutarate aminotransferase/ glycine2-oxoglutarate aminotransferase 
MSLKALDYESLNENVKNCQYAVRGELYLRASELQKEGKKIIFTNVGNPHALGQKPLTFPR 
QVVSLCQAPFLLDDPNVGMIFPADAIARAKHYLSLTSGGLGAYSDSRGLPGVRKEVAEFI 
ERRDGYPSDPELIFLTDGASKGVMQILNCVIRGQKDGILVPVPQYPLYSATISLLGGTLV 
PYYLEESENWGLDVNNLRQSVAQARSQGITVRAMVIINPGNPTGQCLSEANIREILRFCC 
DERLVLLGDEVYQQNIYQDERPFISSKKVLMDMGAPISKEVQLISFHTVSKGYWGECGQR 
GGYFEMTNIPPRTVEEIYKVASIALSPNVSAQIFMGLMVSPPKPGDISYDQFVRESKGIL 
ESLRRRARMMTDGFNSCKNVVCNFTEGAMYSFPQIKLPSKAIQAAKQAGKVPDVFYCLKL 
LEATGISTVPGSGFGQKEGVFHLRTTILPAEEEMPEIMDSFKKFNDEFMSQYADNFGYSR 
M*
>AT1G70580.2 |  AOAT2 (ALANINE-2-OXOGLUTARATE AMINOTRANSFERASE 2) L-alanine2-oxoglutarate aminotransferase/ glycine2-oxoglutarate aminotransferase 
MSLKALDYESLNENVKNCQYAVRGELYLRASELQKEGKKIIFTNVGNPHALGQKPLTFPR 
QVVSLCQAPFLLDDPNVGMIFPADAIARAKHYLSLTSGGLGAYSDSRGLPGVRKEVAEFI 
ERRDGYPSDPELIFLTDGASKGVMQILNCVIRGQKDGILVPVPQYPLYSATISLLGGTLV 
PYYLEESENWGLDVNNLRQSVAQARSQGITVRAMVIINPGNPTGQCLSEANIREILRFCC 
DERLVLLGDEVYQQNIYQDERPFISSKKVLMDMGAPISKEVQLISFHTVSKGYWGECGQR 
GGYFEMTNIPPRTVEEIYKVASIALSPNVSAQIFMGLMVSPPKPGDISYDQFVRESKGIL 
ESLRRRARMMTDGFNSCKNVVCNFTEGAMYSFPQIKLPSKAIQAAKQAGKVPDVFYCLKL 
LEATGISTVPGSGFGQKEGVFHLRTTILPAEEEMPEIMDSFKKFNDEFMSQYADNFGYSR 
M*
>AT1G70580.2 |  AOAT2 (ALANINE-2-OXOGLUTARATE AMINOTRANSFERASE 2) L-alanine2-oxoglutarate aminotransferase/ glycine2-oxoglutarate aminotransferase 
MSLKALDYESLNENVKNCQYAVRGELYLRASELQKEGKKIIFTNVGNPHALGQKPLTFPR 
QVVSLCQAPFLLDDPNVGMIFPADAIARAKHYLSLTSGGLGAYSDSRGLPGVRKEVAEFI 
ERRDGYPSDPELIFLTDGASKGVMQILNCVIRGQKDGILVPVPQYPLYSATISLLGGTLV 
PYYLEESENWGLDVNNLRQSVAQARSQGITVRAMVIINPGNPTGQCLSEANIREILRFCC 
DERLVLLGDEVYQQNIYQDERPFISSKKVLMDMGAPISKEVQLISFHTVSKGYWGECGQR 
GGYFEMTNIPPRTVEEIYKVASIALSPNVSAQIFMGLMVSPPKPGDISYDQFVRESKGIL 
ESLRRRARMMTDGFNSCKNVVCNFTEGAMYSFPQIKLPSKAIQAAKQAGKVPDVFYCLKL 
LEATGISTVPGSGFGQKEGVFHLRTTILPAEEEMPEIMDSFKKFNDEFMSQYADNFGYSR 
M*
>AT1G70580.3 |  AOAT2 (ALANINE-2-OXOGLUTARATE AMINOTRANSFERASE 2) L-alanine2-oxoglutarate aminotransferase/ glycine2-oxoglutarate aminotransferase 
MSLKALDYESLNENVKNCQYAVRGELYLRASELQKEGKKIIFTNVGNPHALGQKPLTFPR 
QVVSLCQAPFLLDDPNVGMIFPADAIARAKHYLSLTSGGLGAYSDSRGLPGVRKEVAEFI 
ERRDGYPSDPELIFLTDGASKGVMQILNCVIRGQKDGILVPVPQYPLYSATISLLGGTLV 
PYYLEESENWGLDVNNLRQSVAQARSQGITVRAMVIINPGNPTGQCLSEANIREILRFCC 
DERLVLLGDEVYQQNIYQDERPFISSKKVLMDMGAPISKEVQLISFHTVSKGYWGECGQR 
GGYFEMTNIPPRTVEEIYKVASIALSPNVSAQIFMGLMVSPPKPGDISYDQFVRESKGIL 
ESLRRRARMMTDGFNSCKNVVCNFTEGAMYSFPQIKLPSKAIQAAKQAGKVPDVFYCLKL 
LEATGISTVPGSGFGQKEGVFHLRTTILPAEEEMPEIMDSFKKFNDEFMSQYADNFGYSR 
M*
>AT1G70580.3 |  AOAT2 (ALANINE-2-OXOGLUTARATE AMINOTRANSFERASE 2) L-alanine2-oxoglutarate aminotransferase/ glycine2-oxoglutarate aminotransferase 
MSLKALDYESLNENVKNCQYAVRGELYLRASELQKEGKKIIFTNVGNPHALGQKPLTFPR 
QVVSLCQAPFLLDDPNVGMIFPADAIARAKHYLSLTSGGLGAYSDSRGLPGVRKEVAEFI 
ERRDGYPSDPELIFLTDGASKGVMQILNCVIRGQKDGILVPVPQYPLYSATISLLGGTLV 
PYYLEESENWGLDVNNLRQSVAQARSQGITVRAMVIINPGNPTGQCLSEANIREILRFCC 
DERLVLLGDEVYQQNIYQDERPFISSKKVLMDMGAPISKEVQLISFHTVSKGYWGECGQR 
GGYFEMTNIPPRTVEEIYKVASIALSPNVSAQIFMGLMVSPPKPGDISYDQFVRESKGIL 
ESLRRRARMMTDGFNSCKNVVCNFTEGAMYSFPQIKLPSKAIQAAKQAGKVPDVFYCLKL 
LEATGISTVPGSGFGQKEGVFHLRTTILPAEEEMPEIMDSFKKFNDEFMSQYADNFGYSR 
M*
>AT1G70580.3 |  AOAT2 (ALANINE-2-OXOGLUTARATE AMINOTRANSFERASE 2) L-alanine2-oxoglutarate aminotransferase/ glycine2-oxoglutarate aminotransferase 
MSLKALDYESLNENVKNCQYAVRGELYLRASELQKEGKKIIFTNVGNPHALGQKPLTFPR 
QVVSLCQAPFLLDDPNVGMIFPADAIARAKHYLSLTSGGLGAYSDSRGLPGVRKEVAEFI 
ERRDGYPSDPELIFLTDGASKGVMQILNCVIRGQKDGILVPVPQYPLYSATISLLGGTLV 
PYYLEESENWGLDVNNLRQSVAQARSQGITVRAMVIINPGNPTGQCLSEANIREILRFCC 
DERLVLLGDEVYQQNIYQDERPFISSKKVLMDMGAPISKEVQLISFHTVSKGYWGECGQR 
GGYFEMTNIPPRTVEEIYKVASIALSPNVSAQIFMGLMVSPPKPGDISYDQFVRESKGIL 
ESLRRRARMMTDGFNSCKNVVCNFTEGAMYSFPQIKLPSKAIQAAKQAGKVPDVFYCLKL 
LEATGISTVPGSGFGQKEGVFHLRTTILPAEEEMPEIMDSFKKFNDEFMSQYADNFGYSR 
M*
>AT1G70580.3 |  AOAT2 (ALANINE-2-OXOGLUTARATE AMINOTRANSFERASE 2) L-alanine2-oxoglutarate aminotransferase/ glycine2-oxoglutarate aminotransferase 
MSLKALDYESLNENVKNCQYAVRGELYLRASELQKEGKKIIFTNVGNPHALGQKPLTFPR 
QVVSLCQAPFLLDDPNVGMIFPADAIARAKHYLSLTSGGLGAYSDSRGLPGVRKEVAEFI 
ERRDGYPSDPELIFLTDGASKGVMQILNCVIRGQKDGILVPVPQYPLYSATISLLGGTLV 
PYYLEESENWGLDVNNLRQSVAQARSQGITVRAMVIINPGNPTGQCLSEANIREILRFCC 
DERLVLLGDEVYQQNIYQDERPFISSKKVLMDMGAPISKEVQLISFHTVSKGYWGECGQR 
GGYFEMTNIPPRTVEEIYKVASIALSPNVSAQIFMGLMVSPPKPGDISYDQFVRESKGIL 
ESLRRRARMMTDGFNSCKNVVCNFTEGAMYSFPQIKLPSKAIQAAKQAGKVPDVFYCLKL 
LEATGISTVPGSGFGQKEGVFHLRTTILPAEEEMPEIMDSFKKFNDEFMSQYADNFGYSR 
M*
>AT1G70580.4 |  AOAT2 (ALANINE-2-OXOGLUTARATE AMINOTRANSFERASE 2) L-alanine2-oxoglutarate aminotransferase/ glycine2-oxoglutarate aminotransferase 
MSLKALDYESLNENVKNCQYAVRGELYLRASELQKEGKKIIFTNVGNPHALGQKPLTFPR 
QVVSLCQAPFLLDDPNVGMIFPADAIARAKHYLSLTSGGLGAYSDSRGLPGVRKEVAEFI 
ERRDGYPSDPELIFLTDGASKGVMQILNCVIRGQKDGILVPVPQYPLYSATISLLGGTLV 
PYYLEESENWGLDVNNLRQSVAQARSQGITVRAMVIINPGNPTGQCLSEANIREILRFCC 
DERLVLLGDEVYQQNIYQDERPFISSKKVLMDMGAPISKEVQLISFHTVSKGYWGECGQR 
GGYFEMTNIPPRTVEEIYKVASIALSPNVSAQIFMGLMVSPPKPGDISYDQFVRESKGIL 
ESLRRRARMMTDGFNSCKNVVCNFTEGAMYSFPQIKLPSKAIQAAKQAGKVPDVFYCLKL 
LEATGISTVPGSGFGQKEGVFHLRTTILPAEEEMPEIMDSFKKFNDEFMSQYADNFGYSR 
M*
>AT1G70580.4 |  AOAT2 (ALANINE-2-OXOGLUTARATE AMINOTRANSFERASE 2) L-alanine2-oxoglutarate aminotransferase/ glycine2-oxoglutarate aminotransferase 
MSLKALDYESLNENVKNCQYAVRGELYLRASELQKEGKKIIFTNVGNPHALGQKPLTFPR 
QVVSLCQAPFLLDDPNVGMIFPADAIARAKHYLSLTSGGLGAYSDSRGLPGVRKEVAEFI 
ERRDGYPSDPELIFLTDGASKGVMQILNCVIRGQKDGILVPVPQYPLYSATISLLGGTLV 
PYYLEESENWGLDVNNLRQSVAQARSQGITVRAMVIINPGNPTGQCLSEANIREILRFCC 
DERLVLLGDEVYQQNIYQDERPFISSKKVLMDMGAPISKEVQLISFHTVSKGYWGECGQR 
GGYFEMTNIPPRTVEEIYKVASIALSPNVSAQIFMGLMVSPPKPGDISYDQFVRESKGIL 
ESLRRRARMMTDGFNSCKNVVCNFTEGAMYSFPQIKLPSKAIQAAKQAGKVPDVFYCLKL 
LEATGISTVPGSGFGQKEGVFHLRTTILPAEEEMPEIMDSFKKFNDEFMSQYADNFGYSR 
M*
>AT1G70580.4 |  AOAT2 (ALANINE-2-OXOGLUTARATE AMINOTRANSFERASE 2) L-alanine2-oxoglutarate aminotransferase/ glycine2-oxoglutarate aminotransferase 
MSLKALDYESLNENVKNCQYAVRGELYLRASELQKEGKKIIFTNVGNPHALGQKPLTFPR 
QVVSLCQAPFLLDDPNVGMIFPADAIARAKHYLSLTSGGLGAYSDSRGLPGVRKEVAEFI 
ERRDGYPSDPELIFLTDGASKGVMQILNCVIRGQKDGILVPVPQYPLYSATISLLGGTLV 
PYYLEESENWGLDVNNLRQSVAQARSQGITVRAMVIINPGNPTGQCLSEANIREILRFCC 
DERLVLLGDEVYQQNIYQDERPFISSKKVLMDMGAPISKEVQLISFHTVSKGYWGECGQR 
GGYFEMTNIPPRTVEEIYKVASIALSPNVSAQIFMGLMVSPPKPGDISYDQFVRESKGIL 
ESLRRRARMMTDGFNSCKNVVCNFTEGAMYSFPQIKLPSKAIQAAKQAGKVPDVFYCLKL 
LEATGISTVPGSGFGQKEGVFHLRTTILPAEEEMPEIMDSFKKFNDEFMSQYADNFGYSR 
M*
>AT1G70580.4 |  AOAT2 (ALANINE-2-OXOGLUTARATE AMINOTRANSFERASE 2) L-alanine2-oxoglutarate aminotransferase/ glycine2-oxoglutarate aminotransferase 
MSLKALDYESLNENVKNCQYAVRGELYLRASELQKEGKKIIFTNVGNPHALGQKPLTFPR 
QVVSLCQAPFLLDDPNVGMIFPADAIARAKHYLSLTSGGLGAYSDSRGLPGVRKEVAEFI 
ERRDGYPSDPELIFLTDGASKGVMQILNCVIRGQKDGILVPVPQYPLYSATISLLGGTLV 
PYYLEESENWGLDVNNLRQSVAQARSQGITVRAMVIINPGNPTGQCLSEANIREILRFCC 
DERLVLLGDEVYQQNIYQDERPFISSKKVLMDMGAPISKEVQLISFHTVSKGYWGECGQR 
GGYFEMTNIPPRTVEEIYKVASIALSPNVSAQIFMGLMVSPPKPGDISYDQFVRESKGIL 
ESLRRRARMMTDGFNSCKNVVCNFTEGAMYSFPQIKLPSKAIQAAKQAGKVPDVFYCLKL 
LEATGISTVPGSGFGQKEGVFHLRTTILPAEEEMPEIMDSFKKFNDEFMSQYADNFGYSR 
M*
>AT2G38960.2 |  AERO2 (Arabidopsis endoplasmic reticulum oxidoreductins 2) FAD binding / electron carrier/ oxidoreductase acting on sulfur group of donors disulfide as acceptor / protein binding 
MAETDVGSVKGKEKGSGKRWILLIGAIAAVLLAVVVAVFLNTQNSSISEFTGKICNCRQA 
EQQKYIGIVEDCCCDYETVNRLNTEVLNPLLQDLVKTPFYRYFKVKLWCDCPFWPDDGMC 
RLRDCSVCECPESEFPEVFKKPLSQYNPVCQEGKPQATVDRTLDTRAFRGWTVTDNPWTS 
DDETDNDEMTYVNLRLNPERYTGYIGPSARRIWEAIYSENCPKHTSEGSCQEEKILYKLV 
SGLHSSISVHIASDYLLDEATNLWGQNLTLLYDRVLRYPDRVQNLYFTFLFVLRAVTKVK 
DYLGEAEYETGNVIEDLKTKSLVKQVVSDPKTKAACPVPFDEAKLWKGQRGPELKQQLEK 
QFRNISAIMDCVGCEKCRLWGKLQILGLGTALKILFTVNGEDNLPIFVVLFQLELQRNEV 
IALMNLLHRLSESVKYVHDMSPAAERIAGGHASSGNSFWQRIVTSIAQSKGKKALKNL*
>AT2G38960.2 |  AERO2 (Arabidopsis endoplasmic reticulum oxidoreductins 2) FAD binding / electron carrier/ oxidoreductase acting on sulfur group of donors disulfide as acceptor / protein binding 
MAETDVGSVKGKEKGSGKRWILLIGAIAAVLLAVVVAVFLNTQNSSISEFTGKICNCRQA 
EQQKYIGIVEDCCCDYETVNRLNTEVLNPLLQDLVKTPFYRYFKVKLWCDCPFWPDDGMC 
RLRDCSVCECPESEFPEVFKKPLSQYNPVCQEGKPQATVDRTLDTRAFRGWTVTDNPWTS 
DDETDNDEMTYVNLRLNPERYTGYIGPSARRIWEAIYSENCPKHTSEGSCQEEKILYKLV 
SGLHSSISVHIASDYLLDEATNLWGQNLTLLYDRVLRYPDRVQNLYFTFLFVLRAVTKVK 
DYLGEAEYETGNVIEDLKTKSLVKQVVSDPKTKAACPVPFDEAKLWKGQRGPELKQQLEK 
QFRNISAIMDCVGCEKCRLWGKLQILGLGTALKILFTVNGEDNLPIFVVLFQLELQRNEV 
IALMNLLHRLSESVKYVHDMSPAAERIAGGHASSGNSFWQRIVTSIAQSKGKKALKNL*
>AT2G38960.1 |  AERO2 (Arabidopsis endoplasmic reticulum oxidoreductins 2) FAD binding / electron carrier/ oxidoreductase acting on sulfur group of donors disulfide as acceptor / protein binding 
MAETDVGSVKGKEKGSGKRWILLIGAIAAVLLAVVVAVFLNTQNSSISEFTGKICNCRQA 
EQQKYIGIVEDCCCDYETVNRLNTEVLNPLLQDLVKTPFYRYFKVKLWCDCPFWPDDGMC 
RLRDCSVCECPESEFPEVFKKPLSQYNPVCQEGKPQATVDRTLDTRAFRGWTVTDNPWTS 
DDETDNDEMTYVNLRLNPERYTGYIGPSARRIWEAIYSENCPKHTSEGSCQEEKILYKLV 
SGLHSSISVHIASDYLLDEATNLWGQNLTLLYDRVLRYPDRVQNLYFTFLFVLRAVTKAE 
DYLGEAEYETGNVIEDLKTKSLVKQVVSDPKTKAACPVPFDEAKLWKGQRGPELKQQLEK 
QFRNISAIMDCVGCEKCRLWGKLQILGLGTALKILFTVNGEDNLRHNLELQRNEVIALMN 
LLHRLSESVKYVHDMSPAAERIAGGHASSGNSFWQRIVTSIAQSKAVSGKRS*
>AT2G38960.1 |  AERO2 (Arabidopsis endoplasmic reticulum oxidoreductins 2) FAD binding / electron carrier/ oxidoreductase acting on sulfur group of donors disulfide as acceptor / protein binding 
MAETDVGSVKGKEKGSGKRWILLIGAIAAVLLAVVVAVFLNTQNSSISEFTGKICNCRQA 
EQQKYIGIVEDCCCDYETVNRLNTEVLNPLLQDLVKTPFYRYFKVKLWCDCPFWPDDGMC 
RLRDCSVCECPESEFPEVFKKPLSQYNPVCQEGKPQATVDRTLDTRAFRGWTVTDNPWTS 
DDETDNDEMTYVNLRLNPERYTGYIGPSARRIWEAIYSENCPKHTSEGSCQEEKILYKLV 
SGLHSSISVHIASDYLLDEATNLWGQNLTLLYDRVLRYPDRVQNLYFTFLFVLRAVTKAE 
DYLGEAEYETGNVIEDLKTKSLVKQVVSDPKTKAACPVPFDEAKLWKGQRGPELKQQLEK 
QFRNISAIMDCVGCEKCRLWGKLQILGLGTALKILFTVNGEDNLRHNLELQRNEVIALMN 
LLHRLSESVKYVHDMSPAAERIAGGHASSGNSFWQRIVTSIAQSKAVSGKRS*
>AT5G10330.1 |  HPA1 (HISTIDINOL PHOSPHATE AMINOTRANSFERASE 1) histidinol-phosphate transaminase 
MGVINVQGSPSFSIHSSESNLRKSRALKKPFCSIRNRVYCAQSSSAAVDESKNITMGDSF 
IRPHLRQLAAYQPILPFEVLSAQLGRKPEDIVKLDANENPYGPPPEVFEALGNMKFPYVY 
PDPQSRRLRDALAQDSGLESEYILVGCGADELIDLIMRCVLDPGEKIIDCPPTFSMYVFD 
AAVNGAGVIKVPRNPDFSLNVDRIAEVVELEKPKCIFLTSPNNPDGSIISEDDLLKILEM 
PILVVLDEAYIEFSGVESRMKWVKKYENLIVLRTFSKRAGLAGLRVGYGAFPLSIIEYLW 
RAKQPYNVSVAGEVAALAALSNGKYLEDVRDALVRERERLFGLLKEVPFLNPYPSYSNFI 
LCEVTSGMDAKKLKEDLAKMGVMVRHYNSQELKGYVRVSAGKPEHTDVLMECLKQFY*
>AT5G10330.1 |  HPA1 (HISTIDINOL PHOSPHATE AMINOTRANSFERASE 1) histidinol-phosphate transaminase 
MGVINVQGSPSFSIHSSESNLRKSRALKKPFCSIRNRVYCAQSSSAAVDESKNITMGDSF 
IRPHLRQLAAYQPILPFEVLSAQLGRKPEDIVKLDANENPYGPPPEVFEALGNMKFPYVY 
PDPQSRRLRDALAQDSGLESEYILVGCGADELIDLIMRCVLDPGEKIIDCPPTFSMYVFD 
AAVNGAGVIKVPRNPDFSLNVDRIAEVVELEKPKCIFLTSPNNPDGSIISEDDLLKILEM 
PILVVLDEAYIEFSGVESRMKWVKKYENLIVLRTFSKRAGLAGLRVGYGAFPLSIIEYLW 
RAKQPYNVSVAGEVAALAALSNGKYLEDVRDALVRERERLFGLLKEVPFLNPYPSYSNFI 
LCEVTSGMDAKKLKEDLAKMGVMVRHYNSQELKGYVRVSAGKPEHTDVLMECLKQFY*
>AT5G10330.1 |  HPA1 (HISTIDINOL PHOSPHATE AMINOTRANSFERASE 1) histidinol-phosphate transaminase 
MGVINVQGSPSFSIHSSESNLRKSRALKKPFCSIRNRVYCAQSSSAAVDESKNITMGDSF 
IRPHLRQLAAYQPILPFEVLSAQLGRKPEDIVKLDANENPYGPPPEVFEALGNMKFPYVY 
PDPQSRRLRDALAQDSGLESEYILVGCGADELIDLIMRCVLDPGEKIIDCPPTFSMYVFD 
AAVNGAGVIKVPRNPDFSLNVDRIAEVVELEKPKCIFLTSPNNPDGSIISEDDLLKILEM 
PILVVLDEAYIEFSGVESRMKWVKKYENLIVLRTFSKRAGLAGLRVGYGAFPLSIIEYLW 
RAKQPYNVSVAGEVAALAALSNGKYLEDVRDALVRERERLFGLLKEVPFLNPYPSYSNFI 
LCEVTSGMDAKKLKEDLAKMGVMVRHYNSQELKGYVRVSAGKPEHTDVLMECLKQFY*
>AT5G10330.2 |  HPA1 (HISTIDINOL PHOSPHATE AMINOTRANSFERASE 1) histidinol-phosphate transaminase 
MGVINVQGSPSFSIHSSESNLRKSRALKKPFCSIRNRVYCAQSSSAAVDESKNITMGDSF 
IRPHLRQLAAYQPILPFEVLSAQLGRKPEDIVKLDANENPYGPPPEVFEALGNMKFPYVY 
PDPQSRRLRDALAQDSGLESEYILVGCGADELIDLIMRCVLDPGEKIIDCPPTFSMYVFD 
AAVNGAGVIKVPRNPDFSLNVDRIAEVVELEKPKCIFLTSPNNPDGSIISEDDLLKILEM 
PILVVLDEAYIEFSGVESRMKWVKKYENLIVLRTFSKRAGLAGLRVGYGAFPLSIIEYLW 
RAKQPYNVSVAGEVAALAALSNGKYLEDVRDALVRERERLFGLLKEVPFLNPYPSYSNFI 
LCEVTSGMDAKKLKEDLAKMGVMVRHYNSQELKGYVRVSAGKPEHTDVLMECLKQFY*
>AT5G10330.2 |  HPA1 (HISTIDINOL PHOSPHATE AMINOTRANSFERASE 1) histidinol-phosphate transaminase 
MGVINVQGSPSFSIHSSESNLRKSRALKKPFCSIRNRVYCAQSSSAAVDESKNITMGDSF 
IRPHLRQLAAYQPILPFEVLSAQLGRKPEDIVKLDANENPYGPPPEVFEALGNMKFPYVY 
PDPQSRRLRDALAQDSGLESEYILVGCGADELIDLIMRCVLDPGEKIIDCPPTFSMYVFD 
AAVNGAGVIKVPRNPDFSLNVDRIAEVVELEKPKCIFLTSPNNPDGSIISEDDLLKILEM 
PILVVLDEAYIEFSGVESRMKWVKKYENLIVLRTFSKRAGLAGLRVGYGAFPLSIIEYLW 
RAKQPYNVSVAGEVAALAALSNGKYLEDVRDALVRERERLFGLLKEVPFLNPYPSYSNFI 
LCEVTSGMDAKKLKEDLAKMGVMVRHYNSQELKGYVRVSAGKPEHTDVLMECLKQFY*
>AT5G10330.2 |  HPA1 (HISTIDINOL PHOSPHATE AMINOTRANSFERASE 1) histidinol-phosphate transaminase 
MGVINVQGSPSFSIHSSESNLRKSRALKKPFCSIRNRVYCAQSSSAAVDESKNITMGDSF 
IRPHLRQLAAYQPILPFEVLSAQLGRKPEDIVKLDANENPYGPPPEVFEALGNMKFPYVY 
PDPQSRRLRDALAQDSGLESEYILVGCGADELIDLIMRCVLDPGEKIIDCPPTFSMYVFD 
AAVNGAGVIKVPRNPDFSLNVDRIAEVVELEKPKCIFLTSPNNPDGSIISEDDLLKILEM 
PILVVLDEAYIEFSGVESRMKWVKKYENLIVLRTFSKRAGLAGLRVGYGAFPLSIIEYLW 
RAKQPYNVSVAGEVAALAALSNGKYLEDVRDALVRERERLFGLLKEVPFLNPYPSYSNFI 
LCEVTSGMDAKKLKEDLAKMGVMVRHYNSQELKGYVRVSAGKPEHTDVLMECLKQFY*
>AT5G10330.3 |  HPA1 (HISTIDINOL PHOSPHATE AMINOTRANSFERASE 1) histidinol-phosphate transaminase 
MGDSFIRPHLRQLAAYQPILPFEVLSAQLGRKPEDIVKLDANENPYGPPPEVFEALGNMK 
FPYVYPDPQSRRLRDALAQDSGLESEYILVGCGADELIDLIMRCVLDPGEKIIDCPPTFS 
MYVFDAAVNGAGVIKVPRNPDFSLNVDRIAEVVELEKPKCIFLTSPNNPDGSIISEDDLL 
KILEMPILVVLDEAYIEFSGVESRMKWVKKYENLIVLRTFSKRAGLAGLRVGYGAFPLSI 
IEYLWRAKQPYNVSVAGEVAALAALSNGKYLEDVRDALVRERERLFGLLKEVPFLNPYPS 
YSNFILCEVTSGMDAKKLKEDLAKMGVMVRHYNSQELKGYVRVSAGKPEHTDVLMECLKQ 
FY*
>AT5G10330.3 |  HPA1 (HISTIDINOL PHOSPHATE AMINOTRANSFERASE 1) histidinol-phosphate transaminase 
MGDSFIRPHLRQLAAYQPILPFEVLSAQLGRKPEDIVKLDANENPYGPPPEVFEALGNMK 
FPYVYPDPQSRRLRDALAQDSGLESEYILVGCGADELIDLIMRCVLDPGEKIIDCPPTFS 
MYVFDAAVNGAGVIKVPRNPDFSLNVDRIAEVVELEKPKCIFLTSPNNPDGSIISEDDLL 
KILEMPILVVLDEAYIEFSGVESRMKWVKKYENLIVLRTFSKRAGLAGLRVGYGAFPLSI 
IEYLWRAKQPYNVSVAGEVAALAALSNGKYLEDVRDALVRERERLFGLLKEVPFLNPYPS 
YSNFILCEVTSGMDAKKLKEDLAKMGVMVRHYNSQELKGYVRVSAGKPEHTDVLMECLKQ 
FY*
>AT5G10330.3 |  HPA1 (HISTIDINOL PHOSPHATE AMINOTRANSFERASE 1) histidinol-phosphate transaminase 
MGDSFIRPHLRQLAAYQPILPFEVLSAQLGRKPEDIVKLDANENPYGPPPEVFEALGNMK 
FPYVYPDPQSRRLRDALAQDSGLESEYILVGCGADELIDLIMRCVLDPGEKIIDCPPTFS 
MYVFDAAVNGAGVIKVPRNPDFSLNVDRIAEVVELEKPKCIFLTSPNNPDGSIISEDDLL 
KILEMPILVVLDEAYIEFSGVESRMKWVKKYENLIVLRTFSKRAGLAGLRVGYGAFPLSI 
IEYLWRAKQPYNVSVAGEVAALAALSNGKYLEDVRDALVRERERLFGLLKEVPFLNPYPS 
YSNFILCEVTSGMDAKKLKEDLAKMGVMVRHYNSQELKGYVRVSAGKPEHTDVLMECLKQ 
FY*
>AT1G10210.1 |  ATMPK1 (MITOGEN-ACTIVATED PROTEIN KINASE 1) MAP kinase/ kinase 
MATLVDPPNGIRNEGKHYFSMWQTLFEIDTKYMPIKPIGRGAYGVVCSSVNSDTNEKVAI 
KKIHNVYENRIDALRTLRELKLLRHLRHENVIALKDVMMPIHKMSFKDVYLVYELMDTDL 
HQIIKSSQVLSNDHCQYFLFQLLRGLKYIHSANILHRDLKPGNLLVNANCDLKICDFGLA 
RASNTKGQFMTEYVVTRWYRAPELLLCCDNYGTSIDVWSVGCIFAELLGRKPIFQGTECL 
NQLKLIVNILGSQREEDLEFIDNPKAKRYIRSLPYSPGMSLSRLYPGAHVLAIDLLQKML 
VFDPSKRISVSEALQHPYMAPLYDPNANPPAQVPIDLDVDEDLREEMIREMMWNEMLHYH 
PQASTLNTEL*
>AT1G10210.1 |  ATMPK1 (MITOGEN-ACTIVATED PROTEIN KINASE 1) MAP kinase/ kinase 
MATLVDPPNGIRNEGKHYFSMWQTLFEIDTKYMPIKPIGRGAYGVVCSSVNSDTNEKVAI 
KKIHNVYENRIDALRTLRELKLLRHLRHENVIALKDVMMPIHKMSFKDVYLVYELMDTDL 
HQIIKSSQVLSNDHCQYFLFQLLRGLKYIHSANILHRDLKPGNLLVNANCDLKICDFGLA 
RASNTKGQFMTEYVVTRWYRAPELLLCCDNYGTSIDVWSVGCIFAELLGRKPIFQGTECL 
NQLKLIVNILGSQREEDLEFIDNPKAKRYIRSLPYSPGMSLSRLYPGAHVLAIDLLQKML 
VFDPSKRISVSEALQHPYMAPLYDPNANPPAQVPIDLDVDEDLREEMIREMMWNEMLHYH 
PQASTLNTEL*
>AT1G10210.2 |  ATMPK1 (MITOGEN-ACTIVATED PROTEIN KINASE 1) MAP kinase/ kinase 
MATLVDPPNGIRNEGKHYFSMWQTLFEIDTKYMPIKPIGRGAYGVVCSSVNSDTNEKVAI 
KKIHNVYENRIDALRTLRELKLLRHLRHENVIALKDVMMPIHKMSFKDVYLVYELMDTDL 
HQIIKSSQVLSNDHCQYFLFQLLRGLKYIHSANILHRDLKPGNLLVNANCDLKICDFGLA 
RASNTKGQFMTEYVVTRWYRAPELLLCCDNYGTSIDVWSVGCIFAELLGRKPIFQGTECL 
NQLKLIVNILGSQREEDLEFIDNPKAKRYIRSLPYSPGMSLSRLYPGAHVLAIDLLQKML 
VFDPSKRISVSEALQHPYMAPLYDPNANPPAQVPIDLDVDEDLREEMIREMMWNEMLHYH 
PQASTLNTEL*
>AT1G10210.2 |  ATMPK1 (MITOGEN-ACTIVATED PROTEIN KINASE 1) MAP kinase/ kinase 
MATLVDPPNGIRNEGKHYFSMWQTLFEIDTKYMPIKPIGRGAYGVVCSSVNSDTNEKVAI 
KKIHNVYENRIDALRTLRELKLLRHLRHENVIALKDVMMPIHKMSFKDVYLVYELMDTDL 
HQIIKSSQVLSNDHCQYFLFQLLRGLKYIHSANILHRDLKPGNLLVNANCDLKICDFGLA 
RASNTKGQFMTEYVVTRWYRAPELLLCCDNYGTSIDVWSVGCIFAELLGRKPIFQGTECL 
NQLKLIVNILGSQREEDLEFIDNPKAKRYIRSLPYSPGMSLSRLYPGAHVLAIDLLQKML 
VFDPSKRISVSEALQHPYMAPLYDPNANPPAQVPIDLDVDEDLREEMIREMMWNEMLHYH 
PQASTLNTEL*
>AT1G15440.1 |  transducin family protein / WD-40 repeat family protein 
MEFRFENLLGAPYRGGNAVITKNTQLISPVGNRVSVTDLSKNHSVTLPLETSTNICRLAS 
SPDGTFLLAVDEQNRCLFINLPRRVVLHRITFKDKVGALKFSPNGKFIAVGIGKLVEIWR 
SPGFRRAVLPFERVRTFANSDDKVVSLEWSLDSDYLLVGSRDLAARLFCVRKLKGVLNKP 
FLFLGHRDSVVGCFFGVDKMTNKVNRAFTIARDGYIFSWGYTEKDVKMDESEDGHSEPPS 
PVTPDRADEVMVENGGGVGTELKKRKEYDGKGLESDEEGDDDDEEYMHRGKWVLLRKDGC 
NQASAKVTACDYHQGLDMVVVGFSNGVFGLYQMPDFICIHLLSISRQKLTTAVFNERGNW 
LTFGCAKLGQLLVWDWRTETYILKQQGHYFDVNCVTYSPDSQLLATGADDNKVKVWNVMS 
GTCFITFTEHTNAVTALHFMADNHSLLSASLDGTVRAWDFKRYKNYKTYTTPTPRQFVSL 
TADPSGDVVCAGTLDSFEIFVWSKKTGQIKDILSGHEAPVHGLMFSPLTQLLASSSWDYT 
VRLWDVFASKGTVETFRHNHDVLTVAFRPDGKQLASSTLDGQINFWDTIEGVLMYTIEGR 
RDIAGGRVMTDRRSAANSSSGKCFTTLCYSADGGYILAAGTSRYICMYDIADQVLLRRFQ 
ISHNLSLDGVLDFLHSKKMTEAGPIDLIDDDNSDEEGGIDKQSRGNLGYDLPGSRPNRGR 
PIIRTKSLSIAPTGRSFAAATTEGVLIFSIDDTFIFDPTDLDIDVTPEAVEAAIEEDEVS 
RALALSMRLNEDSLIKKCIFAVAPADIKAVAISVRQKYLERLMEALVDLLENCPHLEFIL 
HWCQEICKAHGSSIQRNYRTLLPALRSLQKAITRAHQDLADMCSSNEYTLRYLCSVPNNH 
*
>AT1G15440.1 |  transducin family protein / WD-40 repeat family protein 
MEFRFENLLGAPYRGGNAVITKNTQLISPVGNRVSVTDLSKNHSVTLPLETSTNICRLAS 
SPDGTFLLAVDEQNRCLFINLPRRVVLHRITFKDKVGALKFSPNGKFIAVGIGKLVEIWR 
SPGFRRAVLPFERVRTFANSDDKVVSLEWSLDSDYLLVGSRDLAARLFCVRKLKGVLNKP 
FLFLGHRDSVVGCFFGVDKMTNKVNRAFTIARDGYIFSWGYTEKDVKMDESEDGHSEPPS 
PVTPDRADEVMVENGGGVGTELKKRKEYDGKGLESDEEGDDDDEEYMHRGKWVLLRKDGC 
NQASAKVTACDYHQGLDMVVVGFSNGVFGLYQMPDFICIHLLSISRQKLTTAVFNERGNW 
LTFGCAKLGQLLVWDWRTETYILKQQGHYFDVNCVTYSPDSQLLATGADDNKVKVWNVMS 
GTCFITFTEHTNAVTALHFMADNHSLLSASLDGTVRAWDFKRYKNYKTYTTPTPRQFVSL 
TADPSGDVVCAGTLDSFEIFVWSKKTGQIKDILSGHEAPVHGLMFSPLTQLLASSSWDYT 
VRLWDVFASKGTVETFRHNHDVLTVAFRPDGKQLASSTLDGQINFWDTIEGVLMYTIEGR 
RDIAGGRVMTDRRSAANSSSGKCFTTLCYSADGGYILAAGTSRYICMYDIADQVLLRRFQ 
ISHNLSLDGVLDFLHSKKMTEAGPIDLIDDDNSDEEGGIDKQSRGNLGYDLPGSRPNRGR 
PIIRTKSLSIAPTGRSFAAATTEGVLIFSIDDTFIFDPTDLDIDVTPEAVEAAIEEDEVS 
RALALSMRLNEDSLIKKCIFAVAPADIKAVAISVRQKYLERLMEALVDLLENCPHLEFIL 
HWCQEICKAHGSSIQRNYRTLLPALRSLQKAITRAHQDLADMCSSNEYTLRYLCSVPNNH 
*
>AT1G15440.2 |  transducin family protein / WD-40 repeat family protein 
MEFRFENLLGAPYRGGNAVITKNTQLISPVGNRVSVTDLSKNHSVTLPLETSTNICRLAS 
SPDGTFLLAVDEQNRCLFINLPRRVVLHRITFKDKVGALKFSPNGKFIAVGIGKLVEIWR 
SPGFRRAVLPFERVRTFANSDDKVVSLEWSLDSDYLLVGSRDLAARAFTIARDGYIFSWG 
YTEKDVKMDESEDGHSEPPSPVTPDRADEVMVENGGGVGTELKKRKEYDGKGLESDEEGD 
DDDEEYMHRGKWVLLRKDGCNQASAKVTACDYHQGLDMVVVGFSNGVFGLYQMPDFICIH 
LLSISRQKLTTAVFNERGNWLTFGCAKLGQLLVWDWRTETYILKQQGHYFDVNCVTYSPD 
SQLLATGADDNKVKVWNVMSGTCFITFTEHTNAVTALHFMADNHSLLSASLDGTVRAWDF 
KRYKNYKTYTTPTPRQFVSLTADPSGDVVCAGTLDSFEIFVWSKKTGQIKDILSGHEAPV 
HGLMFSPLTQLLASSSWDYTVRLWDVFASKGTVETFRHNHDVLTVAFRPDGKQLASSTLD 
GQINFWDTIEGVLMYTIEGRRDIAGGRVMTDRRSAANSSSGKCFTTLCYSADGGYILAAG 
TSRYICMYDIADQVLLRRFQISHNLSLDGVLDFLHSKKMTEAGPIDLIDDDNSDEEGGID 
KQSRGNLGYDLPGSRPNRGRPIIRTKSLSIAPTGRSFAAATTEGVLIFSIDDTFIFDPTD 
LDIDVTPEAVEAAIEEDEVSRALALSMRLNEDSLIKKCIFAVAPADIKAVAISVRQKYLE 
RLMEALVDLLENCPHLEFILHWCQEICKAHGSSIQRNYRTLLPALRSLQKAITRAHQDLA 
DMCSSNEYTLRYLCSVPNNH*
>AT1G15440.2 |  transducin family protein / WD-40 repeat family protein 
MEFRFENLLGAPYRGGNAVITKNTQLISPVGNRVSVTDLSKNHSVTLPLETSTNICRLAS 
SPDGTFLLAVDEQNRCLFINLPRRVVLHRITFKDKVGALKFSPNGKFIAVGIGKLVEIWR 
SPGFRRAVLPFERVRTFANSDDKVVSLEWSLDSDYLLVGSRDLAARAFTIARDGYIFSWG 
YTEKDVKMDESEDGHSEPPSPVTPDRADEVMVENGGGVGTELKKRKEYDGKGLESDEEGD 
DDDEEYMHRGKWVLLRKDGCNQASAKVTACDYHQGLDMVVVGFSNGVFGLYQMPDFICIH 
LLSISRQKLTTAVFNERGNWLTFGCAKLGQLLVWDWRTETYILKQQGHYFDVNCVTYSPD 
SQLLATGADDNKVKVWNVMSGTCFITFTEHTNAVTALHFMADNHSLLSASLDGTVRAWDF 
KRYKNYKTYTTPTPRQFVSLTADPSGDVVCAGTLDSFEIFVWSKKTGQIKDILSGHEAPV 
HGLMFSPLTQLLASSSWDYTVRLWDVFASKGTVETFRHNHDVLTVAFRPDGKQLASSTLD 
GQINFWDTIEGVLMYTIEGRRDIAGGRVMTDRRSAANSSSGKCFTTLCYSADGGYILAAG 
TSRYICMYDIADQVLLRRFQISHNLSLDGVLDFLHSKKMTEAGPIDLIDDDNSDEEGGID 
KQSRGNLGYDLPGSRPNRGRPIIRTKSLSIAPTGRSFAAATTEGVLIFSIDDTFIFDPTD 
LDIDVTPEAVEAAIEEDEVSRALALSMRLNEDSLIKKCIFAVAPADIKAVAISVRQKYLE 
RLMEALVDLLENCPHLEFILHWCQEICKAHGSSIQRNYRTLLPALRSLQKAITRAHQDLA 
DMCSSNEYTLRYLCSVPNNH*
>AT1G34580.1 |  monosaccharide transporter putative 
MAGGGLALDVSSAGNIDAKITAAVVMSCIVAASCGLIFGYDIGISGGVTTMKPFLEKFFP 
SVLKKASEAKTNVYCVYDSQLLTAFTSSLYVAGLVASLVASRLTAAYGRRTTMILGGFTF 
LFGALINGLAANIAMLISGRILLGFGVGFTNQAAPVYLSEVAPPRWRGAFNIGFSCFISM 
GVVAANLINYGTDSHRNGWRISLGLAAVPAAIMTVGCLFISDTPSSLLARGKHDEAHTSL 
LKLRGVENIADVETELAELVRSSQLAIEARAELFMKTILQRRYRPHLVVAVVIPCFQQLT 
GITVNAFYAPVLFRSVGFGSGPALIATFILGFVNLGSLLLSTMVIDRFGRRFLFIAGGIL 
MLLCQIAVAVLLAVTVGATGDGEMKKGYAVTVVVLLCIYAAGFGWSWGPLSWLVPSEIFP 
LKIRPAGQSLSVAVNFAATFALSQTFLATLCDFKYGAFLFYGGWIFTMTIFVIMFLPETK 
GIPVDSMYQVWEKHWYWQRFTKPTST*
>AT2G47090.1 |  nucleic acid binding / protein binding / zinc ion binding 
MDDSCAVCAENLEWVGYGSCGHREVCSTCVVRLRFILNDRRCCICKTECPVVFVTKALGD 
YTKTISDFSTTFPSVPKEGRVGSFWYHEETNVYFDDLNHYTRIKAMCRLSCNLCNDTNKT 
RPKKEPNHCVRFKSVEHLKDHLNHQHKLHMCSLCLVGRKVFICEQKLFTKGQLNQHISSG 
DSEVDGSESERGGFTGHPMCEFCKRPFYGDNELYTHMSREHYTCHICQRLKPGQYEYYGN 
YDDLEVHFRSDHFLCEDETCLAKKFIVFQIEAELKRHNAIDHGGRMSRSQQNASLQIQAS 
FQYPNSRRGRRRSSLREPNLVLLESQASYAFNDDNNLPQHVGRSGNSRLGESSFPPLSVQ 
ANQGQSRFGQNSESLVSNTTTTRQRHRANQGQSRFGQNSESLVSNTTTTRQRHQTNRSAT 
SGSSQAWPALNRGPAEISITSRVQSSGASAQSQSRHHDRVESTRTLASAVPQDARTTVGG 
CSSGSSLSSANATKRNNHHSSSTPKMSETRSLAQPSHSDSPQISAVKNRRSSSTSANAGN 
IQVAQGVSDVQSDNKSLVEKIHASLGHDEELFMAFKNTSGKYRHGSIDARTYLEYVKGYG 
LSHLVLDMARLCPDPQRQKELIDTHNACLKGGNKGKAVKVESSSDSKGDRFVDTVRKLQF 
SDKSQDKDKDKDAYRSDKGKTKVTTLVNSSSAGVGLGDTGKQPKKTSKFLRTRLGEKSMA 
AVLDLRNSNPEPEPEPKNDNSKRSQNSPGGLPLRGAWKRGSAKLFV*
>AT3G06470.1 |  GNS1/SUR4 membrane family protein 
MASIYSSLTYWLVNHPYISNFTWIEGETLGSTVFFVSVVVSVYLSATFLLRSAIDSLPSL 
SPRILKPITAVHSLILCLLSLVMAVGCTLSITSSHASSDPMARFLHAICFPVDVKPNGPL 
FFWAQVFYLSKILEFGDTILIILGKSIQRLSFLHVYHHATVVVMCYLWLRTRQSMFPIAL 
VTNSTVHVIMYGYYFLCAVGSRPKWKRLVTDCQIVQFVFSFGLSGWMLREHLFGSGCTGI 
WGWCFNAAFNASLLALFSNFHSKNYVKKPTREDGKKSD*
>AT3G45630.1 |  RNA recognition motif (RRM)-containing protein 
MSDYGEKTCPLCAEEMDLTDQQLKPCKCGYQICVWCWHHIMDMAEKDQSEGRCPACRTPY 
DKEKIVGMTVDQERLASEGNMDRKKIQKSKPKSSDGRKPLTSVRVVQRNLVYIVGLPLNL 
ADEDLLQRKEYFGQYGKVLKVSMSRTATGLIQQFPNNTCSVYITYGKEEEAIRCIQSVHG 
FILDGKALKACFGTTKYCHAWLRNVACNNQDCLYLHEVGSQEDSFTKDEIISAHTRVQQI 
TGATNTMQYRSGSMLPPPLDAYTSDSSTGNPIAKVPSSTSVSAPKSSPPSGSSGKSTALP 
AAASWGARLTNQHSLATSALSNGSLDNQRSTSENGTLATSTVVTKAANGPVSSSNSLQKA 
PLKEEIQSLAEKSKPGVLKPLQQKIVLDPESKRTTSPNRDPSSNQISCLVESSYNSRVID 
KPSAVENSLEHTSEIAEDVFDVGKLSADVAWMGITTNSRDETPGVPVVIGTHCDLGSITQ 
SDNDVQNLEQCRKQSPTNTYAEADISLNGIHGSRPEWDWRSGLQSQIDVKEPLEVNDFSS 
FNNNRRGIAEAVSHSTSKFSSSISILDSNHLASRSFQNRETSCGMDSKTGSSFEIGSDRL 
HLPNGFSEKAMSNMEHSLFANEGRSNIQNTEDDIISNILDFDPWDESLTSQHNFAKLLGQ 
SDHRASTLESSNLLKQHNDQSRFSFARHEESNSQAYDNRSYSIYGQLSRDQPLQEFGANR 
DMYQDKLGSQNGFASNYSGGYEQFATSPGLSSYKSPVARTQVSAPPGFSAPNRLPPPGFS 
SHQRGDLSSDIASGTRLLDSANLLRNAYHVPPPSGNLNAAGDIEFIDPAILAVGRGRLHN 
GMETADFDLRSGFSSQLNSFDNDARLQLLAQRSLAAQQVNGFHDPRNVNNFSSSFSDPYG 
ISSRPTDQTQGTGLSPFTQLPRQASANPLLSNGHWDNKWNEPQSGNNLGITQLLRNERMG 
FNDNVYSGFEEPKFRRPGPGDPYNRTYGI*
>AT4G14240.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF21 (InterProIPR002550) Cystathionine beta-synthase core (InterProIPR000644) BEST Arabidopsis thaliana protein match is CBS domain-containing protein-related (TAIRAT4G142301) Has 6770 Blast hits to 6657 proteins in 1347 species Archae - 62 Bacteria - 4461 Metazoa - 254 Fungi - 179 Plants - 121 Viruses - 0 Other Eukaryotes - 1693 (source NCBI BLink) 
MHLINAVAAARILSGIGQSNGNNGGEAIPFGSFEWITYAGISCFLVLFAGIMSGLTLGLM 
SLGLVELEILQRSGTPNEKKQAAAIFPVVQKQHQLLVTLLLCNAMAMEGLPIYLDKLFNE 
YVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVWLVRILMTLCYPIAFPIGKILDLVL 
GHNDALFRRAQLKALVSIHSQEAGKGGELTHDETTIISGALDLTEKTAQEAMTPIESTFS 
LDVNSKLDWEAMGKILARGHSRVPVYSGNPKNVIGLLLVKSLLTVRPETETLVSAVCIRR 
IPRVPADMPLYDILNEFQKGSSHMAAVVKVKGKSKVPPSTLLEEHTDESNDSDLTAPLLL 
KREGNHDNVIVTIDKANGQSFFQNNESGPHGFSHTSEAIEDGEVIGIITLEDVFEELLQE 
EIVDETDEYVDVHKRIRVAAAAAASSIARAPSSRKLLAQKGTGGQNKQGQTNKVPGQEQD 
KMLGTITEPIRRNN*
>AT4G14240.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF21 (InterProIPR002550) Cystathionine beta-synthase core (InterProIPR000644) BEST Arabidopsis thaliana protein match is CBS domain-containing protein-related (TAIRAT4G142301) Has 6735 Blast hits to 6622 proteins in 1349 species Archae - 62 Bacteria - 4446 Metazoa - 254 Fungi - 179 Plants - 121 Viruses - 0 Other Eukaryotes - 1673 (source NCBI BLink) 
MHLINAVAAARILSGIGQSNGNNGGEAIPFGSFEWITYAGISCFLVLFAGIMSGLTLGLM 
SLGLVELEILQRSGTPNEKKQAAAIFPVVQKQHQLLVTLLLCNAMAMEGLPIYLDKLFNE 
YVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVWLVRILMTLCYPIAFPIGKILDLVL 
GHNDALFRRAQLKALVSIHSQEAGKGGELTHDETTIISGALDLTEKTAQEAMTPIESTFS 
LDVNSKLDWEAMGKILARGHSRVPVYSGNPKNVIGLLLVKSLLTVRPETETLVSAVCIRR 
IPRVPADMPLYDILNEFQKGSSHMAAVVKVKGKSKVPPSTLLEEHTDESNDSDLTAPLLL 
KREGNHDNVIVTIDKANGQSFFQNNESGPHGFSHTSEAIEDGEVIGIITLEDVFEELLQE 
EIVDETDEYVDVHKRIRVAAAAAASSIARAPSSRKLLAQKGTGGQNKQGQTNKVPGQEQD 
KMLGTITEPIRRNN*
>AT4G14240.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF21 (InterProIPR002550) Cystathionine beta-synthase core (InterProIPR000644) BEST Arabidopsis thaliana protein match is CBS domain-containing protein-related (TAIRAT4G142301) Has 6770 Blast hits to 6657 proteins in 1347 species Archae - 62 Bacteria - 4461 Metazoa - 254 Fungi - 179 Plants - 121 Viruses - 0 Other Eukaryotes - 1693 (source NCBI BLink) 
MHLINAVAAARILSGIGQSNGNNGGEAIPFGSFEWITYAGISCFLVLFAGIMSGLTLGLM 
SLGLVELEILQRSAAIFPVVQKQHQLLVTLLLCNAMAMEGLPIYLDKLFNEYVAIILSVT 
FVLAFGEVIPQAICTRYGLAVGANFVWLVRILMTLCYPIAFPIGKILDLVLGHNDALFRR 
AQLKALVSIHSQEAGKGGELTHDETTIISGALDLTEKTAQEAMTPIESTFSLDVNSKLDW 
EAMGKILARGHSRVPVYSGNPKNVIGLLLVKSLLTVRPETETLVSAVCIRRIPRVPADMP 
LYDILNEFQKGSSHMAAVVKVKGKSKVPPSTLLEEHTDESNDSDLTAPLLLKREGNHDNV 
IVTIDKANGQSFFQNNESGPHGFSHTSEAIEDGEVIGIITLEDVFEELLQEEIVDETDEY 
VDVHKRIRVAAAAAASSIARAPSSRKLLAQKGTGGQNKQGQTNKVPGQEQDKMLGTITEP 
IRRNN*
>AT4G14240.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF21 (InterProIPR002550) Cystathionine beta-synthase core (InterProIPR000644) BEST Arabidopsis thaliana protein match is CBS domain-containing protein-related (TAIRAT4G142301) Has 6735 Blast hits to 6622 proteins in 1349 species Archae - 62 Bacteria - 4446 Metazoa - 254 Fungi - 179 Plants - 121 Viruses - 0 Other Eukaryotes - 1673 (source NCBI BLink) 
MHLINAVAAARILSGIGQSNGNNGGEAIPFGSFEWITYAGISCFLVLFAGIMSGLTLGLM 
SLGLVELEILQRSAAIFPVVQKQHQLLVTLLLCNAMAMEGLPIYLDKLFNEYVAIILSVT 
FVLAFGEVIPQAICTRYGLAVGANFVWLVRILMTLCYPIAFPIGKILDLVLGHNDALFRR 
AQLKALVSIHSQEAGKGGELTHDETTIISGALDLTEKTAQEAMTPIESTFSLDVNSKLDW 
EAMGKILARGHSRVPVYSGNPKNVIGLLLVKSLLTVRPETETLVSAVCIRRIPRVPADMP 
LYDILNEFQKGSSHMAAVVKVKGKSKVPPSTLLEEHTDESNDSDLTAPLLLKREGNHDNV 
IVTIDKANGQSFFQNNESGPHGFSHTSEAIEDGEVIGIITLEDVFEELLQEEIVDETDEY 
VDVHKRIRVAAAAAASSIARAPSSRKLLAQKGTGGQNKQGQTNKVPGQEQDKMLGTITEP 
IRRNN*
>AT4G18593.1 |  dual specificity protein phosphatase-related 
MTDLQMEVEVDTNSSLQESLPKPQVMYRCKKCRRIVAIEENIVPHEPGKGEECFAWKKRS 
GNSEQVQCSSIFVEPMKWMQTIHDGMVEEKLLCFGCNGRLGYFNWAGMQCSCGAWVNPAF 
QLNKSRIDECKSEPNPNLNMET*
>AT5G66640.1 |  DAR3 (DA1-RELATED PROTEIN 3) 
MVRRKRQEEDEKIEIERVKEESLKLAKQAEEKRRLEESKEQGKRIQVDDDQLAKTTSKDK 
GQINHSKDVVEEDVNPPPSIDGKSEIGDGTSVNPRCLCCFHCHRPFVMHEILKKGKFHID 
CYKEYYRNRNCYVCQQKIPVNAEGIRKFSEHPFWKEKYCPIHDEDGTAKCCSCERLEPRG 
TNYVMLGDFRWLCIECMGSAVMDTNEVQPLHFEIREFFEGLFLKVDKEFALLLVEKQALN 
KAEEEEKIDYHRAAVTRGLCMSEEQIVPSIIKGPRMGPDNQLITDIVTESQRVSGFEVTG 
ILIIYGLPRLLTGYILAHEMMHAWLRLNGYKNLKLELEEGLCQALGLRWLESQTFASTDA 
AAAAAVASSSSFSSSTAPPAAITSKKSDDWSIFEKKLVEFCMNQIKEDDSPVYGLGFKQV 
YEMMVSNNYNIKDTLKDIVSASNATPDSTV*
>AT5G44990.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN stem CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G198801) Has 1632 Blast hits to 1632 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 156 Plants - 57 Viruses - 0 Other Eukaryotes - 479 (source NCBI BLink) 
MATPMENENPNFARTATSFRNFVSKDPDSQFPAESGRYHLYISYACPWASRCLAILKLKG 
LDKAISFSSVQPLWRNTKENDEHMGWVFPDSDTEVLGAERDHINGAKSVRELYDIASSNY 
TGKYTVPVLWDKKLKTIVNNESSEILRMFNTEFNHVAENPSLDLYPPNLRAIIDETNEWI 
HDGINNGVYKCGFATNQETYDVEVKRLYEALDRCEDILRKQRFLCGNTLTESDIRLFVTV 
IRFDEAYAVIFKCDKRLVREYYHLFNYTKDIYQIAGMSSTVKMDHIKQNYYGSFPSINPL 
EIIAHGPNIDYSLPHDRHRFSLESDYTRLELFESASFVCELKLIEIFDSL*
>AT5G44990.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN stem CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G198801) Has 1632 Blast hits to 1632 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 156 Plants - 57 Viruses - 0 Other Eukaryotes - 479 (source NCBI BLink) 
MATPMENENPNFARTATSFRNFVSKDPDSQFPAESGRYHLYISYACPWASRCLAILKLKG 
LDKAISFSSVQPLWRNTKENDEHMGWVFPDSDTEVLGAERDHINGAKSVRELYDIASSNY 
TGKYTVPVLWDKKLKTIVNNESSEILRMFNTEFNHVAENPSLDLYPPNLRAIIDETNEWI 
HDGINNGVYKCGFATNQETYDVEVKRLYEALDRCEDILRKQRFLCGNTLTESDIRLFVTV 
IRFDEAYAVIFKCDKRLVREYYHLFNYTKDIYQIAGMSSTVKMDHIKQNYYGSFPSINPL 
EIIAHGPNIDYSLPHDRHRFSLESDYTRLELFESASFVCELKLIEIFDSL*
>AT5G44990.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN stem CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G198801) Has 1632 Blast hits to 1632 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 156 Plants - 57 Viruses - 0 Other Eukaryotes - 479 (source NCBI BLink) 
MATPMENENPNFARTATSFRNFVSKDPDSQFPAESGRYHLYISYACPWASRCLAILKLKG 
LDKAISFSSVQPLWRNTKENDEHMGWVFPDSDTEVLGAERDHINGAKSVRELYDIASSNY 
TGKYTVPVLWDKKLKTIVNNESSEILRMFNTEFNHVAENPSLDLYPPNLRAIIDETNEWI 
HDGINNGVYKCGFATNQETYDVEVKRLYEALDRCEDILRKQRFLCGNTLTESDIRLFVTV 
IRFDEAYAVIFKCDKRLVREYYHLFNYTKDIYQIAGMSSTVKMDHIKQNYYGSFPSINPL 
EIIAHGPNIDYSLPHDRHRFSLESDYTRLELFESASFVCELKLIEIFDSL*
>AT5G44990.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN stem CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G198801) Has 1632 Blast hits to 1632 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 156 Plants - 57 Viruses - 0 Other Eukaryotes - 479 (source NCBI BLink) 
MATPMENENPNFARTATSFRNFVSKDPDSQFPAESGRYHLYISYACPWASRCLAILKLKG 
LDKAISFSSVQPLWRNTKENDEHMGWVFPDSDTEVLGAERDHINGAKSVRELYDIASSNY 
TGKYTVPVLWDKKLKTIVNNESSEILRMFNTEFNHVAENPSLDLYPPNLRAIIDETNEWI 
HDGINNGVYKCGFATNQETYDVEVKRLYEALDRCEDILRKQRFLCGNTLTESDIRLFVTV 
IRFDEAYAVIFKCDKRLVREYYHLFNYTKDIYQIAGMSSTVKMDHIKQNYYGSFPSINPL 
EIIAHGPNIDYSLPHDRHRFSLESDYTRLELFESASFVCELKLIEIFDSL*
>AT5G44990.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN stem CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G198801) Has 1632 Blast hits to 1632 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 156 Plants - 57 Viruses - 0 Other Eukaryotes - 479 (source NCBI BLink) 
MGWVFPDSDTEVLGAERDHINGAKSVRELYDIASSNYTGKYTVPVLWDKKLKTIVNNESS 
EILRMFNTEFNHVAENPSLDLYPPNLRAIIDETNEWIHDGINNGVYKCGFATNQETYDVE 
VKRLYEALDRCEDILRKQRFLCGNTLTESDIRLFVTVIRFDEAYAVIFKCDKRLVREYYH 
LFNYTKDIYQIAGMSSTVKMDHIKQNYYGSFPSINPLEIIAHGPNIDYSLPHDRHRFSLE 
SDYTRLELFESASFVCELKLIEIFDSL*
>AT5G44990.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN stem CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G198801) Has 1632 Blast hits to 1632 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 156 Plants - 57 Viruses - 0 Other Eukaryotes - 479 (source NCBI BLink) 
MGWVFPDSDTEVLGAERDHINGAKSVRELYDIASSNYTGKYTVPVLWDKKLKTIVNNESS 
EILRMFNTEFNHVAENPSLDLYPPNLRAIIDETNEWIHDGINNGVYKCGFATNQETYDVE 
VKRLYEALDRCEDILRKQRFLCGNTLTESDIRLFVTVIRFDEAYAVIFKCDKRLVREYYH 
LFNYTKDIYQIAGMSSTVKMDHIKQNYYGSFPSINPLEIIAHGPNIDYSLPHDRHRFSLE 
SDYTRLELFESASFVCELKLIEIFDSL*
>AT5G44990.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN stem CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G198801) Has 1562 Blast hits to 1562 proteins in 487 species Archae - 12 Bacteria - 896 Metazoa - 23 Fungi - 156 Plants - 57 Viruses - 0 Other Eukaryotes - 418 (source NCBI BLink) 
MATPMENENPNFARTATSFRNFVSKDPDSQFPAESGRYHLYISYACPWASRCLAILKLKG 
LDKAISFSSVQPLWRNTKENDEHMGWVFPDSDTEVLGAERDHINGAKSVRELYDIASSNY 
TGKYTVPVLWDKKLKTIVNNESSEILRMFNTEFNHVAENPSLDLYPPNLRAIIDETNEWI 
HDGINNGVYKCGFATNQETYDVEVKRLYEALDRCEDILRKQRFLCGNTLTESDIRLFVTV 
IRFDEAYAVIFKCDKRLVREYYHLFNYTKDIYQIAGMSSTVKMDHIKQNYYGSFPSINPL 
EIIAHGPNIDYSLPHDRHRFSLESDYTRLELFESASFVCELKLIEIFDSL*
>AT5G44990.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN stem CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G198801) Has 1562 Blast hits to 1562 proteins in 487 species Archae - 12 Bacteria - 896 Metazoa - 23 Fungi - 156 Plants - 57 Viruses - 0 Other Eukaryotes - 418 (source NCBI BLink) 
MATPMENENPNFARTATSFRNFVSKDPDSQFPAESGRYHLYISYACPWASRCLAILKLKG 
LDKAISFSSVQPLWRNTKENDEHMGWVFPDSDTEVLGAERDHINGAKSVRELYDIASSNY 
TGKYTVPVLWDKKLKTIVNNESSEILRMFNTEFNHVAENPSLDLYPPNLRAIIDETNEWI 
HDGINNGVYKCGFATNQETYDVEVKRLYEALDRCEDILRKQRFLCGNTLTESDIRLFVTV 
IRFDEAYAVIFKCDKRLVREYYHLFNYTKDIYQIAGMSSTVKMDHIKQNYYGSFPSINPL 
EIIAHGPNIDYSLPHDRHRFSLESDYTRLELFESASFVCELKLIEIFDSL*
>AT5G44990.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN stem CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G198801) Has 1562 Blast hits to 1562 proteins in 487 species Archae - 12 Bacteria - 896 Metazoa - 23 Fungi - 156 Plants - 57 Viruses - 0 Other Eukaryotes - 418 (source NCBI BLink) 
MGWVFPDSDTEVLGAERDHINGAKSVRELYDIASSNYTGKYTVPVLWDKKLKTIVNNESS 
EILRMFNTEFNHVAENPSLDLYPPNLRAIIDETNEWIHDGINNGVYKCGFATNQETYDVE 
VKRLYEALDRCEDILRKQRFLCGNTLTESDIRLFVTVIRFDEAYAVIFKCDKRLVREYYH 
LFNYTKDIYQIAGMSSTVKMDHIKQNYYGSFPSINPLEIIAHGPNIDYSLPHDRHRFSLE 
SDYTRLELFESASFVCELKLIEIFDSL*
>AT5G44000.1 |  glutathione S-transferase C-terminal domain-containing protein 
MANCFAPQLTFPSFSPRHFSPRMSHQSPKPSTSTTTSIFTSATKLLWGPSLPPGLLISTA 
RTAWTTVWQLMMTQLAPSDSSGSYTRPTSKFRLDPTQFTSAASSELHLYVGLPCPWAHRT 
LIVRALKGLNDAVPVSIASPGQDGSWEFKNNNIPIKDKDKLIPSLDKANRCRNLKEVYKS 
RSGGYDGRCTVPMLWDLRKKDVVCNESYDIIEFFNSGLNKLARNDNLDLSPPELKEMIQG 
WNQIVYPKVNNGVYRCGFAQSQEAYDGAVNELFSTLDEIEDHLGSNRYLCGERLTLADVC 
LFTTLIRFDSVYNILFKCTKKKLVEYPNLYGYLREIYQIPGVAATCDISAIMDGYYKTLF 
PLNASGIQPAISSSGDQDSLWRPHNRDLVGKAIEAQLSV*
>AT4G19880.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal (InterProIPR004046) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1635 Blast hits to 1635 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 158 Plants - 57 Viruses - 0 Other Eukaryotes - 480 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVYAVHFKCNKKLIREYPNLFN 
YTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPFGIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1621 Blast hits to 1621 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 155 Plants - 57 Viruses - 0 Other Eukaryotes - 469 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVYAVHFKCNKKLIREYPNLFN 
YTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPFGIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal (InterProIPR004046) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1635 Blast hits to 1635 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 158 Plants - 57 Viruses - 0 Other Eukaryotes - 480 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSSYFQSKKKKYTICERISNV 
ETLIQVYAVHFKCNKKLIREYPNLFNYTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPF 
GIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1621 Blast hits to 1621 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 155 Plants - 57 Viruses - 0 Other Eukaryotes - 469 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSSYFQSKKKKYTICERISNV 
ETLIQVYAVHFKCNKKLIREYPNLFNYTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPF 
GIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal (InterProIPR004046) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1635 Blast hits to 1635 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 158 Plants - 57 Viruses - 0 Other Eukaryotes - 480 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSLCSPLQMQQETHKGVSEFV 
QLHERHFPDPRYE*
>AT4G19880.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1621 Blast hits to 1621 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 155 Plants - 57 Viruses - 0 Other Eukaryotes - 469 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSLCSPLQMQQETHKGVSEFV 
QLHERHFPDPRYE*
>AT4G19880.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1567 Blast hits to 1567 proteins in 487 species Archae - 12 Bacteria - 899 Metazoa - 23 Fungi - 155 Plants - 55 Viruses - 0 Other Eukaryotes - 423 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVYAVHFKCNKKLIREYPNLFN 
YTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPFGIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1567 Blast hits to 1567 proteins in 487 species Archae - 12 Bacteria - 899 Metazoa - 23 Fungi - 155 Plants - 55 Viruses - 0 Other Eukaryotes - 423 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSSYFQSKKKKYTICERISNV 
ETLIQVYAVHFKCNKKLIREYPNLFNYTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPF 
GIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1567 Blast hits to 1567 proteins in 487 species Archae - 12 Bacteria - 899 Metazoa - 23 Fungi - 155 Plants - 55 Viruses - 0 Other Eukaryotes - 423 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSLCSPLQMQQETHKGVSEFV 
QLHERHFPDPRYE*