>AT3G26590.1 |  MATE efflux family protein 
MAKDKDITETLLTAAEERSDLPFLSVDDIPPITTVGGFVREFNVETKKLWYLAGPAIFTS 
VNQYSLGAITQVFAGHISTIALAAVSVENSVVAGFSFGIMLGMGSALETLCGQAFGAGKL 
SMLGVYLQRSWVILNVTALILSLLYIFAAPILASIGQTAAISSAAGIFSIYMIPQIFAYA 
INFPTAKFLQSQSKIMVMAVISAVALVIHVPLTWFVIVKLQWGMPGLAVVLNASWCFIDM 
AQLVYIFSGTCGEAWSGFSWEAFHNLWSFVRLSLASAVMLCLEVWYFMAIILFAGYLKNA 
EISVAALSICMNILGWTAMIAIGMNTAVSVRVSNELGANHPRTAKFSLLVAVITSTLIGF 
IVSMILLIFRDQYPSLFVKDEKVIILVKELTPILALSIVINNVQPVLSGVAVGAGWQAVV 
AYVNIACYYVFGIPFGLLLGYKLNYGVMGIWCGMLTGTVVQTIVLTWMICKTNWDTEASM 
AEDRIREWGGEVSEIKQLIN*
>AT2G45330.1 |  emb1067 (embryo defective 1067) tRNA 2-phosphotransferase/ transferase transferring phosphorus-containing groups 
MLLRLCRHRLPLPLTLSSSVFFSKSSPVSSAFMDASNPNSSRKSNVSSFAQSSRSGGRGG 
GYERDNDRRRPQGRGDGGGGKDRIDALGRLLTRILRHMATELRLNMRGDGFVKVEDLLNL 
NLKTSANIQLKSHTIDEIREAVRRDNKQRFSLIDENGELLIRANQGHSITTVESEKLLKP 
ILSPEEAPVCVHGTYRKNLESILASGLKRMNRMHVHFSCGLPTDGEVISGMRRNVNVIIF 
LDIKKALEDGIAFYISDNKVILTEGIDGVLPVDYFQKIESWPDRQSIPF*
>AT2G45330.1 |  emb1067 (embryo defective 1067) tRNA 2-phosphotransferase/ transferase transferring phosphorus-containing groups 
MLLRLCRHRLPLPLTLSSSVFFSKSSPVSSAFMDASNPNSSRKSNVSSFAQSSRSGGRGG 
GYERDNDRRRPQGRGDGGGGKDRIDALGRLLTRILRHMATELRLNMRGDGFVKVEDLLNL 
NLKTSANIQLKSHTIDEIREAVRRDNKQRFSLIDENGELLIRANQGHSITTVESEKLLKP 
ILSPEEAPVCVHGTYRKNLESILASGLKRMNRMHVHFSCGLPTDGEVISGMRRNVNVIIF 
LDIKKALEDGIAFYISDNKVILTEGIDGVLPVDYFQKIESWPDRQSIPF*
>AT2G45330.2 |  emb1067 (embryo defective 1067) tRNA 2-phosphotransferase/ transferase transferring phosphorus-containing groups 
MLLRLCRHRLPLPLTLSSSVFFSKSSPVSSAFMDASNPNSSRKSNVSSFAQSSRRGGGYE 
RDNDRRRPQGRGDGGGGKDRIDALGRLLTRILRHMATELRLNMRGDGFVKVEDLLNLNLK 
TSANIQLKSHTIDEIREAVRRDNKQRFSLIDENGELLIRANQGHSITTVESEKLLKPILS 
PEEAPVCVHGTYRKNLESILASGLKRMNRMHVHFSCGLPTDGEVISGMRRNVNVIIFLDI 
KKALEDGIAFYISDNKVILTEGIDGVLPVDYFQKIESWPDRQSIPF*
>AT2G45330.2 |  emb1067 (embryo defective 1067) tRNA 2-phosphotransferase/ transferase transferring phosphorus-containing groups 
MLLRLCRHRLPLPLTLSSSVFFSKSSPVSSAFMDASNPNSSRKSNVSSFAQSSRRGGGYE 
RDNDRRRPQGRGDGGGGKDRIDALGRLLTRILRHMATELRLNMRGDGFVKVEDLLNLNLK 
TSANIQLKSHTIDEIREAVRRDNKQRFSLIDENGELLIRANQGHSITTVESEKLLKPILS 
PEEAPVCVHGTYRKNLESILASGLKRMNRMHVHFSCGLPTDGEVISGMRRNVNVIIFLDI 
KKALEDGIAFYISDNKVILTEGIDGVLPVDYFQKIESWPDRQSIPF*
>AT4G26900.1 |  AT-HF (HIS HF) imidazoleglycerol-phosphate synthase 
MEATAAPFSSIVSSRQNFSSSSSIRASSPASLFLSQKSIGNVNRKFKSPRSLSVRASSTS 
DSVVTLLDYGAGNVRSIRNALRHLGFSIKDVQTPGDILNADRLIFPGVGAFAPAMDVLNR 
TGMAEALCKYIENDRPFLGICLGLQLLFDSSEENGPVKGLGVIPGIVGRFDASAGIRVPH 
IGWNALQVGKDSEILDDVGNRHVYFVHSYRAIPSDENKDWISSTCNYGESFISSIRRGNV 
HAVQFHPEKSGEVGLSVLRRFLHPKLPATQKPMEGKASKLAKRVIACLDVRTNDKGDLVV 
TKGDQYDVREQSNENEVRNLGKPVDLAGQYYKDGADEISFLNITGFRDFPLGDLPMIQVL 
RQTSKNVFVPLTVGGGIRDFTDASGRYYSSLEVAAEYFRSGADKISIGSDAVSAAEEFIK 
SGVKTGKSSLEQISRVYGNQAVVVSIDPRRVYVNHPDDVPYKVIRVTNPGPNGEEYAWYQ 
CTVSGGREGRPIGAFELAKAVEELGAGEILLNCIDCDGQGKGFDIDLVKLISDSVGIPVI 
ASSGAGTPDHFSEVFEKTNASAALAAGIFHRKEVPIQSVKEHLQEERIEVRI*
>AT3G53870.1 |  40S ribosomal protein S3 (RPS3B) 
MTTQISKKRKFVADGVFYAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLG 
EKGRRIRELTSLVQKRFKFPVDSVELYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACY 
GVLRFVMESGAKGCEVIVSGKLRAARAKSMKFKDGYMVSSGQPTKEYIDSAVRHVLLRQG 
VLGIKVKVMLDWDPKGISGPKTPLPDVVIIHSPKEEEAIYAPAQVAAPAALVADAPLTAV 
DYPAMIPVA*
>AT4G25630.1 |  FIB2 (FIBRILLARIN 2) snoRNA binding 
MRPPLTGSGGGFSGGRGRGGYSGGRGDGGFSGGRGGGGRGGGRGFSDRGGRGRGRGPPRG 
GARGGRGPAGRGGMKGGSKVIVEPHRHAGVFIAKGKEDALVTKNLVPGEAVYNEKRISVQ 
NEDGTKTEYRVWNPFRSKLAAAILGGVDNIWIKPGAKVLYLGAASGTTVSHVSDLVGPEG 
CVYAVEFSHRSGRDLVNMAKKRTNVIPIIEDARHPAKYRMLVGMVDVIFSDVAQPDQARI 
LALNASYFLKSGGHFVISIKANCIDSTVPAEAVFQTEVKKLQQEQFKPAEQVTLEPFERD 
HACVVGGYRMPKKPKAATAA*
>AT3G23620.1 |  brix domain-containing protein 
MMEIRTPKTGKAKRVLESRAPKLVETGKKTLILHGTKTSATLSSVMTELYRLKKGGAIRY 
SRRNENIRPFESGGETSLEFFSQKTDCSIFVYGSHTKKRPDNLVLGRMYDHQVYDLIEVG 
IENFKSLRAFSYDKKFAPHEGTKPFICFIGEGFENVSELKHLKEVLTDLFRGEVVDNLNL 
TGLDRAYVCSAISPTKVFLTHCALKLKKSGSIVPRMELVEVGPSMDLVIRRNRLPNDSLM 
KEAMRTSKDKPKKKEKNVDQDAVLGKTGKIYMPDQKLKEMKLFDKSKGSKRERKDAKLKH 
KEETVAKKMKVSSE*
>AT1G03330.1 |  small nuclear ribonucleoprotein D putative / snRNP core SM-like protein putative / U6 snRNA-associated Sm-like protein putative 
MLFFSYFKDLVGQEVTVELKNDLAIRGTLHSVDQYLNIKLENTRVVDQDKYPHMLSVRNC 
FIRGSVVRYVQLPKDGVDVDLLHDAARREARGG*
>AT1G10490.1 |  unknown protein 
MRKKVDERIRTLIENGVKLRHRSMFVIIGDKARDQIVNLHHILSKSVVKSNPSVLWCYKN 
RLDISSHNKKRAKQLKKMKERGQLDPEKLDAFSLFLDVVDVTHCLYKDSERILGNTFGIC 
ILQDFEALTPNLLARTIETVEGGGLVVLLLQSLASLTSLCTMVMDVHDRFRTESHSEASG 
RFNERFLLSLASCKACVVMDDELNLLPLSSHIKSITKVPTKEDSEALSEAERDLKSLKDA 
LNDDFPVGPLINKCCTLDQGKAVVTFFDAILDKTLRSIVALIASRGRGKSAALGLAVAGA 
VAAGYSNIYVTAPSPDNLKTVFEFVCKGFDALEYKEHLEYDVVRSVNPEFNKAIVRINIF 
KQHRQTIQYIQPHEHEKLSQVELLVIDEAAAIPLPVVKSLLGPYLVFLSSTVSGYEGTGR 
SLSLKLLQQLEEQSRAPVTGVEGSLSGCLFKKIELSESIRYASGDPIESWLNGLLCLDVA 
NCLPNPACHPLPSQCDLYYVNRDTLFSYHKDSELFLQRMMALCVSSHYKNSPNDLQLLSD 
APAHHLFVLLGPVDESKNQLPDILCVIQVCLEGQISRKSAEKSLREGHSPHGDQIPWKFC 
EQFRDVVFPKLSGARIVRIAVHPNAMKMGYGSAAVELLTRYFEGQLASISEGDDELEVEP 
SPVRVTEAAAKVSLLEEQIKPRANLPPLLVPLRDRRPERLHYIGVSFGLTLDLFRFWRKH 
KFAPFYISQIPSAVTGEHTCMLLKPLTLSNDEFEVDESDELGFFAPFYKDFRIRFSKLLS 
DKFKKMDYKLAMSVLNPKINFPEVDLTGNSPDGFLKKLDGVLSPYDMERFRAYTANLVDF 
NLVYDICKTLAHHYFQEKLPVSLSYVQASVLLCLGLQESDFSSIERQMQLERGQIYSLLL 
KVGKKLYKYLNGIATKELESTLPRLKDRVLEPHKVSVDEDLREGAKEVEEQMRARIEELL 
DPELLDQFAIGDKEAEALQKSKISSSGLISIESTKTDNKKEKPSGFDKSAKKRGNDKHSS 
TSNKKRRA*
>AT1G07360.1 |  zinc finger (CCCH-type) family protein / RNA recognition motif (RRM)-containing protein 
MAHRILRDHEADGWERSDFPIICESCLGDNPYVRMTKANYDKECKICTRPFTVFRWRPGR 
DARYKKTEICQTCCKLKNVCQVCLLDLEYGLPVQVRDTALNISTHDSIPKSDVNREYFAE 
EHDRKARAGLDYESSFGKMRPNDTILKLQRTTPYYKRNRAHVCSFFIRGECTRGAECPYR 
HEMPETGELSQQNIKDRYYGVNDPVAMKLLGKAGEMGTLESPDDESIKTLYVGGLNSRIL 
EQDIRDQFYAHGEIESIRILADKACAFVTYTSREGAEKAAQELSNRLVINGQRLKLTWGR 
PKPDQDGANQQGGVAHSGLLPRAVISQQHNQPPPMQQYYMHPPPANQDKPYYPSMDPQRM 
GAVISTQEAGGSSTENNGASSSSYMMPPHQSYPPPPYGYMPSPYQQQYPPNHHHQPSPMQ 
HYAPPPAAYPYPQQPGPGSRPAPSPTAVSAISPDSAPAGSGAPSGSSQQAPDVSTATGSS 
Q*
>AT4G14240.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF21 (InterProIPR002550) Cystathionine beta-synthase core (InterProIPR000644) BEST Arabidopsis thaliana protein match is CBS domain-containing protein-related (TAIRAT4G142301) Has 6770 Blast hits to 6657 proteins in 1347 species Archae - 62 Bacteria - 4461 Metazoa - 254 Fungi - 179 Plants - 121 Viruses - 0 Other Eukaryotes - 1693 (source NCBI BLink) 
MHLINAVAAARILSGIGQSNGNNGGEAIPFGSFEWITYAGISCFLVLFAGIMSGLTLGLM 
SLGLVELEILQRSGTPNEKKQAAAIFPVVQKQHQLLVTLLLCNAMAMEGLPIYLDKLFNE 
YVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVWLVRILMTLCYPIAFPIGKILDLVL 
GHNDALFRRAQLKALVSIHSQEAGKGGELTHDETTIISGALDLTEKTAQEAMTPIESTFS 
LDVNSKLDWEAMGKILARGHSRVPVYSGNPKNVIGLLLVKSLLTVRPETETLVSAVCIRR 
IPRVPADMPLYDILNEFQKGSSHMAAVVKVKGKSKVPPSTLLEEHTDESNDSDLTAPLLL 
KREGNHDNVIVTIDKANGQSFFQNNESGPHGFSHTSEAIEDGEVIGIITLEDVFEELLQE 
EIVDETDEYVDVHKRIRVAAAAAASSIARAPSSRKLLAQKGTGGQNKQGQTNKVPGQEQD 
KMLGTITEPIRRNN*
>AT4G14240.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF21 (InterProIPR002550) Cystathionine beta-synthase core (InterProIPR000644) BEST Arabidopsis thaliana protein match is CBS domain-containing protein-related (TAIRAT4G142301) Has 6735 Blast hits to 6622 proteins in 1349 species Archae - 62 Bacteria - 4446 Metazoa - 254 Fungi - 179 Plants - 121 Viruses - 0 Other Eukaryotes - 1673 (source NCBI BLink) 
MHLINAVAAARILSGIGQSNGNNGGEAIPFGSFEWITYAGISCFLVLFAGIMSGLTLGLM 
SLGLVELEILQRSGTPNEKKQAAAIFPVVQKQHQLLVTLLLCNAMAMEGLPIYLDKLFNE 
YVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVWLVRILMTLCYPIAFPIGKILDLVL 
GHNDALFRRAQLKALVSIHSQEAGKGGELTHDETTIISGALDLTEKTAQEAMTPIESTFS 
LDVNSKLDWEAMGKILARGHSRVPVYSGNPKNVIGLLLVKSLLTVRPETETLVSAVCIRR 
IPRVPADMPLYDILNEFQKGSSHMAAVVKVKGKSKVPPSTLLEEHTDESNDSDLTAPLLL 
KREGNHDNVIVTIDKANGQSFFQNNESGPHGFSHTSEAIEDGEVIGIITLEDVFEELLQE 
EIVDETDEYVDVHKRIRVAAAAAASSIARAPSSRKLLAQKGTGGQNKQGQTNKVPGQEQD 
KMLGTITEPIRRNN*
>AT4G14240.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF21 (InterProIPR002550) Cystathionine beta-synthase core (InterProIPR000644) BEST Arabidopsis thaliana protein match is CBS domain-containing protein-related (TAIRAT4G142301) Has 6770 Blast hits to 6657 proteins in 1347 species Archae - 62 Bacteria - 4461 Metazoa - 254 Fungi - 179 Plants - 121 Viruses - 0 Other Eukaryotes - 1693 (source NCBI BLink) 
MHLINAVAAARILSGIGQSNGNNGGEAIPFGSFEWITYAGISCFLVLFAGIMSGLTLGLM 
SLGLVELEILQRSAAIFPVVQKQHQLLVTLLLCNAMAMEGLPIYLDKLFNEYVAIILSVT 
FVLAFGEVIPQAICTRYGLAVGANFVWLVRILMTLCYPIAFPIGKILDLVLGHNDALFRR 
AQLKALVSIHSQEAGKGGELTHDETTIISGALDLTEKTAQEAMTPIESTFSLDVNSKLDW 
EAMGKILARGHSRVPVYSGNPKNVIGLLLVKSLLTVRPETETLVSAVCIRRIPRVPADMP 
LYDILNEFQKGSSHMAAVVKVKGKSKVPPSTLLEEHTDESNDSDLTAPLLLKREGNHDNV 
IVTIDKANGQSFFQNNESGPHGFSHTSEAIEDGEVIGIITLEDVFEELLQEEIVDETDEY 
VDVHKRIRVAAAAAASSIARAPSSRKLLAQKGTGGQNKQGQTNKVPGQEQDKMLGTITEP 
IRRNN*
>AT4G14240.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF21 (InterProIPR002550) Cystathionine beta-synthase core (InterProIPR000644) BEST Arabidopsis thaliana protein match is CBS domain-containing protein-related (TAIRAT4G142301) Has 6735 Blast hits to 6622 proteins in 1349 species Archae - 62 Bacteria - 4446 Metazoa - 254 Fungi - 179 Plants - 121 Viruses - 0 Other Eukaryotes - 1673 (source NCBI BLink) 
MHLINAVAAARILSGIGQSNGNNGGEAIPFGSFEWITYAGISCFLVLFAGIMSGLTLGLM 
SLGLVELEILQRSAAIFPVVQKQHQLLVTLLLCNAMAMEGLPIYLDKLFNEYVAIILSVT 
FVLAFGEVIPQAICTRYGLAVGANFVWLVRILMTLCYPIAFPIGKILDLVLGHNDALFRR 
AQLKALVSIHSQEAGKGGELTHDETTIISGALDLTEKTAQEAMTPIESTFSLDVNSKLDW 
EAMGKILARGHSRVPVYSGNPKNVIGLLLVKSLLTVRPETETLVSAVCIRRIPRVPADMP 
LYDILNEFQKGSSHMAAVVKVKGKSKVPPSTLLEEHTDESNDSDLTAPLLLKREGNHDNV 
IVTIDKANGQSFFQNNESGPHGFSHTSEAIEDGEVIGIITLEDVFEELLQEEIVDETDEY 
VDVHKRIRVAAAAAASSIARAPSSRKLLAQKGTGGQNKQGQTNKVPGQEQDKMLGTITEP 
IRRNN*