>AT5G49130.1 | MATE efflux family protein
MVVEEDSRLINLQHKYNPTMPEVVEELKRIWDISFPVAAMSILNYLKNMTSVVCMGRLGS
LELAGGALAIGFTNITGYSVLSGLATGMEPLCGQAIGSKNPSLASLTLKRTIFLLLLASL
PISLLWLNLAPLMLMLRQQHDITRVASLYCSFSLPDLLANSFLHPLRIYLRCKGTTWPLM
WCTLVSVLLHLPITAFFTFYISLGVPGVAVSSFLTNFISLSLLLCYIYLENNNNDKTTSK
SLCLDTPLMLYGSRDSGENDVWSTLVKFAVPSCIAVCLEWWWYEFMTVLAGYLPEPKVAL
AAAAIVIQTTSLMYTIPTALSAAVSTRVSNELGAGRPEKAKTAATVAVGAAVAVSVFGLV
GTTVGREAWGKVFTADKVVLELTAAVIPVIGACELANCPQTISCGILRGSARPGIGAKIN
FYAFYVVGAPVAVVLAFVWGLGFMGLCYGLLGAQLACAISILTVVYNTDWNKESLKAHDL
VGKNVISPNVDQIIVKCEEGLH*
>AT3G26590.1 | MATE efflux family protein
MAKDKDITETLLTAAEERSDLPFLSVDDIPPITTVGGFVREFNVETKKLWYLAGPAIFTS
VNQYSLGAITQVFAGHISTIALAAVSVENSVVAGFSFGIMLGMGSALETLCGQAFGAGKL
SMLGVYLQRSWVILNVTALILSLLYIFAAPILASIGQTAAISSAAGIFSIYMIPQIFAYA
INFPTAKFLQSQSKIMVMAVISAVALVIHVPLTWFVIVKLQWGMPGLAVVLNASWCFIDM
AQLVYIFSGTCGEAWSGFSWEAFHNLWSFVRLSLASAVMLCLEVWYFMAIILFAGYLKNA
EISVAALSICMNILGWTAMIAIGMNTAVSVRVSNELGANHPRTAKFSLLVAVITSTLIGF
IVSMILLIFRDQYPSLFVKDEKVIILVKELTPILALSIVINNVQPVLSGVAVGAGWQAVV
AYVNIACYYVFGIPFGLLLGYKLNYGVMGIWCGMLTGTVVQTIVLTWMICKTNWDTEASM
AEDRIREWGGEVSEIKQLIN*
>AT3G62870.1 | 60S ribosomal protein L7A (RPL7aB)
MAPKKGVKVASKKKPEKVTNPLFERRPKQFGIGGALPPKKDLSRYIKWPKSIRLQRQKRI
LKQRLKVPPALNQFTKTLDKNLATSLFKILLKYRPEDKAAKKERLLNKAQAEAEGKPAES
KKPIVVKYGLNHVTYLIEQNKAQLVVIAHDVDPIELVVWLPALCRKMEVPYCIVKGKSRL
GAVVHQKTAAALCLTTVKNEDKLEFSKILEAIKANFNDKYEEYRKKWGGGIMGSKSQAKT
KAKERVIAKEAAQRMN*
>AT5G16820.1 | HSF3 (HEAT SHOCK FACTOR 3) DNA binding / transcription factor
MESVPESVPSPNSNTPSIPPPVNSVPPFLSKTYDMVDDPLTNEVVSWSSGNNSFVVWSAP
EFSKVLLPKYFKHNNFSSFVRQLNTYGFRKVDPDRWEFANEGFLRGRKQLLKSIVRRKPS
HVQQNQQQTQVQSSSVGACVEVGKFGIEEEVERLKRDKNVLMQELVRLRQQQQATENQLQ
NVGQKVQVMEQRQQQMMSFLAKAVQSPGFLNQLVQQNNNDGNRQIPGSNKKRRLPVDEQE
NRGDNVANGLNRQIVRYQPSINEAAQNMLRQFLNTSTSPRYESVSNNPDSFLLGDVPSST
SVDNGNPSSRVSGVTLAEFSPNTVQSATNQVPEASLAHHPQAGLVQPNIGQSPAQGAAPA
DSWSPEFDLVGCETDSGECFDPIMAVLDESEGDAISPEGEGKMNELLEGVPKLPGIQDPF
WEQFFSVELPAIADTDDILSGSVENNDLVLEQEPNEWTRNEQQMKYLTEQMGLLSSEAQR
K*
>AT5G16820.1 | HSF3 (HEAT SHOCK FACTOR 3) DNA binding / transcription factor
MESVPESVPSPNSNTPSIPPPVNSVPPFLSKTYDMVDDPLTNEVVSWSSGNNSFVVWSAP
EFSKVLLPKYFKHNNFSSFVRQLNTYGFRKVDPDRWEFANEGFLRGRKQLLKSIVRRKPS
HVQQNQQQTQVQSSSVGACVEVGKFGIEEEVERLKRDKNVLMQELVRLRQQQQATENQLQ
NVGQKVQVMEQRQQQMMSFLAKAVQSPGFLNQLVQQNNNDGNRQIPGSNKKRRLPVDEQE
NRGDNVANGLNRQIVRYQPSINEAAQNMLRQFLNTSTSPRYESVSNNPDSFLLGDVPSST
SVDNGNPSSRVSGVTLAEFSPNTVQSATNQVPEASLAHHPQAGLVQPNIGQSPAQGAAPA
DSWSPEFDLVGCETDSGECFDPIMAVLDESEGDAISPEGEGKMNELLEGVPKLPGIQDPF
WEQFFSVELPAIADTDDILSGSVENNDLVLEQEPNEWTRNEQQMKYLTEQMGLLSSEAQR
K*
>AT5G16820.2 | HSF3 (HEAT SHOCK FACTOR 3) DNA binding / transcription factor
MESVPESVPSPNSNTPSIPPPVNSVPPFLSKTYDMVDDPLTNEVVSWSSGNNSFVVWSAP
EFSKVLLPKYFKHNNFSSFVRQLNTYGFRKVDPDRWEFANEGFLRGRKQLLKSIVRRKPS
HVQQNQQQTQVQSSSVGACVEVGKFGIEEEVERLKRDKNVLMQELVRLRQQQQATENQLQ
NVGQKVQVMEQRQQQMMSFLAKAVQSPGFLNQLVQQNNNDGNRQIPGSNKKRRLPVDEQE
NRGDNVANGLNRQIVRYQPSINEAAQNMLRQFLNTSTSPRYESVSNNPDSFLLGDVPSST
SVDNGNPSSRVSGVTLAEFSPNTVQSATNQVPEASLAHHPQAGLVQPNIGQSPAQGAAPA
DSWSPEFDLVGCETDSGECFDPIMAVLDESEGDAISPEGEGKMNELLEGVPKLPGIQDPF
WEQFFSVELPAIADTDDILSGSVENNDLVLEQEPNEWTRNEQQMKYLTEQMGLLSSEAQR
K*
>AT5G16820.2 | HSF3 (HEAT SHOCK FACTOR 3) DNA binding / transcription factor
MESVPESVPSPNSNTPSIPPPVNSVPPFLSKTYDMVDDPLTNEVVSWSSGNNSFVVWSAP
EFSKVLLPKYFKHNNFSSFVRQLNTYGFRKVDPDRWEFANEGFLRGRKQLLKSIVRRKPS
HVQQNQQQTQVQSSSVGACVEVGKFGIEEEVERLKRDKNVLMQELVRLRQQQQATENQLQ
NVGQKVQVMEQRQQQMMSFLAKAVQSPGFLNQLVQQNNNDGNRQIPGSNKKRRLPVDEQE
NRGDNVANGLNRQIVRYQPSINEAAQNMLRQFLNTSTSPRYESVSNNPDSFLLGDVPSST
SVDNGNPSSRVSGVTLAEFSPNTVQSATNQVPEASLAHHPQAGLVQPNIGQSPAQGAAPA
DSWSPEFDLVGCETDSGECFDPIMAVLDESEGDAISPEGEGKMNELLEGVPKLPGIQDPF
WEQFFSVELPAIADTDDILSGSVENNDLVLEQEPNEWTRNEQQMKYLTEQMGLLSSEAQR
K*
>AT1G54560.1 | XIE motor/ protein binding
MRNSGTPVNIIVGSHVWIEDSDVAWIDGLVEKINGQDVEVQATNGKKITAKLSKIYPKDM
EAPAGGVDDMTKLSYLHEPGVLQNLKIRYELNEIYTYTGNILIAINPFQRLPHIYDAHMM
QQYKGAPFGELSPHVFAVADVAYRAMINEGKSNSILVSGESGAGKTETTKMLMRYLAYLG
GRAVTEGRTVEQQVLESNPVLEAFGNAKTVRNNNSSRFGKFVEIQFDKQGRISGAAVRTY
LLERSRVCQISDPERNYHCFYLLCAAPQEELEKYKLGHPKTFHYLNQSKCFELVGISDAH
DYIATRRAMDIVGMSEKEQEAIFRVVAAILHLGNVEFTKGKEVDSSVPKDDKSKFHLNTV
AELLMCDVKALEDALCKRVMVTPEEVIKRSLDPQSALISRDGLAKTIYSRLFDWLVEKIN
VSIGQDATSRSLIGVLDIYGFESFKTNSFEQFCINFTNEKLQQHFNQHVFKMEQEEYTKE
AIDWSYIEFVDNQDVLDLIEKKPGGIVALLDEACMFPKSTHETFANKLYQTFKTHKRFIK
PKLSRTDFAVAHYAGEVQYQSDLFLDKNKDYVIPEHQDLLGASKCPFVVGLFPPLPEETS
KSSKFSSIGSRFKLQLQQLMETLNSTEPHYIRCVKPNNLLKPAVFENVNIMQQLRCGGVL
EAIRISCAGYPTRKPFFEFINRFGLLYPRALEGNYEEKAAAQKILDNIGLKGYQVGKTKV
FLRAGQMAELDARRTMVLSAAAKKIQRRIRTHQAQRRFILLRKATISLQALCRGRLSSKI
FDNLRRQAAAVKIQKNARRLHSRKSYKNLHVAALVVQTGLRAMAAHKQFRFRKQTKAATT
IQAQFRCHRATLYFKKLKKGVILSQTRWRGKLARRELRQLKMASRETGALKEAKDMLEKK
VEELTYRAQLEKRSRVDLEEEKNQEIKKLQSSLEEMRKKVDETNGLLVKEREAAKKAIEE
APPVVTETQVLVEDTQKIEALTEEVEGLKANLEQEKQRADDATRKFDEAQESSEDRKKKL
EDTEKKAQQLQESVTRLEEKCNNLESENKVLRQQAVSIAPNKFLSGRSRSILQRGSESGH
LSVDARPSLDLHSHSINRRDLSEVDDKPQKSLNEKQQENQELLIRCIVQHLGFQGKRPVT
ACIIYKCLLQWRSFEVERTSVFDRIIQTIGQAIETQDNNNILAYWLSNASTLLLLLQRTL
KASGAAGMAPQRRRSSSATLFGRMTQSFRGTPQGVNLAMINGGVDTLRQVEAKYPALLFK
QQLTAYVEKIYGMIRDNLKKEISPLLGLCIQAPRTSRASLVKGASRSVGNTAAQQALIAH
WQGIVKSLTNFLNNLKSNHVPPFLVRKVFTQIFSFINVQLFNSLLLRRECCSFSNGEYVK
AGLAELEHWCYNATDEYAGSSWDELKHIRQAIGFLVIHQKPKKTLDEISHELCPVLSIQQ
LYRISTMYWDDKYGTHSVSPDVIANMRVLMTEDSNNAVSNSFLLDDDSSIPFSVDDLSKS
MERIEIGDVEPPPLIRENSGFSFLLPCSD*
>AT3G06470.1 | GNS1/SUR4 membrane family protein
MASIYSSLTYWLVNHPYISNFTWIEGETLGSTVFFVSVVVSVYLSATFLLRSAIDSLPSL
SPRILKPITAVHSLILCLLSLVMAVGCTLSITSSHASSDPMARFLHAICFPVDVKPNGPL
FFWAQVFYLSKILEFGDTILIILGKSIQRLSFLHVYHHATVVVMCYLWLRTRQSMFPIAL
VTNSTVHVIMYGYYFLCAVGSRPKWKRLVTDCQIVQFVFSFGLSGWMLREHLFGSGCTGI
WGWCFNAAFNASLLALFSNFHSKNYVKKPTREDGKKSD*
>AT4G14240.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF21 (InterProIPR002550) Cystathionine beta-synthase core (InterProIPR000644) BEST Arabidopsis thaliana protein match is CBS domain-containing protein-related (TAIRAT4G142301) Has 6770 Blast hits to 6657 proteins in 1347 species Archae - 62 Bacteria - 4461 Metazoa - 254 Fungi - 179 Plants - 121 Viruses - 0 Other Eukaryotes - 1693 (source NCBI BLink)
MHLINAVAAARILSGIGQSNGNNGGEAIPFGSFEWITYAGISCFLVLFAGIMSGLTLGLM
SLGLVELEILQRSGTPNEKKQAAAIFPVVQKQHQLLVTLLLCNAMAMEGLPIYLDKLFNE
YVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVWLVRILMTLCYPIAFPIGKILDLVL
GHNDALFRRAQLKALVSIHSQEAGKGGELTHDETTIISGALDLTEKTAQEAMTPIESTFS
LDVNSKLDWEAMGKILARGHSRVPVYSGNPKNVIGLLLVKSLLTVRPETETLVSAVCIRR
IPRVPADMPLYDILNEFQKGSSHMAAVVKVKGKSKVPPSTLLEEHTDESNDSDLTAPLLL
KREGNHDNVIVTIDKANGQSFFQNNESGPHGFSHTSEAIEDGEVIGIITLEDVFEELLQE
EIVDETDEYVDVHKRIRVAAAAAASSIARAPSSRKLLAQKGTGGQNKQGQTNKVPGQEQD
KMLGTITEPIRRNN*
>AT4G14240.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF21 (InterProIPR002550) Cystathionine beta-synthase core (InterProIPR000644) BEST Arabidopsis thaliana protein match is CBS domain-containing protein-related (TAIRAT4G142301) Has 6735 Blast hits to 6622 proteins in 1349 species Archae - 62 Bacteria - 4446 Metazoa - 254 Fungi - 179 Plants - 121 Viruses - 0 Other Eukaryotes - 1673 (source NCBI BLink)
MHLINAVAAARILSGIGQSNGNNGGEAIPFGSFEWITYAGISCFLVLFAGIMSGLTLGLM
SLGLVELEILQRSGTPNEKKQAAAIFPVVQKQHQLLVTLLLCNAMAMEGLPIYLDKLFNE
YVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVWLVRILMTLCYPIAFPIGKILDLVL
GHNDALFRRAQLKALVSIHSQEAGKGGELTHDETTIISGALDLTEKTAQEAMTPIESTFS
LDVNSKLDWEAMGKILARGHSRVPVYSGNPKNVIGLLLVKSLLTVRPETETLVSAVCIRR
IPRVPADMPLYDILNEFQKGSSHMAAVVKVKGKSKVPPSTLLEEHTDESNDSDLTAPLLL
KREGNHDNVIVTIDKANGQSFFQNNESGPHGFSHTSEAIEDGEVIGIITLEDVFEELLQE
EIVDETDEYVDVHKRIRVAAAAAASSIARAPSSRKLLAQKGTGGQNKQGQTNKVPGQEQD
KMLGTITEPIRRNN*
>AT4G14240.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF21 (InterProIPR002550) Cystathionine beta-synthase core (InterProIPR000644) BEST Arabidopsis thaliana protein match is CBS domain-containing protein-related (TAIRAT4G142301) Has 6770 Blast hits to 6657 proteins in 1347 species Archae - 62 Bacteria - 4461 Metazoa - 254 Fungi - 179 Plants - 121 Viruses - 0 Other Eukaryotes - 1693 (source NCBI BLink)
MHLINAVAAARILSGIGQSNGNNGGEAIPFGSFEWITYAGISCFLVLFAGIMSGLTLGLM
SLGLVELEILQRSAAIFPVVQKQHQLLVTLLLCNAMAMEGLPIYLDKLFNEYVAIILSVT
FVLAFGEVIPQAICTRYGLAVGANFVWLVRILMTLCYPIAFPIGKILDLVLGHNDALFRR
AQLKALVSIHSQEAGKGGELTHDETTIISGALDLTEKTAQEAMTPIESTFSLDVNSKLDW
EAMGKILARGHSRVPVYSGNPKNVIGLLLVKSLLTVRPETETLVSAVCIRRIPRVPADMP
LYDILNEFQKGSSHMAAVVKVKGKSKVPPSTLLEEHTDESNDSDLTAPLLLKREGNHDNV
IVTIDKANGQSFFQNNESGPHGFSHTSEAIEDGEVIGIITLEDVFEELLQEEIVDETDEY
VDVHKRIRVAAAAAASSIARAPSSRKLLAQKGTGGQNKQGQTNKVPGQEQDKMLGTITEP
IRRNN*
>AT4G14240.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF21 (InterProIPR002550) Cystathionine beta-synthase core (InterProIPR000644) BEST Arabidopsis thaliana protein match is CBS domain-containing protein-related (TAIRAT4G142301) Has 6735 Blast hits to 6622 proteins in 1349 species Archae - 62 Bacteria - 4446 Metazoa - 254 Fungi - 179 Plants - 121 Viruses - 0 Other Eukaryotes - 1673 (source NCBI BLink)
MHLINAVAAARILSGIGQSNGNNGGEAIPFGSFEWITYAGISCFLVLFAGIMSGLTLGLM
SLGLVELEILQRSAAIFPVVQKQHQLLVTLLLCNAMAMEGLPIYLDKLFNEYVAIILSVT
FVLAFGEVIPQAICTRYGLAVGANFVWLVRILMTLCYPIAFPIGKILDLVLGHNDALFRR
AQLKALVSIHSQEAGKGGELTHDETTIISGALDLTEKTAQEAMTPIESTFSLDVNSKLDW
EAMGKILARGHSRVPVYSGNPKNVIGLLLVKSLLTVRPETETLVSAVCIRRIPRVPADMP
LYDILNEFQKGSSHMAAVVKVKGKSKVPPSTLLEEHTDESNDSDLTAPLLLKREGNHDNV
IVTIDKANGQSFFQNNESGPHGFSHTSEAIEDGEVIGIITLEDVFEELLQEEIVDETDEY
VDVHKRIRVAAAAAASSIARAPSSRKLLAQKGTGGQNKQGQTNKVPGQEQDKMLGTITEP
IRRNN*
>AT5G66640.1 | DAR3 (DA1-RELATED PROTEIN 3)
MVRRKRQEEDEKIEIERVKEESLKLAKQAEEKRRLEESKEQGKRIQVDDDQLAKTTSKDK
GQINHSKDVVEEDVNPPPSIDGKSEIGDGTSVNPRCLCCFHCHRPFVMHEILKKGKFHID
CYKEYYRNRNCYVCQQKIPVNAEGIRKFSEHPFWKEKYCPIHDEDGTAKCCSCERLEPRG
TNYVMLGDFRWLCIECMGSAVMDTNEVQPLHFEIREFFEGLFLKVDKEFALLLVEKQALN
KAEEEEKIDYHRAAVTRGLCMSEEQIVPSIIKGPRMGPDNQLITDIVTESQRVSGFEVTG
ILIIYGLPRLLTGYILAHEMMHAWLRLNGYKNLKLELEEGLCQALGLRWLESQTFASTDA
AAAAAVASSSSFSSSTAPPAAITSKKSDDWSIFEKKLVEFCMNQIKEDDSPVYGLGFKQV
YEMMVSNNYNIKDTLKDIVSASNATPDSTV*
>AT1G64820.1 | MATE efflux family protein
METDFSLVRKEEEEEEDNRNGMSYLSMEMMKKVSSMAAPMVAVSVSQFLLQVISMVMAGH
LDELSLSAVAIATSLTNVTGFSLIVGFAGALDTLCGQAFGAEQFGKIGAYTYSSMLCLLV
FCFSISIVWFFMDKLLEIFHQDPLISQLACRYSIWLIPALFGFTLLQPMTRYFQSQGITL
PLFVSSLGALCFHIPFCWLLVYKLKFGIVGAALSIGFSYWLNVFLLWIFMRYSALHREMK
NLGLQELISSMKQFIALAIPSAMMICLEWWSFEILLLMSGLLPNSKLETSVISICLTTSA
VHFVLVNAIGASASTHVSNELGAGNHRAARAAVNSAIFLGGVGALITTITLYSYRKSWGY
VFSNEREVVRYATQITPILCLSIFVNSFLAVLSGVARGSGWQRIGGYASLGSYYLVGIPL
GWFLCFVMKLRGKGLWIGILIASTIQLIVFALVTFFTNWEQEATKARDRVFEMTPQVKGN
QKTQIIVEEDTQVLLNHIAETV*
>AT4G23030.1 | MATE efflux protein-related
MAAPLLMIIKNQTDHRQDPNPNPTHLSSSIQEAKSIAKISLPLILTGLLLYSRSMISMLF
LGRLNDLSALSGGSLALGFANITGYSLLSGLSIGMEPICVQAFGAKRFKLLGLALQRTTL
LLLLCSLPISILWLNIKKILLFFGQDEEISNQAEIFILFSLPDLILQSFLHPIRIYLRSQ
SITLPLTYSAFFAVLLHIPINYLLVSSLGLGLKGVALGAIWTNVNLLGFLIIYIVFSGVY
QKTWGGFSMDCFKGWRSLMKLAIPSCVSVCLEWWWYEIMILLCGLLLNPQATVASMGILI
QTTALIYIFPSSLSISVSTRVGNELGANQPDKARIAARTGLSLSLGLGLLAMFFALMVRN
CWARLFTDEEEIVKLTSMVLPIIGLCELGNCPQTTLCGVLRGSARPKLGANINLCCFYFV
GMPVAVWLSFFSGFDFKGLWLGLFAAQGSCLISMLVVLARTDWEVEVHRAKELMTRSCDG
DEDDGNTPFLLDSLDIEENLVF*