>AT5G12010.1 | unknown protein
MKAAVFRNEDGDEEEEEEEEEEVCNVFDGGEVESKRNNETKNLKGFFTSLLLMEEHEKQD
QEARNAASRREMSDFQSNYRKRARTMSDYYSDLNDYYADAEESGDINLKKSRVSRAVASV
AVAAASEIEAESSEITGSGSVRGTGSGQQRRLWVKDRSRAWWEECSRLDYPEEDFKKAFR
MSKSTFELICDELNSAVAKEDTALRNAIPVRQRVAVCIWRLATGEPLRLVSKKFGLGIST
CHKLVLEVCKAIKDVLMPKYLQWPDDESLRNIRERFESVSGIPNVVGSMYTTHIPIIAPK
ISVASYFNKRHTERNQKTSYSITIQAVVNPKGVFTDLCIGWPGSMPDDKVLEKSLLYQRA
NNGGLLKGMWVAGGPGHPLLDWVLVPYTQQNLTWTQHAFNEKMSEVQGVAKEAFGRLKGR
WACLQKRTEVKLQDLPTVLGACCVLHNICEMREEKMEPELMVEVIDDEVLPENVLRSVNA
MKARDTISHNLLHHGLAGTSFL*
>AT4G29140.1 | MATE efflux protein-related
MCNPSTTTTTTGSENQESRTGLFLDLFSINSFEPTKRNLRHCENRGSPLMAEAVTEAKSL
FTLAFPIAVTALVLYLRSAVSMFFLGQLGDLELAAGSLAIAFANITGYSVLSGLALGMEP
LCSQAFGAHRFKLLSLTLHRTVVFLLVCCVPISVLWFNVGKISVYLHQDPDIAKLAQTYL
IFSLPDLLTNTLLHPIRIYLRAQGIIHPVTLASLSGAVFHLPANLFLVSYLRLGLTGVAV
ASSITNIFVVAFLVCYVWASGLHAPTWTDPTRDCFRGWAPLLRLAGPSCVSVCLEWWWYE
IMIVLCGLLVNPRSTVAAMGVLIQTTSFLYVFPSSLSFAVSTRVGNELGANRPKTAKLTA
TVAIVFAAVTGIIAAAFAYSVRNAWGRIFTGDKEILQLTAAALPILGLCEIGNCPQTVGC
GVVRGTARPSTAANVNLGAFYLVGMPVAVGLGFWAGIGFNGLWVGLLAAQISCAGLMMYV
VGTTDWESEAKKAQTLTCAETVENDIIKAVVASTIDGECDEAEPLIRITVLY*
>AT3G62870.1 | 60S ribosomal protein L7A (RPL7aB)
MAPKKGVKVASKKKPEKVTNPLFERRPKQFGIGGALPPKKDLSRYIKWPKSIRLQRQKRI
LKQRLKVPPALNQFTKTLDKNLATSLFKILLKYRPEDKAAKKERLLNKAQAEAEGKPAES
KKPIVVKYGLNHVTYLIEQNKAQLVVIAHDVDPIELVVWLPALCRKMEVPYCIVKGKSRL
GAVVHQKTAAALCLTTVKNEDKLEFSKILEAIKANFNDKYEEYRKKWGGGIMGSKSQAKT
KAKERVIAKEAAQRMN*
>AT1G18540.1 | 60S ribosomal protein L6 (RPL6A)
MPAAKRTPKVNRNPDLIRGVGKYSRSQMYHKRGLWAIKAKNGGVFPRHDAQPKVDAPVEK
PAKFYPAEDVKKPLVNRRKPKPTKLKASITPGTVLIILAGRFKGKRVVFLKQLSSGLLLV
TGPFKINGVPLRRVNQAYVIGTSTKIDISGVNTEKFDDKYFGKVAEKKKKKTEGEFFEAE
KEEKKEIPQEKKEDQKTVDAALIKSIEAVPELKVYLGARFSLSQGMKPHELVF*
>AT2G39290.1 | PGP1 (PHOSPHATIDYLGLYCEROLPHOSPHATE SYNTHASE 1) CDP-alcohol phosphatidyltransferase/ CDP-diacylglycerol-glycerol-3-phosphate 3-phosphatidyltransferase
MLRSGLASLIVDVNLRRTLRPSPTFSFPAHLSRCIITSRYSSRTSLRFPIQISRHQHRLS
YFSSSSSSEQSRPTSSSRNSFSGHGQLDSDDNSSPPPSQSSSKVLTLPTVLTLGRVAAVP
LLVATFYVDSWWGTTATTSIFIAAAITDWLDGYLARKMRLGSAFGAFLDPVADKLMVAAT
LILLCTKPIQVAELGPLPWLLTVPSIAIIGREITMSAVREWAASQNGKLLEAVAVNNLGK
WKTATQMTALTILLASRDSNVGWLVASGAGLLYVSAGLSVWSLAVYMRKIWKVLMK*
>AT5G45970.1 | ARAC2 (ARABIDOPSIS RAC-LIKE 2) GTP binding
MSTARFIKCVTVGDGAVGKTCMLISYTSNTFPTDYVPTVFDNFSANVVVDGSTVNLGLWD
TAGQEDYNRLRPLSYRGADVFLLAFSLISKASYENIHKKWLPELKHYAPGIPIVLVGTKL
DLRDDKQFLKDHPGAASITTAQGEELRKMIGAVRYLECSSKTQQNVKAVFDTAIRVALRP
PKAKKKIKPLKTKRSRICFFL*
>AT2G01680.1 | ankyrin repeat family protein
MEMKQMKFLTHQAFFSSVRSGDLSQLQQLVDNLTGDELIDESSPCSAVAELMSVQNDAGE
TAVYISAAENLEDIFRYLIRFSSLETVKIRSKSDMNAFHVAAKRGHLGIVKELLRLWPEL
CRICDASNTSPLYAAAVQDHLEIVNAMLDVDPSCAMIVRKNGKTSLHTAGRYGLLRIVKA
LIEKDAAIVGVKDKKGQTALHMAVKGRSLEVVEEILQADYTILNERDRKGNTALHIATRK
ARPQITSLLLTFTAIEVNAINNQKETAMDLADKLQYSESALEINEALVEAGAKHGRFIGR
EDEARALKRAVSDIKHEVQSQLLQNEKTNRRVSGIAKELRKLHREAVQNTTNSITVVAVL
FASIAFLAIFNLPGQYFTEGSHVGQANIAGRTGFRVFCLLNATSLFISLAVVVVQITLVA
WDTRAQKKVVSVVNKLMWAACACTFGAFLAIAFAVVGKGNSWMAITITLLGAPILVGTLA
SMCYFVFRQRFRSGNDSQRRIRRGSSKSFSWSYSHHVSDFEDESDFEKIIAL*
>AT1G59740.1 | proton-dependent oligopeptide transport (POT) family protein
MAEINKQSNKWEQEEVSNENNWELAEEESVDWRGRPSNPNKHGGMRAALFVLGLQAFEIM
GIAAVGNNLITYVINEMHFPLSKAANIVTNFVGTIFIFALLGGYLSDAFLGSFWTIIIFG
FVELSGFILLSVQAHLPQLKPPKCNPLIDQTCEEAKGFKAMIFFMALYLVALGSGCVKPN
MIAHGADQFSQSHPKQSKRLSSYFNAAYFAFSMGELIALTLLVWVQTHSGMDIGFGVSAA
AMTMGIISLVSGTMYFRNKRPRRSIFTPIAHVIVAAILKRKLASPSDPRMLHGDHHVAND
VVPSSTLPHTPRFRFLDKACIKIQDTNTKESPWRLCTVTQVEQVKTLISLVPIFASTIVF
NTILAQLQTFSVQQGSSMNTRLSNSFHIPPASLQAIPYIMLIFLVPLYDSFLVPFARKLT
GHNSGIPPLTRIGIGLFLSTFSMVSAAMLEKKRRDSSVLDGRILSIFWITPQFLIFGISE
MFTAVGLIEFFYKQSAKGMESFLMALTYCSYSFGFYFSSVLVSVVNKITSTSVDSKGWLG
ENDLNKDRLDLFYWLLAVLSLLNFLSYLFWSRWNIKSSRRNNTNVVGDENI*
>AT5G12350.1 | Ran GTPase binding / chromatin binding / zinc ion binding
MSRNGRMASDLSRAGPVERDIEQAIIALKKGAYLLKYGRRGKPKFCPFRLSNDETVLIWF
SGNEEKHLKLSHVSRIISGQRTPIFQRYPRPEKEYQSFSLIYSERSLDVICKDKDEAEVW
FTGLKALISHCHQRNRRTESRSDGTPSEANSPRTYTRRSSPLHSPFSSNDSLQKDGSNHL
RIHSPFESPPKNGLDKAFSDMALYAVPPKGFYPSDSATISVHSGGSDSMHGHMRGMGMDA
FRVSMSSAVSSSSHGSGHDDGDALGDVFIWGEGIGEGVLGGGNRRVGSSFDIKMDSLLPK
ALESTIVLDVQNIACGGQHAVLVTKQGESFSWGEESEGRLGHGVDSNIQQPKLIDALNTT
NIELVACGEFHSCAVTLSGDLYTWGKGDFGVLGHGNEVSHWVPKRVNFLLEGIHVSSIAC
GPYHTAVVTSAGQLFTFGDGTFGVLGHGDKKSVFIPREVDSLKGLRTVRAACGVWHTAAV
VEVMVGSSSSSNCSSGKLFTWGDGDKGRLGHGNKEPKLVPTCVAALVEPNFCQVACGHSL
TVALTTSGHVYTMGSPVYGQLGNSHADGKTPNRVEGKLHKSFVEEIACGAYHVAVLTSRT
EVYTWGKGSNGRLGHGDVDDRNSPTLVESLKDKQVKSIACGTNFTAAVCIHRWASGMDQS
MCSGCRQPFSFKRKRHNCYNCGLVFCHSCTSKKSLKACMAPNPNKPYRVCDKCFNKLKKT
METDPSSHSSLSRRGSINQGSDPIDKDDKFDSRSDGQLARFSLMESMRQVDSRHKKNKKY
EFNSSRVSPIPSGSSQRGALNIAKSFNPVFGASKKFFSASVPGSRIVSRATSPISRRPSP
PRSTTPTPTLSGLATPKFVVDDTKRTNDNLSQEVVKLRSQVESLTRKAQLQEVELERTTK
QLKEALAITNEETTRCKAAKEVIKSLTAQLKDMAERLPVGSARTVKSPPSLNSFGSSPGR
IDPFNILNQANSQESEPNGITTPMFSNGTMTPAFGNGEATNEARNEKEWVEQDEPGVYIT
LTALAGGARDLKRVRFSRKRFSEIQAEQWWADNRGRVYEQYNVRMVDKASEDLPR*
>AT1G72560.1 | PSD (PAUSED) nucleobase nucleoside nucleotide and nucleic acid transmembrane transporter/ tRNA binding
MDDLEQAIVISFETGAVDSALKSQAVTYCQQIKETPSICSICIEKLWFSKLVQVQFWCLQ
TLQDVLRVKYGSMSLDEQSYVRKSVFSMACLEVIDNENAGRVVEGPPFVKNKLAQVLATL
IYYEYPLIWSSVFLDFMLHLCKGAVVIDMFCRVLNALDDELISLDYPRTPEEISVAARVK
DAMRQQCVPQIARAWYDIVSMYKNSDPDLSATVLDCMRRFVSWIDIGLVANDAFVPLLFE
LILSDGLSEQVRGAAAGCVLAMVSKRMDPQSKLPLLQTLQISRVFGLVSGDVDSDLVSKV
SALLTGYAVEVLECHKRLNSEDTKAVSMDLLNEVLPSVFYVMQKCEVDSTFSIVQFLLGY
VSTLKGLPALKEKQLLHITQILEVIRIQICYDPMYRNNLNSLDKTGLEEEDRMSEFRKDL
FVLLRTVGRVAPEVTQHFIRNSLANAVESSSESNVEEVEAALSLLYSFGESMTEEAMKTG
SGCLSELIPMLLTTQFPGHSHRLVALVYLENITRYMKFIQENSQYIPNVLGAFLDDRGLH
HQNFYVSRRAGYLFMRVVKLLKSKLVPFIDKILQNLQDTLSQLTTMNFASRELTGTEDGS
HIFEAIGIIIGLEDVPAEKQSDYLSLLLTPLCQQIEAGLVQAKVASSEDFPVKIANIQFA
IVAINALSKGFNERLVTASRPGIGLMFKQTLDVLLRVLIEFPKVEPLRSKVTSFIHRMVD
TLGSAVFPYLPKALEQLLADSEPKEMVGFMVLLNQLICKFNSALHDILEEVYPVVAVRIF
NVIPRDGLPSRPGAVTEEMRELIELQRMLYTFLHVIATHDLSSVFLTPKSRAYLDPMMQL
VLNTSCNHKDITVRKACVQIFIKLIKDWCAEPYSEEKVPGFQNFVIEAFATNCCLYSVLD
KSFNFSDANTHALFGEIITAQKVMYEKFGNTFLMHLMSKSFPSAHIPQDLAEQYCQKLQG
NDIRSLKSYYQSLIENLRLQQNGSHVFR*
>AT1G72560.1 | PSD (PAUSED) nucleobase nucleoside nucleotide and nucleic acid transmembrane transporter/ tRNA binding
MDDLEQAIVISFETGAVDSALKSQAVTYCQQIKETPSICSICIEKLWFSKLVQVQFWCLQ
TLQDVLRVKYGSMSLDEQSYVRKSVFSMACLEVIDNENAGRVVEGPPFVKNKLAQVLATL
IYYEYPLIWSSVFLDFMLHLCKGAVVIDMFCRVLNALDDELISLDYPRTPEEISVAARVK
DAMRQQCVPQIARAWYDIVSMYKNSDPDLSATVLDCMRRFVSWIDIGLVANDAFVPLLFE
LILSDGLSEQVRGAAAGCVLAMVSKRMDPQSKLPLLQTLQISRVFGLVSGDVDSDLVSKV
SALLTGYAVEVLECHKRLNSEDTKAVSMDLLNEVLPSVFYVMQKCEVDSTFSIVQFLLGY
VSTLKGLPALKEKQLLHITQILEVIRIQICYDPMYRNNLNSLDKTGLEEEDRMSEFRKDL
FVLLRTVGRVAPEVTQHFIRNSLANAVESSSESNVEEVEAALSLLYSFGESMTEEAMKTG
SGCLSELIPMLLTTQFPGHSHRLVALVYLENITRYMKFIQENSQYIPNVLGAFLDDRGLH
HQNFYVSRRAGYLFMRVVKLLKSKLVPFIDKILQNLQDTLSQLTTMNFASRELTGTEDGS
HIFEAIGIIIGLEDVPAEKQSDYLSLLLTPLCQQIEAGLVQAKVASSEDFPVKIANIQFA
IVAINALSKGFNERLVTASRPGIGLMFKQTLDVLLRVLIEFPKVEPLRSKVTSFIHRMVD
TLGSAVFPYLPKALEQLLADSEPKEMVGFMVLLNQLICKFNSALHDILEEVYPVVAVRIF
NVIPRDGLPSRPGAVTEEMRELIELQRMLYTFLHVIATHDLSSVFLTPKSRAYLDPMMQL
VLNTSCNHKDITVRKACVQIFIKLIKDWCAEPYSEEKVPGFQNFVIEAFATNCCLYSVLD
KSFNFSDANTHALFGEIITAQKVMYEKFGNTFLMHLMSKSFPSAHIPQDLAEQYCQKLQG
NDIRSLKSYYQSLIENLRLQQNGSHVFR*
>AT1G72560.2 | PSD (PAUSED) nucleobase nucleoside nucleotide and nucleic acid transmembrane transporter/ tRNA binding
MDDLEQAIVISFETGAVDSALKSQAVTYCQQIKETPSICSICIEKLWFSKLVQVQFWCLQ
TLQDVLRVKYGSMSLDEQSYVRKSVFSMACLEVIDNENAGRVVEGPPFVKNKLAQVLATL
IYYEYPLIWSSVFLDFMLHLCKGAVVIDMFCRVLNALDDELISLDYPRTPEEISVAARVK
DAMRQQCVPQIARAWYDIVSMYKNSDPDLSATVLDCMRRFVSWIDIGLVANDAFVPLLFE
LILSDGLSEQVRGAAAGCVLAMVSKRMDPQSKLPLLQTLQISRVFGLVSGDVDSDLVSKV
SALLTGYAVEVLECHKRLNSEDTKAVSMDLLNEVLPSVFYVMQKCEVDSTFSIVQFLLGY
VSTLKGLPALKEKQLLHITQILEVIRIQICYDPMYRNNLNSLDKTGLEEEDRMSEFRKDL
FVLLRTVGRVAPEVTQHFIRNSLANAVESSSESNVEEVEAALSLLYSFGESMTEEAMKTG
SGCLSELIPMLLTTQFPGHSHRLVALVYLENITRYMKFIQENSQYIPNVLGAFLDDRGLH
HQNFYVSRRAGYLFMRVVKLLKSKLVPFIDKILQNLQDTLSQLTTMNFASRELTGTEDGS
HIFEAIGIIIGLEDVPAEKQSDYLSLLLTPLCQQIEAGLVQAKVASSEDFPVKIANIQFA
IVAINALSKGFNERLVTASRPGIGLMFKQTLDVLLRVLIEFPKVEPLRSKVTSFIHRMVD
TLGSAVFPYLPKALEQLLADSEPKEMVGFMVLLNQLICKFNSALHDILEEVYPVVAVRIF
NVIPRDGLPSRPGAVTEEMRELIELQRMLYTFLHVIATHDLSSVFLTPKSRAYLDPMMQL
VLNTSCNHKDITVRKACVQIFIKLIKDWCAEPYSEEKVPGFQNFVIEAFATNCCLYSVLD
KSFNFSDANTHALFGEIITAQKVMYEKFGNTFLMHLMSKSFPSAHIPQDLAEQYCQKLQG
NDIRSLKSYYQSLIENLRLQQNGSHVFR*
>AT1G72560.2 | PSD (PAUSED) nucleobase nucleoside nucleotide and nucleic acid transmembrane transporter/ tRNA binding
MDDLEQAIVISFETGAVDSALKSQAVTYCQQIKETPSICSICIEKLWFSKLVQVQFWCLQ
TLQDVLRVKYGSMSLDEQSYVRKSVFSMACLEVIDNENAGRVVEGPPFVKNKLAQVLATL
IYYEYPLIWSSVFLDFMLHLCKGAVVIDMFCRVLNALDDELISLDYPRTPEEISVAARVK
DAMRQQCVPQIARAWYDIVSMYKNSDPDLSATVLDCMRRFVSWIDIGLVANDAFVPLLFE
LILSDGLSEQVRGAAAGCVLAMVSKRMDPQSKLPLLQTLQISRVFGLVSGDVDSDLVSKV
SALLTGYAVEVLECHKRLNSEDTKAVSMDLLNEVLPSVFYVMQKCEVDSTFSIVQFLLGY
VSTLKGLPALKEKQLLHITQILEVIRIQICYDPMYRNNLNSLDKTGLEEEDRMSEFRKDL
FVLLRTVGRVAPEVTQHFIRNSLANAVESSSESNVEEVEAALSLLYSFGESMTEEAMKTG
SGCLSELIPMLLTTQFPGHSHRLVALVYLENITRYMKFIQENSQYIPNVLGAFLDDRGLH
HQNFYVSRRAGYLFMRVVKLLKSKLVPFIDKILQNLQDTLSQLTTMNFASRELTGTEDGS
HIFEAIGIIIGLEDVPAEKQSDYLSLLLTPLCQQIEAGLVQAKVASSEDFPVKIANIQFA
IVAINALSKGFNERLVTASRPGIGLMFKQTLDVLLRVLIEFPKVEPLRSKVTSFIHRMVD
TLGSAVFPYLPKALEQLLADSEPKEMVGFMVLLNQLICKFNSALHDILEEVYPVVAVRIF
NVIPRDGLPSRPGAVTEEMRELIELQRMLYTFLHVIATHDLSSVFLTPKSRAYLDPMMQL
VLNTSCNHKDITVRKACVQIFIKLIKDWCAEPYSEEKVPGFQNFVIEAFATNCCLYSVLD
KSFNFSDANTHALFGEIITAQKVMYEKFGNTFLMHLMSKSFPSAHIPQDLAEQYCQKLQG
NDIRSLKSYYQSLIENLRLQQNGSHVFR*
>AT3G16050.1 | A37 protein heterodimerization
MADQAMTDQDQGAVTLYSGTAITDAKKNHPFSVKVGLAQVLRGGAIVEVSSVNQAKLAES
AGACSVIVSDPVRSRGGVRRMPDPVLIKEVKRAVSVPVMARARVGHFVEAQILESLAVDY
IDESEIISVADDDHFINKHNFRSPFICGCRDTGEALRRIREGAAMIRIQGDLTATGNIAE
TVKNVRSLMGEVRVLNNMDDDEVFTFAKKISAPYDLVAQTKQMGRVPVVQFASGGITTPA
DAALMMQLGCDGVFVGSEVFDGPDPFKKLRSIVQAVQHYNDPHVLAEMSSGLENAMESLN
VRGDRIQDFGQGSV*
>AT4G30660.1 | hydrophobic protein putative / low temperature and salt responsive protein putative
MPSNCEILCEIIIAILLPPLGVCFRKGCCTVEFLICLVLTILGYVPGIIYAIYVIVFQHR
EEYFDEYRRPIYSA*
>AT4G30660.1 | hydrophobic protein putative / low temperature and salt responsive protein putative
MPSNCEILCEIIIAILLPPLGVCFRKGCCTVEFLICLVLTILGYVPGIIYAIYVIVFQHR
EEYFDEYRRPIYSA*
>AT4G30660.2 | hydrophobic protein putative / low temperature and salt responsive protein putative
MPSNCEILCEIIIAILLPPLGVCFRKGCCTVEFLICLVLTILGYVPGIIYAIYVIVFQHR
EEYFDEYRRPIYSA*
>AT4G30660.2 | hydrophobic protein putative / low temperature and salt responsive protein putative
MPSNCEILCEIIIAILLPPLGVCFRKGCCTVEFLICLVLTILGYVPGIIYAIYVIVFQHR
EEYFDEYRRPIYSA*
>AT1G19730.1 | ATTRX4 oxidoreductase acting on sulfur group of donors disulfide as acceptor
MAAEEGQVIGCHTNDVWTVQLDKAKESNKLIVIDFTASWCPPCRMIAPIFNDLAKKFMSS
AIFFKVDVDELQSVAKEFGVEAMPTFVFIKAGEVVDKLVGANKEDLQAKIVKHTGVTTA*
>AT2G46230.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 22 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF652 (InterProIPR006984) Nucleotide binding protein PINc (InterProIPR006596) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G265301) Has 493 Blast hits to 493 proteins in 165 species Archae - 21 Bacteria - 0 Metazoa - 192 Fungi - 143 Plants - 53 Viruses - 0 Other Eukaryotes - 84 (source NCBI BLink)
MGKAKKPQKFAVVKKMISHKALKHYKEEVLNPNKKDLTELPRNVPSVPAGLFFSYNSTLV
PPYRVLVDTNFINFSIQNKIDLEKGMRDCLYANCTPCITDCVMAELEKLGQKYRVALRIA
KDPHFERLPCIHKGTYADDCLVDRVTQHKCFIVATCDRDLKRRIRKIPGVPIMYVTNRKY
SIEKLPEATLGGAPRY*
>AT2G46230.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 22 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF652 (InterProIPR006984) Nucleotide binding protein PINc (InterProIPR006596) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G265301) Has 441 Blast hits to 441 proteins in 161 species Archae - 15 Bacteria - 0 Metazoa - 185 Fungi - 119 Plants - 49 Viruses - 0 Other Eukaryotes - 73 (source NCBI BLink)
MGKAKKPQKFAVVKKMISHKALKHYKEEVLNPNKKDLTELPRNVPSVPAGLFFSYNSTLV
PPYRVLVDTNFINFSIQNKIDLEKGMRDCLYANCTPCITDCVMAELEKLGQKYRVALRIA
KDPHFERLPCIHKGTYADDCLVDRVTQHKCFIVATCDRDLKRRIRKIPGVPIMYVTNRKY
SIEKLPEATLGGAPRY*
>AT2G46230.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 22 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF652 (InterProIPR006984) Nucleotide binding protein PINc (InterProIPR006596) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G265301) Has 493 Blast hits to 493 proteins in 165 species Archae - 21 Bacteria - 0 Metazoa - 192 Fungi - 143 Plants - 53 Viruses - 0 Other Eukaryotes - 84 (source NCBI BLink)
MGKAKKPQKFAVVKKMISHKALKHYKEEVLNPNKKDLTELPRNVPSVPAGLFFSYNSTLV
PPYRVLVDTNFINFSIQNKIDLEKGMRDCLYANCTPCITDCVMAELEKLGQKYRVALRIA
KDPHFERLPCIHKGTYADDCLVDRVTQHKCFIVATCDRDLKRRIRKVKIVCCISALVYIH
G*
>AT2G46230.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 22 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF652 (InterProIPR006984) Nucleotide binding protein PINc (InterProIPR006596) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G265301) Has 441 Blast hits to 441 proteins in 161 species Archae - 15 Bacteria - 0 Metazoa - 185 Fungi - 119 Plants - 49 Viruses - 0 Other Eukaryotes - 73 (source NCBI BLink)
MGKAKKPQKFAVVKKMISHKALKHYKEEVLNPNKKDLTELPRNVPSVPAGLFFSYNSTLV
PPYRVLVDTNFINFSIQNKIDLEKGMRDCLYANCTPCITDCVMAELEKLGQKYRVALRIA
KDPHFERLPCIHKGTYADDCLVDRVTQHKCFIVATCDRDLKRRIRKVKIVCCISALVYIH
G*
>AT1G20696.1 | HMGB3 (HIGH MOBILITY GROUP B 3) DNA binding / chromatin binding / structural constituent of chromatin / transcription factor
MKGAKSKAETRSTKLSVTKKPAKGAKGAAKDPNKPKRPSSAFFVFMEDFRVTYKEEHPKN
KSVAAVGKAGGEKWKSLSDSEKAPYVAKADKRKVEYEKNMKAYNKKLEEGPKEDEESDKS
VSEVNDEDDAEDGSEEEEDDD*
>AT1G20696.1 | HMGB3 (HIGH MOBILITY GROUP B 3) DNA binding / chromatin binding / structural constituent of chromatin / transcription factor
MKGAKSKAETRSTKLSVTKKPAKGAKGAAKDPNKPKRPSSAFFVFMEDFRVTYKEEHPKN
KSVAAVGKAGGEKWKSLSDSEKAPYVAKADKRKVEYEKNMKAYNKKLEEGPKEDEESDKS
VSEVNDEDDAEDGSEEEEDDD*
>AT1G20696.1 | HMGB3 (HIGH MOBILITY GROUP B 3) DNA binding / chromatin binding / structural constituent of chromatin / transcription factor
MKGAKSKAETRSTKLSVTKKPAKGAKGAAKDPNKPKRPSSAFFVFMEDFRVTYKEEHPKN
KSVAAVGKAGGEKWKSLSDSEKAPYVAKADKRKVEYEKNMKAYNKKLEEGPKEDEESDKS
VSEVNDEDDAEDGSEEEEDDD*
>AT1G20696.2 | HMGB3 (HIGH MOBILITY GROUP B 3) DNA binding / chromatin binding / structural constituent of chromatin / transcription factor
MKGAKSKAETRSTKLSVTKKPAKGAKGAAKDPNKPKRPSSAFFVFMEDFRVTYKEEHPKN
KSVAAVGKAGGEKWKSLSDSEKAPYVAKADKRKVEYEKNMKAYNKKLVIALRKMRNLTSQ
FQRLTMRMMLRMAVKRRKTMIKKLKLW*
>AT1G20696.2 | HMGB3 (HIGH MOBILITY GROUP B 3) DNA binding / chromatin binding / structural constituent of chromatin / transcription factor
MKGAKSKAETRSTKLSVTKKPAKGAKGAAKDPNKPKRPSSAFFVFMEDFRVTYKEEHPKN
KSVAAVGKAGGEKWKSLSDSEKAPYVAKADKRKVEYEKNMKAYNKKLVIALRKMRNLTSQ
FQRLTMRMMLRMAVKRRKTMIKKLKLW*
>AT1G20696.2 | HMGB3 (HIGH MOBILITY GROUP B 3) DNA binding / chromatin binding / structural constituent of chromatin / transcription factor
MKGAKSKAETRSTKLSVTKKPAKGAKGAAKDPNKPKRPSSAFFVFMEDFRVTYKEEHPKN
KSVAAVGKAGGEKWKSLSDSEKAPYVAKADKRKVEYEKNMKAYNKKLVIALRKMRNLTSQ
FQRLTMRMMLRMAVKRRKTMIKKLKLW*
>AT1G20696.3 | HMGB3 (HIGH MOBILITY GROUP B 3) DNA binding / chromatin binding / structural constituent of chromatin / transcription factor
MKGAKSKAETRSTKLSVTKKPAKGAKGAAKDPNKPKRPSSAFFVFMEDFRVTYKEEHPKN
KSVAAVGKAGGEKWKSLSDSEKAPYVAKADKRKVEYEKNMKAYNKKLEEGPKEDEESDKS
VSEVNDEDDAEDGSEEVRRR*
>AT1G20696.3 | HMGB3 (HIGH MOBILITY GROUP B 3) DNA binding / chromatin binding / structural constituent of chromatin / transcription factor
MKGAKSKAETRSTKLSVTKKPAKGAKGAAKDPNKPKRPSSAFFVFMEDFRVTYKEEHPKN
KSVAAVGKAGGEKWKSLSDSEKAPYVAKADKRKVEYEKNMKAYNKKLEEGPKEDEESDKS
VSEVNDEDDAEDGSEEVRRR*
>AT1G20696.3 | HMGB3 (HIGH MOBILITY GROUP B 3) DNA binding / chromatin binding / structural constituent of chromatin / transcription factor
MKGAKSKAETRSTKLSVTKKPAKGAKGAAKDPNKPKRPSSAFFVFMEDFRVTYKEEHPKN
KSVAAVGKAGGEKWKSLSDSEKAPYVAKADKRKVEYEKNMKAYNKKLEEGPKEDEESDKS
VSEVNDEDDAEDGSEEVRRR*
>AT1G62850.2 | translation release factor
MAAIRTMTNMILREFIHHPLLLHSSSKSCQSLLPCLRLTPLISPIHSNSRLVSVRCAAST
SGGSGGDRKVSSRLSQVQQMLHEAEERASSAGNEPTPQITLDNVTLNFARSGGPGGQNVN
KLNTKVDMRFNVKNAYWLSDRIREKILLTEKNRINKDGELVISSTKTRTQKGNIDDALEK
LQAIIDAASYVPPPPSEEQKKKIVKLAAKADNKRLKSKKVLSDKKSARRSRGSYDD*
>AT1G62850.2 | translation release factor
MAAIRTMTNMILREFIHHPLLLHSSSKSCQSLLPCLRLTPLISPIHSNSRLVSVRCAAST
SGGSGGDRKVSSRLSQVQQMLHEAEERASSAGNEPTPQITLDNVTLNFARSGGPGGQNVN
KLNTKVDMRFNVKNAYWLSDRIREKILLTEKNRINKDGELVISSTKTRTQKGNIDDALEK
LQAIIDAASYVPPPPSEEQKKKIVKLAAKADNKRLKSKKVLSDKKSARRSRGSYDD*
>AT1G62850.3 | translation release factor
MAAIRTMTNMILREFIHHPLLLHSSSKSCQSLLPCLRLTPLISPIHSNSRLVSVRCAAST
SGGSGGDRKVSSRLSQVQQMLHEAEERASSAGNEPTPQITLDNVTLNFARSGGPGGQNVN
KLNTKVDMRFNVKNAYWLSDRIREKILLTEKNRINKDGELVISSTKTRTQKGNIDDALEK
LQAIIDAASYVPPPPSEEQKKKIVKLAAKADNKRLKSKKVLSDKKSARRSRGSYDD*
>AT1G62850.3 | translation release factor
MAAIRTMTNMILREFIHHPLLLHSSSKSCQSLLPCLRLTPLISPIHSNSRLVSVRCAAST
SGGSGGDRKVSSRLSQVQQMLHEAEERASSAGNEPTPQITLDNVTLNFARSGGPGGQNVN
KLNTKVDMRFNVKNAYWLSDRIREKILLTEKNRINKDGELVISSTKTRTQKGNIDDALEK
LQAIIDAASYVPPPPSEEQKKKIVKLAAKADNKRLKSKKVLSDKKSARRSRGSYDD*
>AT1G66240.1 | ATX1 (ARABIDOPSIS HOMOLOG OF ANTI-OXIDANT 1) metal ion binding
MLKDLFQAVSYQNTASLSLFQALSVVESKAMSQTVVLRVAMTCEGCVGAVKRVLGKMEGV
ESFDVDIKEQKVTVKGNVQPDAVLQTVTKTGKKTAFWEAEGETAKA*
>AT1G66240.1 | ATX1 (ARABIDOPSIS HOMOLOG OF ANTI-OXIDANT 1) metal ion binding
MLKDLFQAVSYQNTASLSLFQALSVVESKAMSQTVVLRVAMTCEGCVGAVKRVLGKMEGV
ESFDVDIKEQKVTVKGNVQPDAVLQTVTKTGKKTAFWEAEGETAKA*
>AT1G66240.2 | ATX1 (ARABIDOPSIS HOMOLOG OF ANTI-OXIDANT 1) metal ion binding
MTCEGCVGAVKRVLGKMEGVESFDVDIKEQKVTVKGNVQPDAVLQTVTKTGKKTAFWEAE
GETAKA*
>AT1G66240.2 | ATX1 (ARABIDOPSIS HOMOLOG OF ANTI-OXIDANT 1) metal ion binding
MTCEGCVGAVKRVLGKMEGVESFDVDIKEQKVTVKGNVQPDAVLQTVTKTGKKTAFWEAE
GETAKA*
>AT1G80710.1 | transducin family protein / WD-40 repeat family protein
MATEYERKRLENIRRNDEMLAALNVRAKASSLLSAAKRSRDDSKSFKKKKPKPASTPTVI
RMSLRTRGLNPDSAGLPDGFSDFRMGSQITHNQPSPQKQSPRLLAPIPFESAYEGYGSYT
QLVDTLLGIESKSCRGKLVKGEIGVVKDENESPMVRTRSSSRVSKVSVKKEEPEDDSFSD
YVNKEFSIPVKPEKIEFDLDLLTLEPQNVARVVPGRIFVVQFLPCENVKMVAAGDKLGNV
GFWNLDCGNEEDNDGIYLFTPHSAPVSSIVFQQNSLSRVISSSYDGLIRLMDVEKSVFDL
VYSTDEAIFSLSQRPNDEQSLYFGQDYGVFNVWDLRAGKSVFHWELHERRINSIDFNPQN
PHVMATSSTDGTACLWDLRSMGAKKPKTLSTVNHSRAVHSAYFSPSGLSLATTSLDNYIG
VLSGANFENTCMIYHNNTSRWISKFKAVWGWDDSYIYVGNLSKKIDVINPKLKRTVMELH
NPLQRAIPCRIHCHPYNVGTLAGSTAGGQVYVWTTK*
>AT2G27920.2 | SCPL51 (SERINE CARBOXYPEPTIDASE-LIKE 51) serine-type carboxypeptidase
MKQGASGVGIGNFQEVGPLDTFLKPRNSTWLKKADLLFVDSPVGAGYSFVEGNQKDLYVK
SDEEAAQDLTKLLQQLFNKNQTLNQSPLFIVAESYGGKIAVKLGLSVIDAVQSGKLKLHL
GGVILGDSWISPEDFVFSWGPLLKHVSRLDDNGLDSSNSLAEKIKTQIKNGEYVGATQTW
MDLENLISSKSNFVDFYNFLLDTGMDPVSLTTSLKIKKEEKIKKYSRYLNDMRSLSDVED
VEGDLDKLMNGVIKKKLKIIPNDLIWGNNSDDVFTAMEAAFMKPVIEDVDELLATGVDVT
IYNGQLDVICSTSGTEAWVHKLRWEGLEEFKKMEREPLFCESDRATRGFTKSYKNLHFYW
ILGAGHFVPVDEPCVALKMVGEITKSPQL*
>AT2G27920.2 | SCPL51 (SERINE CARBOXYPEPTIDASE-LIKE 51) serine-type carboxypeptidase
MKQGASGVGIGNFQEVGPLDTFLKPRNSTWLKKADLLFVDSPVGAGYSFVEGNQKDLYVK
SDEEAAQDLTKLLQQLFNKNQTLNQSPLFIVAESYGGKIAVKLGLSVIDAVQSGKLKLHL
GGVILGDSWISPEDFVFSWGPLLKHVSRLDDNGLDSSNSLAEKIKTQIKNGEYVGATQTW
MDLENLISSKSNFVDFYNFLLDTGMDPVSLTTSLKIKKEEKIKKYSRYLNDMRSLSDVED
VEGDLDKLMNGVIKKKLKIIPNDLIWGNNSDDVFTAMEAAFMKPVIEDVDELLATGVDVT
IYNGQLDVICSTSGTEAWVHKLRWEGLEEFKKMEREPLFCESDRATRGFTKSYKNLHFYW
ILGAGHFVPVDEPCVALKMVGEITKSPQL*
>AT2G27920.2 | SCPL51 (SERINE CARBOXYPEPTIDASE-LIKE 51) serine-type carboxypeptidase
MKQGASGVGIGNFQEVGPLDTFLKPRNSTWLKKADLLFVDSPVGAGYSFVEGNQKDLYVK
SDEEAAQDLTKLLQQLFNKNQTLNQSPLFIVAESYGGKIAVKLGLSVIDAVQSGKLKLHL
GGVILGDSWISPEDFVFSWGPLLKHVSRLDDNGLDSSNSLAEKIKTQIKNGEYVGATQTW
MDLENLISSKSNFVDFYNFLLDTGMDPVSLTTSLKIKKEEKIKKYSRYLNDMRSLSDVED
VEGDLDKLMNGVIKKKLKIIPNDLIWGNNSDDVFTAMEAAFMKPVIEDVDELLATGVDVT
IYNGQLDVICSTSGTEAWVHKLRWEGLEEFKKMEREPLFCESDRATRGFTKSYKNLHFYW
ILGAGHFVPVDEPCVALKMVGEITKSPQL*
>AT2G27920.1 | SCPL51 (SERINE CARBOXYPEPTIDASE-LIKE 51) serine-type carboxypeptidase
MKTTVVYLVILCLIVSCTNGETKHVRKINSDGSEAWGYVEVRPKAHMFWWHYKSPYRVEN
PSKPWPIILWLQGGPGASGVGIGNFQEVGPLDTFLKPRNSTWLKKADLLFVDSPVGAGYS
FVEGNQKDLYVKSDEEAAQDLTKLLQQLFNKNQTLNQSPLFIVAESYGGKIAVKLGLSVI
DAVQSGKLKLHLGGVILGDSWISPEDFVFSWGPLLKHVSRLDDNGLDSSNSLAEKIKTQI
KNGEYVGATQTWMDLENLISSKSNFVDFYNFLLDTGMDPVSLTTSLKIKKEEKIKKYSRY
LNDMRSLSDVEDVEGDLDKLMNGVIKKKLKIIPNDLIWGNNSDDVFTAMEAAFMKPVIED
VDELLATGVDVTIYNGQLDVICSTSGTEAWVHKLRWEGLEEFKKMEREPLFCESDRATRG
FTKSYKNLHFYWILGAGHFVPVDEPCVALKMVGEITKSPQL*
>AT2G27920.1 | SCPL51 (SERINE CARBOXYPEPTIDASE-LIKE 51) serine-type carboxypeptidase
MKTTVVYLVILCLIVSCTNGETKHVRKINSDGSEAWGYVEVRPKAHMFWWHYKSPYRVEN
PSKPWPIILWLQGGPGASGVGIGNFQEVGPLDTFLKPRNSTWLKKADLLFVDSPVGAGYS
FVEGNQKDLYVKSDEEAAQDLTKLLQQLFNKNQTLNQSPLFIVAESYGGKIAVKLGLSVI
DAVQSGKLKLHLGGVILGDSWISPEDFVFSWGPLLKHVSRLDDNGLDSSNSLAEKIKTQI
KNGEYVGATQTWMDLENLISSKSNFVDFYNFLLDTGMDPVSLTTSLKIKKEEKIKKYSRY
LNDMRSLSDVEDVEGDLDKLMNGVIKKKLKIIPNDLIWGNNSDDVFTAMEAAFMKPVIED
VDELLATGVDVTIYNGQLDVICSTSGTEAWVHKLRWEGLEEFKKMEREPLFCESDRATRG
FTKSYKNLHFYWILGAGHFVPVDEPCVALKMVGEITKSPQL*
>AT2G27920.1 | SCPL51 (SERINE CARBOXYPEPTIDASE-LIKE 51) serine-type carboxypeptidase
MKTTVVYLVILCLIVSCTNGETKHVRKINSDGSEAWGYVEVRPKAHMFWWHYKSPYRVEN
PSKPWPIILWLQGGPGASGVGIGNFQEVGPLDTFLKPRNSTWLKKADLLFVDSPVGAGYS
FVEGNQKDLYVKSDEEAAQDLTKLLQQLFNKNQTLNQSPLFIVAESYGGKIAVKLGLSVI
DAVQSGKLKLHLGGVILGDSWISPEDFVFSWGPLLKHVSRLDDNGLDSSNSLAEKIKTQI
KNGEYVGATQTWMDLENLISSKSNFVDFYNFLLDTGMDPVSLTTSLKIKKEEKIKKYSRY
LNDMRSLSDVEDVEGDLDKLMNGVIKKKLKIIPNDLIWGNNSDDVFTAMEAAFMKPVIED
VDELLATGVDVTIYNGQLDVICSTSGTEAWVHKLRWEGLEEFKKMEREPLFCESDRATRG
FTKSYKNLHFYWILGAGHFVPVDEPCVALKMVGEITKSPQL*
>AT2G27920.3 | SCPL51 (SERINE CARBOXYPEPTIDASE-LIKE 51) serine-type carboxypeptidase
MAYHPLAPRWTWSFRGWYREFSGGWTIRHISQAQKLNLVEESRSFVCVGAGYSFVEGNQK
DLYVKSDEEAAQDLTKLLQQLFNKNQTLNQSPLFIVAESYGGKIAVKLGLSVIDAVQSGK
LKLHLGGVILGDSWISPEDFVFSWGPLLKHVSRLDDNGLDSSNSLAEKIKTQIKNGEYVG
ATQTWMDLENLISSKSNFVDFYNFLLDTGMDPVSLTTSLKIKKEEKIKKYSRYLNDMRSL
SDVEDVEGDLDKLMNGVIKKKLKIIPNDLIWGNNSDDVFTAMEAAFMKPVIEDVDELLAT
GVDVTIYNGQLDVICSTSGTEAWVHKLRWEGLEEFKKMEREPLFCESDRATRGFTKSYKN
LHFYWILGAGHFVPVDEPCVALKMVGEITKSPQL*
>AT2G27920.3 | SCPL51 (SERINE CARBOXYPEPTIDASE-LIKE 51) serine-type carboxypeptidase
MAYHPLAPRWTWSFRGWYREFSGGWTIRHISQAQKLNLVEESRSFVCVGAGYSFVEGNQK
DLYVKSDEEAAQDLTKLLQQLFNKNQTLNQSPLFIVAESYGGKIAVKLGLSVIDAVQSGK
LKLHLGGVILGDSWISPEDFVFSWGPLLKHVSRLDDNGLDSSNSLAEKIKTQIKNGEYVG
ATQTWMDLENLISSKSNFVDFYNFLLDTGMDPVSLTTSLKIKKEEKIKKYSRYLNDMRSL
SDVEDVEGDLDKLMNGVIKKKLKIIPNDLIWGNNSDDVFTAMEAAFMKPVIEDVDELLAT
GVDVTIYNGQLDVICSTSGTEAWVHKLRWEGLEEFKKMEREPLFCESDRATRGFTKSYKN
LHFYWILGAGHFVPVDEPCVALKMVGEITKSPQL*
>AT2G27920.3 | SCPL51 (SERINE CARBOXYPEPTIDASE-LIKE 51) serine-type carboxypeptidase
MAYHPLAPRWTWSFRGWYREFSGGWTIRHISQAQKLNLVEESRSFVCVGAGYSFVEGNQK
DLYVKSDEEAAQDLTKLLQQLFNKNQTLNQSPLFIVAESYGGKIAVKLGLSVIDAVQSGK
LKLHLGGVILGDSWISPEDFVFSWGPLLKHVSRLDDNGLDSSNSLAEKIKTQIKNGEYVG
ATQTWMDLENLISSKSNFVDFYNFLLDTGMDPVSLTTSLKIKKEEKIKKYSRYLNDMRSL
SDVEDVEGDLDKLMNGVIKKKLKIIPNDLIWGNNSDDVFTAMEAAFMKPVIEDVDELLAT
GVDVTIYNGQLDVICSTSGTEAWVHKLRWEGLEEFKKMEREPLFCESDRATRGFTKSYKN
LHFYWILGAGHFVPVDEPCVALKMVGEITKSPQL*
>AT3G54150.1 | embryo-abundant protein-related
MAALSEKEAEAYLDARPRYPIDWFKKIAARTQDHKFAWDVGTGNGQAAIGLVEHYENVVA
TDINEAQLQRAIKHSRISYHHTPTTISEDEMVDLLGGENSVDLIVAAQAVHFFDLNVFYN
VAKRVLRKEGGLIAVWVYNDIIISHEIDPIMKRLVDSTLPFRTPIMNLAFDGYKTLTFPF
ETIGMGSEGKPITLDIPHKLSLKGFIGFLRSWQPAMKAKEKGVELINEDLITKFEEAWGD
ETQVKDVFYKAHMIVGKIPEEKFESDQVLSNKGRLLLETEVGRNQKRPQPSDEGDRQSKK
QNTSEDDACKDKPSSVFSTHYRF*
>AT4G04700.1 | CPK27 ATP binding / calcium ion binding / kinase/ protein kinase/ protein serine/threonine kinase/ protein tyrosine kinase
MGCFSSKELQQSKRTILEKPLVDITKIYILGEELGRGNFGLTRKCVEKSTGKTFACKTIL
KTKLKDEECEEDVKREIRIMKQLSGEPNIVEFKNAYEDKDSVHIVMEYCGGGELYDKILA
LYDVGKSYSEKEAAGIIRSIVNVVKNCHYMGVMHRDLKPENFLLTSNDDNATVKVIDFGC
SVFIEEGKVYQDLAGSDYYIAPEVLQGNYGKEADIWSAGIILYILLCGKSPFVKEPEGQM
FNEIKSLEIDYSEEPWPLRDSRAIHLVKRMLDRNPKERISAAEVLGHPWMKEGEASDKPI
DGVVLSRLKRFRDANKFKKVVLKFIAANLSEEEIKGLKTLFTNIDTDKSGNITLEELKTG
LTRLGSNLSKTEVEQLMEAADMDGNGTIDIDEFISATMHRYKLDRDEHVYKAFQHFDKDN
DGHITKEELEMAMKEDGAGDEGSIKQIIADADTDNDGKINFEEFRTMMRTESSLQPEGEL
LPIIN*
>AT4G19880.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal (InterProIPR004046) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1635 Blast hits to 1635 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 158 Plants - 57 Viruses - 0 Other Eukaryotes - 480 (source NCBI BLink)
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVYAVHFKCNKKLIREYPNLFN
YTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPFGIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1621 Blast hits to 1621 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 155 Plants - 57 Viruses - 0 Other Eukaryotes - 469 (source NCBI BLink)
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVYAVHFKCNKKLIREYPNLFN
YTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPFGIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal (InterProIPR004046) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1635 Blast hits to 1635 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 158 Plants - 57 Viruses - 0 Other Eukaryotes - 480 (source NCBI BLink)
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSSYFQSKKKKYTICERISNV
ETLIQVYAVHFKCNKKLIREYPNLFNYTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPF
GIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1621 Blast hits to 1621 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 155 Plants - 57 Viruses - 0 Other Eukaryotes - 469 (source NCBI BLink)
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSSYFQSKKKKYTICERISNV
ETLIQVYAVHFKCNKKLIREYPNLFNYTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPF
GIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.3 | FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal (InterProIPR004046) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1635 Blast hits to 1635 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 158 Plants - 57 Viruses - 0 Other Eukaryotes - 480 (source NCBI BLink)
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSLCSPLQMQQETHKGVSEFV
QLHERHFPDPRYE*
>AT4G19880.3 | FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1621 Blast hits to 1621 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 155 Plants - 57 Viruses - 0 Other Eukaryotes - 469 (source NCBI BLink)
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSLCSPLQMQQETHKGVSEFV
QLHERHFPDPRYE*
>AT4G19880.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1567 Blast hits to 1567 proteins in 487 species Archae - 12 Bacteria - 899 Metazoa - 23 Fungi - 155 Plants - 55 Viruses - 0 Other Eukaryotes - 423 (source NCBI BLink)
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVYAVHFKCNKKLIREYPNLFN
YTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPFGIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1567 Blast hits to 1567 proteins in 487 species Archae - 12 Bacteria - 899 Metazoa - 23 Fungi - 155 Plants - 55 Viruses - 0 Other Eukaryotes - 423 (source NCBI BLink)
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSSYFQSKKKKYTICERISNV
ETLIQVYAVHFKCNKKLIREYPNLFNYTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPF
GIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.3 | FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1567 Blast hits to 1567 proteins in 487 species Archae - 12 Bacteria - 899 Metazoa - 23 Fungi - 155 Plants - 55 Viruses - 0 Other Eukaryotes - 423 (source NCBI BLink)
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSLCSPLQMQQETHKGVSEFV
QLHERHFPDPRYE*
>AT4G25950.1 | VATG3 (vacuolar ATP synthase G3) hydrolase acting on acid anhydrides catalyzing transmembrane movement of substances
MDSLRGQGGIQMLLTAEQEAGRIVSAARTAKLARMKQAKDEAEKEMEEYRSRLEEEYQTQ
VSGTDQEADAKRLDDETDVRITNLKESSSKVSKDIVKMLIKYVTTTAA*
>AT4G29580.1 | cytidine deaminase putative / cytidine aminohydrolase putative
MAQPPNPYAALTPTEAESSGPFEPETLLPLINRALPLAQALPSQSPLVAVGRGSSGRTFL
GVNVELPGLSPLHSIHAGQFLVVHLALNNERTLNCLAFSSNGSYFDPPCPHCCQLLQEIR
NASSTKLLITDPSRQRDMSLSTYLPQKYLSLYNEVPKYFFARLLDENRNNGLTLINPNPI
RDCLDSEICNHLSCRALKAANRSYAPYSKSPSGVALMDFQGRVYSGWSIESVANPILGAA
QAALVDFMTNGGGHEFNNIVRGFLVEKRDAKLSHLATAREILNKVAHFSFILRVLHCQ*
>AT4G29580.1 | cytidine deaminase putative / cytidine aminohydrolase putative
MAQPPNPYAALTPTEAESSGPFEPETLLPLINRALPLAQALPSQSPLVAVGRGSSGRTFL
GVNVELPGLSPLHSIHAGQFLVVHLALNNERTLNCLAFSSNGSYFDPPCPHCCQLLQEIR
NASSTKLLITDPSRQRDMSLSTYLPQKYLSLYNEVPKYFFARLLDENRNNGLTLINPNPI
RDCLDSEICNHLSCRALKAANRSYAPYSKSPSGVALMDFQGRVYSGWSIESVANPILGAA
QAALVDFMTNGGGHEFNNIVRGFLVEKRDAKLSHLATAREILNKVAHFSFILRVLHCQ*
>AT4G29580.2 | cytidine deaminase putative / cytidine aminohydrolase putative
MAQPPNPYAALTPTEAESSGPFEPETLLPLINRALPLAQALPSQSPLVAVGRGSSGRTFL
GVNVELPGLSPLHSIHAGQFLVVHLALNNERTLNCLAFSSNGSYFDPPCPHCCQLLQEIR
NASSTKLLITDPSRQRDMSLSTYLPQKYLSLYNEVPKYFFARLLDENRNNGLTLINPNPI
RDCLDSEICNHLSCRALKAANRSYAPYSKSPSGVALMDFQGRVYSGWSIESVANPILGAA
QAALVDFMTNGGGHEFNNIVRGFLVEKRDAKLSHLATAREILNKVAHFSFILRVLHCHKI
QCFFTHESPSMSEKEDFANILKDINSKRRRNIHHGTIYKTGDEKTLKFILGVNYKDMEAV
DEPPMRKRKVDELMHAVDPSNDSRFTFQTRVTRATCQKDEDLRILTEVCRIKPKIQGTGE
QSDGMTKLLMVSTKAIYLSGYRMIF*
>AT4G29580.2 | cytidine deaminase putative / cytidine aminohydrolase putative
MAQPPNPYAALTPTEAESSGPFEPETLLPLINRALPLAQALPSQSPLVAVGRGSSGRTFL
GVNVELPGLSPLHSIHAGQFLVVHLALNNERTLNCLAFSSNGSYFDPPCPHCCQLLQEIR
NASSTKLLITDPSRQRDMSLSTYLPQKYLSLYNEVPKYFFARLLDENRNNGLTLINPNPI
RDCLDSEICNHLSCRALKAANRSYAPYSKSPSGVALMDFQGRVYSGWSIESVANPILGAA
QAALVDFMTNGGGHEFNNIVRGFLVEKRDAKLSHLATAREILNKVAHFSFILRVLHCHKI
QCFFTHESPSMSEKEDFANILKDINSKRRRNIHHGTIYKTGDEKTLKFILGVNYKDMEAV
DEPPMRKRKVDELMHAVDPSNDSRFTFQTRVTRATCQKDEDLRILTEVCRIKPKIQGTGE
QSDGMTKLLMVSTKAIYLSGYRMIF*
>AT4G32350.1 | unknown protein
MFDGFLGRGFAPKGKPLIKLTKNRIDVLRRKRNATIKFLKRDLADLIINGHDYNAFSRAG
GLLDELRYLWSLDFVEQTCDFVYKQLSTMQKTPECPEDCREAISSLMFAASGFSELPELR
ELRQMFHEKYTDSLALFVNQELVENMSSKPFSMEKKVKLMEDVALEFSIRWDSKDFEKRI
VRQNSISVMETPKSTNDKYKPVDRNMALPKREEFEGSENGVSLNRKTAEASERRDPLFQS
DKESYQNGLRGNQRGLTYKERSENVLHASRSESKDNKAERKEFYLHSKQNPAREKHQPIF
NEGDTIVMKVNYGNLGQGNGHRPGVVDAHKKTEFVASERKEFYSQSKQEPSRERHQPIFN
DGDTIVMKVKHENHVQGNGHKNGVVDLHKKIEVNASEKLKSSSSKRADKLVIGFKQESFF
QGYKHEKNEEHAHQKVEDSTSRPPKPNSKSKRAESINPGSRHHNDRESKENAVLVGKSTE
EDPSGDNVKGGEYEYDHANPARKVEERETERMKSPFYKSLPPPYVKKSIAKARHEKAEAL
DNPKARFDGEEGNHPDNGKNVYGAERRNGAGHHEVNDIDNASLKRQTNRRKHIVESGGDD
HISSRRRENSRKGLQVLIDEDEKDSEEKMMDKLLMHYSKKPSSYEKDNVQEESKSRRTHL
KKGESDEEMMIHQPARSRSLPAEQLAGPSEPAKTFARAASFQPERSSEAKHVHPKLPNYD
DLAARFAELKGR*
>AT5G13240.1 | transcription regulator
MKFLEYTNLDRLNVFLGHLNLGERTIKGCLEAYSCKHAGSDKRLSLSLENEMLDYLGKSS
DTDSSSPVDLLLSRSSRKALIYLVLTLYQMYPDYDFSAVKAHQFFSEESWDTFKQIFNNY
MFEASKEWTERNEDGSLLEVIYKALDEVVKLAECEIYVYNPNPNADPFLEEGAIWSFCFL
FYNRKLKRVAGFRFSCTSNLANDAFLTDSPPYEEDEEIFADMDM*
>AT5G42600.1 | MRN1 (MARNERAL SYNTHASE) catalytic/ marneral synthase
MWRLRIGAEARQDPHLFTTNNFAGRQIWEFDANGGSPEELAEVEEARLNFANNKSRFKAS
PDLFWRRQFLREKKFEQKIPRVRIEDAEKITYEDAKTALRRGVLYYAACQANDGHWPSEV
SGSMFLDAPFVICLYITGHLEKIFTLEHVKELLRYMYNTQNEDGGWGLDVESHSVMFCTV
LNYICLRILGVEPDHDGQKSACARARKWILDHGGATYAPMVAKAWLSVLGVYDWSGCKPL
PPEIWMLPSFSPINGGTLWIYIRDLLMGMSYLYGKKFVATPTALILQLREELYPQPYSKI
IWSKARNRCAKEDLLYPKSFGQDLFWEGVHMLSENIINRWPLNKFVRQRALRTTMELVHY
HDETTHYITGACVAKPFHMLACWVEDPDGDYFKKHLARVPDFIWIAEDGLKFQLMGMQSW
NAALSLQVMLAANMDDEIRSTLIKGYDFLKQSQISENPQGDHLKMFRDITKGGWTFQDRE
QGLPISDGTAESIECCIHFHRMPSEFIGEKMDVEKLYDAVNFLIYLQSDNGGMPVWEPAP
GKKWLEWLSPVEHVENTVVEQEYLECTGSVIAGLVCFKKEFPDHRPKEIEKLIKKGLKYI
EDLQMPDGSWYGNWGVCFTYGTLFAVRGLAAAGKTFGNSEAIRRAVQFILNTQNAEGGWG
ESALSCPNKKYIPSKGNVTNVVNTGQAMMVLLIGGQMERDPSPVHRAAKVLINSQLDIGD
FPQQERRGIYMNMLLHYPTYRNMFSLWALALYTNALRLLVS*
>AT5G48010.1 | THAS1 (THALIANOL SYNTHASE 1) catalytic/ thalianol synthase
MWRLRTGPKAGEDTHLFTTNNYAGRQIWEFDANAGSPQEIAEVEDARHKFSDNTSRFKTT
ADLLWRMQFLREKKFEQKIPRVIIEDARKIKYEDAKTALKRGLLYFTALQADDGHWPAEN
SGPNFYTPPFLICLYITGHLEKIFTPEHVKELLRHIYNMQNEDGGWGLHVESHSVMFCTV
INYVCLRIVGEEVGHDDQRNGCAKAHKWIMDHGGATYTPLIGKALLSVLGVYDWSGCNPI
PPEFWLLPSSFPVNGGTLWIYLRDTFMGLSYLYGKKFVAPPTPLILQLREELYPEPYAKI
NWTQTRNRCGKEDLYYPRSFLQDLFWKSVHMFSESILDRWPLNKLIRQRALQSTMALIHY
HDESTRYITGGCLPKAFHMLACWIEDPKSDYFKKHLARVREYIWIGEDGLKIQSFGSQLW
DTALSLHALLDGIDDHDVDDEIKTTLVKGYDYLKKSQITENPRGDHFKMFRHKTKGGWTF
SDQDQGWPVSDCTAESLECCLFFESMPSELIGKKMDVEKLYDAVDYLLYLQSDNGGIAAW
QPVEGKAWLELLNIMIFRYVECTGSAIAALTQFNKQFPGYKNVEVKRFITKAAKYIEDMQ
TVDGSWYGNWGVCFIYGTFFAVRGLVAAGKTYSNCEAIRKAVRFLLDTQNPEGGWGESFL
SCPSKKYTPLKGNSTNVVQTAQALMVLIMGDQMERDPLPVHRAAQVLINSQLDNGDFPQQ
EIMGTFMRTVMLHFPTYRNTFSLWALTHYTHALRRLLP*
>AT5G48010.1 | THAS1 (THALIANOL SYNTHASE 1) catalytic/ thalianol synthase
MWRLRTGPKAGEDTHLFTTNNYAGRQIWEFDANAGSPQEIAEVEDARHKFSDNTSRFKTT
ADLLWRMQFLREKKFEQKIPRVIIEDARKIKYEDAKTALKRGLLYFTALQADDGHWPAEN
SGPNFYTPPFLICLYITGHLEKIFTPEHVKELLRHIYNMQNEDGGWGLHVESHSVMFCTV
INYVCLRIVGEEVGHDDQRNGCAKAHKWIMDHGGATYTPLIGKALLSVLGVYDWSGCNPI
PPEFWLLPSSFPVNGGTLWIYLRDTFMGLSYLYGKKFVAPPTPLILQLREELYPEPYAKI
NWTQTRNRCGKEDLYYPRSFLQDLFWKSVHMFSESILDRWPLNKLIRQRALQSTMALIHY
HDESTRYITGGCLPKAFHMLACWIEDPKSDYFKKHLARVREYIWIGEDGLKIQSFGSQLW
DTALSLHALLDGIDDHDVDDEIKTTLVKGYDYLKKSQITENPRGDHFKMFRHKTKGGWTF
SDQDQGWPVSDCTAESLECCLFFESMPSELIGKKMDVEKLYDAVDYLLYLQSDNGGIAAW
QPVEGKAWLELLNIMIFRYVECTGSAIAALTQFNKQFPGYKNVEVKRFITKAAKYIEDMQ
TVDGSWYGNWGVCFIYGTFFAVRGLVAAGKTYSNCEAIRKAVRFLLDTQNPEGGWGESFL
SCPSKKYTPLKGNSTNVVQTAQALMVLIMGDQMERDPLPVHRAAQVLINSQLDNGDFPQQ
EIMGTFMRTVMLHFPTYRNTFSLWALTHYTHALRRLLP*
>AT5G48010.2 | THAS1 (THALIANOL SYNTHASE 1) catalytic/ thalianol synthase
MWRLRTGPKAGEDTHLFTTNNYAGRQIWEFDANAGSPQEIAEVEDARHKFSDNTSRFKTT
ADLLWRMQFLREKKFEQKIPRVIIEDARKIKYEDAKTALKRGLLYFTALQADDGHWPAEN
SGPNFYTPPFLICLYITGHLEKIFTPEHVKELLRHIYNMQNEDGGWGLHVESHSVMFCTV
INYVCLRIVGEEVGHDDQRNGCAKAHKWIMDHGGATYTPLIGKALLSVLGVYDWSGCNPI
PPEFWLLPSSFPVNGGTLWIYLRDTFMGLSYLYGKKFVAPPTPLILQLREELYPEPYAKI
NWTQTRNRCGKEDLYYPRSFLQDLFWKSVHMFSESILDRWPLNKLIRQRALQSTMALIHY
HDESTRYITGGCLPKAFHMLACWIEDPKSDYFKKHLARVREYIWIGEDGLKIQSFGSQLW
DTALSLHALLDGIDDHDVDDEIKTTLVKGYDYLKKSQITENPRGDHFKMFRHKTKGGWTF
SDQDQGWPVSDCTAESLECCLFFESMPSELIGKKMDVEKLYDAVDYLLYLQSDNGGIAAW
QPVEGKAWLEWLSPVEFLEDTIVEYEYVECTGSAIAALTQFNKQFPGYKNVEVKRFITKA
AKYIEDMQTVDGSWYGNWGVCFIYGTFFAVRGLVAAGKTYSNCEAIRKAVRFLLDTQNPE
GGWGESFLSCPSKKYTPLKGNSTNVVQTAQALMVLIMGDQMERDPLPVHRAAQVLINSQL
DNGDFPQQEIMGTFMRTVMLHFPTYRNTFSLWALTHYTHALRRLLP*
>AT5G48010.2 | THAS1 (THALIANOL SYNTHASE 1) catalytic/ thalianol synthase
MWRLRTGPKAGEDTHLFTTNNYAGRQIWEFDANAGSPQEIAEVEDARHKFSDNTSRFKTT
ADLLWRMQFLREKKFEQKIPRVIIEDARKIKYEDAKTALKRGLLYFTALQADDGHWPAEN
SGPNFYTPPFLICLYITGHLEKIFTPEHVKELLRHIYNMQNEDGGWGLHVESHSVMFCTV
INYVCLRIVGEEVGHDDQRNGCAKAHKWIMDHGGATYTPLIGKALLSVLGVYDWSGCNPI
PPEFWLLPSSFPVNGGTLWIYLRDTFMGLSYLYGKKFVAPPTPLILQLREELYPEPYAKI
NWTQTRNRCGKEDLYYPRSFLQDLFWKSVHMFSESILDRWPLNKLIRQRALQSTMALIHY
HDESTRYITGGCLPKAFHMLACWIEDPKSDYFKKHLARVREYIWIGEDGLKIQSFGSQLW
DTALSLHALLDGIDDHDVDDEIKTTLVKGYDYLKKSQITENPRGDHFKMFRHKTKGGWTF
SDQDQGWPVSDCTAESLECCLFFESMPSELIGKKMDVEKLYDAVDYLLYLQSDNGGIAAW
QPVEGKAWLEWLSPVEFLEDTIVEYEYVECTGSAIAALTQFNKQFPGYKNVEVKRFITKA
AKYIEDMQTVDGSWYGNWGVCFIYGTFFAVRGLVAAGKTYSNCEAIRKAVRFLLDTQNPE
GGWGESFLSCPSKKYTPLKGNSTNVVQTAQALMVLIMGDQMERDPLPVHRAAQVLINSQL
DNGDFPQQEIMGTFMRTVMLHFPTYRNTFSLWALTHYTHALRRLLP*
>AT2G04100.1 | MATE efflux family protein
MEDPLLLGDNQIITGSLKPTPTWRMNFTAELKNLSRMALPMATVTVAQYLLPVISVMVAG
HRSELQLSGVALATSFTNVSGFSVMFGLAGALETLCGQAYGAKQYAKIGTYTFSAIVSNV
PIVVLISILWFYMDKLFVSLGQDPDISKVAGSYAVCLIPALLAQAVQQPLTRFLQTQGLV
LPLLYCAITTLLFHIPVCLILVYAFGLGSNGAALAIGLSYWFNVLILALYVRFSSSCEKT
RGFVSDDFVLSVKQFFQYGIPSAAMTTIEWSLFEFLILSSGLLPNPKLETSVLSICLTTS
SLHYVIPMGIGAAGSIRVSNELGAGNPEVARLAVFAGIFLWFLEATICSTLLFICRDIFG
YAFSNSKEVVDYVTELSPLLCISFLVDGFSAVLGGVARGSGWQHIGAWANVVAYYLLGAP
VGLFLGFWCHMNGKGLWIGVVVGSTAQGIILAIVTACMSWNEQAAKARQRIVVRTSSFGN
GLA*