>AT1G62680.1 | LOCATED IN chloroplast EXPRESSED IN shoot apex leaf whorl flower seed EXPRESSED DURING F mature embryo stage petal differentiation and expansion stage CONTAINS InterPro DOMAIN/s Pentatricopeptide repeat (InterProIPR002885) BEST Arabidopsis thaliana protein match is pentatricopeptide (PPR) repeat-containing protein (TAIRAT1G631301) Has 26421 Blast hits to 6237 proteins in 192 species Archae - 5 Bacteria - 22 Metazoa - 845 Fungi - 755 Plants - 23373 Viruses - 0 Other Eukaryotes - 1421 (source NCBI BLink)
MQRSIAMTAKRFLHRNLLENGKPRTASSPSFSHCSSCRCWVRASSSVSGGDLRERLSKTR
LRDIKLNDAIDLFSDMVKSRPFPSIVDFNRLLSAIVKLKKYDVVISLGKKMEVLGIRNDL
YTFNIVINCFCCCFQVSLALSILGKMLKLGYEPDRVTIGSLVNGFCRRNRVSDAVSLVDK
MVEIGYKPDIVAYNAIIDSLCKTKRVNDAFDFFKEIERKGIRPNVVTYTALVNGLCNSSR
WSDAARLLSDMIKKKITPNVITYSALLDAFVKNGKVLEAKELFEEMVRMSIDPDIVTYSS
LINGLCLHDRIDEANQMFDLMVSKGCLADVVSYNTLINGFCKAKRVEDGMKLFREMSQRG
LVSNTVTYNTLIQGFFQAGDVDKAQEFFSQMDFFGISPDIWTYNILLGGLCDNGELEKAL
VIFEDMQKREMDLDIVTYTTVIRGMCKTGKVEEAWSLFCSLSLKGLKPDIVTYTTMMSGL
CTKGLLHEVEALYTKMKQEGLMKNDCTLSDGDITLSAELIKKMLSCGYAPSLLKDIKSGV
CKKALSLL*
>AT4G30120.1 | HMA3 (HEAVY METAL ATPASE 3) ATPase coupled to transmembrane movement of ions phosphorylative mechanism
MAEGEESKKMNLQTSYFDVVGICCSSEVSIVGNVLRQVDGVKEFSVIVPSRTVIVVHDTF
LISPLQIVKALNQARLEASVRPYGETSLKSQWPSPFAIVSGVLLVLSFFKYFYSPLEWLA
IVAVVAGVFPILAKAVASVTRFRLDINALTLIAVIATLCMQDFTEAATIVFLFSVADWLE
SSAAHKASIVMSSLMSLAPRKAVIADTGLEVDVDEVGINTVVSVKAGESIPIDGVVVDGS
CDVDEKTLTGESFPVSKQRESTVMAATINLNGYIKVKTTALARDCVVAKMTKLVEEAQKS
QTKTQRFIDKCSRYYTPAVVVSAACFAVIPVLLKVQDLSHWFHLALVVLVSGCPCGLILS
TPVATFCALTKAATSGFLIKTGDCLETLAKIKIVAFDKTGTITKAEFMVSDFRSLSPSIN
LHKLLYWVSSIECKSSHPMAAALIDYARSVSVEPKPDIVENFQNFPGEGVYGRIDGQDIY
IGNKRIAQRAGCLTDNVPDIEATMKRGKTIGYIYMGAKLTGSFNLLDGCRYGVAQALKEL
KS*
>AT1G62670.1 | pentatricopeptide (PPR) repeat-containing protein
MRISFAIASTAKRFVHRSLVVRGNAATVSPSFSFFWRRAFSGKTSYDYREKLSRNGLSEL
KLDDAVALFGEMVKSRPFPSIIEFSKLLSAIAKMNKFDVVISLGEQMQNLGIPHNHYTYS
ILINCFCRRSQLPLALAVLGKMMKLGYEPNIVTLSSLLNGYCHSKRISEAVALVDQMFVT
GYQPNTVTFNTLIHGLFLHNKASEAMALIDRMVAKGCQPDLVTYGVVVNGLCKRGDTDLA
FNLLNKMEQGKLEPGVLIYNTIIDGLCKYKHMDDALNLFKEMETKGIRPNVVTYSSLISC
LCNYGRWSDASRLLSDMIERKINPDVFTFSALIDAFVKEGKLVEAEKLYDEMVKRSIDPS
IVTYSSLINGFCMHDRLDEAKQMFEFMVSKHCFPDVVTYNTLIKGFCKYKRVEEGMEVFR
EMSQRGLVGNTVTYNILIQGLFQAGDCDMAQEIFKEMVSDGVPPNIMTYNTLLDGLCKNG
KLEKAMVVFEYLQRSKMEPTIYTYNIMIEGMCKAGKVEDGWDLFCNLSLKGVKPDVVAYN
TMISGFCRKGSKEEADALFKEMKEDGTLPNSGCYNTLIRARLRDGDREASAELIKEMRSC
GFAGDASTIGLVTNMLHDGRLDKSFLDMLS*
>AT1G62930.1 | INVOLVED IN biological_process unknown EXPRESSED IN 21 plant structures EXPRESSED DURING 14 growth stages CONTAINS InterPro DOMAIN/s Pentatricopeptide repeat (InterProIPR002885) BEST Arabidopsis thaliana protein match is pentatricopeptide (PPR) repeat-containing protein (TAIRAT1G631301) Has 29147 Blast hits to 6292 proteins in 192 species Archae - 8 Bacteria - 22 Metazoa - 1258 Fungi - 858 Plants - 25291 Viruses - 0 Other Eukaryotes - 1710 (source NCBI BLink)
MTSCVHLGIVASQSKKMSLAKRFAQLRKASPLFSLRGVYFSAASYDYREKLSRNVLLDLK
LDDAVDLFGEMVQSRPLPSIVEFNKLLSAIAKMNKFDLVISLGERMQNLRISYDLYSYNI
LINCFCRRSQLPLALAVLGKMMKLGYEPDIVTLSSLLNGYCHGKRISEAVALVDQMFVME
YQPNTVTFNTLIHGLFLHNKASEAVALIDRMVARGCQPDLFTYGTVVNGLCKRGDIDLAL
SLLKKMEKGKIEADVVIYTTIIDALCNYKNVNDALNLFTEMDNKGIRPNVVTYNSLIRCL
CNYGRWSDASRLLSDMIERKINPNVVTFSALIDAFVKEGKLVEAEKLYDEMIKRSIDPDI
FTYSSLINGFCMHDRLDEAKHMFELMISKDCFPNVVTYNTLIKGFCKAKRVEEGMELFRE
MSQRGLVGNTVTYNTLIQGLFQAGDCDMAQKIFKKMVSDGVPPDIITYSILLDGLCKYGK
LEKALVVFEYLQKSKMEPDIYTYNIMIEGMCKAGKVEDGWDLFCSLSLKGVKPNVIIYTT
MISGFCRKGLKEEADALFREMKEDGTLPNSGTYNTLIRARLRDGDKAASAELIKEMRSCG
FVGDASTISMVINMLHDGRLEKSYLEMLS*
>AT2G32830.1 | PHT5 inorganic phosphate transmembrane transporter/ phosphate transmembrane transporter
MAKKGKEVLNALDAAKTQMYHFTAIVIAGMGFFTDAYDLFSISLVTKLLGRIYYHVDSSK
KPGTLPPNVAAAVNGVAFCGTLAGQLFFGWLGDKLGRKKVYGITLMLMVLCSLGSGLSFG
HSANGVMATLCFFRFWLGFGIGGDYPLSATIMSEYANKKTRGAFIAAVFAMQGFGILAGG
IVSLIVSSTFDHAFKAPTYEVDPVGSTVPQADYVWRIVLMFGAIPALLTYYWRMKMPETA
RYTALVARNTKQAASDMSKVLQVDLIAEEEAQSNSNSSNPNFTFGLFTREFARRHGLHLL
GTTTTWFLLDIAYYSSNLFQKDIYTAIGWIPAAETMNAIHEVFTVSKAQTLIALCGTVPG
YWFTVAFIDILGRFFIQLMGFIFMTIFMFALAIPYDHWRHRENRIGFLIMYSLTMFFANF
GPNATTFVVPAEIFPARLRSTCHGISAASGKAGAIVGAFGFLYAAQSSDSEKTDAGYPPG
IGVRNSLLMLACVNFLGIVFTLLVPESKGKSLEEISREDEEQSGGDTVVEMTVANSGRKV
PV*
>AT1G03740.1 | ATP binding / kinase/ protein kinase/ protein serine/threonine kinase
MGCVNSRHRPFRRKSTTLKESSEEKRSSRIDSSRRIDDWIQPEDGFDRLSNSGDAKVRLI
ESEMFSTSRCHDHQIGKILENPATVAHMDRVVHDQELRRASSAVVDSDLDIDPKVVKAKL
DRWNSKDSKVRLIESEKLSSSMFSEHHQIEKGVEKPEVEASVRVVHRELKRGSSIVSPKD
AERKQVAAGWPSWLVSVAGESLVDWAPRRANTFEKLEKIGQGTYSSVYRARDLLHNKIVA
LKKVRFDLNDMESVKFMAREIIVMRRLDHPNVLKLEGLITAPVSSSLYLVFEYMDHDLLG
LSSLPGVKFTEPQVKCYMRQLLSGLEHCHSRGVLHRDIKGSNLLIDSKGVLKIADFGLAT
FFDPAKSVSLTSHVVTLWYRPPELLLGASHYGVGVDLWSTGCILGELYAGKPILPGKTEV
EQLHKIFKLCGSPTENYWRKQKLPSSAGFKTAIPYRRKVSEMFKDFPASVLSLLETLLSI
DPDHRSSADRALESEYFKTKPFACDPSNLPKYPPSKEIDAKMRDEAKRQQPMRAEKQERQ
DSMTRISHERKFVPPVKANNSLSMTMEKQYKDLRSRNDSFKSFKEERTPHGPVPDYQNMQ
HNRNNQTGVRISHSGPLMSNRNMAKSTMHVKENALPRYPPARVNPKMLSGSVSSKTLLER
QDQPVTNQRRRDRRAYNRADTMDSRHMTAPIDPSWYNPSDSKIYMSGPLLAQPSRVDQML
EEHDRQLQEFNRQALKTPQG*
>AT1G03740.1 | ATP binding / kinase/ protein kinase/ protein serine/threonine kinase/ protein tyrosine kinase
MGCVNSRHRPFRRKSTTLKESSEEKRSSRIDSSRRIDDWIQPEDGFDRLSNSGDAKVRLI
ESEMFSTSRCHDHQIGKILENPATVAHMDRVVHDQELRRASSAVVDSDLDIDPKVVKAKL
DRWNSKDSKVRLIESEKLSSSMFSEHHQIEKGVEKPEVEASVRVVHRELKRGSSIVSPKD
AERKQVAAGWPSWLVSVAGESLVDWAPRRANTFEKLEKIGQGTYSSVYRARDLLHNKIVA
LKKVRFDLNDMESVKFMAREIIVMRRLDHPNVLKLEGLITAPVSSSLYLVFEYMDHDLLG
LSSLPGVKFTEPQVKCYMRQLLSGLEHCHSRGVLHRDIKGSNLLIDSKGVLKIADFGLAT
FFDPAKSVSLTSHVVTLWYRPPELLLGASHYGVGVDLWSTGCILGELYAGKPILPGKTEV
EQLHKIFKLCGSPTENYWRKQKLPSSAGFKTAIPYRRKVSEMFKDFPASVLSLLETLLSI
DPDHRSSADRALESEYFKTKPFACDPSNLPKYPPSKEIDAKMRDEAKRQQPMRAEKQERQ
DSMTRISHERKFVPPVKANNSLSMTMEKQYKDLRSRNDSFKSFKEERTPHGPVPDYQNMQ
HNRNNQTGVRISHSGPLMSNRNMAKSTMHVKENALPRYPPARVNPKMLSGSVSSKTLLER
QDQPVTNQRRRDRRAYNRADTMDSRHMTAPIDPSWYNPSDSKIYMSGPLLAQPSRVDQML
EEHDRQLQEFNRQALKTPQG*
>AT1G03740.2 | ATP binding / kinase/ protein kinase/ protein serine/threonine kinase
MGCVNSRHRPFRRKSTTLKESSEEKRSSRIDSSRRIDDWIQPEDGFDRLSNSGDAKVRLI
ESEMFSTSRCHDHQIGKILENPATVAHMDRVVHDQELRRASSAVVDSDLDIDPKVVKAKL
DRWNSKDSKVRLIESEKLSSSMFSEHHQIEKGVEKPEVEASVRVVHRELKRGSSIVSPKD
AERKQVAAGWPSWLVSVAGESLVDWAPRRANTFEKLEKIGQGTYSSVYRARDLLHNKIVA
LKKVRFDLNDMESVKFMAREIIVMRRLDHPNVLKLEGLITAPVSSSLYLVFEYMDHDLLG
LSSLPGVKFTEPQVKCYMRQLLSGLEHCHSRGVLHRDIKGSNLLIDSKGVLKIADFGLAT
FFDPAKSVSLTSHVVTLWYRPPELLLGASHYGVGVDLWSTGCILGELYAGKPILPGKTEV
EQLHKIFKLCGSPTENYWRKQKLPSSAGFKTAIPYRRKVSEMFKDFPASVLSLLETLLSI
DPDHRSSADRALESEYFKTKPFACDPSNLPKYPPSKEIDAKMRDEAKRQQPMRAEKQERQ
DSMTRISHERKFVPPVKANNSLSMTMEKQYKDLRSRNDSFKSFKEERTPHGPVPDYQNMQ
HNRNNQTGVRISHSGPLMSNRNMAKSTMHVKENALPRYPPARVNPKMLSGSVSSKTLLER
QDQPVTNQRRRDRRAYNRADTMDSRHMTAPIDPSWVS*
>AT1G03740.2 | ATP binding / kinase/ protein kinase/ protein serine/threonine kinase/ protein tyrosine kinase
MGCVNSRHRPFRRKSTTLKESSEEKRSSRIDSSRRIDDWIQPEDGFDRLSNSGDAKVRLI
ESEMFSTSRCHDHQIGKILENPATVAHMDRVVHDQELRRASSAVVDSDLDIDPKVVKAKL
DRWNSKDSKVRLIESEKLSSSMFSEHHQIEKGVEKPEVEASVRVVHRELKRGSSIVSPKD
AERKQVAAGWPSWLVSVAGESLVDWAPRRANTFEKLEKIGQGTYSSVYRARDLLHNKIVA
LKKVRFDLNDMESVKFMAREIIVMRRLDHPNVLKLEGLITAPVSSSLYLVFEYMDHDLLG
LSSLPGVKFTEPQVKCYMRQLLSGLEHCHSRGVLHRDIKGSNLLIDSKGVLKIADFGLAT
FFDPAKSVSLTSHVVTLWYRPPELLLGASHYGVGVDLWSTGCILGELYAGKPILPGKTEV
EQLHKIFKLCGSPTENYWRKQKLPSSAGFKTAIPYRRKVSEMFKDFPASVLSLLETLLSI
DPDHRSSADRALESEYFKTKPFACDPSNLPKYPPSKEIDAKMRDEAKRQQPMRAEKQERQ
DSMTRISHERKFVPPVKANNSLSMTMEKQYKDLRSRNDSFKSFKEERTPHGPVPDYQNMQ
HNRNNQTGVRISHSGPLMSNRNMAKSTMHVKENALPRYPPARVNPKMLSGSVSSKTLLER
QDQPVTNQRRRDRRAYNRADTMDSRHMTAPIDPSWVS*
>AT4G00350.1 | MATE efflux family protein
MEIPVREERRSSSSSAGPLQQTISLAADDAIDSGPSSPLVVKVSVFETEHETTKLIHAPS
TLLGETTGDADFPPIQSFRDAKLVCVVETSKLWEIAAPIAFNILCNYGVNSFTSIFVGHI
GDLELSAVAIALSVVSNFSFGFLLGMASALETLCGQAFGAGQMDMLGVYMQRSWLILLGT
SVCLLPLYIYATPLLILLGQEPEIAEISGKFTTQIIPQMFALAINFPTQKFLQSQSKVGI
MAWIGFFALTLHIFILYLFINVFKWGLNGAAAAFDVSAWGIAIAQVVYVVGWCKDGWKGL
SWLAFQDVWPFLKLSFASAVMLCLEIWYFMTIIVLTGHLEDPVIAVGSLSICMNINGWEG
MLFIGINAAISVRVSNELGSGHPRAAKYSVIVTVIESLVIGVVCAIVILITRDDFAVIFT
ESEEMRKAVADLAYLLGITMILNSLQPVISGVAVGGGWQAPVAYINLFCYYAFGLPLGFL
LGYKTSLGVQGIWIGMICGTSLQTLILLYMIYITNWNKEVEQASERMKQWGAGYEKLEKI
AT*
>AT4G23220.1 | kinase
MFEFVYDVLLDSTEIVNESTEILICLLMVHTPHNFFFFFFFDKGDRQKKVCILFILTCCP
NVNIICTILKMELKNLFPIFWFVLVGFAVVSAQECGKTGFFVPQSRYETNRGLLLSSLPS
NVSARGGFYNSSIGQGPDRVYALGMCIEGAEPDVCSDCIEYASNLLLDTCLNQTEGLAWP
EKRILCMVRYSNSSFFGSLKAEPHFYIHNVDDITSNLTEFDQVWEELARRMIASTTSPSS
KRKYYAADVAALTAFQIIYALMQCTPDLSLEDCHICLRQSVGDYETCCNGKQGGIVYRAS
CVFRWELFPFSEAFSRISLAPPPQSPAFPTLPAVTNTATKKGSITISIGIVWAIIIPTVI
VVFLVLLALGFVVYRRRKSYQGSSTDITITHSLQFDFKAIEDATNKFSESNIIGRGGFGE
VFMGVLNGTEVAIKRLSKASRQGAREFKNEVVVVAKLHHRNLVKLLGFCLEGEEKILVYE
FVPNKSLDYFLFDPTKQGQLDWTKRYNIIRGITRGILYLHQDSRLTIIHRDLKASNILLD
ADMNPKIADFGMARIFGIDQSGANTKKIAGTRGYMPPEYVRQGQFSTRSDVYSFGVLVLE
IICGRNNRFIHQSDTTVENLVTYAWRLWRNDSPLELVDPTISENCETEEVTRCIHIALLC
VQHNPTDRPSLSTINMMLINNSYVLPDPQQPGFFFPIISNQERDGLDSMNRSNPQTINDV
TITDFEPR*
>AT1G62940.1 | ACOS5 (ACYL-COA SYNTHETASE 5) 4-coumarate-CoA ligase/ long-chain-fatty-acid-CoA ligase/ medium-chain-fatty-acid-CoA ligase
MESQKQEDNEYIFRSLYPSVPIPDKLTLPEFVLQGVEEYTENVAFVEAVTGKAVTYGDVV
RDTKRLAKALTSLGLRKGQVMVVVLPNVAEYGIIALGIMSAGGVFSGANPTALVSEIKKQ
VEASGARGIITDATNYEKVKSLGLPVIVLGEEKIEGAVNWKDLLEAGDKCGDTDNEEILQ
TDLCALPFSSGTTGLQKGVMLTHRNLIANLCSTLFGVRSEMIGQIVTLGLIPFFHIYGIV
GICCATMKNKGKVVAMSRYDLRIFLNALIAHEVSFAPIVPPIILNLVKNPIVDEFDLSKL
KLQSVMTAAAPLAPELLTAFEAKFPNVQVQEAYGLTEHSCITLTHGDPEKGQGIAKRNSV
GFILPNLEVKFIDPDTGRSLPKNTSGELCVRSQCVMQGYFMNKEETDKTIDEQGWLHTGD
IGYIDDDGDIFIVDRIKELIKYKGFQVAPAELEAILLTHPSVEDVAVVPLPDEEAGEIPA
ACVVINPKATEKEEDILNFVAANVAHYKKVRAVHFVDSIPKSLSGKIMRRLLRDKILSIN
KK*
>AT4G33370.1 | DEAD-box protein abstrakt putative
MEVDDGYVEYVPVEERLAQMKRKVVEEPGKGMMEHLSDKKKLMSVGELARGITYTEPLST
WWKPPLHVRKMSTKQMDLIRKQWHITVNGEDIPPPIKNFMDMKFPSPLLRMLKDKGIMHP
TPIQVQGLPVVLSGRDMIGIAFTGSGKTLVFVLPMIILALQEEIMMPIAAGEGPIALVIC
PSRELAKQTYDVVEQFVASLVEDGYPRLRSLLCIGGVDMRSQLDVVKKGVHIVVATPGRL
KDILAKKKMSLDACRLLTLDEADRLVDLGFEDDIRHVFDHFKSQRQTLLFSATMPAKIQI
FATSALVKPVTVNVGRAGAANLDVIQEVEYVKQEAKIVYLLECLQKTTPPVLIFCENKAD
VDDIHEYLLLKGVEAVAIHGGKDQEDRDYAISLFKAGKKDVLVATDVASKGLDFPDIQHV
INYDMPGEIENYVHRIGRTGRCGKTGIATTFINKNQSEITLLDLKHLLQEAKQRIPPVLA
ELNGPMEETETIANASGVKGCAYCGGLGHRILQCPKFEHQKSVAISSSRKDHFGSDGYRG
EV*