>AT2G03880.1 | pentatricopeptide (PPR) repeat-containing protein
MKSVMSKIKLFRPVVTLRCSYSSTDQTLLLSEFTRLCYQRDLPRAMKAMDSLQSHGLWAD
SATYSELIKCCISNRAVHEGNLICRHLYFNGHRPMMFLVNVLINMYVKFNLLNDAHQLFD
QMPQRNVISWTTMISAYSKCKIHQKALELLVLMLRDNVRPNVYTYSSVLRSCNGMSDVRM
LHCGIIKEGLESDVFVRSALIDVFAKLGEPEDALSVFDEMVTGDAIVWNSIIGGFAQNSR
SDVALELFKRMKRAGFIAEQATLTSVLRACTGLALLELGMQAHVHIVKYDQDLILNNALV
DMYCKCGSLEDALRVFNQMKERDVITWSTMISGLAQNGYSQEALKLFERMKSSGTKPNYI
TIVGVLFACSHAGLLEDGWYYFRSMKKLYGIDPVREHYGCMIDLLGKAGKLDDAVKLLNE
MECEPDAVTWRTLLGACRVQRNMVLAEYAAKKVIALDPEDAGTYTLLSNIYANSQKWDSV
EEIRTRMRDRGIKKEPGCSWIEVNKQIHAFIIGDNSHPQIVEVSKKLNQLIHRLTGIGYV
PETNFVLQDLEGEQMEDSLRHHSEKLALAFGLMTLPIEKVIRIRKNLRICGDCHVFCKLA
SKLEIRSIVIRDPIRYHHFQDGKCSCGDYW*
>AT3G13770.1 | pentatricopeptide (PPR) repeat-containing protein
MFNLMRLIHRSFSSSPTNYVLQTILPISQLCSNGRLQEALLEMAMLGPEMGFHGYDALLN
ACLDKRALRDGQRVHAHMIKTRYLPATYLRTRLLIFYGKCDCLEDARKVLDEMPEKNVVS
WTAMISRYSQTGHSSEALTVFAEMMRSDGKPNEFTFATVLTSCIRASGLGLGKQIHGLIV
KWNYDSHIFVGSSLLDMYAKAGQIKEAREIFECLPERDVVSCTAIIAGYAQLGLDEEALE
MFHRLHSEGMSPNYVTYASLLTALSGLALLDHGKQAHCHVLRRELPFYAVLQNSLIDMYS
KCGNLSYARRLFDNMPERTAISWNAMLVGYSKHGLGREVLELFRLMRDEKRVKPDAVTLL
AVLSGCSHGRMEDTGLNIFDGMVAGEYGTKPGTEHYGCIVDMLGRAGRIDEAFEFIKRMP
SKPTAGVLGSLLGACRVHLSVDIGESVGRRLIEIEPENAGNYVILSNLYASAGRWADVNN
VRAMMMQKAVTKEPGRSWIQHEQTLHYFHANDRTHPRREEVLAKMKEISIKMKQAGYVPD
LSCVLYDVDEEQKEKMLLGHSEKLALTFGLIATGEGIPIRVFKNLRICVDCHNFAKIFSK
VFEREVSLRDKNRFHQIVDGICSCGDYW*
>AT1G34210.1 | SERK2 (SOMATIC EMBRYOGENESIS RECEPTOR-LIKE KINASE 2) kinase
MGRKKFEAFGFVCLISLLLLFNSLWLASSNMEGDALHSLRANLVDPNNVLQSWDPTLVNP
CTWFHVTCNNENSVIRVDLGNADLSGQLVPQLGQLKNLQYLELYSNNITGPVPSDLGNLT
NLVSLDLYLNSFTGPIPDSLGKLFKLRFLRLNNNSLTGPIPMSLTNIMTLQVLDLSNNRL
SGSVPDNGSFSLFTPISFANNLDLCGPVTSRPCPGSPPFSPPPPFIPPPIVPTPGGYSAT
GAIAGGVAAGAALLFAAPALAFAWWRRRKPQEFFFDVPAEEDPEVHLGQLKRFSLRELQV
ATDSFSNKNILGRGGFGKVYKGRLADGTLVAVKRLKEERTPGGELQFQTEVEMISMAVHR
NLLRLRGFCMTPTERLLVYPYMANGSVASCLRERPPSQLPLAWSIRQQIALGSARGLSYL
HDHCDPKIIHRDVKAANILLDEEFEAVVGDFGLARLMDYKDTHVTTAVRGTIGHIAPEYL
STGKSSEKTDVFGYGIMLLELITGQRAFDLARLANDDDVMLLDWVKGLLKEKKLEMLVDP
DLQSNYTEAEVEQLIQVALLCTQSSPMERPKMSEVVRMLEGDGLAEKWDEWQKVEVLRQE
VELSSHPTSDWILDSTDNLHAMELSGPR*
>AT5G27330.1 | LOCATED IN endoplasmic reticulum EXPRESSED IN 22 plant structures EXPRESSED DURING 12 growth stages CONTAINS InterPro DOMAIN/s Prefoldin (InterProIPR009053) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT3G051301) Has 110457 Blast hits to 56483 proteins in 2051 species Archae - 1653 Bacteria - 15111 Metazoa - 52441 Fungi - 7858 Plants - 4309 Viruses - 584 Other Eukaryotes - 28501 (source NCBI BLink)
MAKKKVSRNSNGASNEQQQIQNQSVPVTSQKSTKLSRESSMEDHDSSEEKFQNLKSLNAI
LLKQTMEKRQQIESLFQAKDSLEIELVRSGKEKTLLREELCGSSDENFMLKIEMDLLMGF
VEGRVKEMGVEVDWLFKEKSDRETEIRDLKREANGLIRKLESEREEFSRVCDERDLVKSG
FDLQSEEMNLLKESVVRLEMREVSLGEEVGRLKCENGRLVKERKKREEVIERGNRERSEL
VESLEEKVREIDVLKREIEGVVKEKMEVEMVRRDQREMIVELEKKLGDMNEIVESLTKER
EGLRGQVVGLEKSLDEVTEEAKARAEQINELVKEKTVKESELEGLMVENNSIKKEIEMAM
VQFSDKEKLVEQLLREKNELVQRVVNQEAEIVELSKLAGEQKHAVAQLRKDYNDQIKNGE
KLNCNVSQLKDALALVEVERDNAGKALDEEKRNMVALKEKVVALEKTNEATGKELEKIKA
ERGRLIKEKKELENRSESLRNEKAILQKDIVELKRATGVLKTELESAGTNAKQSLTMLKS
VSSLVCGIENKKDEKKRGKGMDSYSVQLEAIKKAFKNKESMVEEMKKELAKMKHSVEDAH
KKKSFWTLVSSVTSLLMAASVAYAASLK*
>AT2G25490.1 | EBF1 (EIN3-BINDING F BOX PROTEIN 1) protein binding / ubiquitin-protein ligase
MSQIFSFAGENDFYRRGAIYPNPKDASLLLSLGSFADVYFPPSKRSRVVAPTIFSAFEKK
PVSIDVLPDECLFEIFRRLSGPQERSACAFVSKQWLTLVSSIRQKEIDVPSKITEDGDDC
EGCLSRSLDGKKATDVRLAAIAVGTAGRGGLGKLSIRGSNSAKVSDLGLRSIGRSCPSLG
SLSLWNVSTITDNGLLEIAEGCAQLEKLELNRCSTITDKGLVAIAKSCPNLTELTLEACS
RIGDEGLLAIARSCSKLKSVSIKNCPLVRDQGIASLLSNTTCSLAKLKLQMLNVTDVSLA
VVGHYGLSITDLVLAGLSHVSEKGFWVMGNGVGLQKLNSLTITACQGVTDMGLESVGKGC
PNMKKAIISKSPLLSDNGLVSFAKASLSLESLQLEECHRVTQFGFFGSLLNCGEKLKAFS
LVNCLSIRDLTTGLPASSHCSALRSLSIRNCPGFGDANLAAIGKLCPQLEDIDLCGLKGI
TESGFLHLIQSSLVKINFSGCSNLTDRVISAITARNGWTLEVLNIDGCSNITDASLVSIA
ANCQILSDLDISKCAISDSGIQALASSDKLKLQILSVAGCSMVTDKSLPAIVGLGSTLLG
LNLQQCRSISNSTVDFLVERLYKCDILS*
>AT5G40410.1 | INVOLVED IN biological_process unknown CONTAINS InterPro DOMAIN/s Pentatricopeptide repeat (InterProIPR002885) BEST Arabidopsis thaliana protein match is pentatricopeptide (PPR) repeat-containing protein (TAIRAT1G080701) Has 12213 Blast hits to 4641 proteins in 127 species Archae - 0 Bacteria - 0 Metazoa - 61 Fungi - 12 Plants - 12006 Viruses - 0 Other Eukaryotes - 134 (source NCBI BLink)
MIKANVYSCSKFRFLYRRRFLSQSSFVHSLDANVSSLIAAVKSCVSIELCRLLHCKVVKS
VSYRHGFIGDQLVGCYLRLGHDVCAEKLFDEMPERDLVSWNSLISGYSGRGYLGKCFEVL
SRMMISEVGFRPNEVTFLSMISACVYGGSKEEGRCIHGLVMKFGVLEEVKVVNAFINWYG
KTGDLTSSCKLFEDLSIKNLVSWNTMIVIHLQNGLAEKGLAYFNMSRRVGHEPDQATFLA
VLRSCEDMGVVRLAQGIHGLIMFGGFSGNKCITTALLDLYSKLGRLEDSSTVFHEITSPD
SMAWTAMLAAYATHGFGRDAIKHFELMVHYGISPDHVTFTHLLNACSHSGLVEEGKHYFE
TMSKRYRIDPRLDHYSCMVDLLGRSGLLQDAYGLIKEMPMEPSSGVWGALLGACRVYKDT
QLGTKAAERLFELEPRDGRNYVMLSNIYSASGLWKDASRIRNLMKQKGLVRASGCSYIEH
GNKIHKFVVGDWSHPESEKIQKKLKEIRKKMKSEMGYKSKTEFVLHDVGEDVKEEMINQH
SEKIAMAFGLLVVSPMEPIIIRKNLRICGDCHETAKAISLIEKRRIIIRDSKRFHHFLDG
SCSCSDYW*
>AT2G41080.1 | pentatricopeptide (PPR) repeat-containing protein
MSMYSKLGDFPSAVAVYGRMRKKNYMSSNILINGYVRAGDLVNARKVFDEMPDRKLTTWN
AMIAGLIQFEFNEEGLSLFREMHGLGFSPDEYTLGSVFSGSAGLRSVSIGQQIHGYTIKY
GLELDLVVNSSLAHMYMRNGKLQDGEIVIRSMPVRNLVAWNTLIMGNAQNGCPETVLYLY
KMMKISGCRPNKITFVTVLSSCSDLAIRGQGQQIHAEAIKIGASSVVAVVSSLISMYSKC
GCLGDAAKAFSEREDEDEVMWSSMISAYGFHGQGDEAIELFNTMAEQTNMEINEVAFLNL
LYACSHSGLKDKGLELFDMMVEKYGFKPGLKHYTCVVDLLGRAGCLDQAEAIIRSMPIKT
DIVIWKTLLSACNIHKNAEMAQRVFKEILQIDPNDSACYVLLANVHASAKRWRDVSEVRK
SMRDKNVKKEAGISWFEHKGEVHQFKMGDRSQSKSKEIYSYLKELTLEMKLKGYKPDTAS
VLHDMDEEEKESDLVQHSEKLAVAFALMILPEGAPIRIIKNLRVCSDCHVAFKYISVIKN
REITLRDGSRFHHFINGKCSCGDYW*
>AT1G21810.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 14 plant structures EXPRESSED DURING 6 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF869 plant (InterProIPR008587) BEST Arabidopsis thaliana protein match is myosin heavy chain-related (TAIRAT1G775802) Has 62951 Blast hits to 33517 proteins in 1593 species Archae - 800 Bacteria - 6354 Metazoa - 33836 Fungi - 4765 Plants - 2651 Viruses - 311 Other Eukaryotes - 14234 (source NCBI BLink)
MTGTTLILEPVMDSKDELVKQHAKVAEDAVAGWEKAENEVVELKQKLEDAADKNIVLEDR
VSHLDGALKECVRQLRQFRDEQEKNIQAAVTESTKELHSANTGLEKRVLELQKEAEAAKS
ENMMLRREFLTQREDLEIVMIERDLSTQAAETASKQHLDIIKKLAKLEAECRKLRILAKT
SSSLSSNQSVDSHSDGGRERVEGSCSDSWASSAFISELDQIKNEKGGNRSLQGTTSSTEI
DLMDDFLEMERLVALPTETQAKNSKDGYELSLMEKLEKIQAEKDDLEREVKCCREAEKRL
SLEIEAVVGDKMELEDMLKRVEAEKAELKTSFDVLKDKYQESRVCFQEVDTKLEKLQAEK
DELDSEVICCKEAEKRFSLELEAVVGDKIEMEDELEKMEAEKAELKISFDVIKDQYQESR
VCFQEVEMKLEAMKRELKLANESKTQAESRVTRMEAEVRKERIVSDGLKEKCETFEEELR
REIEEKTMIKREKVEPKIKQEDIATAAGKFADCQKTIASLGKQLQSLATLEEFLIDTASI
PGSARSVHNKEALLGKDPHECIKTINGRSLEFLAIQNSNNKTSPPCSSSSDSTTVSLIMS
SNRGSSEKNRNGFATVFTRSRNSVNLGI*