>AT2G03880.1 |  pentatricopeptide (PPR) repeat-containing protein 
MKSVMSKIKLFRPVVTLRCSYSSTDQTLLLSEFTRLCYQRDLPRAMKAMDSLQSHGLWAD 
SATYSELIKCCISNRAVHEGNLICRHLYFNGHRPMMFLVNVLINMYVKFNLLNDAHQLFD 
QMPQRNVISWTTMISAYSKCKIHQKALELLVLMLRDNVRPNVYTYSSVLRSCNGMSDVRM 
LHCGIIKEGLESDVFVRSALIDVFAKLGEPEDALSVFDEMVTGDAIVWNSIIGGFAQNSR 
SDVALELFKRMKRAGFIAEQATLTSVLRACTGLALLELGMQAHVHIVKYDQDLILNNALV 
DMYCKCGSLEDALRVFNQMKERDVITWSTMISGLAQNGYSQEALKLFERMKSSGTKPNYI 
TIVGVLFACSHAGLLEDGWYYFRSMKKLYGIDPVREHYGCMIDLLGKAGKLDDAVKLLNE 
MECEPDAVTWRTLLGACRVQRNMVLAEYAAKKVIALDPEDAGTYTLLSNIYANSQKWDSV 
EEIRTRMRDRGIKKEPGCSWIEVNKQIHAFIIGDNSHPQIVEVSKKLNQLIHRLTGIGYV 
PETNFVLQDLEGEQMEDSLRHHSEKLALAFGLMTLPIEKVIRIRKNLRICGDCHVFCKLA 
SKLEIRSIVIRDPIRYHHFQDGKCSCGDYW*
>AT3G13770.1 |  pentatricopeptide (PPR) repeat-containing protein 
MFNLMRLIHRSFSSSPTNYVLQTILPISQLCSNGRLQEALLEMAMLGPEMGFHGYDALLN 
ACLDKRALRDGQRVHAHMIKTRYLPATYLRTRLLIFYGKCDCLEDARKVLDEMPEKNVVS 
WTAMISRYSQTGHSSEALTVFAEMMRSDGKPNEFTFATVLTSCIRASGLGLGKQIHGLIV 
KWNYDSHIFVGSSLLDMYAKAGQIKEAREIFECLPERDVVSCTAIIAGYAQLGLDEEALE 
MFHRLHSEGMSPNYVTYASLLTALSGLALLDHGKQAHCHVLRRELPFYAVLQNSLIDMYS 
KCGNLSYARRLFDNMPERTAISWNAMLVGYSKHGLGREVLELFRLMRDEKRVKPDAVTLL 
AVLSGCSHGRMEDTGLNIFDGMVAGEYGTKPGTEHYGCIVDMLGRAGRIDEAFEFIKRMP 
SKPTAGVLGSLLGACRVHLSVDIGESVGRRLIEIEPENAGNYVILSNLYASAGRWADVNN 
VRAMMMQKAVTKEPGRSWIQHEQTLHYFHANDRTHPRREEVLAKMKEISIKMKQAGYVPD 
LSCVLYDVDEEQKEKMLLGHSEKLALTFGLIATGEGIPIRVFKNLRICVDCHNFAKIFSK 
VFEREVSLRDKNRFHQIVDGICSCGDYW*
>AT1G34210.1 |  SERK2 (SOMATIC EMBRYOGENESIS RECEPTOR-LIKE KINASE 2) kinase 
MGRKKFEAFGFVCLISLLLLFNSLWLASSNMEGDALHSLRANLVDPNNVLQSWDPTLVNP 
CTWFHVTCNNENSVIRVDLGNADLSGQLVPQLGQLKNLQYLELYSNNITGPVPSDLGNLT 
NLVSLDLYLNSFTGPIPDSLGKLFKLRFLRLNNNSLTGPIPMSLTNIMTLQVLDLSNNRL 
SGSVPDNGSFSLFTPISFANNLDLCGPVTSRPCPGSPPFSPPPPFIPPPIVPTPGGYSAT 
GAIAGGVAAGAALLFAAPALAFAWWRRRKPQEFFFDVPAEEDPEVHLGQLKRFSLRELQV 
ATDSFSNKNILGRGGFGKVYKGRLADGTLVAVKRLKEERTPGGELQFQTEVEMISMAVHR 
NLLRLRGFCMTPTERLLVYPYMANGSVASCLRERPPSQLPLAWSIRQQIALGSARGLSYL 
HDHCDPKIIHRDVKAANILLDEEFEAVVGDFGLARLMDYKDTHVTTAVRGTIGHIAPEYL 
STGKSSEKTDVFGYGIMLLELITGQRAFDLARLANDDDVMLLDWVKGLLKEKKLEMLVDP 
DLQSNYTEAEVEQLIQVALLCTQSSPMERPKMSEVVRMLEGDGLAEKWDEWQKVEVLRQE 
VELSSHPTSDWILDSTDNLHAMELSGPR*
>AT5G27330.1 |  LOCATED IN endoplasmic reticulum EXPRESSED IN 22 plant structures EXPRESSED DURING 12 growth stages CONTAINS InterPro DOMAIN/s Prefoldin (InterProIPR009053) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT3G051301) Has 110457 Blast hits to 56483 proteins in 2051 species Archae - 1653 Bacteria - 15111 Metazoa - 52441 Fungi - 7858 Plants - 4309 Viruses - 584 Other Eukaryotes - 28501 (source NCBI BLink) 
MAKKKVSRNSNGASNEQQQIQNQSVPVTSQKSTKLSRESSMEDHDSSEEKFQNLKSLNAI 
LLKQTMEKRQQIESLFQAKDSLEIELVRSGKEKTLLREELCGSSDENFMLKIEMDLLMGF 
VEGRVKEMGVEVDWLFKEKSDRETEIRDLKREANGLIRKLESEREEFSRVCDERDLVKSG 
FDLQSEEMNLLKESVVRLEMREVSLGEEVGRLKCENGRLVKERKKREEVIERGNRERSEL 
VESLEEKVREIDVLKREIEGVVKEKMEVEMVRRDQREMIVELEKKLGDMNEIVESLTKER 
EGLRGQVVGLEKSLDEVTEEAKARAEQINELVKEKTVKESELEGLMVENNSIKKEIEMAM 
VQFSDKEKLVEQLLREKNELVQRVVNQEAEIVELSKLAGEQKHAVAQLRKDYNDQIKNGE 
KLNCNVSQLKDALALVEVERDNAGKALDEEKRNMVALKEKVVALEKTNEATGKELEKIKA 
ERGRLIKEKKELENRSESLRNEKAILQKDIVELKRATGVLKTELESAGTNAKQSLTMLKS 
VSSLVCGIENKKDEKKRGKGMDSYSVQLEAIKKAFKNKESMVEEMKKELAKMKHSVEDAH 
KKKSFWTLVSSVTSLLMAASVAYAASLK*
>AT2G25490.1 |  EBF1 (EIN3-BINDING F BOX PROTEIN 1) protein binding / ubiquitin-protein ligase 
MSQIFSFAGENDFYRRGAIYPNPKDASLLLSLGSFADVYFPPSKRSRVVAPTIFSAFEKK 
PVSIDVLPDECLFEIFRRLSGPQERSACAFVSKQWLTLVSSIRQKEIDVPSKITEDGDDC 
EGCLSRSLDGKKATDVRLAAIAVGTAGRGGLGKLSIRGSNSAKVSDLGLRSIGRSCPSLG 
SLSLWNVSTITDNGLLEIAEGCAQLEKLELNRCSTITDKGLVAIAKSCPNLTELTLEACS 
RIGDEGLLAIARSCSKLKSVSIKNCPLVRDQGIASLLSNTTCSLAKLKLQMLNVTDVSLA 
VVGHYGLSITDLVLAGLSHVSEKGFWVMGNGVGLQKLNSLTITACQGVTDMGLESVGKGC 
PNMKKAIISKSPLLSDNGLVSFAKASLSLESLQLEECHRVTQFGFFGSLLNCGEKLKAFS 
LVNCLSIRDLTTGLPASSHCSALRSLSIRNCPGFGDANLAAIGKLCPQLEDIDLCGLKGI 
TESGFLHLIQSSLVKINFSGCSNLTDRVISAITARNGWTLEVLNIDGCSNITDASLVSIA 
ANCQILSDLDISKCAISDSGIQALASSDKLKLQILSVAGCSMVTDKSLPAIVGLGSTLLG 
LNLQQCRSISNSTVDFLVERLYKCDILS*
>AT5G40410.1 |  INVOLVED IN biological_process unknown CONTAINS InterPro DOMAIN/s Pentatricopeptide repeat (InterProIPR002885) BEST Arabidopsis thaliana protein match is pentatricopeptide (PPR) repeat-containing protein (TAIRAT1G080701) Has 12213 Blast hits to 4641 proteins in 127 species Archae - 0 Bacteria - 0 Metazoa - 61 Fungi - 12 Plants - 12006 Viruses - 0 Other Eukaryotes - 134 (source NCBI BLink) 
MIKANVYSCSKFRFLYRRRFLSQSSFVHSLDANVSSLIAAVKSCVSIELCRLLHCKVVKS 
VSYRHGFIGDQLVGCYLRLGHDVCAEKLFDEMPERDLVSWNSLISGYSGRGYLGKCFEVL 
SRMMISEVGFRPNEVTFLSMISACVYGGSKEEGRCIHGLVMKFGVLEEVKVVNAFINWYG 
KTGDLTSSCKLFEDLSIKNLVSWNTMIVIHLQNGLAEKGLAYFNMSRRVGHEPDQATFLA 
VLRSCEDMGVVRLAQGIHGLIMFGGFSGNKCITTALLDLYSKLGRLEDSSTVFHEITSPD 
SMAWTAMLAAYATHGFGRDAIKHFELMVHYGISPDHVTFTHLLNACSHSGLVEEGKHYFE 
TMSKRYRIDPRLDHYSCMVDLLGRSGLLQDAYGLIKEMPMEPSSGVWGALLGACRVYKDT 
QLGTKAAERLFELEPRDGRNYVMLSNIYSASGLWKDASRIRNLMKQKGLVRASGCSYIEH 
GNKIHKFVVGDWSHPESEKIQKKLKEIRKKMKSEMGYKSKTEFVLHDVGEDVKEEMINQH 
SEKIAMAFGLLVVSPMEPIIIRKNLRICGDCHETAKAISLIEKRRIIIRDSKRFHHFLDG 
SCSCSDYW*
>AT2G41080.1 |  pentatricopeptide (PPR) repeat-containing protein 
MSMYSKLGDFPSAVAVYGRMRKKNYMSSNILINGYVRAGDLVNARKVFDEMPDRKLTTWN 
AMIAGLIQFEFNEEGLSLFREMHGLGFSPDEYTLGSVFSGSAGLRSVSIGQQIHGYTIKY 
GLELDLVVNSSLAHMYMRNGKLQDGEIVIRSMPVRNLVAWNTLIMGNAQNGCPETVLYLY 
KMMKISGCRPNKITFVTVLSSCSDLAIRGQGQQIHAEAIKIGASSVVAVVSSLISMYSKC 
GCLGDAAKAFSEREDEDEVMWSSMISAYGFHGQGDEAIELFNTMAEQTNMEINEVAFLNL 
LYACSHSGLKDKGLELFDMMVEKYGFKPGLKHYTCVVDLLGRAGCLDQAEAIIRSMPIKT 
DIVIWKTLLSACNIHKNAEMAQRVFKEILQIDPNDSACYVLLANVHASAKRWRDVSEVRK 
SMRDKNVKKEAGISWFEHKGEVHQFKMGDRSQSKSKEIYSYLKELTLEMKLKGYKPDTAS 
VLHDMDEEEKESDLVQHSEKLAVAFALMILPEGAPIRIIKNLRVCSDCHVAFKYISVIKN 
REITLRDGSRFHHFINGKCSCGDYW*
>AT1G21810.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 14 plant structures EXPRESSED DURING 6 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF869 plant (InterProIPR008587) BEST Arabidopsis thaliana protein match is myosin heavy chain-related (TAIRAT1G775802) Has 62951 Blast hits to 33517 proteins in 1593 species Archae - 800 Bacteria - 6354 Metazoa - 33836 Fungi - 4765 Plants - 2651 Viruses - 311 Other Eukaryotes - 14234 (source NCBI BLink) 
MTGTTLILEPVMDSKDELVKQHAKVAEDAVAGWEKAENEVVELKQKLEDAADKNIVLEDR 
VSHLDGALKECVRQLRQFRDEQEKNIQAAVTESTKELHSANTGLEKRVLELQKEAEAAKS 
ENMMLRREFLTQREDLEIVMIERDLSTQAAETASKQHLDIIKKLAKLEAECRKLRILAKT 
SSSLSSNQSVDSHSDGGRERVEGSCSDSWASSAFISELDQIKNEKGGNRSLQGTTSSTEI 
DLMDDFLEMERLVALPTETQAKNSKDGYELSLMEKLEKIQAEKDDLEREVKCCREAEKRL 
SLEIEAVVGDKMELEDMLKRVEAEKAELKTSFDVLKDKYQESRVCFQEVDTKLEKLQAEK 
DELDSEVICCKEAEKRFSLELEAVVGDKIEMEDELEKMEAEKAELKISFDVIKDQYQESR 
VCFQEVEMKLEAMKRELKLANESKTQAESRVTRMEAEVRKERIVSDGLKEKCETFEEELR 
REIEEKTMIKREKVEPKIKQEDIATAAGKFADCQKTIASLGKQLQSLATLEEFLIDTASI 
PGSARSVHNKEALLGKDPHECIKTINGRSLEFLAIQNSNNKTSPPCSSSSDSTTVSLIMS 
SNRGSSEKNRNGFATVFTRSRNSVNLGI*