>AT1G21810.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 14 plant structures EXPRESSED DURING 6 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF869 plant (InterProIPR008587) BEST Arabidopsis thaliana protein match is myosin heavy chain-related (TAIRAT1G775802) Has 62951 Blast hits to 33517 proteins in 1593 species Archae - 800 Bacteria - 6354 Metazoa - 33836 Fungi - 4765 Plants - 2651 Viruses - 311 Other Eukaryotes - 14234 (source NCBI BLink) 
MTGTTLILEPVMDSKDELVKQHAKVAEDAVAGWEKAENEVVELKQKLEDAADKNIVLEDR 
VSHLDGALKECVRQLRQFRDEQEKNIQAAVTESTKELHSANTGLEKRVLELQKEAEAAKS 
ENMMLRREFLTQREDLEIVMIERDLSTQAAETASKQHLDIIKKLAKLEAECRKLRILAKT 
SSSLSSNQSVDSHSDGGRERVEGSCSDSWASSAFISELDQIKNEKGGNRSLQGTTSSTEI 
DLMDDFLEMERLVALPTETQAKNSKDGYELSLMEKLEKIQAEKDDLEREVKCCREAEKRL 
SLEIEAVVGDKMELEDMLKRVEAEKAELKTSFDVLKDKYQESRVCFQEVDTKLEKLQAEK 
DELDSEVICCKEAEKRFSLELEAVVGDKIEMEDELEKMEAEKAELKISFDVIKDQYQESR 
VCFQEVEMKLEAMKRELKLANESKTQAESRVTRMEAEVRKERIVSDGLKEKCETFEEELR 
REIEEKTMIKREKVEPKIKQEDIATAAGKFADCQKTIASLGKQLQSLATLEEFLIDTASI 
PGSARSVHNKEALLGKDPHECIKTINGRSLEFLAIQNSNNKTSPPCSSSSDSTTVSLIMS 
SNRGSSEKNRNGFATVFTRSRNSVNLGI*
>AT1G34210.1 |  SERK2 (SOMATIC EMBRYOGENESIS RECEPTOR-LIKE KINASE 2) kinase 
MGRKKFEAFGFVCLISLLLLFNSLWLASSNMEGDALHSLRANLVDPNNVLQSWDPTLVNP 
CTWFHVTCNNENSVIRVDLGNADLSGQLVPQLGQLKNLQYLELYSNNITGPVPSDLGNLT 
NLVSLDLYLNSFTGPIPDSLGKLFKLRFLRLNNNSLTGPIPMSLTNIMTLQVLDLSNNRL 
SGSVPDNGSFSLFTPISFANNLDLCGPVTSRPCPGSPPFSPPPPFIPPPIVPTPGGYSAT 
GAIAGGVAAGAALLFAAPALAFAWWRRRKPQEFFFDVPAEEDPEVHLGQLKRFSLRELQV 
ATDSFSNKNILGRGGFGKVYKGRLADGTLVAVKRLKEERTPGGELQFQTEVEMISMAVHR 
NLLRLRGFCMTPTERLLVYPYMANGSVASCLRERPPSQLPLAWSIRQQIALGSARGLSYL 
HDHCDPKIIHRDVKAANILLDEEFEAVVGDFGLARLMDYKDTHVTTAVRGTIGHIAPEYL 
STGKSSEKTDVFGYGIMLLELITGQRAFDLARLANDDDVMLLDWVKGLLKEKKLEMLVDP 
DLQSNYTEAEVEQLIQVALLCTQSSPMERPKMSEVVRMLEGDGLAEKWDEWQKVEVLRQE 
VELSSHPTSDWILDSTDNLHAMELSGPR*
>AT3G13770.1 |  pentatricopeptide (PPR) repeat-containing protein 
MFNLMRLIHRSFSSSPTNYVLQTILPISQLCSNGRLQEALLEMAMLGPEMGFHGYDALLN 
ACLDKRALRDGQRVHAHMIKTRYLPATYLRTRLLIFYGKCDCLEDARKVLDEMPEKNVVS 
WTAMISRYSQTGHSSEALTVFAEMMRSDGKPNEFTFATVLTSCIRASGLGLGKQIHGLIV 
KWNYDSHIFVGSSLLDMYAKAGQIKEAREIFECLPERDVVSCTAIIAGYAQLGLDEEALE 
MFHRLHSEGMSPNYVTYASLLTALSGLALLDHGKQAHCHVLRRELPFYAVLQNSLIDMYS 
KCGNLSYARRLFDNMPERTAISWNAMLVGYSKHGLGREVLELFRLMRDEKRVKPDAVTLL 
AVLSGCSHGRMEDTGLNIFDGMVAGEYGTKPGTEHYGCIVDMLGRAGRIDEAFEFIKRMP 
SKPTAGVLGSLLGACRVHLSVDIGESVGRRLIEIEPENAGNYVILSNLYASAGRWADVNN 
VRAMMMQKAVTKEPGRSWIQHEQTLHYFHANDRTHPRREEVLAKMKEISIKMKQAGYVPD 
LSCVLYDVDEEQKEKMLLGHSEKLALTFGLIATGEGIPIRVFKNLRICVDCHNFAKIFSK 
VFEREVSLRDKNRFHQIVDGICSCGDYW*
>AT5G27330.1 |  LOCATED IN endoplasmic reticulum EXPRESSED IN 22 plant structures EXPRESSED DURING 12 growth stages CONTAINS InterPro DOMAIN/s Prefoldin (InterProIPR009053) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT3G051301) Has 110457 Blast hits to 56483 proteins in 2051 species Archae - 1653 Bacteria - 15111 Metazoa - 52441 Fungi - 7858 Plants - 4309 Viruses - 584 Other Eukaryotes - 28501 (source NCBI BLink) 
MAKKKVSRNSNGASNEQQQIQNQSVPVTSQKSTKLSRESSMEDHDSSEEKFQNLKSLNAI 
LLKQTMEKRQQIESLFQAKDSLEIELVRSGKEKTLLREELCGSSDENFMLKIEMDLLMGF 
VEGRVKEMGVEVDWLFKEKSDRETEIRDLKREANGLIRKLESEREEFSRVCDERDLVKSG 
FDLQSEEMNLLKESVVRLEMREVSLGEEVGRLKCENGRLVKERKKREEVIERGNRERSEL 
VESLEEKVREIDVLKREIEGVVKEKMEVEMVRRDQREMIVELEKKLGDMNEIVESLTKER 
EGLRGQVVGLEKSLDEVTEEAKARAEQINELVKEKTVKESELEGLMVENNSIKKEIEMAM 
VQFSDKEKLVEQLLREKNELVQRVVNQEAEIVELSKLAGEQKHAVAQLRKDYNDQIKNGE 
KLNCNVSQLKDALALVEVERDNAGKALDEEKRNMVALKEKVVALEKTNEATGKELEKIKA 
ERGRLIKEKKELENRSESLRNEKAILQKDIVELKRATGVLKTELESAGTNAKQSLTMLKS 
VSSLVCGIENKKDEKKRGKGMDSYSVQLEAIKKAFKNKESMVEEMKKELAKMKHSVEDAH 
KKKSFWTLVSSVTSLLMAASVAYAASLK*
>AT2G25490.1 |  EBF1 (EIN3-BINDING F BOX PROTEIN 1) protein binding / ubiquitin-protein ligase 
MSQIFSFAGENDFYRRGAIYPNPKDASLLLSLGSFADVYFPPSKRSRVVAPTIFSAFEKK 
PVSIDVLPDECLFEIFRRLSGPQERSACAFVSKQWLTLVSSIRQKEIDVPSKITEDGDDC 
EGCLSRSLDGKKATDVRLAAIAVGTAGRGGLGKLSIRGSNSAKVSDLGLRSIGRSCPSLG 
SLSLWNVSTITDNGLLEIAEGCAQLEKLELNRCSTITDKGLVAIAKSCPNLTELTLEACS 
RIGDEGLLAIARSCSKLKSVSIKNCPLVRDQGIASLLSNTTCSLAKLKLQMLNVTDVSLA 
VVGHYGLSITDLVLAGLSHVSEKGFWVMGNGVGLQKLNSLTITACQGVTDMGLESVGKGC 
PNMKKAIISKSPLLSDNGLVSFAKASLSLESLQLEECHRVTQFGFFGSLLNCGEKLKAFS 
LVNCLSIRDLTTGLPASSHCSALRSLSIRNCPGFGDANLAAIGKLCPQLEDIDLCGLKGI 
TESGFLHLIQSSLVKINFSGCSNLTDRVISAITARNGWTLEVLNIDGCSNITDASLVSIA 
ANCQILSDLDISKCAISDSGIQALASSDKLKLQILSVAGCSMVTDKSLPAIVGLGSTLLG 
LNLQQCRSISNSTVDFLVERLYKCDILS*
>AT1G77580.2 |  myosin heavy chain-related 
MEKRKRESSERSFGESESVSSLSEKDSEIQPESTMESRDDEIQSPTVSLEVETEKEELKD 
SMKTLAEKLSAALANVSAKDDLVKQHVKVAEEAVAGWEKAENEVVELKEKLEAADDKNRV 
LEDRVSHLDGALKECVRQLRQARDEQEQRIQDAVIERTQELQSSRTSLENQIFETATKSE 
ELSQMAESVAKENVMLRHELLARCEELEIRTIERDLSTQAAETASKQQLDSIKKVAKLEA 
ECRKFRMLAKSSASFNDHRSTDSHSDGGERMDVSCSDSWASSTLIEKRSLQGTSSSIELD 
LMGDFLEMERLVALPETPDGNGKSGPESVTEEVVVPSENSLASEIEVLTSRIKELEEKLE 
KLEAEKHELENEVKCNREEAVVHIENSEVLTSRTKELEEKLEKLEAEKEELKSEVKCNRE 
KAVVHVENSLAAEIEVLTSRTKELEEQLEKLEAEKVELESEVKCNREEAVAQVENSLATE 
IEVLTCRIKQLEEKLEKLEVEKDELKSEVKCNREVESTLRFELEAIACEKMELENKLEKL 
EVEKAELQISFDIIKDKYEESQVCLQEIETKLGEIQTEMKLVNELKAEVESQTIAMEADA 
KTKSAKIESLEEDMRKERFAFDELRRKCEALEEEISLHKENSIKSENKEPKIKQEDIETA 
AGKLANCQKTIASLGKQLQSLATLEDFLTDTPIIPMAANGVSSSSNSESWKVHKNETFMT 
RNHPESIKPTKETSPSSSSSTASAAVSMPVSTNRGSSEKNRNGFATVFTRSKDGIHLAI*
>AT1G77580.2 |  myosin heavy chain-related 
MEKRKRESSERSFGESESVSSLSEKDSEIQPESTMESRDDEIQSPTVSLEVETEKEELKD 
SMKTLAEKLSAALANVSAKDDLVKQHVKVAEEAVAGWEKAENEVVELKEKLEAADDKNRV 
LEDRVSHLDGALKECVRQLRQARDEQEQRIQDAVIERTQELQSSRTSLENQIFETATKSE 
ELSQMAESVAKENVMLRHELLARCEELEIRTIERDLSTQAAETASKQQLDSIKKVAKLEA 
ECRKFRMLAKSSASFNDHRSTDSHSDGGERMDVSCSDSWASSTLIEKRSLQGTSSSIELD 
LMGDFLEMERLVALPETPDGNGKSGPESVTEEVVVPSENSLASEIEVLTSRIKELEEKLE 
KLEAEKHELENEVKCNREEAVVHIENSEVLTSRTKELEEKLEKLEAEKEELKSEVKCNRE 
KAVVHVENSLAAEIEVLTSRTKELEEQLEKLEAEKVELESEVKCNREEAVAQVENSLATE 
IEVLTCRIKQLEEKLEKLEVEKDELKSEVKCNREVESTLRFELEAIACEKMELENKLEKL 
EVEKAELQISFDIIKDKYEESQVCLQEIETKLGEIQTEMKLVNELKAEVESQTIAMEADA 
KTKSAKIESLEEDMRKERFAFDELRRKCEALEEEISLHKENSIKSENKEPKIKQEDIETA 
AGKLANCQKTIASLGKQLQSLATLEDFLTDTPIIPMAANGVSSSSNSESWKVHKNETFMT 
RNHPESIKPTKETSPSSSSSTASAAVSMPVSTNRGSSEKNRNGFATVFTRSKDGIHLAI*
>AT1G77580.1 |  myosin heavy chain-related 
MESRDDEIQSPTVSLEVETEKEELKDSMKTLAEKLSAALANVSAKDDLVKQHVKVAEEAV 
AGWEKAENEVVELKEKLEAADDKNRVLEDRVSHLDGALKECVRQLRQARDEQEQRIQDAV 
IERTQELQSSRTSLENQIFETATKSEELSQMAESVAKENVMLRHELLARCEELEIRTIER 
DLSTQAAETASKQQLDSIKKVAKLEAECRKFRMLAKSSASFNDHRSTDSHSDGGERMDVS 
CSDSWASSTLIEKRSLQGTSSSIELDLMGDFLEMERLVALPETPDGNGKSGPESVTEEVV 
VPSENSLASEIEVLTSRIKELEEKLEKLEAEKHELENEVKCNREEAVVHIENSEVLTSRT 
KELEEKLEKLEAEKEELKSEVKCNREKAVVHVENSLAAEIEVLTSRTKELEEQLEKLEAE 
KVELESEVKCNREEAVAQVENSLATEIEVLTCRIKQLEEKLEKLEVEKDELKSEVKCNRE 
VESTLRFELEAIACEKMELENKLEKLEVEKAELQISFDIIKDKYEESQVCLQEIETKLGE 
IQTEMKLVNELKAEVESQTIAMEADAKTKSAKIESLEEDMRKERFAFDELRRKCEALEEE 
ISLHKENSIKSENKEPKIKQVCLQSSGVS*
>AT1G77580.1 |  myosin heavy chain-related 
MESRDDEIQSPTVSLEVETEKEELKDSMKTLAEKLSAALANVSAKDDLVKQHVKVAEEAV 
AGWEKAENEVVELKEKLEAADDKNRVLEDRVSHLDGALKECVRQLRQARDEQEQRIQDAV 
IERTQELQSSRTSLENQIFETATKSEELSQMAESVAKENVMLRHELLARCEELEIRTIER 
DLSTQAAETASKQQLDSIKKVAKLEAECRKFRMLAKSSASFNDHRSTDSHSDGGERMDVS 
CSDSWASSTLIEKRSLQGTSSSIELDLMGDFLEMERLVALPETPDGNGKSGPESVTEEVV 
VPSENSLASEIEVLTSRIKELEEKLEKLEAEKHELENEVKCNREEAVVHIENSEVLTSRT 
KELEEKLEKLEAEKEELKSEVKCNREKAVVHVENSLAAEIEVLTSRTKELEEQLEKLEAE 
KVELESEVKCNREEAVAQVENSLATEIEVLTCRIKQLEEKLEKLEVEKDELKSEVKCNRE 
VESTLRFELEAIACEKMELENKLEKLEVEKAELQISFDIIKDKYEESQVCLQEIETKLGE 
IQTEMKLVNELKAEVESQTIAMEADAKTKSAKIESLEEDMRKERFAFDELRRKCEALEEE 
ISLHKENSIKSENKEPKIKQVCLQSSGVS*