>AT1G26900.1 |  pentatricopeptide (PPR) repeat-containing protein 
MTLAITSRLRRNFVFRRRNLESLLSPQCQKLINDLRSCRDTVEVSRIHGYMVKTGLDKDD 
FAVSKLLAFSSVLDIRYASSIFEHVSNTNLFMFNTMIRGYSISDEPERAFSVFNQLRAKG 
LTLDRFSFITTLKSCSRELCVSIGEGLHGIALRSGFMVFTDLRNALIHFYCVCGKISDAR 
KVFDEMPQSVDAVTFSTLMNGYLQVSKKALALDLFRIMRKSEVVVNVSTLLSFLSAISDL 
GDLSGAESAHVLCIKIGLDLDLHLITALIGMYGKTGGISSARRIFDCAIRKDVVTWNCMI 
DQYAKTGLLEECVWLLRQMKYEKMKPNSSTFVGLLSSCAYSEAAFVGRTVADLLEEERIA 
LDAILGTALVDMYAKVGLLEKAVEIFNRMKDKDVKSWTAMISGYGAHGLAREAVTLFNKM 
EEENCKVRPNEITFLVVLNACSHGGLVMEGIRCFKRMVEAYSFTPKVEHYGCVVDLLGRA 
GQLEEAYELIRNLPITSDSTAWRALLAACRVYGNADLGESVMMRLAEMGETHPADAILLA 
GTHAVAGNPEKSLDNELNKGRKEAGYSAIEIE*
>AT2G45350.1 |  CRR4 (CHLORORESPIRATORY REDUCTION 4) 
MECSISSTIHVLGSCKTSDDVNQIHGRLIKTGIIKNSNLTTRIVLAFASSRRPYLADFAR 
CVFHEYHVCSFSFGEVEDPFLWNAVIKSHSHGKDPRQALLLLCLMLENGVSVDKFSLSLV 
LKACSRLGFVKGGMQIHGFLKKTGLWSDLFLQNCLIGLYLKCGCLGLSRQMFDRMPKRDS 
VSYNSMIDGYVKCGLIVSARELFDLMPMEMKNLISWNSMISGYAQTSDGVDIASKLFADM 
PEKDLISWNSMIDGYVKHGRIEDAKGLFDVMPRRDVVTWATMIDGYAKLGFVHHAKTLFD 
QMPHRDVVAYNSMMAGYVQNKYHMEALEIFSDMEKESHLLPDDTTLVIVLPAIAQLGRLS 
KAIDMHLYIVEKQFYLGGKLGVALIDMYSKCGSIQHAMLVFEGIENKSIDHWNAMIGGLA 
IHGLGESAFDMLLQIERLSLKPDDITFVGVLNACSHSGLVKEGLLCFELMRRKHKIEPRL 
QHYGCMVDILSRSGSIELAKNLIEEMPVEPNDVIWRTFLTACSHHKEFETGELVAKHLIL 
QAGYNPSSYVLLSNMYASFGMWKDVRRVRTMMKERKIEKIPGCSWIELDGRVHEFFVDSI 
EVSSTL*
>AT4G38010.1 |  pentatricopeptide (PPR) repeat-containing protein 
MYLPEKSVLLELISRCSSLRVFKQIQTQLITRDLLRDDLIINKVVTFLGKSADFASYSSV 
ILHSIRSVLSSFSYNTLLSSYAVCDKPRVTIFAYKTFVSNGFSPDMFTFPPVFKACGKFS 
GIREGKQIHGIVTKMGFYDDIYVQNSLVHFYGVCGESRNACKVFGEMPVRDVVSWTGIIT 
GFTRTGLYKEALDTFSKMDVEPNLATYVCVLVSSGRVGCLSLGKGIHGLILKRASLISLE 
TGNALIDMYVKCEQLSDAMRVFGELEKKDKVSWNSMISGLVHCERSKEAIDLFSLMQTSS 
GIKPDGHILTSVLSACASLGAVDHGRWVHEYILTAGIKWDTHIGTAIVDMYAKCGYIETA 
LEIFNGIRSKNVFTWNALLGGLAIHGHGLESLRYFEEMVKLGFKPNLVTFLAALNACCHT 
GLVDEGRRYFHKMKSREYNLFPKLEHYGCMIDLLCRAGLLDEALELVKAMPVKPDVRICG 
AILSACKNRGTLMELPKEILDSFLDIEFEDSGVYVLLSNIFAANRRWDDVARIRRLMKVK 
GISKVPGSSYIEKFMTLDQ*
>AT3G05240.1 |  pentatricopeptide (PPR) repeat-containing protein 
MMKKHYKPILSQLENCRSLVELNQLHGLMIKSSVIRNVIPLSRLIDFCTTCPETMNLSYA 
RSVFESIDCPSVYIWNSMIRGYSNSPNPDKALIFYQEMLRKGYSPDYFTFPYVLKACSGL 
RDIQFGSCVHGFVVKTGFEVNMYVSTCLLHMYMCCGEVNYGLRVFEDIPQWNVVAWGSLI 
SGFVNNNRFSDAIEAFREMQSNGVKANETIMVDLLVACGRCFDPYFQSKVGFNVILATSL 
IDMYAKCGDLRTARYLFDGMPERTLVSWNSIITGYSQNGDAEEALCMFLDMLDLGIAPDK 
VTFLSVIRASMIQGCSQLGQSIHAYVSKTGFVKDAAIVCALVNMYAKTGDAESAKKAFED 
LEKKDTIAWTVVIIGLASHGHGNEALSIFQRMQEKGNATPDGITYLGVLYACSHIGLVEE 
GQRYFAEMRDLHGLEPTVEHYGCMVDILSRAGRFEEAERLVKTMPVKPNVNIWGALLNGC 
DIHENLELTDRIRSMVAEPEELGSGIYVLLSNIYAKAGRWADVKLIRESMKSKRVDKVLG 
HSSVETMF*
>AT2G34400.1 |  pentatricopeptide (PPR) repeat-containing protein 
MLIKPEKLAFSIYRQFPKFKPRQFEEARRGDLERDFLFLLKKCISVNQLRQIQAQMLLHS 
VEKPNFLIPKAVELGDFNYSSFLFSVTEEPNHYSFNYMIRGLTNTWNDHEAALSLYRRMK 
FSGLKPDKFTYNFVFIACAKLEEIGVGRSVHSSLFKVGLERDVHINHSLIMMYAKCGQVG 
YARKLFDEITERDTVSWNSMISGYSEAGYAKDAMDLFRKMEEEGFEPDERTLVSMLGACS 
HLGDLRTGRLLEEMAITKKIGLSTFLGSKLISMYGKCGDLDSARRVFNQMIKKDRVAWTA 
MITVYSQNGKSSEAFKLFFEMEKTGVSPDAGTLSTVLSACGSVGALELGKQIETHASELS 
LQHNIYVATGLVDMYGKCGRVEEALRVFEAMPVKNEATWNAMITAYAHQGHAKEALLLFD 
RMSVPPSDITFIGVLSACVHAGLVHQGCRYFHEMSSMFGLVPKIEHYTNIIDLLSRAGML 
DEAWEFMERFPGKPDEIMLAAILGACHKRKDVAIREKAMRMLMEMKEAKNAGNYVISSNV 
LADMKMWDESAKMRALMRDRGVVKTPGCSWIEIEGELMEFLAGSDYLQCGREDSGSLFDL 
LVEEMKRERYEFGYIHL*
>AT1G31430.1 |  pentatricopeptide (PPR) repeat-containing protein 
MNMSLLQTPSLLMYNKMLKSLADGKSFTKVLALFGELRGQGLYPDNFTLPVVLKSIGRLR 
KVIEGEKVHGYAVKAGLEFDSYVSNSLMGMYASLGKIEITHKVFDEMPQRDVVSWNGLIS 
SYVGNGRFEDAIGVFKRMSQESNLKFDEGTIVSTLSACSALKNLEIGERIYRFVVTEFEM 
SVRIGNALVDMFCKCGCLDKARAVFDSMRDKNVKCWTSMVFGYVSTGRIDEARVLFERSP 
VKDVVLWTAMMNGYVQFNRFDEALELFRCMQTAGIRPDNFVLVSLLTGCAQTGALEQGKW 
IHGYINENRVTVDKVVGTALVDMYAKCGCIETALEVFYEIKERDTASWTSLIYGLAMNGM 
SGRALDLYYEMENVGVRLDAITFVAVLTACNHGGFVAEGRKIFHSMTERHNVQPKSEHCS 
CLIDLLCRAGLLDEAEELIDKMRGESDETLVPVYCSLLSAARNYGNVKIAERVAEKLEKV 
EVSDSSAHTLLASVYASANRWEDVTNVRRKMKDLGIRKFPGCSSIEIDGVGHEFIVGDDL 
LSHPKMDEINSMLHQTTNLMLDLEHKEIDS*
>AT1G64310.1 |  pentatricopeptide (PPR) repeat-containing protein 
MASQTQLRLIIYEFTRKIQTRLNTQKLHSFVTKSKLARDPYFATQLARFYALNDDLISAR 
KLFDVFPERSVFLWNSIIRAYAKAHQFTTVLSLFSQILRSDTRPDNFTYACLARGFSESF 
DTKGLRCIHGIAIVSGLGFDQICGSAIVKAYSKAGLIVEASKLFCSIPDPDLALWNVMIL 
GYGCCGFWDKGINLFNLMQHRGHQPNCYTMVALTSGLIDPSLLLVAWSVHAFCLKINLDS 
HSYVGCALVNMYSRCMCIASACSVFNSISEPDLVACSSLITGYSRCGNHKEALHLFAELR 
MSGKKPDCVLVAIVLGSCAELSDSVSGKEVHSYVIRLGLELDIKVCSALIDMYSKCGLLK 
CAMSLFAGIPEKNIVSFNSLILGLGLHGFASTAFEKFTEILEMGLIPDEITFSALLCTCC 
HSGLLNKGQEIFERMKSEFGIEPQTEHYVYMVKLMGMAGKLEEAFEFVMSLQKPIDSGIL 
GALLSCCEVHENTHLAEVVAENIHKNGEERRSVYKVMLSNVYARYGRWDEVERLRDGISE 
SYGGKLPGISWF*
>AT4G05040.1 |  ankyrin repeat family protein 
MDSSEAHLDRIEAQRSTDVSHDQQKKRYFPMNLINKVASKLCSRGGDGATPPMGDNESGL 
EFLNNLKLSDLFHLPGENVQMNTEVFSGLSDGDKECLEMLKGVGTPMACLKSDRGDSVLH 
LAARWGHLELVKNIISECPCLVLELNFKDQLPLHVAAHAGHSAIVEALVASVTFFSDRLA 
EEDRERLNPYVLRDKYGNTALHLAIEGRYMEMAASLVNENQNASFLENNEGISSLYMAVE 
AGDVTLVKEILKTAGNNDLEGRNSNLDSKLEGRKHLVHVALNARSIGVLDVILNEYPSLE 
DERDEEGRTCLSFAASIGFYKGVCNLLDRSTKNVYVCDEDGSFPIHTAAENGHIRIVKEI 
LKRCPHSKHMLNKLGQNVLHIAAKIGEHNLVKSLMRSDDTKHLGVGQDVDGNTPLHLAVL 
NWRYRSIRTLASDVKILQLRNDNGLTARGIAESVLKPNYIFHERLTLAFLLDAHAFRGCG 
SVKSLTKPSEPLDHEKSRDYVNTLLLVAALVATMTFAAGFTIPGGFNSSAPHLGRATLTT 
DPNLFFFLLFDILAMQTSVASICTLIYMGAVG*
>AT4G05040.1 |  ankyrin repeat family protein 
MDSSEAHLDRIEAQRSTDVSHDQQKKRYFPMNLINKVASKLCSRGGDGATPPMGDNESGL 
EFLNNLKLSDLFHLPGENVQMNTEVFSGLSDGDKECLEMLKGVGTPMACLKSDRGDSVLH 
LAARWGHLELVKNIISECPCLVLELNFKDQLPLHVAAHAGHSAIVEALVASVTFFSDRLA 
EEDRERLNPYVLRDKYGNTALHLAIEGRYMEMAASLVNENQNASFLENNEGISSLYMAVE 
AGDVTLVKEILKTAGNNDLEGRNSNLDSKLEGRKHLVHVALNARSIGVLDVILNEYPSLE 
DERDEEGRTCLSFAASIGFYKGVCNLLDRSTKNVYVCDEDGSFPIHTAAENGHIRIVKEI 
LKRCPHSKHMLNKLGQNVLHIAAKIGEHNLVKSLMRSDDTKHLGVGQDVDGNTPLHLAVL 
NWRYRSIRTLASDVKILQLRNDNGLTARGIAESVLKPNYIFHERLTLAFLLDAHAFRGCG 
SVKSLTKPSEPLDHEKSRDYVNTLLLVAALVATMTFAAGFTIPGGFNSSAPHLGRATLTT 
DPNLFFFLLFDILAMQTSVASICTLIYMGAVG*
>AT4G05040.1 |  ankyrin repeat family protein 
MDSSEAHLDRIEAQRSTDVSHDQQKKRYFPMNLINKVASKLCSRGGDGATPPMGDNESGL 
EFLNNLKLSDLFHLPGENVQMNTEVFSGLSDGDKECLEMLKGVGTPMACLKSDRGDSVLH 
LAARWGHLELVKNIISECPCLVLELNFKDQLPLHVAAHAGHSAIVEALVASVTFFSDRLA 
EEDRERLNPYVLRDKYGNTALHLAIEGRYMEMAASLVNENQNASFLENNEGISSLYMAVE 
AGDVTLVKEILKTAGNNDLEGRNSNLDSKLEGRKHLVHVALNARSIGVLDVILNEYPSLE 
DERDEEGRTCLSFAASIGFYKGVCNLLDRSTKNVYVCDEDGSFPIHTAAENGHIRIVKEI 
LKRCPHSKHMLNKLGQNVLHIAAKIGEHNLVKSLMRSDDTKHLGVGQDVDGNTPLHLAVL 
NWRYRSIRTLASDVKILQLRNDNGLTARGIAESVLKPNYIFHERLTLAFLLDAHAFRGCG 
SVKSLTKPSEPLDHEKSRDYVNTLLLVAALVATMTFAAGFTIPGGFNSSAPHLGRATLTT 
DPNLFFFLLFDILAMQTSVASICTLIYMGAVG*
>AT4G05040.2 |  ankyrin repeat family protein 
MDSSEAHLDRIEAQRSTDVSHDQQKKRYFPMNLINKVASKLCSRGGDGATPPMGDNESGL 
EFLNNLKLSDLFHLPGENVQMNTEVFSGLSDGDKECLEMLKGVGTPMACLKSDRGDSVLH 
LAARWGHLELVKNIISECPCLVLELNFKDQLPLHVAAHAGHSAIVEALVASVTFFSDRLA 
EEDRERLNPYVLRDKYGNTALHLAIEGRYMEMAASLVNENQNASFLENNEGISSLYMAVE 
AGDVTLVKEILKTAGNNDLEGRNSNLDSKLEGRKHLVHVALNARSIGVLDVILNEYPSLE 
DERDEEGRTCLSFAASIGFYKGVCNLLDRSTKNVYVCDEDGSFPIHTAAENGHIRIVKEI 
LKRCPHSKHMLNKLGQNVLHIAAKIGEHNLVKSLMRSDDTKHLGVGQDVDGNTPLHLAVL 
NWRYRSIRTLASDVKILQLRNDNGLTARGIAESVLKPNYIFHERLTLAFLLDAHAFRGCG 
SVKSLTKPSEPLDHEKSRDYVNTLLLVAALVATMTFAAGFTIPGGFNSSAPHLGRATLTT 
DPNLFFFLLFDILAMQTSVASICTLIYMGAVG*
>AT4G05040.2 |  ankyrin repeat family protein 
MDSSEAHLDRIEAQRSTDVSHDQQKKRYFPMNLINKVASKLCSRGGDGATPPMGDNESGL 
EFLNNLKLSDLFHLPGENVQMNTEVFSGLSDGDKECLEMLKGVGTPMACLKSDRGDSVLH 
LAARWGHLELVKNIISECPCLVLELNFKDQLPLHVAAHAGHSAIVEALVASVTFFSDRLA 
EEDRERLNPYVLRDKYGNTALHLAIEGRYMEMAASLVNENQNASFLENNEGISSLYMAVE 
AGDVTLVKEILKTAGNNDLEGRNSNLDSKLEGRKHLVHVALNARSIGVLDVILNEYPSLE 
DERDEEGRTCLSFAASIGFYKGVCNLLDRSTKNVYVCDEDGSFPIHTAAENGHIRIVKEI 
LKRCPHSKHMLNKLGQNVLHIAAKIGEHNLVKSLMRSDDTKHLGVGQDVDGNTPLHLAVL 
NWRYRSIRTLASDVKILQLRNDNGLTARGIAESVLKPNYIFHERLTLAFLLDAHAFRGCG 
SVKSLTKPSEPLDHEKSRDYVNTLLLVAALVATMTFAAGFTIPGGFNSSAPHLGRATLTT 
DPNLFFFLLFDILAMQTSVASICTLIYMGAVG*
>AT4G05040.2 |  ankyrin repeat family protein 
MDSSEAHLDRIEAQRSTDVSHDQQKKRYFPMNLINKVASKLCSRGGDGATPPMGDNESGL 
EFLNNLKLSDLFHLPGENVQMNTEVFSGLSDGDKECLEMLKGVGTPMACLKSDRGDSVLH 
LAARWGHLELVKNIISECPCLVLELNFKDQLPLHVAAHAGHSAIVEALVASVTFFSDRLA 
EEDRERLNPYVLRDKYGNTALHLAIEGRYMEMAASLVNENQNASFLENNEGISSLYMAVE 
AGDVTLVKEILKTAGNNDLEGRNSNLDSKLEGRKHLVHVALNARSIGVLDVILNEYPSLE 
DERDEEGRTCLSFAASIGFYKGVCNLLDRSTKNVYVCDEDGSFPIHTAAENGHIRIVKEI 
LKRCPHSKHMLNKLGQNVLHIAAKIGEHNLVKSLMRSDDTKHLGVGQDVDGNTPLHLAVL 
NWRYRSIRTLASDVKILQLRNDNGLTARGIAESVLKPNYIFHERLTLAFLLDAHAFRGCG 
SVKSLTKPSEPLDHEKSRDYVNTLLLVAALVATMTFAAGFTIPGGFNSSAPHLGRATLTT 
DPNLFFFLLFDILAMQTSVASICTLIYMGAVG*
>AT4G05040.3 |  ankyrin repeat family protein 
MDSSEAHLDRIEAQRSTDVSHDQQKKRYFPMNLINKVASKLCSRGGDGATPPMGDNESGL 
EFLNNLKLSDLFHLPGENVQMNTEVFSGLSDGDKECLEMLKGVGTPMACLKSDRGDSVLH 
LAARWGHLELVKNIISECPCLVLELNFKDQLPLHVAAHAGHSAIVEALVASVTFFSDRLA 
EEDRERLNPYVLRDKYGNTALHLAIEGRYMEMAASLVNENQNASFLENNEGISSLYMAVE 
AGDVTLVKEILKTAGNNDLEGRNSNLDSKLEGRKHLVHVALNARSIGVLDVILNEYPSLE 
DERDEEGRTCLSFAASIGFYKGVCNLLDRSTKNVYVCDEDGSFPIHTAAENGHIRIVKEI 
LKRCPHSKHMLNKLGQNVLHIAAKIGEHNLVKSLMRSDDTKHLGVGQDVDGNTPLHLAVL 
NWRYRSIRTLASDVKILQLRNDNGLTARGIAESVLKPNYIFHERLTLAFLLDAHAFRGCG 
SVKSLTKPSEPLDHEKSRDYVNTLLLVAALVATMTFAAGFTIPGGFNSSAPHLGRATLTT 
DPNLFFFLLFDILAMQTSVASICTLIYMGAVG*
>AT4G05040.3 |  ankyrin repeat family protein 
MDSSEAHLDRIEAQRSTDVSHDQQKKRYFPMNLINKVASKLCSRGGDGATPPMGDNESGL 
EFLNNLKLSDLFHLPGENVQMNTEVFSGLSDGDKECLEMLKGVGTPMACLKSDRGDSVLH 
LAARWGHLELVKNIISECPCLVLELNFKDQLPLHVAAHAGHSAIVEALVASVTFFSDRLA 
EEDRERLNPYVLRDKYGNTALHLAIEGRYMEMAASLVNENQNASFLENNEGISSLYMAVE 
AGDVTLVKEILKTAGNNDLEGRNSNLDSKLEGRKHLVHVALNARSIGVLDVILNEYPSLE 
DERDEEGRTCLSFAASIGFYKGVCNLLDRSTKNVYVCDEDGSFPIHTAAENGHIRIVKEI 
LKRCPHSKHMLNKLGQNVLHIAAKIGEHNLVKSLMRSDDTKHLGVGQDVDGNTPLHLAVL 
NWRYRSIRTLASDVKILQLRNDNGLTARGIAESVLKPNYIFHERLTLAFLLDAHAFRGCG 
SVKSLTKPSEPLDHEKSRDYVNTLLLVAALVATMTFAAGFTIPGGFNSSAPHLGRATLTT 
DPNLFFFLLFDILAMQTSVASICTLIYMGAVG*
>AT4G05040.3 |  ankyrin repeat family protein 
MDSSEAHLDRIEAQRSTDVSHDQQKKRYFPMNLINKVASKLCSRGGDGATPPMGDNESGL 
EFLNNLKLSDLFHLPGENVQMNTEVFSGLSDGDKECLEMLKGVGTPMACLKSDRGDSVLH 
LAARWGHLELVKNIISECPCLVLELNFKDQLPLHVAAHAGHSAIVEALVASVTFFSDRLA 
EEDRERLNPYVLRDKYGNTALHLAIEGRYMEMAASLVNENQNASFLENNEGISSLYMAVE 
AGDVTLVKEILKTAGNNDLEGRNSNLDSKLEGRKHLVHVALNARSIGVLDVILNEYPSLE 
DERDEEGRTCLSFAASIGFYKGVCNLLDRSTKNVYVCDEDGSFPIHTAAENGHIRIVKEI 
LKRCPHSKHMLNKLGQNVLHIAAKIGEHNLVKSLMRSDDTKHLGVGQDVDGNTPLHLAVL 
NWRYRSIRTLASDVKILQLRNDNGLTARGIAESVLKPNYIFHERLTLAFLLDAHAFRGCG 
SVKSLTKPSEPLDHEKSRDYVNTLLLVAALVATMTFAAGFTIPGGFNSSAPHLGRATLTT 
DPNLFFFLLFDILAMQTSVASICTLIYMGAVG*
>AT1G12270.1 |  stress-inducible protein putative 
MAEEAKAKGNAAFSSGDFTTAINHFTEAIALAPTNHVLFSNRSAAHASLHQYAEALSDAK 
ETIKLKPYWPKGYSRLGAAHLGLNQFELAVTAYKKGLDVDPTNEALKSGLADAEASVARS 
RAAPNPFGDAFQGPEMWTKLTSDPSTRGFLQQPDFVNMMQEIQKNPSSLNLYLKDQRVMQ 
SLGVLLNVKFRPPPPQGDEAEVPESDMGQSSSNEPEVEKKREPEPEPEPEVTEEKEKKER 
KEKAKKEKELGNAAYKKKDFETAIQHYSTAIEIDDEDISYLTNRAAVYLEMGKYNECIED 
CNKAVERGRELRSDYKMVARALTRKGTALTKMAKCSKDYEPAIEAFQKALTEHRNPDTLK 
RLNDAERAKKEWEQKQYFDPKLGDEEREKGNDFFKEQKYPEAIKHYTEAIKRNPNDHKAY 
SNRAASYTKLGAMPEGLKDAEKCIELDPTFSKGYSRKAAVQFFLKEYDNAMETYQAGLEH 
DPSNQELLDGVKRCVQQINKANRGDLTPEELKERQAKGMQDPEIQNILTDPVMRQVLSDL 
QENPSAAQKHMQNPMVMNKIQKLISAGIVQMK*
>AT1G18610.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s Galactose oxidase/kelch beta-propeller (InterProIPR011043) Kelch repeat type 1 (InterProIPR006652) Kelch repeat type 2 (InterProIPR011498) Kelch-type beta propeller (InterProIPR015915) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G741501) Has 8761 Blast hits to 4412 proteins in 271 species Archae - 12 Bacteria - 304 Metazoa - 4209 Fungi - 851 Plants - 1246 Viruses - 11 Other Eukaryotes - 2128 (source NCBI BLink) 
MRWERVRQLQQQVGLGESSSGPGKRWGHTCNAIKGGSFLYVFGGYGRDNCQTNQVHVFDA 
AKQIWTQPMINGTPPPPRDSHSCTTVGDNLFVFGGTDGVNPLKDLYILDTSSHTWKCPSV 
RGEGPEAREGHSATLVGKRLFVFGGCGKSSGINEEIYYNDVYIFNTETFVWKRAVTIGNP 
PSARDSHSCSSWKNKLVVIGGEDGHDYYLSDVHILDTDTLIWKELNTSGQLLTPRAGHVT 
VSLGRNFFVFGGFTDAQNLYDDLYVLDVDTCIWSKVLTMGEGPSARFSSAGACLDPHKAG 
FLVIVGGCNKNLEALDDMFYLQTGLGYDARFDQNVGMLSLKKQLKIKCQEQSHASSLYDK 
SLVRINMDHQGRGNFGLNTCQFNEGKMMFQARITESYPVGYTMETMIDGKVLRGVLFSNK 
RSSILPADQSFSRPAMSNGDQDNRSKISRTLIKDQANAVESKDSQLNGMEAGIDTISNPL 
GVNITTVAVAPHETETSVVTSDAKNQDASQLDMGTVNTVNTAPSSVPQVDEASLESRNAI 
TIDDRANKTGLGES*