>AT4G14960.1 |  TUA6 structural constituent of cytoskeleton 
MRECISIHIGQAGIQVGNACWELYCLEHGIQPDGQMPGDKTVGGGDDAFNTFFSETGAGK 
HVPRAVFVDLEPTVIDEVRTGTYRQLFHPEQLISGKEDAANNFARGHYTIGKEIVDLCLD 
RIRKLADNCTGLQGFLVFNAVGGGTGSGLGSLLLERLSVDYGKKSKLGFTVYPSPQVSTS 
VVEPYNSVLSTHSLLEHTDVSILLDNEAIYDICRRSLNIERPTYTNLNRLVSQVISSLTA 
SLRFDGALNVDVTEFQTNLVPYPRIHFMLSSYAPVISAEKAFHEQLSVAEITNSAFEPAS 
MMAKCDPRHGKYMACCLMYRGDVVPKDVNAAVGTIKTKRTIQFVDWCPTGFKCGINYQPP 
TVVPGGDLAKVQRAVCMISNSTSVAEKENSQRLVRILQHWRRIMKRSVLKVVTMRMMKER 
NTKKNVS*
>AT4G14960.1 |  TUA6 structural constituent of cytoskeleton 
MRECISIHIGQAGIQVGNACWELYCLEHGIQPDGQMPGDKTVGGGDDAFNTFFSETGAGK 
HVPRAVFVDLEPTVIDEVRTGTYRQLFHPEQLISGKEDAANNFARGHYTIGKEIVDLCLD 
RIRKLADNCTGLQGFLVFNAVGGGTGSGLGSLLLERLSVDYGKKSKLGFTVYPSPQVSTS 
VVEPYNSVLSTHSLLEHTDVSILLDNEAIYDICRRSLNIERPTYTNLNRLVSQVISSLTA 
SLRFDGALNVDVTEFQTNLVPYPRIHFMLSSYAPVISAEKAFHEQLSVAEITNSAFEPAS 
MMAKCDPRHGKYMACCLMYRGDVVPKDVNAAVGTIKTKRTIQFVDWCPTGFKCGINYQPP 
TVVPGGDLAKVQRAVCMISNSTSVAEKENSQRLVRILQHWRRIMKRSVLKVVTMRMMKER 
NTKKNVS*
>AT4G14960.2 |  TUA6 structural constituent of cytoskeleton 
MRECISIHIGQAGIQVGNACWELYCLEHGIQPDGQMPGDKTVGGGDDAFNTFFSETGAGK 
HVPRAVFVDLEPTVIDEVRTGTYRQLFHPEQLISGKEDAANNFARGHYTIGKEIVDLCLD 
RIRKLADNCTGLQGFLVFNAVGGGTGSGLGSLLLERLSVDYGKKSKLGFTVYPSPQVSTS 
VVEPYNSVLSTHSLLEHTDVSILLDNEAIYDICRRSLNIERPTYTNLNRLVSQVISSLTA 
SLRFDGALNVDVTEFQTNLVPYPRIHFMLSSYAPVISAEKAFHEQLSVAEITNSAFEPAS 
MMAKCDPRHGKYMACCLMYRGDVVPKDVNAAVGTIKTKRTIQFVDWCPTGFKCGINYQPP 
TVVPGGDLAKVQRAVCMISNSTSVAEVFSRIDHKFDLMYAKRAFVHWYVGEGMEEGEFSE 
AREDLAALEKDYEEVGAEGGDDEDDEGEEY*
>AT4G14960.2 |  TUA6 structural constituent of cytoskeleton 
MRECISIHIGQAGIQVGNACWELYCLEHGIQPDGQMPGDKTVGGGDDAFNTFFSETGAGK 
HVPRAVFVDLEPTVIDEVRTGTYRQLFHPEQLISGKEDAANNFARGHYTIGKEIVDLCLD 
RIRKLADNCTGLQGFLVFNAVGGGTGSGLGSLLLERLSVDYGKKSKLGFTVYPSPQVSTS 
VVEPYNSVLSTHSLLEHTDVSILLDNEAIYDICRRSLNIERPTYTNLNRLVSQVISSLTA 
SLRFDGALNVDVTEFQTNLVPYPRIHFMLSSYAPVISAEKAFHEQLSVAEITNSAFEPAS 
MMAKCDPRHGKYMACCLMYRGDVVPKDVNAAVGTIKTKRTIQFVDWCPTGFKCGINYQPP 
TVVPGGDLAKVQRAVCMISNSTSVAEVFSRIDHKFDLMYAKRAFVHWYVGEGMEEGEFSE 
AREDLAALEKDYEEVGAEGGDDEDDEGEEY*
>AT3G58180.1 |  PBS lyase HEAT-like repeat-containing protein 
MESNGSVSSMVNLEKFLCERLVDQSQPISERFRALFSLRNLKGPGPRNALILASRDSSNL 
LAHEAAFALGQMQDAEAIPALESVLNDMSLHPIVRHEAAEALGAIGLAGNVNILKKSLSS 
DPAQEVRETCELALKRIEDMSNVDAENQSSTTEKSPFMSVDPAGPAASFSSVHQLRQVLL 
DETKGMYERYAALFALRNHGGEEAVSAIVDSLSASSALLRHEVAYVLGQLQSKTALATLS 
KVLRDVNEHPMVRHEAAEALGSIADEQSIALLEEFSKDPEPIVAQSCEVALSMLEFENSG 
KSFEFFFTQDPLVH*
>AT1G20260.1 |  hydrogen ion transporting ATP synthase rotational mechanism / hydrolase acting on acid anhydrides catalyzing transmembrane movement of substances / proton-transporting ATPase rotational mechanism 
MVETSIDMEEGTLEIGMEYRTVSGVAGPLVILDKVKGPKYQEIVNIRLGDGSTRRGQVLE 
VDGEKAVVQVFEGTSGIDNKFTTVQFTGEVLKTPVSLDMLGRIFNGSGKPIDNGPPILPE 
AYLDISGSSINPSERTYPEEMIQTGISTIDVMNSIARGQKIPLFSAAGLPHNEIAAQICR 
QAGLVKRLEKTENLIQEDHGEDNFAIVFAAMGVNMETAQFFKRDFEENGSMERVTLFLNL 
ANDPTIERIITPRIALTTAEYLAYECGKHVLVILTDMSSYADALREVSAAREEVPGRRGY 
PGYMYTDLATIYERAGRIEGRKGSITQIPILTMPNDDITHPTPDLTGYITEGQIYIDRQL 
HNRQIYPPINVLPSLSRLMKSAIGEGMTRKDHSDVSNQLYANYAIGKDVQAMKAVVGEEA 
LSSEDLLYLEFLDKFERKFVMQGAYDTRNIFQSLDLAWTLLRIFPRELLHRIPAKTLDQF 
YSRDSTS*
>AT5G12140.1 |  ATCYS1 (A thaliana cystatin-1) cysteine-type endopeptidase inhibitor 
MADQQAGTIVGGVRDIDANANDLQVESLARFAVDEHNKNENLTLEYKRLLGAKTQVVAGT 
MHHLTVEVADGETNKVYEAKVLEKAWENLKQLESFNHLHDV*
>AT4G33650.1 |  DRP3A (DYNAMIN-RELATED PROTEIN 3A) GTP binding / GTPase/ phosphoinositide binding 
MTIEEVSGETPPSTPPSSSTPSPSSSTTNAAPLGSSVIPIVNKLQDIFAQLGSQSTIALP 
QVVVVGSQSSGKSSVLEALVGRDFLPRGNDICTRRPLVLQLLQTKSRANGGSDDEWGEFR 
HLPETRFYDFSEIRREIEAETNRLVGENKGVADTQIRLKISSPNVLNITLVDLPGITKVP 
VGDQPSDIEARIRTMILSYIKQDTCLILAVTPANTDLANSDALQIASIVDPDGHRTIGVI 
TKLDIMDKGTDARKLLLGNVVPLRLGYVGVVNRCQEDILLNRTVKEALLAEEKFFRSHPV 
YHGLADRLGVPQLAKKLNQILVQHIKVLLPDLKSRISNALVATAKEHQSYGELTESRAGQ 
GALLLNFLSKYCEAYSSLLEGKSEEMSTSELSGGARIHYIFQSIFVKSLEEVDPCEDLTD 
DDIRTAIQNATGPRSALFVPDVPFEVLVRRQISRLLDPSLQCARFIFEELIKISHRCMMN 
ELQRFPVLRKRMDEVIGDFLREGLEPSEAMIGDIIDMEMDYINTSHPNFIGGTKAVEAAM 
HQVKSSRIPHPVARPKDTVEPDRTSSSTSQVKSRSFLGRQANGIVTDQGVVSADAEKAQP 
AANASDTRWGIPSIFRGGDTRAVTKDSLLNKPFSEAVEDMSHNLSMIYLKEPPAVLRPTE 
THSEQEAVEIQITKLLLRSYYDIVRKNIEDSVPKAIMHFLVNHTKRELHNVFIKKLYREN 
LFEEMLQEPDEIAVKRKRTQETLHVLQQAYRTLDELPLEADSVSAGMSKHQELLTSSKYS 
TSSSYSASPSTTRRSRRAGDQHQNGYGF*
>AT5G20890.1 |  chaperonin putative 
MPIDKIFKDDASEEKGERARMASFVGAMAISDLVKSTLGPKGMDKILQSTGRGHAVTVTN 
DGATILKSLHIDNPAAKVLVDISKVQDDEVGDGTTSVVVLAGELLREAEKLVASKIHPMT 
IIAGYRMASECARNALLKRVIDNKDNAEKFRSDLLKIAMTTLCSKILSQDKEHFAEMAVD 
AVFRLKGSTNLEAIQIIKKPGGSLKDSFLDEGFILDKKIGIGQPKRIENANILVANTAMD 
TDKVKIYGARVRVDSMTKVAEIEGAEKEKMKDKVKKIIGHGINCFVNRQLIYNFPEELFA 
DAGILAIEHADFEGIERLGLVTGGEIASTFDNPESVKLGHCKLIEEIMIGEDKLIHFSGC 
EMGQACSIVLRGASHHVLDEAERSLHDALCVLSQTVNDTRVLLGGGWPEMVMAKEVDELA 
RKTAGKKSHAIEAFSRALVAIPTTIADNAGLDSAELVAQLRAEHHTEGCNAGIDVITGAV 
GDMEERGIYEAFKVKQAVLLSATEASEMILRVDEIITCAPRRREDRM*
>AT1G03930.1 |  ADK1 (dual specificity kinase 1) kinase/ protein serine/threonine/tyrosine kinase 
MDLVIGGKFKLGRKIGSGSFGELYLGINVQTGEEVAVKLESVKTKHPQLHYESKLYMLLQ 
GGTGVPNLKWYGVEGDYNVMVIDLLGPSLEDLFNYCNRKLSLKTVLMLADQLINRVEFMH 
TRGFLHRDIKPDNFLMGLGRKANQVYIIDFGLGKKYRDLQTHRHIPYRENKNLTGTARYA 
SVNTHLGVEQSRRDDLEALGYVLMYFLKGSLPWQGLKAGTKKQKYDRISEKKVATPIEVL 
CKNQPSEFVSYFRYCRSLRFDDKPDYSYLKRLFRDLFIREGYQFDYVFDWTVLKYPQIGS 
SSGSSSRTRNHTTANPGLTAGASLEKQERIAGKETRENRFSGAVEAFSRRHPATSTTRDR 
SASRNSVDGPLSKHPPGDSERPRSSSRYGSSSRRAIPSSSRPSSAGGPSDSRSSSRLVTS 
TGGVGTVSNRASTSQRIQAGNESRTSSFSRAARNTREDPLRRSLELLTLRK*
>AT2G19980.1 |  allergen V5/Tpx-1-related family protein 
MSFSGYSFIVLTLLSIVLTQIYGLRSFSRMDDLQPAETLAVHNQIRAADQKLAAHAQRYA 
NVRSQDCAMKYSTDGTYGENIAAGWVQPMDTMSGPIATKFWFTEKPYYNYATNKCSEPCG 
HYTQIVANQSTHLGCGTVRCFKNEYVWVVCNYAPRPMGDANTRPY*
>AT2G43460.1 |  60S ribosomal protein L38 (RPL38A) 
MPKQIHEIKDFLLTARRKDARSVKIKRSKDIVKFKVRCSRYLYTLCVFDQEKADKLKQSL 
PPGLSVQDL*
>AT5G19910.1 |  SOH1 family protein 
MASPEEMGDDASEIPSPPKNTYKDPDGGRQRFLLELEFIQCLANPTYIHYLAQNRYFEDE 
AFIGYLKYLQYWQRPEYIKFIMYPHCLYFLELLQNPNFRTAMAHPANKELAHRQQFYYWK 
NYRNNRLKHILPRPLPEPVPPQPPVAPSTSLPPAPSATAALSPALSPMQYNNMLSKNDTR 
NMGATGIDRRKRKKGI*
>AT1G61040.1 |  VIP5 (vernalization independence 5) DNA binding 
MGDLENLLLEAAGRTNSAGRSRHPPSSRRREGSYSDGSSDSRDDSDEDRGYASRKPSGSQ 
VPLKKRLEAEREDRAARVEGGYGDGPSDREGDSSEESDFGDDLYKNEEDRQKLAGMTEFQ 
REMILSERADKKGDKNFTEKLRSKRESEKTPVSKKETQPLPASRGVRSSARSADRAAAKD 
DALNELRAKRMKQQDPAALRKLRDASKGGSGSRDFSSTKRKPLASSNLSSSSQSDSDSRS 
QSDDEGSNGGMLDSDDDRSDVPTFEDVKEVTIRRSKLAKWLMEPFFEELIVGCFVRVGIG 
RSKSGPIYRLCWVKNVDATDPDKTYKLENKTTHKYLNVVWGNETSAARWQMAMISDGHPL 
EEEYRQWIREVERTNGRMPTKQDISEKKEAIQRTNSFVYSAETVKQMLQEKKSASVRPMN 
VAAEKDRLRKELEIAQSKNDEAGVERIKSKIKQLDASRNKKGVDKKALKLAEMNKKNRAE 
NFKNASEVKSITASLKAGEAGYDPFSRRWTRSSNYYNGKNKGKDGEENEAAVAAAVETNG 
ADAGAGVEATEAALEAAAEAGKLIDTRAPIGQGAEHNQLHNFELSLSLTALQKYGGPQGV 
QKAFMARKQLTEATVGCRVAENDGKRHGLTLTVSDYKRRRGLL*
>AT2G19170.1 |  SLP3 serine-type peptidase 
MDIGLRIFVVFVLLVAVTAEVYIVTMEGDPIISYKGGENGFEATAVESDEKIDTSSELVT 
VYARHLERKHDMILGMLFEEGSYKKLYSYKHLINGFAAHVSPEQAETLRRAPGVRSVDKD 
WKVRRLTTHTPEFLGLPTDVWPTGGGFDRAGEDIVIGFVDSGIYPHHPSFASHHRLPYGP 
LPHYKGKCEEDPHTKKSFCNRKIVGAQHFAEAAKAAGAFNPDIDYASPMDGDGHGSHTAA 
IAAGNNGIPLRMHGYEFGKASGMAPRARIAVYKALYRLFGGFVADVVAAIDQAVHDGVDI 
LSLSVGPNSPPTTTKTTFLNPFDATLLGAVKAGVFVAQAAGNGGPFPKTLVSYSPWITTV 
AAAIDDRRYKNHLTLGNGKMLAGMGLSPPTRPHRLYTLVSANDVLLDSSVSKYNPSDCQR 
PEVFNKKLVEGNILLCGYSFNFVVGTASIKKVVATAKHLGAAGFVLVVENVSPGTKFDPV 
PSAIPGILITDVSKSMDLIDYYNASTSRDWTGRVKSFKAEGSIGDGLAPVLHKSAPQVAL 
FSARGPNTKDFSFQDADLLKPDILAPGYLIWAAWCPNGTDEPNYVGEGFALISGTSMAAP 
HIAGIAALVKQKHPQWSPAAIKSALMTTSTVIDRAGRLLQAQQYSDTEAVTLVKATPFDY 
GSGHVNPSAALDPGLIFDAGYEDYLGFLCTTPGISAHEIRNYTNTACNYDMKHPSNFNAP 
SIAVSHLVGTQTVTRKVTNVAEVEETYTITARMQPSIAIEVNPPAMTLRPGATRTFSVTM 
TVRSVSGVYSFGEVKLKGSRGHKVRIPVVALGHRR*
>AT1G02730.1 |  ATCSLD5 14-beta-D-xylan synthase/ cellulose synthase 
MVKSAASQSPSPVTITVTPCKGSGDRSLGLTSPIPRASVITNQNSPLSSRATRRTSISSG 
NRRSNGDEGRYCSMSVEDLTAETTNSECVLSYTVHIPPTPDHQTVFASQESEEDEMLKGN 
SNQKSFLSGTIFTGGFKSVTRGHVIDCSMDRADPEKKSGQICWLKGCDEKVVHGRCECGF 
RICRDCYFDCITSGGGNCPGCKEPYRDINDDPETEEEDEEDEAKPLPQMGESKLDKRLSV 
VKSFKAQNQAGDFDHTRWLFETKGTYGYGNAVWPKDGYGIGSGGGGNGYETPPEFGERSK 
RPLTRKVSVSAAIISPYRLLIALRLVALGLFLTWRVRHPNREAMWLWGMSTTCELWFALS 
WLLDQLPKLCPVNRLTDLGVLKERFESPNLRNPKGRSDLPGIDVFVSTADPEKEPPLVTA 
NTILSILAVDYPVEKLACYLSDDGGALLTFEALAQTASFASTWVPFCRKHNIEPRNPEAY 
FGQKRNFLKNKVRLDFVRERRRVKREYDEFKVRINSLPEAIRRRSDAYNVHEELRAKKKQ 
MEMMMGNNPQETVIVPKATWMSDGSHWPGTWSSGETDNSRGDHAGIIQAMLAPPNAEPVY 
GAEADAENLIDTTDVDIRLPMLVYVSREKRPGYDHNKKAGAMNALVRTSAIMSNGPFILN 
LDCDHYIYNSMALREGMCFMLDRGGDRICYVQFPQRFEGIDPNDRYANHNTVFFDVSMRA 
LDGLQGPMYVGTGCIFRRTALYGFSPPRATEHHGWLGRRKVKISLRRPKAMMKKDDEVSL 
PINGEYNEEENDDGDIESLLLPKRFGNSNSFVASIPVAEYQGRLIQDLQGKGKNSRPAGS 
LAVPREPLDAATVAEAISVISCFYEDKTEWGKRVGWIYGSVTEDVVTGYRMHNRGWRSIY 
CVTKRDAFRGTAPINLTDRLHQVLRWATGSVEIFFSRNNAIFATRRMKFLQRVAYFNVGM 
YPFTSLFLIVYCILPAISLFSGQFIVQSLDITFLIYLLSITLTLCMLSLLEIKWSGITLH 
EWWRNEQFWVIGGTSAHPAAVLQGLLKVIAGVDISFTLTSKSSAPEDGDDEFADLYVVKW 
SFLMVPPLTIMMVNMIAIAVGLARTLYSPFPQWSKLVGGVFFSFWVLCHLYPFAKGLMGR 
RGRVPTIVFVWSGLLSIIVSLLWVYINPPSGKQDYMQFQFP*
>AT1G15420.1 |  unknown protein 
MAKDKLKPLLSSDAAGDIADTPLREKKHKKKSKKRAEPEPDIPSTRDSGLDEDRDGVLVD 
DTLNEPTIGDKLESLDLLNGEKVNSEESNRDSAPGDDKPPTAASVNVLLRQALHADDRSL 
LLDCLYNRDEQVIANSVAKLNSAEVLKLLNALLPILQSRGAILACTIPWIKSLLLTHSSG 
IMSQESSLLALNTMYQLIESRVSTIHTAVEVSSGLDLIVDDLDEEEDEGPVIYEDKDSDE 
DEEEGIEEAMETDEEADDSADEAADGVNDFEGFDDMSD*
>AT1G31170.1 |  SRX (SULFIREDOXIN) DNA binding / oxidoreductase acting on sulfur group of donors 
MANLMMRLPISLRSFSVSASSSNGSPPVIGGSSGGVGPMIVELPLEKIRRPLMRTRSNDQ 
NKVKELMDSIRQIGLQVPIDVIEVDGTYYGFSGCHRYEAHQKLGLPTIRCKIRKGTKETL 
RHHLR*
>AT1G31170.1 |  SRX (SULFIREDOXIN) DNA binding / oxidoreductase acting on sulfur group of donors 
MANLMMRLPISLRSFSVSASSSNGSPPVIGGSSGGVGPMIVELPLEKIRRPLMRTRSNDQ 
NKVKELMDSIRQIGLQVPIDVIEVDGTYYGFSGCHRYEAHQKLGLPTIRCKIRKGTKETL 
RHHLR*
>AT1G31170.1 |  SRX (SULFIREDOXIN) DNA binding / oxidoreductase acting on sulfur group of donors 
MANLMMRLPISLRSFSVSASSSNGSPPVIGGSSGGVGPMIVELPLEKIRRPLMRTRSNDQ 
NKVKELMDSIRQIGLQVPIDVIEVDGTYYGFSGCHRYEAHQKLGLPTIRCKIRKGTKETL 
RHHLR*
>AT1G31170.2 |  SRX (SULFIREDOXIN) DNA binding / oxidoreductase acting on sulfur group of donors 
MANLMMRLPISLRSFSVSASSSNGSPPVIGGSSGGVGPMIVELPLEKIRRPLMRTRSNDQ 
NKVKELMDSIRQIGLQVPIDVIEVDGTYYGFSGCHRYEAHQKLGLPTIRCKIRKGTKETL 
RHHLR*
>AT1G31170.2 |  SRX (SULFIREDOXIN) DNA binding / oxidoreductase acting on sulfur group of donors 
MANLMMRLPISLRSFSVSASSSNGSPPVIGGSSGGVGPMIVELPLEKIRRPLMRTRSNDQ 
NKVKELMDSIRQIGLQVPIDVIEVDGTYYGFSGCHRYEAHQKLGLPTIRCKIRKGTKETL 
RHHLR*
>AT1G31170.2 |  SRX (SULFIREDOXIN) DNA binding / oxidoreductase acting on sulfur group of donors 
MANLMMRLPISLRSFSVSASSSNGSPPVIGGSSGGVGPMIVELPLEKIRRPLMRTRSNDQ 
NKVKELMDSIRQIGLQVPIDVIEVDGTYYGFSGCHRYEAHQKLGLPTIRCKIRKGTKETL 
RHHLR*
>AT1G31170.3 |  SRX (SULFIREDOXIN) DNA binding / oxidoreductase acting on sulfur group of donors 
MANLMMRLPISLRSFSVSASSSNGSPPVIGGSSGGVGPMIVELPLEKIRRPLMRTRSNDQ 
NKVKELMDSIRQIGLQIDVIEVDGTYYGFSGCHRYEAHQKLGLPTIRCKIRKGTKETLRH 
HLR*
>AT1G31170.3 |  SRX (SULFIREDOXIN) DNA binding / oxidoreductase acting on sulfur group of donors 
MANLMMRLPISLRSFSVSASSSNGSPPVIGGSSGGVGPMIVELPLEKIRRPLMRTRSNDQ 
NKVKELMDSIRQIGLQIDVIEVDGTYYGFSGCHRYEAHQKLGLPTIRCKIRKGTKETLRH 
HLR*
>AT1G31170.3 |  SRX (SULFIREDOXIN) DNA binding / oxidoreductase acting on sulfur group of donors 
MANLMMRLPISLRSFSVSASSSNGSPPVIGGSSGGVGPMIVELPLEKIRRPLMRTRSNDQ 
NKVKELMDSIRQIGLQIDVIEVDGTYYGFSGCHRYEAHQKLGLPTIRCKIRKGTKETLRH 
HLR*
>AT1G43080.1 |  glycoside hydrolase family 28 protein / polygalacturonase (pectinase) family protein 
MVSIASTFKALCLSFLFIAVASRPTIRPKVFNVRRYGSRPDGKTDNANAFTSVWKRACTR 
ISGSSKIYVPKGTFYLGGVEFVGPCKNPIEFVIDGTLLAPANPRDIKQDTWINFRYINNL 
SISGSGTLDGQGKYSWPLNDCHKNTNCPKLAMTMGFAFVNNSRIKDITSLNSKMGHFNFF 
SVHRFNITGVTITAPGDSPNTDGIKMGSCSNIHISNTNIGTGDDCIAILSGTTNLDISNI 
KCGPGHGISVGSLGKNKDEKDVKHLTVRDTVFNGTSDGIRIKTWESSASKIVVSNFIYEN 
IQMIDVGKPINIDQKYCPHPPCEHEKKGESHVQIQDIKLKNIYGTSNNIVAVNLQCSKSF 
PCKNVELIDINLKHTGLEKGHSTAMCENVDGSVRSKMVPQHCLD*
>AT1G79990.2 |  LOCATED IN endomembrane system COPI vesicle coat Golgi membrane EXPRESSED IN 25 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s SYS1 homologue (InterProIPR016973) Has 55556 Blast hits to 24059 proteins in 620 species Archae - 38 Bacteria - 5697 Metazoa - 25539 Fungi - 10898 Plants - 5309 Viruses - 0 Other Eukaryotes - 8075 (source NCBI BLink) 
MFYGTAVWDPWLIVGQIICLQCSYYLTLGLFTMVFLGLRVPRLSLVYFFDYATLTTSTFT 
GWSVIASFLFSSLAGAVYMIFLVERARKCLDFSATLYIIHLFFCIMYGGWPSSMAWWVVN 
GTGLAVMALLAEYLCIKREQREIPMDRFHSRV*
>AT1G79990.2 |  protein binding / structural molecule 
MFYGTAVWDPWLIVGQIICLQCSYYLTLGLFTMVFLGLRVPRLSLVYFFDYATLTTSTFT 
GWSVIASFLFSSLAGAVYMIFLVERARKCLDFSATLYIIHLFFCIMYGGWPSSMAWWVVN 
GTGLAVMALLAEYLCIKREQREIPMDRFHSRV*
>AT1G79990.2 |  protein binding / structural molecule 
MFYGTAVWDPWLIVGQIICLQCSYYLTLGLFTMVFLGLRVPRLSLVYFFDYATLTTSTFT 
GWSVIASFLFSSLAGAVYMIFLVERARKCLDFSATLYIIHLFFCIMYGGWPSSMAWWVVN 
GTGLAVMALLAEYLCIKREQREIPMDRFHSRV*
>AT1G79990.2 |  LOCATED IN endomembrane system COPI vesicle coat Golgi membrane EXPRESSED IN 25 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s SYS1 homologue (InterProIPR016973) 
MFYGTAVWDPWLIVGQIICLQCSYYLTLGLFTMVFLGLRVPRLSLVYFFDYATLTTSTFT 
GWSVIASFLFSSLAGAVYMIFLVERARKCLDFSATLYIIHLFFCIMYGGWPSSMAWWVVN 
GTGLAVMALLAEYLCIKREQREIPMDRFHSRV*
>AT1G79990.2 |  protein binding / structural molecule 
MFYGTAVWDPWLIVGQIICLQCSYYLTLGLFTMVFLGLRVPRLSLVYFFDYATLTTSTFT 
GWSVIASFLFSSLAGAVYMIFLVERARKCLDFSATLYIIHLFFCIMYGGWPSSMAWWVVN 
GTGLAVMALLAEYLCIKREQREIPMDRFHSRV*
>AT1G79990.3 |  LOCATED IN endomembrane system COPI vesicle coat Golgi membrane EXPRESSED IN 25 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s SYS1 homologue (InterProIPR016973) Has 55556 Blast hits to 24059 proteins in 620 species Archae - 38 Bacteria - 5697 Metazoa - 25539 Fungi - 10898 Plants - 5309 Viruses - 0 Other Eukaryotes - 8075 (source NCBI BLink) 
MPLRLEIKRKFAQRSERVKSVDLHPTEPWILASLYSGTLCIWNYQTQTMVKSFDVTELPV 
RSAKFIARKQWVVAGADDMFIRVYNYNTMDKIKVFEAHADYIRCVAVHPTLPYVLSSSDD 
MLIKLWDWEKGWLCTQIFEGHSHYVMQVTFNPKDTNTFASASLDRTIKIWNLGSPDPNFT 
LDAHLKGVNCVDYFTGGDKPYLITGSDDHTAKVWDYQTKSCVQTLEGHTHNVSAVSFHPE 
LPIIITGSEDGTVRIWHATTYRLENTLNYGLERVWAIGHIKGSRRVVIGYDEGSIMVKLG 
REIPVASMDNSGKIIWAKHNEIHTVNIKSVGADEVTDGERLPLAVKELGTCDLYPQSLKH 
NPNGRFVVVCGDGEYIIYTALAWRNRSFGSALEFVWSSDGEHAVRESSTKIKIFSKNFQE 
KKTVRPTFSAEHIFGGTLLTMCSSDFICFYDWAECRLIRRIDVTVKNLYWADSGDLVAIA 
SDTSFYILKFNRDIVSSYFDGGKQIDEEGIEDAFELLNETNERVRTGLWVGDCFIYTNSS 
WRLNYCVGGEVTTMYHLDRPMYLLGYLANQSRVYLIDKEFNVIGYTLLLSLIEYKTLVMR 
GDLEQANEVLPSIPKEHHNSVAHFLESRGMTEDALEVATDPDYRFELAIQLGRLAVAKDI 
AVEAQNESKWKQLGELAMSSGKLDMAEECMRHAMDLSGLLLLYSSLGDADGMMKLAALAK 
EQGKNNVAFLCLFMLGQVEDCLHLLVESNRIPEAALMARSYLPSKVSEIVALWRNDLTKI 
SPKAAESLADPEEYPNLFEEWQVALSLENRAAETRGVHPPAGDYCSHADRDHTTLVDAFR 
IMQIEEEGRLEQGDVLDEVGEEGEDGEEEEEEDRQEESSDGRQQNVEEEAVVVDADSTDG 
AVLVNGNESEEQWVLTPPQE*
>AT1G79990.3 |  protein binding / structural molecule 
MPLRLEIKRKFAQRSERVKSVDLHPTEPWILASLYSGTLCIWNYQTQTMVKSFDVTELPV 
RSAKFIARKQWVVAGADDMFIRVYNYNTMDKIKVFEAHADYIRCVAVHPTLPYVLSSSDD 
MLIKLWDWEKGWLCTQIFEGHSHYVMQVTFNPKDTNTFASASLDRTIKIWNLGSPDPNFT 
LDAHLKGVNCVDYFTGGDKPYLITGSDDHTAKVWDYQTKSCVQTLEGHTHNVSAVSFHPE 
LPIIITGSEDGTVRIWHATTYRLENTLNYGLERVWAIGHIKGSRRVVIGYDEGSIMVKLG 
REIPVASMDNSGKIIWAKHNEIHTVNIKSVGADEVTDGERLPLAVKELGTCDLYPQSLKH 
NPNGRFVVVCGDGEYIIYTALAWRNRSFGSALEFVWSSDGEHAVRESSTKIKIFSKNFQE 
KKTVRPTFSAEHIFGGTLLTMCSSDFICFYDWAECRLIRRIDVTVKNLYWADSGDLVAIA 
SDTSFYILKFNRDIVSSYFDGGKQIDEEGIEDAFELLNETNERVRTGLWVGDCFIYTNSS 
WRLNYCVGGEVTTMYHLDRPMYLLGYLANQSRVYLIDKEFNVIGYTLLLSLIEYKTLVMR 
GDLEQANEVLPSIPKEHHNSVAHFLESRGMTEDALEVATDPDYRFELAIQLGRLAVAKDI 
AVEAQNESKWKQLGELAMSSGKLDMAEECMRHAMDLSGLLLLYSSLGDADGMMKLAALAK 
EQGKNNVAFLCLFMLGQVEDCLHLLVESNRIPEAALMARSYLPSKVSEIVALWRNDLTKI 
SPKAAESLADPEEYPNLFEEWQVALSLENRAAETRGVHPPAGDYCSHADRDHTTLVDAFR 
IMQIEEEGRLEQGDVLDEVGEEGEDGEEEEEEDRQEESSDGRQQNVEEEAVVVDADSTDG 
AVLVNGNESEEQWVLTPPQE*
>AT1G79990.3 |  protein binding / structural molecule 
MPLRLEIKRKFAQRSERVKSVDLHPTEPWILASLYSGTLCIWNYQTQTMVKSFDVTELPV 
RSAKFIARKQWVVAGADDMFIRVYNYNTMDKIKVFEAHADYIRCVAVHPTLPYVLSSSDD 
MLIKLWDWEKGWLCTQIFEGHSHYVMQVTFNPKDTNTFASASLDRTIKIWNLGSPDPNFT 
LDAHLKGVNCVDYFTGGDKPYLITGSDDHTAKVWDYQTKSCVQTLEGHTHNVSAVSFHPE 
LPIIITGSEDGTVRIWHATTYRLENTLNYGLERVWAIGHIKGSRRVVIGYDEGSIMVKLG 
REIPVASMDNSGKIIWAKHNEIHTVNIKSVGADEVTDGERLPLAVKELGTCDLYPQSLKH 
NPNGRFVVVCGDGEYIIYTALAWRNRSFGSALEFVWSSDGEHAVRESSTKIKIFSKNFQE 
KKTVRPTFSAEHIFGGTLLTMCSSDFICFYDWAECRLIRRIDVTVKNLYWADSGDLVAIA 
SDTSFYILKFNRDIVSSYFDGGKQIDEEGIEDAFELLNETNERVRTGLWVGDCFIYTNSS 
WRLNYCVGGEVTTMYHLDRPMYLLGYLANQSRVYLIDKEFNVIGYTLLLSLIEYKTLVMR 
GDLEQANEVLPSIPKEHHNSVAHFLESRGMTEDALEVATDPDYRFELAIQLGRLAVAKDI 
AVEAQNESKWKQLGELAMSSGKLDMAEECMRHAMDLSGLLLLYSSLGDADGMMKLAALAK 
EQGKNNVAFLCLFMLGQVEDCLHLLVESNRIPEAALMARSYLPSKVSEIVALWRNDLTKI 
SPKAAESLADPEEYPNLFEEWQVALSLENRAAETRGVHPPAGDYCSHADRDHTTLVDAFR 
IMQIEEEGRLEQGDVLDEVGEEGEDGEEEEEEDRQEESSDGRQQNVEEEAVVVDADSTDG 
AVLVNGNESEEQWVLTPPQE*
>AT1G79990.3 |  LOCATED IN endomembrane system COPI vesicle coat Golgi membrane EXPRESSED IN 25 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s SYS1 homologue (InterProIPR016973) 
MPLRLEIKRKFAQRSERVKSVDLHPTEPWILASLYSGTLCIWNYQTQTMVKSFDVTELPV 
RSAKFIARKQWVVAGADDMFIRVYNYNTMDKIKVFEAHADYIRCVAVHPTLPYVLSSSDD 
MLIKLWDWEKGWLCTQIFEGHSHYVMQVTFNPKDTNTFASASLDRTIKIWNLGSPDPNFT 
LDAHLKGVNCVDYFTGGDKPYLITGSDDHTAKVWDYQTKSCVQTLEGHTHNVSAVSFHPE 
LPIIITGSEDGTVRIWHATTYRLENTLNYGLERVWAIGHIKGSRRVVIGYDEGSIMVKLG 
REIPVASMDNSGKIIWAKHNEIHTVNIKSVGADEVTDGERLPLAVKELGTCDLYPQSLKH 
NPNGRFVVVCGDGEYIIYTALAWRNRSFGSALEFVWSSDGEHAVRESSTKIKIFSKNFQE 
KKTVRPTFSAEHIFGGTLLTMCSSDFICFYDWAECRLIRRIDVTVKNLYWADSGDLVAIA 
SDTSFYILKFNRDIVSSYFDGGKQIDEEGIEDAFELLNETNERVRTGLWVGDCFIYTNSS 
WRLNYCVGGEVTTMYHLDRPMYLLGYLANQSRVYLIDKEFNVIGYTLLLSLIEYKTLVMR 
GDLEQANEVLPSIPKEHHNSVAHFLESRGMTEDALEVATDPDYRFELAIQLGRLAVAKDI 
AVEAQNESKWKQLGELAMSSGKLDMAEECMRHAMDLSGLLLLYSSLGDADGMMKLAALAK 
EQGKNNVAFLCLFMLGQVEDCLHLLVESNRIPEAALMARSYLPSKVSEIVALWRNDLTKI 
SPKAAESLADPEEYPNLFEEWQVALSLENRAAETRGVHPPAGDYCSHADRDHTTLVDAFR 
IMQIEEEGRLEQGDVLDEVGEEGEDGEEEEEEDRQEESSDGRQQNVEEEAVVVDADSTDG 
AVLVNGNESEEQWVLTPPQE*
>AT1G79990.3 |  protein binding / structural molecule 
MPLRLEIKRKFAQRSERVKSVDLHPTEPWILASLYSGTLCIWNYQTQTMVKSFDVTELPV 
RSAKFIARKQWVVAGADDMFIRVYNYNTMDKIKVFEAHADYIRCVAVHPTLPYVLSSSDD 
MLIKLWDWEKGWLCTQIFEGHSHYVMQVTFNPKDTNTFASASLDRTIKIWNLGSPDPNFT 
LDAHLKGVNCVDYFTGGDKPYLITGSDDHTAKVWDYQTKSCVQTLEGHTHNVSAVSFHPE 
LPIIITGSEDGTVRIWHATTYRLENTLNYGLERVWAIGHIKGSRRVVIGYDEGSIMVKLG 
REIPVASMDNSGKIIWAKHNEIHTVNIKSVGADEVTDGERLPLAVKELGTCDLYPQSLKH 
NPNGRFVVVCGDGEYIIYTALAWRNRSFGSALEFVWSSDGEHAVRESSTKIKIFSKNFQE 
KKTVRPTFSAEHIFGGTLLTMCSSDFICFYDWAECRLIRRIDVTVKNLYWADSGDLVAIA 
SDTSFYILKFNRDIVSSYFDGGKQIDEEGIEDAFELLNETNERVRTGLWVGDCFIYTNSS 
WRLNYCVGGEVTTMYHLDRPMYLLGYLANQSRVYLIDKEFNVIGYTLLLSLIEYKTLVMR 
GDLEQANEVLPSIPKEHHNSVAHFLESRGMTEDALEVATDPDYRFELAIQLGRLAVAKDI 
AVEAQNESKWKQLGELAMSSGKLDMAEECMRHAMDLSGLLLLYSSLGDADGMMKLAALAK 
EQGKNNVAFLCLFMLGQVEDCLHLLVESNRIPEAALMARSYLPSKVSEIVALWRNDLTKI 
SPKAAESLADPEEYPNLFEEWQVALSLENRAAETRGVHPPAGDYCSHADRDHTTLVDAFR 
IMQIEEEGRLEQGDVLDEVGEEGEDGEEEEEEDRQEESSDGRQQNVEEEAVVVDADSTDG 
AVLVNGNESEEQWVLTPPQE*
>AT1G79990.1 |  LOCATED IN endomembrane system COPI vesicle coat Golgi membrane EXPRESSED IN 25 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s SYS1 homologue (InterProIPR016973) Has 55556 Blast hits to 24059 proteins in 620 species Archae - 38 Bacteria - 5697 Metazoa - 25539 Fungi - 10898 Plants - 5309 Viruses - 0 Other Eukaryotes - 8075 (source NCBI BLink) 
MFYGTAVWDPWLIVGQIICLQCSYYLTLGLFTMVFLGLRVPRLSLVYFFDYATLTTSTFT 
GWSVIASFLFSSLAGAVYMIFLVERARKCLDFSATLYIIHLFFCIMYGGWPSSMAWWVVN 
GTGLAVMALLAEYLCIKREQREIPMDRFHSNPDSDPNRQRALEVGSPPLRFFFFVISDPP 
CDQGPFPSIIRSPVQTFVWTCKDVYVRVFLIALKFDPLRLEIKRKFAQRSERVKSVDLHP 
TEPWILASLYSGTLCIWNYQTQTMVKSFDVTELPVRSAKFIARKQWVVAGADDMFIRVYN 
YNTMDKIKVFEAHADYIRCVAVHPTLPYVLSSSDDMLIKLWDWEKGWLCTQIFEGHSHYV 
MQVTFNPKDTNTFASASLDRTIKIWNLGSPDPNFTLDAHLKGVNCVDYFTGGDKPYLITG 
SDDHTAKVWDYQTKSCVQTLEGHTHNVSAVSFHPELPIIITGSEDGTVRIWHATTYRLEN 
TLNYGLERVWAIGHIKGSRRVVIGYDEGSIMVKLGREIPVASMDNSGKIIWAKHNEIHTV 
NIKSVGADEVTDGERLPLAVKELGTCDLYPQSLKHNPNGRFVVVCGDGEYIIYTALAWRN 
RSFGSALEFVWSSDGEHAVRESSTKIKIFSKNFQEKKTVRPTFSAEHIFGGTLLTMCSSD 
FICFYDWAECRLIRRIDVTVKNLYWADSGDLVAIASDTSFYILKFNRDIVSSYFDGGKQI 
DEEGIEDAFELLNETNERVRTGLWVGDCFIYTNSSWRLNYCVGGEVTTMYHLDRPMYLLG 
YLANQSRVYLIDKEFNVIGYTLLLSLIEYKTLVMRGDLEQANEVLPSIPKEHHNSVAHFL 
ESRGMTEDALEVATDPDYRFELAIQLGRLAVAKDIAVEAQNESKWKQLGELAMSSGKLDM 
AEECMRHAMDLSGLLLLYSSLGDADGMMKLAALAKEQGKNNVAFLCLFMLGQVEDCLHLL 
VESNRIPEAALMARSYLPSKVSEIVALWRNDLTKISPKAAESLADPEEYPNLFEEWQVAL 
SLENRAAETRGVHPPAGDYCSHADRDHTTLVDAFRIMQIEEEGRLEQGDVLDEVGEEGED 
GEEEEEEDRQEESSDGRQQNVEEEAVVVDADSTDGAVLVNGNESEEQWVLTPPQE*
>AT1G79990.1 |  protein binding / structural molecule 
MFYGTAVWDPWLIVGQIICLQCSYYLTLGLFTMVFLGLRVPRLSLVYFFDYATLTTSTFT 
GWSVIASFLFSSLAGAVYMIFLVERARKCLDFSATLYIIHLFFCIMYGGWPSSMAWWVVN 
GTGLAVMALLAEYLCIKREQREIPMDRFHSNPDSDPNRQRALEVGSPPLRFFFFVISDPP 
CDQGPFPSIIRSPVQTFVWTCKDVYVRVFLIALKFDPLRLEIKRKFAQRSERVKSVDLHP 
TEPWILASLYSGTLCIWNYQTQTMVKSFDVTELPVRSAKFIARKQWVVAGADDMFIRVYN 
YNTMDKIKVFEAHADYIRCVAVHPTLPYVLSSSDDMLIKLWDWEKGWLCTQIFEGHSHYV 
MQVTFNPKDTNTFASASLDRTIKIWNLGSPDPNFTLDAHLKGVNCVDYFTGGDKPYLITG 
SDDHTAKVWDYQTKSCVQTLEGHTHNVSAVSFHPELPIIITGSEDGTVRIWHATTYRLEN 
TLNYGLERVWAIGHIKGSRRVVIGYDEGSIMVKLGREIPVASMDNSGKIIWAKHNEIHTV 
NIKSVGADEVTDGERLPLAVKELGTCDLYPQSLKHNPNGRFVVVCGDGEYIIYTALAWRN 
RSFGSALEFVWSSDGEHAVRESSTKIKIFSKNFQEKKTVRPTFSAEHIFGGTLLTMCSSD 
FICFYDWAECRLIRRIDVTVKNLYWADSGDLVAIASDTSFYILKFNRDIVSSYFDGGKQI 
DEEGIEDAFELLNETNERVRTGLWVGDCFIYTNSSWRLNYCVGGEVTTMYHLDRPMYLLG 
YLANQSRVYLIDKEFNVIGYTLLLSLIEYKTLVMRGDLEQANEVLPSIPKEHHNSVAHFL 
ESRGMTEDALEVATDPDYRFELAIQLGRLAVAKDIAVEAQNESKWKQLGELAMSSGKLDM 
AEECMRHAMDLSGLLLLYSSLGDADGMMKLAALAKEQGKNNVAFLCLFMLGQVEDCLHLL 
VESNRIPEAALMARSYLPSKVSEIVALWRNDLTKISPKAAESLADPEEYPNLFEEWQVAL 
SLENRAAETRGVHPPAGDYCSHADRDHTTLVDAFRIMQIEEEGRLEQGDVLDEVGEEGED 
GEEEEEEDRQEESSDGRQQNVEEEAVVVDADSTDGAVLVNGNESEEQWVLTPPQE*
>AT1G79990.1 |  protein binding / structural molecule 
MFYGTAVWDPWLIVGQIICLQCSYYLTLGLFTMVFLGLRVPRLSLVYFFDYATLTTSTFT 
GWSVIASFLFSSLAGAVYMIFLVERARKCLDFSATLYIIHLFFCIMYGGWPSSMAWWVVN 
GTGLAVMALLAEYLCIKREQREIPMDRFHSNPDSDPNRQRALEVGSPPLRFFFFVISDPP 
CDQGPFPSIIRSPVQTFVWTCKDVYVRVFLIALKFDPLRLEIKRKFAQRSERVKSVDLHP 
TEPWILASLYSGTLCIWNYQTQTMVKSFDVTELPVRSAKFIARKQWVVAGADDMFIRVYN 
YNTMDKIKVFEAHADYIRCVAVHPTLPYVLSSSDDMLIKLWDWEKGWLCTQIFEGHSHYV 
MQVTFNPKDTNTFASASLDRTIKIWNLGSPDPNFTLDAHLKGVNCVDYFTGGDKPYLITG 
SDDHTAKVWDYQTKSCVQTLEGHTHNVSAVSFHPELPIIITGSEDGTVRIWHATTYRLEN 
TLNYGLERVWAIGHIKGSRRVVIGYDEGSIMVKLGREIPVASMDNSGKIIWAKHNEIHTV 
NIKSVGADEVTDGERLPLAVKELGTCDLYPQSLKHNPNGRFVVVCGDGEYIIYTALAWRN 
RSFGSALEFVWSSDGEHAVRESSTKIKIFSKNFQEKKTVRPTFSAEHIFGGTLLTMCSSD 
FICFYDWAECRLIRRIDVTVKNLYWADSGDLVAIASDTSFYILKFNRDIVSSYFDGGKQI 
DEEGIEDAFELLNETNERVRTGLWVGDCFIYTNSSWRLNYCVGGEVTTMYHLDRPMYLLG 
YLANQSRVYLIDKEFNVIGYTLLLSLIEYKTLVMRGDLEQANEVLPSIPKEHHNSVAHFL 
ESRGMTEDALEVATDPDYRFELAIQLGRLAVAKDIAVEAQNESKWKQLGELAMSSGKLDM 
AEECMRHAMDLSGLLLLYSSLGDADGMMKLAALAKEQGKNNVAFLCLFMLGQVEDCLHLL 
VESNRIPEAALMARSYLPSKVSEIVALWRNDLTKISPKAAESLADPEEYPNLFEEWQVAL 
SLENRAAETRGVHPPAGDYCSHADRDHTTLVDAFRIMQIEEEGRLEQGDVLDEVGEEGED 
GEEEEEEDRQEESSDGRQQNVEEEAVVVDADSTDGAVLVNGNESEEQWVLTPPQE*
>AT1G79990.1 |  LOCATED IN endomembrane system COPI vesicle coat Golgi membrane EXPRESSED IN 25 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s SYS1 homologue (InterProIPR016973) 
MFYGTAVWDPWLIVGQIICLQCSYYLTLGLFTMVFLGLRVPRLSLVYFFDYATLTTSTFT 
GWSVIASFLFSSLAGAVYMIFLVERARKCLDFSATLYIIHLFFCIMYGGWPSSMAWWVVN 
GTGLAVMALLAEYLCIKREQREIPMDRFHSNPDSDPNRQRALEVGSPPLRFFFFVISDPP 
CDQGPFPSIIRSPVQTFVWTCKDVYVRVFLIALKFDPLRLEIKRKFAQRSERVKSVDLHP 
TEPWILASLYSGTLCIWNYQTQTMVKSFDVTELPVRSAKFIARKQWVVAGADDMFIRVYN 
YNTMDKIKVFEAHADYIRCVAVHPTLPYVLSSSDDMLIKLWDWEKGWLCTQIFEGHSHYV 
MQVTFNPKDTNTFASASLDRTIKIWNLGSPDPNFTLDAHLKGVNCVDYFTGGDKPYLITG 
SDDHTAKVWDYQTKSCVQTLEGHTHNVSAVSFHPELPIIITGSEDGTVRIWHATTYRLEN 
TLNYGLERVWAIGHIKGSRRVVIGYDEGSIMVKLGREIPVASMDNSGKIIWAKHNEIHTV 
NIKSVGADEVTDGERLPLAVKELGTCDLYPQSLKHNPNGRFVVVCGDGEYIIYTALAWRN 
RSFGSALEFVWSSDGEHAVRESSTKIKIFSKNFQEKKTVRPTFSAEHIFGGTLLTMCSSD 
FICFYDWAECRLIRRIDVTVKNLYWADSGDLVAIASDTSFYILKFNRDIVSSYFDGGKQI 
DEEGIEDAFELLNETNERVRTGLWVGDCFIYTNSSWRLNYCVGGEVTTMYHLDRPMYLLG 
YLANQSRVYLIDKEFNVIGYTLLLSLIEYKTLVMRGDLEQANEVLPSIPKEHHNSVAHFL 
ESRGMTEDALEVATDPDYRFELAIQLGRLAVAKDIAVEAQNESKWKQLGELAMSSGKLDM 
AEECMRHAMDLSGLLLLYSSLGDADGMMKLAALAKEQGKNNVAFLCLFMLGQVEDCLHLL 
VESNRIPEAALMARSYLPSKVSEIVALWRNDLTKISPKAAESLADPEEYPNLFEEWQVAL 
SLENRAAETRGVHPPAGDYCSHADRDHTTLVDAFRIMQIEEEGRLEQGDVLDEVGEEGED 
GEEEEEEDRQEESSDGRQQNVEEEAVVVDADSTDGAVLVNGNESEEQWVLTPPQE*
>AT1G79990.1 |  protein binding / structural molecule 
MFYGTAVWDPWLIVGQIICLQCSYYLTLGLFTMVFLGLRVPRLSLVYFFDYATLTTSTFT 
GWSVIASFLFSSLAGAVYMIFLVERARKCLDFSATLYIIHLFFCIMYGGWPSSMAWWVVN 
GTGLAVMALLAEYLCIKREQREIPMDRFHSNPDSDPNRQRALEVGSPPLRFFFFVISDPP 
CDQGPFPSIIRSPVQTFVWTCKDVYVRVFLIALKFDPLRLEIKRKFAQRSERVKSVDLHP 
TEPWILASLYSGTLCIWNYQTQTMVKSFDVTELPVRSAKFIARKQWVVAGADDMFIRVYN 
YNTMDKIKVFEAHADYIRCVAVHPTLPYVLSSSDDMLIKLWDWEKGWLCTQIFEGHSHYV 
MQVTFNPKDTNTFASASLDRTIKIWNLGSPDPNFTLDAHLKGVNCVDYFTGGDKPYLITG 
SDDHTAKVWDYQTKSCVQTLEGHTHNVSAVSFHPELPIIITGSEDGTVRIWHATTYRLEN 
TLNYGLERVWAIGHIKGSRRVVIGYDEGSIMVKLGREIPVASMDNSGKIIWAKHNEIHTV 
NIKSVGADEVTDGERLPLAVKELGTCDLYPQSLKHNPNGRFVVVCGDGEYIIYTALAWRN 
RSFGSALEFVWSSDGEHAVRESSTKIKIFSKNFQEKKTVRPTFSAEHIFGGTLLTMCSSD 
FICFYDWAECRLIRRIDVTVKNLYWADSGDLVAIASDTSFYILKFNRDIVSSYFDGGKQI 
DEEGIEDAFELLNETNERVRTGLWVGDCFIYTNSSWRLNYCVGGEVTTMYHLDRPMYLLG 
YLANQSRVYLIDKEFNVIGYTLLLSLIEYKTLVMRGDLEQANEVLPSIPKEHHNSVAHFL 
ESRGMTEDALEVATDPDYRFELAIQLGRLAVAKDIAVEAQNESKWKQLGELAMSSGKLDM 
AEECMRHAMDLSGLLLLYSSLGDADGMMKLAALAKEQGKNNVAFLCLFMLGQVEDCLHLL 
VESNRIPEAALMARSYLPSKVSEIVALWRNDLTKISPKAAESLADPEEYPNLFEEWQVAL 
SLENRAAETRGVHPPAGDYCSHADRDHTTLVDAFRIMQIEEEGRLEQGDVLDEVGEEGED 
GEEEEEEDRQEESSDGRQQNVEEEAVVVDADSTDGAVLVNGNESEEQWVLTPPQE*
>AT1G79990.4 |  LOCATED IN endomembrane system COPI vesicle coat Golgi membrane EXPRESSED IN 25 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s SYS1 homologue (InterProIPR016973) Has 55556 Blast hits to 24059 proteins in 620 species Archae - 38 Bacteria - 5697 Metazoa - 25539 Fungi - 10898 Plants - 5309 Viruses - 0 Other Eukaryotes - 8075 (source NCBI BLink) 
MFYGTAVWDPWLIVGQIICLQCSYYLTLGLFTMVFLGLRVPRLSLVYFFDYATLTTSTFT 
GWSVIASFLFSSLAGAVYMIFLVERARKCLDFSATLYIIHLFFCIMYGGWPSSMAWWVVN 
GTGLAVMALLAEYLCIKREQREIPMDRFHSRV*
>AT1G79990.4 |  protein binding / structural molecule 
MFYGTAVWDPWLIVGQIICLQCSYYLTLGLFTMVFLGLRVPRLSLVYFFDYATLTTSTFT 
GWSVIASFLFSSLAGAVYMIFLVERARKCLDFSATLYIIHLFFCIMYGGWPSSMAWWVVN 
GTGLAVMALLAEYLCIKREQREIPMDRFHSRV*
>AT1G79990.4 |  protein binding / structural molecule 
MFYGTAVWDPWLIVGQIICLQCSYYLTLGLFTMVFLGLRVPRLSLVYFFDYATLTTSTFT 
GWSVIASFLFSSLAGAVYMIFLVERARKCLDFSATLYIIHLFFCIMYGGWPSSMAWWVVN 
GTGLAVMALLAEYLCIKREQREIPMDRFHSRV*
>AT1G79990.4 |  LOCATED IN endomembrane system COPI vesicle coat Golgi membrane EXPRESSED IN 25 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s SYS1 homologue (InterProIPR016973) 
MFYGTAVWDPWLIVGQIICLQCSYYLTLGLFTMVFLGLRVPRLSLVYFFDYATLTTSTFT 
GWSVIASFLFSSLAGAVYMIFLVERARKCLDFSATLYIIHLFFCIMYGGWPSSMAWWVVN 
GTGLAVMALLAEYLCIKREQREIPMDRFHSRV*
>AT1G79990.4 |  protein binding / structural molecule 
MFYGTAVWDPWLIVGQIICLQCSYYLTLGLFTMVFLGLRVPRLSLVYFFDYATLTTSTFT 
GWSVIASFLFSSLAGAVYMIFLVERARKCLDFSATLYIIHLFFCIMYGGWPSSMAWWVVN 
GTGLAVMALLAEYLCIKREQREIPMDRFHSRV*
>AT1G79990.5 |  LOCATED IN endomembrane system COPI vesicle coat Golgi membrane EXPRESSED IN 25 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s SYS1 homologue (InterProIPR016973) Has 55556 Blast hits to 24059 proteins in 620 species Archae - 38 Bacteria - 5697 Metazoa - 25539 Fungi - 10898 Plants - 5309 Viruses - 0 Other Eukaryotes - 8075 (source NCBI BLink) 
MPLRLEIKRKFAQRSERVKSVDLHPTEPWILASLYSGTLCIWNYQTQTMVKSFDVTELPV 
RSAKFIARKQWVVAGADDMFIRVYNYNTMDKIKVFEAHADYIRCVAVHPTLPYVLSSSDD 
MLIKLWDWEKGWLCTQIFEGHSHYVMQVTFNPKDTNTFASASLDRTIKIWNLGSPDPNFT 
LDAHLKGVNCVDYFTGGDKPYLITGSDDHTAKVWDYQTKSCVQTLEGHTHNVSAVSFHPE 
LPIIITGSEDGTVRIWHATTYRLENTLNYGLERVWAIGHIKGSRRVVIGYDEGSIMVKLG 
REIPVASMDNSGKIIWAKHNEIHTVNIKSVGADEVTDGERLPLAVKELGTCDLYPQSLKH 
NPNGRFVVVCGDGEYIIYTALAWRNRSFGSALEFVWSSDGEHAVRESSTKIKIFSKNFQE 
KKTVRPTFSAEHIFGGTLLTMCSSDFICFYDWAECRLIRRIDVTVKNLYWADSGDLVAIA 
SDTSFYILKFNRDIVSSYFDGGKQIDEEGIEDAFELLNETNERVRTGLWVGDCFIYTNSS 
WRLNYCVGGEVTTMYHLDRPMYLLGYLANQSRVYLIDKEFNVIGYTLLLSLIEYKTLVMR 
GDLEQANEVLPSIPKEHHNSVAHFLESRGMTEDALEVATDPDYRFELAIQLGRLAVAKDI 
AVEAQNESKWKQLGELAMSSGKLDMAEECMRHAMDLSGLLLLYSSLGDADGMMKLAALAK 
EQGKNNVAFLCLFMLGQVEDCLHLLVESNRIPEAALMARSYLPSKVSEIVALWRNDLTKI 
SPKAAESLADPEEYPNLFEEWQVALSLENRAAETRGVHPPAGDYCSHADRDHTTLVDAFR 
IMQIEEEGRLEQGDVLDEVGEEGEDGEEEEEEDRQEESSDGRQQNVEEEAVVVDADSTDG 
AVLVNVLTPPQE*
>AT1G79990.5 |  protein binding / structural molecule 
MPLRLEIKRKFAQRSERVKSVDLHPTEPWILASLYSGTLCIWNYQTQTMVKSFDVTELPV 
RSAKFIARKQWVVAGADDMFIRVYNYNTMDKIKVFEAHADYIRCVAVHPTLPYVLSSSDD 
MLIKLWDWEKGWLCTQIFEGHSHYVMQVTFNPKDTNTFASASLDRTIKIWNLGSPDPNFT 
LDAHLKGVNCVDYFTGGDKPYLITGSDDHTAKVWDYQTKSCVQTLEGHTHNVSAVSFHPE 
LPIIITGSEDGTVRIWHATTYRLENTLNYGLERVWAIGHIKGSRRVVIGYDEGSIMVKLG 
REIPVASMDNSGKIIWAKHNEIHTVNIKSVGADEVTDGERLPLAVKELGTCDLYPQSLKH 
NPNGRFVVVCGDGEYIIYTALAWRNRSFGSALEFVWSSDGEHAVRESSTKIKIFSKNFQE 
KKTVRPTFSAEHIFGGTLLTMCSSDFICFYDWAECRLIRRIDVTVKNLYWADSGDLVAIA 
SDTSFYILKFNRDIVSSYFDGGKQIDEEGIEDAFELLNETNERVRTGLWVGDCFIYTNSS 
WRLNYCVGGEVTTMYHLDRPMYLLGYLANQSRVYLIDKEFNVIGYTLLLSLIEYKTLVMR 
GDLEQANEVLPSIPKEHHNSVAHFLESRGMTEDALEVATDPDYRFELAIQLGRLAVAKDI 
AVEAQNESKWKQLGELAMSSGKLDMAEECMRHAMDLSGLLLLYSSLGDADGMMKLAALAK 
EQGKNNVAFLCLFMLGQVEDCLHLLVESNRIPEAALMARSYLPSKVSEIVALWRNDLTKI 
SPKAAESLADPEEYPNLFEEWQVALSLENRAAETRGVHPPAGDYCSHADRDHTTLVDAFR 
IMQIEEEGRLEQGDVLDEVGEEGEDGEEEEEEDRQEESSDGRQQNVEEEAVVVDADSTDG 
AVLVNVLTPPQE*
>AT1G79990.5 |  protein binding / structural molecule 
MPLRLEIKRKFAQRSERVKSVDLHPTEPWILASLYSGTLCIWNYQTQTMVKSFDVTELPV 
RSAKFIARKQWVVAGADDMFIRVYNYNTMDKIKVFEAHADYIRCVAVHPTLPYVLSSSDD 
MLIKLWDWEKGWLCTQIFEGHSHYVMQVTFNPKDTNTFASASLDRTIKIWNLGSPDPNFT 
LDAHLKGVNCVDYFTGGDKPYLITGSDDHTAKVWDYQTKSCVQTLEGHTHNVSAVSFHPE 
LPIIITGSEDGTVRIWHATTYRLENTLNYGLERVWAIGHIKGSRRVVIGYDEGSIMVKLG 
REIPVASMDNSGKIIWAKHNEIHTVNIKSVGADEVTDGERLPLAVKELGTCDLYPQSLKH 
NPNGRFVVVCGDGEYIIYTALAWRNRSFGSALEFVWSSDGEHAVRESSTKIKIFSKNFQE 
KKTVRPTFSAEHIFGGTLLTMCSSDFICFYDWAECRLIRRIDVTVKNLYWADSGDLVAIA 
SDTSFYILKFNRDIVSSYFDGGKQIDEEGIEDAFELLNETNERVRTGLWVGDCFIYTNSS 
WRLNYCVGGEVTTMYHLDRPMYLLGYLANQSRVYLIDKEFNVIGYTLLLSLIEYKTLVMR 
GDLEQANEVLPSIPKEHHNSVAHFLESRGMTEDALEVATDPDYRFELAIQLGRLAVAKDI 
AVEAQNESKWKQLGELAMSSGKLDMAEECMRHAMDLSGLLLLYSSLGDADGMMKLAALAK 
EQGKNNVAFLCLFMLGQVEDCLHLLVESNRIPEAALMARSYLPSKVSEIVALWRNDLTKI 
SPKAAESLADPEEYPNLFEEWQVALSLENRAAETRGVHPPAGDYCSHADRDHTTLVDAFR 
IMQIEEEGRLEQGDVLDEVGEEGEDGEEEEEEDRQEESSDGRQQNVEEEAVVVDADSTDG 
AVLVNVLTPPQE*
>AT1G79990.5 |  LOCATED IN endomembrane system COPI vesicle coat Golgi membrane EXPRESSED IN 25 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s SYS1 homologue (InterProIPR016973) 
MPLRLEIKRKFAQRSERVKSVDLHPTEPWILASLYSGTLCIWNYQTQTMVKSFDVTELPV 
RSAKFIARKQWVVAGADDMFIRVYNYNTMDKIKVFEAHADYIRCVAVHPTLPYVLSSSDD 
MLIKLWDWEKGWLCTQIFEGHSHYVMQVTFNPKDTNTFASASLDRTIKIWNLGSPDPNFT 
LDAHLKGVNCVDYFTGGDKPYLITGSDDHTAKVWDYQTKSCVQTLEGHTHNVSAVSFHPE 
LPIIITGSEDGTVRIWHATTYRLENTLNYGLERVWAIGHIKGSRRVVIGYDEGSIMVKLG 
REIPVASMDNSGKIIWAKHNEIHTVNIKSVGADEVTDGERLPLAVKELGTCDLYPQSLKH 
NPNGRFVVVCGDGEYIIYTALAWRNRSFGSALEFVWSSDGEHAVRESSTKIKIFSKNFQE 
KKTVRPTFSAEHIFGGTLLTMCSSDFICFYDWAECRLIRRIDVTVKNLYWADSGDLVAIA 
SDTSFYILKFNRDIVSSYFDGGKQIDEEGIEDAFELLNETNERVRTGLWVGDCFIYTNSS 
WRLNYCVGGEVTTMYHLDRPMYLLGYLANQSRVYLIDKEFNVIGYTLLLSLIEYKTLVMR 
GDLEQANEVLPSIPKEHHNSVAHFLESRGMTEDALEVATDPDYRFELAIQLGRLAVAKDI 
AVEAQNESKWKQLGELAMSSGKLDMAEECMRHAMDLSGLLLLYSSLGDADGMMKLAALAK 
EQGKNNVAFLCLFMLGQVEDCLHLLVESNRIPEAALMARSYLPSKVSEIVALWRNDLTKI 
SPKAAESLADPEEYPNLFEEWQVALSLENRAAETRGVHPPAGDYCSHADRDHTTLVDAFR 
IMQIEEEGRLEQGDVLDEVGEEGEDGEEEEEEDRQEESSDGRQQNVEEEAVVVDADSTDG 
AVLVNVLTPPQE*
>AT1G79990.5 |  protein binding / structural molecule 
MPLRLEIKRKFAQRSERVKSVDLHPTEPWILASLYSGTLCIWNYQTQTMVKSFDVTELPV 
RSAKFIARKQWVVAGADDMFIRVYNYNTMDKIKVFEAHADYIRCVAVHPTLPYVLSSSDD 
MLIKLWDWEKGWLCTQIFEGHSHYVMQVTFNPKDTNTFASASLDRTIKIWNLGSPDPNFT 
LDAHLKGVNCVDYFTGGDKPYLITGSDDHTAKVWDYQTKSCVQTLEGHTHNVSAVSFHPE 
LPIIITGSEDGTVRIWHATTYRLENTLNYGLERVWAIGHIKGSRRVVIGYDEGSIMVKLG 
REIPVASMDNSGKIIWAKHNEIHTVNIKSVGADEVTDGERLPLAVKELGTCDLYPQSLKH 
NPNGRFVVVCGDGEYIIYTALAWRNRSFGSALEFVWSSDGEHAVRESSTKIKIFSKNFQE 
KKTVRPTFSAEHIFGGTLLTMCSSDFICFYDWAECRLIRRIDVTVKNLYWADSGDLVAIA 
SDTSFYILKFNRDIVSSYFDGGKQIDEEGIEDAFELLNETNERVRTGLWVGDCFIYTNSS 
WRLNYCVGGEVTTMYHLDRPMYLLGYLANQSRVYLIDKEFNVIGYTLLLSLIEYKTLVMR 
GDLEQANEVLPSIPKEHHNSVAHFLESRGMTEDALEVATDPDYRFELAIQLGRLAVAKDI 
AVEAQNESKWKQLGELAMSSGKLDMAEECMRHAMDLSGLLLLYSSLGDADGMMKLAALAK 
EQGKNNVAFLCLFMLGQVEDCLHLLVESNRIPEAALMARSYLPSKVSEIVALWRNDLTKI 
SPKAAESLADPEEYPNLFEEWQVALSLENRAAETRGVHPPAGDYCSHADRDHTTLVDAFR 
IMQIEEEGRLEQGDVLDEVGEEGEDGEEEEEEDRQEESSDGRQQNVEEEAVVVDADSTDG 
AVLVNVLTPPQE*
>AT3G25100.1 |  CDC45 (cell division cycle 45) 
MVRIKKVESFYAKLRESATSLSSQNPLLIFPSTSDVDSLCALKVITHILESDSIQYSCFP 
VSSFLEIHKYAGPAGLCSTSLESPPVTILLINWGCHRDLKLVLKLGPSARVFVVDSHRPI 
HLHNLSDYNEQVVVLHTDDDERQGDLAYDFDVLKLANESFQLRVEDAGEESDEEEEDEEE 
DEEDDDDDDGDRPSKRRKMGDGVKVFKKLKRDYYKMGTFHGKPSGCLLFELSHMLRKNTN 
ELLWLACVSLTDQFVHERLTDERYQAAVMELEQHINSSGNIDKITSVTLKDGTKVRAPDC 
SRISYEEEPRLMLLREWTLFDSMLCSSYIATKLKTWSDNGIKKLKLLLARMGFALIECQQ 
KFPYMSLEVKRKMKQEFDRFLPEYGLNDFYYRSFLRLHGYSSRVSAADVVYGITALLESF 
LGSGGSSASKQFGEAYDALSLNNLDKLRSGMQQAIKVQRAILRQGSAAITKSGCIRSGRK 
FRWVKIEDSMDAKYLGYPQALTKFCYFLMDALREKGARMKPMLCACASQQPGKILVVGVC 
GKPRLGAVRGNAFGNAFRKAAQESRADYFHELFESSWIVLDASAVNSFMIRLTEKL*
>AT4G16970.1 |  ATP binding / kinase/ protein kinase/ protein serine/threonine kinase 
MSENSEPRQLENSTAGRELIPLSPTNSDGNDDLNYHLHAFELSRLLLSSGHPESVIDLSS 
KCTYFQGSPNLVKYLCSIPNSPISLAEDGFTVTLSPESPSAPASFACSLDLQENVVLEQF 
MDPRSLTLKHSRENAEQEELELMPLPKRSRNDGNDVNYSVIDSRPNDIRTVACGTMLGTI 
LALESQASVFNLSASNRGIEAFVQDHQPGPQTSNASVDVNPTHRLEESKNDLPSPQEDGY 
YERPEIGDFQIADNQILIEEGDDKNKKDLFPKGEIQTDSVQSDPVASLMPTENELEPVQI 
VDDTEDLLVDDHTVDIVSTPDRELPLKPSATEANQDKSLVQKTLDQCKLPGNSKTYSCSP 
EIKHTRKSKVIQKRKQNFNTVRLKDQKDQAKHNTIPDFDSYTIVEEEGSGGYGIVYKATR 
KTDGTEFAIKCPHVGAQKYYVNNEIRMLERFGGKNCIIKHEGCLKNGDSDCIILEHLEHD 
RPDSLKREIDVYQLQWYGYCMFKALSSLHKQGVVHRDVKPGNFLFSRKTNKGYLIDFNLA 
MDLHQKYRRADKSKAASGLPTASKKHHTLVKSLDAVNRGTNKPSQKTLAPNSIKKAAGKT 
RARNDMTRWERLNSQGAEGSGLTSAKDVTSTRNNPSGEKRREPLPCHGRKALLDFLQETM 
SVPIPNHEVSSKAPTSMRKRVAALPGKAEKELLYLTPMPLCSNGRPEAGDVIEKKDGPCS 
GTKGFRAPEVCFRSLHQGPKIDVWSAGVTLLYLIMGRTPFTGDPEQNIKDIAQLRGSEEL 
WEVAKLHNRESSFPKELYESRYLKGMELRKWCELNTKRREFLDVIPLSLLDLVDKCLTVN 
PRRRISAEDALKHDFFHPVHETLRNQMLLKQQPTVVADAVSQTLNYLQL*
>AT4G04880.1 |  adenosine/AMP deaminase family protein 
MEWIQSLPKIELHAHLNGSIRDSTLLELARVLGEKGVIVFADVEHVIQKNDRSLVEVFKL 
FDLIHKLTTDHKTVTRITREVVEDFALENVVYLELRTTPKRSDSIGMSKRSYMEAVIQGL 
RSVSEVDIDFVTASDSQKLHNAGDGIGRKKIYVRLLLSIDRRETTESAMETVKLALEMRD 
VGVVGIDLSGNPLVGEWSTFLPALQYAKDNDLHITLHCGEVPNPKEIQAMLDFKPHRIGH 
ACFFKDEDWTKLKSFRIPVEICLTSNIVTKSISSIDIHHFADLYNAKHPLILCTDDFGVF 
STSLSNEYALAVRSLGLSKSETFALARAAIDATFAEDEVKQQLRFIFDSASPEHV*
>AT5G66100.1 |  La domain-containing protein 
MMATTASSAANSASRFSIDSSISRSRHGDSSPWLLPSDSHDHPTLSLSQDDPFSAPSVSP 
PTGNNSSDYDNADKKPPPVWNMPSSNSSSDVGPVMGAAESWPALSLSARSSSIKSPSLDA 
SKPFPDGSSSSIPPPQATSNTSTNANAGSSVSATSSENSAVNNSQRKPFRRNNNTSSSST 
SSNVSNAAPLNTRDQNHSQRGGGSFGSGNFRNSQRNRNSSSYPRGEGLHHGNRRNYEHGN 
QSGFSHRNYSGRDMHLQPQRGVGMIRPQMLMGPPSFPASSAQYMAAPQLGSYGGPIIYPD 
YAQHVFMPHPSPDPMGLVGPFPLQPMYFRNFDAILYNKILTQVEYYFSADNLSRDEHLRD 
QMNDEGWVPVRVIAAFRRLAELTNNIQTILEALRSSEVVEIQGETLRRRGDWDKYLLPRE 
PSRSGPAAGASNNASLVSQIESMTLSERSREGV*
>AT1G52360.1 |  coatomer protein complex subunit beta 2 (beta prime) putative 
MPLRLEIKRKLAQRSERVKSVDLHPTEPWILASLYSGTLCIWNYQTQVMAKSFEVTELPV 
RSAKFVARKQWVVAGADDMYIRVYNYNTMDKVKVFEAHSDYIRCVAVHPTLPYVLSSSDD 
MLIKLWDWEKGWACTQIFEGHSHYVMQVTFNPKDTNTFASASLDRTIKIWNLGSPDPNFT 
LDAHQKGVNCVDYFTGGDKPYLITGSDDHTAKVWDYQTKSCVQTLEGHTHNVSAVCFHPE 
LPIIITGSEDGTVRIWHATTYRLENTLNYGLERVWAIGYIKSSRRVVIGYDEGTIMVKLG 
REIPVASMDNTGKIIWAKHNEIQTANIKSIGADYEVTDGERLPLSVKELGTCDLYPQSLK 
HNPNGRFVVVCGDGEYIIYTALAWRNRSFGSGLEFVWSSEGECAVRESSSKIKIFSKNFQ 
EKRSIRPTFSAEKIFGGTLLAMCSSDFICFYDWAECRLIQRIDVTVKNLYWADSGDLVAI 
ASDTSFYILKFNRDLVTSHFDSGRPTEEEGVEDAFEVLHENDERVRTGIWVGDCFIYNNS 
SWKLNYCVGGEVTTMYHLDRPMYLLGYLASQSRVFLVDKEFNVIGYTLLLSLIEYKTLVM 
RGDLDKASEILPTIPKDQHNSVAHFLESRGMIEDALEIATDPDYRFELAIQLGRLEIAQE 
IAVEVQSESKWKQLGELAMSSGKLQMAEECMKYAMDLSGLLLLYSSLGDAEGVTKLATLA 
KEQGKNNVAFLCLFMLGKLEDCLQLLVESNRIPEAALMARSYLPSKVSEIVALWRKDLSK 
VNSKAAESLADPEEYSNLFEDWQVALSVEAKAVETRGVYTGAKDYPSHADKSSMTLVEAF 
RNLQVEEEESLENGDMDHEEVVAEENGNEQRNEDDVAEHVEEHHEEKEAEEEEGIVDGDS 
TDGAVLVNGSEADEEWGTNNEGNPSA*
>AT4G35890.1 |  La domain-containing protein 
MASATSNNPASSSMSPRRISGNHGSPTASVAQSPRRPSRQVSSPWTQIVRGESEPIAAAA 
AVAGPSSPQSRAPIEPIASVSVAAPTAAVLTVEAAAGDEKSEASGGQDNAGKKPVWKRPS 
NGASEVGPVMGASSWPALSETTKAPSNKSSSDSLKSLGDVPSSSSASSSVPVTQGIANAS 
VPAPKQAGRANPNPTPNHSRQRSFKQRNGASGSANGTVSQPSAQGSFTELPSHNPSPRGQ 
NQKNGFASQNHGGTENPSQRDSYRNQNGNHHQSHGGRRNQEHGNQNWTFQRSFNGREGNA 
QSQRGTPAFVRHPSPTVQPIPQFMAAQPFPSHIPFPTELAQSSYYPRMPYMTPIPHGPQF 
FYHYQDPPLHMKLHKQIQYYFSDENLITDIYLRGFMNNEGFVPLRVVAGFKKVAELTDNI 
QQIVEALQNSPHVEVQGDFIRKRDNWQNWVLRRNPTGSGPQSVDRADAVAKRLGNLSVDQ 
SSADPIGGSSSQLQPTEALSDDQQQSSSTAPVSNHNAPDGANR*
>AT2G32160.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 325 Blast hits to 315 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 70 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV 
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG 
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL 
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET 
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKKQ*
>AT2G32160.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 327 Blast hits to 317 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV 
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG 
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL 
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET 
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKKQ*
>AT2G32160.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 328 Blast hits to 321 proteins in 142 species Archae - 0 Bacteria - 0 Metazoa - 99 Fungi - 132 Plants - 25 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV 
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG 
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL 
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET 
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKKQ*
>AT2G32160.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 325 Blast hits to 315 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 70 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV 
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG 
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL 
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET 
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKEKTIETT 
YSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT2G32160.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 327 Blast hits to 317 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV 
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG 
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL 
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET 
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKEKTIETT 
YSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT2G32160.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 328 Blast hits to 321 proteins in 142 species Archae - 0 Bacteria - 0 Metazoa - 99 Fungi - 132 Plants - 25 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV 
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG 
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL 
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET 
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKEKTIETT 
YSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT2G32160.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 325 Blast hits to 315 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 70 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSVPF 
SSHDWLDSSLQAHVPLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRH 
KESTRPACLVPGAGLGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPW 
IHTNCNSLSDDDQLRPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCF 
FIDTAHNIIEYIETISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASH 
YGFEMEKEKTIETTYSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT2G32160.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 327 Blast hits to 317 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSVPF 
SSHDWLDSSLQAHVPLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRH 
KESTRPACLVPGAGLGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPW 
IHTNCNSLSDDDQLRPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCF 
FIDTAHNIIEYIETISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASH 
YGFEMEKEKTIETTYSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT2G32160.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 328 Blast hits to 321 proteins in 142 species Archae - 0 Bacteria - 0 Metazoa - 99 Fungi - 132 Plants - 25 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSVPF 
SSHDWLDSSLQAHVPLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRH 
KESTRPACLVPGAGLGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPW 
IHTNCNSLSDDDQLRPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCF 
FIDTAHNIIEYIETISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASH 
YGFEMEKEKTIETTYSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT5G13010.1 |  EMB3011 (embryo defective 3011) ATP binding / RNA helicase/ helicase/ nucleic acid binding 
MGVDPFKTTETLEADKETNGGVPVKDKLTFKAPERKSRLGLDARAIEKKDNAKTEGEFKV 
PKKSAISVTSSLDEEDKSDVSGLDFGTENTRPVHSSRRYREKSSRSQSAQESTVTTENAG 
TSDVVAIGIEKNIGVTEVKLRGQDRETLMMRWITTDGGNLIANLTETITEKSVGDTIAIG 
GLQDEWERSPHGDRGSSYSRRPQPSPSPMLAAASPDARLASPWLDTPRSTMSSASPWDMG 
APSPIPIRASGSSIRSSSSRYGGRSNQLAYSREGDLTNEGHSDEDRSQGAEEFKHEITET 
MRVEMEYQSDRAWYDTDEGNSLFDADSASFFLGDDASLQKKETELAKRLVRRDGSKMSLA 
QSKKYSQLNADNAQWEDRQLLRSGAVRGTEVQTEFDSEEERKAILLVHDTKPPFLDGRVV 
YTKQAEPVMPVKDPTSDMAIISRKGSGLVKEIREKQSANKSRQRFWELAGSNLGNILGIE 
KSAEQIDADTAVVGDDGEVDFKGEAKFAQHMKKGEAVSEFAMSKTMAEQRQYLPIFSVRD 
ELLQVIRENQVIVVVGETGSGKTTQLTQDGYTINGIVGCTQPRRVAAMSVAKRVSEEMET 
ELGDKIGYAIRFEDVTGPNTVIKYMTDGVLLRETLKDSDLDKYRVVVMDEAHERSLNTDV 
LFGILKKVVARRRDFKLIVTSATLNAQKFSNFFGSVPIFNIPGRTFPVNILYSKTPCEDY 
VEAAVKQAMTIHITSPPGDILIFMTGQDEIEAACFSLKERMEQLVSSSSREITNLLILPI 
YSQLPADLQAKIFQKPEDGARKCIVATNIAETSLTVDGIYYVIDTGYGKMKVFNPRMGMD 
ALQVFPISRAASDQRAGRAGRTGPGTCYRLYTESAYLNEMLPSPVPEIQRTNLGNVVLLL 
KSLKIDNLLDFDFMDPPPQENILNSMYQLWVLGALNNVGGLTDLGWKMVEFPLDPPLAKM 
LLMGERLDCIDEVLTIVSMLSVPSVFFRPKERAEESDAAREKFFVPESDHLTLLNVYQQW 
KEHDYRGDWCNDHYLQVKGLRKAREVRSQLLDILKQLKIELRSCGPDWDIVRKAICSAYF 
HNSARLKGVGEYVNCRTGMPCHLHPSSALYGLGYTPDYVVYHELILTTKEYMQCATSVEP 
HWLAELGPMFFSVKDSDTSMLEHKKKQKEEKSGMEEEMEKLRRDQVESELRSKERERKKR 
AKQQQQISGPGLKKGTTFLRPKKLGL*