>AT1G20960.1 |  emb1507 (embryo defective 1507) ATP binding / ATP-dependent helicase/ helicase/ nucleic acid binding / nucleoside-triphosphatase/ nucleotide binding 
MANLGGGAEAHARFKQYEYRANSSLVLTTDNRPRDTHEPTGEPETLWGKIDPRSFGDRVA 
KGRPQELEDKLKKSKKKERDVVDDMVNIRQSKRRRLREESVLTDTDDAVYQPKTKETRAA 
YEAMLGLIQKQLGGQPPSIVSGAADEILAVLKNDAFRNPEKKMEIEKLLNKIENHEFDQL 
VSIGKLITDFQEGGDSGGGRANDDEGLDDDLGVAVEFEENEEDDEESDPDMVEEDDDEED 
DEPTRTGGMQVDAGINDEDAGDANEGTNLNVQDIDAYWLQRKISQAYEQQIDPQQCQVLA 
EELLKILAEGDDRVVEDKLLMHLQYEKFSLVKFLLRNRLKVVWCTRLARAEDQEERNRIE 
EEMRGLGPELTAIVEQLHATRATAKEREENLQKSINEEARRLKDETGGDGGRGRRDVADR 
DSESGWVKGQRQMLDLESLAFDQGGLLMANKKCDLPPGSYRSHGKGYDEVHVPWVSKKVD 
RNEKLVKITEMPDWAQPAFKGMQQLNRVQSKVYDTALFKAENILLCAPTGAGKTNVAMLT 
ILQQLEMNRNTDGTYNHGDYKIVYVAPMKALVAEVVGNLSNRLKDYGVIVRELSGDQSLT 
GREIEETQIIVTTPEKWDIITRKSGDRTYTQLVRLLIIDEIHLLHDNRGPVLESIVARTL 
RQIETTKENIRLVGLSATLPNYEDVALFLRVDLKKGLFKFDRSYRPVPLHQQYIGISVKK 
PLQRFQLMNDLCYQKVLAGAGKHQVLIFVHSRKETSKTARAIRDTAMANDTLSRFLKEDS 
VTRDVLHSHEDIVKNSDLKDILPYGFAIHHAGLSRGDREIVETLFSQGHVQVLVSTATLA 
WGVNLPAHTVIIKGTQVYNPEKGAWMELSPLDVMQMLGRAGRPQYDQHGEGIIITGYSEL 
QYYLSLMNEQLPIESQFISKLADQLNAEIVLGTVQNAREACHWLGYTYLYIRMVRNPTLY 
GLAPDALAKDVVLEERRADLIHSAATILDKNNLVKYDRKSGYFQVTDLGRIASYYYITHG 
TIATYNEHLKPTMGDIDLYRLFSLSDEFKYVTVRQDEKMELAKLLDRVPIPIKETLEEPS 
AKINVLLQAYISQLKLEGLSLTSDMVYITQSAGRLVRALYEIVLKRGWAQLAEKALNLSK 
MVGKRMWSVQTPLRQFHGLSNDILMQLEKKDLVWERYYDLSAQELGELIRSPKMGKPLHK 
FIHQFPKVTLSAHVQPITRTVLNVELTVTPDFLWDEKIHKYVEPFWIIVEDNDGEKILHH 
EYFLLKKQYIDEDHTLHFTVPIFEPLPPQYFVRVVSDKWLGSETVLPVSFRHLILPEKYP 
PPTELLDLQPLPVTALRNPNYEILYQDFKHFNPVQTQVFTVLYNTNDNVLVAAPTGSGKT 
ICAEFAILRNHHEGPDATMRVVYIAPLEAIAKEQFRIWEGKFGKGLGLRVVELTGETALD 
LKLLEKGQIIISTPEKWDALSRRWKQRKYVQQVSLFIVDELHLIGGQHGPVLEVIVSRMR 
YISSQVINKIRIVALSTSLANAKDLGEWIGASSHGLFNFPPGVRPVPLEIHIQGVDISSF 
EARMQAMTKPTYTAIVQHAKNKKPAIVFVPTRKHVRLTAVDLMAYSHMDNPQSPDFLLGK 
LEELDPFVEQIREETLKETLCHGIGYLHEGLSSLDQEIVTQLFEAGRIQVCVMSSSLCWG 
TPLTAHLVVVMGTQYYDGRENSHSDYPVPDLLQMMGRASRPLLDNAGKCVIFCHAPRKEY 
YKKFLYEAFPVESQLQHFLHDNFNAEVVAGVIENKQDAVDYLTWTFMYRRLPQNPNYYNL 
QGVSHRHLSDHLSELVENTLSDLEASKCIEVEDEMELSPLNLGMIASYYYISYTTIERFS 
SLLSSKTKMKGLLEILTSASEYDMIPIRPGEEDTVRRLINHQRFSFENPKCTDPHVKANA 
LLQAHFSRQNIGGNLAMDQRDVLLSATRLLQAMVDVISSNGWLNLALLAMEVSQMVTQGM 
WERDSMLLQLPHFTKDLAKRCQENPGKNIETVFDLVEMEDEERQELLKMSDAQLLDIARF 
CNRFPNIDLTYEIVGSEEVNPGKEVTLQVMLERDMEGRTEVGPVDSLRYPKTKEEGWWLV 
VGDTKTNQLLAIKRVSLQRKVKVKLDFTAPSEPGEKSYTLYFMCDSYLGCDQEYSFSVDV 
KGSGAGDRMEE*
>AT3G07590.1 |  small nuclear ribonucleoprotein D1 putative / snRNP core protein D1 putative / Sm protein D1 putative 
MKLVRFLMKLNNETVSIELKNGTVVHGTITGVDVSMNTHLKTVKMSLKGKNPVTLDHLSL 
RGNNIRYYILPDSLNLETLLVEDTPRVKPKKPVAGKAVGRGRGRGRGRGRGRGR*
>AT1G09760.1 |  U2A (U2 small nuclear ribonucleoprotein A) protein binding 
MVKLTADLIWKSPHFFNAIKERELDLRGNKIPVIENLGATEDQFDTIDLSDNEIVKLENF 
PYLNRLGTLLINNNRITRINPNLGEFLPKLHSLVLTNNRLVNLVEIDPLASIPKLQYLSL 
LDNNITKKANYRLYVIHKLKSLRVLDFIKIKAKERAEAASLFSSKEAEEEVKKVSREEVK 
KVSETAENPETPKVVAPTAEQILAIKAAIINSQTIEEIARLEQALKFGQVPAGLIIPDPA 
TNDSAPMEE*
>AT1G09770.1 |  ATCDC5 (ARABIDOPSIS THALIANA CELL DIVISION CYCLE 5) DNA binding / transcription factor 
MRIMIKGGVWKNTEDEILKAAVMKYGKNQWARISSLLVRKSAKQCKARWYEWLDPSIKKT 
EWTREEDEKLLHLAKLLPTQWRTIAPIVGRTPSQCLERYEKLLDAACTKDENYDAADDPR 
KLRPGEIDPNPEAKPARPDPVDMDEDEKEMLSEARARLANTRGKKAKRKAREKQLEEARR 
LASLQKRRELKAAGIDGRHRKRKRKGIDYNAEIPFEKRAPAGFYDTADEDRPADQVKFPT 
TIEELEGKRRADVEAHLRKQDVARNKIAQRQDAPAAILQANKLNDPEVVRKRSKLMLPPP 
QISDHELEEIAKMGYASDLLAENEELTEGSAATRALLANYSQTPRQGMTPMRTPQRTPAG 
KGDAIMMEAENLARLRDSQTPLLGGENPELHPSDFTGVTPRKKEIQTPNPMLTPSMTPGG 
AGLTPRIGLTPSRDGSSFSMTPKGTPFRDELHINEDMDMHESAKLERQRREEARRSLRSG 
LTGLPQPKNEYQIVAQPPPEESEEPEEKIEEDMSDRIAREKAEEEARQQALLKKRSKVLQ 
RDLPRPPAASLAVIRNSLLSADGDKSSVVPPTPIEVADKMVREELLQLLEHDNAKYPLDD 
KAEKKKGAKNRTNRSASQVLAIDDFDENELQEADKMIKEEGKFLCVSMGHENKTLDDFVE 
AHNTCVNDLMYFPTRSAYELSSVAGNADKVAAFQEEMENVRKKMEEDEKKAEHMKAKYKT 
YTKGHERRAETVWTQIEATLKQAEIGGTEVECFKALKRQEEMAASFRKKNLQEEVIKQKE 
TESKLQTRYGNMLAMVEKAEEIMVGFRAQALKKQEDVEDSHKLKEAKLATGEEEDIAIAM 
EASA*
>AT2G23930.1 |  SNRNP-G (PROBABLE SMALL NUCLEAR RIBONUCLEOPROTEIN G) 
MSRSGQPPDLKKYMDKKLQIKLNANRMVTGTLRGFDQFMNLVVDNTVEVNGNDKTDIGMV 
VIRGNSIVTVEALEPVGRSS*
>AT2G23930.1 |  SNRNP-G (PROBABLE SMALL NUCLEAR RIBONUCLEOPROTEIN G) 
MSRSGQPPDLKKYMDKKLQIKLNANRMVTGTLRGFDQFMNLVVDNTVEVNGNDKTDIGMV 
VIRGNSIVTVEALEPVGRSS*
>AT2G23930.2 |  SNRNP-G (PROBABLE SMALL NUCLEAR RIBONUCLEOPROTEIN G) 
MDKKLQIKLNANRMVTGTLRGFDQFMNLVVDNTVEVNGNDKTDIGMVVIRGNSIVTVEAL 
EPVGRSS*
>AT2G23930.2 |  SNRNP-G (PROBABLE SMALL NUCLEAR RIBONUCLEOPROTEIN G) 
MDKKLQIKLNANRMVTGTLRGFDQFMNLVVDNTVEVNGNDKTDIGMVVIRGNSIVTVEAL 
EPVGRSS*
>AT5G27720.1 |  emb1644 (embryo defective 1644) 
MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECY 
IRGNTIKYLRVPDEVIDKVQEEKTRTDRKPPGVGRGRGRGVDDGGARGRGRGTSMGKMGG 
NRGAGRGRG*
>AT2G47640.1 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.1 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.1 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.1 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.2 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.2 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.2 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.2 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.3 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.3 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.3 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.3 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.4 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.4 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.4 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.4 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT1G20580.1 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MSRSLGIPVKLLHEASGHIVTVELKSGELYRGSMIECEDNWNCQLEDITYTAKDGKVSQL 
EHVFIRGSKVRFMVIPDILKHAPMFKRLDARIKGKSSSLGVGRGRGAMRGKPAAGPGRGT 
GGRGAVPPVRR*
>AT1G28060.1 |  small nuclear ribonucleoprotein family protein / snRNP family protein 
MDKERYSRSHRDDRDRDSSPDHSPQREGGRRRDRDVDSKRRDSDHYRSSRRGDREDERDR 
TKDRRGRSVERGEREGSRDREKHHHERSHEGSKEKESRSKRKDREEENGARDGKKKSRFA 
DGNGERRSRFEDVAIEVENKDAQVSEGSGATNPTSGVTMGASTYSSIPSEASAAPSQTLL 
TKVSSISTTDENKASVVRSHEVPGKSSTDGRPLSTAGKSSANLPLDSSALAAKARKALQL 
QKGLADRLKNLPLLKKATKPTSEGSPHTRVPPSTTTPAVSTGTSFASTLPHTGLAGFGSI 
ANIEAVKRAQELAANMGFHQDREFAPVINLFPGQAPSDMTVAQRPEKPPVLRVDALGREI 
DEHGNVISVTKPSNLSTLKVNINKKKKDAFQILKPQLEADLKENPYFDTRMGIDEKKILR 
PKRMSFQFVEEGKWTRDAENLKFKSHFGEAKAKELKVKQAQLAKANDDINPNLIEVSERV 
PRKEKPKEPIPDVEWWDANVLTNGEYGEITDGTITESHLKIEKLTHYIEHPRPIEPPAEA 
APPPPQPLKLTKKEQKKLRTQRRLAKEKEKQEMIRQGLLEPPKAKVKMSNLMKVLGSEAT 
QDPTKLEKEIRTAAAEREQAHTDRNAARKLTPAEKREKKERKLFDDPTTVETIVSVYKIK 
KLSHPKTRFKVEMNARENRLTGCSVMTDEMSVVVVEGKSKAIKRYGKLMMKRINWEEAER 
KEGNEDEEEEVNGGNKCWLVWQGSIGKPSFHRFHVHECVTESTAKKVFMDAGVVHYWDLA 
VNYSDD*
>AT1G04510.1 |  transducin family protein / WD-40 repeat family protein 
MNCAISGEVPEEPVVSKKSGLLYEKRLIQTHISDYGKCPVTGEPHTLDDIVPIKTGKIVK 
PKPLHTASIPGLLGTFQTEWDSLMLSNFALEQQLHTARQELSHALYQHDAACRVIARLKK 
ERDESRQLLAEAERQLPAAPEVATSNAALSNGKRGIDDGEQGPNAKKMRLGISAEVITEL 
TDCNAALSQQRKKRQIPKTLASVDALEKFTQLSSHPLHKTNKPGIFSMDILHSKDVIATG 
GIDTTAVLFDRPSGQILSTLTGHSKKVTSIKFVGDTDLVLTASSDKTVRIWGCSEDGNYT 
SRHTLKDHSAEVRAVTVHATNKYFVSASLDSTWCFYDLSSGLCLAQVTDASENDVNYTAA 
AFHPDGLILGTGTAQSIVKIWDVKSQANVAKFGGHNGEITSISFSENGYFLATAALDGVR 
LWDLRKLKNFRTFDFPDANSVEFDHSGSYLGIAASDIRVFQAASVKAEWNPIKTLPDLSG 
TGKATSVKFGLDSKYIAVGSMDRNLRIFGLPDDDNTEDSAQDS*
>AT1G04510.1 |  transducin family protein / WD-40 repeat family protein 
MNCAISGEVPEEPVVSKKSGLLYEKRLIQTHISDYGKCPVTGEPHTLDDIVPIKTGKIVK 
PKPLHTASIPGLLGTFQTEWDSLMLSNFALEQQLHTARQELSHALYQHDAACRVIARLKK 
ERDESRQLLAEAERQLPAAPEVATSNAALSNGKRGIDDGEQGPNAKKMRLGISAEVITEL 
TDCNAALSQQRKKRQIPKTLASVDALEKFTQLSSHPLHKTNKPGIFSMDILHSKDVIATG 
GIDTTAVLFDRPSGQILSTLTGHSKKVTSIKFVGDTDLVLTASSDKTVRIWGCSEDGNYT 
SRHTLKDHSAEVRAVTVHATNKYFVSASLDSTWCFYDLSSGLCLAQVTDASENDVNYTAA 
AFHPDGLILGTGTAQSIVKIWDVKSQANVAKFGGHNGEITSISFSENGYFLATAALDGVR 
LWDLRKLKNFRTFDFPDANSVEFDHSGSYLGIAASDIRVFQAASVKAEWNPIKTLPDLSG 
TGKATSVKFGLDSKYIAVGSMDRNLRIFGLPDDDNTEDSAQDS*
>AT1G04510.2 |  transducin family protein / WD-40 repeat family protein 
MNCAISGEVPEEPVVSKKSGLLYEKRLIQTHISDYGKCPVTGEPHTLDDIVPIKTGKIVK 
PKPLHTASIPGLLGTFQTEWDSLMLSNFALEQQLHTARQELSHALYQHDAACRVIARLKK 
ERDESRQLLAEAERQLPAAPEVATSNAALSNGKRGIDDGEQGPNAKKMRLGISAEVITEL 
TDCNAALSQQRKKRQIPKTLASVDALEKFTQLSSHPLHKTNKPGIFSMDILHSKDVIATG 
GIDTTAVLFDRPSGQILSTLTGHSKKVTSIKFVGDTDLVLTASSDKTVRIWGCSEDGNYT 
SRHTLKDHSAEVRAVTVHATNKYFVSASLDSTWCFYDLSSGLCLAQVTDASENDVNYTAA 
AFHPDGLILGTGTAQSIVKIWDVKSQANVAKFGGHNGEITSISFSENGYFLATAALDGVR 
LWDLRKLKNFRTFDFPDANSVEFDHSGSYLGIAASDIRVFQAASVKAEWNPIKTLPDLSG 
TGKSTSVKFGLDSKYIAVGSMDRNLRIFGLPDDDNTEDSAQDS*
>AT1G04510.2 |  transducin family protein / WD-40 repeat family protein 
MNCAISGEVPEEPVVSKKSGLLYEKRLIQTHISDYGKCPVTGEPHTLDDIVPIKTGKIVK 
PKPLHTASIPGLLGTFQTEWDSLMLSNFALEQQLHTARQELSHALYQHDAACRVIARLKK 
ERDESRQLLAEAERQLPAAPEVATSNAALSNGKRGIDDGEQGPNAKKMRLGISAEVITEL 
TDCNAALSQQRKKRQIPKTLASVDALEKFTQLSSHPLHKTNKPGIFSMDILHSKDVIATG 
GIDTTAVLFDRPSGQILSTLTGHSKKVTSIKFVGDTDLVLTASSDKTVRIWGCSEDGNYT 
SRHTLKDHSAEVRAVTVHATNKYFVSASLDSTWCFYDLSSGLCLAQVTDASENDVNYTAA 
AFHPDGLILGTGTAQSIVKIWDVKSQANVAKFGGHNGEITSISFSENGYFLATAALDGVR 
LWDLRKLKNFRTFDFPDANSVEFDHSGSYLGIAASDIRVFQAASVKAEWNPIKTLPDLSG 
TGKSTSVKFGLDSKYIAVGSMDRNLRIFGLPDDDNTEDSAQDS*
>AT1G72560.1 |  PSD (PAUSED) nucleobase nucleoside nucleotide and nucleic acid transmembrane transporter/ tRNA binding 
MDDLEQAIVISFETGAVDSALKSQAVTYCQQIKETPSICSICIEKLWFSKLVQVQFWCLQ 
TLQDVLRVKYGSMSLDEQSYVRKSVFSMACLEVIDNENAGRVVEGPPFVKNKLAQVLATL 
IYYEYPLIWSSVFLDFMLHLCKGAVVIDMFCRVLNALDDELISLDYPRTPEEISVAARVK 
DAMRQQCVPQIARAWYDIVSMYKNSDPDLSATVLDCMRRFVSWIDIGLVANDAFVPLLFE 
LILSDGLSEQVRGAAAGCVLAMVSKRMDPQSKLPLLQTLQISRVFGLVSGDVDSDLVSKV 
SALLTGYAVEVLECHKRLNSEDTKAVSMDLLNEVLPSVFYVMQKCEVDSTFSIVQFLLGY 
VSTLKGLPALKEKQLLHITQILEVIRIQICYDPMYRNNLNSLDKTGLEEEDRMSEFRKDL 
FVLLRTVGRVAPEVTQHFIRNSLANAVESSSESNVEEVEAALSLLYSFGESMTEEAMKTG 
SGCLSELIPMLLTTQFPGHSHRLVALVYLENITRYMKFIQENSQYIPNVLGAFLDDRGLH 
HQNFYVSRRAGYLFMRVVKLLKSKLVPFIDKILQNLQDTLSQLTTMNFASRELTGTEDGS 
HIFEAIGIIIGLEDVPAEKQSDYLSLLLTPLCQQIEAGLVQAKVASSEDFPVKIANIQFA 
IVAINALSKGFNERLVTASRPGIGLMFKQTLDVLLRVLIEFPKVEPLRSKVTSFIHRMVD 
TLGSAVFPYLPKALEQLLADSEPKEMVGFMVLLNQLICKFNSALHDILEEVYPVVAVRIF 
NVIPRDGLPSRPGAVTEEMRELIELQRMLYTFLHVIATHDLSSVFLTPKSRAYLDPMMQL 
VLNTSCNHKDITVRKACVQIFIKLIKDWCAEPYSEEKVPGFQNFVIEAFATNCCLYSVLD 
KSFNFSDANTHALFGEIITAQKVMYEKFGNTFLMHLMSKSFPSAHIPQDLAEQYCQKLQG 
NDIRSLKSYYQSLIENLRLQQNGSHVFR*
>AT1G72560.1 |  PSD (PAUSED) nucleobase nucleoside nucleotide and nucleic acid transmembrane transporter/ tRNA binding 
MDDLEQAIVISFETGAVDSALKSQAVTYCQQIKETPSICSICIEKLWFSKLVQVQFWCLQ 
TLQDVLRVKYGSMSLDEQSYVRKSVFSMACLEVIDNENAGRVVEGPPFVKNKLAQVLATL 
IYYEYPLIWSSVFLDFMLHLCKGAVVIDMFCRVLNALDDELISLDYPRTPEEISVAARVK 
DAMRQQCVPQIARAWYDIVSMYKNSDPDLSATVLDCMRRFVSWIDIGLVANDAFVPLLFE 
LILSDGLSEQVRGAAAGCVLAMVSKRMDPQSKLPLLQTLQISRVFGLVSGDVDSDLVSKV 
SALLTGYAVEVLECHKRLNSEDTKAVSMDLLNEVLPSVFYVMQKCEVDSTFSIVQFLLGY 
VSTLKGLPALKEKQLLHITQILEVIRIQICYDPMYRNNLNSLDKTGLEEEDRMSEFRKDL 
FVLLRTVGRVAPEVTQHFIRNSLANAVESSSESNVEEVEAALSLLYSFGESMTEEAMKTG 
SGCLSELIPMLLTTQFPGHSHRLVALVYLENITRYMKFIQENSQYIPNVLGAFLDDRGLH 
HQNFYVSRRAGYLFMRVVKLLKSKLVPFIDKILQNLQDTLSQLTTMNFASRELTGTEDGS 
HIFEAIGIIIGLEDVPAEKQSDYLSLLLTPLCQQIEAGLVQAKVASSEDFPVKIANIQFA 
IVAINALSKGFNERLVTASRPGIGLMFKQTLDVLLRVLIEFPKVEPLRSKVTSFIHRMVD 
TLGSAVFPYLPKALEQLLADSEPKEMVGFMVLLNQLICKFNSALHDILEEVYPVVAVRIF 
NVIPRDGLPSRPGAVTEEMRELIELQRMLYTFLHVIATHDLSSVFLTPKSRAYLDPMMQL 
VLNTSCNHKDITVRKACVQIFIKLIKDWCAEPYSEEKVPGFQNFVIEAFATNCCLYSVLD 
KSFNFSDANTHALFGEIITAQKVMYEKFGNTFLMHLMSKSFPSAHIPQDLAEQYCQKLQG 
NDIRSLKSYYQSLIENLRLQQNGSHVFR*
>AT1G72560.2 |  PSD (PAUSED) nucleobase nucleoside nucleotide and nucleic acid transmembrane transporter/ tRNA binding 
MDDLEQAIVISFETGAVDSALKSQAVTYCQQIKETPSICSICIEKLWFSKLVQVQFWCLQ 
TLQDVLRVKYGSMSLDEQSYVRKSVFSMACLEVIDNENAGRVVEGPPFVKNKLAQVLATL 
IYYEYPLIWSSVFLDFMLHLCKGAVVIDMFCRVLNALDDELISLDYPRTPEEISVAARVK 
DAMRQQCVPQIARAWYDIVSMYKNSDPDLSATVLDCMRRFVSWIDIGLVANDAFVPLLFE 
LILSDGLSEQVRGAAAGCVLAMVSKRMDPQSKLPLLQTLQISRVFGLVSGDVDSDLVSKV 
SALLTGYAVEVLECHKRLNSEDTKAVSMDLLNEVLPSVFYVMQKCEVDSTFSIVQFLLGY 
VSTLKGLPALKEKQLLHITQILEVIRIQICYDPMYRNNLNSLDKTGLEEEDRMSEFRKDL 
FVLLRTVGRVAPEVTQHFIRNSLANAVESSSESNVEEVEAALSLLYSFGESMTEEAMKTG 
SGCLSELIPMLLTTQFPGHSHRLVALVYLENITRYMKFIQENSQYIPNVLGAFLDDRGLH 
HQNFYVSRRAGYLFMRVVKLLKSKLVPFIDKILQNLQDTLSQLTTMNFASRELTGTEDGS 
HIFEAIGIIIGLEDVPAEKQSDYLSLLLTPLCQQIEAGLVQAKVASSEDFPVKIANIQFA 
IVAINALSKGFNERLVTASRPGIGLMFKQTLDVLLRVLIEFPKVEPLRSKVTSFIHRMVD 
TLGSAVFPYLPKALEQLLADSEPKEMVGFMVLLNQLICKFNSALHDILEEVYPVVAVRIF 
NVIPRDGLPSRPGAVTEEMRELIELQRMLYTFLHVIATHDLSSVFLTPKSRAYLDPMMQL 
VLNTSCNHKDITVRKACVQIFIKLIKDWCAEPYSEEKVPGFQNFVIEAFATNCCLYSVLD 
KSFNFSDANTHALFGEIITAQKVMYEKFGNTFLMHLMSKSFPSAHIPQDLAEQYCQKLQG 
NDIRSLKSYYQSLIENLRLQQNGSHVFR*
>AT1G72560.2 |  PSD (PAUSED) nucleobase nucleoside nucleotide and nucleic acid transmembrane transporter/ tRNA binding 
MDDLEQAIVISFETGAVDSALKSQAVTYCQQIKETPSICSICIEKLWFSKLVQVQFWCLQ 
TLQDVLRVKYGSMSLDEQSYVRKSVFSMACLEVIDNENAGRVVEGPPFVKNKLAQVLATL 
IYYEYPLIWSSVFLDFMLHLCKGAVVIDMFCRVLNALDDELISLDYPRTPEEISVAARVK 
DAMRQQCVPQIARAWYDIVSMYKNSDPDLSATVLDCMRRFVSWIDIGLVANDAFVPLLFE 
LILSDGLSEQVRGAAAGCVLAMVSKRMDPQSKLPLLQTLQISRVFGLVSGDVDSDLVSKV 
SALLTGYAVEVLECHKRLNSEDTKAVSMDLLNEVLPSVFYVMQKCEVDSTFSIVQFLLGY 
VSTLKGLPALKEKQLLHITQILEVIRIQICYDPMYRNNLNSLDKTGLEEEDRMSEFRKDL 
FVLLRTVGRVAPEVTQHFIRNSLANAVESSSESNVEEVEAALSLLYSFGESMTEEAMKTG 
SGCLSELIPMLLTTQFPGHSHRLVALVYLENITRYMKFIQENSQYIPNVLGAFLDDRGLH 
HQNFYVSRRAGYLFMRVVKLLKSKLVPFIDKILQNLQDTLSQLTTMNFASRELTGTEDGS 
HIFEAIGIIIGLEDVPAEKQSDYLSLLLTPLCQQIEAGLVQAKVASSEDFPVKIANIQFA 
IVAINALSKGFNERLVTASRPGIGLMFKQTLDVLLRVLIEFPKVEPLRSKVTSFIHRMVD 
TLGSAVFPYLPKALEQLLADSEPKEMVGFMVLLNQLICKFNSALHDILEEVYPVVAVRIF 
NVIPRDGLPSRPGAVTEEMRELIELQRMLYTFLHVIATHDLSSVFLTPKSRAYLDPMMQL 
VLNTSCNHKDITVRKACVQIFIKLIKDWCAEPYSEEKVPGFQNFVIEAFATNCCLYSVLD 
KSFNFSDANTHALFGEIITAQKVMYEKFGNTFLMHLMSKSFPSAHIPQDLAEQYCQKLQG 
NDIRSLKSYYQSLIENLRLQQNGSHVFR*
>AT1G03330.1 |  small nuclear ribonucleoprotein D putative / snRNP core SM-like protein putative / U6 snRNA-associated Sm-like protein putative 
MLFFSYFKDLVGQEVTVELKNDLAIRGTLHSVDQYLNIKLENTRVVDQDKYPHMLSVRNC 
FIRGSVVRYVQLPKDGVDVDLLHDAARREARGG*
>AT2G18740.1 |  small nuclear ribonucleoprotein E putative / snRNP-E putative / Sm protein E putative 
MASTKVQRIMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRITGFDEYMNLVLDEAEEV 
SIKKNTRKPLGRILLKGDNITLMMNTGK*
>AT2G18740.1 |  small nuclear ribonucleoprotein E putative / snRNP-E putative / Sm protein E putative 
MASTKVQRIMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRITGFDEYMNLVLDEAEEV 
SIKKNTRKPLGRILLKGDNITLMMNTGK*
>AT2G18740.2 |  small nuclear ribonucleoprotein E putative / snRNP-E putative / Sm protein E putative 
MASTKVQRIMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRITGFDEYMNLVLDEAEEV 
SIKKNTRKPLGFYSKETT*
>AT2G18740.2 |  small nuclear ribonucleoprotein E putative / snRNP-E putative / Sm protein E putative 
MASTKVQRIMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRITGFDEYMNLVLDEAEEV 
SIKKNTRKPLGFYSKETT*
>AT4G30220.1 |  RUXF (SMALL NUCLEAR RIBONUCLEOPROTEIN F) 
MATIPVNPKPFLNNLTGKTVIVKLKWGMEYKGFLASVDSYMNLQLGNTEEYIDGQLTGNL 
GEILIRCNNVLYVRGVPEDEELEDADQD*
>AT1G65700.1 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MAATTGLETLVDQIISVITNDGRNIVGVLKGFDQATNIILDESHERVFSTKEGVQQHVLG 
LYIIRGDNIGVIGELDEELDASLDFSKLRAHPLKPVVH*
>AT1G65700.1 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MAATTGLETLVDQIISVITNDGRNIVGVLKGFDQATNIILDESHERVFSTKEGVQQHVLG 
LYIIRGDNIGVIGELDEELDASLDFSKLRAHPLKPVVH*
>AT1G65700.2 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MAATTGLETLVDQIISVITNDGRNIVGVLKGFDQATNIILDESHERVFSTKEGVQQHVLG 
LYIIRGDNIGVIGELDEELDASLDFSKLRAHPLKPVVH*
>AT1G65700.2 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MAATTGLETLVDQIISVITNDGRNIVGVLKGFDQATNIILDESHERVFSTKEGVQQHVLG 
LYIIRGDNIGVIGELDEELDASLDFSKLRAHPLKPVVH*
>AT1G80070.1 |  SUS2 (ABNORMAL SUSPENSOR 2) 
MWNNNDGMPLAPPGTGGSMMPPPPAAHPSYTALPPPSNPTPPVEPTPEEAEAKLEEKARK 
WMQLNSKRYGDKRKFGFVETQKEDMPPEHVRKIIRLVFFSSFSTISKYSLLDNYFLARDH 
GDMSSKKFRHDKRVYLGALKFVPHAVFKLLENMPMPWEQVRDVKVLYHITGAITFVNEIP 
WVVEPIYMAQWGTMWIMMRREKRDRRHFKRMRFPPFDDEEPPLDYADNLLDVDPLEPIQL 
ELDEEEDSAVHTWFYDHKPLVKTKLINGPSYRRWNLSLPIMATLHRLAGQLLSDLIDRNY 
FYLFDMPSFFTAKALNMCIPGGPKFEPLYRDMEKGDEDWNEFNDINKLIIRSPLRTEYRI 
AFPHLYNNRPRKVKLCVYHSPMIMYIKTEDPDLPAFYYDPLIHPISNTNKEKRERKVYDD 
EDDFALPEGVEPLLRDTQLYTDTTAAGISLLFAPRPFNMRSGRTRRAEDIPLVSEWFKEH 
CPPAYPVKVRVSYQKLLKCYVLNELHHRPPKAQKKKHLFRSLAATKFFQSTELDWVEVGL 
QVCRQGYNMLNLLIHRKNLNYLHLDYNFNLKPVKTLTTKERKKSRFGNAFHLCREILRLT 
KLVVDANVQFRLGNVDAFQLADGLQYIFSHVGQLTGMYRYKYRLMRQIRMCKDLKHLIYY 
RFNTGPVGKGPGCGFWAPMWRVWLFFLRGIVPLLERWLGNLLARQFEGRHSKGVAKTVTK 
QRVESHFDLELRAAVMHDVLDAMPEGIKQNKARTILQHLSEAWRCWKANIPWKVPGLPVP 
IENMILRYVKSKADWWTNVAHYNRERIRRGATVDKTVCRKNLGRLTRLWLKAEQERQHNY 
LKDGPYVTPEEALAIYTTTVHWLESRKFSPIPFPPLSYKHDTKLLILALERLKESYSVAV 
RLNQQQREELGLIEQAYDNPHEALSRIKRHLLTQRGFKEVGIEFMDLYSYLIPVYEIEPL 
EKITDAYLDQYLWYEGDKRHLFPNWIKPADSEPPPLLVYKWCQGINNLQGIWDTGDGQCV 
VMLQTKFEKFFEKIDLTMLNRLLRLVLDHNIADYVSAKNNVVLSYKDMSHTNSYGLIRGL 
QFASFVVQFYGLLLDLLLLGLTRASEIAGPPQMPNEFMTFWDTKVETRHPIRLYSRYIDK 
VHIMFKFTHEEARDLIQRYLTEHPDPNNENMVGYNNKKCWPRDARMRLMKHDVNLGRSVF 
WDMKNRLPRSITTLEWENGFVSVYSKDNPNLLFSMCGFEVRILPKIRMTQEAFSNTKDGV 
WNLQNEQTKERTAVAFLRVDDEHMKVFENRVRQILMSSGSTTFTKIVNKWNTALIGLMTY 
FREATVHTQELLDLLVKCENKIQTRIKIGLNSKMPSRFPPVIFYTPKEIGGLGMLSMGHI 
LIPQSDLRYSKQTDVGVTHFRSGMSHEEDQLIPNLYRYIQPWESEFIDSQRVWAEYALKR 
QEAQAQNRRLTLEDLEDSWDRGIPRINTLFQKDRHTLAYDKGWRVRTDFKQYQVLKQNPF 
WWTHQRHDGKLWNLNNYRTDVIQALGGVEGILEHTLFKGTYFPTWEGLFWEKASGFEESM 
KYKKLTNAQRSGLNQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGIFMHGKIPTLKISL 
IQIFRAHLWQKIHESVVMDLCQVLDQELDALEIETVQKETIHPRKSYKMNSSCADVLLFA 
AHKWPMSKPSLVAESKDMFDQKASNKYWIDVQLRWGDYDSHDIERYTRAKFMDYTTDNMS 
IYPSPTGVMIGLDLAYNLHSAFGNWFPGSKPLLAQAMNKIMKSNPALYVLRERIRKGLQL 
YSSEPTEPYLSSQNYGEIFSNQIIWFVDDTNVYRVTIHKTFEGNLTTKPINGAIFIFNPR 
TGQLFLKVIHTSVWAGQKRLGQLAKWKTAEEVAALVRSLPVEEQPKQIIVTRKGMLDPLE 
VHLLDFPNIVIKGSELQLPFQACLKIEKFGDLILKATEPQMVLFNIYDDWLKSISSYTAF 
SRLILILRALHVNNEKAKMLLKPDKSVVTEPHHIWPSLTDDQWMKVEVALRDLILSDYAK 
KNNVNTSALTQSEIRDIILGAEITPPSQQRQQIAEIEKQAKEASQLTAVTTRTTNVHGDE 
LIVTTTSPYEQSAFGSKTDWRVRAISATNLYLRVNHIYVNSDDIKETGYTYIMPKNILKK 
FICVADLRTQIAGYLYGISPPDNPQVKEIRCVVMVPQWGNHQLVHLPSSLPEHDFLNDLE 
PLGWLHTQPNELPQLSPQDVTSHSRILENNKQWDGEKCIILTCSFTPGSCSLTSYKLTQT 
GYEWGRLNKDNGSNPHGYLPTHYEKVQMLLSDRFLGFYMVPESGPWNYSFTGVKHTLSMK 
YSVKLGSPKEFYHEEHRPTHFLEFSNMEEADITEGDREDTFT*
>AT4G31120.1 |  SKB1 (SHK1 BINDING PROTEIN 1) protein methyltransferase 
MPLGERGGWERTESRYCGVETDFSNDVTHLLNFNISTGGFDYVLAPLVDPSYRPSLVEGN 
GVDTQVLPVCGSDLVLSPSQWSSHVVGKISSWIDLDSEDEVLRMDSETTLKQEIAWATHL 
SLQACLLPTPKGKSCANYARCVNQILQGLTTLQLWLRVPLVKSEGDSMDDTSEGLNDSWE 
LWNSFRLLCEHDSKLSVALDVLSTLPSETSLGRWMGESVRAAILSTDAFLTNARGYPCLS 
KRHQKLIAGFFDHAAQVVICGKPVHNLQKPLDSSSEGTEKNPLRIYLDYVAYLFQKMESL 
SEQERIELGYRDFLQAPLQPLMDNLEAQTYETFERDSVKYIQYQRAVEKALVDRVPDEKA 
SELTTVLMVVGAGRGPLVRASLQAAEETDRKLKVYAVEKNPNAVVTLHNLVKMEGWEDVV 
TIISCDMRFWNAPEQADILVSELLGSFGDNELSPECLDGAQRFLKPDGISIPSSYTSFIQ 
PITASKLYNDVKAHKDLAHFETAYVVKLHSVAKLAPSQSVFTFTHPNFSTKVNNQRYKKL 
QFSLPSDAGSALVHGFAGYFDSVLYKDVHLGIEPTTATPNMFSWFPIFFPLRKPVEVHPD 
TPLEVHFWRCCGSSKVWYEWSVSSPTPSPMHNTNGRSYWVGL*
>AT4G31120.1 |  SKB1 (SHK1 BINDING PROTEIN 1) protein methyltransferase 
MPLGERGGWERTESRYCGVETDFSNDVTHLLNFNISTGGFDYVLAPLVDPSYRPSLVEGN 
GVDTQVLPVCGSDLVLSPSQWSSHVVGKISSWIDLDSEDEVLRMDSETTLKQEIAWATHL 
SLQACLLPTPKGKSCANYARCVNQILQGLTTLQLWLRVPLVKSEGDSMDDTSEGLNDSWE 
LWNSFRLLCEHDSKLSVALDVLSTLPSETSLGRWMGESVRAAILSTDAFLTNARGYPCLS 
KRHQKLIAGFFDHAAQVVICGKPVHNLQKPLDSSSEGTEKNPLRIYLDYVAYLFQKMESL 
SEQERIELGYRDFLQAPLQPLMDNLEAQTYETFERDSVKYIQYQRAVEKALVDRVPDEKA 
SELTTVLMVVGAGRGPLVRASLQAAEETDRKLKVYAVEKNPNAVVTLHNLVKMEGWEDVV 
TIISCDMRFWNAPEQADILVSELLGSFGDNELSPECLDGAQRFLKPDGISIPSSYTSFIQ 
PITASKLYNDVKAHKDLAHFETAYVVKLHSVAKLAPSQSVFTFTHPNFSTKVNNQRYKKL 
QFSLPSDAGSALVHGFAGYFDSVLYKDVHLGIEPTTATPNMFSWFPIFFPLRKPVEVHPD 
TPLEVHFWRCCGSSKVWYEWSVSSPTPSPMHNTNGRSYWVGL*
>AT4G31120.2 |  SKB1 (SHK1 BINDING PROTEIN 1) protein methyltransferase 
MPLGERGGWERTESRYCGVETDFSNDVTHLLNFNISTGGFDYVLAPLVDPSYRPSLVEGN 
GVDTQVLPVCGSDLVLSPSQWSSHVVGKISSWIDLDSEDEVLRMDSETTLKQEIAWATHL 
SLQACLLPTPKGKSCANYARCVNQILQGLTTLQLWLRVPLVKSEGDSMDDTSEGLNDSWE 
LWNSFRLLCEHDSKLSVALDVLSTLPSETSLGRWMGESVRAAILSTDAFLTNARGYPCLS 
KRHQKLIAGFFDHAAQVVICGKPVHNLQKPLDSSSEGTEKNPLRIYLDYVAYLFQKMESL 
SEQERIELGYRDFLQAPLQPLMDNLEAQTYETFERDSVKYIQYQRAVEKALVDRVPDEKA 
SELTTVLMVVGAGRGPLVRASLQAAEETDRKLKVYAVEKNPNAVVTLHNLVKMEGWEDVV 
TIISCDMRFWNAPEQADILVSELLGSFGDNELSPECLDGAQRFLKPDGISIPSSYTSFIQ 
PITASKLYNDVKAHKDLAHFETAYVVKLHSVAKLAPSQSVFTFTHPNFSTKVNNQRYKKL 
QFSLPSDAGSALVHGFAGYFDSVLYKDVHLGIEPTTATPNMFSW*
>AT4G31120.2 |  SKB1 (SHK1 BINDING PROTEIN 1) protein methyltransferase 
MPLGERGGWERTESRYCGVETDFSNDVTHLLNFNISTGGFDYVLAPLVDPSYRPSLVEGN 
GVDTQVLPVCGSDLVLSPSQWSSHVVGKISSWIDLDSEDEVLRMDSETTLKQEIAWATHL 
SLQACLLPTPKGKSCANYARCVNQILQGLTTLQLWLRVPLVKSEGDSMDDTSEGLNDSWE 
LWNSFRLLCEHDSKLSVALDVLSTLPSETSLGRWMGESVRAAILSTDAFLTNARGYPCLS 
KRHQKLIAGFFDHAAQVVICGKPVHNLQKPLDSSSEGTEKNPLRIYLDYVAYLFQKMESL 
SEQERIELGYRDFLQAPLQPLMDNLEAQTYETFERDSVKYIQYQRAVEKALVDRVPDEKA 
SELTTVLMVVGAGRGPLVRASLQAAEETDRKLKVYAVEKNPNAVVTLHNLVKMEGWEDVV 
TIISCDMRFWNAPEQADILVSELLGSFGDNELSPECLDGAQRFLKPDGISIPSSYTSFIQ 
PITASKLYNDVKAHKDLAHFETAYVVKLHSVAKLAPSQSVFTFTHPNFSTKVNNQRYKKL 
QFSLPSDAGSALVHGFAGYFDSVLYKDVHLGIEPTTATPNMFSW*
>AT4G21660.1 |  proline-rich spliceosome-associated (PSP) family protein 
MTADSTVALVHSVVSNGDVSNGNTSASSKKSREIDRRRRRRKQKKNNKASQADVDASDVS 
AASESKENTDPQPQVCEQIVIEYVPEQAEFEDGFNDEFKEIFEKFNFREPLASEEDGTKD 
ESEEKEDVKKKVNSDSDSDDDEQDNQNKEKGISNKKKKLQRRMKIAELKQVSARPDVVEV 
WDATSADPKLLVFLKSYRNTVPVPRHWSQKRKYLQGKRGIEKQPFHLPDFIAATGIEKIR 
QAYIEKEDGKKLKQKQRERMQPKMGKMDIDYQVLHDAFFKYQTKPKLSALGDLYFEGKEF 
EVKLRETKPGFLSNDLKEALGMPEGAPPPWLINMQRYGPPPSYPHLKIPGLNAPIPIGAS 
FGFHAGGWGKPPVDEYGRPLYGDVFGVQQQDQPNYEEEPIDKSKHWGDLEEEEEEEEEEE 
EEQEEEMDEEELEDGTESVDTLSSTPTGIETPDAIELRKDQRKEPDRALYQVLEEKGESV 
APGTLLGTSHTYVIKTGTQEKTGAKRVDLLRGQKTDRVDVSLQPEELDAMENVLPAKYEE 
AREEEKLRNKPVDLSDMVVEHVQQNSRKRKMHDKEGKKKKDFKF*
>AT1G44910.1 |  protein binding 
MANNPPQSSGTQFRPMVPGQQGQHFVPAASQPFHPYGHVPPNVQSQPPQYSQPIQQQQLF 
PVRPGQPVHITSSSQAVSVPYIQTNKILTSGSTQPQPNAPPMTGFATSGPPFSSPYTFVP 
SSYPQQQPTSLVQPNSQMHVAGVPPAANTWPVPVNQSTSLVSPVQQTGQQTPVAVSTDPG 
NLTPQSASDWQEHTSADGRKYYYNKRTKQSNWEKPLELMTPLERADASTVWKEFTTPEGK 
KYYYNKVTKESKWTIPEDLKLAREQAQLASEKTSLSEAGSTPLSHHAASSSDLAVSTVTS 
VVPSTSSALTGHSSSPIQAGLAVPVTRPPSVAPVTPTSGAISDTEATTIKGDNLSSRGAD 
DSNDGATAQNNEAENKEMSVNGKANLSPAGDKANVEEPMVYATKQEAKAAFKSLLESVNV 
HSDWTWEQTLKEIVHDKRYGALRTLGERKQAFNEYLGQRKKVEAEERRRRQKKAREEFVK 
MLEECEELSSSLKWSKAMSLFENDQRFKAVDRPRDREDLFDNYIVELERKEREKAAEEHR 
QYMADYRKFLETCDYIKAGTQWRKIQDRLEDDDRCSCLEKIDRLIGFEEYILDLEKEEEE 
LKRVEKEHVRRAERKNRDAFRTLLEEHVAAGILTAKTYWLDYCIELKDLPQYQAVASNTS 
GSTPKDLFEDVTEELEKQYHEDKSYVKDAMKSRKISMVSSWLFEDFKSAISEDLSTQQIS 
DINLKLIYDDLVGRVKEKEEKEARKLQRLAEEFTNLLHTFKEITVASNWEDSKQLVEESQ 
EYRSIGDESVSQGLFEEYITSLQEKAKEKERKRDEEKVRKEKERDEKEKRKDKDKERREK 
EREREKEKGKERSKREESDGETAMDVSEGHKDEKRKGKDRDRKHRRRHHNNSDEDVSSDR 
DDRDESKKSSRKHGNDRKKSRKHANSPESESENRHKRQKKESSRRSGNDELEDGEVGE*
>AT1G44910.1 |  protein binding 
MANNPPQSSGTQFRPMVPGQQGQHFVPAASQPFHPYGHVPPNVQSQPPQYSQPIQQQQLF 
PVRPGQPVHITSSSQAVSVPYIQTNKILTSGSTQPQPNAPPMTGFATSGPPFSSPYTFVP 
SSYPQQQPTSLVQPNSQMHVAGVPPAANTWPVPVNQSTSLVSPVQQTGQQTPVAVSTDPG 
NLTPQSASDWQEHTSADGRKYYYNKRTKQSNWEKPLELMTPLERADASTVWKEFTTPEGK 
KYYYNKVTKESKWTIPEDLKLAREQAQLASEKTSLSEAGSTPLSHHAASSSDLAVSTVTS 
VVPSTSSALTGHSSSPIQAGLAVPVTRPPSVAPVTPTSGAISDTEATTIKGDNLSSRGAD 
DSNDGATAQNNEAENKEMSVNGKANLSPAGDKANVEEPMVYATKQEAKAAFKSLLESVNV 
HSDWTWEQTLKEIVHDKRYGALRTLGERKQAFNEYLGQRKKVEAEERRRRQKKAREEFVK 
MLEECEELSSSLKWSKAMSLFENDQRFKAVDRPRDREDLFDNYIVELERKEREKAAEEHR 
QYMADYRKFLETCDYIKAGTQWRKIQDRLEDDDRCSCLEKIDRLIGFEEYILDLEKEEEE 
LKRVEKEHVRRAERKNRDAFRTLLEEHVAAGILTAKTYWLDYCIELKDLPQYQAVASNTS 
GSTPKDLFEDVTEELEKQYHEDKSYVKDAMKSRKISMVSSWLFEDFKSAISEDLSTQQIS 
DINLKLIYDDLVGRVKEKEEKEARKLQRLAEEFTNLLHTFKEITVASNWEDSKQLVEESQ 
EYRSIGDESVSQGLFEEYITSLQEKAKEKERKRDEEKVRKEKERDEKEKRKDKDKERREK 
EREREKEKGKERSKREESDGETAMDVSEGHKDEKRKGKDRDRKHRRRHHNNSDEDVSSDR 
DDRDESKKSSRKHGNDRKKSRKHANSPESESENRHKRQKKESSRRSGNDELEDGEVGE*
>AT1G44910.2 |  protein binding 
MANNPPQSSGTQFRPMVPGQQGQHFVPAASQPFHPYGHVPPNVQSQPPQYSQPIQQQQLF 
PVRPGQPVHITSSSQAVSVPYIQTNKILTSGSTQPQPNAPPMTGFATSGPPFSSPYTFVP 
SSYPQQQPTSLVQPNSQMHVAGVPPAANTWPVPVNQSTSLVSPVQQTGQQTPVAVSTDPG 
NLTPQSASDWQEHTSADGRKYYYNKRTKQSNWEKPLELMTPLERADASTVWKEFTTPEGK 
KYYYNKVTKESKWTIPEDLKLAREQAQLASEKTSLSEAGSTPLSHHAASSSDLAVSTVTS 
VVPSTSSALTGHSSSPIQAGLAVPVTRPPSVAPVTPTSGAISDTEATTIKGDNLSSRGAD 
DSNDGATAQNNEAENKEMSVNGKANLSPAGDKANVEEPMVYATKQEAKAAFKSLLESVNV 
HSDWTWEQTLKEIVHDKRYGALRTLGERKQAFNEYLGQRKKVEAEERRRRQKKAREEFVK 
MLEECEELSSSLKWSKAMSLFENDQRFKAVDRPRDREDLFDNYIVELERKEREKAAEEHR 
QYMADYRKFLETCDYIKAGTQWRKIQDRLEDDDRCSCLEKIDRLIGFEEYILDLEKEEEE 
LKRVEKEHVRRAERKNRDAFRTLLEEHVAAGILTAKTYWLDYCIELKDLPQYQAVASNTS 
GSTPKDLFEDVTEELEKQYHEDKSYVKDAMKSRKISMVSSWLFEDFKSAISEDLSTQQIS 
DINLKLIYDDLVGRVKEKEEKEARKLQRLAEEFTNLLHTFKEITVASNWEDSKQLVEESQ 
EYRSIGDESVSQGLFEEYITSLQEKAKEKERKRDEEKVRKEKERDEKEKRKDKDKERREK 
EREREKEKGKERSKREESDGETAMDVSEGHKDEKRKGKDRDRKHRRRHHNNSDEDVSSDR 
DDRDESKKSSRKHGNDRKKSRKVGTP*
>AT1G44910.2 |  protein binding 
MANNPPQSSGTQFRPMVPGQQGQHFVPAASQPFHPYGHVPPNVQSQPPQYSQPIQQQQLF 
PVRPGQPVHITSSSQAVSVPYIQTNKILTSGSTQPQPNAPPMTGFATSGPPFSSPYTFVP 
SSYPQQQPTSLVQPNSQMHVAGVPPAANTWPVPVNQSTSLVSPVQQTGQQTPVAVSTDPG 
NLTPQSASDWQEHTSADGRKYYYNKRTKQSNWEKPLELMTPLERADASTVWKEFTTPEGK 
KYYYNKVTKESKWTIPEDLKLAREQAQLASEKTSLSEAGSTPLSHHAASSSDLAVSTVTS 
VVPSTSSALTGHSSSPIQAGLAVPVTRPPSVAPVTPTSGAISDTEATTIKGDNLSSRGAD 
DSNDGATAQNNEAENKEMSVNGKANLSPAGDKANVEEPMVYATKQEAKAAFKSLLESVNV 
HSDWTWEQTLKEIVHDKRYGALRTLGERKQAFNEYLGQRKKVEAEERRRRQKKAREEFVK 
MLEECEELSSSLKWSKAMSLFENDQRFKAVDRPRDREDLFDNYIVELERKEREKAAEEHR 
QYMADYRKFLETCDYIKAGTQWRKIQDRLEDDDRCSCLEKIDRLIGFEEYILDLEKEEEE 
LKRVEKEHVRRAERKNRDAFRTLLEEHVAAGILTAKTYWLDYCIELKDLPQYQAVASNTS 
GSTPKDLFEDVTEELEKQYHEDKSYVKDAMKSRKISMVSSWLFEDFKSAISEDLSTQQIS 
DINLKLIYDDLVGRVKEKEEKEARKLQRLAEEFTNLLHTFKEITVASNWEDSKQLVEESQ 
EYRSIGDESVSQGLFEEYITSLQEKAKEKERKRDEEKVRKEKERDEKEKRKDKDKERREK 
EREREKEKGKERSKREESDGETAMDVSEGHKDEKRKGKDRDRKHRRRHHNNSDEDVSSDR 
DDRDESKKSSRKHGNDRKKSRKVGTP*
>AT3G03340.1 |  UNE6 (unfertilized embryo sac 6) 
MDAIRKQLDVLMGANRNGDVQEVNRKYYDRDVCRLYLSGLCPHDLFQLTKMDMGPCPKVH 
SLQLRKEYREARAKGVDNYDRELEDAIDRLIVECDRKIGRALKRLQEEDAKAAIAISVSE 
VTQSPEILELSEKIKEKMKEADIHDLEGKMDLKIRALELVEEMRTKRADQQAVLLLEAFN 
KDRASLPQPVPAQPPSSELPPPDPRTQEMINEKLKKAEDLGEQGMVDEAQKALEEAEALK 
KLTVRREPPADSTKYTAVDVRITDQKLRLCDICGAFLSVYDSDRRLADHFGGKLHLGYML 
VRDKLTELLDEKANIRKERSKERNSKERESSKDREKEQETSREHRRDYDRRSRDRDRHHD 
RDREQDRDYDRSHSRSRRRSRSRSRSRDRPRDYDRHRRHNRY*
>AT2G32600.1 |  hydroxyproline-rich glycoprotein family protein 
MDREWGSKPGSGGAASGQNEAIDRRERLRRLALETIDLAKDPYFMRNHLGSYECKLCLTL 
HNNEGNYLAHTQGKRHQTNLAKRAAREAKDAPTKPQPLKRNVSVRRTVKIGRPGYRVTKQ 
YDPELQQRSLLFQIEYPEIEDNIKPRHRFMSSYEQKVQPYDKSYQYLLFAAEPYEIIAFK 
VPSTEVDKSTPKFFSHWDPDSKMFTLQVYFKPTKPEPNKPQSAVGANGLPPPPPPPPHQA 
QPPPPPPSGLFPPPPPPMANNGFRPMPPAGGFGHPNM*
>AT2G32160.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 325 Blast hits to 315 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 70 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV 
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG 
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL 
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET 
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKKQ*
>AT2G32160.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 327 Blast hits to 317 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV 
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG 
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL 
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET 
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKKQ*
>AT2G32160.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 328 Blast hits to 321 proteins in 142 species Archae - 0 Bacteria - 0 Metazoa - 99 Fungi - 132 Plants - 25 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV 
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG 
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL 
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET 
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKKQ*
>AT2G32160.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 325 Blast hits to 315 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 70 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV 
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG 
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL 
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET 
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKEKTIETT 
YSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT2G32160.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 327 Blast hits to 317 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV 
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG 
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL 
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET 
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKEKTIETT 
YSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT2G32160.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 328 Blast hits to 321 proteins in 142 species Archae - 0 Bacteria - 0 Metazoa - 99 Fungi - 132 Plants - 25 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV 
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG 
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL 
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET 
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKEKTIETT 
YSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT2G32160.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 325 Blast hits to 315 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 70 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSVPF 
SSHDWLDSSLQAHVPLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRH 
KESTRPACLVPGAGLGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPW 
IHTNCNSLSDDDQLRPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCF 
FIDTAHNIIEYIETISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASH 
YGFEMEKEKTIETTYSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT2G32160.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 327 Blast hits to 317 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSVPF 
SSHDWLDSSLQAHVPLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRH 
KESTRPACLVPGAGLGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPW 
IHTNCNSLSDDDQLRPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCF 
FIDTAHNIIEYIETISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASH 
YGFEMEKEKTIETTYSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT2G32160.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 328 Blast hits to 321 proteins in 142 species Archae - 0 Bacteria - 0 Metazoa - 99 Fungi - 132 Plants - 25 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSVPF 
SSHDWLDSSLQAHVPLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRH 
KESTRPACLVPGAGLGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPW 
IHTNCNSLSDDDQLRPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCF 
FIDTAHNIIEYIETISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASH 
YGFEMEKEKTIETTYSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT1G66510.1 |  AAR2 protein family 
MDSEKALELVKHGATLLFLDVPQYTLVGIDTQIFAVGPAFKGIKMIPPGIHFVFYSSSTR 
DGREFSPTIGFFVDVAPSQVIVRKWNQQDEWLTKVSEEEEERYSQAVRSLEFDKNLGPYN 
LKQYGEWRHLSNYITKDVVEKFEPVGGEITVTYESAILKGGPKTAMEIALDTQMKKSKFT 
TSSTEQPKGNRFYYTSIPRIIKHKGMSGQELTSMNLDKTQLLESVLSKEYKDSEDLLLGE 
LQFSFVAFLMGQSLESFMQWKSIVSLLLGCTSAPFQTRSQLFTKFIKVIYHQLKYGLQKE 
NSGPETGIHALLDDSWLASDSFLHFLCKDFFALVEETSVVDGDLLSWTRKFKELLENRLG 
WEFQKKSAVDGIYFEEDDEYAPVVEMLDESHGEYMDKTT*
>AT1G66510.1 |  AAR2 protein family 
MDSEKALELVKHGATLLFLDVPQYTLVGIDTQIFAVGPAFKGIKMIPPGIHFVFYSSSTR 
DGREFSPTIGFFVDVAPSQVIVRKWNQQDEWLTKVSEEEEERYSQAVRSLEFDKNLGPYN 
LKQYGEWRHLSNYITKDVVEKFEPVGGEITVTYESAILKGGPKTAMEIALDTQMKKSKFT 
TSSTEQPKGNRFYYTSIPRIIKHKGMSGQELTSMNLDKTQLLESVLSKEYKDSEDLLLGE 
LQFSFVAFLMGQSLESFMQWKSIVSLLLGCTSAPFQTRSQLFTKFIKVIYHQLKYGLQKE 
NSGPETGIHALLDDSWLASDSFLHFLCKDFFALVEETSVVDGDLLSWTRKFKELLENRLG 
WEFQKKSAVDGIYFEEDDEYAPVVEMLDESHGEYMDKTT*
>AT1G66510.1 |  AAR2 protein family 
MDSEKALELVKHGATLLFLDVPQYTLVGIDTQIFAVGPAFKGIKMIPPGIHFVFYSSSTR 
DGREFSPTIGFFVDVAPSQVIVRKWNQQDEWLTKVSEEEEERYSQAVRSLEFDKNLGPYN 
LKQYGEWRHLSNYITKDVVEKFEPVGGEITVTYESAILKGGPKTAMEIALDTQMKKSKFT 
TSSTEQPKGNRFYYTSIPRIIKHKGMSGQELTSMNLDKTQLLESVLSKEYKDSEDLLLGE 
LQFSFVAFLMGQSLESFMQWKSIVSLLLGCTSAPFQTRSQLFTKFIKVIYHQLKYGLQKE 
NSGPETGIHALLDDSWLASDSFLHFLCKDFFALVEETSVVDGDLLSWTRKFKELLENRLG 
WEFQKKSAVDGIYFEEDDEYAPVVEMLDESHGEYMDKTT*
>AT1G66510.3 |  AAR2 protein family 
MIPPGIHFVFYSSSTRDGREFSPTIGFFVDVAPSQVIVRKWNQQDEWLTKVSEEEEERYS 
QAVRSLEFDKNLGPYNLKQYGEWRHLSNYITKDVVEKFEPVGGEITVTYESAILKGGPKT 
AMEIALDTQMKKSKFTTSSTEQPKGNRFYYTSIPRIIKHKGMSGQELTSMNLDKTQLLES 
VLSKEYKDSEDLLLGELQFSFVAFLMGQSLESFMQWKSIVSLLLGCTSAPFQTRSQLFTK 
FIKVIYHQLKYGLQKENSGPETGIHALLDDSWLASDSFLHFLCKDFFALVEETSVVDGDL 
LSWTRKFKELLENRLGWEFQKKSAVDGIYFEEDDEYAPVVEMLDESHGEYMDKTT*
>AT1G66510.3 |  AAR2 protein family 
MIPPGIHFVFYSSSTRDGREFSPTIGFFVDVAPSQVIVRKWNQQDEWLTKVSEEEEERYS 
QAVRSLEFDKNLGPYNLKQYGEWRHLSNYITKDVVEKFEPVGGEITVTYESAILKGGPKT 
AMEIALDTQMKKSKFTTSSTEQPKGNRFYYTSIPRIIKHKGMSGQELTSMNLDKTQLLES 
VLSKEYKDSEDLLLGELQFSFVAFLMGQSLESFMQWKSIVSLLLGCTSAPFQTRSQLFTK 
FIKVIYHQLKYGLQKENSGPETGIHALLDDSWLASDSFLHFLCKDFFALVEETSVVDGDL 
LSWTRKFKELLENRLGWEFQKKSAVDGIYFEEDDEYAPVVEMLDESHGEYMDKTT*
>AT1G66510.3 |  AAR2 protein family 
MIPPGIHFVFYSSSTRDGREFSPTIGFFVDVAPSQVIVRKWNQQDEWLTKVSEEEEERYS 
QAVRSLEFDKNLGPYNLKQYGEWRHLSNYITKDVVEKFEPVGGEITVTYESAILKGGPKT 
AMEIALDTQMKKSKFTTSSTEQPKGNRFYYTSIPRIIKHKGMSGQELTSMNLDKTQLLES 
VLSKEYKDSEDLLLGELQFSFVAFLMGQSLESFMQWKSIVSLLLGCTSAPFQTRSQLFTK 
FIKVIYHQLKYGLQKENSGPETGIHALLDDSWLASDSFLHFLCKDFFALVEETSVVDGDL 
LSWTRKFKELLENRLGWEFQKKSAVDGIYFEEDDEYAPVVEMLDESHGEYMDKTT*
>AT1G66510.2 |  AAR2 protein family 
MDSEKALELVKHGATLLFLDVPQYTLVGIDTQIFAVGPAFKGIKMIPPGIHFVFYSSSTR 
DGREFSPTIGFFVDVAPSQVIVRKWNQQDEWLTKVSEEEEERYSQAVRSLEFDKNLGPYN 
LKQYGEWRHLSNYITKDVVEKFEPVGGEITVTYESAILKGGPKTAMEIALDTQMKKSKFT 
TSSTEQPKGNRFYYTSIPRIIKHKGMSGQELTSMNLDKTQLLESVLSKEYKDSEDLLLGE 
LQFSFVAFLMGQSLESFMQWKSIVSLLLGCTSAPFQTRSQLFTKFIKVIYHQLKYGLQKE 
NSGPETGIHALLDDSWLASDSFLHFLCKDFFALVEETSVVDGDLLSWTRKFKELLENRLG 
WEFQKKSAVDGIYFEEDDEYAPVVEMLDESHGEYMDKTT*
>AT1G66510.2 |  AAR2 protein family 
MDSEKALELVKHGATLLFLDVPQYTLVGIDTQIFAVGPAFKGIKMIPPGIHFVFYSSSTR 
DGREFSPTIGFFVDVAPSQVIVRKWNQQDEWLTKVSEEEEERYSQAVRSLEFDKNLGPYN 
LKQYGEWRHLSNYITKDVVEKFEPVGGEITVTYESAILKGGPKTAMEIALDTQMKKSKFT 
TSSTEQPKGNRFYYTSIPRIIKHKGMSGQELTSMNLDKTQLLESVLSKEYKDSEDLLLGE 
LQFSFVAFLMGQSLESFMQWKSIVSLLLGCTSAPFQTRSQLFTKFIKVIYHQLKYGLQKE 
NSGPETGIHALLDDSWLASDSFLHFLCKDFFALVEETSVVDGDLLSWTRKFKELLENRLG 
WEFQKKSAVDGIYFEEDDEYAPVVEMLDESHGEYMDKTT*
>AT1G66510.2 |  AAR2 protein family 
MDSEKALELVKHGATLLFLDVPQYTLVGIDTQIFAVGPAFKGIKMIPPGIHFVFYSSSTR 
DGREFSPTIGFFVDVAPSQVIVRKWNQQDEWLTKVSEEEEERYSQAVRSLEFDKNLGPYN 
LKQYGEWRHLSNYITKDVVEKFEPVGGEITVTYESAILKGGPKTAMEIALDTQMKKSKFT 
TSSTEQPKGNRFYYTSIPRIIKHKGMSGQELTSMNLDKTQLLESVLSKEYKDSEDLLLGE 
LQFSFVAFLMGQSLESFMQWKSIVSLLLGCTSAPFQTRSQLFTKFIKVIYHQLKYGLQKE 
NSGPETGIHALLDDSWLASDSFLHFLCKDFFALVEETSVVDGDLLSWTRKFKELLENRLG 
WEFQKKSAVDGIYFEEDDEYAPVVEMLDESHGEYMDKTT*
>AT1G14640.1 |  SWAP (Suppressor-of-White-APricot)/surp domain-containing protein 
MFNSMKILPLEAPPADGNLGPLPPSQLTDEEIKENEFQGEQNNSIQTPIAVATHTNPIGI 
IYPPPEIRKIVETTAQFVSQNGLAFGNKVKTEKANNANFSFLKSDNPYHGFYRYKVTEYS 
CHIRDGAQGTDVDDTEDPKLDDESDAKPDLQAQFRAPRKILEAPEPEKYTVRLPEGIMEA 
ELDIIKHTAQFVARNGQSFLRELMRREVNNSQFQFMKPTHSMFTFFTSLVDAYSEVLMPP 
RDLKEKLRKSVADLTTVLERCLNRLEWDRFQEEEKNKEEDEKEKERVQMVMIDWKDFAVV 
ESIDFADEEDKDLPMPMTLEEVIRRSKVSAMEEDEIVEPGKEVEMDMDEEEVKLVAEGMR 
AANLEEYVGSVEIEEEAPMRIVKNWKRPEDRFLTERDSSKVVISRITGELIPITEMSEHM 
RISLIDPKFKEQKDRMFAKIRETTLAQDDEIAKNIVGLARLRPDIFGTTEEEVSNAVKAD 
IEKKDEQPKQVIWDGHTGSIGRTANQALTQNSNGEQGDGVYGDPNSFPGPAAFPPPRPGV 
PTVRPLPPPQNLALNLPRPPPSVQYPGAPRPLGVPMMQPMYQQHQLSMSGPHGHPSMMMS 
RPPQMQPVMRVPPPPGSQFSHMQVPQPYGQLPPLSMGMMQPPPMAEMPPPPPPGEAPPPL 
PEEPEPKRQKLDESALVPEDQFLAQHPGPATIRVSKPNENDGQVMEITVQSLSENVGSLK 
EKIAGEMQIPANKQK*
>AT1G60170.1 |  emb1220 (embryo defective 1220) 
MATLEDSFLADLDELSDNEAELDENDGDVGKEEEDVDMDMADLETLNYDDLDNVSKLQKS 
QRYADIMHKVEEALGKDSDGAEKGTVLEDDPEYKLIVDCNQLSVDIENEIVIVHNFIKDK 
YKLKFQELESLVHHPIDYACVVKKIGNETDLALVDLADLLPSAIIMVVSVTALTTKGSAL 
PEDVLQKVLEACDRALDLDSARKKVLEFVESKMGSIAPNLSAIVGSAVAAKLMGTAGGLS 
ALAKMPACNVQVLGHKRKNLAGFSSATSQSRVGYLEQTEIYQSTPPGLQARAGRLVAAKS 
TLAARVDATRGDPLGISGKAFREEIRKKIEKWQEPPPARQPKPLPVPDSEPKKRRGGRRL 
RKMKERYQVTDMRKLANRMAFGTPEESSLGDGLGEGYGMLGQAGSNRLRVSSVPSKLKIN 
AKVAKKLKERQYAGGATTSGLTSSLAFTPVQGIELCNPQQALGLGSGTQSTYFSESGTFS 
KLKKI*
>AT4G03430.1 |  EMB2770 (EMBRYO DEFECTIVE 2770) RNA splicing factor transesterification mechanism 
MVFLSIPNGKTLSIDVNPNSTTISAFEQLAHQRSDVPQSFLRYSLRMRNPSRVFVDSKDS 
DSILLSDLGVSRFSTVIIHVLLLGGMQAAPPKPRLDFLNSKPPSNYVAGLGRGATGFTTR 
SDIGPARAAPDLPDRSALATAAAPGVGRGAGKPSEAEAEDDEEAEEKRYDENQTFDEFEG 
NDVGLFANAEYDEDDKEADAIWESIDQRMDSRRKDRREAKLKEEIEKYRASNPKITEQFA 
DLKRKLHTLSADEWDSIPEIGDYSLRNKKKKFESFVPIPDTLLEKAKKEKELVMALDPKS 
RAAGGSETPWGQTPVTDLTAVGEGRGTVLSLKLDNLSDSVSGQTVVDPKGYLTDLKSMKR 
TTDEEIYDRNRARLLYKSLTQSNPKNPNGWIAAARVEEVDGKIKAARFQIQRGCEECPKN 
EDVWLEACRLANPEDAKGVIAKGVKLIPNSVKLWLEAAKLEHDVENKSRVLRKGLEHIPD 
SVRLWKAVVELANEEDARILLHRAVECCPLHLELWVALARLETYAESKKVLNKAREKLPK 
EPAIWITAAKLEEANGKLDEANDNTAMVGKIIDRGIKTLQREGVVIDRENWMSEAEACER 
VGSVATCQAIIKNTIGIGVEEEDRKRTWVADADECKKRGSIETARAIYAHALSVFLTKKS 
IWLKAAQLEKSHGSRESLDALLRKAVTYVPQAEVLWLMGAKEKWLAGDVPAARAILQEAY 
AAIPNSEEIWLAAFKLEFENKEPERARMLLAKARERGGTERVWMKSAIVERELGNVEEER 
RLLNEGLKQFPTFFKLWLMLGQLEERFKHLEQARKAYDTGLKHCPHCIPLWLSLADLEEK 
VNGLNKARAILTTARKKNPGGAELWLAAIRAELRHDNKREAEHLMSKALQDCPKSGILWA 
ADIEMAPRPRRKTKSIDAMKKCDRDPHVTIAVAKLFWQDKKVEKARAWFERAVTVGPDIG 
DFWALFYKFELQHGSDEDRKEVVAKCVACEPKHGEKWQAISKAVENAHQPIEVILKRVVN 
ALSKEENSA*
>AT5G06160.1 |  ATO (ATROPOS) nucleic acid binding / zinc ion binding 
MSSTLLEQTRSNHEEVERLERLVVEDLQKEPPSSKDRLVQGHRVRHMIESIMLTTEKLVE 
TYEDKDGAWDDEIAALGGQTATGTNVFSEFYDRLKEIREYHKRHPSGRLVDANEDYEARL 
KEEPIIAFSGEEGNGRYLDLHDMYNQYINSKFGERVEYSAYLDVFSQPEKIPRKLKLSRQ 
YMKYMEALLEYLVYFFQRTEPLQDLDRILSKVCSDFEEQYADGIVEGLDNELIPSQHTVI 
DLDYYSTVEELVDVGPEKLKEALGALGLKVGGTPQQRAERLFLTKHTPLEKLDKKHFARP 
PHNGKQNGDAKSTHESENAKEIALTEAKVKKLCNLLDETIERTKQNIVKKQSLTYEEMEG 
EREGEEANTELESDDEDGLIYNPLKLPIGWDGKPIPYWLYKLHGLGQEFKCEICGNYSYW 
GRRAFERHFKEWRHQHGMRCLGIPNTKNFNEITSIEEAKELWKRIQERQGVNKWRPELEE 
EYEDREGNIYNKKTYSDLQRQGLI*
>AT1G11650.2 |  RNA-binding protein 45 (RBP45) putative 
MMQQPPPGGILPHHAPPPSAQQQYGYQQPYGIAGAAPPPPQMWNPQAAAPPSVQPTTADE 
IRTLWIGDLQYWMDENFLYGCFAHTGEMVSAKVIRNKQTGQVEGYGFIEFASHAAAERVL 
QTFNNAPIPSFPDQLFRLNWASLSSGDKRDDSPDYTIFVGDLAADVTDYILLETFRASYP 
SVKGAKVVIDRVTGRTKGYGFVRFSDESEQIRAMTEMNGVPCSTRPMRIGPAASKKGVTG 
QRDSYQSSAAGVTTDNDPNNTTVFVGGLDASVTDDHLKNVFSQYGEIVHVKIPAGKRCGF 
VQFSEKSCAEEALRMLNGVQLGGTTVRLSWGRSPSNKQSGDPSQFYYGGYGQGQEQYGYT 
MPQDPNAYYGGYSGGGYSGGYQQTPQAGQQPPQQPPQQQQVGFSY*
>AT1G11650.2 |  RNA-binding protein 45 (RBP45) putative 
MMQQPPPGGILPHHAPPPSAQQQYGYQQPYGIAGAAPPPPQMWNPQAAAPPSVQPTTADE 
IRTLWIGDLQYWMDENFLYGCFAHTGEMVSAKVIRNKQTGQVEGYGFIEFASHAAAERVL 
QTFNNAPIPSFPDQLFRLNWASLSSGDKRDDSPDYTIFVGDLAADVTDYILLETFRASYP 
SVKGAKVVIDRVTGRTKGYGFVRFSDESEQIRAMTEMNGVPCSTRPMRIGPAASKKGVTG 
QRDSYQSSAAGVTTDNDPNNTTVFVGGLDASVTDDHLKNVFSQYGEIVHVKIPAGKRCGF 
VQFSEKSCAEEALRMLNGVQLGGTTVRLSWGRSPSNKQSGDPSQFYYGGYGQGQEQYGYT 
MPQDPNAYYGGYSGGGYSGGYQQTPQAGQQPPQQPPQQQQVGFSY*
>AT1G11650.1 |  RNA-binding protein 45 (RBP45) putative 
MMQQPPPGGILPHHAPPPSAQQQYGYQQPYGIAGAAPPPPQMWNPQAAAPPSVQPTTADE 
IRTLWIGDLQYWMDENFLYGCFAHTGEMVSAKVIRNKQTGQVEGYGFIEFASHAAAERVL 
QTFNNAPIPSFPDQLFRLNWASLSSGDKRDDSPDYTIFVGDLAADVTDYILLETFRASYP 
SVKGAKVVIDRVTGRTKGYGFVRFSDESEQIRAMTEMNGVPCSTRPMRIGPAASKKGVTG 
QRDSYQSSAAGVTTDNDPNNTTVFVGGLDASVTDDHLKNVFSQYGEIVHVKIPAGKRCGF 
VQFSEK*
>AT1G11650.1 |  RNA-binding protein 45 (RBP45) putative 
MMQQPPPGGILPHHAPPPSAQQQYGYQQPYGIAGAAPPPPQMWNPQAAAPPSVQPTTADE 
IRTLWIGDLQYWMDENFLYGCFAHTGEMVSAKVIRNKQTGQVEGYGFIEFASHAAAERVL 
QTFNNAPIPSFPDQLFRLNWASLSSGDKRDDSPDYTIFVGDLAADVTDYILLETFRASYP 
SVKGAKVVIDRVTGRTKGYGFVRFSDESEQIRAMTEMNGVPCSTRPMRIGPAASKKGVTG 
QRDSYQSSAAGVTTDNDPNNTTVFVGGLDASVTDDHLKNVFSQYGEIVHVKIPAGKRCGF 
VQFSEK*
>AT1G79880.2 |  La domain-containing protein 
MRNLLGLGNINREDIPPRIVEEVANLLRTSDFLKVSNNGQRIGRGTKLSKPEEVLEQVHR 
RTLAASPFEYSIKMEDVSSFFSQYAKVNSVRLPPNIADKRRFCGTALVEFSSEQDTQDIL 
RQSLVYAGADLVLIPKSDFDCQRENMIKQLGKSESHNEFRRGQIVKFALKWIASEEKVTN 
KEKPSALKNKIKEKEDKETGIADREKENGDNSCASLCKDNTDQLVVPPWNNSNSVSSEVL 
KDLFQRFGSVEHIEYSGGLDSGYVWFTDSETAMKARAAVEFVGGLVVKNNFSVALEAING 
EMERELWKRLSSAELEGGKEGHKKEKGKDECFENVQPTKKARKEP*
>AT1G79880.2 |  La domain-containing protein 
MRNLLGLGNINREDIPPRIVEEVANLLRTSDFLKVSNNGQRIGRGTKLSKPEEVLEQVHR 
RTLAASPFEYSIKMEDVSSFFSQYAKVNSVRLPPNIADKRRFCGTALVEFSSEQDTQDIL 
RQSLVYAGADLVLIPKSDFDCQRENMIKQLGKSESHNEFRRGQIVKFALKWIASEEKVTN 
KEKPSALKNKIKEKEDKETGIADREKENGDNSCASLCKDNTDQLVVPPWNNSNSVSSEVL 
KDLFQRFGSVEHIEYSGGLDSGYVWFTDSETAMKARAAVEFVGGLVVKNNFSVALEAING 
EMERELWKRLSSAELEGGKEGHKKEKGKDECFENVQPTKKARKEP*
>AT1G79880.2 |  La domain-containing protein 
MRNLLGLGNINREDIPPRIVEEVANLLRTSDFLKVSNNGQRIGRGTKLSKPEEVLEQVHR 
RTLAASPFEYSIKMEDVSSFFSQYAKVNSVRLPPNIADKRRFCGTALVEFSSEQDTQDIL 
RQSLVYAGADLVLIPKSDFDCQRENMIKQLGKSESHNEFRRGQIVKFALKWIASEEKVTN 
KEKPSALKNKIKEKEDKETGIADREKENGDNSCASLCKDNTDQLVVPPWNNSNSVSSEVL 
KDLFQRFGSVEHIEYSGGLDSGYVWFTDSETAMKARAAVEFVGGLVVKNNFSVALEAING 
EMERELWKRLSSAELEGGKEGHKKEKGKDECFENVQPTKKARKEP*
>AT1G79880.3 |  La domain-containing protein 
MRNLLGLGNINREDIPPRIVEEVANLLRTSDFLKVSNNGQRIGRGTKLSKPEEVLEQVHR 
RTLAASPFEYSIKMEDVSSFFSQYAKVNSVRLPPNIADKRRFCGTALVEFSSEQDTQDIL 
RQSLVYAGADLVLIPKSDFDCQRENMIKQLGKSESHNEFRRGQIVKFALKWIASEEKVTN 
KEKPSALKNKIKEKEDKETGIADREKENGDNSCASLCKDNTDQLVVPPWNNSNSVSSEVL 
KDLFQRFGSVEHIEYSGGLDSGYVWFTDSETAMKARAAVEFVGGLVVKNNFSVALEAING 
EMERELWKRLSSAELEGGYDIKPNLVYSSLKEGHKKEKGKDECFENVQPTKKARKEP*
>AT1G79880.3 |  La domain-containing protein 
MRNLLGLGNINREDIPPRIVEEVANLLRTSDFLKVSNNGQRIGRGTKLSKPEEVLEQVHR 
RTLAASPFEYSIKMEDVSSFFSQYAKVNSVRLPPNIADKRRFCGTALVEFSSEQDTQDIL 
RQSLVYAGADLVLIPKSDFDCQRENMIKQLGKSESHNEFRRGQIVKFALKWIASEEKVTN 
KEKPSALKNKIKEKEDKETGIADREKENGDNSCASLCKDNTDQLVVPPWNNSNSVSSEVL 
KDLFQRFGSVEHIEYSGGLDSGYVWFTDSETAMKARAAVEFVGGLVVKNNFSVALEAING 
EMERELWKRLSSAELEGGYDIKPNLVYSSLKEGHKKEKGKDECFENVQPTKKARKEP*
>AT1G79880.3 |  La domain-containing protein 
MRNLLGLGNINREDIPPRIVEEVANLLRTSDFLKVSNNGQRIGRGTKLSKPEEVLEQVHR 
RTLAASPFEYSIKMEDVSSFFSQYAKVNSVRLPPNIADKRRFCGTALVEFSSEQDTQDIL 
RQSLVYAGADLVLIPKSDFDCQRENMIKQLGKSESHNEFRRGQIVKFALKWIASEEKVTN 
KEKPSALKNKIKEKEDKETGIADREKENGDNSCASLCKDNTDQLVVPPWNNSNSVSSEVL 
KDLFQRFGSVEHIEYSGGLDSGYVWFTDSETAMKARAAVEFVGGLVVKNNFSVALEAING 
EMERELWKRLSSAELEGGYDIKPNLVYSSLKEGHKKEKGKDECFENVQPTKKARKEP*
>AT1G79880.1 |  La domain-containing protein 
MASSFNEETAKKLLTQVEFYFSDSNLPTDGFLNREVTKSKDGLVSLPLVCSFSRMRNLLG 
LGNINREDIPPRIVEEVANLLRTSDFLKVSNNGQRIGRGTKLSKPEEVLEQVHRRTLAAS 
PFEYSIKMEDVSSFFSQYAKVNSVRLPPNIADKRRFCGTALVEFSSEQDTQDILRQSLVY 
AGADLVLIPKSDFDCQRENMIKQLGKSESHNEFRRGQIVKFALKWIASEEKVTNKEKPSA 
LKNKIKEKEDKETGIADREKENGDNSCASLCKDNTDQLVVPPWNNSNSVSSEVLKDLFQR 
FGSVEHIEYSGGLDSGYVWFTDSETAMKARAAVEFVGGLVVKNNFSVALEAINGEMEREL 
WKRLSSAELEGGKEGHKKEKGKDECFENVQPTKKARKEP*
>AT1G79880.1 |  La domain-containing protein 
MASSFNEETAKKLLTQVEFYFSDSNLPTDGFLNREVTKSKDGLVSLPLVCSFSRMRNLLG 
LGNINREDIPPRIVEEVANLLRTSDFLKVSNNGQRIGRGTKLSKPEEVLEQVHRRTLAAS 
PFEYSIKMEDVSSFFSQYAKVNSVRLPPNIADKRRFCGTALVEFSSEQDTQDILRQSLVY 
AGADLVLIPKSDFDCQRENMIKQLGKSESHNEFRRGQIVKFALKWIASEEKVTNKEKPSA 
LKNKIKEKEDKETGIADREKENGDNSCASLCKDNTDQLVVPPWNNSNSVSSEVLKDLFQR 
FGSVEHIEYSGGLDSGYVWFTDSETAMKARAAVEFVGGLVVKNNFSVALEAINGEMEREL 
WKRLSSAELEGGKEGHKKEKGKDECFENVQPTKKARKEP*
>AT1G79880.1 |  La domain-containing protein 
MASSFNEETAKKLLTQVEFYFSDSNLPTDGFLNREVTKSKDGLVSLPLVCSFSRMRNLLG 
LGNINREDIPPRIVEEVANLLRTSDFLKVSNNGQRIGRGTKLSKPEEVLEQVHRRTLAAS 
PFEYSIKMEDVSSFFSQYAKVNSVRLPPNIADKRRFCGTALVEFSSEQDTQDILRQSLVY 
AGADLVLIPKSDFDCQRENMIKQLGKSESHNEFRRGQIVKFALKWIASEEKVTNKEKPSA 
LKNKIKEKEDKETGIADREKENGDNSCASLCKDNTDQLVVPPWNNSNSVSSEVLKDLFQR 
FGSVEHIEYSGGLDSGYVWFTDSETAMKARAAVEFVGGLVVKNNFSVALEAINGEMEREL 
WKRLSSAELEGGKEGHKKEKGKDECFENVQPTKKARKEP*
>AT1G65660.1 |  SMP1 (SWELLMAP 1) nucleic acid binding / single-stranded RNA binding 
MATASVAFKSREDHRKQIELEEARKAGLAPAEVDEDGKEINPHIPQYMSSAPWYLNSEKP 
SLKHQRKWKSDPNYTKSWYDRGAKIFQAEKYRKGACQNCGAMTHTAKACMDRPRKIGAKY 
TNMNIAPDEKIESFELDYDGKRDRWNGYDPSTYHRVIDLYEAKEDARKKYLKEQQLKKLE 
EKNNNEKGDDANSDGEEDEDDLRVDEAKVDESRQMDFAKVEKRVRTTGGGSTGTVRNLRI 
REDTAKYLLNLDVNSAHYDPKTRSMREDPLPDADPNDKFYLGDNQYRNSGQALEFKQLNI 
HSWEAFDKGQDMHMQAAPSQAELLYKSFQVAKEKLKSQTKDTIMDKYGNAATEDEIPMEL 
LLGQSERQVEYDRAGRIIKGQEVILPKSKYEEDVHANNHTSVWGSYWKDHQWGYKCCQQI 
IRNSYCTGSAGIEAAEAALDLMKANIARKEATEESPKKVEEKRMASWGTDIPEDLELNEE 
ALANALKKEDLSRREEKDERKRKYNVKYNNDVTPEEMEAYRMKRVHHEDPMKDFL*