>AT4G02840.1 |  small nuclear ribonucleoprotein D1 putative / snRNP core protein D1 putative / Sm protein D1 putative 
MKLVRFLMKLNNETVSIELKNGTIVHGTITGVDVSMNTHLKAVKLTLKGKNPVTLDHLSV 
RGNNIRYYILPDSLNLETLLVEDTPRIKPKKPTAGKIPAGRGRGRGRGRGRGRGGR*
>AT2G46560.1 |  transducin family protein / WD-40 repeat family protein 
MRYRIQLKPSGLERGSGDPRTDRIDHLPLRQLRSEIVPPAPTRSQSSIDWLPDFANYSWL 
AYGASTLVVISHLPSPLRGEDSTNGPFFRQILEVSGEPVTAVCWSPVTPSVGELAVGSGN 
YIFLFARDLKGSFCWSQNAILVQETIVEAIEWTGSGDGIIVGGTDIVLWKRRNQSWEIAW 
KFSGDHLQDLVSSTWSFEGPFATATSWRKFPAECDDAGKSVLAYYSDGESYHNFELPHPQ 
RISMIQWRPMAAEQSAIGIGKSMRNVLMTCCLDGAVRLWCEVDGGKTKKGMKDVPDHKKS 
FCVAAVIEINQVLDGCLGRDLFLFWGTRTGGIFKTIEGTNQVFSMEKYDNENVGKCEWLV 
GYGPGNFATLWAVHCLDDISPMRFPRVTLWAKQESNEIGAGSLSLASATGSSDRLPLKKV 
SVLRNNLYGTPLICSSIYLSPQNTVYWSSLHTIKSHDSEDSSPNKSSLLKCIDGKVLYLD 
GHGGKILQVASDPFVCEAGYTASLDSNGLIIICSSSVYLNRTIEHPISVASWKPCGRLQN 
QEFRLKYTSLCWAPSSLKDERFLLVGHVGGVDCFSVRNCGKGDDGYLTHYICTIPFTVNS 
PLQSGPTSIFAKPLSNSCGKTFKSNRFLLLSVWMKEKRFDALSWSVTLHHFDTAGSTCDC 
HFHDFDSIGLGKWLFEDTFAGKTNCLAIRSCSSEIPESHREDEVTSFAVVNPSGRDLENG 
VNSESQAYTIATGQADGSLKLWRSSFQESSTPSGLWELVGMLTVGQNPVSAISLTDSGHK 
IAALCTESHSKAARAVSIWEIVHLIDSGVFILEDKVHVDAEVVAVRWSTTGNDQLLLGVC 
TQIEMRVYGIARQPCKSTSFAAYDYSSEAQIWQCFAVTRTFSAIHDLWWGPKAMTCLVHN 
DYISLHGQWLAVVDKKQKIDNYPEIFASNLPNLVNATEEGRDSEFLSDSGTNDINEADTT 
STSRGCIPLPSTSNAIDDGQVNSMSLIGTAYGSNTIDDIMSMGHMVEKLGGALPLYHPHA 
LLVAIRSGNWKRASAALRHLAEYITSSDTSEKGYAVKSVLCPDILLSKYYEGSLSNGPNP 
KDFQWGGTSGSMLQYSQFQSGLQSKFNMESYSPNSPATDLEFSGFCEQLKKLSDEGNISR 
IEILQYFAIVDLLCEISNPHSTSVYASLDEPGRRFWVTLRFKQLFLARSSGKTASLEELD 
IDSSMIGWAFHSESQENLSGSLLPNESSWQQMRSQGFGFWYSNAAQLRSRMEKLARQQYL 
KNKNPKDCALLYIALNRVQVLAGLFKLSKDEKDKPLVVFLSRNFQEEKNKAAALKNAYVL 
MGKHQLELAIGFFLLGGEASSAINVCVKNLQDEQLALVICRLIDGQGGALESNLIKKYIL 
PSAVQRGDFWLASLLKWELGEYHRSILAMAGCLENPATESSTVSSNHVSFVDPSIGLYCL 
MLATKNSVKNALGERTASTLSRWASLMAATAFSRCGLPLEALECLSPSASGHGGTHQTSV 
PSNGQLHTTQGVFDHSVPHSSNWVSSGVSSTVDTHFRLGLAVQFLSMILREATAPLMNSE 
VVSCEKFSRFQHKLQTALEQFHQRFSLSASYLRNMMILSAYNRGLLSMGHNIFQENSSSG 
LSDDKSHTDEDLLQYSALSKLILKATDEKSLVLSRIIAACSVTCLHSVPCFEENKVSSGP 
DPKWSNALRFYFQGILESFSNLRTSIRLCLGSSVEDLKTKLAVVLDLVEYCLRLAMAWVL 
GDVHCLFRMVQPLVISYFNGHMPYEVDLESVKRVYHQEASVSVPDASDVGVNSKFSSVVE 
NHGVGYPVYSIPEDERCLVTQACFWKHVSDFVKLKLVSISINLDDGISNSGSAENFDAQT 
SLDSSDDIVCVTEKIMSVLGKTLISTLAQLSSYHVKQLVLVLKQKLEKRLQVPTLLWLLE 
CQGSQANFLNRDIPDAGVETEKNGDPVVSVRFWKLCVDPHLLHEAFLLENFDIFEWSKSK 
PLEDWSDMYREVIRKNELYVPCNQDGRSSNEVASLANHASNSSPKAAVTANENSAFQNPK 
EIHKRTGELIEALCINAINHRQAALASNRKGIIFFNLEDGDSSQNQSDYIWSDADWPHNG 
WANSESTPVPTCVSLGVGLGDKKGAHLGLGVSGLGWETQEEFEEFVDPPPTVESVITRAF 
SNHPTMPLFLVGSSNTHIYLWEFGNERATATYGVLPAANVSPPYALASISAVQFGPFGHR 
FASAALDGTVCTWQSEVGGRSNIHPVESSLCFNGHASDVGYISSSGSIVAASGYSSSGAN 
VVVWDTLAPPSTSQASINCHEGGARSISVFDNDIGSGSISPMIVTGGKNGDVGLHDFRFI 
ATGKMKKQRNPDGGSSTDGDQNKNGMLWYIPKAHLGSVTKIATIPRTSLFLTGSKDGEVK 
LWDAKAAKLIHHWPKLHERHTFLQPNSRGYGGIIRAGVTDIQVCPNGFITCGGDGTVKFV 
SLVDSSYGDAK*
>AT1G20960.1 |  emb1507 (embryo defective 1507) ATP binding / ATP-dependent helicase/ helicase/ nucleic acid binding / nucleoside-triphosphatase/ nucleotide binding 
MANLGGGAEAHARFKQYEYRANSSLVLTTDNRPRDTHEPTGEPETLWGKIDPRSFGDRVA 
KGRPQELEDKLKKSKKKERDVVDDMVNIRQSKRRRLREESVLTDTDDAVYQPKTKETRAA 
YEAMLGLIQKQLGGQPPSIVSGAADEILAVLKNDAFRNPEKKMEIEKLLNKIENHEFDQL 
VSIGKLITDFQEGGDSGGGRANDDEGLDDDLGVAVEFEENEEDDEESDPDMVEEDDDEED 
DEPTRTGGMQVDAGINDEDAGDANEGTNLNVQDIDAYWLQRKISQAYEQQIDPQQCQVLA 
EELLKILAEGDDRVVEDKLLMHLQYEKFSLVKFLLRNRLKVVWCTRLARAEDQEERNRIE 
EEMRGLGPELTAIVEQLHATRATAKEREENLQKSINEEARRLKDETGGDGGRGRRDVADR 
DSESGWVKGQRQMLDLESLAFDQGGLLMANKKCDLPPGSYRSHGKGYDEVHVPWVSKKVD 
RNEKLVKITEMPDWAQPAFKGMQQLNRVQSKVYDTALFKAENILLCAPTGAGKTNVAMLT 
ILQQLEMNRNTDGTYNHGDYKIVYVAPMKALVAEVVGNLSNRLKDYGVIVRELSGDQSLT 
GREIEETQIIVTTPEKWDIITRKSGDRTYTQLVRLLIIDEIHLLHDNRGPVLESIVARTL 
RQIETTKENIRLVGLSATLPNYEDVALFLRVDLKKGLFKFDRSYRPVPLHQQYIGISVKK 
PLQRFQLMNDLCYQKVLAGAGKHQVLIFVHSRKETSKTARAIRDTAMANDTLSRFLKEDS 
VTRDVLHSHEDIVKNSDLKDILPYGFAIHHAGLSRGDREIVETLFSQGHVQVLVSTATLA 
WGVNLPAHTVIIKGTQVYNPEKGAWMELSPLDVMQMLGRAGRPQYDQHGEGIIITGYSEL 
QYYLSLMNEQLPIESQFISKLADQLNAEIVLGTVQNAREACHWLGYTYLYIRMVRNPTLY 
GLAPDALAKDVVLEERRADLIHSAATILDKNNLVKYDRKSGYFQVTDLGRIASYYYITHG 
TIATYNEHLKPTMGDIDLYRLFSLSDEFKYVTVRQDEKMELAKLLDRVPIPIKETLEEPS 
AKINVLLQAYISQLKLEGLSLTSDMVYITQSAGRLVRALYEIVLKRGWAQLAEKALNLSK 
MVGKRMWSVQTPLRQFHGLSNDILMQLEKKDLVWERYYDLSAQELGELIRSPKMGKPLHK 
FIHQFPKVTLSAHVQPITRTVLNVELTVTPDFLWDEKIHKYVEPFWIIVEDNDGEKILHH 
EYFLLKKQYIDEDHTLHFTVPIFEPLPPQYFVRVVSDKWLGSETVLPVSFRHLILPEKYP 
PPTELLDLQPLPVTALRNPNYEILYQDFKHFNPVQTQVFTVLYNTNDNVLVAAPTGSGKT 
ICAEFAILRNHHEGPDATMRVVYIAPLEAIAKEQFRIWEGKFGKGLGLRVVELTGETALD 
LKLLEKGQIIISTPEKWDALSRRWKQRKYVQQVSLFIVDELHLIGGQHGPVLEVIVSRMR 
YISSQVINKIRIVALSTSLANAKDLGEWIGASSHGLFNFPPGVRPVPLEIHIQGVDISSF 
EARMQAMTKPTYTAIVQHAKNKKPAIVFVPTRKHVRLTAVDLMAYSHMDNPQSPDFLLGK 
LEELDPFVEQIREETLKETLCHGIGYLHEGLSSLDQEIVTQLFEAGRIQVCVMSSSLCWG 
TPLTAHLVVVMGTQYYDGRENSHSDYPVPDLLQMMGRASRPLLDNAGKCVIFCHAPRKEY 
YKKFLYEAFPVESQLQHFLHDNFNAEVVAGVIENKQDAVDYLTWTFMYRRLPQNPNYYNL 
QGVSHRHLSDHLSELVENTLSDLEASKCIEVEDEMELSPLNLGMIASYYYISYTTIERFS 
SLLSSKTKMKGLLEILTSASEYDMIPIRPGEEDTVRRLINHQRFSFENPKCTDPHVKANA 
LLQAHFSRQNIGGNLAMDQRDVLLSATRLLQAMVDVISSNGWLNLALLAMEVSQMVTQGM 
WERDSMLLQLPHFTKDLAKRCQENPGKNIETVFDLVEMEDEERQELLKMSDAQLLDIARF 
CNRFPNIDLTYEIVGSEEVNPGKEVTLQVMLERDMEGRTEVGPVDSLRYPKTKEEGWWLV 
VGDTKTNQLLAIKRVSLQRKVKVKLDFTAPSEPGEKSYTLYFMCDSYLGCDQEYSFSVDV 
KGSGAGDRMEE*
>AT1G09760.1 |  U2A (U2 small nuclear ribonucleoprotein A) protein binding 
MVKLTADLIWKSPHFFNAIKERELDLRGNKIPVIENLGATEDQFDTIDLSDNEIVKLENF 
PYLNRLGTLLINNNRITRINPNLGEFLPKLHSLVLTNNRLVNLVEIDPLASIPKLQYLSL 
LDNNITKKANYRLYVIHKLKSLRVLDFIKIKAKERAEAASLFSSKEAEEEVKKVSREEVK 
KVSETAENPETPKVVAPTAEQILAIKAAIINSQTIEEIARLEQALKFGQVPAGLIIPDPA 
TNDSAPMEE*
>AT1G09770.1 |  ATCDC5 (ARABIDOPSIS THALIANA CELL DIVISION CYCLE 5) DNA binding / transcription factor 
MRIMIKGGVWKNTEDEILKAAVMKYGKNQWARISSLLVRKSAKQCKARWYEWLDPSIKKT 
EWTREEDEKLLHLAKLLPTQWRTIAPIVGRTPSQCLERYEKLLDAACTKDENYDAADDPR 
KLRPGEIDPNPEAKPARPDPVDMDEDEKEMLSEARARLANTRGKKAKRKAREKQLEEARR 
LASLQKRRELKAAGIDGRHRKRKRKGIDYNAEIPFEKRAPAGFYDTADEDRPADQVKFPT 
TIEELEGKRRADVEAHLRKQDVARNKIAQRQDAPAAILQANKLNDPEVVRKRSKLMLPPP 
QISDHELEEIAKMGYASDLLAENEELTEGSAATRALLANYSQTPRQGMTPMRTPQRTPAG 
KGDAIMMEAENLARLRDSQTPLLGGENPELHPSDFTGVTPRKKEIQTPNPMLTPSMTPGG 
AGLTPRIGLTPSRDGSSFSMTPKGTPFRDELHINEDMDMHESAKLERQRREEARRSLRSG 
LTGLPQPKNEYQIVAQPPPEESEEPEEKIEEDMSDRIAREKAEEEARQQALLKKRSKVLQ 
RDLPRPPAASLAVIRNSLLSADGDKSSVVPPTPIEVADKMVREELLQLLEHDNAKYPLDD 
KAEKKKGAKNRTNRSASQVLAIDDFDENELQEADKMIKEEGKFLCVSMGHENKTLDDFVE 
AHNTCVNDLMYFPTRSAYELSSVAGNADKVAAFQEEMENVRKKMEEDEKKAEHMKAKYKT 
YTKGHERRAETVWTQIEATLKQAEIGGTEVECFKALKRQEEMAASFRKKNLQEEVIKQKE 
TESKLQTRYGNMLAMVEKAEEIMVGFRAQALKKQEDVEDSHKLKEAKLATGEEEDIAIAM 
EASA*
>AT5G08290.1 |  YLS8 catalytic 
MSYLLPHLHSGWAVDQSILAEEERLVVIRFGHDWDETCMQMDEVLASVAETIKNFAVIYL 
VDITEVPDFNTMYELYDPSTVMFFFRNKHIMIDLGTGNNNKINWALKDKQEFIDIIETVY 
RGARKGRGLVIAPKDYSTKYRY*
>AT3G62840.1 |  FUNCTIONS IN molecular_function unknown LOCATED IN small nucleolar ribonucleoprotein complex nucleus EXPRESSED IN 22 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Like-Sm ribonucleoprotein core (InterProIPR001163) Like-Sm ribonucleoprotein eukaryotic and archaea-type core (InterProIPR006649) Like-Sm ribonucleoprotein-related core (InterProIPR010920) BEST Arabidopsis thaliana protein match is small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative (TAIRAT2G476404) Has 535 Blast hits to 535 proteins in 164 species Archae - 2 Bacteria - 0 Metazoa - 239 Fungi - 109 Plants - 78 Viruses - 0 Other Eukaryotes - 107 (source NCBI BLink) 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT3G62840.1 |  FUNCTIONS IN molecular_function unknown LOCATED IN small nucleolar ribonucleoprotein complex nucleus EXPRESSED IN 22 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Like-Sm ribonucleoprotein core (InterProIPR001163) Like-Sm ribonucleoprotein eukaryotic and archaea-type core (InterProIPR006649) Like-Sm ribonucleoprotein-related core (InterProIPR010920) BEST Arabidopsis thaliana protein match is small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative (TAIRAT2G476403) Has 537 Blast hits to 537 proteins in 165 species Archae - 2 Bacteria - 0 Metazoa - 239 Fungi - 109 Plants - 78 Viruses - 0 Other Eukaryotes - 109 (source NCBI BLink) 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT3G62840.2 |  FUNCTIONS IN molecular_function unknown LOCATED IN small nucleolar ribonucleoprotein complex nucleus EXPRESSED IN 22 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Like-Sm ribonucleoprotein core (InterProIPR001163) Like-Sm ribonucleoprotein eukaryotic and archaea-type core (InterProIPR006649) Like-Sm ribonucleoprotein-related core (InterProIPR010920) BEST Arabidopsis thaliana protein match is small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative (TAIRAT2G476404) Has 535 Blast hits to 535 proteins in 164 species Archae - 2 Bacteria - 0 Metazoa - 239 Fungi - 109 Plants - 78 Viruses - 0 Other Eukaryotes - 107 (source NCBI BLink) 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT3G62840.2 |  FUNCTIONS IN molecular_function unknown LOCATED IN small nucleolar ribonucleoprotein complex nucleus EXPRESSED IN 22 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Like-Sm ribonucleoprotein core (InterProIPR001163) Like-Sm ribonucleoprotein eukaryotic and archaea-type core (InterProIPR006649) Like-Sm ribonucleoprotein-related core (InterProIPR010920) BEST Arabidopsis thaliana protein match is small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative (TAIRAT2G476403) Has 537 Blast hits to 537 proteins in 165 species Archae - 2 Bacteria - 0 Metazoa - 239 Fungi - 109 Plants - 78 Viruses - 0 Other Eukaryotes - 109 (source NCBI BLink) 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G23930.1 |  SNRNP-G (PROBABLE SMALL NUCLEAR RIBONUCLEOPROTEIN G) 
MSRSGQPPDLKKYMDKKLQIKLNANRMVTGTLRGFDQFMNLVVDNTVEVNGNDKTDIGMV 
VIRGNSIVTVEALEPVGRSS*
>AT2G23930.1 |  SNRNP-G (PROBABLE SMALL NUCLEAR RIBONUCLEOPROTEIN G) 
MSRSGQPPDLKKYMDKKLQIKLNANRMVTGTLRGFDQFMNLVVDNTVEVNGNDKTDIGMV 
VIRGNSIVTVEALEPVGRSS*
>AT2G23930.2 |  SNRNP-G (PROBABLE SMALL NUCLEAR RIBONUCLEOPROTEIN G) 
MDKKLQIKLNANRMVTGTLRGFDQFMNLVVDNTVEVNGNDKTDIGMVVIRGNSIVTVEAL 
EPVGRSS*
>AT2G23930.2 |  SNRNP-G (PROBABLE SMALL NUCLEAR RIBONUCLEOPROTEIN G) 
MDKKLQIKLNANRMVTGTLRGFDQFMNLVVDNTVEVNGNDKTDIGMVVIRGNSIVTVEAL 
EPVGRSS*
>AT5G27720.1 |  emb1644 (embryo defective 1644) 
MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECY 
IRGNTIKYLRVPDEVIDKVQEEKTRTDRKPPGVGRGRGRGVDDGGARGRGRGTSMGKMGG 
NRGAGRGRG*
>AT3G11500.1 |  small nuclear ribonucleoprotein G putative / snRNP-G putative / Sm protein G putative 
MSRSGQPPDLKKYMDKKLQIKLNANRMVVGTLRGFDQFMNLVVDNTVEVNGDDKTDIGMV 
VIRGNSIVTVEALEPVGRS*
>AT1G03330.1 |  small nuclear ribonucleoprotein D putative / snRNP core SM-like protein putative / U6 snRNA-associated Sm-like protein putative 
MLFFSYFKDLVGQEVTVELKNDLAIRGTLHSVDQYLNIKLENTRVVDQDKYPHMLSVRNC 
FIRGSVVRYVQLPKDGVDVDLLHDAARREARGG*
>AT4G30330.1 |  small nuclear ribonucleoprotein E putative / snRNP-E putative / Sm protein E putative 
MASTKVQRIMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRITGFDEYMNLVLDEAEEV 
SIKKKTRKPLGRILLKGDNITLMMNAGK*
>AT2G47640.1 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.1 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.1 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.1 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.2 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.2 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.2 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.2 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.3 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.3 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.3 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.3 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.4 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.4 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.4 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.4 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT3G19840.1 |  FF domain-containing protein / WW domain-containing protein 
MLANAPFGRPGTLAPPGLMTSPPAFPGSNPFSTTPRPGMSAGPAQMNPGIHPHMYPPYHS 
LPGTPQGMWLQPPSMGGIPRAPFLSHPTTFPGSYPFPVRGISPNLPYSGSHPLGASPMGS 
VGNVHALPGRQPDISPGRKTEELSGIAGSQLVGNRLDAWTAHKSEAGVLYYYNSVTGQST 
YEKPPGFGGEPDKVPVQPIPVSILPGTDWALVSTNDGKKYYYNNKTKVSSWQIPAEVKDF 
GKKLEERAMESVASVPSADLTEKGSDLTSLSAPAISNGGRDAASLKTTNFGSSALDLVKK 
KLHDSGMPVSSTITSEANSGKTTEVTPSGESGNSTGKVKDAPGAGALSDSSSDSEDEDSG 
PSKEECSKQFKEMLKERGIAPFSKWEKELPKIIFDPRFKAIPSHSVRRSLFEQYVKTRAE 
EERREKRAAHKAAIEGFRQLLDDASTDIDQHTDYRAFKKKWGNDLRFEAIERKEREGLLN 
ERVLSLKRSAEQKAQEIRAAAASDFKTMLREREISINSHWSKVKDSLRNEPRYRSVAHED 
REVFYYEYIAELKAAQRGDDHEMKARDEEDKLRERERELRKRKEREVQEVERVRQKIRRK 
EASSSYQALLVEKIRDPEASWTESKPILERDPQKRASNPDLEPADKEKLFRDHVKSLYER 
CVHDFKALLAEALSSEAATLQTEDGKTALNSWSTAKQVLKPDIRYSKMPRQDREVVWRRY 
VEDISRKQRHENYQEEKQRDYKT*
>AT3G50670.1 |  U1-70K (U1 SMALL NUCLEAR RIBONUCLEOPROTEIN-70K) RNA binding / nucleic acid binding / nucleotide binding 
MGDSGDPFLRNPNAAVQARAKVQNRANVLQLKLMGQSHPTGLTNNLLKLFEPRPPLEYKP 
PPEKRKCPPYTGMAQFVSNFAEPGDPEYAPPKPEVELPSQKRERIHKLRLEKGVEKAAED 
LKKYDPNNDPNATGDPYKTLFVSRLNYESSESKIKREFESYGPIKRVHLVTDQLTNKPKG 
YAFIEYMHTRDMKAAYKQADGQKIDGRRVLVDVERGRTVPNWRPRRLGGGLGTSRVGGGE 
EIVGEQQPQGRTSQSEEPSRPREEREKSREKGKERERSRELSHEQPRERSRDRPREDKHH 
RDRDQGGRDRDRDSRRDRDRTRDRGDRDRRDRDRGRDRTSRDHDRDRSRKKERDYEGGEY 
EHEGGGRSRERDAEYRGEPEETRGYYEDDQGDTDRYSHRYDKMEEDDFRYEREYKRSKRS 
ESREYVR*
>AT3G50670.1 |  U1-70K (U1 SMALL NUCLEAR RIBONUCLEOPROTEIN-70K) RNA binding / nucleotide binding 
MGDSGDPFLRNPNAAVQARAKVQNRANVLQLKLMGQSHPTGLTNNLLKLFEPRPPLEYKP 
PPEKRKCPPYTGMAQFVSNFAEPGDPEYAPPKPEVELPSQKRERIHKLRLEKGVEKAAED 
LKKYDPNNDPNATGDPYKTLFVSRLNYESSESKIKREFESYGPIKRVHLVTDQLTNKPKG 
YAFIEYMHTRDMKAAYKQADGQKIDGRRVLVDVERGRTVPNWRPRRLGGGLGTSRVGGGE 
EIVGEQQPQGRTSQSEEPSRPREEREKSREKGKERERSRELSHEQPRERSRDRPREDKHH 
RDRDQGGRDRDRDSRRDRDRTRDRGDRDRRDRDRGRDRTSRDHDRDRSRKKERDYEGGEY 
EHEGGGRSRERDAEYRGEPEETRGYYEDDQGDTDRYSHRYDKMEEDDFRYEREYKRSKRS 
ESREYVR*
>AT3G50670.2 |  U1-70K (U1 SMALL NUCLEAR RIBONUCLEOPROTEIN-70K) RNA binding / nucleic acid binding / nucleotide binding 
MGDSGDPFLRNPNAAVQARAKVQNRANVLQLKLMGQSHPTGLTNNLLKLFEPRPPLEYKP 
PPEKRKCPPYTGMAQFVSNFAEPGDPEYAPPKPEVELPSQKRERIHKLRLEKGVEKAAED 
LKKYDPNNDPNATGDPYKTLFVSRLNYESSESKIKREFESYGPIKRVGYSEHSLAGSVRI 
CVMASLSRALCSICFILSTKVFQG*
>AT3G50670.2 |  U1-70K (U1 SMALL NUCLEAR RIBONUCLEOPROTEIN-70K) RNA binding / nucleotide binding 
MGDSGDPFLRNPNAAVQARAKVQNRANVLQLKLMGQSHPTGLTNNLLKLFEPRPPLEYKP 
PPEKRKCPPYTGMAQFVSNFAEPGDPEYAPPKPEVELPSQKRERIHKLRLEKGVEKAAED 
LKKYDPNNDPNATGDPYKTLFVSRLNYESSESKIKREFESYGPIKRVGYSEHSLAGSVRI 
CVMASLSRALCSICFILSTKVFQG*
>AT1G20580.1 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MSRSLGIPVKLLHEASGHIVTVELKSGELYRGSMIECEDNWNCQLEDITYTAKDGKVSQL 
EHVFIRGSKVRFMVIPDILKHAPMFKRLDARIKGKSSSLGVGRGRGAMRGKPAAGPGRGT 
GGRGAVPPVRR*
>AT1G28060.1 |  small nuclear ribonucleoprotein family protein / snRNP family protein 
MDKERYSRSHRDDRDRDSSPDHSPQREGGRRRDRDVDSKRRDSDHYRSSRRGDREDERDR 
TKDRRGRSVERGEREGSRDREKHHHERSHEGSKEKESRSKRKDREEENGARDGKKKSRFA 
DGNGERRSRFEDVAIEVENKDAQVSEGSGATNPTSGVTMGASTYSSIPSEASAAPSQTLL 
TKVSSISTTDENKASVVRSHEVPGKSSTDGRPLSTAGKSSANLPLDSSALAAKARKALQL 
QKGLADRLKNLPLLKKATKPTSEGSPHTRVPPSTTTPAVSTGTSFASTLPHTGLAGFGSI 
ANIEAVKRAQELAANMGFHQDREFAPVINLFPGQAPSDMTVAQRPEKPPVLRVDALGREI 
DEHGNVISVTKPSNLSTLKVNINKKKKDAFQILKPQLEADLKENPYFDTRMGIDEKKILR 
PKRMSFQFVEEGKWTRDAENLKFKSHFGEAKAKELKVKQAQLAKANDDINPNLIEVSERV 
PRKEKPKEPIPDVEWWDANVLTNGEYGEITDGTITESHLKIEKLTHYIEHPRPIEPPAEA 
APPPPQPLKLTKKEQKKLRTQRRLAKEKEKQEMIRQGLLEPPKAKVKMSNLMKVLGSEAT 
QDPTKLEKEIRTAAAEREQAHTDRNAARKLTPAEKREKKERKLFDDPTTVETIVSVYKIK 
KLSHPKTRFKVEMNARENRLTGCSVMTDEMSVVVVEGKSKAIKRYGKLMMKRINWEEAER 
KEGNEDEEEEVNGGNKCWLVWQGSIGKPSFHRFHVHECVTESTAKKVFMDAGVVHYWDLA 
VNYSDD*
>AT4G30220.1 |  RUXF (SMALL NUCLEAR RIBONUCLEOPROTEIN F) 
MATIPVNPKPFLNNLTGKTVIVKLKWGMEYKGFLASVDSYMNLQLGNTEEYIDGQLTGNL 
GEILIRCNNVLYVRGVPEDEELEDADQD*
>AT1G65700.1 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MAATTGLETLVDQIISVITNDGRNIVGVLKGFDQATNIILDESHERVFSTKEGVQQHVLG 
LYIIRGDNIGVIGELDEELDASLDFSKLRAHPLKPVVH*
>AT1G65700.1 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MAATTGLETLVDQIISVITNDGRNIVGVLKGFDQATNIILDESHERVFSTKEGVQQHVLG 
LYIIRGDNIGVIGELDEELDASLDFSKLRAHPLKPVVH*
>AT1G65700.2 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MAATTGLETLVDQIISVITNDGRNIVGVLKGFDQATNIILDESHERVFSTKEGVQQHVLG 
LYIIRGDNIGVIGELDEELDASLDFSKLRAHPLKPVVH*
>AT1G65700.2 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MAATTGLETLVDQIISVITNDGRNIVGVLKGFDQATNIILDESHERVFSTKEGVQQHVLG 
LYIIRGDNIGVIGELDEELDASLDFSKLRAHPLKPVVH*
>AT2G41500.1 |  EMB2776 nucleotide binding 
MEPNKDDNVSLAATAQISAPPVLQDASSLPGFSAIPPVVPPSFPPPMAPIPMMPHPPVAR 
PPTFRPPVSQNGGVKTSDSDSESDDEHIEISEESKQVRERQEKALQDLLVKRRAAAMAVP 
TNDKAVRDRLRRLGEPITLFGEQEMERRARLTQLLTRYDINGQLDKLVKDHEEDVTPKEE 
VDDEVLEYPFFTEGPKELREARIEIAKFSVKRAAVRIQRAKRRRDDPDEDMDAETKWALK 
HAKHMALDCSNFGDDRPLTGCSFSRDGKILATCSLSGVTKLWEMPQVTNTIAVLKDHKER 
ATDVVFSPVDDCLATASADRTAKLWKTDGTLLQTFEGHLDRLARVAFHPSGKYLGTTSYD 
KTWRLWDINTGAELLLQEGHSRSVYGIAFQQDGALAASCGLDSLARVWDLRTGRSILVFQ 
GHIKPVFSVNFSPNGYHLASGGEDNQCRIWDLRMRKSLYIIPAHANLVSQVKYEPQEGYF 
LATASYDMKVNIWSGRDFSLVKSLAGHESKVASLDITADSSCIATVSHDRTIKLWTSSGN 
DDEDEEKETMDIDL*
>AT1G72560.1 |  PSD (PAUSED) nucleobase nucleoside nucleotide and nucleic acid transmembrane transporter/ tRNA binding 
MDDLEQAIVISFETGAVDSALKSQAVTYCQQIKETPSICSICIEKLWFSKLVQVQFWCLQ 
TLQDVLRVKYGSMSLDEQSYVRKSVFSMACLEVIDNENAGRVVEGPPFVKNKLAQVLATL 
IYYEYPLIWSSVFLDFMLHLCKGAVVIDMFCRVLNALDDELISLDYPRTPEEISVAARVK 
DAMRQQCVPQIARAWYDIVSMYKNSDPDLSATVLDCMRRFVSWIDIGLVANDAFVPLLFE 
LILSDGLSEQVRGAAAGCVLAMVSKRMDPQSKLPLLQTLQISRVFGLVSGDVDSDLVSKV 
SALLTGYAVEVLECHKRLNSEDTKAVSMDLLNEVLPSVFYVMQKCEVDSTFSIVQFLLGY 
VSTLKGLPALKEKQLLHITQILEVIRIQICYDPMYRNNLNSLDKTGLEEEDRMSEFRKDL 
FVLLRTVGRVAPEVTQHFIRNSLANAVESSSESNVEEVEAALSLLYSFGESMTEEAMKTG 
SGCLSELIPMLLTTQFPGHSHRLVALVYLENITRYMKFIQENSQYIPNVLGAFLDDRGLH 
HQNFYVSRRAGYLFMRVVKLLKSKLVPFIDKILQNLQDTLSQLTTMNFASRELTGTEDGS 
HIFEAIGIIIGLEDVPAEKQSDYLSLLLTPLCQQIEAGLVQAKVASSEDFPVKIANIQFA 
IVAINALSKGFNERLVTASRPGIGLMFKQTLDVLLRVLIEFPKVEPLRSKVTSFIHRMVD 
TLGSAVFPYLPKALEQLLADSEPKEMVGFMVLLNQLICKFNSALHDILEEVYPVVAVRIF 
NVIPRDGLPSRPGAVTEEMRELIELQRMLYTFLHVIATHDLSSVFLTPKSRAYLDPMMQL 
VLNTSCNHKDITVRKACVQIFIKLIKDWCAEPYSEEKVPGFQNFVIEAFATNCCLYSVLD 
KSFNFSDANTHALFGEIITAQKVMYEKFGNTFLMHLMSKSFPSAHIPQDLAEQYCQKLQG 
NDIRSLKSYYQSLIENLRLQQNGSHVFR*
>AT1G72560.1 |  PSD (PAUSED) nucleobase nucleoside nucleotide and nucleic acid transmembrane transporter/ tRNA binding 
MDDLEQAIVISFETGAVDSALKSQAVTYCQQIKETPSICSICIEKLWFSKLVQVQFWCLQ 
TLQDVLRVKYGSMSLDEQSYVRKSVFSMACLEVIDNENAGRVVEGPPFVKNKLAQVLATL 
IYYEYPLIWSSVFLDFMLHLCKGAVVIDMFCRVLNALDDELISLDYPRTPEEISVAARVK 
DAMRQQCVPQIARAWYDIVSMYKNSDPDLSATVLDCMRRFVSWIDIGLVANDAFVPLLFE 
LILSDGLSEQVRGAAAGCVLAMVSKRMDPQSKLPLLQTLQISRVFGLVSGDVDSDLVSKV 
SALLTGYAVEVLECHKRLNSEDTKAVSMDLLNEVLPSVFYVMQKCEVDSTFSIVQFLLGY 
VSTLKGLPALKEKQLLHITQILEVIRIQICYDPMYRNNLNSLDKTGLEEEDRMSEFRKDL 
FVLLRTVGRVAPEVTQHFIRNSLANAVESSSESNVEEVEAALSLLYSFGESMTEEAMKTG 
SGCLSELIPMLLTTQFPGHSHRLVALVYLENITRYMKFIQENSQYIPNVLGAFLDDRGLH 
HQNFYVSRRAGYLFMRVVKLLKSKLVPFIDKILQNLQDTLSQLTTMNFASRELTGTEDGS 
HIFEAIGIIIGLEDVPAEKQSDYLSLLLTPLCQQIEAGLVQAKVASSEDFPVKIANIQFA 
IVAINALSKGFNERLVTASRPGIGLMFKQTLDVLLRVLIEFPKVEPLRSKVTSFIHRMVD 
TLGSAVFPYLPKALEQLLADSEPKEMVGFMVLLNQLICKFNSALHDILEEVYPVVAVRIF 
NVIPRDGLPSRPGAVTEEMRELIELQRMLYTFLHVIATHDLSSVFLTPKSRAYLDPMMQL 
VLNTSCNHKDITVRKACVQIFIKLIKDWCAEPYSEEKVPGFQNFVIEAFATNCCLYSVLD 
KSFNFSDANTHALFGEIITAQKVMYEKFGNTFLMHLMSKSFPSAHIPQDLAEQYCQKLQG 
NDIRSLKSYYQSLIENLRLQQNGSHVFR*
>AT1G72560.2 |  PSD (PAUSED) nucleobase nucleoside nucleotide and nucleic acid transmembrane transporter/ tRNA binding 
MDDLEQAIVISFETGAVDSALKSQAVTYCQQIKETPSICSICIEKLWFSKLVQVQFWCLQ 
TLQDVLRVKYGSMSLDEQSYVRKSVFSMACLEVIDNENAGRVVEGPPFVKNKLAQVLATL 
IYYEYPLIWSSVFLDFMLHLCKGAVVIDMFCRVLNALDDELISLDYPRTPEEISVAARVK 
DAMRQQCVPQIARAWYDIVSMYKNSDPDLSATVLDCMRRFVSWIDIGLVANDAFVPLLFE 
LILSDGLSEQVRGAAAGCVLAMVSKRMDPQSKLPLLQTLQISRVFGLVSGDVDSDLVSKV 
SALLTGYAVEVLECHKRLNSEDTKAVSMDLLNEVLPSVFYVMQKCEVDSTFSIVQFLLGY 
VSTLKGLPALKEKQLLHITQILEVIRIQICYDPMYRNNLNSLDKTGLEEEDRMSEFRKDL 
FVLLRTVGRVAPEVTQHFIRNSLANAVESSSESNVEEVEAALSLLYSFGESMTEEAMKTG 
SGCLSELIPMLLTTQFPGHSHRLVALVYLENITRYMKFIQENSQYIPNVLGAFLDDRGLH 
HQNFYVSRRAGYLFMRVVKLLKSKLVPFIDKILQNLQDTLSQLTTMNFASRELTGTEDGS 
HIFEAIGIIIGLEDVPAEKQSDYLSLLLTPLCQQIEAGLVQAKVASSEDFPVKIANIQFA 
IVAINALSKGFNERLVTASRPGIGLMFKQTLDVLLRVLIEFPKVEPLRSKVTSFIHRMVD 
TLGSAVFPYLPKALEQLLADSEPKEMVGFMVLLNQLICKFNSALHDILEEVYPVVAVRIF 
NVIPRDGLPSRPGAVTEEMRELIELQRMLYTFLHVIATHDLSSVFLTPKSRAYLDPMMQL 
VLNTSCNHKDITVRKACVQIFIKLIKDWCAEPYSEEKVPGFQNFVIEAFATNCCLYSVLD 
KSFNFSDANTHALFGEIITAQKVMYEKFGNTFLMHLMSKSFPSAHIPQDLAEQYCQKLQG 
NDIRSLKSYYQSLIENLRLQQNGSHVFR*
>AT1G72560.2 |  PSD (PAUSED) nucleobase nucleoside nucleotide and nucleic acid transmembrane transporter/ tRNA binding 
MDDLEQAIVISFETGAVDSALKSQAVTYCQQIKETPSICSICIEKLWFSKLVQVQFWCLQ 
TLQDVLRVKYGSMSLDEQSYVRKSVFSMACLEVIDNENAGRVVEGPPFVKNKLAQVLATL 
IYYEYPLIWSSVFLDFMLHLCKGAVVIDMFCRVLNALDDELISLDYPRTPEEISVAARVK 
DAMRQQCVPQIARAWYDIVSMYKNSDPDLSATVLDCMRRFVSWIDIGLVANDAFVPLLFE 
LILSDGLSEQVRGAAAGCVLAMVSKRMDPQSKLPLLQTLQISRVFGLVSGDVDSDLVSKV 
SALLTGYAVEVLECHKRLNSEDTKAVSMDLLNEVLPSVFYVMQKCEVDSTFSIVQFLLGY 
VSTLKGLPALKEKQLLHITQILEVIRIQICYDPMYRNNLNSLDKTGLEEEDRMSEFRKDL 
FVLLRTVGRVAPEVTQHFIRNSLANAVESSSESNVEEVEAALSLLYSFGESMTEEAMKTG 
SGCLSELIPMLLTTQFPGHSHRLVALVYLENITRYMKFIQENSQYIPNVLGAFLDDRGLH 
HQNFYVSRRAGYLFMRVVKLLKSKLVPFIDKILQNLQDTLSQLTTMNFASRELTGTEDGS 
HIFEAIGIIIGLEDVPAEKQSDYLSLLLTPLCQQIEAGLVQAKVASSEDFPVKIANIQFA 
IVAINALSKGFNERLVTASRPGIGLMFKQTLDVLLRVLIEFPKVEPLRSKVTSFIHRMVD 
TLGSAVFPYLPKALEQLLADSEPKEMVGFMVLLNQLICKFNSALHDILEEVYPVVAVRIF 
NVIPRDGLPSRPGAVTEEMRELIELQRMLYTFLHVIATHDLSSVFLTPKSRAYLDPMMQL 
VLNTSCNHKDITVRKACVQIFIKLIKDWCAEPYSEEKVPGFQNFVIEAFATNCCLYSVLD 
KSFNFSDANTHALFGEIITAQKVMYEKFGNTFLMHLMSKSFPSAHIPQDLAEQYCQKLQG 
NDIRSLKSYYQSLIENLRLQQNGSHVFR*
>AT2G33340.1 |  nucleotide binding / ubiquitin-protein ligase 
MNCAISGEVPVEPVVSTKSGLLFERRLIERHISDYGKCPVTGEPLTIDDIVPIKTGEIIK 
PKTLHTASIPGLLGTFQNEWDGLMLSNFALEQQLHTARQELSHALYQHDSACRVIARLKK 
ERDEARQLLAEVERHIPAAPEAVTANAALSNGKRAAVDEELGPDAKKLCPGISAEIITEL 
TDCNAALSQKRKKRQIPQTLASIDTLERFTQLSSHPLHKTNKPGICSMDILHSKDVIATG 
GVDATAVLFDRPSGQILSTLTGHSKKVTSVKFVGDSDLVLTASADKTVRIWRNPGDGNYA 
CGYTLNDHSAEVRAVTVHPTNKYFVSASLDGTWCFYDLSSGSCLAQVSDDSKNVDYTAAA 
FHPDGLILGTGTSQSVVKIWDVKSQANVAKFDGHTGEVTAISFSENGYFLATAAEDGVRL 
WDLRKLRNFKSFLSADANSVEFDPSGSYLGIAASDIKVYQTASVKAEWNLIKTLPDLSGT 
GKATCVKFGSDAQYVAVGSMDRNLRIFGLPGDEKANVDDDSAQDS*
>AT2G33340.1 |  nucleotide binding / ubiquitin-protein ligase 
MNCAISGEVPVEPVVSTKSGLLFERRLIERHISDYGKCPVTGEPLTIDDIVPIKTGEIIK 
PKTLHTASIPGLLGTFQNEWDGLMLSNFALEQQLHTARQELSHALYQHDSACRVIARLKK 
ERDEARQLLAEVERHIPAAPEAVTANAALSNGKRAAVDEELGPDAKKLCPGISAEIITEL 
TDCNAALSQKRKKRQIPQTLASIDTLERFTQLSSHPLHKTNKPGICSMDILHSKDVIATG 
GVDATAVLFDRPSGQILSTLTGHSKKVTSVKFVGDSDLVLTASADKTVRIWRNPGDGNYA 
CGYTLNDHSAEVRAVTVHPTNKYFVSASLDGTWCFYDLSSGSCLAQVSDDSKNVDYTAAA 
FHPDGLILGTGTSQSVVKIWDVKSQANVAKFDGHTGEVTAISFSENGYFLATAAEDGVRL 
WDLRKLRNFKSFLSADANSVEFDPSGSYLGIAASDIKVYQTASVKAEWNLIKTLPDLSGT 
GKATCVKFGSDAQYVAVGSMDRNLRIFGLPGDEKANVDDDSAQDS*
>AT2G33340.1 |  nucleotide binding / ubiquitin-protein ligase 
MNCAISGEVPVEPVVSTKSGLLFERRLIERHISDYGKCPVTGEPLTIDDIVPIKTGEIIK 
PKTLHTASIPGLLGTFQNEWDGLMLSNFALEQQLHTARQELSHALYQHDSACRVIARLKK 
ERDEARQLLAEVERHIPAAPEAVTANAALSNGKRAAVDEELGPDAKKLCPGISAEIITEL 
TDCNAALSQKRKKRQIPQTLASIDTLERFTQLSSHPLHKTNKPGICSMDILHSKDVIATG 
GVDATAVLFDRPSGQILSTLTGHSKKVTSVKFVGDSDLVLTASADKTVRIWRNPGDGNYA 
CGYTLNDHSAEVRAVTVHPTNKYFVSASLDGTWCFYDLSSGSCLAQVSDDSKNVDYTAAA 
FHPDGLILGTGTSQSVVKIWDVKSQANVAKFDGHTGEVTAISFSENGYFLATAAEDGVRL 
WDLRKLRNFKSFLSADANSVEFDPSGSYLGIAASDIKVYQTASVKAEWNLIKTLPDLSGT 
GKATCVKFGSDAQYVAVGSMDRNLRIFGLPGDEKANVDDDSAQDS*
>AT2G33340.2 |  nucleotide binding / ubiquitin-protein ligase 
MNCAISGEVPVEPVVSTKSGLLFERRLIERHISDYGKCPVTGEPLTIDDIVPIKTGEIIK 
PKTLHTASIPGLLGTFQNEWDGLMLSNFALEQQLHTARQELSHALYQHDSACRVIARLKK 
ERDEARQLLAEVERHIPAAPEAVTANAALSNGKRAAVDEELGPDAKKLCPGISAEIITEL 
TDCNAALSQKRKKRQIPQTLASIDTLERFTQLSSHPLHKTNKPGICSMDILHSKDVIATG 
GVDATAVLFDRPSGQILSTLTGHSKKVTSVKFVGDSDLVLTASADKTVRIWRNPGDGNYA 
CGYTLNDHSAEVRAVTVHPTNKYFVSASLDGTWCFYDLSSGSCLAQVSDDSKNVDYTAAA 
FHPDGLILGTGTSQSVVKIWDVKSQANVAKFDGHTGEVTAISFSENGYFLATAAEDGVRL 
WDLRKLRNFKSFLSADANSVEFDPSGSYLGIAASDIKVYQTASVKAEWNLIKTLPDLSGT 
GKATCVKFGSDAQYVAVGSMDRNLRIFGLPGDEKANVDDDSAQDS*
>AT2G33340.2 |  nucleotide binding / ubiquitin-protein ligase 
MNCAISGEVPVEPVVSTKSGLLFERRLIERHISDYGKCPVTGEPLTIDDIVPIKTGEIIK 
PKTLHTASIPGLLGTFQNEWDGLMLSNFALEQQLHTARQELSHALYQHDSACRVIARLKK 
ERDEARQLLAEVERHIPAAPEAVTANAALSNGKRAAVDEELGPDAKKLCPGISAEIITEL 
TDCNAALSQKRKKRQIPQTLASIDTLERFTQLSSHPLHKTNKPGICSMDILHSKDVIATG 
GVDATAVLFDRPSGQILSTLTGHSKKVTSVKFVGDSDLVLTASADKTVRIWRNPGDGNYA 
CGYTLNDHSAEVRAVTVHPTNKYFVSASLDGTWCFYDLSSGSCLAQVSDDSKNVDYTAAA 
FHPDGLILGTGTSQSVVKIWDVKSQANVAKFDGHTGEVTAISFSENGYFLATAAEDGVRL 
WDLRKLRNFKSFLSADANSVEFDPSGSYLGIAASDIKVYQTASVKAEWNLIKTLPDLSGT 
GKATCVKFGSDAQYVAVGSMDRNLRIFGLPGDEKANVDDDSAQDS*
>AT2G33340.2 |  nucleotide binding / ubiquitin-protein ligase 
MNCAISGEVPVEPVVSTKSGLLFERRLIERHISDYGKCPVTGEPLTIDDIVPIKTGEIIK 
PKTLHTASIPGLLGTFQNEWDGLMLSNFALEQQLHTARQELSHALYQHDSACRVIARLKK 
ERDEARQLLAEVERHIPAAPEAVTANAALSNGKRAAVDEELGPDAKKLCPGISAEIITEL 
TDCNAALSQKRKKRQIPQTLASIDTLERFTQLSSHPLHKTNKPGICSMDILHSKDVIATG 
GVDATAVLFDRPSGQILSTLTGHSKKVTSVKFVGDSDLVLTASADKTVRIWRNPGDGNYA 
CGYTLNDHSAEVRAVTVHPTNKYFVSASLDGTWCFYDLSSGSCLAQVSDDSKNVDYTAAA 
FHPDGLILGTGTSQSVVKIWDVKSQANVAKFDGHTGEVTAISFSENGYFLATAAEDGVRL 
WDLRKLRNFKSFLSADANSVEFDPSGSYLGIAASDIKVYQTASVKAEWNLIKTLPDLSGT 
GKATCVKFGSDAQYVAVGSMDRNLRIFGLPGDEKANVDDDSAQDS*
>AT2G33340.3 |  nucleotide binding / ubiquitin-protein ligase 
MNCAISGEVPVEPVVSTKSGLLFERRLIERHISDYGKCPVTGEPLTIDDIVPIKTGEIIK 
PKTLHTASIPGLLGTFQNEWDGLMLSNFALEQQLHTARQELSHALYQHDSACRVIARLKK 
ERDEARQLLAEVERHIPAAPEAVTANAALSNGKRAAVDEELGPDAKKLCPGISAEIITEL 
TDCNAALSQKRKKRQIPQTLASIDTLERFTQLSSHPLHKTNKPGICSMDILHSKDVIATG 
GVDATAVLFDRPSGQILSTLTGHSKKVTSVKFVGDSDLVLTASADKTVRIWRNPGDGNYA 
CGYTLNDHSAEVRAVTVHPTNKYFVSASLDGTWCFYDLSSGSCLAQVSDDSKNVDYTAAA 
FHPDGLILGTGTSQSVVKIWDVKSQANVAKFDGHTGEVTAISFSENGYFLATAAEDGVRL 
WDLRKLRNFKSFLSADANSVEFDPSGSYLGIAASDIKVYQTASVKAEWNLIKTLPDLSGT 
GKLRV*
>AT2G33340.3 |  nucleotide binding / ubiquitin-protein ligase 
MNCAISGEVPVEPVVSTKSGLLFERRLIERHISDYGKCPVTGEPLTIDDIVPIKTGEIIK 
PKTLHTASIPGLLGTFQNEWDGLMLSNFALEQQLHTARQELSHALYQHDSACRVIARLKK 
ERDEARQLLAEVERHIPAAPEAVTANAALSNGKRAAVDEELGPDAKKLCPGISAEIITEL 
TDCNAALSQKRKKRQIPQTLASIDTLERFTQLSSHPLHKTNKPGICSMDILHSKDVIATG 
GVDATAVLFDRPSGQILSTLTGHSKKVTSVKFVGDSDLVLTASADKTVRIWRNPGDGNYA 
CGYTLNDHSAEVRAVTVHPTNKYFVSASLDGTWCFYDLSSGSCLAQVSDDSKNVDYTAAA 
FHPDGLILGTGTSQSVVKIWDVKSQANVAKFDGHTGEVTAISFSENGYFLATAAEDGVRL 
WDLRKLRNFKSFLSADANSVEFDPSGSYLGIAASDIKVYQTASVKAEWNLIKTLPDLSGT 
GKLRV*
>AT2G33340.3 |  nucleotide binding / ubiquitin-protein ligase 
MNCAISGEVPVEPVVSTKSGLLFERRLIERHISDYGKCPVTGEPLTIDDIVPIKTGEIIK 
PKTLHTASIPGLLGTFQNEWDGLMLSNFALEQQLHTARQELSHALYQHDSACRVIARLKK 
ERDEARQLLAEVERHIPAAPEAVTANAALSNGKRAAVDEELGPDAKKLCPGISAEIITEL 
TDCNAALSQKRKKRQIPQTLASIDTLERFTQLSSHPLHKTNKPGICSMDILHSKDVIATG 
GVDATAVLFDRPSGQILSTLTGHSKKVTSVKFVGDSDLVLTASADKTVRIWRNPGDGNYA 
CGYTLNDHSAEVRAVTVHPTNKYFVSASLDGTWCFYDLSSGSCLAQVSDDSKNVDYTAAA 
FHPDGLILGTGTSQSVVKIWDVKSQANVAKFDGHTGEVTAISFSENGYFLATAAEDGVRL 
WDLRKLRNFKSFLSADANSVEFDPSGSYLGIAASDIKVYQTASVKAEWNLIKTLPDLSGT 
GKLRV*
>AT1G80070.1 |  SUS2 (ABNORMAL SUSPENSOR 2) 
MWNNNDGMPLAPPGTGGSMMPPPPAAHPSYTALPPPSNPTPPVEPTPEEAEAKLEEKARK 
WMQLNSKRYGDKRKFGFVETQKEDMPPEHVRKIIRLVFFSSFSTISKYSLLDNYFLARDH 
GDMSSKKFRHDKRVYLGALKFVPHAVFKLLENMPMPWEQVRDVKVLYHITGAITFVNEIP 
WVVEPIYMAQWGTMWIMMRREKRDRRHFKRMRFPPFDDEEPPLDYADNLLDVDPLEPIQL 
ELDEEEDSAVHTWFYDHKPLVKTKLINGPSYRRWNLSLPIMATLHRLAGQLLSDLIDRNY 
FYLFDMPSFFTAKALNMCIPGGPKFEPLYRDMEKGDEDWNEFNDINKLIIRSPLRTEYRI 
AFPHLYNNRPRKVKLCVYHSPMIMYIKTEDPDLPAFYYDPLIHPISNTNKEKRERKVYDD 
EDDFALPEGVEPLLRDTQLYTDTTAAGISLLFAPRPFNMRSGRTRRAEDIPLVSEWFKEH 
CPPAYPVKVRVSYQKLLKCYVLNELHHRPPKAQKKKHLFRSLAATKFFQSTELDWVEVGL 
QVCRQGYNMLNLLIHRKNLNYLHLDYNFNLKPVKTLTTKERKKSRFGNAFHLCREILRLT 
KLVVDANVQFRLGNVDAFQLADGLQYIFSHVGQLTGMYRYKYRLMRQIRMCKDLKHLIYY 
RFNTGPVGKGPGCGFWAPMWRVWLFFLRGIVPLLERWLGNLLARQFEGRHSKGVAKTVTK 
QRVESHFDLELRAAVMHDVLDAMPEGIKQNKARTILQHLSEAWRCWKANIPWKVPGLPVP 
IENMILRYVKSKADWWTNVAHYNRERIRRGATVDKTVCRKNLGRLTRLWLKAEQERQHNY 
LKDGPYVTPEEALAIYTTTVHWLESRKFSPIPFPPLSYKHDTKLLILALERLKESYSVAV 
RLNQQQREELGLIEQAYDNPHEALSRIKRHLLTQRGFKEVGIEFMDLYSYLIPVYEIEPL 
EKITDAYLDQYLWYEGDKRHLFPNWIKPADSEPPPLLVYKWCQGINNLQGIWDTGDGQCV 
VMLQTKFEKFFEKIDLTMLNRLLRLVLDHNIADYVSAKNNVVLSYKDMSHTNSYGLIRGL 
QFASFVVQFYGLLLDLLLLGLTRASEIAGPPQMPNEFMTFWDTKVETRHPIRLYSRYIDK 
VHIMFKFTHEEARDLIQRYLTEHPDPNNENMVGYNNKKCWPRDARMRLMKHDVNLGRSVF 
WDMKNRLPRSITTLEWENGFVSVYSKDNPNLLFSMCGFEVRILPKIRMTQEAFSNTKDGV 
WNLQNEQTKERTAVAFLRVDDEHMKVFENRVRQILMSSGSTTFTKIVNKWNTALIGLMTY 
FREATVHTQELLDLLVKCENKIQTRIKIGLNSKMPSRFPPVIFYTPKEIGGLGMLSMGHI 
LIPQSDLRYSKQTDVGVTHFRSGMSHEEDQLIPNLYRYIQPWESEFIDSQRVWAEYALKR 
QEAQAQNRRLTLEDLEDSWDRGIPRINTLFQKDRHTLAYDKGWRVRTDFKQYQVLKQNPF 
WWTHQRHDGKLWNLNNYRTDVIQALGGVEGILEHTLFKGTYFPTWEGLFWEKASGFEESM 
KYKKLTNAQRSGLNQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGIFMHGKIPTLKISL 
IQIFRAHLWQKIHESVVMDLCQVLDQELDALEIETVQKETIHPRKSYKMNSSCADVLLFA 
AHKWPMSKPSLVAESKDMFDQKASNKYWIDVQLRWGDYDSHDIERYTRAKFMDYTTDNMS 
IYPSPTGVMIGLDLAYNLHSAFGNWFPGSKPLLAQAMNKIMKSNPALYVLRERIRKGLQL 
YSSEPTEPYLSSQNYGEIFSNQIIWFVDDTNVYRVTIHKTFEGNLTTKPINGAIFIFNPR 
TGQLFLKVIHTSVWAGQKRLGQLAKWKTAEEVAALVRSLPVEEQPKQIIVTRKGMLDPLE 
VHLLDFPNIVIKGSELQLPFQACLKIEKFGDLILKATEPQMVLFNIYDDWLKSISSYTAF 
SRLILILRALHVNNEKAKMLLKPDKSVVTEPHHIWPSLTDDQWMKVEVALRDLILSDYAK 
KNNVNTSALTQSEIRDIILGAEITPPSQQRQQIAEIEKQAKEASQLTAVTTRTTNVHGDE 
LIVTTTSPYEQSAFGSKTDWRVRAISATNLYLRVNHIYVNSDDIKETGYTYIMPKNILKK 
FICVADLRTQIAGYLYGISPPDNPQVKEIRCVVMVPQWGNHQLVHLPSSLPEHDFLNDLE 
PLGWLHTQPNELPQLSPQDVTSHSRILENNKQWDGEKCIILTCSFTPGSCSLTSYKLTQT 
GYEWGRLNKDNGSNPHGYLPTHYEKVQMLLSDRFLGFYMVPESGPWNYSFTGVKHTLSMK 
YSVKLGSPKEFYHEEHRPTHFLEFSNMEEADITEGDREDTFT*
>AT5G62290.1 |  nucleotide-sensitive chloride conductance regulator (ICln) family protein 
MVAGLRDFTLRTEDGSGKPVLDESNGEELMHVQTSVAVALGNRPIESPGTLYITSRKLIW 
LSDVDMAKGYAVDFLSISLHAVSRDPEAYSSPCIYTQIEVEEDEDDESDSESTEVLDLSK 
IREMRLVPSDSTQLETLFDVFCECAELNPEPVQEEEEESGHNWVFSADQMDVRGGDDDAE 
WQISQSPTSVIGHSNGDEGLNQPMLELQINDQRFEDAEEMVHESETKDH*
>AT5G62290.1 |  nucleotide-sensitive chloride conductance regulator (ICln) family protein 
MVAGLRDFTLRTEDGSGKPVLDESNGEELMHVQTSVAVALGNRPIESPGTLYITSRKLIW 
LSDVDMAKGYAVDFLSISLHAVSRDPEAYSSPCIYTQIEVEEDEDDESDSESTEVLDLSK 
IREMRLVPSDSTQLETLFDVFCECAELNPEPVQEEEEESGHNWVFSADQMDVRGGDDDAE 
WQISQSPTSVIGHSNGDEGLNQPMLELQINDQRFEDAEEMVHESETKDH*
>AT5G62290.2 |  nucleotide-sensitive chloride conductance regulator (ICln) family protein 
MVAGLRDFTLRTEDGSGKPVLDESNGEELMHVQTSVAVALGNRPIESPGTLYITSRKLIW 
LSDVDMAKGYAVDFLSISLHAVSRDPEAYSSPCIYTQIEVEEDEDDESDSESTEVLDLSK 
IREMRLVPSDSTQLETLFDVFCECAELNPEPVQEEEESGHNWVFSADQMDVRGGDDDAEW 
QISQSPTSVIGHSNGDEGLNQPMLELQINDQRFEDAEEMVHESETKDH*
>AT5G62290.2 |  nucleotide-sensitive chloride conductance regulator (ICln) family protein 
MVAGLRDFTLRTEDGSGKPVLDESNGEELMHVQTSVAVALGNRPIESPGTLYITSRKLIW 
LSDVDMAKGYAVDFLSISLHAVSRDPEAYSSPCIYTQIEVEEDEDDESDSESTEVLDLSK 
IREMRLVPSDSTQLETLFDVFCECAELNPEPVQEEEESGHNWVFSADQMDVRGGDDDAEW 
QISQSPTSVIGHSNGDEGLNQPMLELQINDQRFEDAEEMVHESETKDH*
>AT4G31120.1 |  SKB1 (SHK1 BINDING PROTEIN 1) protein methyltransferase 
MPLGERGGWERTESRYCGVETDFSNDVTHLLNFNISTGGFDYVLAPLVDPSYRPSLVEGN 
GVDTQVLPVCGSDLVLSPSQWSSHVVGKISSWIDLDSEDEVLRMDSETTLKQEIAWATHL 
SLQACLLPTPKGKSCANYARCVNQILQGLTTLQLWLRVPLVKSEGDSMDDTSEGLNDSWE 
LWNSFRLLCEHDSKLSVALDVLSTLPSETSLGRWMGESVRAAILSTDAFLTNARGYPCLS 
KRHQKLIAGFFDHAAQVVICGKPVHNLQKPLDSSSEGTEKNPLRIYLDYVAYLFQKMESL 
SEQERIELGYRDFLQAPLQPLMDNLEAQTYETFERDSVKYIQYQRAVEKALVDRVPDEKA 
SELTTVLMVVGAGRGPLVRASLQAAEETDRKLKVYAVEKNPNAVVTLHNLVKMEGWEDVV 
TIISCDMRFWNAPEQADILVSELLGSFGDNELSPECLDGAQRFLKPDGISIPSSYTSFIQ 
PITASKLYNDVKAHKDLAHFETAYVVKLHSVAKLAPSQSVFTFTHPNFSTKVNNQRYKKL 
QFSLPSDAGSALVHGFAGYFDSVLYKDVHLGIEPTTATPNMFSWFPIFFPLRKPVEVHPD 
TPLEVHFWRCCGSSKVWYEWSVSSPTPSPMHNTNGRSYWVGL*
>AT4G31120.1 |  SKB1 (SHK1 BINDING PROTEIN 1) protein methyltransferase 
MPLGERGGWERTESRYCGVETDFSNDVTHLLNFNISTGGFDYVLAPLVDPSYRPSLVEGN 
GVDTQVLPVCGSDLVLSPSQWSSHVVGKISSWIDLDSEDEVLRMDSETTLKQEIAWATHL 
SLQACLLPTPKGKSCANYARCVNQILQGLTTLQLWLRVPLVKSEGDSMDDTSEGLNDSWE 
LWNSFRLLCEHDSKLSVALDVLSTLPSETSLGRWMGESVRAAILSTDAFLTNARGYPCLS 
KRHQKLIAGFFDHAAQVVICGKPVHNLQKPLDSSSEGTEKNPLRIYLDYVAYLFQKMESL 
SEQERIELGYRDFLQAPLQPLMDNLEAQTYETFERDSVKYIQYQRAVEKALVDRVPDEKA 
SELTTVLMVVGAGRGPLVRASLQAAEETDRKLKVYAVEKNPNAVVTLHNLVKMEGWEDVV 
TIISCDMRFWNAPEQADILVSELLGSFGDNELSPECLDGAQRFLKPDGISIPSSYTSFIQ 
PITASKLYNDVKAHKDLAHFETAYVVKLHSVAKLAPSQSVFTFTHPNFSTKVNNQRYKKL 
QFSLPSDAGSALVHGFAGYFDSVLYKDVHLGIEPTTATPNMFSWFPIFFPLRKPVEVHPD 
TPLEVHFWRCCGSSKVWYEWSVSSPTPSPMHNTNGRSYWVGL*
>AT4G31120.2 |  SKB1 (SHK1 BINDING PROTEIN 1) protein methyltransferase 
MPLGERGGWERTESRYCGVETDFSNDVTHLLNFNISTGGFDYVLAPLVDPSYRPSLVEGN 
GVDTQVLPVCGSDLVLSPSQWSSHVVGKISSWIDLDSEDEVLRMDSETTLKQEIAWATHL 
SLQACLLPTPKGKSCANYARCVNQILQGLTTLQLWLRVPLVKSEGDSMDDTSEGLNDSWE 
LWNSFRLLCEHDSKLSVALDVLSTLPSETSLGRWMGESVRAAILSTDAFLTNARGYPCLS 
KRHQKLIAGFFDHAAQVVICGKPVHNLQKPLDSSSEGTEKNPLRIYLDYVAYLFQKMESL 
SEQERIELGYRDFLQAPLQPLMDNLEAQTYETFERDSVKYIQYQRAVEKALVDRVPDEKA 
SELTTVLMVVGAGRGPLVRASLQAAEETDRKLKVYAVEKNPNAVVTLHNLVKMEGWEDVV 
TIISCDMRFWNAPEQADILVSELLGSFGDNELSPECLDGAQRFLKPDGISIPSSYTSFIQ 
PITASKLYNDVKAHKDLAHFETAYVVKLHSVAKLAPSQSVFTFTHPNFSTKVNNQRYKKL 
QFSLPSDAGSALVHGFAGYFDSVLYKDVHLGIEPTTATPNMFSW*
>AT4G31120.2 |  SKB1 (SHK1 BINDING PROTEIN 1) protein methyltransferase 
MPLGERGGWERTESRYCGVETDFSNDVTHLLNFNISTGGFDYVLAPLVDPSYRPSLVEGN 
GVDTQVLPVCGSDLVLSPSQWSSHVVGKISSWIDLDSEDEVLRMDSETTLKQEIAWATHL 
SLQACLLPTPKGKSCANYARCVNQILQGLTTLQLWLRVPLVKSEGDSMDDTSEGLNDSWE 
LWNSFRLLCEHDSKLSVALDVLSTLPSETSLGRWMGESVRAAILSTDAFLTNARGYPCLS 
KRHQKLIAGFFDHAAQVVICGKPVHNLQKPLDSSSEGTEKNPLRIYLDYVAYLFQKMESL 
SEQERIELGYRDFLQAPLQPLMDNLEAQTYETFERDSVKYIQYQRAVEKALVDRVPDEKA 
SELTTVLMVVGAGRGPLVRASLQAAEETDRKLKVYAVEKNPNAVVTLHNLVKMEGWEDVV 
TIISCDMRFWNAPEQADILVSELLGSFGDNELSPECLDGAQRFLKPDGISIPSSYTSFIQ 
PITASKLYNDVKAHKDLAHFETAYVVKLHSVAKLAPSQSVFTFTHPNFSTKVNNQRYKKL 
QFSLPSDAGSALVHGFAGYFDSVLYKDVHLGIEPTTATPNMFSW*
>AT1G14640.1 |  SWAP (Suppressor-of-White-APricot)/surp domain-containing protein 
MFNSMKILPLEAPPADGNLGPLPPSQLTDEEIKENEFQGEQNNSIQTPIAVATHTNPIGI 
IYPPPEIRKIVETTAQFVSQNGLAFGNKVKTEKANNANFSFLKSDNPYHGFYRYKVTEYS 
CHIRDGAQGTDVDDTEDPKLDDESDAKPDLQAQFRAPRKILEAPEPEKYTVRLPEGIMEA 
ELDIIKHTAQFVARNGQSFLRELMRREVNNSQFQFMKPTHSMFTFFTSLVDAYSEVLMPP 
RDLKEKLRKSVADLTTVLERCLNRLEWDRFQEEEKNKEEDEKEKERVQMVMIDWKDFAVV 
ESIDFADEEDKDLPMPMTLEEVIRRSKVSAMEEDEIVEPGKEVEMDMDEEEVKLVAEGMR 
AANLEEYVGSVEIEEEAPMRIVKNWKRPEDRFLTERDSSKVVISRITGELIPITEMSEHM 
RISLIDPKFKEQKDRMFAKIRETTLAQDDEIAKNIVGLARLRPDIFGTTEEEVSNAVKAD 
IEKKDEQPKQVIWDGHTGSIGRTANQALTQNSNGEQGDGVYGDPNSFPGPAAFPPPRPGV 
PTVRPLPPPQNLALNLPRPPPSVQYPGAPRPLGVPMMQPMYQQHQLSMSGPHGHPSMMMS 
RPPQMQPVMRVPPPPGSQFSHMQVPQPYGQLPPLSMGMMQPPPMAEMPPPPPPGEAPPPL 
PEEPEPKRQKLDESALVPEDQFLAQHPGPATIRVSKPNENDGQVMEITVQSLSENVGSLK 
EKIAGEMQIPANKQK*
>AT1G60170.1 |  emb1220 (embryo defective 1220) 
MATLEDSFLADLDELSDNEAELDENDGDVGKEEEDVDMDMADLETLNYDDLDNVSKLQKS 
QRYADIMHKVEEALGKDSDGAEKGTVLEDDPEYKLIVDCNQLSVDIENEIVIVHNFIKDK 
YKLKFQELESLVHHPIDYACVVKKIGNETDLALVDLADLLPSAIIMVVSVTALTTKGSAL 
PEDVLQKVLEACDRALDLDSARKKVLEFVESKMGSIAPNLSAIVGSAVAAKLMGTAGGLS 
ALAKMPACNVQVLGHKRKNLAGFSSATSQSRVGYLEQTEIYQSTPPGLQARAGRLVAAKS 
TLAARVDATRGDPLGISGKAFREEIRKKIEKWQEPPPARQPKPLPVPDSEPKKRRGGRRL 
RKMKERYQVTDMRKLANRMAFGTPEESSLGDGLGEGYGMLGQAGSNRLRVSSVPSKLKIN 
AKVAKKLKERQYAGGATTSGLTSSLAFTPVQGIELCNPQQALGLGSGTQSTYFSESGTFS 
KLKKI*
>AT1G65660.1 |  SMP1 (SWELLMAP 1) nucleic acid binding / single-stranded RNA binding 
MATASVAFKSREDHRKQIELEEARKAGLAPAEVDEDGKEINPHIPQYMSSAPWYLNSEKP 
SLKHQRKWKSDPNYTKSWYDRGAKIFQAEKYRKGACQNCGAMTHTAKACMDRPRKIGAKY 
TNMNIAPDEKIESFELDYDGKRDRWNGYDPSTYHRVIDLYEAKEDARKKYLKEQQLKKLE 
EKNNNEKGDDANSDGEEDEDDLRVDEAKVDESRQMDFAKVEKRVRTTGGGSTGTVRNLRI 
REDTAKYLLNLDVNSAHYDPKTRSMREDPLPDADPNDKFYLGDNQYRNSGQALEFKQLNI 
HSWEAFDKGQDMHMQAAPSQAELLYKSFQVAKEKLKSQTKDTIMDKYGNAATEDEIPMEL 
LLGQSERQVEYDRAGRIIKGQEVILPKSKYEEDVHANNHTSVWGSYWKDHQWGYKCCQQI 
IRNSYCTGSAGIEAAEAALDLMKANIARKEATEESPKKVEEKRMASWGTDIPEDLELNEE 
ALANALKKEDLSRREEKDERKRKYNVKYNNDVTPEEMEAYRMKRVHHEDPMKDFL*
>AT3G55220.1 |  splicing factor putative 
MYLYSLTLQQATGIVCAINGNFSGGKTQEIAVARGKILDLLRPDENGKIQTIHSVEVFGA 
IRSLAQFRLTGAQKDYIVVGSDSGRIVILEYNKEKNVFDKVHQETFGKSGCRRIVPGQYV 
AVDPKGRAVMIGACEKQKLVYVLNRDTTARLTISSPLEAHKSHTICYSLCGVDCGFDNPI 
FAAIELDYSEADQDPTGQAASEAQKHLTFYELDLGLNHVSRKWSNPVDNGANMLVTVPGG 
ADGPSGVLVCAENFVIYMNQGHPDVRAVIPRRTDLPAERGVLVVSAAVHKQKTMFFFLIQ 
TEYGDVFKVTLDHNGDHVSELKVKYFDTIPVASSICVLKLGFLFSASEFGNHGLYQFQAI 
GEEPDVESSSSNLMETEEGFQPVFFQPRRLKNLVRIDQVESLMPLMDMKVLNIFEEETPQ 
IFSLCGRGPRSSLRILRPGLAITEMAVSQLPGQPSAVWTVKKNVSDEFDAYIVVSFTNAT 
LVLSIGEQVEEVNDSGFLDTTPSLAVSLIGDDSLMQVHPNGIRHIREDGRINEWRTPGKR 
SIVKVGYNRLQVVIALSGGELIYFEADMTGQLMEVEKHEMSGDVACLDIAPVPEGRKRSR 
FLAVGSYDNTVRILSLDPDDCLQILSVQSVSSAPESLLFLEVQASIGGDDGADHPANLFL 
NSGLQNGVLFRTVVDMVTGQLSDSRSRFLGLKPPKLFSISVRGRSAMLCLSSRPWLGYIH 
RGHFHLTPLSYETLEFAAPFSSDQCAEGVVSVAGDALRIFMIDRLGETFNETVVPLRYTP 
RKFVLHPKRKLLVIIESDQGAFTAEEREAARKECFEAGGVGENGNGNADQMENGADDEDK 
EDPLSDEQYGYPKAESEKWVSCIRVLDPKTATTTCLLELQDNEAAYSVCTVNFHDKEYGT 
LLAVGTVKGMQFWPKKNLVAGFIHIYRFVEDGKSLELLHKTQVEGVPLALCQFQGRLLAG 
IGPVLRLYDLGKKRLLRKCENKLFPNTIISIQTYRDRIYVGDIQESFHYCKYRRDENQLY 
IFADDCVPRWLTASHHVDFDTMAGADKFGNVYFVRLPQDLSEEIEEDPTGGKIKWEQGKL 
NGAPNKVDEIVQFHVGDVVTCLQKASMIPGGSESIMYGTVMGSIGALHAFTSRDDVDFFS 
HLEMHMRQEYPPLCGRDHMAYRSAYFPVKDVIDGDLCEQFPTLPMDLQRKIADELDRTPA 
EILKKLEDARNKII*
>AT1G66510.1 |  AAR2 protein family 
MDSEKALELVKHGATLLFLDVPQYTLVGIDTQIFAVGPAFKGIKMIPPGIHFVFYSSSTR 
DGREFSPTIGFFVDVAPSQVIVRKWNQQDEWLTKVSEEEEERYSQAVRSLEFDKNLGPYN 
LKQYGEWRHLSNYITKDVVEKFEPVGGEITVTYESAILKGGPKTAMEIALDTQMKKSKFT 
TSSTEQPKGNRFYYTSIPRIIKHKGMSGQELTSMNLDKTQLLESVLSKEYKDSEDLLLGE 
LQFSFVAFLMGQSLESFMQWKSIVSLLLGCTSAPFQTRSQLFTKFIKVIYHQLKYGLQKE 
NSGPETGIHALLDDSWLASDSFLHFLCKDFFALVEETSVVDGDLLSWTRKFKELLENRLG 
WEFQKKSAVDGIYFEEDDEYAPVVEMLDESHGEYMDKTT*
>AT1G66510.1 |  AAR2 protein family 
MDSEKALELVKHGATLLFLDVPQYTLVGIDTQIFAVGPAFKGIKMIPPGIHFVFYSSSTR 
DGREFSPTIGFFVDVAPSQVIVRKWNQQDEWLTKVSEEEEERYSQAVRSLEFDKNLGPYN 
LKQYGEWRHLSNYITKDVVEKFEPVGGEITVTYESAILKGGPKTAMEIALDTQMKKSKFT 
TSSTEQPKGNRFYYTSIPRIIKHKGMSGQELTSMNLDKTQLLESVLSKEYKDSEDLLLGE 
LQFSFVAFLMGQSLESFMQWKSIVSLLLGCTSAPFQTRSQLFTKFIKVIYHQLKYGLQKE 
NSGPETGIHALLDDSWLASDSFLHFLCKDFFALVEETSVVDGDLLSWTRKFKELLENRLG 
WEFQKKSAVDGIYFEEDDEYAPVVEMLDESHGEYMDKTT*
>AT1G66510.1 |  AAR2 protein family 
MDSEKALELVKHGATLLFLDVPQYTLVGIDTQIFAVGPAFKGIKMIPPGIHFVFYSSSTR 
DGREFSPTIGFFVDVAPSQVIVRKWNQQDEWLTKVSEEEEERYSQAVRSLEFDKNLGPYN 
LKQYGEWRHLSNYITKDVVEKFEPVGGEITVTYESAILKGGPKTAMEIALDTQMKKSKFT 
TSSTEQPKGNRFYYTSIPRIIKHKGMSGQELTSMNLDKTQLLESVLSKEYKDSEDLLLGE 
LQFSFVAFLMGQSLESFMQWKSIVSLLLGCTSAPFQTRSQLFTKFIKVIYHQLKYGLQKE 
NSGPETGIHALLDDSWLASDSFLHFLCKDFFALVEETSVVDGDLLSWTRKFKELLENRLG 
WEFQKKSAVDGIYFEEDDEYAPVVEMLDESHGEYMDKTT*
>AT1G66510.3 |  AAR2 protein family 
MIPPGIHFVFYSSSTRDGREFSPTIGFFVDVAPSQVIVRKWNQQDEWLTKVSEEEEERYS 
QAVRSLEFDKNLGPYNLKQYGEWRHLSNYITKDVVEKFEPVGGEITVTYESAILKGGPKT 
AMEIALDTQMKKSKFTTSSTEQPKGNRFYYTSIPRIIKHKGMSGQELTSMNLDKTQLLES 
VLSKEYKDSEDLLLGELQFSFVAFLMGQSLESFMQWKSIVSLLLGCTSAPFQTRSQLFTK 
FIKVIYHQLKYGLQKENSGPETGIHALLDDSWLASDSFLHFLCKDFFALVEETSVVDGDL 
LSWTRKFKELLENRLGWEFQKKSAVDGIYFEEDDEYAPVVEMLDESHGEYMDKTT*
>AT1G66510.3 |  AAR2 protein family 
MIPPGIHFVFYSSSTRDGREFSPTIGFFVDVAPSQVIVRKWNQQDEWLTKVSEEEEERYS 
QAVRSLEFDKNLGPYNLKQYGEWRHLSNYITKDVVEKFEPVGGEITVTYESAILKGGPKT 
AMEIALDTQMKKSKFTTSSTEQPKGNRFYYTSIPRIIKHKGMSGQELTSMNLDKTQLLES 
VLSKEYKDSEDLLLGELQFSFVAFLMGQSLESFMQWKSIVSLLLGCTSAPFQTRSQLFTK 
FIKVIYHQLKYGLQKENSGPETGIHALLDDSWLASDSFLHFLCKDFFALVEETSVVDGDL 
LSWTRKFKELLENRLGWEFQKKSAVDGIYFEEDDEYAPVVEMLDESHGEYMDKTT*
>AT1G66510.3 |  AAR2 protein family 
MIPPGIHFVFYSSSTRDGREFSPTIGFFVDVAPSQVIVRKWNQQDEWLTKVSEEEEERYS 
QAVRSLEFDKNLGPYNLKQYGEWRHLSNYITKDVVEKFEPVGGEITVTYESAILKGGPKT 
AMEIALDTQMKKSKFTTSSTEQPKGNRFYYTSIPRIIKHKGMSGQELTSMNLDKTQLLES 
VLSKEYKDSEDLLLGELQFSFVAFLMGQSLESFMQWKSIVSLLLGCTSAPFQTRSQLFTK 
FIKVIYHQLKYGLQKENSGPETGIHALLDDSWLASDSFLHFLCKDFFALVEETSVVDGDL 
LSWTRKFKELLENRLGWEFQKKSAVDGIYFEEDDEYAPVVEMLDESHGEYMDKTT*
>AT1G66510.2 |  AAR2 protein family 
MDSEKALELVKHGATLLFLDVPQYTLVGIDTQIFAVGPAFKGIKMIPPGIHFVFYSSSTR 
DGREFSPTIGFFVDVAPSQVIVRKWNQQDEWLTKVSEEEEERYSQAVRSLEFDKNLGPYN 
LKQYGEWRHLSNYITKDVVEKFEPVGGEITVTYESAILKGGPKTAMEIALDTQMKKSKFT 
TSSTEQPKGNRFYYTSIPRIIKHKGMSGQELTSMNLDKTQLLESVLSKEYKDSEDLLLGE 
LQFSFVAFLMGQSLESFMQWKSIVSLLLGCTSAPFQTRSQLFTKFIKVIYHQLKYGLQKE 
NSGPETGIHALLDDSWLASDSFLHFLCKDFFALVEETSVVDGDLLSWTRKFKELLENRLG 
WEFQKKSAVDGIYFEEDDEYAPVVEMLDESHGEYMDKTT*
>AT1G66510.2 |  AAR2 protein family 
MDSEKALELVKHGATLLFLDVPQYTLVGIDTQIFAVGPAFKGIKMIPPGIHFVFYSSSTR 
DGREFSPTIGFFVDVAPSQVIVRKWNQQDEWLTKVSEEEEERYSQAVRSLEFDKNLGPYN 
LKQYGEWRHLSNYITKDVVEKFEPVGGEITVTYESAILKGGPKTAMEIALDTQMKKSKFT 
TSSTEQPKGNRFYYTSIPRIIKHKGMSGQELTSMNLDKTQLLESVLSKEYKDSEDLLLGE 
LQFSFVAFLMGQSLESFMQWKSIVSLLLGCTSAPFQTRSQLFTKFIKVIYHQLKYGLQKE 
NSGPETGIHALLDDSWLASDSFLHFLCKDFFALVEETSVVDGDLLSWTRKFKELLENRLG 
WEFQKKSAVDGIYFEEDDEYAPVVEMLDESHGEYMDKTT*
>AT1G66510.2 |  AAR2 protein family 
MDSEKALELVKHGATLLFLDVPQYTLVGIDTQIFAVGPAFKGIKMIPPGIHFVFYSSSTR 
DGREFSPTIGFFVDVAPSQVIVRKWNQQDEWLTKVSEEEEERYSQAVRSLEFDKNLGPYN 
LKQYGEWRHLSNYITKDVVEKFEPVGGEITVTYESAILKGGPKTAMEIALDTQMKKSKFT 
TSSTEQPKGNRFYYTSIPRIIKHKGMSGQELTSMNLDKTQLLESVLSKEYKDSEDLLLGE 
LQFSFVAFLMGQSLESFMQWKSIVSLLLGCTSAPFQTRSQLFTKFIKVIYHQLKYGLQKE 
NSGPETGIHALLDDSWLASDSFLHFLCKDFFALVEETSVVDGDLLSWTRKFKELLENRLG 
WEFQKKSAVDGIYFEEDDEYAPVVEMLDESHGEYMDKTT*
>AT2G32170.1 |  LOCATED IN cellular_component unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321603) Has 362 Blast hits to 319 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 132 Fungi - 130 Plants - 25 Viruses - 0 Other Eukaryotes - 75 (source NCBI BLink) 
MVSPSEMTETAPAIMTTESERIESSRELVDNEEEEKIRRQKKLEEALEAKSLRRIISAYL 
NYPEASEEDLKRWERSYRKLSPAHKALVPHYPMKFQRLRRCISANSYFIFNMLQAFEPPI 
DLSQELDGCEDSNLDCAPHERYTLDERHDSSCQPALTNSCTYKEESKHIRDPITGVSIEE 
LQRKEAHDHSPKDDSADTRINDKTCDCHEGQLNHDHGSVSFSSHDWLDSSLQTHVPLVDV 
DKVRCIIRNIVRDWAAEGQRERDQCYKPILEELDSLFPDRLKESTPPACLVPGAGLGRLA 
LEISCLGFISQGNEFSYYMMICSSFILNYTQVPGEWTIYPWIHSNCNSLSDNDQLRPIAI 
PDIHPASAGITEGFSMCGGDFVEVYNESSHAGMWDAVVTCFFIDTAHNVIEYIQTISKIL 
KDGGVWINLGPLLYHFADTYGHENEMSIELSLEDVKRVASHFGFVIEKERTIETTYTTNP 
RAMMQNRYYTAFWTMRKKCAITTT*
>AT2G32170.1 |  LOCATED IN cellular_component unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321603) 
MVSPSEMTETAPAIMTTESERIESSRELVDNEEEEKIRRQKKLEEALEAKSLRRIISAYL 
NYPEASEEDLKRWERSYRKLSPAHKALVPHYPMKFQRLRRCISANSYFIFNMLQAFEPPI 
DLSQELDGCEDSNLDCAPHERYTLDERHDSSCQPALTNSCTYKEESKHIRDPITGVSIEE 
LQRKEAHDHSPKDDSADTRINDKTCDCHEGQLNHDHGSVSFSSHDWLDSSLQTHVPLVDV 
DKVRCIIRNIVRDWAAEGQRERDQCYKPILEELDSLFPDRLKESTPPACLVPGAGLGRLA 
LEISCLGFISQGNEFSYYMMICSSFILNYTQVPGEWTIYPWIHSNCNSLSDNDQLRPIAI 
PDIHPASAGITEGFSMCGGDFVEVYNESSHAGMWDAVVTCFFIDTAHNVIEYIQTISKIL 
KDGGVWINLGPLLYHFADTYGHENEMSIELSLEDVKRVASHFGFVIEKERTIETTYTTNP 
RAMMQNRYYTAFWTMRKKCAITTT*
>AT2G32170.2 |  LOCATED IN cellular_component unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321603) Has 362 Blast hits to 319 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 132 Fungi - 130 Plants - 25 Viruses - 0 Other Eukaryotes - 75 (source NCBI BLink) 
MVSPSEMTETAPAIMTTESERIESSRELVDNEEEEKIRRQKKLEEALEAKSLRRIISAYL 
NYPEASEEDLKRWERSYRKLSPAHKALVPHYPMKFQRLRRCISANSYFIFNMLQAFEPPI 
DLSQELDGCEDSNLDCAPHERYTLDERHDSSCQPALTNSCTYKEESKHIRDPITGVSIEE 
LQRKEAHDHSPKDDSADTRINDKTCDCHEGQLNHDHGSVSFSSHDWLDSSLQTHVPLVDV 
DKRETKIIMCHVRTYSQSLLVPLGSLVSYEGWFAHVALHGEVNPVELKCTFLHVRCIIRN 
IVRDWAAEGQRERDQCYKPILEELDSLFPDRLKESTCCIHSPYKVDYMICSTPPACLVPG 
AGLGRLALEISCLGFISQGNEFSYYMMICSSFILNYTQVPGEWTIYPWIHSNCNSLSDND 
QLRPIAIPDIHPASAGITEGFSMCGGDFVEVYNESSHAGMWDAVVTCFFIDTAHNVIEYI 
QTISKILKDGGVWINLGPLLYHFADTYGHENEMSIELSLEDVKRVASHFGFVIEKERTIE 
TTYTTNPRAMMQNRYYTAFWTMRKKCAITTT*
>AT2G32170.2 |  LOCATED IN cellular_component unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321603) 
MVSPSEMTETAPAIMTTESERIESSRELVDNEEEEKIRRQKKLEEALEAKSLRRIISAYL 
NYPEASEEDLKRWERSYRKLSPAHKALVPHYPMKFQRLRRCISANSYFIFNMLQAFEPPI 
DLSQELDGCEDSNLDCAPHERYTLDERHDSSCQPALTNSCTYKEESKHIRDPITGVSIEE 
LQRKEAHDHSPKDDSADTRINDKTCDCHEGQLNHDHGSVSFSSHDWLDSSLQTHVPLVDV 
DKRETKIIMCHVRTYSQSLLVPLGSLVSYEGWFAHVALHGEVNPVELKCTFLHVRCIIRN 
IVRDWAAEGQRERDQCYKPILEELDSLFPDRLKESTCCIHSPYKVDYMICSTPPACLVPG 
AGLGRLALEISCLGFISQGNEFSYYMMICSSFILNYTQVPGEWTIYPWIHSNCNSLSDND 
QLRPIAIPDIHPASAGITEGFSMCGGDFVEVYNESSHAGMWDAVVTCFFIDTAHNVIEYI 
QTISKILKDGGVWINLGPLLYHFADTYGHENEMSIELSLEDVKRVASHFGFVIEKERTIE 
TTYTTNPRAMMQNRYYTAFWTMRKKCAITTT*
>AT2G32600.1 |  hydroxyproline-rich glycoprotein family protein 
MDREWGSKPGSGGAASGQNEAIDRRERLRRLALETIDLAKDPYFMRNHLGSYECKLCLTL 
HNNEGNYLAHTQGKRHQTNLAKRAAREAKDAPTKPQPLKRNVSVRRTVKIGRPGYRVTKQ 
YDPELQQRSLLFQIEYPEIEDNIKPRHRFMSSYEQKVQPYDKSYQYLLFAAEPYEIIAFK 
VPSTEVDKSTPKFFSHWDPDSKMFTLQVYFKPTKPEPNKPQSAVGANGLPPPPPPPPHQA 
QPPPPPPSGLFPPPPPPMANNGFRPMPPAGGFGHPNM*
>AT3G03340.1 |  UNE6 (unfertilized embryo sac 6) 
MDAIRKQLDVLMGANRNGDVQEVNRKYYDRDVCRLYLSGLCPHDLFQLTKMDMGPCPKVH 
SLQLRKEYREARAKGVDNYDRELEDAIDRLIVECDRKIGRALKRLQEEDAKAAIAISVSE 
VTQSPEILELSEKIKEKMKEADIHDLEGKMDLKIRALELVEEMRTKRADQQAVLLLEAFN 
KDRASLPQPVPAQPPSSELPPPDPRTQEMINEKLKKAEDLGEQGMVDEAQKALEEAEALK 
KLTVRREPPADSTKYTAVDVRITDQKLRLCDICGAFLSVYDSDRRLADHFGGKLHLGYML 
VRDKLTELLDEKANIRKERSKERNSKERESSKDREKEQETSREHRRDYDRRSRDRDRHHD 
RDREQDRDYDRSHSRSRRRSRSRSRSRDRPRDYDRHRRHNRY*
>AT3G05760.1 |  nucleic acid binding / zinc ion binding 
MASSNTTTGVDNTFRKKFDVEEFKERAREREKKESDRSKSRSKGPPVQRAPLKHRDYHVD 
LESRLGKTQVVTPVAPLSQQAGYFCRVCDCVVKDSANYLDHINGKKHQRALGMSMRVERS 
SLEQVQERFEVLKKRKAPGTFTEQDLDERIRKQQEEEEELKRQRREKKKEKKKGKVVEEE 
PEMDPEVAEMMGFGGFGSSKKS*
>AT4G03430.1 |  EMB2770 (EMBRYO DEFECTIVE 2770) RNA splicing factor transesterification mechanism 
MVFLSIPNGKTLSIDVNPNSTTISAFEQLAHQRSDVPQSFLRYSLRMRNPSRVFVDSKDS 
DSILLSDLGVSRFSTVIIHVLLLGGMQAAPPKPRLDFLNSKPPSNYVAGLGRGATGFTTR 
SDIGPARAAPDLPDRSALATAAAPGVGRGAGKPSEAEAEDDEEAEEKRYDENQTFDEFEG 
NDVGLFANAEYDEDDKEADAIWESIDQRMDSRRKDRREAKLKEEIEKYRASNPKITEQFA 
DLKRKLHTLSADEWDSIPEIGDYSLRNKKKKFESFVPIPDTLLEKAKKEKELVMALDPKS 
RAAGGSETPWGQTPVTDLTAVGEGRGTVLSLKLDNLSDSVSGQTVVDPKGYLTDLKSMKR 
TTDEEIYDRNRARLLYKSLTQSNPKNPNGWIAAARVEEVDGKIKAARFQIQRGCEECPKN 
EDVWLEACRLANPEDAKGVIAKGVKLIPNSVKLWLEAAKLEHDVENKSRVLRKGLEHIPD 
SVRLWKAVVELANEEDARILLHRAVECCPLHLELWVALARLETYAESKKVLNKAREKLPK 
EPAIWITAAKLEEANGKLDEANDNTAMVGKIIDRGIKTLQREGVVIDRENWMSEAEACER 
VGSVATCQAIIKNTIGIGVEEEDRKRTWVADADECKKRGSIETARAIYAHALSVFLTKKS 
IWLKAAQLEKSHGSRESLDALLRKAVTYVPQAEVLWLMGAKEKWLAGDVPAARAILQEAY 
AAIPNSEEIWLAAFKLEFENKEPERARMLLAKARERGGTERVWMKSAIVERELGNVEEER 
RLLNEGLKQFPTFFKLWLMLGQLEERFKHLEQARKAYDTGLKHCPHCIPLWLSLADLEEK 
VNGLNKARAILTTARKKNPGGAELWLAAIRAELRHDNKREAEHLMSKALQDCPKSGILWA 
ADIEMAPRPRRKTKSIDAMKKCDRDPHVTIAVAKLFWQDKKVEKARAWFERAVTVGPDIG 
DFWALFYKFELQHGSDEDRKEVVAKCVACEPKHGEKWQAISKAVENAHQPIEVILKRVVN 
ALSKEENSA*
>AT4G21660.1 |  proline-rich spliceosome-associated (PSP) family protein 
MTADSTVALVHSVVSNGDVSNGNTSASSKKSREIDRRRRRRKQKKNNKASQADVDASDVS 
AASESKENTDPQPQVCEQIVIEYVPEQAEFEDGFNDEFKEIFEKFNFREPLASEEDGTKD 
ESEEKEDVKKKVNSDSDSDDDEQDNQNKEKGISNKKKKLQRRMKIAELKQVSARPDVVEV 
WDATSADPKLLVFLKSYRNTVPVPRHWSQKRKYLQGKRGIEKQPFHLPDFIAATGIEKIR 
QAYIEKEDGKKLKQKQRERMQPKMGKMDIDYQVLHDAFFKYQTKPKLSALGDLYFEGKEF 
EVKLRETKPGFLSNDLKEALGMPEGAPPPWLINMQRYGPPPSYPHLKIPGLNAPIPIGAS 
FGFHAGGWGKPPVDEYGRPLYGDVFGVQQQDQPNYEEEPIDKSKHWGDLEEEEEEEEEEE 
EEQEEEMDEEELEDGTESVDTLSSTPTGIETPDAIELRKDQRKEPDRALYQVLEEKGESV 
APGTLLGTSHTYVIKTGTQEKTGAKRVDLLRGQKTDRVDVSLQPEELDAMENVLPAKYEE 
AREEEKLRNKPVDLSDMVVEHVQQNSRKRKMHDKEGKKKKDFKF*
>AT5G06160.1 |  ATO (ATROPOS) nucleic acid binding / zinc ion binding 
MSSTLLEQTRSNHEEVERLERLVVEDLQKEPPSSKDRLVQGHRVRHMIESIMLTTEKLVE 
TYEDKDGAWDDEIAALGGQTATGTNVFSEFYDRLKEIREYHKRHPSGRLVDANEDYEARL 
KEEPIIAFSGEEGNGRYLDLHDMYNQYINSKFGERVEYSAYLDVFSQPEKIPRKLKLSRQ 
YMKYMEALLEYLVYFFQRTEPLQDLDRILSKVCSDFEEQYADGIVEGLDNELIPSQHTVI 
DLDYYSTVEELVDVGPEKLKEALGALGLKVGGTPQQRAERLFLTKHTPLEKLDKKHFARP 
PHNGKQNGDAKSTHESENAKEIALTEAKVKKLCNLLDETIERTKQNIVKKQSLTYEEMEG 
EREGEEANTELESDDEDGLIYNPLKLPIGWDGKPIPYWLYKLHGLGQEFKCEICGNYSYW 
GRRAFERHFKEWRHQHGMRCLGIPNTKNFNEITSIEEAKELWKRIQERQGVNKWRPELEE 
EYEDREGNIYNKKTYSDLQRQGLI*
>AT5G41770.1 |  crooked neck protein putative / cell cycle protein putative 
MASGGKDSDRTLGYMTRKDTEVKLPRPTRVKNKTPAPIQITAEQILREARERQEAEIRPP 
KQKITDSTELSDYRLRRRKEFEDQIRRARWNIQVWVKYAQWEESQKDYARARSVWERAIE 
GDYRNHTLWLKYAEFEMKNKFVNSARNVWDRAVTLLPRVDQLWYKYIHMEEILGNIAGAR 
QIFERWMDWSPDQQGWLSFIKFELRYNEIERARTIYERFVLCHPKVSAYIRYAKFEMKGG 
EVARCRSVYERATEKLADDEEAEILFVAFAEFEERCKEVERARFIYKFALDHIPKGRAED 
LYRKFVAFEKQYGDKEGIEDAIVGKRRFQYEDEVRKSPSNYDSWFDYVRLEESVGNKDRI 
REIYERAIANVPPAEEKRYWQRYIYLWINYALFEEIETEDIERTRDVYRECLKLIPHSKF 
SFAKIWLLAAQFEIRQLNLTGARQILGNAIGKAPKDKIFKKYIEIELQLGNMDRCRKLYE 
RYLEWSPENCYAWSKYAELERSLVETERARAIFELAISQPALDMPELLWKAYIDFEISEG 
ELERTRALYERLLDRTKHYKVWVSFAKFEASAAELEEDENEDEDQEEDVIEHKKDCIKRA 
RAIFDRANTYYKDSTPELKEERATLLEDWLNMESSFGNLGDVSIVQSKLPKKLKKRKAIT 
REDGSTEYEEYIDYLYPEESQTTNLKILEAAYKWKKQKVAASEDD*
>AT4G32720.1 |  AtLa1 (Arabidopsis thaliana La protein 1) RNA binding 
MSIPCLTEETAKTVLRQVEFYFSDSNLPIDDFLKKTVTESEDGLVSLALICSFSKMRGYL 
KLGDSKGDDIPEDTIKAVADTLRTSSALKISDDGKKVGRSTELLKLEDLIEQLNARTVAA 
SPFSYDVKREDVESFFSQYGKVNSVRMPRHVAESRIFSGVALVEFPTEEDAQNVMKQNLV 
FAGQELELKPKKEFDNEREKDEVKFANYQPQKGSANQKNGSDHKNNSAYEPDYPKGLIIS 
FTLKRSAEEGTTEQKSSEEPTDKTMEESETKPADTPDADKENTGEVQAEGAEDEDDEKEE 
KGALATHKDNKDVVLREDLKAVFGKFGDVKFVDFKMGSETGYLRFDEPEASQKARAAAVL 
ANEGGLAVKNFIAVLEPVIGEAEKEYWTLLRSKDRFDKGGRGGRGGRRGGRFGRKRGSDS 
PGGRWNKSQKVEA*
>AT4G32720.1 |  AtLa1 (Arabidopsis thaliana La protein 1) RNA binding 
MSIPCLTEETAKTVLRQVEFYFSDSNLPIDDFLKKTVTESEDGLVSLALICSFSKMRGYL 
KLGDSKGDDIPEDTIKAVADTLRTSSALKISDDGKKVGRSTELLKLEDLIEQLNARTVAA 
SPFSYDVKREDVESFFSQYGKVNSVRMPRHVAESRIFSGVALVEFPTEEDAQNVMKQNLV 
FAGQELELKPKKEFDNEREKDEVKFANYQPQKGSANQKNGSDHKNNSAYEPDYPKGLIIS 
FTLKRSAEEGTTEQKSSEEPTDKTMEESETKPADTPDADKENTGEVQAEGAEDEDDEKEE 
KGALATHKDNKDVVLREDLKAVFGKFGDVKFVDFKMGSETGYLRFDEPEASQKARAAAVL 
ANEGGLAVKNFIAVLEPVIGEAEKEYWTLLRSKDRFDKGGRGGRGGRRGGRFGRKRGSDS 
PGGRWNKSQKVEA*
>AT4G32720.2 |  AtLa1 (Arabidopsis thaliana La protein 1) RNA binding 
MSIPCLTEETAKTVLRQVEFYFSDSNLPIDDFLKKTVTESEDGLVSLALICSFSKMRGYL 
KLGDSKGDDIPEDTIKAVADTLRTSSALKISDDGKKVGRSTELLKLEDLIEQLNARTVAA 
SPFSYDVKREDVESFFSQYGKVNSVRMPRHVAESRIFSGVALVEFPTEEDAQNVMKQNLV 
FAGQELELKPKKEFDNEREKDEVKFANYQPQKGSANQKNGSDHKNNSAYEPDYPKGLIIS 
FTLKRSAEEGTTEQKSSEEPTDKTMEESETKPADTPDADKENTGEVQAEGAEDEDDEKEE 
KGALATHKDNKDVVLREDLKAVFGKFGDVKFVDFKMGSETGYLRFDEPEASQKARAAAVL 
ANEGGLAVKNFIAVLEPVIGEAEKEYWTLLRSKDRFDKGGRGGR*
>AT4G32720.2 |  AtLa1 (Arabidopsis thaliana La protein 1) RNA binding 
MSIPCLTEETAKTVLRQVEFYFSDSNLPIDDFLKKTVTESEDGLVSLALICSFSKMRGYL 
KLGDSKGDDIPEDTIKAVADTLRTSSALKISDDGKKVGRSTELLKLEDLIEQLNARTVAA 
SPFSYDVKREDVESFFSQYGKVNSVRMPRHVAESRIFSGVALVEFPTEEDAQNVMKQNLV 
FAGQELELKPKKEFDNEREKDEVKFANYQPQKGSANQKNGSDHKNNSAYEPDYPKGLIIS 
FTLKRSAEEGTTEQKSSEEPTDKTMEESETKPADTPDADKENTGEVQAEGAEDEDDEKEE 
KGALATHKDNKDVVLREDLKAVFGKFGDVKFVDFKMGSETGYLRFDEPEASQKARAAAVL 
ANEGGLAVKNFIAVLEPVIGEAEKEYWTLLRSKDRFDKGGRGGR*
>AT5G46400.1 |  PRP39-2 
MVTTEVRTAVSDKEPLQRSPELDSSTDFLDNDRLKETFSSGALDFDEWTLLISEIETTSF 
PDDIEKLCLVYDAFLLEFPLCHGYWRKYAYHKIKLCTLEDAVEVFERAVQAATYSVAVWL 
DYCAFAVAAYEDPHDVSRLFERGLSFIGKDYSCCTLWDKYIEYLLGQQQWSSLANVYLRT 
LKYPSKKLDLYYKNFRKIAASLKEKIKCRIDVNGDLSSDPMEEDLVHTRHTDEEISIVVR 
ELMGPSSSSAVSKALHTYLSIGEQFYQDSRQLMEKISCFETQIRRPYFHVKPLDTNQLDN 
WHAYLSFGETYGDFDWAINLYERCLIPCANYTEFWFRYVDFVESKGGRELANFALARASQ 
TFVKSASVIHLFNARFKEHVGDASAASVALSRCGEELGFGFVENVTKKANMEKRLGNFEA 
AVTTYREALNKTLIGKENLETTARLYVQFSRLKYVITNSADDAAQILLEGNENVPHCKLL 
LEELMRLLMMHGGSRQVDLLDPIIDKELSHQADSSDGLSAEDKEEISNLYMEFIDLSGTI 
HDVRKALGRHIKLFPHSARAKLRGSRPSGNLFRELIQRREKTRERLNQDLLTNKGISSIV 
DSPPKEKKESSLDSYGTQSKDAVRADYVNTEPNQGCLTSGHLVEGNDNVIERETLCESQS 
DLSMGLKANEGGKRSHEVSLPIQASPEHGFVTKQAHFSSNSVDTVKSDAIVIQPSGSQSP 
QSYQSQESLRQTGRNRYHRRDLNQMHRDSKPRSQERPPQMPYSPVGTGREILGQHMAFTH 
QDNRVALQSSTSQNPQNQFQNSALQMHPVVQTSNAYPQSQIHGQHMIVSPPESQNPQNQC 
QNSTSQVQTSFAYPQTQIPQNPVQSNYQQEGQMQSHEAYNQMWQQYYYSYYYYQQQQQLM 
SEQPQPNQNPQPQLDQNLVQLLSKQYQSQAKTQYLQPQQVEQVNTQQQSQEPQNQQQIQF 
QQQQQQQEWFQQQQQWQQQQYLLYIQQQQLQGEAKGDEQRLSMPQGSTTNSDIQKSQESG 
AVNEANLSSDTSISSI*
>AT5G15770.1 |  AtGNA1 (Arabidopsis thaliana glucose-6-phosphate acetyltransferase 1) N-acetyltransferase/ glucosamine 6-phosphate N-acetyltransferase 
MAETFKIRKLEISDKRKGFIELLGQLTVTGSVTDEEFDRRFEEIRSYGDDHVICVIEEET 
SGKIAATGSVMIEKKFLRNCGKAGHIEDVVVDSRFRGKQLGKKVVEFLMDHCKSMGCYKV 
ILDCSVENKVFYEKCGMSNKSIQMSKYFD*