>AT4G30330.1 |  small nuclear ribonucleoprotein E putative / snRNP-E putative / Sm protein E putative 
MASTKVQRIMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRITGFDEYMNLVLDEAEEV 
SIKKKTRKPLGRILLKGDNITLMMNAGK*
>AT4G02840.1 |  small nuclear ribonucleoprotein D1 putative / snRNP core protein D1 putative / Sm protein D1 putative 
MKLVRFLMKLNNETVSIELKNGTIVHGTITGVDVSMNTHLKAVKLTLKGKNPVTLDHLSV 
RGNNIRYYILPDSLNLETLLVEDTPRIKPKKPTAGKIPAGRGRGRGRGRGRGRGGR*
>AT1G20960.1 |  emb1507 (embryo defective 1507) ATP binding / ATP-dependent helicase/ helicase/ nucleic acid binding / nucleoside-triphosphatase/ nucleotide binding 
MANLGGGAEAHARFKQYEYRANSSLVLTTDNRPRDTHEPTGEPETLWGKIDPRSFGDRVA 
KGRPQELEDKLKKSKKKERDVVDDMVNIRQSKRRRLREESVLTDTDDAVYQPKTKETRAA 
YEAMLGLIQKQLGGQPPSIVSGAADEILAVLKNDAFRNPEKKMEIEKLLNKIENHEFDQL 
VSIGKLITDFQEGGDSGGGRANDDEGLDDDLGVAVEFEENEEDDEESDPDMVEEDDDEED 
DEPTRTGGMQVDAGINDEDAGDANEGTNLNVQDIDAYWLQRKISQAYEQQIDPQQCQVLA 
EELLKILAEGDDRVVEDKLLMHLQYEKFSLVKFLLRNRLKVVWCTRLARAEDQEERNRIE 
EEMRGLGPELTAIVEQLHATRATAKEREENLQKSINEEARRLKDETGGDGGRGRRDVADR 
DSESGWVKGQRQMLDLESLAFDQGGLLMANKKCDLPPGSYRSHGKGYDEVHVPWVSKKVD 
RNEKLVKITEMPDWAQPAFKGMQQLNRVQSKVYDTALFKAENILLCAPTGAGKTNVAMLT 
ILQQLEMNRNTDGTYNHGDYKIVYVAPMKALVAEVVGNLSNRLKDYGVIVRELSGDQSLT 
GREIEETQIIVTTPEKWDIITRKSGDRTYTQLVRLLIIDEIHLLHDNRGPVLESIVARTL 
RQIETTKENIRLVGLSATLPNYEDVALFLRVDLKKGLFKFDRSYRPVPLHQQYIGISVKK 
PLQRFQLMNDLCYQKVLAGAGKHQVLIFVHSRKETSKTARAIRDTAMANDTLSRFLKEDS 
VTRDVLHSHEDIVKNSDLKDILPYGFAIHHAGLSRGDREIVETLFSQGHVQVLVSTATLA 
WGVNLPAHTVIIKGTQVYNPEKGAWMELSPLDVMQMLGRAGRPQYDQHGEGIIITGYSEL 
QYYLSLMNEQLPIESQFISKLADQLNAEIVLGTVQNAREACHWLGYTYLYIRMVRNPTLY 
GLAPDALAKDVVLEERRADLIHSAATILDKNNLVKYDRKSGYFQVTDLGRIASYYYITHG 
TIATYNEHLKPTMGDIDLYRLFSLSDEFKYVTVRQDEKMELAKLLDRVPIPIKETLEEPS 
AKINVLLQAYISQLKLEGLSLTSDMVYITQSAGRLVRALYEIVLKRGWAQLAEKALNLSK 
MVGKRMWSVQTPLRQFHGLSNDILMQLEKKDLVWERYYDLSAQELGELIRSPKMGKPLHK 
FIHQFPKVTLSAHVQPITRTVLNVELTVTPDFLWDEKIHKYVEPFWIIVEDNDGEKILHH 
EYFLLKKQYIDEDHTLHFTVPIFEPLPPQYFVRVVSDKWLGSETVLPVSFRHLILPEKYP 
PPTELLDLQPLPVTALRNPNYEILYQDFKHFNPVQTQVFTVLYNTNDNVLVAAPTGSGKT 
ICAEFAILRNHHEGPDATMRVVYIAPLEAIAKEQFRIWEGKFGKGLGLRVVELTGETALD 
LKLLEKGQIIISTPEKWDALSRRWKQRKYVQQVSLFIVDELHLIGGQHGPVLEVIVSRMR 
YISSQVINKIRIVALSTSLANAKDLGEWIGASSHGLFNFPPGVRPVPLEIHIQGVDISSF 
EARMQAMTKPTYTAIVQHAKNKKPAIVFVPTRKHVRLTAVDLMAYSHMDNPQSPDFLLGK 
LEELDPFVEQIREETLKETLCHGIGYLHEGLSSLDQEIVTQLFEAGRIQVCVMSSSLCWG 
TPLTAHLVVVMGTQYYDGRENSHSDYPVPDLLQMMGRASRPLLDNAGKCVIFCHAPRKEY 
YKKFLYEAFPVESQLQHFLHDNFNAEVVAGVIENKQDAVDYLTWTFMYRRLPQNPNYYNL 
QGVSHRHLSDHLSELVENTLSDLEASKCIEVEDEMELSPLNLGMIASYYYISYTTIERFS 
SLLSSKTKMKGLLEILTSASEYDMIPIRPGEEDTVRRLINHQRFSFENPKCTDPHVKANA 
LLQAHFSRQNIGGNLAMDQRDVLLSATRLLQAMVDVISSNGWLNLALLAMEVSQMVTQGM 
WERDSMLLQLPHFTKDLAKRCQENPGKNIETVFDLVEMEDEERQELLKMSDAQLLDIARF 
CNRFPNIDLTYEIVGSEEVNPGKEVTLQVMLERDMEGRTEVGPVDSLRYPKTKEEGWWLV 
VGDTKTNQLLAIKRVSLQRKVKVKLDFTAPSEPGEKSYTLYFMCDSYLGCDQEYSFSVDV 
KGSGAGDRMEE*
>AT1G09760.1 |  U2A (U2 small nuclear ribonucleoprotein A) protein binding 
MVKLTADLIWKSPHFFNAIKERELDLRGNKIPVIENLGATEDQFDTIDLSDNEIVKLENF 
PYLNRLGTLLINNNRITRINPNLGEFLPKLHSLVLTNNRLVNLVEIDPLASIPKLQYLSL 
LDNNITKKANYRLYVIHKLKSLRVLDFIKIKAKERAEAASLFSSKEAEEEVKKVSREEVK 
KVSETAENPETPKVVAPTAEQILAIKAAIINSQTIEEIARLEQALKFGQVPAGLIIPDPA 
TNDSAPMEE*
>AT5G64270.1 |  splicing factor putative 
MADLDPEIAKTQEERRKMEADLASLTSLTFDRDLYGGNDRASYSTSIAPNEEDDANLDTT 
GSLVAQRLASYTAPRSILNDVARPHNEDDDVGFKPRQSIAEREGEYRNRRLNRVLSPDRV 
DAFAMGDKTPDASVRTYTDHMRETALQREKEETMRLIAKKKKEEEEAAAKHQKDSAPPPP 
ASSSSSSSKRRHRWDLPEEDGAAAKKAKAASSDWDLPDAAPGIGRWDAPTPGRVSDATPS 
AGRRNRWDETPTPGRVTDSDATPGGGVTPGATPSGVTWDGLATPTPKRQRSRWDETPATM 
GSATPMGGVTPGAAYTPGVTPIGGIDMATPTPGQLIFRGPMTPEQLNMQRWEKDIEERNR 
PLSDEELDAMFPKDGYKVLDPPATYVPIRTPARKLQQTPTPMATPGYVIPEENRGQQYDV 
PPEVPGGLPFMKPEDYQYFGSLLNEENEEELSPEEQKERKIMKLLLKVKNGTPPQRKTAL 
RQLTDKARELGAGPLFNKILPLLMQPTLEDQERHLLVKVIDRILYKLDEMVRPYVHKILV 
VIEPLLIDEDYYARVEGREIISNLSKAAGLASMIAAMRPDIDNIDEYVRNTTARAFSVVA 
SALGIPALLPFLKAVCQSKRSWQARHTGIKIVQQIAILIGCAVLPHLRSLVEIIEHGLSD 
ENQKVRTITALSLAALAEAAAPYGIESFDSVLKPLWKGIRSHRGKVLAAFLKAIGFIIPL 
MDAIYASYYTKEVMVILIREFQSPDEEMKKIVLKVVKQCVSTEGVEPEYIRSDILPEFFR 
NFWTRKMALERRNYKQLVETTVEVANKVGVADIVGRVVEDLKDESEQYRRMVMETIDKVV 
TNLGASDIDARLEELLIDGILYAFQEQTSDDANVMLNGFGAVVNALGQRVKPYLPQICGT 
IKWRLNNKSAKVRQQAADLISRIAVVMKQCGEEQLMGHLGVVLYEYLGEEYPEVLGSILG 
ALKAIVNVIGMTKMTPPIKDLLPRLTPILKNRHEKVQENCIDLVGRIADRGAEFVPAREW 
MRICFELLEMLKAHKKGIRRATVNTFGYIAKAIGPQDVLATLLNNLKVQERQNRVCTTVA 
IAIVAETCSPFTVLPALMNEYRVPELNVQNGVLKSLSFLFEYIGEMGKDYIYAVTPLLED 
ALMDRDLVHRQTAASAVKHMALGVAGLGCEDALVHLLNFIWPNIFETSPHVINAVMEAIE 
GMRVALGAAVILNYCLQGLFHPARKVREVYWKIYNSLYIGAQDTLVAAYPVLEDEQNNVY 
SRPELTMFV*
>AT1G09770.1 |  ATCDC5 (ARABIDOPSIS THALIANA CELL DIVISION CYCLE 5) DNA binding / transcription factor 
MRIMIKGGVWKNTEDEILKAAVMKYGKNQWARISSLLVRKSAKQCKARWYEWLDPSIKKT 
EWTREEDEKLLHLAKLLPTQWRTIAPIVGRTPSQCLERYEKLLDAACTKDENYDAADDPR 
KLRPGEIDPNPEAKPARPDPVDMDEDEKEMLSEARARLANTRGKKAKRKAREKQLEEARR 
LASLQKRRELKAAGIDGRHRKRKRKGIDYNAEIPFEKRAPAGFYDTADEDRPADQVKFPT 
TIEELEGKRRADVEAHLRKQDVARNKIAQRQDAPAAILQANKLNDPEVVRKRSKLMLPPP 
QISDHELEEIAKMGYASDLLAENEELTEGSAATRALLANYSQTPRQGMTPMRTPQRTPAG 
KGDAIMMEAENLARLRDSQTPLLGGENPELHPSDFTGVTPRKKEIQTPNPMLTPSMTPGG 
AGLTPRIGLTPSRDGSSFSMTPKGTPFRDELHINEDMDMHESAKLERQRREEARRSLRSG 
LTGLPQPKNEYQIVAQPPPEESEEPEEKIEEDMSDRIAREKAEEEARQQALLKKRSKVLQ 
RDLPRPPAASLAVIRNSLLSADGDKSSVVPPTPIEVADKMVREELLQLLEHDNAKYPLDD 
KAEKKKGAKNRTNRSASQVLAIDDFDENELQEADKMIKEEGKFLCVSMGHENKTLDDFVE 
AHNTCVNDLMYFPTRSAYELSSVAGNADKVAAFQEEMENVRKKMEEDEKKAEHMKAKYKT 
YTKGHERRAETVWTQIEATLKQAEIGGTEVECFKALKRQEEMAASFRKKNLQEEVIKQKE 
TESKLQTRYGNMLAMVEKAEEIMVGFRAQALKKQEDVEDSHKLKEAKLATGEEEDIAIAM 
EASA*
>AT2G23930.1 |  SNRNP-G (PROBABLE SMALL NUCLEAR RIBONUCLEOPROTEIN G) 
MSRSGQPPDLKKYMDKKLQIKLNANRMVTGTLRGFDQFMNLVVDNTVEVNGNDKTDIGMV 
VIRGNSIVTVEALEPVGRSS*
>AT2G23930.1 |  SNRNP-G (PROBABLE SMALL NUCLEAR RIBONUCLEOPROTEIN G) 
MSRSGQPPDLKKYMDKKLQIKLNANRMVTGTLRGFDQFMNLVVDNTVEVNGNDKTDIGMV 
VIRGNSIVTVEALEPVGRSS*
>AT2G23930.2 |  SNRNP-G (PROBABLE SMALL NUCLEAR RIBONUCLEOPROTEIN G) 
MDKKLQIKLNANRMVTGTLRGFDQFMNLVVDNTVEVNGNDKTDIGMVVIRGNSIVTVEAL 
EPVGRSS*
>AT2G23930.2 |  SNRNP-G (PROBABLE SMALL NUCLEAR RIBONUCLEOPROTEIN G) 
MDKKLQIKLNANRMVTGTLRGFDQFMNLVVDNTVEVNGNDKTDIGMVVIRGNSIVTVEAL 
EPVGRSS*
>AT3G11500.1 |  small nuclear ribonucleoprotein G putative / snRNP-G putative / Sm protein G putative 
MSRSGQPPDLKKYMDKKLQIKLNANRMVVGTLRGFDQFMNLVVDNTVEVNGDDKTDIGMV 
VIRGNSIVTVEALEPVGRS*
>AT5G27720.1 |  emb1644 (embryo defective 1644) 
MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECY 
IRGNTIKYLRVPDEVIDKVQEEKTRTDRKPPGVGRGRGRGVDDGGARGRGRGTSMGKMGG 
NRGAGRGRG*
>AT1G21190.1 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MSVEEDATVREPLDLIRLSIEERIYVKLRSDRELRGKLHAFDQHLNMILGDVEEVITTIE 
IDDETYEEIVRTTKRTVPFLFVRGDGVILVSPPLRTT*
>AT1G76860.1 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MSGEEEATVREPLDLIRLSLDERIYVKLRSDRELRGKLHAFDQHLNMILGDVEETITTVE 
IDDETYEEIVRTTKRTIEFLFVRGDGVILVSPPLRTAA*
>AT1G03330.1 |  small nuclear ribonucleoprotein D putative / snRNP core SM-like protein putative / U6 snRNA-associated Sm-like protein putative 
MLFFSYFKDLVGQEVTVELKNDLAIRGTLHSVDQYLNIKLENTRVVDQDKYPHMLSVRNC 
FIRGSVVRYVQLPKDGVDVDLLHDAARREARGG*
>AT2G18740.1 |  small nuclear ribonucleoprotein E putative / snRNP-E putative / Sm protein E putative 
MASTKVQRIMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRITGFDEYMNLVLDEAEEV 
SIKKNTRKPLGRILLKGDNITLMMNTGK*
>AT2G18740.1 |  small nuclear ribonucleoprotein E putative / snRNP-E putative / Sm protein E putative 
MASTKVQRIMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRITGFDEYMNLVLDEAEEV 
SIKKNTRKPLGRILLKGDNITLMMNTGK*
>AT2G18740.2 |  small nuclear ribonucleoprotein E putative / snRNP-E putative / Sm protein E putative 
MASTKVQRIMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRITGFDEYMNLVLDEAEEV 
SIKKNTRKPLGFYSKETT*
>AT2G18740.2 |  small nuclear ribonucleoprotein E putative / snRNP-E putative / Sm protein E putative 
MASTKVQRIMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRITGFDEYMNLVLDEAEEV 
SIKKNTRKPLGFYSKETT*
>AT2G47640.1 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.1 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.1 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.1 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.2 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.2 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.2 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.2 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.3 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.3 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.3 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.3 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.4 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.4 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.4 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.4 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT3G50670.1 |  U1-70K (U1 SMALL NUCLEAR RIBONUCLEOPROTEIN-70K) RNA binding / nucleic acid binding / nucleotide binding 
MGDSGDPFLRNPNAAVQARAKVQNRANVLQLKLMGQSHPTGLTNNLLKLFEPRPPLEYKP 
PPEKRKCPPYTGMAQFVSNFAEPGDPEYAPPKPEVELPSQKRERIHKLRLEKGVEKAAED 
LKKYDPNNDPNATGDPYKTLFVSRLNYESSESKIKREFESYGPIKRVHLVTDQLTNKPKG 
YAFIEYMHTRDMKAAYKQADGQKIDGRRVLVDVERGRTVPNWRPRRLGGGLGTSRVGGGE 
EIVGEQQPQGRTSQSEEPSRPREEREKSREKGKERERSRELSHEQPRERSRDRPREDKHH 
RDRDQGGRDRDRDSRRDRDRTRDRGDRDRRDRDRGRDRTSRDHDRDRSRKKERDYEGGEY 
EHEGGGRSRERDAEYRGEPEETRGYYEDDQGDTDRYSHRYDKMEEDDFRYEREYKRSKRS 
ESREYVR*
>AT3G50670.1 |  U1-70K (U1 SMALL NUCLEAR RIBONUCLEOPROTEIN-70K) RNA binding / nucleotide binding 
MGDSGDPFLRNPNAAVQARAKVQNRANVLQLKLMGQSHPTGLTNNLLKLFEPRPPLEYKP 
PPEKRKCPPYTGMAQFVSNFAEPGDPEYAPPKPEVELPSQKRERIHKLRLEKGVEKAAED 
LKKYDPNNDPNATGDPYKTLFVSRLNYESSESKIKREFESYGPIKRVHLVTDQLTNKPKG 
YAFIEYMHTRDMKAAYKQADGQKIDGRRVLVDVERGRTVPNWRPRRLGGGLGTSRVGGGE 
EIVGEQQPQGRTSQSEEPSRPREEREKSREKGKERERSRELSHEQPRERSRDRPREDKHH 
RDRDQGGRDRDRDSRRDRDRTRDRGDRDRRDRDRGRDRTSRDHDRDRSRKKERDYEGGEY 
EHEGGGRSRERDAEYRGEPEETRGYYEDDQGDTDRYSHRYDKMEEDDFRYEREYKRSKRS 
ESREYVR*
>AT3G50670.2 |  U1-70K (U1 SMALL NUCLEAR RIBONUCLEOPROTEIN-70K) RNA binding / nucleic acid binding / nucleotide binding 
MGDSGDPFLRNPNAAVQARAKVQNRANVLQLKLMGQSHPTGLTNNLLKLFEPRPPLEYKP 
PPEKRKCPPYTGMAQFVSNFAEPGDPEYAPPKPEVELPSQKRERIHKLRLEKGVEKAAED 
LKKYDPNNDPNATGDPYKTLFVSRLNYESSESKIKREFESYGPIKRVGYSEHSLAGSVRI 
CVMASLSRALCSICFILSTKVFQG*
>AT3G50670.2 |  U1-70K (U1 SMALL NUCLEAR RIBONUCLEOPROTEIN-70K) RNA binding / nucleotide binding 
MGDSGDPFLRNPNAAVQARAKVQNRANVLQLKLMGQSHPTGLTNNLLKLFEPRPPLEYKP 
PPEKRKCPPYTGMAQFVSNFAEPGDPEYAPPKPEVELPSQKRERIHKLRLEKGVEKAAED 
LKKYDPNNDPNATGDPYKTLFVSRLNYESSESKIKREFESYGPIKRVGYSEHSLAGSVRI 
CVMASLSRALCSICFILSTKVFQG*
>AT3G19840.1 |  FF domain-containing protein / WW domain-containing protein 
MLANAPFGRPGTLAPPGLMTSPPAFPGSNPFSTTPRPGMSAGPAQMNPGIHPHMYPPYHS 
LPGTPQGMWLQPPSMGGIPRAPFLSHPTTFPGSYPFPVRGISPNLPYSGSHPLGASPMGS 
VGNVHALPGRQPDISPGRKTEELSGIAGSQLVGNRLDAWTAHKSEAGVLYYYNSVTGQST 
YEKPPGFGGEPDKVPVQPIPVSILPGTDWALVSTNDGKKYYYNNKTKVSSWQIPAEVKDF 
GKKLEERAMESVASVPSADLTEKGSDLTSLSAPAISNGGRDAASLKTTNFGSSALDLVKK 
KLHDSGMPVSSTITSEANSGKTTEVTPSGESGNSTGKVKDAPGAGALSDSSSDSEDEDSG 
PSKEECSKQFKEMLKERGIAPFSKWEKELPKIIFDPRFKAIPSHSVRRSLFEQYVKTRAE 
EERREKRAAHKAAIEGFRQLLDDASTDIDQHTDYRAFKKKWGNDLRFEAIERKEREGLLN 
ERVLSLKRSAEQKAQEIRAAAASDFKTMLREREISINSHWSKVKDSLRNEPRYRSVAHED 
REVFYYEYIAELKAAQRGDDHEMKARDEEDKLRERERELRKRKEREVQEVERVRQKIRRK 
EASSSYQALLVEKIRDPEASWTESKPILERDPQKRASNPDLEPADKEKLFRDHVKSLYER 
CVHDFKALLAEALSSEAATLQTEDGKTALNSWSTAKQVLKPDIRYSKMPRQDREVVWRRY 
VEDISRKQRHENYQEEKQRDYKT*
>AT1G20580.1 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MSRSLGIPVKLLHEASGHIVTVELKSGELYRGSMIECEDNWNCQLEDITYTAKDGKVSQL 
EHVFIRGSKVRFMVIPDILKHAPMFKRLDARIKGKSSSLGVGRGRGAMRGKPAAGPGRGT 
GGRGAVPPVRR*
>AT1G28060.1 |  small nuclear ribonucleoprotein family protein / snRNP family protein 
MDKERYSRSHRDDRDRDSSPDHSPQREGGRRRDRDVDSKRRDSDHYRSSRRGDREDERDR 
TKDRRGRSVERGEREGSRDREKHHHERSHEGSKEKESRSKRKDREEENGARDGKKKSRFA 
DGNGERRSRFEDVAIEVENKDAQVSEGSGATNPTSGVTMGASTYSSIPSEASAAPSQTLL 
TKVSSISTTDENKASVVRSHEVPGKSSTDGRPLSTAGKSSANLPLDSSALAAKARKALQL 
QKGLADRLKNLPLLKKATKPTSEGSPHTRVPPSTTTPAVSTGTSFASTLPHTGLAGFGSI 
ANIEAVKRAQELAANMGFHQDREFAPVINLFPGQAPSDMTVAQRPEKPPVLRVDALGREI 
DEHGNVISVTKPSNLSTLKVNINKKKKDAFQILKPQLEADLKENPYFDTRMGIDEKKILR 
PKRMSFQFVEEGKWTRDAENLKFKSHFGEAKAKELKVKQAQLAKANDDINPNLIEVSERV 
PRKEKPKEPIPDVEWWDANVLTNGEYGEITDGTITESHLKIEKLTHYIEHPRPIEPPAEA 
APPPPQPLKLTKKEQKKLRTQRRLAKEKEKQEMIRQGLLEPPKAKVKMSNLMKVLGSEAT 
QDPTKLEKEIRTAAAEREQAHTDRNAARKLTPAEKREKKERKLFDDPTTVETIVSVYKIK 
KLSHPKTRFKVEMNARENRLTGCSVMTDEMSVVVVEGKSKAIKRYGKLMMKRINWEEAER 
KEGNEDEEEEVNGGNKCWLVWQGSIGKPSFHRFHVHECVTESTAKKVFMDAGVVHYWDLA 
VNYSDD*
>AT2G03870.2 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFVRDHDDPLKT 
TDQTRRLGLIVCRGTAVMLVSPTDGTEEIANPFVTAEAV*
>AT2G03870.2 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFVRDHDDPLKT 
TDQTRRLGLIVCRGTAVMLVSPTDGTEEIANPFVTAEAV*
>AT2G03870.1 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFVRDHDDPLKT 
TDQTRRLGLIVCRGTAVMLVSPTDGTEEIANPFVTAEAV*
>AT2G03870.1 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFVRDHDDPLKT 
TDQTRRLGLIVCRGTAVMLVSPTDGTEEIANPFVTAEAV*
>AT4G30220.1 |  RUXF (SMALL NUCLEAR RIBONUCLEOPROTEIN F) 
MATIPVNPKPFLNNLTGKTVIVKLKWGMEYKGFLASVDSYMNLQLGNTEEYIDGQLTGNL 
GEILIRCNNVLYVRGVPEDEELEDADQD*
>AT5G48870.1 |  SAD1 (SUPERSENSITIVE TO ABA AND DROUGHT 1) RNA binding 
MANNPSQLLPSELIDRCIGSKIWVIMKGDKELVGILKGFDVYVNMVLEDVTEYEITAEGR 
RVTKLDQILLNGNNIAILVPGGSPEDGE*
>AT2G41500.1 |  EMB2776 nucleotide binding 
MEPNKDDNVSLAATAQISAPPVLQDASSLPGFSAIPPVVPPSFPPPMAPIPMMPHPPVAR 
PPTFRPPVSQNGGVKTSDSDSESDDEHIEISEESKQVRERQEKALQDLLVKRRAAAMAVP 
TNDKAVRDRLRRLGEPITLFGEQEMERRARLTQLLTRYDINGQLDKLVKDHEEDVTPKEE 
VDDEVLEYPFFTEGPKELREARIEIAKFSVKRAAVRIQRAKRRRDDPDEDMDAETKWALK 
HAKHMALDCSNFGDDRPLTGCSFSRDGKILATCSLSGVTKLWEMPQVTNTIAVLKDHKER 
ATDVVFSPVDDCLATASADRTAKLWKTDGTLLQTFEGHLDRLARVAFHPSGKYLGTTSYD 
KTWRLWDINTGAELLLQEGHSRSVYGIAFQQDGALAASCGLDSLARVWDLRTGRSILVFQ 
GHIKPVFSVNFSPNGYHLASGGEDNQCRIWDLRMRKSLYIIPAHANLVSQVKYEPQEGYF 
LATASYDMKVNIWSGRDFSLVKSLAGHESKVASLDITADSSCIATVSHDRTIKLWTSSGN 
DDEDEEKETMDIDL*
>AT3G26590.1 |  MATE efflux family protein 
MAKDKDITETLLTAAEERSDLPFLSVDDIPPITTVGGFVREFNVETKKLWYLAGPAIFTS 
VNQYSLGAITQVFAGHISTIALAAVSVENSVVAGFSFGIMLGMGSALETLCGQAFGAGKL 
SMLGVYLQRSWVILNVTALILSLLYIFAAPILASIGQTAAISSAAGIFSIYMIPQIFAYA 
INFPTAKFLQSQSKIMVMAVISAVALVIHVPLTWFVIVKLQWGMPGLAVVLNASWCFIDM 
AQLVYIFSGTCGEAWSGFSWEAFHNLWSFVRLSLASAVMLCLEVWYFMAIILFAGYLKNA 
EISVAALSICMNILGWTAMIAIGMNTAVSVRVSNELGANHPRTAKFSLLVAVITSTLIGF 
IVSMILLIFRDQYPSLFVKDEKVIILVKELTPILALSIVINNVQPVLSGVAVGAGWQAVV 
AYVNIACYYVFGIPFGLLLGYKLNYGVMGIWCGMLTGTVVQTIVLTWMICKTNWDTEASM 
AEDRIREWGGEVSEIKQLIN*
>AT2G33340.1 |  nucleotide binding / ubiquitin-protein ligase 
MNCAISGEVPVEPVVSTKSGLLFERRLIERHISDYGKCPVTGEPLTIDDIVPIKTGEIIK 
PKTLHTASIPGLLGTFQNEWDGLMLSNFALEQQLHTARQELSHALYQHDSACRVIARLKK 
ERDEARQLLAEVERHIPAAPEAVTANAALSNGKRAAVDEELGPDAKKLCPGISAEIITEL 
TDCNAALSQKRKKRQIPQTLASIDTLERFTQLSSHPLHKTNKPGICSMDILHSKDVIATG 
GVDATAVLFDRPSGQILSTLTGHSKKVTSVKFVGDSDLVLTASADKTVRIWRNPGDGNYA 
CGYTLNDHSAEVRAVTVHPTNKYFVSASLDGTWCFYDLSSGSCLAQVSDDSKNVDYTAAA 
FHPDGLILGTGTSQSVVKIWDVKSQANVAKFDGHTGEVTAISFSENGYFLATAAEDGVRL 
WDLRKLRNFKSFLSADANSVEFDPSGSYLGIAASDIKVYQTASVKAEWNLIKTLPDLSGT 
GKATCVKFGSDAQYVAVGSMDRNLRIFGLPGDEKANVDDDSAQDS*
>AT2G33340.1 |  nucleotide binding / ubiquitin-protein ligase 
MNCAISGEVPVEPVVSTKSGLLFERRLIERHISDYGKCPVTGEPLTIDDIVPIKTGEIIK 
PKTLHTASIPGLLGTFQNEWDGLMLSNFALEQQLHTARQELSHALYQHDSACRVIARLKK 
ERDEARQLLAEVERHIPAAPEAVTANAALSNGKRAAVDEELGPDAKKLCPGISAEIITEL 
TDCNAALSQKRKKRQIPQTLASIDTLERFTQLSSHPLHKTNKPGICSMDILHSKDVIATG 
GVDATAVLFDRPSGQILSTLTGHSKKVTSVKFVGDSDLVLTASADKTVRIWRNPGDGNYA 
CGYTLNDHSAEVRAVTVHPTNKYFVSASLDGTWCFYDLSSGSCLAQVSDDSKNVDYTAAA 
FHPDGLILGTGTSQSVVKIWDVKSQANVAKFDGHTGEVTAISFSENGYFLATAAEDGVRL 
WDLRKLRNFKSFLSADANSVEFDPSGSYLGIAASDIKVYQTASVKAEWNLIKTLPDLSGT 
GKATCVKFGSDAQYVAVGSMDRNLRIFGLPGDEKANVDDDSAQDS*
>AT2G33340.1 |  nucleotide binding / ubiquitin-protein ligase 
MNCAISGEVPVEPVVSTKSGLLFERRLIERHISDYGKCPVTGEPLTIDDIVPIKTGEIIK 
PKTLHTASIPGLLGTFQNEWDGLMLSNFALEQQLHTARQELSHALYQHDSACRVIARLKK 
ERDEARQLLAEVERHIPAAPEAVTANAALSNGKRAAVDEELGPDAKKLCPGISAEIITEL 
TDCNAALSQKRKKRQIPQTLASIDTLERFTQLSSHPLHKTNKPGICSMDILHSKDVIATG 
GVDATAVLFDRPSGQILSTLTGHSKKVTSVKFVGDSDLVLTASADKTVRIWRNPGDGNYA 
CGYTLNDHSAEVRAVTVHPTNKYFVSASLDGTWCFYDLSSGSCLAQVSDDSKNVDYTAAA 
FHPDGLILGTGTSQSVVKIWDVKSQANVAKFDGHTGEVTAISFSENGYFLATAAEDGVRL 
WDLRKLRNFKSFLSADANSVEFDPSGSYLGIAASDIKVYQTASVKAEWNLIKTLPDLSGT 
GKATCVKFGSDAQYVAVGSMDRNLRIFGLPGDEKANVDDDSAQDS*
>AT2G33340.2 |  nucleotide binding / ubiquitin-protein ligase 
MNCAISGEVPVEPVVSTKSGLLFERRLIERHISDYGKCPVTGEPLTIDDIVPIKTGEIIK 
PKTLHTASIPGLLGTFQNEWDGLMLSNFALEQQLHTARQELSHALYQHDSACRVIARLKK 
ERDEARQLLAEVERHIPAAPEAVTANAALSNGKRAAVDEELGPDAKKLCPGISAEIITEL 
TDCNAALSQKRKKRQIPQTLASIDTLERFTQLSSHPLHKTNKPGICSMDILHSKDVIATG 
GVDATAVLFDRPSGQILSTLTGHSKKVTSVKFVGDSDLVLTASADKTVRIWRNPGDGNYA 
CGYTLNDHSAEVRAVTVHPTNKYFVSASLDGTWCFYDLSSGSCLAQVSDDSKNVDYTAAA 
FHPDGLILGTGTSQSVVKIWDVKSQANVAKFDGHTGEVTAISFSENGYFLATAAEDGVRL 
WDLRKLRNFKSFLSADANSVEFDPSGSYLGIAASDIKVYQTASVKAEWNLIKTLPDLSGT 
GKATCVKFGSDAQYVAVGSMDRNLRIFGLPGDEKANVDDDSAQDS*
>AT2G33340.2 |  nucleotide binding / ubiquitin-protein ligase 
MNCAISGEVPVEPVVSTKSGLLFERRLIERHISDYGKCPVTGEPLTIDDIVPIKTGEIIK 
PKTLHTASIPGLLGTFQNEWDGLMLSNFALEQQLHTARQELSHALYQHDSACRVIARLKK 
ERDEARQLLAEVERHIPAAPEAVTANAALSNGKRAAVDEELGPDAKKLCPGISAEIITEL 
TDCNAALSQKRKKRQIPQTLASIDTLERFTQLSSHPLHKTNKPGICSMDILHSKDVIATG 
GVDATAVLFDRPSGQILSTLTGHSKKVTSVKFVGDSDLVLTASADKTVRIWRNPGDGNYA 
CGYTLNDHSAEVRAVTVHPTNKYFVSASLDGTWCFYDLSSGSCLAQVSDDSKNVDYTAAA 
FHPDGLILGTGTSQSVVKIWDVKSQANVAKFDGHTGEVTAISFSENGYFLATAAEDGVRL 
WDLRKLRNFKSFLSADANSVEFDPSGSYLGIAASDIKVYQTASVKAEWNLIKTLPDLSGT 
GKATCVKFGSDAQYVAVGSMDRNLRIFGLPGDEKANVDDDSAQDS*
>AT2G33340.2 |  nucleotide binding / ubiquitin-protein ligase 
MNCAISGEVPVEPVVSTKSGLLFERRLIERHISDYGKCPVTGEPLTIDDIVPIKTGEIIK 
PKTLHTASIPGLLGTFQNEWDGLMLSNFALEQQLHTARQELSHALYQHDSACRVIARLKK 
ERDEARQLLAEVERHIPAAPEAVTANAALSNGKRAAVDEELGPDAKKLCPGISAEIITEL 
TDCNAALSQKRKKRQIPQTLASIDTLERFTQLSSHPLHKTNKPGICSMDILHSKDVIATG 
GVDATAVLFDRPSGQILSTLTGHSKKVTSVKFVGDSDLVLTASADKTVRIWRNPGDGNYA 
CGYTLNDHSAEVRAVTVHPTNKYFVSASLDGTWCFYDLSSGSCLAQVSDDSKNVDYTAAA 
FHPDGLILGTGTSQSVVKIWDVKSQANVAKFDGHTGEVTAISFSENGYFLATAAEDGVRL 
WDLRKLRNFKSFLSADANSVEFDPSGSYLGIAASDIKVYQTASVKAEWNLIKTLPDLSGT 
GKATCVKFGSDAQYVAVGSMDRNLRIFGLPGDEKANVDDDSAQDS*
>AT2G33340.3 |  nucleotide binding / ubiquitin-protein ligase 
MNCAISGEVPVEPVVSTKSGLLFERRLIERHISDYGKCPVTGEPLTIDDIVPIKTGEIIK 
PKTLHTASIPGLLGTFQNEWDGLMLSNFALEQQLHTARQELSHALYQHDSACRVIARLKK 
ERDEARQLLAEVERHIPAAPEAVTANAALSNGKRAAVDEELGPDAKKLCPGISAEIITEL 
TDCNAALSQKRKKRQIPQTLASIDTLERFTQLSSHPLHKTNKPGICSMDILHSKDVIATG 
GVDATAVLFDRPSGQILSTLTGHSKKVTSVKFVGDSDLVLTASADKTVRIWRNPGDGNYA 
CGYTLNDHSAEVRAVTVHPTNKYFVSASLDGTWCFYDLSSGSCLAQVSDDSKNVDYTAAA 
FHPDGLILGTGTSQSVVKIWDVKSQANVAKFDGHTGEVTAISFSENGYFLATAAEDGVRL 
WDLRKLRNFKSFLSADANSVEFDPSGSYLGIAASDIKVYQTASVKAEWNLIKTLPDLSGT 
GKLRV*
>AT2G33340.3 |  nucleotide binding / ubiquitin-protein ligase 
MNCAISGEVPVEPVVSTKSGLLFERRLIERHISDYGKCPVTGEPLTIDDIVPIKTGEIIK 
PKTLHTASIPGLLGTFQNEWDGLMLSNFALEQQLHTARQELSHALYQHDSACRVIARLKK 
ERDEARQLLAEVERHIPAAPEAVTANAALSNGKRAAVDEELGPDAKKLCPGISAEIITEL 
TDCNAALSQKRKKRQIPQTLASIDTLERFTQLSSHPLHKTNKPGICSMDILHSKDVIATG 
GVDATAVLFDRPSGQILSTLTGHSKKVTSVKFVGDSDLVLTASADKTVRIWRNPGDGNYA 
CGYTLNDHSAEVRAVTVHPTNKYFVSASLDGTWCFYDLSSGSCLAQVSDDSKNVDYTAAA 
FHPDGLILGTGTSQSVVKIWDVKSQANVAKFDGHTGEVTAISFSENGYFLATAAEDGVRL 
WDLRKLRNFKSFLSADANSVEFDPSGSYLGIAASDIKVYQTASVKAEWNLIKTLPDLSGT 
GKLRV*
>AT2G33340.3 |  nucleotide binding / ubiquitin-protein ligase 
MNCAISGEVPVEPVVSTKSGLLFERRLIERHISDYGKCPVTGEPLTIDDIVPIKTGEIIK 
PKTLHTASIPGLLGTFQNEWDGLMLSNFALEQQLHTARQELSHALYQHDSACRVIARLKK 
ERDEARQLLAEVERHIPAAPEAVTANAALSNGKRAAVDEELGPDAKKLCPGISAEIITEL 
TDCNAALSQKRKKRQIPQTLASIDTLERFTQLSSHPLHKTNKPGICSMDILHSKDVIATG 
GVDATAVLFDRPSGQILSTLTGHSKKVTSVKFVGDSDLVLTASADKTVRIWRNPGDGNYA 
CGYTLNDHSAEVRAVTVHPTNKYFVSASLDGTWCFYDLSSGSCLAQVSDDSKNVDYTAAA 
FHPDGLILGTGTSQSVVKIWDVKSQANVAKFDGHTGEVTAISFSENGYFLATAAEDGVRL 
WDLRKLRNFKSFLSADANSVEFDPSGSYLGIAASDIKVYQTASVKAEWNLIKTLPDLSGT 
GKLRV*
>AT3G18790.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN chloroplast EXPRESSED IN 22 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Isy1-like splicing (InterProIPR009360) Has 1075 Blast hits to 879 proteins in 176 species Archae - 8 Bacteria - 11 Metazoa - 379 Fungi - 177 Plants - 27 Viruses - 9 Other Eukaryotes - 464 (source NCBI BLink) 
MARNEEKAQSMLNRFITQKESEKKKPKERRPYLASECRDLAEADKWRQQILREIGSKVAE 
IQNEGLGEHRLRDLNDEINKLLRERYHWERRIVELGGHNYSKHSAKMTDLEGNIIDVPNP 
SGRGPGYRYFGAAKKLPGVRELFEKPPELRKRKTRYDIYKRIDASYYGYRDDEDGILEKL 
ERKSEGGMRKRSVEEWRRLDEVRKEARKGASEVVSVGAAAAAAREVLFEEEEDVVEEERM 
EREKEEEKEREFVVHVPLPDEKEIEKMVLEKKKMDLLSKYASEDLVEQQTEAKSMLNIHR 
*
>AT1G80070.1 |  SUS2 (ABNORMAL SUSPENSOR 2) 
MWNNNDGMPLAPPGTGGSMMPPPPAAHPSYTALPPPSNPTPPVEPTPEEAEAKLEEKARK 
WMQLNSKRYGDKRKFGFVETQKEDMPPEHVRKIIRLVFFSSFSTISKYSLLDNYFLARDH 
GDMSSKKFRHDKRVYLGALKFVPHAVFKLLENMPMPWEQVRDVKVLYHITGAITFVNEIP 
WVVEPIYMAQWGTMWIMMRREKRDRRHFKRMRFPPFDDEEPPLDYADNLLDVDPLEPIQL 
ELDEEEDSAVHTWFYDHKPLVKTKLINGPSYRRWNLSLPIMATLHRLAGQLLSDLIDRNY 
FYLFDMPSFFTAKALNMCIPGGPKFEPLYRDMEKGDEDWNEFNDINKLIIRSPLRTEYRI 
AFPHLYNNRPRKVKLCVYHSPMIMYIKTEDPDLPAFYYDPLIHPISNTNKEKRERKVYDD 
EDDFALPEGVEPLLRDTQLYTDTTAAGISLLFAPRPFNMRSGRTRRAEDIPLVSEWFKEH 
CPPAYPVKVRVSYQKLLKCYVLNELHHRPPKAQKKKHLFRSLAATKFFQSTELDWVEVGL 
QVCRQGYNMLNLLIHRKNLNYLHLDYNFNLKPVKTLTTKERKKSRFGNAFHLCREILRLT 
KLVVDANVQFRLGNVDAFQLADGLQYIFSHVGQLTGMYRYKYRLMRQIRMCKDLKHLIYY 
RFNTGPVGKGPGCGFWAPMWRVWLFFLRGIVPLLERWLGNLLARQFEGRHSKGVAKTVTK 
QRVESHFDLELRAAVMHDVLDAMPEGIKQNKARTILQHLSEAWRCWKANIPWKVPGLPVP 
IENMILRYVKSKADWWTNVAHYNRERIRRGATVDKTVCRKNLGRLTRLWLKAEQERQHNY 
LKDGPYVTPEEALAIYTTTVHWLESRKFSPIPFPPLSYKHDTKLLILALERLKESYSVAV 
RLNQQQREELGLIEQAYDNPHEALSRIKRHLLTQRGFKEVGIEFMDLYSYLIPVYEIEPL 
EKITDAYLDQYLWYEGDKRHLFPNWIKPADSEPPPLLVYKWCQGINNLQGIWDTGDGQCV 
VMLQTKFEKFFEKIDLTMLNRLLRLVLDHNIADYVSAKNNVVLSYKDMSHTNSYGLIRGL 
QFASFVVQFYGLLLDLLLLGLTRASEIAGPPQMPNEFMTFWDTKVETRHPIRLYSRYIDK 
VHIMFKFTHEEARDLIQRYLTEHPDPNNENMVGYNNKKCWPRDARMRLMKHDVNLGRSVF 
WDMKNRLPRSITTLEWENGFVSVYSKDNPNLLFSMCGFEVRILPKIRMTQEAFSNTKDGV 
WNLQNEQTKERTAVAFLRVDDEHMKVFENRVRQILMSSGSTTFTKIVNKWNTALIGLMTY 
FREATVHTQELLDLLVKCENKIQTRIKIGLNSKMPSRFPPVIFYTPKEIGGLGMLSMGHI 
LIPQSDLRYSKQTDVGVTHFRSGMSHEEDQLIPNLYRYIQPWESEFIDSQRVWAEYALKR 
QEAQAQNRRLTLEDLEDSWDRGIPRINTLFQKDRHTLAYDKGWRVRTDFKQYQVLKQNPF 
WWTHQRHDGKLWNLNNYRTDVIQALGGVEGILEHTLFKGTYFPTWEGLFWEKASGFEESM 
KYKKLTNAQRSGLNQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGIFMHGKIPTLKISL 
IQIFRAHLWQKIHESVVMDLCQVLDQELDALEIETVQKETIHPRKSYKMNSSCADVLLFA 
AHKWPMSKPSLVAESKDMFDQKASNKYWIDVQLRWGDYDSHDIERYTRAKFMDYTTDNMS 
IYPSPTGVMIGLDLAYNLHSAFGNWFPGSKPLLAQAMNKIMKSNPALYVLRERIRKGLQL 
YSSEPTEPYLSSQNYGEIFSNQIIWFVDDTNVYRVTIHKTFEGNLTTKPINGAIFIFNPR 
TGQLFLKVIHTSVWAGQKRLGQLAKWKTAEEVAALVRSLPVEEQPKQIIVTRKGMLDPLE 
VHLLDFPNIVIKGSELQLPFQACLKIEKFGDLILKATEPQMVLFNIYDDWLKSISSYTAF 
SRLILILRALHVNNEKAKMLLKPDKSVVTEPHHIWPSLTDDQWMKVEVALRDLILSDYAK 
KNNVNTSALTQSEIRDIILGAEITPPSQQRQQIAEIEKQAKEASQLTAVTTRTTNVHGDE 
LIVTTTSPYEQSAFGSKTDWRVRAISATNLYLRVNHIYVNSDDIKETGYTYIMPKNILKK 
FICVADLRTQIAGYLYGISPPDNPQVKEIRCVVMVPQWGNHQLVHLPSSLPEHDFLNDLE 
PLGWLHTQPNELPQLSPQDVTSHSRILENNKQWDGEKCIILTCSFTPGSCSLTSYKLTQT 
GYEWGRLNKDNGSNPHGYLPTHYEKVQMLLSDRFLGFYMVPESGPWNYSFTGVKHTLSMK 
YSVKLGSPKEFYHEEHRPTHFLEFSNMEEADITEGDREDTFT*
>AT3G55200.1 |  splicing factor putative 
MYLYSLTLQQATGIVCAINGNFSGGKTQEIAVARGKILDLLRPDENGKIQTIHSVEVFGA 
IRSLAQFRLTGAQKDYIVVGSDSGRIVILEYNKEKNVFDKVHQETFGKSGCRRIVPGQYV 
AVDPKGRAVMIGACEKQKLVYVLNRDTTARLTISSPLEAHKSHTICYSLCGVDCGFDNPI 
FAAIELDYSEADQDPTGQAASEAQKHLTFYELDLGLNHVSRKWSNPVDNGANMLVTVPGG 
ADGPSGVLVCAENFVIYMNQGHPDVRAVIPRRTDLPAERGVLVVSAAVHKQKTMFFFLIQ 
TEYGDVFKVTLDHNGDHVSELKVKYFDTIPVASSICVLKLGFLFSASEFGNHGLYQFQAI 
GEEPDVESSSSNLMETEEGFQPVFFQPRRLKNLVRIDQVESLMPLMDMKVLNIFEEETPQ 
IFSLCGRGPRSSLRILRPGLAITEMAVSQLPGQPSAVWTVKKNVSDEFDAYIVVSFTNAT 
LVLSIGEQVEEVNDSGFLDTTPSLAVSLIGDDSLMQVHPNGIRHIREDGRINEWRTPGKR 
SIVKVGYNRLQVVIALSGGELIYFEADMTGQLMEVEKHEMSGDVACLDIAPVPEGRKRSR 
FLAVGSYDNTVRILSLDPDDCLQILSVQSVSSAPESLLFLEVQASIGGDDGADHPANLFL 
NSGLQNGVLFRTVVDMVTGQLSDSRSRFLGLKPPKLFSISVRGRSAMLCLSSRPWLGYIH 
RGHFHLTPLSYETLEFAAPFSSDQCAEGVVSVAGDALRIFMIDRLGETFNETVVPLRYTP 
RKFVLHPKRKLLVIIESDQGAFTAEEREAARKECFEAGGVGENGNGNADQMENGADDEDK 
EDPLSDEQYGYPKAESEKWVSCIRVLDPKTATTTCLLELQDNEAAYSVCTVNFHDKEYGT 
LLAVGTVKGMQFWPKKNLVAGFIHIYRFVEDGKSLELLHKTQVEGVPLALCQFQGRLLAG 
IGPVLRLYDLGKKRLLRKCENKLFPNTIISIQTYRDRIYVGDIQESFHYCKYRRDENQLY 
IFADDCVPRWLTASHHVDFDTMAGADKFGNVYFVRLPQDLSEEIEEDPTGGKIKWEQGKL 
NGAPNKVDEIVQFHVGDVVTCLQKASMIPGGSESIMYGTVMGSIGALHAFTSRDDVDFFS 
HLEMHMRQEYPPLCGRDHMAYRSAYFPVKDVIDGDLCEQFPTLPMDLQRKIADELDRTPA 
EILKKLEDARNKII*
>AT4G38740.1 |  ROC1 (ROTAMASE CYP 1) peptidyl-prolyl cis-trans isomerase 
MAFPKVYFDMTIDGQPAGRIVMELYTDKTPRTAENFRALCTGEKGVGGTGKPLHFKGSKF 
HRVIPNFMCQGGDFTAGNGTGGESIYGSKFEDENFERKHTGPGILSMANAGANTNGSQFF 
ICTVKTDWLDGKHVVFGQVVEGLDVVKAIEKVGSSSGKPTKPVVVADCGQLS*
>AT2G21130.1 |  peptidyl-prolyl cis-trans isomerase / cyclophilin (CYP2) / rotamase 
MASHPKVFFDMTIGGAPAGKIVMELYTDKTPKTAENFRALCTGEKGVGRSGKPLHFKGSS 
FHRVIPNFMCQGGDFTKGNGTGGESIYGAKFEDENFERKHTGPGILSMANAGANTNGSQF 
FICTVKTDWLDGKHVVFGQVVEGLDVVKAIEKIGSSSGKPTKPVVIADCGEISS*
>AT1G50110.1 |  branched-chain amino acid aminotransferase 6 / branched-chain amino acid transaminase 6 (BCAT6) 
MAPSSSPLRTTSETDEKYANVKWEELGFALTPIDYMYVAKCRQGESFTQGKIVPYGDISI 
SPCSPILNYGQGLFEGLKAYRTEDDRIRIFRPDQNALRMQTGAERLCMTPPTLEQFVEAV 
KQTVLANKKWVPPPGKGTLYIRPLLLGSGATLGVAPAPEYTFLIYASPVGDYHKVSSGLN 
LKVDHKYHRAHSGGTGGVKSCTNYSPVVKSLLEAKSAGFSDVLFLDAATGRNIEELTACN 
IFIVKGNIVSTPPTSGTILPGVTRKSISELAHDIGYQVEERDVSVDELLEAEEVFCTGTA 
VVVKAVETVTFHDKKVKYRTGEAALSTKLHSMLTNIQMGVVEDKKGWMVDIDPCQG*
>AT1G61040.1 |  VIP5 (vernalization independence 5) DNA binding 
MGDLENLLLEAAGRTNSAGRSRHPPSSRRREGSYSDGSSDSRDDSDEDRGYASRKPSGSQ 
VPLKKRLEAEREDRAARVEGGYGDGPSDREGDSSEESDFGDDLYKNEEDRQKLAGMTEFQ 
REMILSERADKKGDKNFTEKLRSKRESEKTPVSKKETQPLPASRGVRSSARSADRAAAKD 
DALNELRAKRMKQQDPAALRKLRDASKGGSGSRDFSSTKRKPLASSNLSSSSQSDSDSRS 
QSDDEGSNGGMLDSDDDRSDVPTFEDVKEVTIRRSKLAKWLMEPFFEELIVGCFVRVGIG 
RSKSGPIYRLCWVKNVDATDPDKTYKLENKTTHKYLNVVWGNETSAARWQMAMISDGHPL 
EEEYRQWIREVERTNGRMPTKQDISEKKEAIQRTNSFVYSAETVKQMLQEKKSASVRPMN 
VAAEKDRLRKELEIAQSKNDEAGVERIKSKIKQLDASRNKKGVDKKALKLAEMNKKNRAE 
NFKNASEVKSITASLKAGEAGYDPFSRRWTRSSNYYNGKNKGKDGEENEAAVAAAVETNG 
ADAGAGVEATEAALEAAAEAGKLIDTRAPIGQGAEHNQLHNFELSLSLTALQKYGGPQGV 
QKAFMARKQLTEATVGCRVAENDGKRHGLTLTVSDYKRRRGLL*
>AT5G62290.1 |  nucleotide-sensitive chloride conductance regulator (ICln) family protein 
MVAGLRDFTLRTEDGSGKPVLDESNGEELMHVQTSVAVALGNRPIESPGTLYITSRKLIW 
LSDVDMAKGYAVDFLSISLHAVSRDPEAYSSPCIYTQIEVEEDEDDESDSESTEVLDLSK 
IREMRLVPSDSTQLETLFDVFCECAELNPEPVQEEEEESGHNWVFSADQMDVRGGDDDAE 
WQISQSPTSVIGHSNGDEGLNQPMLELQINDQRFEDAEEMVHESETKDH*
>AT5G62290.1 |  nucleotide-sensitive chloride conductance regulator (ICln) family protein 
MVAGLRDFTLRTEDGSGKPVLDESNGEELMHVQTSVAVALGNRPIESPGTLYITSRKLIW 
LSDVDMAKGYAVDFLSISLHAVSRDPEAYSSPCIYTQIEVEEDEDDESDSESTEVLDLSK 
IREMRLVPSDSTQLETLFDVFCECAELNPEPVQEEEEESGHNWVFSADQMDVRGGDDDAE 
WQISQSPTSVIGHSNGDEGLNQPMLELQINDQRFEDAEEMVHESETKDH*
>AT5G62290.2 |  nucleotide-sensitive chloride conductance regulator (ICln) family protein 
MVAGLRDFTLRTEDGSGKPVLDESNGEELMHVQTSVAVALGNRPIESPGTLYITSRKLIW 
LSDVDMAKGYAVDFLSISLHAVSRDPEAYSSPCIYTQIEVEEDEDDESDSESTEVLDLSK 
IREMRLVPSDSTQLETLFDVFCECAELNPEPVQEEEESGHNWVFSADQMDVRGGDDDAEW 
QISQSPTSVIGHSNGDEGLNQPMLELQINDQRFEDAEEMVHESETKDH*
>AT5G62290.2 |  nucleotide-sensitive chloride conductance regulator (ICln) family protein 
MVAGLRDFTLRTEDGSGKPVLDESNGEELMHVQTSVAVALGNRPIESPGTLYITSRKLIW 
LSDVDMAKGYAVDFLSISLHAVSRDPEAYSSPCIYTQIEVEEDEDDESDSESTEVLDLSK 
IREMRLVPSDSTQLETLFDVFCECAELNPEPVQEEEESGHNWVFSADQMDVRGGDDDAEW 
QISQSPTSVIGHSNGDEGLNQPMLELQINDQRFEDAEEMVHESETKDH*
>AT3G55220.1 |  splicing factor putative 
MYLYSLTLQQATGIVCAINGNFSGGKTQEIAVARGKILDLLRPDENGKIQTIHSVEVFGA 
IRSLAQFRLTGAQKDYIVVGSDSGRIVILEYNKEKNVFDKVHQETFGKSGCRRIVPGQYV 
AVDPKGRAVMIGACEKQKLVYVLNRDTTARLTISSPLEAHKSHTICYSLCGVDCGFDNPI 
FAAIELDYSEADQDPTGQAASEAQKHLTFYELDLGLNHVSRKWSNPVDNGANMLVTVPGG 
ADGPSGVLVCAENFVIYMNQGHPDVRAVIPRRTDLPAERGVLVVSAAVHKQKTMFFFLIQ 
TEYGDVFKVTLDHNGDHVSELKVKYFDTIPVASSICVLKLGFLFSASEFGNHGLYQFQAI 
GEEPDVESSSSNLMETEEGFQPVFFQPRRLKNLVRIDQVESLMPLMDMKVLNIFEEETPQ 
IFSLCGRGPRSSLRILRPGLAITEMAVSQLPGQPSAVWTVKKNVSDEFDAYIVVSFTNAT 
LVLSIGEQVEEVNDSGFLDTTPSLAVSLIGDDSLMQVHPNGIRHIREDGRINEWRTPGKR 
SIVKVGYNRLQVVIALSGGELIYFEADMTGQLMEVEKHEMSGDVACLDIAPVPEGRKRSR 
FLAVGSYDNTVRILSLDPDDCLQILSVQSVSSAPESLLFLEVQASIGGDDGADHPANLFL 
NSGLQNGVLFRTVVDMVTGQLSDSRSRFLGLKPPKLFSISVRGRSAMLCLSSRPWLGYIH 
RGHFHLTPLSYETLEFAAPFSSDQCAEGVVSVAGDALRIFMIDRLGETFNETVVPLRYTP 
RKFVLHPKRKLLVIIESDQGAFTAEEREAARKECFEAGGVGENGNGNADQMENGADDEDK 
EDPLSDEQYGYPKAESEKWVSCIRVLDPKTATTTCLLELQDNEAAYSVCTVNFHDKEYGT 
LLAVGTVKGMQFWPKKNLVAGFIHIYRFVEDGKSLELLHKTQVEGVPLALCQFQGRLLAG 
IGPVLRLYDLGKKRLLRKCENKLFPNTIISIQTYRDRIYVGDIQESFHYCKYRRDENQLY 
IFADDCVPRWLTASHHVDFDTMAGADKFGNVYFVRLPQDLSEEIEEDPTGGKIKWEQGKL 
NGAPNKVDEIVQFHVGDVVTCLQKASMIPGGSESIMYGTVMGSIGALHAFTSRDDVDFFS 
HLEMHMRQEYPPLCGRDHMAYRSAYFPVKDVIDGDLCEQFPTLPMDLQRKIADELDRTPA 
EILKKLEDARNKII*
>AT4G03430.1 |  EMB2770 (EMBRYO DEFECTIVE 2770) RNA splicing factor transesterification mechanism 
MVFLSIPNGKTLSIDVNPNSTTISAFEQLAHQRSDVPQSFLRYSLRMRNPSRVFVDSKDS 
DSILLSDLGVSRFSTVIIHVLLLGGMQAAPPKPRLDFLNSKPPSNYVAGLGRGATGFTTR 
SDIGPARAAPDLPDRSALATAAAPGVGRGAGKPSEAEAEDDEEAEEKRYDENQTFDEFEG 
NDVGLFANAEYDEDDKEADAIWESIDQRMDSRRKDRREAKLKEEIEKYRASNPKITEQFA 
DLKRKLHTLSADEWDSIPEIGDYSLRNKKKKFESFVPIPDTLLEKAKKEKELVMALDPKS 
RAAGGSETPWGQTPVTDLTAVGEGRGTVLSLKLDNLSDSVSGQTVVDPKGYLTDLKSMKR 
TTDEEIYDRNRARLLYKSLTQSNPKNPNGWIAAARVEEVDGKIKAARFQIQRGCEECPKN 
EDVWLEACRLANPEDAKGVIAKGVKLIPNSVKLWLEAAKLEHDVENKSRVLRKGLEHIPD 
SVRLWKAVVELANEEDARILLHRAVECCPLHLELWVALARLETYAESKKVLNKAREKLPK 
EPAIWITAAKLEEANGKLDEANDNTAMVGKIIDRGIKTLQREGVVIDRENWMSEAEACER 
VGSVATCQAIIKNTIGIGVEEEDRKRTWVADADECKKRGSIETARAIYAHALSVFLTKKS 
IWLKAAQLEKSHGSRESLDALLRKAVTYVPQAEVLWLMGAKEKWLAGDVPAARAILQEAY 
AAIPNSEEIWLAAFKLEFENKEPERARMLLAKARERGGTERVWMKSAIVERELGNVEEER 
RLLNEGLKQFPTFFKLWLMLGQLEERFKHLEQARKAYDTGLKHCPHCIPLWLSLADLEEK 
VNGLNKARAILTTARKKNPGGAELWLAAIRAELRHDNKREAEHLMSKALQDCPKSGILWA 
ADIEMAPRPRRKTKSIDAMKKCDRDPHVTIAVAKLFWQDKKVEKARAWFERAVTVGPDIG 
DFWALFYKFELQHGSDEDRKEVVAKCVACEPKHGEKWQAISKAVENAHQPIEVILKRVVN 
ALSKEENSA*
>AT1G15440.1 |  transducin family protein / WD-40 repeat family protein 
MEFRFENLLGAPYRGGNAVITKNTQLISPVGNRVSVTDLSKNHSVTLPLETSTNICRLAS 
SPDGTFLLAVDEQNRCLFINLPRRVVLHRITFKDKVGALKFSPNGKFIAVGIGKLVEIWR 
SPGFRRAVLPFERVRTFANSDDKVVSLEWSLDSDYLLVGSRDLAARLFCVRKLKGVLNKP 
FLFLGHRDSVVGCFFGVDKMTNKVNRAFTIARDGYIFSWGYTEKDVKMDESEDGHSEPPS 
PVTPDRADEVMVENGGGVGTELKKRKEYDGKGLESDEEGDDDDEEYMHRGKWVLLRKDGC 
NQASAKVTACDYHQGLDMVVVGFSNGVFGLYQMPDFICIHLLSISRQKLTTAVFNERGNW 
LTFGCAKLGQLLVWDWRTETYILKQQGHYFDVNCVTYSPDSQLLATGADDNKVKVWNVMS 
GTCFITFTEHTNAVTALHFMADNHSLLSASLDGTVRAWDFKRYKNYKTYTTPTPRQFVSL 
TADPSGDVVCAGTLDSFEIFVWSKKTGQIKDILSGHEAPVHGLMFSPLTQLLASSSWDYT 
VRLWDVFASKGTVETFRHNHDVLTVAFRPDGKQLASSTLDGQINFWDTIEGVLMYTIEGR 
RDIAGGRVMTDRRSAANSSSGKCFTTLCYSADGGYILAAGTSRYICMYDIADQVLLRRFQ 
ISHNLSLDGVLDFLHSKKMTEAGPIDLIDDDNSDEEGGIDKQSRGNLGYDLPGSRPNRGR 
PIIRTKSLSIAPTGRSFAAATTEGVLIFSIDDTFIFDPTDLDIDVTPEAVEAAIEEDEVS 
RALALSMRLNEDSLIKKCIFAVAPADIKAVAISVRQKYLERLMEALVDLLENCPHLEFIL 
HWCQEICKAHGSSIQRNYRTLLPALRSLQKAITRAHQDLADMCSSNEYTLRYLCSVPNNH 
*
>AT1G15440.1 |  transducin family protein / WD-40 repeat family protein 
MEFRFENLLGAPYRGGNAVITKNTQLISPVGNRVSVTDLSKNHSVTLPLETSTNICRLAS 
SPDGTFLLAVDEQNRCLFINLPRRVVLHRITFKDKVGALKFSPNGKFIAVGIGKLVEIWR 
SPGFRRAVLPFERVRTFANSDDKVVSLEWSLDSDYLLVGSRDLAARLFCVRKLKGVLNKP 
FLFLGHRDSVVGCFFGVDKMTNKVNRAFTIARDGYIFSWGYTEKDVKMDESEDGHSEPPS 
PVTPDRADEVMVENGGGVGTELKKRKEYDGKGLESDEEGDDDDEEYMHRGKWVLLRKDGC 
NQASAKVTACDYHQGLDMVVVGFSNGVFGLYQMPDFICIHLLSISRQKLTTAVFNERGNW 
LTFGCAKLGQLLVWDWRTETYILKQQGHYFDVNCVTYSPDSQLLATGADDNKVKVWNVMS 
GTCFITFTEHTNAVTALHFMADNHSLLSASLDGTVRAWDFKRYKNYKTYTTPTPRQFVSL 
TADPSGDVVCAGTLDSFEIFVWSKKTGQIKDILSGHEAPVHGLMFSPLTQLLASSSWDYT 
VRLWDVFASKGTVETFRHNHDVLTVAFRPDGKQLASSTLDGQINFWDTIEGVLMYTIEGR 
RDIAGGRVMTDRRSAANSSSGKCFTTLCYSADGGYILAAGTSRYICMYDIADQVLLRRFQ 
ISHNLSLDGVLDFLHSKKMTEAGPIDLIDDDNSDEEGGIDKQSRGNLGYDLPGSRPNRGR 
PIIRTKSLSIAPTGRSFAAATTEGVLIFSIDDTFIFDPTDLDIDVTPEAVEAAIEEDEVS 
RALALSMRLNEDSLIKKCIFAVAPADIKAVAISVRQKYLERLMEALVDLLENCPHLEFIL 
HWCQEICKAHGSSIQRNYRTLLPALRSLQKAITRAHQDLADMCSSNEYTLRYLCSVPNNH 
*
>AT1G15440.2 |  transducin family protein / WD-40 repeat family protein 
MEFRFENLLGAPYRGGNAVITKNTQLISPVGNRVSVTDLSKNHSVTLPLETSTNICRLAS 
SPDGTFLLAVDEQNRCLFINLPRRVVLHRITFKDKVGALKFSPNGKFIAVGIGKLVEIWR 
SPGFRRAVLPFERVRTFANSDDKVVSLEWSLDSDYLLVGSRDLAARAFTIARDGYIFSWG 
YTEKDVKMDESEDGHSEPPSPVTPDRADEVMVENGGGVGTELKKRKEYDGKGLESDEEGD 
DDDEEYMHRGKWVLLRKDGCNQASAKVTACDYHQGLDMVVVGFSNGVFGLYQMPDFICIH 
LLSISRQKLTTAVFNERGNWLTFGCAKLGQLLVWDWRTETYILKQQGHYFDVNCVTYSPD 
SQLLATGADDNKVKVWNVMSGTCFITFTEHTNAVTALHFMADNHSLLSASLDGTVRAWDF 
KRYKNYKTYTTPTPRQFVSLTADPSGDVVCAGTLDSFEIFVWSKKTGQIKDILSGHEAPV 
HGLMFSPLTQLLASSSWDYTVRLWDVFASKGTVETFRHNHDVLTVAFRPDGKQLASSTLD 
GQINFWDTIEGVLMYTIEGRRDIAGGRVMTDRRSAANSSSGKCFTTLCYSADGGYILAAG 
TSRYICMYDIADQVLLRRFQISHNLSLDGVLDFLHSKKMTEAGPIDLIDDDNSDEEGGID 
KQSRGNLGYDLPGSRPNRGRPIIRTKSLSIAPTGRSFAAATTEGVLIFSIDDTFIFDPTD 
LDIDVTPEAVEAAIEEDEVSRALALSMRLNEDSLIKKCIFAVAPADIKAVAISVRQKYLE 
RLMEALVDLLENCPHLEFILHWCQEICKAHGSSIQRNYRTLLPALRSLQKAITRAHQDLA 
DMCSSNEYTLRYLCSVPNNH*
>AT1G15440.2 |  transducin family protein / WD-40 repeat family protein 
MEFRFENLLGAPYRGGNAVITKNTQLISPVGNRVSVTDLSKNHSVTLPLETSTNICRLAS 
SPDGTFLLAVDEQNRCLFINLPRRVVLHRITFKDKVGALKFSPNGKFIAVGIGKLVEIWR 
SPGFRRAVLPFERVRTFANSDDKVVSLEWSLDSDYLLVGSRDLAARAFTIARDGYIFSWG 
YTEKDVKMDESEDGHSEPPSPVTPDRADEVMVENGGGVGTELKKRKEYDGKGLESDEEGD 
DDDEEYMHRGKWVLLRKDGCNQASAKVTACDYHQGLDMVVVGFSNGVFGLYQMPDFICIH 
LLSISRQKLTTAVFNERGNWLTFGCAKLGQLLVWDWRTETYILKQQGHYFDVNCVTYSPD 
SQLLATGADDNKVKVWNVMSGTCFITFTEHTNAVTALHFMADNHSLLSASLDGTVRAWDF 
KRYKNYKTYTTPTPRQFVSLTADPSGDVVCAGTLDSFEIFVWSKKTGQIKDILSGHEAPV 
HGLMFSPLTQLLASSSWDYTVRLWDVFASKGTVETFRHNHDVLTVAFRPDGKQLASSTLD 
GQINFWDTIEGVLMYTIEGRRDIAGGRVMTDRRSAANSSSGKCFTTLCYSADGGYILAAG 
TSRYICMYDIADQVLLRRFQISHNLSLDGVLDFLHSKKMTEAGPIDLIDDDNSDEEGGID 
KQSRGNLGYDLPGSRPNRGRPIIRTKSLSIAPTGRSFAAATTEGVLIFSIDDTFIFDPTD 
LDIDVTPEAVEAAIEEDEVSRALALSMRLNEDSLIKKCIFAVAPADIKAVAISVRQKYLE 
RLMEALVDLLENCPHLEFILHWCQEICKAHGSSIQRNYRTLLPALRSLQKAITRAHQDLA 
DMCSSNEYTLRYLCSVPNNH*
>AT1G60170.1 |  emb1220 (embryo defective 1220) 
MATLEDSFLADLDELSDNEAELDENDGDVGKEEEDVDMDMADLETLNYDDLDNVSKLQKS 
QRYADIMHKVEEALGKDSDGAEKGTVLEDDPEYKLIVDCNQLSVDIENEIVIVHNFIKDK 
YKLKFQELESLVHHPIDYACVVKKIGNETDLALVDLADLLPSAIIMVVSVTALTTKGSAL 
PEDVLQKVLEACDRALDLDSARKKVLEFVESKMGSIAPNLSAIVGSAVAAKLMGTAGGLS 
ALAKMPACNVQVLGHKRKNLAGFSSATSQSRVGYLEQTEIYQSTPPGLQARAGRLVAAKS 
TLAARVDATRGDPLGISGKAFREEIRKKIEKWQEPPPARQPKPLPVPDSEPKKRRGGRRL 
RKMKERYQVTDMRKLANRMAFGTPEESSLGDGLGEGYGMLGQAGSNRLRVSSVPSKLKIN 
AKVAKKLKERQYAGGATTSGLTSSLAFTPVQGIELCNPQQALGLGSGTQSTYFSESGTFS 
KLKKI*
>AT1G60730.1 |  aldo/keto reductase family protein 
MAEACGVRRIKLGSQGLEVSAQGLGCMGLSAFYGTPKPETEAIALIHHAIHSGVTFLDTS 
DIYGPETNELLLSKALKDGVREKVELATKYGIRYAEGKVEFKGDPAYVRAACEASLMRVD 
VACIDLYYQHRIDTRVPIEITIGELKKLVEEGKIKYIGLSEASASTIRRAHAVHPITALQ 
IEWSLWSRDVEEDIIPTCRELGIGIVAYSPLGRGFFASGPKLVENLDNNDVRKTLPRFQQ 
ENLDHNKILFEKVSAMSEKKGCTPAQLALAWVHHQGDDVCPIPGTTKIENLNQNIGALSV 
KLTPEEMSELESLAQPGFVKGERSISILTTFKNSETPPLSSWKAA*
>AT1G60730.1 |  aldo/keto reductase family protein 
MAEACGVRRIKLGSQGLEVSAQGLGCMGLSAFYGTPKPETEAIALIHHAIHSGVTFLDTS 
DIYGPETNELLLSKALKDGVREKVELATKYGIRYAEGKVEFKGDPAYVRAACEASLMRVD 
VACIDLYYQHRIDTRVPIEITIGELKKLVEEGKIKYIGLSEASASTIRRAHAVHPITALQ 
IEWSLWSRDVEEDIIPTCRELGIGIVAYSPLGRGFFASGPKLVENLDNNDVRKTLPRFQQ 
ENLDHNKILFEKVSAMSEKKGCTPAQLALAWVHHQGDDVCPIPGTTKIENLNQNIGALSV 
KLTPEEMSELESLAQPGFVKGERSISILTTFKNSETPPLSSWKAA*
>AT1G60730.2 |  aldo/keto reductase family protein 
MAEACGVRRIKLGSQGLEVSAQGLGCMGLSAFYGTPKPETEAIALIHHAIHSGVTFLDTS 
DIYGPETNELLLSKALKDGVREKVELATKYGIRYAEGKVEFKGDPAYVRAACEASLMRVD 
VACIDLYYQHRIDTRVPIEITIGELKKLVEEGKIKYIGLSEASASTIRRAHAVHPITALQ 
IEWSLWSRDVEEDIIPTCRELGIGIVAYSPLGRGFFASGPKLVENLDNNDVRKANSTKIP 
TGKLRPQQDSL*
>AT1G60730.2 |  aldo/keto reductase family protein 
MAEACGVRRIKLGSQGLEVSAQGLGCMGLSAFYGTPKPETEAIALIHHAIHSGVTFLDTS 
DIYGPETNELLLSKALKDGVREKVELATKYGIRYAEGKVEFKGDPAYVRAACEASLMRVD 
VACIDLYYQHRIDTRVPIEITIGELKKLVEEGKIKYIGLSEASASTIRRAHAVHPITALQ 
IEWSLWSRDVEEDIIPTCRELGIGIVAYSPLGRGFFASGPKLVENLDNNDVRKANSTKIP 
TGKLRPQQDSL*
>AT2G31020.1 |  ORP1A (OSBP(OXYSTEROL BINDING PROTEIN)-RELATED PROTEIN 1A) oxysterol binding 
MYAATPETPFGSARSQPVITRSVSQRYNHPGQSNHHHLLHSLSFNHQNVLALPAAAREPP 
VDVKINDIAGNSIAGILYKWVNYGKGWRPRWFVLQDGVLSYYKIKGPDKIVVIHETEKGS 
RVIGEESTRMISRNKRHAATNNTNHQLRRKPFGEVHLKVSSIRESRSDDKRFSIFTGTKR 
LHLRAETREDREAWIEALQAVKDMFPRMSNCELMAPTNNLDISIEKLRLRLVEEGVSESA 
IQDCEQITRSEFSAIQSQLLLLKQKQWLLIDTLRQLETEKVDLENTVVDETQRQAGNGDS 
EETISESDDDNEQFDEAEEEMDTCDSLSSSSFKSIGSVFRTSSFSSDDDGLTNGFESEND 
DVDPSIKTIGFNYPHVKRRKKLPDPVEKEKSVSLWSMIKDNIGKDLTKVCLPVYFNEPLS 
SLQKCFEDLEYSYLLDQASEWGKRGNNLMRILNVAAFAVSGYASTEGRICKPFNPMLGET 
YEADYPDKGLRFFSEKVSHHPMIVACHCDGTGWKFWGDSNLKSKFWGRSIQLDPIGLLTL 
QFDDGEIVQWSKVTTSIYNLILGKLYCDHYGTMKIEGNGEYSCKLKFKEQSMIDRNPHQV 
QGIVEDKNGKTVARLFGKWDESIHYVMVDQGKVNESHLLWKRNKQPENPTKYNLTRFGIT 
LNELTPGLKEKLPPTDSRLRPDQRYLEKGEYEMGNAEKLRLEQRQRQAREMQERGWKPKW 
FRKEKGSETYRYIGGYWEARDSGSWDDCPDIFGQVHQSIK*
>AT2G32170.1 |  LOCATED IN cellular_component unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321603) Has 362 Blast hits to 319 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 132 Fungi - 130 Plants - 25 Viruses - 0 Other Eukaryotes - 75 (source NCBI BLink) 
MVSPSEMTETAPAIMTTESERIESSRELVDNEEEEKIRRQKKLEEALEAKSLRRIISAYL 
NYPEASEEDLKRWERSYRKLSPAHKALVPHYPMKFQRLRRCISANSYFIFNMLQAFEPPI 
DLSQELDGCEDSNLDCAPHERYTLDERHDSSCQPALTNSCTYKEESKHIRDPITGVSIEE 
LQRKEAHDHSPKDDSADTRINDKTCDCHEGQLNHDHGSVSFSSHDWLDSSLQTHVPLVDV 
DKVRCIIRNIVRDWAAEGQRERDQCYKPILEELDSLFPDRLKESTPPACLVPGAGLGRLA 
LEISCLGFISQGNEFSYYMMICSSFILNYTQVPGEWTIYPWIHSNCNSLSDNDQLRPIAI 
PDIHPASAGITEGFSMCGGDFVEVYNESSHAGMWDAVVTCFFIDTAHNVIEYIQTISKIL 
KDGGVWINLGPLLYHFADTYGHENEMSIELSLEDVKRVASHFGFVIEKERTIETTYTTNP 
RAMMQNRYYTAFWTMRKKCAITTT*
>AT2G32170.1 |  LOCATED IN cellular_component unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321603) 
MVSPSEMTETAPAIMTTESERIESSRELVDNEEEEKIRRQKKLEEALEAKSLRRIISAYL 
NYPEASEEDLKRWERSYRKLSPAHKALVPHYPMKFQRLRRCISANSYFIFNMLQAFEPPI 
DLSQELDGCEDSNLDCAPHERYTLDERHDSSCQPALTNSCTYKEESKHIRDPITGVSIEE 
LQRKEAHDHSPKDDSADTRINDKTCDCHEGQLNHDHGSVSFSSHDWLDSSLQTHVPLVDV 
DKVRCIIRNIVRDWAAEGQRERDQCYKPILEELDSLFPDRLKESTPPACLVPGAGLGRLA 
LEISCLGFISQGNEFSYYMMICSSFILNYTQVPGEWTIYPWIHSNCNSLSDNDQLRPIAI 
PDIHPASAGITEGFSMCGGDFVEVYNESSHAGMWDAVVTCFFIDTAHNVIEYIQTISKIL 
KDGGVWINLGPLLYHFADTYGHENEMSIELSLEDVKRVASHFGFVIEKERTIETTYTTNP 
RAMMQNRYYTAFWTMRKKCAITTT*
>AT2G32170.2 |  LOCATED IN cellular_component unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321603) Has 362 Blast hits to 319 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 132 Fungi - 130 Plants - 25 Viruses - 0 Other Eukaryotes - 75 (source NCBI BLink) 
MVSPSEMTETAPAIMTTESERIESSRELVDNEEEEKIRRQKKLEEALEAKSLRRIISAYL 
NYPEASEEDLKRWERSYRKLSPAHKALVPHYPMKFQRLRRCISANSYFIFNMLQAFEPPI 
DLSQELDGCEDSNLDCAPHERYTLDERHDSSCQPALTNSCTYKEESKHIRDPITGVSIEE 
LQRKEAHDHSPKDDSADTRINDKTCDCHEGQLNHDHGSVSFSSHDWLDSSLQTHVPLVDV 
DKRETKIIMCHVRTYSQSLLVPLGSLVSYEGWFAHVALHGEVNPVELKCTFLHVRCIIRN 
IVRDWAAEGQRERDQCYKPILEELDSLFPDRLKESTCCIHSPYKVDYMICSTPPACLVPG 
AGLGRLALEISCLGFISQGNEFSYYMMICSSFILNYTQVPGEWTIYPWIHSNCNSLSDND 
QLRPIAIPDIHPASAGITEGFSMCGGDFVEVYNESSHAGMWDAVVTCFFIDTAHNVIEYI 
QTISKILKDGGVWINLGPLLYHFADTYGHENEMSIELSLEDVKRVASHFGFVIEKERTIE 
TTYTTNPRAMMQNRYYTAFWTMRKKCAITTT*
>AT2G32170.2 |  LOCATED IN cellular_component unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321603) 
MVSPSEMTETAPAIMTTESERIESSRELVDNEEEEKIRRQKKLEEALEAKSLRRIISAYL 
NYPEASEEDLKRWERSYRKLSPAHKALVPHYPMKFQRLRRCISANSYFIFNMLQAFEPPI 
DLSQELDGCEDSNLDCAPHERYTLDERHDSSCQPALTNSCTYKEESKHIRDPITGVSIEE 
LQRKEAHDHSPKDDSADTRINDKTCDCHEGQLNHDHGSVSFSSHDWLDSSLQTHVPLVDV 
DKRETKIIMCHVRTYSQSLLVPLGSLVSYEGWFAHVALHGEVNPVELKCTFLHVRCIIRN 
IVRDWAAEGQRERDQCYKPILEELDSLFPDRLKESTCCIHSPYKVDYMICSTPPACLVPG 
AGLGRLALEISCLGFISQGNEFSYYMMICSSFILNYTQVPGEWTIYPWIHSNCNSLSDND 
QLRPIAIPDIHPASAGITEGFSMCGGDFVEVYNESSHAGMWDAVVTCFFIDTAHNVIEYI 
QTISKILKDGGVWINLGPLLYHFADTYGHENEMSIELSLEDVKRVASHFGFVIEKERTIE 
TTYTTNPRAMMQNRYYTAFWTMRKKCAITTT*
>AT2G47090.1 |  nucleic acid binding / protein binding / zinc ion binding 
MDDSCAVCAENLEWVGYGSCGHREVCSTCVVRLRFILNDRRCCICKTECPVVFVTKALGD 
YTKTISDFSTTFPSVPKEGRVGSFWYHEETNVYFDDLNHYTRIKAMCRLSCNLCNDTNKT 
RPKKEPNHCVRFKSVEHLKDHLNHQHKLHMCSLCLVGRKVFICEQKLFTKGQLNQHISSG 
DSEVDGSESERGGFTGHPMCEFCKRPFYGDNELYTHMSREHYTCHICQRLKPGQYEYYGN 
YDDLEVHFRSDHFLCEDETCLAKKFIVFQIEAELKRHNAIDHGGRMSRSQQNASLQIQAS 
FQYPNSRRGRRRSSLREPNLVLLESQASYAFNDDNNLPQHVGRSGNSRLGESSFPPLSVQ 
ANQGQSRFGQNSESLVSNTTTTRQRHRANQGQSRFGQNSESLVSNTTTTRQRHQTNRSAT 
SGSSQAWPALNRGPAEISITSRVQSSGASAQSQSRHHDRVESTRTLASAVPQDARTTVGG 
CSSGSSLSSANATKRNNHHSSSTPKMSETRSLAQPSHSDSPQISAVKNRRSSSTSANAGN 
IQVAQGVSDVQSDNKSLVEKIHASLGHDEELFMAFKNTSGKYRHGSIDARTYLEYVKGYG 
LSHLVLDMARLCPDPQRQKELIDTHNACLKGGNKGKAVKVESSSDSKGDRFVDTVRKLQF 
SDKSQDKDKDKDAYRSDKGKTKVTTLVNSSSAGVGLGDTGKQPKKTSKFLRTRLGEKSMA 
AVLDLRNSNPEPEPEPKNDNSKRSQNSPGGLPLRGAWKRGSAKLFV*
>AT5G12030.1 |  AT-HSP176A (ARABIDOPSIS THALIANA HEAT SHOCK PROTEIN 176A) unfolded protein binding 
MDLEFGRFPIFSILEDMLEAPEEQTEKTRNNPSRAYMRDAKAMAATPADVIEHPDAYVFA 
VDMPGIKGDEIQVQIENENVLVVSGKRQRDNKENEGVKFVRMERRMGKFMRKFQLPDNAD 
LEKISAACNDGVLKVTIPKLPPPEPKKPKTIQVQVA*
>AT5G41770.1 |  crooked neck protein putative / cell cycle protein putative 
MASGGKDSDRTLGYMTRKDTEVKLPRPTRVKNKTPAPIQITAEQILREARERQEAEIRPP 
KQKITDSTELSDYRLRRRKEFEDQIRRARWNIQVWVKYAQWEESQKDYARARSVWERAIE 
GDYRNHTLWLKYAEFEMKNKFVNSARNVWDRAVTLLPRVDQLWYKYIHMEEILGNIAGAR 
QIFERWMDWSPDQQGWLSFIKFELRYNEIERARTIYERFVLCHPKVSAYIRYAKFEMKGG 
EVARCRSVYERATEKLADDEEAEILFVAFAEFEERCKEVERARFIYKFALDHIPKGRAED 
LYRKFVAFEKQYGDKEGIEDAIVGKRRFQYEDEVRKSPSNYDSWFDYVRLEESVGNKDRI 
REIYERAIANVPPAEEKRYWQRYIYLWINYALFEEIETEDIERTRDVYRECLKLIPHSKF 
SFAKIWLLAAQFEIRQLNLTGARQILGNAIGKAPKDKIFKKYIEIELQLGNMDRCRKLYE 
RYLEWSPENCYAWSKYAELERSLVETERARAIFELAISQPALDMPELLWKAYIDFEISEG 
ELERTRALYERLLDRTKHYKVWVSFAKFEASAAELEEDENEDEDQEEDVIEHKKDCIKRA 
RAIFDRANTYYKDSTPELKEERATLLEDWLNMESSFGNLGDVSIVQSKLPKKLKKRKAIT 
REDGSTEYEEYIDYLYPEESQTTNLKILEAAYKWKKQKVAASEDD*
>AT5G06160.1 |  ATO (ATROPOS) nucleic acid binding / zinc ion binding 
MSSTLLEQTRSNHEEVERLERLVVEDLQKEPPSSKDRLVQGHRVRHMIESIMLTTEKLVE 
TYEDKDGAWDDEIAALGGQTATGTNVFSEFYDRLKEIREYHKRHPSGRLVDANEDYEARL 
KEEPIIAFSGEEGNGRYLDLHDMYNQYINSKFGERVEYSAYLDVFSQPEKIPRKLKLSRQ 
YMKYMEALLEYLVYFFQRTEPLQDLDRILSKVCSDFEEQYADGIVEGLDNELIPSQHTVI 
DLDYYSTVEELVDVGPEKLKEALGALGLKVGGTPQQRAERLFLTKHTPLEKLDKKHFARP 
PHNGKQNGDAKSTHESENAKEIALTEAKVKKLCNLLDETIERTKQNIVKKQSLTYEEMEG 
EREGEEANTELESDDEDGLIYNPLKLPIGWDGKPIPYWLYKLHGLGQEFKCEICGNYSYW 
GRRAFERHFKEWRHQHGMRCLGIPNTKNFNEITSIEEAKELWKRIQERQGVNKWRPELEE 
EYEDREGNIYNKKTYSDLQRQGLI*
>AT4G31790.1 |  diphthine synthase putative  (DPH5) 
MLYIIGLGLGDEKDITLRGLEAVKKSQKVYMEAYTSLLSFGLSADGLSNLEKFYGKPIIL 
ADREMVEEKAGDMIDEAIDNDVAFLVVGDPFGATTHSDLVVRAKTLGVKVEVVHNASVMN 
AVGICGLQLYHYGETVSIPFFTETWRPDSFYEKIKKNRSLGLHTLCLLDIRVKEPTFESL 
CRGGKKQYEPPRYMSVNTAIEQLLEVEQKHGDSVYGEDTQCVGFARLGSEDQTIVAGTMK 
QLESVDFGAPLHCLVIVGETHPVEEEMLEFYKYKSGN*
>AT4G31790.1 |  diphthine synthase putative  (DPH5) 
MLYIIGLGLGDEKDITLRGLEAVKKSQKVYMEAYTSLLSFGLSADGLSNLEKFYGKPIIL 
ADREMVEEKAGDMIDEAIDNDVAFLVVGDPFGATTHSDLVVRAKTLGVKVEVVHNASVMN 
AVGICGLQLYHYGETVSIPFFTETWRPDSFYEKIKKNRSLGLHTLCLLDIRVKEPTFESL 
CRGGKKQYEPPRYMSVNTAIEQLLEVEQKHGDSVYGEDTQCVGFARLGSEDQTIVAGTMK 
QLESVDFGAPLHCLVIVGETHPVEEEMLEFYKYKSGN*
>AT4G31790.2 |  diphthine synthase putative  (DPH5) 
MLYIIGLGLGDEKDITLRGLEAVKKSQKVYMEAYTSLLSFGLSADGLSNLEKFYGKPIIL 
ADREMVEEKAGDMIDEAIDNDVAFLVVGDPFGATTHSDLVVRAKTLGVKVEVVHNASVMN 
AVGICGLQLYHYGETVSIPFFTETWRPDSFYEKIKKNRSLGLHTLCLLDIRVKEPTFESL 
CRGGKKQYEPPRYMSVNTAIEQLLEVEQKHGDSVYGEDTQCVGFARLGSEDQTIVAGTMK 
QLESVDFGAPLHCLVIVGETHPVEEEMLEFYKYKSGN*
>AT4G31790.2 |  diphthine synthase putative  (DPH5) 
MLYIIGLGLGDEKDITLRGLEAVKKSQKVYMEAYTSLLSFGLSADGLSNLEKFYGKPIIL 
ADREMVEEKAGDMIDEAIDNDVAFLVVGDPFGATTHSDLVVRAKTLGVKVEVVHNASVMN 
AVGICGLQLYHYGETVSIPFFTETWRPDSFYEKIKKNRSLGLHTLCLLDIRVKEPTFESL 
CRGGKKQYEPPRYMSVNTAIEQLLEVEQKHGDSVYGEDTQCVGFARLGSEDQTIVAGTMK 
QLESVDFGAPLHCLVIVGETHPVEEEMLEFYKYKSGN*