>AT2G18740.1 | small nuclear ribonucleoprotein E putative / snRNP-E putative / Sm protein E putative
MASTKVQRIMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRITGFDEYMNLVLDEAEEV
SIKKNTRKPLGRILLKGDNITLMMNTGK*
>AT2G18740.1 | small nuclear ribonucleoprotein E putative / snRNP-E putative / Sm protein E putative
MASTKVQRIMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRITGFDEYMNLVLDEAEEV
SIKKNTRKPLGRILLKGDNITLMMNTGK*
>AT2G18740.2 | small nuclear ribonucleoprotein E putative / snRNP-E putative / Sm protein E putative
MASTKVQRIMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRITGFDEYMNLVLDEAEEV
SIKKNTRKPLGFYSKETT*
>AT2G18740.2 | small nuclear ribonucleoprotein E putative / snRNP-E putative / Sm protein E putative
MASTKVQRIMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRITGFDEYMNLVLDEAEEV
SIKKNTRKPLGFYSKETT*
>AT3G07590.1 | small nuclear ribonucleoprotein D1 putative / snRNP core protein D1 putative / Sm protein D1 putative
MKLVRFLMKLNNETVSIELKNGTVVHGTITGVDVSMNTHLKTVKMSLKGKNPVTLDHLSL
RGNNIRYYILPDSLNLETLLVEDTPRVKPKKPVAGKAVGRGRGRGRGRGRGRGR*
>AT1G20960.1 | emb1507 (embryo defective 1507) ATP binding / ATP-dependent helicase/ helicase/ nucleic acid binding / nucleoside-triphosphatase/ nucleotide binding
MANLGGGAEAHARFKQYEYRANSSLVLTTDNRPRDTHEPTGEPETLWGKIDPRSFGDRVA
KGRPQELEDKLKKSKKKERDVVDDMVNIRQSKRRRLREESVLTDTDDAVYQPKTKETRAA
YEAMLGLIQKQLGGQPPSIVSGAADEILAVLKNDAFRNPEKKMEIEKLLNKIENHEFDQL
VSIGKLITDFQEGGDSGGGRANDDEGLDDDLGVAVEFEENEEDDEESDPDMVEEDDDEED
DEPTRTGGMQVDAGINDEDAGDANEGTNLNVQDIDAYWLQRKISQAYEQQIDPQQCQVLA
EELLKILAEGDDRVVEDKLLMHLQYEKFSLVKFLLRNRLKVVWCTRLARAEDQEERNRIE
EEMRGLGPELTAIVEQLHATRATAKEREENLQKSINEEARRLKDETGGDGGRGRRDVADR
DSESGWVKGQRQMLDLESLAFDQGGLLMANKKCDLPPGSYRSHGKGYDEVHVPWVSKKVD
RNEKLVKITEMPDWAQPAFKGMQQLNRVQSKVYDTALFKAENILLCAPTGAGKTNVAMLT
ILQQLEMNRNTDGTYNHGDYKIVYVAPMKALVAEVVGNLSNRLKDYGVIVRELSGDQSLT
GREIEETQIIVTTPEKWDIITRKSGDRTYTQLVRLLIIDEIHLLHDNRGPVLESIVARTL
RQIETTKENIRLVGLSATLPNYEDVALFLRVDLKKGLFKFDRSYRPVPLHQQYIGISVKK
PLQRFQLMNDLCYQKVLAGAGKHQVLIFVHSRKETSKTARAIRDTAMANDTLSRFLKEDS
VTRDVLHSHEDIVKNSDLKDILPYGFAIHHAGLSRGDREIVETLFSQGHVQVLVSTATLA
WGVNLPAHTVIIKGTQVYNPEKGAWMELSPLDVMQMLGRAGRPQYDQHGEGIIITGYSEL
QYYLSLMNEQLPIESQFISKLADQLNAEIVLGTVQNAREACHWLGYTYLYIRMVRNPTLY
GLAPDALAKDVVLEERRADLIHSAATILDKNNLVKYDRKSGYFQVTDLGRIASYYYITHG
TIATYNEHLKPTMGDIDLYRLFSLSDEFKYVTVRQDEKMELAKLLDRVPIPIKETLEEPS
AKINVLLQAYISQLKLEGLSLTSDMVYITQSAGRLVRALYEIVLKRGWAQLAEKALNLSK
MVGKRMWSVQTPLRQFHGLSNDILMQLEKKDLVWERYYDLSAQELGELIRSPKMGKPLHK
FIHQFPKVTLSAHVQPITRTVLNVELTVTPDFLWDEKIHKYVEPFWIIVEDNDGEKILHH
EYFLLKKQYIDEDHTLHFTVPIFEPLPPQYFVRVVSDKWLGSETVLPVSFRHLILPEKYP
PPTELLDLQPLPVTALRNPNYEILYQDFKHFNPVQTQVFTVLYNTNDNVLVAAPTGSGKT
ICAEFAILRNHHEGPDATMRVVYIAPLEAIAKEQFRIWEGKFGKGLGLRVVELTGETALD
LKLLEKGQIIISTPEKWDALSRRWKQRKYVQQVSLFIVDELHLIGGQHGPVLEVIVSRMR
YISSQVINKIRIVALSTSLANAKDLGEWIGASSHGLFNFPPGVRPVPLEIHIQGVDISSF
EARMQAMTKPTYTAIVQHAKNKKPAIVFVPTRKHVRLTAVDLMAYSHMDNPQSPDFLLGK
LEELDPFVEQIREETLKETLCHGIGYLHEGLSSLDQEIVTQLFEAGRIQVCVMSSSLCWG
TPLTAHLVVVMGTQYYDGRENSHSDYPVPDLLQMMGRASRPLLDNAGKCVIFCHAPRKEY
YKKFLYEAFPVESQLQHFLHDNFNAEVVAGVIENKQDAVDYLTWTFMYRRLPQNPNYYNL
QGVSHRHLSDHLSELVENTLSDLEASKCIEVEDEMELSPLNLGMIASYYYISYTTIERFS
SLLSSKTKMKGLLEILTSASEYDMIPIRPGEEDTVRRLINHQRFSFENPKCTDPHVKANA
LLQAHFSRQNIGGNLAMDQRDVLLSATRLLQAMVDVISSNGWLNLALLAMEVSQMVTQGM
WERDSMLLQLPHFTKDLAKRCQENPGKNIETVFDLVEMEDEERQELLKMSDAQLLDIARF
CNRFPNIDLTYEIVGSEEVNPGKEVTLQVMLERDMEGRTEVGPVDSLRYPKTKEEGWWLV
VGDTKTNQLLAIKRVSLQRKVKVKLDFTAPSEPGEKSYTLYFMCDSYLGCDQEYSFSVDV
KGSGAGDRMEE*
>AT1G09760.1 | U2A (U2 small nuclear ribonucleoprotein A) protein binding
MVKLTADLIWKSPHFFNAIKERELDLRGNKIPVIENLGATEDQFDTIDLSDNEIVKLENF
PYLNRLGTLLINNNRITRINPNLGEFLPKLHSLVLTNNRLVNLVEIDPLASIPKLQYLSL
LDNNITKKANYRLYVIHKLKSLRVLDFIKIKAKERAEAASLFSSKEAEEEVKKVSREEVK
KVSETAENPETPKVVAPTAEQILAIKAAIINSQTIEEIARLEQALKFGQVPAGLIIPDPA
TNDSAPMEE*
>AT5G64270.1 | splicing factor putative
MADLDPEIAKTQEERRKMEADLASLTSLTFDRDLYGGNDRASYSTSIAPNEEDDANLDTT
GSLVAQRLASYTAPRSILNDVARPHNEDDDVGFKPRQSIAEREGEYRNRRLNRVLSPDRV
DAFAMGDKTPDASVRTYTDHMRETALQREKEETMRLIAKKKKEEEEAAAKHQKDSAPPPP
ASSSSSSSKRRHRWDLPEEDGAAAKKAKAASSDWDLPDAAPGIGRWDAPTPGRVSDATPS
AGRRNRWDETPTPGRVTDSDATPGGGVTPGATPSGVTWDGLATPTPKRQRSRWDETPATM
GSATPMGGVTPGAAYTPGVTPIGGIDMATPTPGQLIFRGPMTPEQLNMQRWEKDIEERNR
PLSDEELDAMFPKDGYKVLDPPATYVPIRTPARKLQQTPTPMATPGYVIPEENRGQQYDV
PPEVPGGLPFMKPEDYQYFGSLLNEENEEELSPEEQKERKIMKLLLKVKNGTPPQRKTAL
RQLTDKARELGAGPLFNKILPLLMQPTLEDQERHLLVKVIDRILYKLDEMVRPYVHKILV
VIEPLLIDEDYYARVEGREIISNLSKAAGLASMIAAMRPDIDNIDEYVRNTTARAFSVVA
SALGIPALLPFLKAVCQSKRSWQARHTGIKIVQQIAILIGCAVLPHLRSLVEIIEHGLSD
ENQKVRTITALSLAALAEAAAPYGIESFDSVLKPLWKGIRSHRGKVLAAFLKAIGFIIPL
MDAIYASYYTKEVMVILIREFQSPDEEMKKIVLKVVKQCVSTEGVEPEYIRSDILPEFFR
NFWTRKMALERRNYKQLVETTVEVANKVGVADIVGRVVEDLKDESEQYRRMVMETIDKVV
TNLGASDIDARLEELLIDGILYAFQEQTSDDANVMLNGFGAVVNALGQRVKPYLPQICGT
IKWRLNNKSAKVRQQAADLISRIAVVMKQCGEEQLMGHLGVVLYEYLGEEYPEVLGSILG
ALKAIVNVIGMTKMTPPIKDLLPRLTPILKNRHEKVQENCIDLVGRIADRGAEFVPAREW
MRICFELLEMLKAHKKGIRRATVNTFGYIAKAIGPQDVLATLLNNLKVQERQNRVCTTVA
IAIVAETCSPFTVLPALMNEYRVPELNVQNGVLKSLSFLFEYIGEMGKDYIYAVTPLLED
ALMDRDLVHRQTAASAVKHMALGVAGLGCEDALVHLLNFIWPNIFETSPHVINAVMEAIE
GMRVALGAAVILNYCLQGLFHPARKVREVYWKIYNSLYIGAQDTLVAAYPVLEDEQNNVY
SRPELTMFV*
>AT1G09770.1 | ATCDC5 (ARABIDOPSIS THALIANA CELL DIVISION CYCLE 5) DNA binding / transcription factor
MRIMIKGGVWKNTEDEILKAAVMKYGKNQWARISSLLVRKSAKQCKARWYEWLDPSIKKT
EWTREEDEKLLHLAKLLPTQWRTIAPIVGRTPSQCLERYEKLLDAACTKDENYDAADDPR
KLRPGEIDPNPEAKPARPDPVDMDEDEKEMLSEARARLANTRGKKAKRKAREKQLEEARR
LASLQKRRELKAAGIDGRHRKRKRKGIDYNAEIPFEKRAPAGFYDTADEDRPADQVKFPT
TIEELEGKRRADVEAHLRKQDVARNKIAQRQDAPAAILQANKLNDPEVVRKRSKLMLPPP
QISDHELEEIAKMGYASDLLAENEELTEGSAATRALLANYSQTPRQGMTPMRTPQRTPAG
KGDAIMMEAENLARLRDSQTPLLGGENPELHPSDFTGVTPRKKEIQTPNPMLTPSMTPGG
AGLTPRIGLTPSRDGSSFSMTPKGTPFRDELHINEDMDMHESAKLERQRREEARRSLRSG
LTGLPQPKNEYQIVAQPPPEESEEPEEKIEEDMSDRIAREKAEEEARQQALLKKRSKVLQ
RDLPRPPAASLAVIRNSLLSADGDKSSVVPPTPIEVADKMVREELLQLLEHDNAKYPLDD
KAEKKKGAKNRTNRSASQVLAIDDFDENELQEADKMIKEEGKFLCVSMGHENKTLDDFVE
AHNTCVNDLMYFPTRSAYELSSVAGNADKVAAFQEEMENVRKKMEEDEKKAEHMKAKYKT
YTKGHERRAETVWTQIEATLKQAEIGGTEVECFKALKRQEEMAASFRKKNLQEEVIKQKE
TESKLQTRYGNMLAMVEKAEEIMVGFRAQALKKQEDVEDSHKLKEAKLATGEEEDIAIAM
EASA*
>AT2G23930.1 | SNRNP-G (PROBABLE SMALL NUCLEAR RIBONUCLEOPROTEIN G)
MSRSGQPPDLKKYMDKKLQIKLNANRMVTGTLRGFDQFMNLVVDNTVEVNGNDKTDIGMV
VIRGNSIVTVEALEPVGRSS*
>AT2G23930.1 | SNRNP-G (PROBABLE SMALL NUCLEAR RIBONUCLEOPROTEIN G)
MSRSGQPPDLKKYMDKKLQIKLNANRMVTGTLRGFDQFMNLVVDNTVEVNGNDKTDIGMV
VIRGNSIVTVEALEPVGRSS*
>AT2G23930.2 | SNRNP-G (PROBABLE SMALL NUCLEAR RIBONUCLEOPROTEIN G)
MDKKLQIKLNANRMVTGTLRGFDQFMNLVVDNTVEVNGNDKTDIGMVVIRGNSIVTVEAL
EPVGRSS*
>AT2G23930.2 | SNRNP-G (PROBABLE SMALL NUCLEAR RIBONUCLEOPROTEIN G)
MDKKLQIKLNANRMVTGTLRGFDQFMNLVVDNTVEVNGNDKTDIGMVVIRGNSIVTVEAL
EPVGRSS*
>AT1G21190.1 | small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative
MSVEEDATVREPLDLIRLSIEERIYVKLRSDRELRGKLHAFDQHLNMILGDVEEVITTIE
IDDETYEEIVRTTKRTVPFLFVRGDGVILVSPPLRTT*
>AT1G03330.1 | small nuclear ribonucleoprotein D putative / snRNP core SM-like protein putative / U6 snRNA-associated Sm-like protein putative
MLFFSYFKDLVGQEVTVELKNDLAIRGTLHSVDQYLNIKLENTRVVDQDKYPHMLSVRNC
FIRGSVVRYVQLPKDGVDVDLLHDAARREARGG*
>AT4G30330.1 | small nuclear ribonucleoprotein E putative / snRNP-E putative / Sm protein E putative
MASTKVQRIMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRITGFDEYMNLVLDEAEEV
SIKKKTRKPLGRILLKGDNITLMMNAGK*
>AT3G50670.1 | U1-70K (U1 SMALL NUCLEAR RIBONUCLEOPROTEIN-70K) RNA binding / nucleic acid binding / nucleotide binding
MGDSGDPFLRNPNAAVQARAKVQNRANVLQLKLMGQSHPTGLTNNLLKLFEPRPPLEYKP
PPEKRKCPPYTGMAQFVSNFAEPGDPEYAPPKPEVELPSQKRERIHKLRLEKGVEKAAED
LKKYDPNNDPNATGDPYKTLFVSRLNYESSESKIKREFESYGPIKRVHLVTDQLTNKPKG
YAFIEYMHTRDMKAAYKQADGQKIDGRRVLVDVERGRTVPNWRPRRLGGGLGTSRVGGGE
EIVGEQQPQGRTSQSEEPSRPREEREKSREKGKERERSRELSHEQPRERSRDRPREDKHH
RDRDQGGRDRDRDSRRDRDRTRDRGDRDRRDRDRGRDRTSRDHDRDRSRKKERDYEGGEY
EHEGGGRSRERDAEYRGEPEETRGYYEDDQGDTDRYSHRYDKMEEDDFRYEREYKRSKRS
ESREYVR*
>AT3G50670.1 | U1-70K (U1 SMALL NUCLEAR RIBONUCLEOPROTEIN-70K) RNA binding / nucleotide binding
MGDSGDPFLRNPNAAVQARAKVQNRANVLQLKLMGQSHPTGLTNNLLKLFEPRPPLEYKP
PPEKRKCPPYTGMAQFVSNFAEPGDPEYAPPKPEVELPSQKRERIHKLRLEKGVEKAAED
LKKYDPNNDPNATGDPYKTLFVSRLNYESSESKIKREFESYGPIKRVHLVTDQLTNKPKG
YAFIEYMHTRDMKAAYKQADGQKIDGRRVLVDVERGRTVPNWRPRRLGGGLGTSRVGGGE
EIVGEQQPQGRTSQSEEPSRPREEREKSREKGKERERSRELSHEQPRERSRDRPREDKHH
RDRDQGGRDRDRDSRRDRDRTRDRGDRDRRDRDRGRDRTSRDHDRDRSRKKERDYEGGEY
EHEGGGRSRERDAEYRGEPEETRGYYEDDQGDTDRYSHRYDKMEEDDFRYEREYKRSKRS
ESREYVR*
>AT3G50670.2 | U1-70K (U1 SMALL NUCLEAR RIBONUCLEOPROTEIN-70K) RNA binding / nucleic acid binding / nucleotide binding
MGDSGDPFLRNPNAAVQARAKVQNRANVLQLKLMGQSHPTGLTNNLLKLFEPRPPLEYKP
PPEKRKCPPYTGMAQFVSNFAEPGDPEYAPPKPEVELPSQKRERIHKLRLEKGVEKAAED
LKKYDPNNDPNATGDPYKTLFVSRLNYESSESKIKREFESYGPIKRVGYSEHSLAGSVRI
CVMASLSRALCSICFILSTKVFQG*
>AT3G50670.2 | U1-70K (U1 SMALL NUCLEAR RIBONUCLEOPROTEIN-70K) RNA binding / nucleotide binding
MGDSGDPFLRNPNAAVQARAKVQNRANVLQLKLMGQSHPTGLTNNLLKLFEPRPPLEYKP
PPEKRKCPPYTGMAQFVSNFAEPGDPEYAPPKPEVELPSQKRERIHKLRLEKGVEKAAED
LKKYDPNNDPNATGDPYKTLFVSRLNYESSESKIKREFESYGPIKRVGYSEHSLAGSVRI
CVMASLSRALCSICFILSTKVFQG*
>AT2G47640.1 | small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.1 | small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.1 | small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.1 | small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.2 | small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.2 | small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.2 | small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.2 | small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.3 | small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.3 | small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.3 | small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.3 | small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.4 | small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.4 | small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.4 | small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.4 | small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT1G20580.1 | small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative
MSRSLGIPVKLLHEASGHIVTVELKSGELYRGSMIECEDNWNCQLEDITYTAKDGKVSQL
EHVFIRGSKVRFMVIPDILKHAPMFKRLDARIKGKSSSLGVGRGRGAMRGKPAAGPGRGT
GGRGAVPPVRR*
>AT2G03870.2 | small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative
MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFVRDHDDPLKT
TDQTRRLGLIVCRGTAVMLVSPTDGTEEIANPFVTAEAV*
>AT2G03870.2 | small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative
MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFVRDHDDPLKT
TDQTRRLGLIVCRGTAVMLVSPTDGTEEIANPFVTAEAV*
>AT2G03870.1 | small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative
MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFVRDHDDPLKT
TDQTRRLGLIVCRGTAVMLVSPTDGTEEIANPFVTAEAV*
>AT2G03870.1 | small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative
MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFVRDHDDPLKT
TDQTRRLGLIVCRGTAVMLVSPTDGTEEIANPFVTAEAV*
>AT4G30220.1 | RUXF (SMALL NUCLEAR RIBONUCLEOPROTEIN F)
MATIPVNPKPFLNNLTGKTVIVKLKWGMEYKGFLASVDSYMNLQLGNTEEYIDGQLTGNL
GEILIRCNNVLYVRGVPEDEELEDADQD*
>AT1G04510.1 | transducin family protein / WD-40 repeat family protein
MNCAISGEVPEEPVVSKKSGLLYEKRLIQTHISDYGKCPVTGEPHTLDDIVPIKTGKIVK
PKPLHTASIPGLLGTFQTEWDSLMLSNFALEQQLHTARQELSHALYQHDAACRVIARLKK
ERDESRQLLAEAERQLPAAPEVATSNAALSNGKRGIDDGEQGPNAKKMRLGISAEVITEL
TDCNAALSQQRKKRQIPKTLASVDALEKFTQLSSHPLHKTNKPGIFSMDILHSKDVIATG
GIDTTAVLFDRPSGQILSTLTGHSKKVTSIKFVGDTDLVLTASSDKTVRIWGCSEDGNYT
SRHTLKDHSAEVRAVTVHATNKYFVSASLDSTWCFYDLSSGLCLAQVTDASENDVNYTAA
AFHPDGLILGTGTAQSIVKIWDVKSQANVAKFGGHNGEITSISFSENGYFLATAALDGVR
LWDLRKLKNFRTFDFPDANSVEFDHSGSYLGIAASDIRVFQAASVKAEWNPIKTLPDLSG
TGKATSVKFGLDSKYIAVGSMDRNLRIFGLPDDDNTEDSAQDS*
>AT1G04510.1 | transducin family protein / WD-40 repeat family protein
MNCAISGEVPEEPVVSKKSGLLYEKRLIQTHISDYGKCPVTGEPHTLDDIVPIKTGKIVK
PKPLHTASIPGLLGTFQTEWDSLMLSNFALEQQLHTARQELSHALYQHDAACRVIARLKK
ERDESRQLLAEAERQLPAAPEVATSNAALSNGKRGIDDGEQGPNAKKMRLGISAEVITEL
TDCNAALSQQRKKRQIPKTLASVDALEKFTQLSSHPLHKTNKPGIFSMDILHSKDVIATG
GIDTTAVLFDRPSGQILSTLTGHSKKVTSIKFVGDTDLVLTASSDKTVRIWGCSEDGNYT
SRHTLKDHSAEVRAVTVHATNKYFVSASLDSTWCFYDLSSGLCLAQVTDASENDVNYTAA
AFHPDGLILGTGTAQSIVKIWDVKSQANVAKFGGHNGEITSISFSENGYFLATAALDGVR
LWDLRKLKNFRTFDFPDANSVEFDHSGSYLGIAASDIRVFQAASVKAEWNPIKTLPDLSG
TGKATSVKFGLDSKYIAVGSMDRNLRIFGLPDDDNTEDSAQDS*
>AT1G04510.2 | transducin family protein / WD-40 repeat family protein
MNCAISGEVPEEPVVSKKSGLLYEKRLIQTHISDYGKCPVTGEPHTLDDIVPIKTGKIVK
PKPLHTASIPGLLGTFQTEWDSLMLSNFALEQQLHTARQELSHALYQHDAACRVIARLKK
ERDESRQLLAEAERQLPAAPEVATSNAALSNGKRGIDDGEQGPNAKKMRLGISAEVITEL
TDCNAALSQQRKKRQIPKTLASVDALEKFTQLSSHPLHKTNKPGIFSMDILHSKDVIATG
GIDTTAVLFDRPSGQILSTLTGHSKKVTSIKFVGDTDLVLTASSDKTVRIWGCSEDGNYT
SRHTLKDHSAEVRAVTVHATNKYFVSASLDSTWCFYDLSSGLCLAQVTDASENDVNYTAA
AFHPDGLILGTGTAQSIVKIWDVKSQANVAKFGGHNGEITSISFSENGYFLATAALDGVR
LWDLRKLKNFRTFDFPDANSVEFDHSGSYLGIAASDIRVFQAASVKAEWNPIKTLPDLSG
TGKSTSVKFGLDSKYIAVGSMDRNLRIFGLPDDDNTEDSAQDS*
>AT1G04510.2 | transducin family protein / WD-40 repeat family protein
MNCAISGEVPEEPVVSKKSGLLYEKRLIQTHISDYGKCPVTGEPHTLDDIVPIKTGKIVK
PKPLHTASIPGLLGTFQTEWDSLMLSNFALEQQLHTARQELSHALYQHDAACRVIARLKK
ERDESRQLLAEAERQLPAAPEVATSNAALSNGKRGIDDGEQGPNAKKMRLGISAEVITEL
TDCNAALSQQRKKRQIPKTLASVDALEKFTQLSSHPLHKTNKPGIFSMDILHSKDVIATG
GIDTTAVLFDRPSGQILSTLTGHSKKVTSIKFVGDTDLVLTASSDKTVRIWGCSEDGNYT
SRHTLKDHSAEVRAVTVHATNKYFVSASLDSTWCFYDLSSGLCLAQVTDASENDVNYTAA
AFHPDGLILGTGTAQSIVKIWDVKSQANVAKFGGHNGEITSISFSENGYFLATAALDGVR
LWDLRKLKNFRTFDFPDANSVEFDHSGSYLGIAASDIRVFQAASVKAEWNPIKTLPDLSG
TGKSTSVKFGLDSKYIAVGSMDRNLRIFGLPDDDNTEDSAQDS*
>AT2G41500.1 | EMB2776 nucleotide binding
MEPNKDDNVSLAATAQISAPPVLQDASSLPGFSAIPPVVPPSFPPPMAPIPMMPHPPVAR
PPTFRPPVSQNGGVKTSDSDSESDDEHIEISEESKQVRERQEKALQDLLVKRRAAAMAVP
TNDKAVRDRLRRLGEPITLFGEQEMERRARLTQLLTRYDINGQLDKLVKDHEEDVTPKEE
VDDEVLEYPFFTEGPKELREARIEIAKFSVKRAAVRIQRAKRRRDDPDEDMDAETKWALK
HAKHMALDCSNFGDDRPLTGCSFSRDGKILATCSLSGVTKLWEMPQVTNTIAVLKDHKER
ATDVVFSPVDDCLATASADRTAKLWKTDGTLLQTFEGHLDRLARVAFHPSGKYLGTTSYD
KTWRLWDINTGAELLLQEGHSRSVYGIAFQQDGALAASCGLDSLARVWDLRTGRSILVFQ
GHIKPVFSVNFSPNGYHLASGGEDNQCRIWDLRMRKSLYIIPAHANLVSQVKYEPQEGYF
LATASYDMKVNIWSGRDFSLVKSLAGHESKVASLDITADSSCIATVSHDRTIKLWTSSGN
DDEDEEKETMDIDL*
>AT1G80070.1 | SUS2 (ABNORMAL SUSPENSOR 2)
MWNNNDGMPLAPPGTGGSMMPPPPAAHPSYTALPPPSNPTPPVEPTPEEAEAKLEEKARK
WMQLNSKRYGDKRKFGFVETQKEDMPPEHVRKIIRLVFFSSFSTISKYSLLDNYFLARDH
GDMSSKKFRHDKRVYLGALKFVPHAVFKLLENMPMPWEQVRDVKVLYHITGAITFVNEIP
WVVEPIYMAQWGTMWIMMRREKRDRRHFKRMRFPPFDDEEPPLDYADNLLDVDPLEPIQL
ELDEEEDSAVHTWFYDHKPLVKTKLINGPSYRRWNLSLPIMATLHRLAGQLLSDLIDRNY
FYLFDMPSFFTAKALNMCIPGGPKFEPLYRDMEKGDEDWNEFNDINKLIIRSPLRTEYRI
AFPHLYNNRPRKVKLCVYHSPMIMYIKTEDPDLPAFYYDPLIHPISNTNKEKRERKVYDD
EDDFALPEGVEPLLRDTQLYTDTTAAGISLLFAPRPFNMRSGRTRRAEDIPLVSEWFKEH
CPPAYPVKVRVSYQKLLKCYVLNELHHRPPKAQKKKHLFRSLAATKFFQSTELDWVEVGL
QVCRQGYNMLNLLIHRKNLNYLHLDYNFNLKPVKTLTTKERKKSRFGNAFHLCREILRLT
KLVVDANVQFRLGNVDAFQLADGLQYIFSHVGQLTGMYRYKYRLMRQIRMCKDLKHLIYY
RFNTGPVGKGPGCGFWAPMWRVWLFFLRGIVPLLERWLGNLLARQFEGRHSKGVAKTVTK
QRVESHFDLELRAAVMHDVLDAMPEGIKQNKARTILQHLSEAWRCWKANIPWKVPGLPVP
IENMILRYVKSKADWWTNVAHYNRERIRRGATVDKTVCRKNLGRLTRLWLKAEQERQHNY
LKDGPYVTPEEALAIYTTTVHWLESRKFSPIPFPPLSYKHDTKLLILALERLKESYSVAV
RLNQQQREELGLIEQAYDNPHEALSRIKRHLLTQRGFKEVGIEFMDLYSYLIPVYEIEPL
EKITDAYLDQYLWYEGDKRHLFPNWIKPADSEPPPLLVYKWCQGINNLQGIWDTGDGQCV
VMLQTKFEKFFEKIDLTMLNRLLRLVLDHNIADYVSAKNNVVLSYKDMSHTNSYGLIRGL
QFASFVVQFYGLLLDLLLLGLTRASEIAGPPQMPNEFMTFWDTKVETRHPIRLYSRYIDK
VHIMFKFTHEEARDLIQRYLTEHPDPNNENMVGYNNKKCWPRDARMRLMKHDVNLGRSVF
WDMKNRLPRSITTLEWENGFVSVYSKDNPNLLFSMCGFEVRILPKIRMTQEAFSNTKDGV
WNLQNEQTKERTAVAFLRVDDEHMKVFENRVRQILMSSGSTTFTKIVNKWNTALIGLMTY
FREATVHTQELLDLLVKCENKIQTRIKIGLNSKMPSRFPPVIFYTPKEIGGLGMLSMGHI
LIPQSDLRYSKQTDVGVTHFRSGMSHEEDQLIPNLYRYIQPWESEFIDSQRVWAEYALKR
QEAQAQNRRLTLEDLEDSWDRGIPRINTLFQKDRHTLAYDKGWRVRTDFKQYQVLKQNPF
WWTHQRHDGKLWNLNNYRTDVIQALGGVEGILEHTLFKGTYFPTWEGLFWEKASGFEESM
KYKKLTNAQRSGLNQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGIFMHGKIPTLKISL
IQIFRAHLWQKIHESVVMDLCQVLDQELDALEIETVQKETIHPRKSYKMNSSCADVLLFA
AHKWPMSKPSLVAESKDMFDQKASNKYWIDVQLRWGDYDSHDIERYTRAKFMDYTTDNMS
IYPSPTGVMIGLDLAYNLHSAFGNWFPGSKPLLAQAMNKIMKSNPALYVLRERIRKGLQL
YSSEPTEPYLSSQNYGEIFSNQIIWFVDDTNVYRVTIHKTFEGNLTTKPINGAIFIFNPR
TGQLFLKVIHTSVWAGQKRLGQLAKWKTAEEVAALVRSLPVEEQPKQIIVTRKGMLDPLE
VHLLDFPNIVIKGSELQLPFQACLKIEKFGDLILKATEPQMVLFNIYDDWLKSISSYTAF
SRLILILRALHVNNEKAKMLLKPDKSVVTEPHHIWPSLTDDQWMKVEVALRDLILSDYAK
KNNVNTSALTQSEIRDIILGAEITPPSQQRQQIAEIEKQAKEASQLTAVTTRTTNVHGDE
LIVTTTSPYEQSAFGSKTDWRVRAISATNLYLRVNHIYVNSDDIKETGYTYIMPKNILKK
FICVADLRTQIAGYLYGISPPDNPQVKEIRCVVMVPQWGNHQLVHLPSSLPEHDFLNDLE
PLGWLHTQPNELPQLSPQDVTSHSRILENNKQWDGEKCIILTCSFTPGSCSLTSYKLTQT
GYEWGRLNKDNGSNPHGYLPTHYEKVQMLLSDRFLGFYMVPESGPWNYSFTGVKHTLSMK
YSVKLGSPKEFYHEEHRPTHFLEFSNMEEADITEGDREDTFT*
>AT3G55200.1 | splicing factor putative
MYLYSLTLQQATGIVCAINGNFSGGKTQEIAVARGKILDLLRPDENGKIQTIHSVEVFGA
IRSLAQFRLTGAQKDYIVVGSDSGRIVILEYNKEKNVFDKVHQETFGKSGCRRIVPGQYV
AVDPKGRAVMIGACEKQKLVYVLNRDTTARLTISSPLEAHKSHTICYSLCGVDCGFDNPI
FAAIELDYSEADQDPTGQAASEAQKHLTFYELDLGLNHVSRKWSNPVDNGANMLVTVPGG
ADGPSGVLVCAENFVIYMNQGHPDVRAVIPRRTDLPAERGVLVVSAAVHKQKTMFFFLIQ
TEYGDVFKVTLDHNGDHVSELKVKYFDTIPVASSICVLKLGFLFSASEFGNHGLYQFQAI
GEEPDVESSSSNLMETEEGFQPVFFQPRRLKNLVRIDQVESLMPLMDMKVLNIFEEETPQ
IFSLCGRGPRSSLRILRPGLAITEMAVSQLPGQPSAVWTVKKNVSDEFDAYIVVSFTNAT
LVLSIGEQVEEVNDSGFLDTTPSLAVSLIGDDSLMQVHPNGIRHIREDGRINEWRTPGKR
SIVKVGYNRLQVVIALSGGELIYFEADMTGQLMEVEKHEMSGDVACLDIAPVPEGRKRSR
FLAVGSYDNTVRILSLDPDDCLQILSVQSVSSAPESLLFLEVQASIGGDDGADHPANLFL
NSGLQNGVLFRTVVDMVTGQLSDSRSRFLGLKPPKLFSISVRGRSAMLCLSSRPWLGYIH
RGHFHLTPLSYETLEFAAPFSSDQCAEGVVSVAGDALRIFMIDRLGETFNETVVPLRYTP
RKFVLHPKRKLLVIIESDQGAFTAEEREAARKECFEAGGVGENGNGNADQMENGADDEDK
EDPLSDEQYGYPKAESEKWVSCIRVLDPKTATTTCLLELQDNEAAYSVCTVNFHDKEYGT
LLAVGTVKGMQFWPKKNLVAGFIHIYRFVEDGKSLELLHKTQVEGVPLALCQFQGRLLAG
IGPVLRLYDLGKKRLLRKCENKLFPNTIISIQTYRDRIYVGDIQESFHYCKYRRDENQLY
IFADDCVPRWLTASHHVDFDTMAGADKFGNVYFVRLPQDLSEEIEEDPTGGKIKWEQGKL
NGAPNKVDEIVQFHVGDVVTCLQKASMIPGGSESIMYGTVMGSIGALHAFTSRDDVDFFS
HLEMHMRQEYPPLCGRDHMAYRSAYFPVKDVIDGDLCEQFPTLPMDLQRKIADELDRTPA
EILKKLEDARNKII*
>AT1G61040.1 | VIP5 (vernalization independence 5) DNA binding
MGDLENLLLEAAGRTNSAGRSRHPPSSRRREGSYSDGSSDSRDDSDEDRGYASRKPSGSQ
VPLKKRLEAEREDRAARVEGGYGDGPSDREGDSSEESDFGDDLYKNEEDRQKLAGMTEFQ
REMILSERADKKGDKNFTEKLRSKRESEKTPVSKKETQPLPASRGVRSSARSADRAAAKD
DALNELRAKRMKQQDPAALRKLRDASKGGSGSRDFSSTKRKPLASSNLSSSSQSDSDSRS
QSDDEGSNGGMLDSDDDRSDVPTFEDVKEVTIRRSKLAKWLMEPFFEELIVGCFVRVGIG
RSKSGPIYRLCWVKNVDATDPDKTYKLENKTTHKYLNVVWGNETSAARWQMAMISDGHPL
EEEYRQWIREVERTNGRMPTKQDISEKKEAIQRTNSFVYSAETVKQMLQEKKSASVRPMN
VAAEKDRLRKELEIAQSKNDEAGVERIKSKIKQLDASRNKKGVDKKALKLAEMNKKNRAE
NFKNASEVKSITASLKAGEAGYDPFSRRWTRSSNYYNGKNKGKDGEENEAAVAAAVETNG
ADAGAGVEATEAALEAAAEAGKLIDTRAPIGQGAEHNQLHNFELSLSLTALQKYGGPQGV
QKAFMARKQLTEATVGCRVAENDGKRHGLTLTVSDYKRRRGLL*
>AT4G03430.1 | EMB2770 (EMBRYO DEFECTIVE 2770) RNA splicing factor transesterification mechanism
MVFLSIPNGKTLSIDVNPNSTTISAFEQLAHQRSDVPQSFLRYSLRMRNPSRVFVDSKDS
DSILLSDLGVSRFSTVIIHVLLLGGMQAAPPKPRLDFLNSKPPSNYVAGLGRGATGFTTR
SDIGPARAAPDLPDRSALATAAAPGVGRGAGKPSEAEAEDDEEAEEKRYDENQTFDEFEG
NDVGLFANAEYDEDDKEADAIWESIDQRMDSRRKDRREAKLKEEIEKYRASNPKITEQFA
DLKRKLHTLSADEWDSIPEIGDYSLRNKKKKFESFVPIPDTLLEKAKKEKELVMALDPKS
RAAGGSETPWGQTPVTDLTAVGEGRGTVLSLKLDNLSDSVSGQTVVDPKGYLTDLKSMKR
TTDEEIYDRNRARLLYKSLTQSNPKNPNGWIAAARVEEVDGKIKAARFQIQRGCEECPKN
EDVWLEACRLANPEDAKGVIAKGVKLIPNSVKLWLEAAKLEHDVENKSRVLRKGLEHIPD
SVRLWKAVVELANEEDARILLHRAVECCPLHLELWVALARLETYAESKKVLNKAREKLPK
EPAIWITAAKLEEANGKLDEANDNTAMVGKIIDRGIKTLQREGVVIDRENWMSEAEACER
VGSVATCQAIIKNTIGIGVEEEDRKRTWVADADECKKRGSIETARAIYAHALSVFLTKKS
IWLKAAQLEKSHGSRESLDALLRKAVTYVPQAEVLWLMGAKEKWLAGDVPAARAILQEAY
AAIPNSEEIWLAAFKLEFENKEPERARMLLAKARERGGTERVWMKSAIVERELGNVEEER
RLLNEGLKQFPTFFKLWLMLGQLEERFKHLEQARKAYDTGLKHCPHCIPLWLSLADLEEK
VNGLNKARAILTTARKKNPGGAELWLAAIRAELRHDNKREAEHLMSKALQDCPKSGILWA
ADIEMAPRPRRKTKSIDAMKKCDRDPHVTIAVAKLFWQDKKVEKARAWFERAVTVGPDIG
DFWALFYKFELQHGSDEDRKEVVAKCVACEPKHGEKWQAISKAVENAHQPIEVILKRVVN
ALSKEENSA*
>AT2G32160.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 325 Blast hits to 315 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 70 (source NCBI BLink)
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKKQ*
>AT2G32160.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 327 Blast hits to 317 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink)
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKKQ*
>AT2G32160.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 328 Blast hits to 321 proteins in 142 species Archae - 0 Bacteria - 0 Metazoa - 99 Fungi - 132 Plants - 25 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink)
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKKQ*
>AT2G32160.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 325 Blast hits to 315 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 70 (source NCBI BLink)
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKEKTIETT
YSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT2G32160.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 327 Blast hits to 317 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink)
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKEKTIETT
YSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT2G32160.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 328 Blast hits to 321 proteins in 142 species Archae - 0 Bacteria - 0 Metazoa - 99 Fungi - 132 Plants - 25 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink)
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKEKTIETT
YSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT2G32160.3 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 325 Blast hits to 315 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 70 (source NCBI BLink)
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSVPF
SSHDWLDSSLQAHVPLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRH
KESTRPACLVPGAGLGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPW
IHTNCNSLSDDDQLRPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCF
FIDTAHNIIEYIETISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASH
YGFEMEKEKTIETTYSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT2G32160.3 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 327 Blast hits to 317 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink)
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSVPF
SSHDWLDSSLQAHVPLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRH
KESTRPACLVPGAGLGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPW
IHTNCNSLSDDDQLRPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCF
FIDTAHNIIEYIETISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASH
YGFEMEKEKTIETTYSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT2G32160.3 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 328 Blast hits to 321 proteins in 142 species Archae - 0 Bacteria - 0 Metazoa - 99 Fungi - 132 Plants - 25 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink)
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSVPF
SSHDWLDSSLQAHVPLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRH
KESTRPACLVPGAGLGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPW
IHTNCNSLSDDDQLRPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCF
FIDTAHNIIEYIETISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASH
YGFEMEKEKTIETTYSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT1G60170.1 | emb1220 (embryo defective 1220)
MATLEDSFLADLDELSDNEAELDENDGDVGKEEEDVDMDMADLETLNYDDLDNVSKLQKS
QRYADIMHKVEEALGKDSDGAEKGTVLEDDPEYKLIVDCNQLSVDIENEIVIVHNFIKDK
YKLKFQELESLVHHPIDYACVVKKIGNETDLALVDLADLLPSAIIMVVSVTALTTKGSAL
PEDVLQKVLEACDRALDLDSARKKVLEFVESKMGSIAPNLSAIVGSAVAAKLMGTAGGLS
ALAKMPACNVQVLGHKRKNLAGFSSATSQSRVGYLEQTEIYQSTPPGLQARAGRLVAAKS
TLAARVDATRGDPLGISGKAFREEIRKKIEKWQEPPPARQPKPLPVPDSEPKKRRGGRRL
RKMKERYQVTDMRKLANRMAFGTPEESSLGDGLGEGYGMLGQAGSNRLRVSSVPSKLKIN
AKVAKKLKERQYAGGATTSGLTSSLAFTPVQGIELCNPQQALGLGSGTQSTYFSESGTFS
KLKKI*
>AT1G44910.1 | protein binding
MANNPPQSSGTQFRPMVPGQQGQHFVPAASQPFHPYGHVPPNVQSQPPQYSQPIQQQQLF
PVRPGQPVHITSSSQAVSVPYIQTNKILTSGSTQPQPNAPPMTGFATSGPPFSSPYTFVP
SSYPQQQPTSLVQPNSQMHVAGVPPAANTWPVPVNQSTSLVSPVQQTGQQTPVAVSTDPG
NLTPQSASDWQEHTSADGRKYYYNKRTKQSNWEKPLELMTPLERADASTVWKEFTTPEGK
KYYYNKVTKESKWTIPEDLKLAREQAQLASEKTSLSEAGSTPLSHHAASSSDLAVSTVTS
VVPSTSSALTGHSSSPIQAGLAVPVTRPPSVAPVTPTSGAISDTEATTIKGDNLSSRGAD
DSNDGATAQNNEAENKEMSVNGKANLSPAGDKANVEEPMVYATKQEAKAAFKSLLESVNV
HSDWTWEQTLKEIVHDKRYGALRTLGERKQAFNEYLGQRKKVEAEERRRRQKKAREEFVK
MLEECEELSSSLKWSKAMSLFENDQRFKAVDRPRDREDLFDNYIVELERKEREKAAEEHR
QYMADYRKFLETCDYIKAGTQWRKIQDRLEDDDRCSCLEKIDRLIGFEEYILDLEKEEEE
LKRVEKEHVRRAERKNRDAFRTLLEEHVAAGILTAKTYWLDYCIELKDLPQYQAVASNTS
GSTPKDLFEDVTEELEKQYHEDKSYVKDAMKSRKISMVSSWLFEDFKSAISEDLSTQQIS
DINLKLIYDDLVGRVKEKEEKEARKLQRLAEEFTNLLHTFKEITVASNWEDSKQLVEESQ
EYRSIGDESVSQGLFEEYITSLQEKAKEKERKRDEEKVRKEKERDEKEKRKDKDKERREK
EREREKEKGKERSKREESDGETAMDVSEGHKDEKRKGKDRDRKHRRRHHNNSDEDVSSDR
DDRDESKKSSRKHGNDRKKSRKHANSPESESENRHKRQKKESSRRSGNDELEDGEVGE*
>AT1G44910.1 | protein binding
MANNPPQSSGTQFRPMVPGQQGQHFVPAASQPFHPYGHVPPNVQSQPPQYSQPIQQQQLF
PVRPGQPVHITSSSQAVSVPYIQTNKILTSGSTQPQPNAPPMTGFATSGPPFSSPYTFVP
SSYPQQQPTSLVQPNSQMHVAGVPPAANTWPVPVNQSTSLVSPVQQTGQQTPVAVSTDPG
NLTPQSASDWQEHTSADGRKYYYNKRTKQSNWEKPLELMTPLERADASTVWKEFTTPEGK
KYYYNKVTKESKWTIPEDLKLAREQAQLASEKTSLSEAGSTPLSHHAASSSDLAVSTVTS
VVPSTSSALTGHSSSPIQAGLAVPVTRPPSVAPVTPTSGAISDTEATTIKGDNLSSRGAD
DSNDGATAQNNEAENKEMSVNGKANLSPAGDKANVEEPMVYATKQEAKAAFKSLLESVNV
HSDWTWEQTLKEIVHDKRYGALRTLGERKQAFNEYLGQRKKVEAEERRRRQKKAREEFVK
MLEECEELSSSLKWSKAMSLFENDQRFKAVDRPRDREDLFDNYIVELERKEREKAAEEHR
QYMADYRKFLETCDYIKAGTQWRKIQDRLEDDDRCSCLEKIDRLIGFEEYILDLEKEEEE
LKRVEKEHVRRAERKNRDAFRTLLEEHVAAGILTAKTYWLDYCIELKDLPQYQAVASNTS
GSTPKDLFEDVTEELEKQYHEDKSYVKDAMKSRKISMVSSWLFEDFKSAISEDLSTQQIS
DINLKLIYDDLVGRVKEKEEKEARKLQRLAEEFTNLLHTFKEITVASNWEDSKQLVEESQ
EYRSIGDESVSQGLFEEYITSLQEKAKEKERKRDEEKVRKEKERDEKEKRKDKDKERREK
EREREKEKGKERSKREESDGETAMDVSEGHKDEKRKGKDRDRKHRRRHHNNSDEDVSSDR
DDRDESKKSSRKHGNDRKKSRKHANSPESESENRHKRQKKESSRRSGNDELEDGEVGE*
>AT1G44910.2 | protein binding
MANNPPQSSGTQFRPMVPGQQGQHFVPAASQPFHPYGHVPPNVQSQPPQYSQPIQQQQLF
PVRPGQPVHITSSSQAVSVPYIQTNKILTSGSTQPQPNAPPMTGFATSGPPFSSPYTFVP
SSYPQQQPTSLVQPNSQMHVAGVPPAANTWPVPVNQSTSLVSPVQQTGQQTPVAVSTDPG
NLTPQSASDWQEHTSADGRKYYYNKRTKQSNWEKPLELMTPLERADASTVWKEFTTPEGK
KYYYNKVTKESKWTIPEDLKLAREQAQLASEKTSLSEAGSTPLSHHAASSSDLAVSTVTS
VVPSTSSALTGHSSSPIQAGLAVPVTRPPSVAPVTPTSGAISDTEATTIKGDNLSSRGAD
DSNDGATAQNNEAENKEMSVNGKANLSPAGDKANVEEPMVYATKQEAKAAFKSLLESVNV
HSDWTWEQTLKEIVHDKRYGALRTLGERKQAFNEYLGQRKKVEAEERRRRQKKAREEFVK
MLEECEELSSSLKWSKAMSLFENDQRFKAVDRPRDREDLFDNYIVELERKEREKAAEEHR
QYMADYRKFLETCDYIKAGTQWRKIQDRLEDDDRCSCLEKIDRLIGFEEYILDLEKEEEE
LKRVEKEHVRRAERKNRDAFRTLLEEHVAAGILTAKTYWLDYCIELKDLPQYQAVASNTS
GSTPKDLFEDVTEELEKQYHEDKSYVKDAMKSRKISMVSSWLFEDFKSAISEDLSTQQIS
DINLKLIYDDLVGRVKEKEEKEARKLQRLAEEFTNLLHTFKEITVASNWEDSKQLVEESQ
EYRSIGDESVSQGLFEEYITSLQEKAKEKERKRDEEKVRKEKERDEKEKRKDKDKERREK
EREREKEKGKERSKREESDGETAMDVSEGHKDEKRKGKDRDRKHRRRHHNNSDEDVSSDR
DDRDESKKSSRKHGNDRKKSRKVGTP*
>AT1G44910.2 | protein binding
MANNPPQSSGTQFRPMVPGQQGQHFVPAASQPFHPYGHVPPNVQSQPPQYSQPIQQQQLF
PVRPGQPVHITSSSQAVSVPYIQTNKILTSGSTQPQPNAPPMTGFATSGPPFSSPYTFVP
SSYPQQQPTSLVQPNSQMHVAGVPPAANTWPVPVNQSTSLVSPVQQTGQQTPVAVSTDPG
NLTPQSASDWQEHTSADGRKYYYNKRTKQSNWEKPLELMTPLERADASTVWKEFTTPEGK
KYYYNKVTKESKWTIPEDLKLAREQAQLASEKTSLSEAGSTPLSHHAASSSDLAVSTVTS
VVPSTSSALTGHSSSPIQAGLAVPVTRPPSVAPVTPTSGAISDTEATTIKGDNLSSRGAD
DSNDGATAQNNEAENKEMSVNGKANLSPAGDKANVEEPMVYATKQEAKAAFKSLLESVNV
HSDWTWEQTLKEIVHDKRYGALRTLGERKQAFNEYLGQRKKVEAEERRRRQKKAREEFVK
MLEECEELSSSLKWSKAMSLFENDQRFKAVDRPRDREDLFDNYIVELERKEREKAAEEHR
QYMADYRKFLETCDYIKAGTQWRKIQDRLEDDDRCSCLEKIDRLIGFEEYILDLEKEEEE
LKRVEKEHVRRAERKNRDAFRTLLEEHVAAGILTAKTYWLDYCIELKDLPQYQAVASNTS
GSTPKDLFEDVTEELEKQYHEDKSYVKDAMKSRKISMVSSWLFEDFKSAISEDLSTQQIS
DINLKLIYDDLVGRVKEKEEKEARKLQRLAEEFTNLLHTFKEITVASNWEDSKQLVEESQ
EYRSIGDESVSQGLFEEYITSLQEKAKEKERKRDEEKVRKEKERDEKEKRKDKDKERREK
EREREKEKGKERSKREESDGETAMDVSEGHKDEKRKGKDRDRKHRRRHHNNSDEDVSSDR
DDRDESKKSSRKHGNDRKKSRKVGTP*
>AT1G07400.1 | 178 kDa class I heat shock protein (HSP178-CI)
MSLIPSFFGNNRRSNSIFDPFSLDVWDPFKELQFPSSLSGETSAITNARVDWKETAEAHV
FKADLPGMKKEEVKVEIEDDSVLKISGERHVEKEEKQDTWHRVERSSGQFSRKFKLPENV
KMDQVKASMENGVLTVTVPKVEEAKKKAQVKSIDISG*
>AT5G06160.1 | ATO (ATROPOS) nucleic acid binding / zinc ion binding
MSSTLLEQTRSNHEEVERLERLVVEDLQKEPPSSKDRLVQGHRVRHMIESIMLTTEKLVE
TYEDKDGAWDDEIAALGGQTATGTNVFSEFYDRLKEIREYHKRHPSGRLVDANEDYEARL
KEEPIIAFSGEEGNGRYLDLHDMYNQYINSKFGERVEYSAYLDVFSQPEKIPRKLKLSRQ
YMKYMEALLEYLVYFFQRTEPLQDLDRILSKVCSDFEEQYADGIVEGLDNELIPSQHTVI
DLDYYSTVEELVDVGPEKLKEALGALGLKVGGTPQQRAERLFLTKHTPLEKLDKKHFARP
PHNGKQNGDAKSTHESENAKEIALTEAKVKKLCNLLDETIERTKQNIVKKQSLTYEEMEG
EREGEEANTELESDDEDGLIYNPLKLPIGWDGKPIPYWLYKLHGLGQEFKCEICGNYSYW
GRRAFERHFKEWRHQHGMRCLGIPNTKNFNEITSIEEAKELWKRIQERQGVNKWRPELEE
EYEDREGNIYNKKTYSDLQRQGLI*
>AT4G31790.1 | diphthine synthase putative (DPH5)
MLYIIGLGLGDEKDITLRGLEAVKKSQKVYMEAYTSLLSFGLSADGLSNLEKFYGKPIIL
ADREMVEEKAGDMIDEAIDNDVAFLVVGDPFGATTHSDLVVRAKTLGVKVEVVHNASVMN
AVGICGLQLYHYGETVSIPFFTETWRPDSFYEKIKKNRSLGLHTLCLLDIRVKEPTFESL
CRGGKKQYEPPRYMSVNTAIEQLLEVEQKHGDSVYGEDTQCVGFARLGSEDQTIVAGTMK
QLESVDFGAPLHCLVIVGETHPVEEEMLEFYKYKSGN*
>AT4G31790.1 | diphthine synthase putative (DPH5)
MLYIIGLGLGDEKDITLRGLEAVKKSQKVYMEAYTSLLSFGLSADGLSNLEKFYGKPIIL
ADREMVEEKAGDMIDEAIDNDVAFLVVGDPFGATTHSDLVVRAKTLGVKVEVVHNASVMN
AVGICGLQLYHYGETVSIPFFTETWRPDSFYEKIKKNRSLGLHTLCLLDIRVKEPTFESL
CRGGKKQYEPPRYMSVNTAIEQLLEVEQKHGDSVYGEDTQCVGFARLGSEDQTIVAGTMK
QLESVDFGAPLHCLVIVGETHPVEEEMLEFYKYKSGN*
>AT4G31790.2 | diphthine synthase putative (DPH5)
MLYIIGLGLGDEKDITLRGLEAVKKSQKVYMEAYTSLLSFGLSADGLSNLEKFYGKPIIL
ADREMVEEKAGDMIDEAIDNDVAFLVVGDPFGATTHSDLVVRAKTLGVKVEVVHNASVMN
AVGICGLQLYHYGETVSIPFFTETWRPDSFYEKIKKNRSLGLHTLCLLDIRVKEPTFESL
CRGGKKQYEPPRYMSVNTAIEQLLEVEQKHGDSVYGEDTQCVGFARLGSEDQTIVAGTMK
QLESVDFGAPLHCLVIVGETHPVEEEMLEFYKYKSGN*
>AT4G31790.2 | diphthine synthase putative (DPH5)
MLYIIGLGLGDEKDITLRGLEAVKKSQKVYMEAYTSLLSFGLSADGLSNLEKFYGKPIIL
ADREMVEEKAGDMIDEAIDNDVAFLVVGDPFGATTHSDLVVRAKTLGVKVEVVHNASVMN
AVGICGLQLYHYGETVSIPFFTETWRPDSFYEKIKKNRSLGLHTLCLLDIRVKEPTFESL
CRGGKKQYEPPRYMSVNTAIEQLLEVEQKHGDSVYGEDTQCVGFARLGSEDQTIVAGTMK
QLESVDFGAPLHCLVIVGETHPVEEEMLEFYKYKSGN*