>AT5G48870.1 |  SAD1 (SUPERSENSITIVE TO ABA AND DROUGHT 1) RNA binding 
MANNPSQLLPSELIDRCIGSKIWVIMKGDKELVGILKGFDVYVNMVLEDVTEYEITAEGR 
RVTKLDQILLNGNNIAILVPGGSPEDGE*
>AT4G31700.1 |  RPS6 (RIBOSOMAL PROTEIN S6) structural constituent of ribosome 
MKFNVANPTTGCQKKLEIDDDQKLRAFYDKRISQEVSGDALGEEFKGYVFKIKGGCDKQG 
FPMKQGVLTPGRVRLLLHRGTPCFRGHGRRTGERRRKSVRGCIVSPDLSVLNLVIVKKGE 
NDLPGLTDTEKPRMRGPKRASKIRKLFNLKKEDDVRTYVNTYRRKFTNKKGKEVSKAPKI 
QRLVTPLTLQRKRARIADKKKKIAKANSDAADYQKLLASRLKEQRDRRSESLAKKRSRLS 
SAAAKPSVTA*
>AT4G31700.1 |  RPS6 (RIBOSOMAL PROTEIN S6) structural constituent of ribosome 
MKFNVANPTTGCQKKLEIDDDQKLRAFYDKRISQEVSGDALGEEFKGYVFKIKGGCDKQG 
FPMKQGVLTPGRVRLLLHRGTPCFRGHGRRTGERRRKSVRGCIVSPDLSVLNLVIVKKGE 
NDLPGLTDTEKPRMRGPKRASKIRKLFNLKKEDDVRTYVNTYRRKFTNKKGKEVSKAPKI 
QRLVTPLTLQRKRARIADKKKKIAKANSDAADYQKLLASRLKEQRDRRSESLAKKRSRLS 
SAAAKPSVTA*
>AT4G31700.2 |  RPS6 (RIBOSOMAL PROTEIN S6) structural constituent of ribosome 
MKQGVLTPGRVRLLLHRGTPCFRGHGRRTGERRRKSVRGCIVSPDLSVLNLVIVKKGEND 
LPGLTDTEKPRMRGPKRASKIRKLFNLKKEDDVRTYVNTYRRKFTNKKGKEVSKAPKIQR 
LVTPLTLQRKRARIADKKKKIAKANSDAADYQKLLASRLKEQRDRRSESLAKKRSRLSSA 
AAKPSVTA*
>AT4G31700.2 |  RPS6 (RIBOSOMAL PROTEIN S6) structural constituent of ribosome 
MKQGVLTPGRVRLLLHRGTPCFRGHGRRTGERRRKSVRGCIVSPDLSVLNLVIVKKGEND 
LPGLTDTEKPRMRGPKRASKIRKLFNLKKEDDVRTYVNTYRRKFTNKKGKEVSKAPKIQR 
LVTPLTLQRKRARIADKKKKIAKANSDAADYQKLLASRLKEQRDRRSESLAKKRSRLSSA 
AAKPSVTA*
>AT5G22440.1 |  60S ribosomal protein L10A (RPL10aC) 
MSKLQSEAVREAISSIITHCKETKPRNFTETIELQIGLKNYDPQKDKRFSGSVKLPHVPR 
PKMKICMLGDAQHVEEAEKIGLESMDVEALKKLNKNKKLVKKLAKKFHAFLASESVIKQI 
PRLLGPGLNKAGKFPTLVSHQESLESKVNETKATVKFQLKKVLCMGVAVGNLSMEEKQIF 
QNVQMSVNFLVSLLKKNWQNVRCLYLKSTMGPPNRVF*
>AT5G22440.1 |  60S ribosomal protein L10A (RPL10aC) 
MSKLQSEAVREAISSIITHCKETKPRNFTETIELQIGLKNYDPQKDKRFSGSVKLPHVPR 
PKMKICMLGDAQHVEEAEKIGLESMDVEALKKLNKNKKLVKKLAKKFHAFLASESVIKQI 
PRLLGPGLNKAGKFPTLVSHQESLESKVNETKATVKFQLKKVLCMGVAVGNLSMEEKQIF 
QNVQMSVNFLVSLLKKNWQNVRCLYLKSTMGPPNRVF*
>AT5G22440.2 |  60S ribosomal protein L10A (RPL10aC) 
MSKLQSEAVREAISSIITHCKETKPRNFTETIELQIGLKNYDPQKDKRFSGSVKLPHVPR 
PKMKICMLGDAQHVEEAEKIGLESMDVEALKKLNKNKKLVKKLAKKFHAFLASESVIKQI 
PRLLGPGLNKAGKFPTLVSHQESLESKVNETKATVKFQLKKVLCMGVAVGNLSMEEKQIF 
QNVQMSVNFLVSLLKKNWQNVRCLYLKSTMGPPNRVF*
>AT5G22440.2 |  60S ribosomal protein L10A (RPL10aC) 
MSKLQSEAVREAISSIITHCKETKPRNFTETIELQIGLKNYDPQKDKRFSGSVKLPHVPR 
PKMKICMLGDAQHVEEAEKIGLESMDVEALKKLNKNKKLVKKLAKKFHAFLASESVIKQI 
PRLLGPGLNKAGKFPTLVSHQESLESKVNETKATVKFQLKKVLCMGVAVGNLSMEEKQIF 
QNVQMSVNFLVSLLKKNWQNVRCLYLKSTMGPPNRVF*
>AT1G20960.1 |  emb1507 (embryo defective 1507) ATP binding / ATP-dependent helicase/ helicase/ nucleic acid binding / nucleoside-triphosphatase/ nucleotide binding 
MANLGGGAEAHARFKQYEYRANSSLVLTTDNRPRDTHEPTGEPETLWGKIDPRSFGDRVA 
KGRPQELEDKLKKSKKKERDVVDDMVNIRQSKRRRLREESVLTDTDDAVYQPKTKETRAA 
YEAMLGLIQKQLGGQPPSIVSGAADEILAVLKNDAFRNPEKKMEIEKLLNKIENHEFDQL 
VSIGKLITDFQEGGDSGGGRANDDEGLDDDLGVAVEFEENEEDDEESDPDMVEEDDDEED 
DEPTRTGGMQVDAGINDEDAGDANEGTNLNVQDIDAYWLQRKISQAYEQQIDPQQCQVLA 
EELLKILAEGDDRVVEDKLLMHLQYEKFSLVKFLLRNRLKVVWCTRLARAEDQEERNRIE 
EEMRGLGPELTAIVEQLHATRATAKEREENLQKSINEEARRLKDETGGDGGRGRRDVADR 
DSESGWVKGQRQMLDLESLAFDQGGLLMANKKCDLPPGSYRSHGKGYDEVHVPWVSKKVD 
RNEKLVKITEMPDWAQPAFKGMQQLNRVQSKVYDTALFKAENILLCAPTGAGKTNVAMLT 
ILQQLEMNRNTDGTYNHGDYKIVYVAPMKALVAEVVGNLSNRLKDYGVIVRELSGDQSLT 
GREIEETQIIVTTPEKWDIITRKSGDRTYTQLVRLLIIDEIHLLHDNRGPVLESIVARTL 
RQIETTKENIRLVGLSATLPNYEDVALFLRVDLKKGLFKFDRSYRPVPLHQQYIGISVKK 
PLQRFQLMNDLCYQKVLAGAGKHQVLIFVHSRKETSKTARAIRDTAMANDTLSRFLKEDS 
VTRDVLHSHEDIVKNSDLKDILPYGFAIHHAGLSRGDREIVETLFSQGHVQVLVSTATLA 
WGVNLPAHTVIIKGTQVYNPEKGAWMELSPLDVMQMLGRAGRPQYDQHGEGIIITGYSEL 
QYYLSLMNEQLPIESQFISKLADQLNAEIVLGTVQNAREACHWLGYTYLYIRMVRNPTLY 
GLAPDALAKDVVLEERRADLIHSAATILDKNNLVKYDRKSGYFQVTDLGRIASYYYITHG 
TIATYNEHLKPTMGDIDLYRLFSLSDEFKYVTVRQDEKMELAKLLDRVPIPIKETLEEPS 
AKINVLLQAYISQLKLEGLSLTSDMVYITQSAGRLVRALYEIVLKRGWAQLAEKALNLSK 
MVGKRMWSVQTPLRQFHGLSNDILMQLEKKDLVWERYYDLSAQELGELIRSPKMGKPLHK 
FIHQFPKVTLSAHVQPITRTVLNVELTVTPDFLWDEKIHKYVEPFWIIVEDNDGEKILHH 
EYFLLKKQYIDEDHTLHFTVPIFEPLPPQYFVRVVSDKWLGSETVLPVSFRHLILPEKYP 
PPTELLDLQPLPVTALRNPNYEILYQDFKHFNPVQTQVFTVLYNTNDNVLVAAPTGSGKT 
ICAEFAILRNHHEGPDATMRVVYIAPLEAIAKEQFRIWEGKFGKGLGLRVVELTGETALD 
LKLLEKGQIIISTPEKWDALSRRWKQRKYVQQVSLFIVDELHLIGGQHGPVLEVIVSRMR 
YISSQVINKIRIVALSTSLANAKDLGEWIGASSHGLFNFPPGVRPVPLEIHIQGVDISSF 
EARMQAMTKPTYTAIVQHAKNKKPAIVFVPTRKHVRLTAVDLMAYSHMDNPQSPDFLLGK 
LEELDPFVEQIREETLKETLCHGIGYLHEGLSSLDQEIVTQLFEAGRIQVCVMSSSLCWG 
TPLTAHLVVVMGTQYYDGRENSHSDYPVPDLLQMMGRASRPLLDNAGKCVIFCHAPRKEY 
YKKFLYEAFPVESQLQHFLHDNFNAEVVAGVIENKQDAVDYLTWTFMYRRLPQNPNYYNL 
QGVSHRHLSDHLSELVENTLSDLEASKCIEVEDEMELSPLNLGMIASYYYISYTTIERFS 
SLLSSKTKMKGLLEILTSASEYDMIPIRPGEEDTVRRLINHQRFSFENPKCTDPHVKANA 
LLQAHFSRQNIGGNLAMDQRDVLLSATRLLQAMVDVISSNGWLNLALLAMEVSQMVTQGM 
WERDSMLLQLPHFTKDLAKRCQENPGKNIETVFDLVEMEDEERQELLKMSDAQLLDIARF 
CNRFPNIDLTYEIVGSEEVNPGKEVTLQVMLERDMEGRTEVGPVDSLRYPKTKEEGWWLV 
VGDTKTNQLLAIKRVSLQRKVKVKLDFTAPSEPGEKSYTLYFMCDSYLGCDQEYSFSVDV 
KGSGAGDRMEE*
>AT4G37930.1 |  SHM1 (SERINE TRANSHYDROXYMETHYLTRANSFERASE 1) glycine hydroxymethyltransferase/ poly(U) binding 
MAMAMALRRLSSSIDKPIRPLIRSTSCYMSSLPSEAVDEKERSRVTWPKQLNAPLEEVDP 
EIADIIEHEKARQWKGLELIPSENFTSVSVMQAVGSVMTNKYSEGYPGARYYGGNEYIDM 
AETLCQKRALEAFRLDPEKWGVNVQPLSGSPANFHVYTALLKPHERIMALDLPHGGHLSH 
GYQTDTKKISAVSIFFETMPYRLDESTGYIDYDQMEKSATLFRPKLIVAGASAYARLYDY 
ARIRKVCNKQKAVMLADMAHISGLVAANVIPSPFDYADVVTTTTHKSLRGPRGAMIFFRK 
GVKEINKQGKEVLYDFEDKINQAVFPGLQGGPHNHTITGLAVALKQATTSEYKAYQEQVL 
SNSAKFAQTLMERGYELVSGGTDNHLVLVNLKPKGIDGSRVEKVLEAVHIASNKNTVPGD 
VSAMVPGGIRMGTPALTSRGFVEEDFAKVAEYFDKAVTIALKVKSEAQGTKLKDFVSAME 
SSSTIQSEIAKLRHEVEEFAKQFPTIGFEKETMKYKN*
>AT3G12780.1 |  PGK1 (PHOSPHOGLYCERATE KINASE 1) phosphoglycerate kinase 
MASAAASSAFSLLKSTGAVASSAGTRARASLLPIPSTSVSARPLGFSATLDSRRFSLHVA 
SKVESVRGKGSRGVVSMAKKSVGDLTSADLKGKKVFVRADLNVPLDDNQTITDDTRIRAA 
IPTIKYLIENGAKVILSTHLGRPKGVTPKFSLAPLVPRLSELLGIEVTKADDCIGPEVES 
LVASLPEGGVLLLENVRFYKEEEKNDPEFAKKLASLADLYVNDAFGTAHRAHASTEGVTK 
FLKPSVAGFLLQKELDYLVGAVSNPKRPFAAIVGGSKVSSKIGVIESLLEKCDILLLGGG 
MIFTFYKAQGLSVGSSLVEEDKLELATELLAKAKAKGVSLLLPTDVVVADKFAPDANSKI 
VPASGIEDGWMGLDIGPDSIKTFNEALDTTQTVIWNGPMGVFEMEKFAAGTEAIANKLAE 
LSEKGVTTIIGGGDSVAAVEKVGVAGVMSHISTGGGASLELLEGKVLPGVIALDEAIPVT 
V*
>AT2G27710.1 |  60S acidic ribosomal protein P2 (RPP2B) 
MKVVAAYLLAVLSGKASPTSADIKTILGSVGAETEDSQIELLLKEVKGKDLAELIAAGRE 
KLASVPSGGGGGVAVASATSGGGGGGGASAAESKKEEKKEEKEESDDDMGFSLFE*
>AT2G27710.1 |  60S acidic ribosomal protein P2 (RPP2B) 
MKVVAAYLLAVLSGKASPTSADIKTILGSVGAETEDSQIELLLKEVKGKDLAELIAAGRE 
KLASVPSGGGGGVAVASATSGGGGGGGASAAESKKEEKKEEKEESDDDMGFSLFE*
>AT2G27710.1 |  60S acidic ribosomal protein P2 (RPP2B) 
MKVVAAYLLAVLSGKASPTSADIKTILGSVGAETEDSQIELLLKEVKGKDLAELIAAGRE 
KLASVPSGGGGGVAVASATSGGGGGGGASAAESKKEEKKEEKEESDDDMGFSLFE*
>AT2G27710.2 |  60S acidic ribosomal protein P2 (RPP2B) 
MKVVAAYLLAVLSGKASPTSADIKTILGSVGAETEDSQIELLLKEVKGKDLAELIAAGRE 
KLASVPSGGGGGVAVASATSGGGGGGGASAAESKKEEKKEEKEESDDDMGFSLFE*
>AT2G27710.2 |  60S acidic ribosomal protein P2 (RPP2B) 
MKVVAAYLLAVLSGKASPTSADIKTILGSVGAETEDSQIELLLKEVKGKDLAELIAAGRE 
KLASVPSGGGGGVAVASATSGGGGGGGASAAESKKEEKKEEKEESDDDMGFSLFE*
>AT2G27710.2 |  60S acidic ribosomal protein P2 (RPP2B) 
MKVVAAYLLAVLSGKASPTSADIKTILGSVGAETEDSQIELLLKEVKGKDLAELIAAGRE 
KLASVPSGGGGGVAVASATSGGGGGGGASAAESKKEEKKEEKEESDDDMGFSLFE*
>AT2G27710.3 |  60S acidic ribosomal protein P2 (RPP2B) 
MKVVAAYLLAVLSGKASPTSADIKTILGSVGAETEDSQIELLLKEVKGKDLAELIAAGRE 
KLASVPSGGGGGVAVASATSGGGGGGGASAAESKKEEKKEEKEESDDDMGFSLFE*
>AT2G27710.3 |  60S acidic ribosomal protein P2 (RPP2B) 
MKVVAAYLLAVLSGKASPTSADIKTILGSVGAETEDSQIELLLKEVKGKDLAELIAAGRE 
KLASVPSGGGGGVAVASATSGGGGGGGASAAESKKEEKKEEKEESDDDMGFSLFE*
>AT2G27710.3 |  60S acidic ribosomal protein P2 (RPP2B) 
MKVVAAYLLAVLSGKASPTSADIKTILGSVGAETEDSQIELLLKEVKGKDLAELIAAGRE 
KLASVPSGGGGGVAVASATSGGGGGGGASAAESKKEEKKEEKEESDDDMGFSLFE*
>AT2G43810.1 |  small nuclear ribonucleoprotein F putative / U6 snRNA-associated Sm-like protein putative / Sm protein F putative 
MSGVGEKASGTTKTPADFLKSIRGKPVVVKLNSGVDYRGILTCLDGYMNIAMEQTEEYVN 
GQLKNTYGDAFVRGNNVLYISTTKGTLSDGA*
>AT2G43810.1 |  small nuclear ribonucleoprotein F putative / U6 snRNA-associated Sm-like protein putative / Sm protein F putative 
MSGVGEKASGTTKTPADFLKSIRGKPVVVKLNSGVDYRGILTCLDGYMNIAMEQTEEYVN 
GQLKNTYGDAFVRGNNVLYISTTKGTLSDGA*
>AT2G43810.2 |  small nuclear ribonucleoprotein F putative / U6 snRNA-associated Sm-like protein putative / Sm protein F putative 
MSGVGEKASGTTKTPADFLKSIRGKPVVVKLNSGVDYRGILTCLDGYMNIAMEQTEEYVN 
GQLKNTYGDAFVRGNNVLYISTTKGTLSDGA*
>AT2G43810.2 |  small nuclear ribonucleoprotein F putative / U6 snRNA-associated Sm-like protein putative / Sm protein F putative 
MSGVGEKASGTTKTPADFLKSIRGKPVVVKLNSGVDYRGILTCLDGYMNIAMEQTEEYVN 
GQLKNTYGDAFVRGNNVLYISTTKGTLSDGA*
>AT1G23290.1 |  RPL27AB structural constituent of ribosome 
MATALKKNRKKRGHVSAGHGRIGKHRKHPGGRGNAGGMHHHRILFDKYHPGYFGKVGMRY 
FHKLRNKFFCPIVNLDKLWSLVPEDVKAKSSKDNVPLIDVTQHGFFKVLGKGHLPENKPF 
VVKAKLISKTAEKKIKEAGGAVVLTA*
>AT3G59810.1 |  small nuclear ribonucleoprotein F putative / U6 snRNA-associated Sm-like protein putative / Sm protein F putative 
MSGVEEKVSGTTKTPADFLKSIRGRPVVVKLNSGVDYRGTLTCLDGYMNIAMEQTEEYVN 
GQLKNKYGDAFIRGNNVLYISTVNMTVADGA*
>AT2G03870.2 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFVRDHDDPLKT 
TDQTRRLGLIVCRGTAVMLVSPTDGTEEIANPFVTAEAV*
>AT2G03870.2 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFVRDHDDPLKT 
TDQTRRLGLIVCRGTAVMLVSPTDGTEEIANPFVTAEAV*
>AT2G03870.1 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFVRDHDDPLKT 
TDQTRRLGLIVCRGTAVMLVSPTDGTEEIANPFVTAEAV*
>AT2G03870.1 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFVRDHDDPLKT 
TDQTRRLGLIVCRGTAVMLVSPTDGTEEIANPFVTAEAV*
>AT4G30220.1 |  RUXF (SMALL NUCLEAR RIBONUCLEOPROTEIN F) 
MATIPVNPKPFLNNLTGKTVIVKLKWGMEYKGFLASVDSYMNLQLGNTEEYIDGQLTGNL 
GEILIRCNNVLYVRGVPEDEELEDADQD*
>AT3G14080.1 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MSWAGPEEIYLSTSLASYLDRKLLVLLRDGRKLMGTLRSFDQFANAVLEGACERVIVGEQ 
YCDIPLGLYVIRGENVVLIGELDTEREELPPHMIRVSEAEIKRAQKVEREASELRGTMRK 
RMEFLDFD*
>AT3G14080.1 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MSWAGPEEIYLSTSLASYLDRKLLVLLRDGRKLMGTLRSFDQFANAVLEGACERVIVGEQ 
YCDIPLGLYVIRGENVVLIGELDTEREELPPHMIRVSEAEIKRAQKVEREASELRGTMRK 
RMEFLDFD*
>AT3G14080.2 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MSWAGPEEIYLSTSLASYLDRKLLVLLRDGRKLMGTLRSFDQFANAVLEGACERVIVGEQ 
YCDIPLGLYVIRGENVVLIGELDTEREELPPHMIRVSEAEIKRAQKVEREASELRGTMRK 
RMEFLDFD*
>AT3G14080.2 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MSWAGPEEIYLSTSLASYLDRKLLVLLRDGRKLMGTLRSFDQFANAVLEGACERVIVGEQ 
YCDIPLGLYVIRGENVVLIGELDTEREELPPHMIRVSEAEIKRAQKVEREASELRGTMRK 
RMEFLDFD*
>AT1G19120.1 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MSWAAPDDIFFSTSLAAYLDKKLLVLLRDGRKLMGLLRSFDQFANAVLEEAYERVIVGDL 
YCDIPLGLYIIRGENVVLIGELDVEKEELPAHMVQVPEAEIKRAQKAEKEEMLLKGTMRK 
RMEFLDLD*
>AT3G62840.1 |  FUNCTIONS IN molecular_function unknown LOCATED IN small nucleolar ribonucleoprotein complex nucleus EXPRESSED IN 22 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Like-Sm ribonucleoprotein core (InterProIPR001163) Like-Sm ribonucleoprotein eukaryotic and archaea-type core (InterProIPR006649) Like-Sm ribonucleoprotein-related core (InterProIPR010920) BEST Arabidopsis thaliana protein match is small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative (TAIRAT2G476404) Has 535 Blast hits to 535 proteins in 164 species Archae - 2 Bacteria - 0 Metazoa - 239 Fungi - 109 Plants - 78 Viruses - 0 Other Eukaryotes - 107 (source NCBI BLink) 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT3G62840.1 |  FUNCTIONS IN molecular_function unknown LOCATED IN small nucleolar ribonucleoprotein complex nucleus EXPRESSED IN 22 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Like-Sm ribonucleoprotein core (InterProIPR001163) Like-Sm ribonucleoprotein eukaryotic and archaea-type core (InterProIPR006649) Like-Sm ribonucleoprotein-related core (InterProIPR010920) BEST Arabidopsis thaliana protein match is small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative (TAIRAT2G476403) Has 537 Blast hits to 537 proteins in 165 species Archae - 2 Bacteria - 0 Metazoa - 239 Fungi - 109 Plants - 78 Viruses - 0 Other Eukaryotes - 109 (source NCBI BLink) 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT3G62840.2 |  FUNCTIONS IN molecular_function unknown LOCATED IN small nucleolar ribonucleoprotein complex nucleus EXPRESSED IN 22 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Like-Sm ribonucleoprotein core (InterProIPR001163) Like-Sm ribonucleoprotein eukaryotic and archaea-type core (InterProIPR006649) Like-Sm ribonucleoprotein-related core (InterProIPR010920) BEST Arabidopsis thaliana protein match is small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative (TAIRAT2G476404) Has 535 Blast hits to 535 proteins in 164 species Archae - 2 Bacteria - 0 Metazoa - 239 Fungi - 109 Plants - 78 Viruses - 0 Other Eukaryotes - 107 (source NCBI BLink) 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT3G62840.2 |  FUNCTIONS IN molecular_function unknown LOCATED IN small nucleolar ribonucleoprotein complex nucleus EXPRESSED IN 22 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Like-Sm ribonucleoprotein core (InterProIPR001163) Like-Sm ribonucleoprotein eukaryotic and archaea-type core (InterProIPR006649) Like-Sm ribonucleoprotein-related core (InterProIPR010920) BEST Arabidopsis thaliana protein match is small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative (TAIRAT2G476403) Has 537 Blast hits to 537 proteins in 165 species Archae - 2 Bacteria - 0 Metazoa - 239 Fungi - 109 Plants - 78 Viruses - 0 Other Eukaryotes - 109 (source NCBI BLink) 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT5G27720.1 |  emb1644 (embryo defective 1644) 
MLPLSLLKTAQGHPMLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECY 
IRGNTIKYLRVPDEVIDKVQEEKTRTDRKPPGVGRGRGRGVDDGGARGRGRGTSMGKMGG 
NRGAGRGRG*
>AT2G23930.1 |  SNRNP-G (PROBABLE SMALL NUCLEAR RIBONUCLEOPROTEIN G) 
MSRSGQPPDLKKYMDKKLQIKLNANRMVTGTLRGFDQFMNLVVDNTVEVNGNDKTDIGMV 
VIRGNSIVTVEALEPVGRSS*
>AT2G23930.1 |  SNRNP-G (PROBABLE SMALL NUCLEAR RIBONUCLEOPROTEIN G) 
MSRSGQPPDLKKYMDKKLQIKLNANRMVTGTLRGFDQFMNLVVDNTVEVNGNDKTDIGMV 
VIRGNSIVTVEALEPVGRSS*
>AT2G23930.2 |  SNRNP-G (PROBABLE SMALL NUCLEAR RIBONUCLEOPROTEIN G) 
MDKKLQIKLNANRMVTGTLRGFDQFMNLVVDNTVEVNGNDKTDIGMVVIRGNSIVTVEAL 
EPVGRSS*
>AT2G23930.2 |  SNRNP-G (PROBABLE SMALL NUCLEAR RIBONUCLEOPROTEIN G) 
MDKKLQIKLNANRMVTGTLRGFDQFMNLVVDNTVEVNGNDKTDIGMVVIRGNSIVTVEAL 
EPVGRSS*
>AT1G21190.1 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MSVEEDATVREPLDLIRLSIEERIYVKLRSDRELRGKLHAFDQHLNMILGDVEEVITTIE 
IDDETYEEIVRTTKRTVPFLFVRGDGVILVSPPLRTT*
>AT1G76860.1 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MSGEEEATVREPLDLIRLSLDERIYVKLRSDRELRGKLHAFDQHLNMILGDVEETITTVE 
IDDETYEEIVRTTKRTIEFLFVRGDGVILVSPPLRTAA*
>AT4G30330.1 |  small nuclear ribonucleoprotein E putative / snRNP-E putative / Sm protein E putative 
MASTKVQRIMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRITGFDEYMNLVLDEAEEV 
SIKKKTRKPLGRILLKGDNITLMMNAGK*
>AT1G03330.1 |  small nuclear ribonucleoprotein D putative / snRNP core SM-like protein putative / U6 snRNA-associated Sm-like protein putative 
MLFFSYFKDLVGQEVTVELKNDLAIRGTLHSVDQYLNIKLENTRVVDQDKYPHMLSVRNC 
FIRGSVVRYVQLPKDGVDVDLLHDAARREARGG*
>AT1G65700.1 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MAATTGLETLVDQIISVITNDGRNIVGVLKGFDQATNIILDESHERVFSTKEGVQQHVLG 
LYIIRGDNIGVIGELDEELDASLDFSKLRAHPLKPVVH*
>AT1G65700.1 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MAATTGLETLVDQIISVITNDGRNIVGVLKGFDQATNIILDESHERVFSTKEGVQQHVLG 
LYIIRGDNIGVIGELDEELDASLDFSKLRAHPLKPVVH*
>AT1G65700.2 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MAATTGLETLVDQIISVITNDGRNIVGVLKGFDQATNIILDESHERVFSTKEGVQQHVLG 
LYIIRGDNIGVIGELDEELDASLDFSKLRAHPLKPVVH*
>AT1G65700.2 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MAATTGLETLVDQIISVITNDGRNIVGVLKGFDQATNIILDESHERVFSTKEGVQQHVLG 
LYIIRGDNIGVIGELDEELDASLDFSKLRAHPLKPVVH*
>AT2G47640.1 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.1 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.1 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.1 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.2 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.2 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.2 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.2 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.3 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.3 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.3 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.3 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNMV 
LENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.4 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.4 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.4 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G47640.4 |  small nuclear ribonucleoprotein D2 putative / snRNP core protein D2 putative / Sm protein D2 putative 
MSKPMEEDTNQGKTEEEEFNTGPLSVLMMSVKNNTQVLINCRNNRKLLGRVRAFDRHCNM 
VLENVREMWTEVPKTGKGKKKALPVNRDRFISKMFLRGDSVIIVLRNPK*
>AT2G41500.1 |  EMB2776 nucleotide binding 
MEPNKDDNVSLAATAQISAPPVLQDASSLPGFSAIPPVVPPSFPPPMAPIPMMPHPPVAR 
PPTFRPPVSQNGGVKTSDSDSESDDEHIEISEESKQVRERQEKALQDLLVKRRAAAMAVP 
TNDKAVRDRLRRLGEPITLFGEQEMERRARLTQLLTRYDINGQLDKLVKDHEEDVTPKEE 
VDDEVLEYPFFTEGPKELREARIEIAKFSVKRAAVRIQRAKRRRDDPDEDMDAETKWALK 
HAKHMALDCSNFGDDRPLTGCSFSRDGKILATCSLSGVTKLWEMPQVTNTIAVLKDHKER 
ATDVVFSPVDDCLATASADRTAKLWKTDGTLLQTFEGHLDRLARVAFHPSGKYLGTTSYD 
KTWRLWDINTGAELLLQEGHSRSVYGIAFQQDGALAASCGLDSLARVWDLRTGRSILVFQ 
GHIKPVFSVNFSPNGYHLASGGEDNQCRIWDLRMRKSLYIIPAHANLVSQVKYEPQEGYF 
LATASYDMKVNIWSGRDFSLVKSLAGHESKVASLDITADSSCIATVSHDRTIKLWTSSGN 
DDEDEEKETMDIDL*
>AT1G28060.1 |  small nuclear ribonucleoprotein family protein / snRNP family protein 
MDKERYSRSHRDDRDRDSSPDHSPQREGGRRRDRDVDSKRRDSDHYRSSRRGDREDERDR 
TKDRRGRSVERGEREGSRDREKHHHERSHEGSKEKESRSKRKDREEENGARDGKKKSRFA 
DGNGERRSRFEDVAIEVENKDAQVSEGSGATNPTSGVTMGASTYSSIPSEASAAPSQTLL 
TKVSSISTTDENKASVVRSHEVPGKSSTDGRPLSTAGKSSANLPLDSSALAAKARKALQL 
QKGLADRLKNLPLLKKATKPTSEGSPHTRVPPSTTTPAVSTGTSFASTLPHTGLAGFGSI 
ANIEAVKRAQELAANMGFHQDREFAPVINLFPGQAPSDMTVAQRPEKPPVLRVDALGREI 
DEHGNVISVTKPSNLSTLKVNINKKKKDAFQILKPQLEADLKENPYFDTRMGIDEKKILR 
PKRMSFQFVEEGKWTRDAENLKFKSHFGEAKAKELKVKQAQLAKANDDINPNLIEVSERV 
PRKEKPKEPIPDVEWWDANVLTNGEYGEITDGTITESHLKIEKLTHYIEHPRPIEPPAEA 
APPPPQPLKLTKKEQKKLRTQRRLAKEKEKQEMIRQGLLEPPKAKVKMSNLMKVLGSEAT 
QDPTKLEKEIRTAAAEREQAHTDRNAARKLTPAEKREKKERKLFDDPTTVETIVSVYKIK 
KLSHPKTRFKVEMNARENRLTGCSVMTDEMSVVVVEGKSKAIKRYGKLMMKRINWEEAER 
KEGNEDEEEEVNGGNKCWLVWQGSIGKPSFHRFHVHECVTESTAKKVFMDAGVVHYWDLA 
VNYSDD*
>AT1G20580.1 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MSRSLGIPVKLLHEASGHIVTVELKSGELYRGSMIECEDNWNCQLEDITYTAKDGKVSQL 
EHVFIRGSKVRFMVIPDILKHAPMFKRLDARIKGKSSSLGVGRGRGAMRGKPAAGPGRGT 
GGRGAVPPVRR*
>AT4G14300.1 |  heterogeneous nuclear ribonucleoprotein putative / hnRNP putative 
MDSDQGKLFVGGISWETDEDKLREHFTNYGEVSQAIVMRDKLTGRPRGFGFVIFSDPSVL 
DRVLQEKHSIDTREVDVKRAMSREEQQVSGRTGNLNTSRSSGGDAYNKTKKIFVGGLPPT 
LTDEEFRQYFEVYGPVTDVAIMYDQATNRPRGFGFVSFDSEDAVDSVLHKTFHDLSGKQV 
EVKRALPKDANPGGGGRSMGGGGSGGYQGYGGNESSYDGRMDSNRFLQHQSVGNGLPSYG 
SSGYGAGYGNGSNGAGYGAYGGYTGSAGGYGAGATAGYGATNIPGAGYGSSTGVAPRNSW 
DTPASSGYGNPGYGSGAAHSGYGVPGAAPPTQSPSGYSNQGYGYGGYSGSDSGYGNQAAY 
GVVGGRPSGGGSNNPGSGGYMGGGYGDGSWRSDPSQGYGGGYNDGQGRQGQ*
>AT4G17300.1 |  NS1 asparagine-tRNA ligase 
MAATFLPATSLRLTQNSTLRFLSFFTISNPSYSLFRPLRRRVLPPFDAFPANSRRRCFCT 
AVSESLGSGDGNKVESYEKRFGSKVGEFRKKLRIAEVKGGADEGLSRVGQSLNIMGWVRT 
LRSQSSVTFIEINDGSCLSNLQCVMTSDAEGYDQVESGSILTGASVSVQGTIVASQGTKQ 
KVELKVEKIIVVGECDSSYPIQKKRVSREFLRTKAHLRPRTNTFGAVARVRNTLAYATHK 
FFQESGFVWVASPIITASDCEGAGEQFCVTTLIPSSHENTDTSIDAIPKTKGGLIDWSQD 
FFGKPAFLTVSGQLNGETYATALSDVYTFGPTFRAENSNTSRHLAEFWMIEPELAFADLD 
DDMACATAYLQYVVKYVLDNCKEDMEFFDTWIEKGIIRRLSDVAEKEFLQLGYTDAIEIL 
LKANKKFDFPVKWGLDLQSEHERYITEEAFGGRPVIIRDYPKEIKAFYMRENDDGKTVAA 
MDMLVPRIGELIGGSQREERLEVLEARLDELKLNKESYWWYLDLRRYGSVPHAGFGLGFE 
RLVQFVTGIDNIRDVIPFPRTPASAEF*
>AT2G19870.1 |  tRNA/rRNA methyltransferase (SpoU) family protein 
MYCNLTRSQGFPMAIRVSLQQTTIKSCLQFSKSSYLPVKFHLNPKGSTFAHVSTGLVSSR 
LNSMKIRMLRNPEGVREFSSFDSQRFGKRSSSNSSRGKSGLGSKAYRDKRSGGSGRSSGD 
SIWVKSDEKPVEERFQRGDRPSWEKRDGRNDSGSDRRSRSRGYGETRNRDSFRGRRDDRI 
SEVEEESKKGGGNSIWVANDDKPAKEQSPRVNNRSSWDDRTRNQNSFSARGDDRITEEAE 
EETMNHAPEDGDGIVEEEPDNTRWSEIKNRFNRYDVRDQGRDDAAYRNWNRQESWGRKTW 
QEATESSVPRLEGEVVYGVSPVLAALSVGRREFYALYVQEGLDLSSNNRKKKDKKGFERV 
LKISEKLGLNIKETSKHDLNMVADNRPHQGLVLDASPLELVKVKELDPISSEEEKYSLWV 
ALDEVTDPQNLGAIIRSAYFFGATGVVVCAKNSAPLSAVVSKASAGSLEVMELRYCKNMM 
QFLEASAENGWRVVGGSVSPKAVALNEVLPGSPTILVLGNEGTGLRPLVERSCTDLVRIS 
GNMPNEVAVTESDDAEGEGFRSFLAVESLNVSVAAGLFLHHLIGNKASA*
>AT1G80070.1 |  SUS2 (ABNORMAL SUSPENSOR 2) 
MWNNNDGMPLAPPGTGGSMMPPPPAAHPSYTALPPPSNPTPPVEPTPEEAEAKLEEKARK 
WMQLNSKRYGDKRKFGFVETQKEDMPPEHVRKIIRLVFFSSFSTISKYSLLDNYFLARDH 
GDMSSKKFRHDKRVYLGALKFVPHAVFKLLENMPMPWEQVRDVKVLYHITGAITFVNEIP 
WVVEPIYMAQWGTMWIMMRREKRDRRHFKRMRFPPFDDEEPPLDYADNLLDVDPLEPIQL 
ELDEEEDSAVHTWFYDHKPLVKTKLINGPSYRRWNLSLPIMATLHRLAGQLLSDLIDRNY 
FYLFDMPSFFTAKALNMCIPGGPKFEPLYRDMEKGDEDWNEFNDINKLIIRSPLRTEYRI 
AFPHLYNNRPRKVKLCVYHSPMIMYIKTEDPDLPAFYYDPLIHPISNTNKEKRERKVYDD 
EDDFALPEGVEPLLRDTQLYTDTTAAGISLLFAPRPFNMRSGRTRRAEDIPLVSEWFKEH 
CPPAYPVKVRVSYQKLLKCYVLNELHHRPPKAQKKKHLFRSLAATKFFQSTELDWVEVGL 
QVCRQGYNMLNLLIHRKNLNYLHLDYNFNLKPVKTLTTKERKKSRFGNAFHLCREILRLT 
KLVVDANVQFRLGNVDAFQLADGLQYIFSHVGQLTGMYRYKYRLMRQIRMCKDLKHLIYY 
RFNTGPVGKGPGCGFWAPMWRVWLFFLRGIVPLLERWLGNLLARQFEGRHSKGVAKTVTK 
QRVESHFDLELRAAVMHDVLDAMPEGIKQNKARTILQHLSEAWRCWKANIPWKVPGLPVP 
IENMILRYVKSKADWWTNVAHYNRERIRRGATVDKTVCRKNLGRLTRLWLKAEQERQHNY 
LKDGPYVTPEEALAIYTTTVHWLESRKFSPIPFPPLSYKHDTKLLILALERLKESYSVAV 
RLNQQQREELGLIEQAYDNPHEALSRIKRHLLTQRGFKEVGIEFMDLYSYLIPVYEIEPL 
EKITDAYLDQYLWYEGDKRHLFPNWIKPADSEPPPLLVYKWCQGINNLQGIWDTGDGQCV 
VMLQTKFEKFFEKIDLTMLNRLLRLVLDHNIADYVSAKNNVVLSYKDMSHTNSYGLIRGL 
QFASFVVQFYGLLLDLLLLGLTRASEIAGPPQMPNEFMTFWDTKVETRHPIRLYSRYIDK 
VHIMFKFTHEEARDLIQRYLTEHPDPNNENMVGYNNKKCWPRDARMRLMKHDVNLGRSVF 
WDMKNRLPRSITTLEWENGFVSVYSKDNPNLLFSMCGFEVRILPKIRMTQEAFSNTKDGV 
WNLQNEQTKERTAVAFLRVDDEHMKVFENRVRQILMSSGSTTFTKIVNKWNTALIGLMTY 
FREATVHTQELLDLLVKCENKIQTRIKIGLNSKMPSRFPPVIFYTPKEIGGLGMLSMGHI 
LIPQSDLRYSKQTDVGVTHFRSGMSHEEDQLIPNLYRYIQPWESEFIDSQRVWAEYALKR 
QEAQAQNRRLTLEDLEDSWDRGIPRINTLFQKDRHTLAYDKGWRVRTDFKQYQVLKQNPF 
WWTHQRHDGKLWNLNNYRTDVIQALGGVEGILEHTLFKGTYFPTWEGLFWEKASGFEESM 
KYKKLTNAQRSGLNQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGIFMHGKIPTLKISL 
IQIFRAHLWQKIHESVVMDLCQVLDQELDALEIETVQKETIHPRKSYKMNSSCADVLLFA 
AHKWPMSKPSLVAESKDMFDQKASNKYWIDVQLRWGDYDSHDIERYTRAKFMDYTTDNMS 
IYPSPTGVMIGLDLAYNLHSAFGNWFPGSKPLLAQAMNKIMKSNPALYVLRERIRKGLQL 
YSSEPTEPYLSSQNYGEIFSNQIIWFVDDTNVYRVTIHKTFEGNLTTKPINGAIFIFNPR 
TGQLFLKVIHTSVWAGQKRLGQLAKWKTAEEVAALVRSLPVEEQPKQIIVTRKGMLDPLE 
VHLLDFPNIVIKGSELQLPFQACLKIEKFGDLILKATEPQMVLFNIYDDWLKSISSYTAF 
SRLILILRALHVNNEKAKMLLKPDKSVVTEPHHIWPSLTDDQWMKVEVALRDLILSDYAK 
KNNVNTSALTQSEIRDIILGAEITPPSQQRQQIAEIEKQAKEASQLTAVTTRTTNVHGDE 
LIVTTTSPYEQSAFGSKTDWRVRAISATNLYLRVNHIYVNSDDIKETGYTYIMPKNILKK 
FICVADLRTQIAGYLYGISPPDNPQVKEIRCVVMVPQWGNHQLVHLPSSLPEHDFLNDLE 
PLGWLHTQPNELPQLSPQDVTSHSRILENNKQWDGEKCIILTCSFTPGSCSLTSYKLTQT 
GYEWGRLNKDNGSNPHGYLPTHYEKVQMLLSDRFLGFYMVPESGPWNYSFTGVKHTLSMK 
YSVKLGSPKEFYHEEHRPTHFLEFSNMEEADITEGDREDTFT*
>AT2G47000.1 |  ABCB4 (ATP BINDING CASSETTE SUBFAMILY B4) ATPase coupled to transmembrane movement of substances / xenobiotic-transporting ATPase 
MASESGLNGDPNILEEVSETKRDKEEEEEVKKTEKKDEEHEKTKTVPFYKLFAFADSFDF 
LLMILGTLGSIGNGLGFPLMTLLFGDLIDAFGENQTNTTDKVSKVALKFVWLGIGTFAAA 
FLQLSGWMISGERQAARIRSLYLKTILRQDIAFFDIDTNTGEVVGRMSGDTVLIQDAMGE 
KVGKAIQLLATFVGGFVIAFVRGWLLTLVMLSSIPLLVMAGALLAIVIAKTASRGQTAYA 
KAATVVEQTIGSIRTVASFTGEKQAISNYNKHLVTAYKAGVIEGGSTGLGLGTLFLVVFC 
SYALAVWYGGKLILDKGYTGGQVLNIIIAVLTGSMSLGQTSPCLSAFAAGQAAAYKMFET 
IERRPNIDSYSTNGKVLDDIKGDIELKDVYFTYPARPDEQIFRGFSLFISSGTTVALVGQ 
SGSGKSTVVSLIERFYDPQAGDVLIDGINLKEFQLKWIRSKIGLVSQEPVLFTASIKDNI 
AYGKEDATTEEIKAAAELANASKFVDKLPQGLDTMVGEHGTQLSGGQKQRIAVARAILKD 
PRILLLDEATSALDAESERVVQEALDRIMVNRTTVVVAHRLSTVRNADMIAVIHQGKIVE 
KGSHTELLKDPEGAYSQLIRLQEEKKSDENAAEEQKMSSIESFKQSSLRKSSLGRSLSKG 
GSSRGNSSRHSFNMFGFPAGIDGNVVQDQEEDDTTQPKTEPKKVSIFRIAALNKPEIPVL 
ILGSISAAANGVILPIFGILISSVIKAFFQPPKKLKEDTSFWAIIFMVLGFASIIAYPAQ 
TFFFAIAGCKLVQRIRSMCFEKVVHMEVGWFDEPENSSGTIGARLSADAATIRGLVGDSL 
AQTVQNLSSILAGLIIAFLACWQLAFVVLAMLPLIALNGFLYMKFMKGFSADAKKMYGEA 
SQVANDAVGSIRTVASFCAEDKVMNMYSKKCEGPMKNGIRQGIVSGIGFGFSFFVLFSSY 
AASFYVGARLVDDGKTTFDSVFRVFFALTMAAMAISQSSSLSPDSSKADVAAASIFAIMD 
RESKIDPSVESGRVLDNVKGDIELRHVSFKYPARPDVQIFQDLCLSIRAGKTVALVGESG 
SGKSTVIALLQRFYDPDSGEITLDGVEIKSLRLKWLRQQTGLVSQEPILFNETIRANIAY 
GKGGDASESEIVSSAELSNAHGFISGLQQGYDTMVGERGIQLSGGQKQRVAIARAIVKDP 
KVLLLDEATSALDAESERVVQDALDRVMVNRTTIVVAHRLSTIKNADVIAVVKNGVIVEK 
GKHDTLINIKDGVYASLVQLHLTAAS*
>AT1G70600.1 |  structural constituent of ribosome 
MTTRFKKNRKKRGHVSAGHGRIGKHRKHPGGRGNAGGMHHHRILFDKYHPGYFGKVGMRY 
FHKLRNKFFCPIVNLDKLWSLVPEDVKAKSTKDNVPLIDVTQHGFFKVLGKGHLPENKPF 
VVKAKLISKTAEKKIKEAGGAVVLTA*
>AT1G02520.1 |  PGP11 (P-GLYCOPROTEIN 11) ATPase coupled to transmembrane movement of substances 
MNGDGAREGDSVSHEPSTSKSPKEGEETKKEEKSEEKANTVPFYKLFAFADSSDVLLMIC 
GSIGAIGNGMSLPFMTLLFGDLIDSFGKNQNNKDIVDVVSKVCLKFVYLGLGTLGAAFLQ 
VACWMITGERQAARIRSTYLKTILRQDIGFFDVETNTGEVVGRMSGDTVLIQDAMGEKVG 
KFIQLVSTFVGGFVLAFIKGWLLTLVMLTSIPLLAMAGAAMALIVTRASSRGQAAYAKAA 
TVVEQTIGSIRTVASFTGEKQAINSYKKFITSAYKSSIQQGFSTGLGLGVMFFVFFSSYA 
LAIWFGGKMILEKGYTGGAVINVIIIVVAGSMSLGQTSPCVTAFAAGQAAAYKMFETIKR 
KPLIDAYDVNGKVLEDIRGDIELKDVHFSYPARPDEEIFDGFSLFIPSGATAALVGESGS 
GKSTVISLIERFYDPKSGAVLIDGVNLKEFQLKWIRSKIGLVSQEPVLFSSSIMENIAYG 
KENATVEEIKAATELANAAKFIDKLPQGLDTMVGEHGTQLSGGQKQRIAIARAILKDPRI 
LLLDEATSALDAESERVVQEALDRVMVNRTTVIVAHRLSTVRNADMIAVIHRGKMVEKGS 
HSELLKDSEGAYSQLIRLQEINKDVKTSELSSGSSFRNSNLKKSMEGTSSVGNSSRHHSL 
NVLGLTTGLDLGSHSQRAGQDETGTASQEPLPKVSLTRIAALNKPEIPVLLLGTVAAAIN 
GAIFPLFGILISRVIEAFFKPAHELKRDSRFWAIIFVALGVTSLIVSPTQMYLFAVAGGK 
LIRRIRSMCFEKAVHMEVAWFDEPQNSSGTMGARLSADATLIRALVGDALSLAVQNVASA 
ASGLIIAFTASWELALIILVMLPLIGINGFVQVKFMKGFSADAKSKYEEASQVANDAVGS 
IRTVASFCAEEKVMQMYKKQCEGPIKDGIKQGFISGLGFGFSFFILFCVYATSFYAGARL 
VEDGKTTFNNVFQVFFALTMAAIGISQSSTFAPDSSKAKVAAASIFAIIDRKSKIDSSDE 
TGTVLENVKGDIELRHLSFTYPARPDIQIFRDLCLTIRAGKTVALVGESGSGKSTVISLL 
QRFYDPDSGHITLDGVELKKLQLKWLRQQMGLVGQEPVLFNDTIRANIAYGKGSEEAATE 
SEIIAAAELANAHKFISSIQQGYDTVVGERGIQLSGGQKQRVAIARAIVKEPKILLLDEA 
TSALDAESERVVQDALDRVMVNRTTIVVAHRLSTIKNADVIAVVKNGVIAEKGTHETLIK 
IEGGVYASLVQLHMTASN*
>AT5G26780.1 |  SHM2 (SERINE HYDROXYMETHYLTRANSFERASE 2) catalytic/ glycine hydroxymethyltransferase/ pyridoxal phosphate binding 
MALALRRLSSSVKKPISLLSSNGGSLRFMSSLSTAAMAESEKSRSSWIKQLNASLDEIDP 
EVADIIELEKARQWKGFELIPSENFTSLSVMQAVGSVMTNKYSEGYPGARYYGGNEYIDM 
AETLCQKRALEAFQLDPSKWGVNVQSLSGSPANFQVYTALLKPHERIMALDLPHGGHLSH 
GYQTDTKKISAVSIFFETMPYRLDENTGYIDYDQLEKSAVLFRPKLIVAGASAYARLYDY 
ARIRKVCNKQKAVMLADMAHISGLVAAGVIPSPFEYADVVTTTTHKSLRGPRGAMIFFRK 
GLKEINKQGKEVMYDYEDRINQAVFPGLQGGPHNHTITGLAVALKQARTPEYKAYQDQVL 
RNCSKFAETLLAKGYDLVSGGTDNHLVLVNLKNKGIDGSRVEKVLELVHIAANKNTVPGD 
VSAMVPGGIRMGTPALTSRGFIEEDFAKVAEYFDLAVKIALKIKAESQGTKLKDFVATMQ 
SNEKLQSEMSKLREMVEEYAKQFPTIGFEKETMRYKE*
>AT5G26780.1 |  SHM2 (SERINE HYDROXYMETHYLTRANSFERASE 2) catalytic/ glycine hydroxymethyltransferase/ pyridoxal phosphate binding 
MALALRRLSSSVKKPISLLSSNGGSLRFMSSLSTAAMAESEKSRSSWIKQLNASLDEIDP 
EVADIIELEKARQWKGFELIPSENFTSLSVMQAVGSVMTNKYSEGYPGARYYGGNEYIDM 
AETLCQKRALEAFQLDPSKWGVNVQSLSGSPANFQVYTALLKPHERIMALDLPHGGHLSH 
GYQTDTKKISAVSIFFETMPYRLDENTGYIDYDQLEKSAVLFRPKLIVAGASAYARLYDY 
ARIRKVCNKQKAVMLADMAHISGLVAAGVIPSPFEYADVVTTTTHKSLRGPRGAMIFFRK 
GLKEINKQGKEVMYDYEDRINQAVFPGLQGGPHNHTITGLAVALKQARTPEYKAYQDQVL 
RNCSKFAETLLAKGYDLVSGGTDNHLVLVNLKNKGIDGSRVEKVLELVHIAANKNTVPGD 
VSAMVPGGIRMGTPALTSRGFIEEDFAKVAEYFDLAVKIALKIKAESQGTKLKDFVATMQ 
SNEKLQSEMSKLREMVEEYAKQFPTIGFEKETMRYKE*
>AT5G26780.1 |  SHM2 (SERINE HYDROXYMETHYLTRANSFERASE 2) catalytic/ glycine hydroxymethyltransferase/ pyridoxal phosphate binding 
MALALRRLSSSVKKPISLLSSNGGSLRFMSSLSTAAMAESEKSRSSWIKQLNASLDEIDP 
EVADIIELEKARQWKGFELIPSENFTSLSVMQAVGSVMTNKYSEGYPGARYYGGNEYIDM 
AETLCQKRALEAFQLDPSKWGVNVQSLSGSPANFQVYTALLKPHERIMALDLPHGGHLSH 
GYQTDTKKISAVSIFFETMPYRLDENTGYIDYDQLEKSAVLFRPKLIVAGASAYARLYDY 
ARIRKVCNKQKAVMLADMAHISGLVAAGVIPSPFEYADVVTTTTHKSLRGPRGAMIFFRK 
GLKEINKQGKEVMYDYEDRINQAVFPGLQGGPHNHTITGLAVALKQARTPEYKAYQDQVL 
RNCSKFAETLLAKGYDLVSGGTDNHLVLVNLKNKGIDGSRVEKVLELVHIAANKNTVPGD 
VSAMVPGGIRMGTPALTSRGFIEEDFAKVAEYFDLAVKIALKIKAESQGTKLKDFVATMQ 
SNEKLQSEMSKLREMVEEYAKQFPTIGFEKETMRYKE*
>AT5G26780.3 |  SHM2 (SERINE HYDROXYMETHYLTRANSFERASE 2) catalytic/ glycine hydroxymethyltransferase/ pyridoxal phosphate binding 
MALALRRLSSSVKKPISLLSSNGGSLRFMSSLSTAAMAESEKSRSSWIKQLNASLDEIDP 
EVADIIELEKARQWKGFELIPSENFTSLSVMQAVGSVMTNKYSEGYPGARYYGGNEYIDM 
AETLCQKRALEAFQLDPSKWGVNVQSLSGSPANFQVYTALLKPHERIMALDLPHGGHLSH 
GYQTDTKKISAVSIFFETMPYRLDENTGYIDYDQLEKSAVLFRPKLIVAGASAYARLYDY 
ARIRKVCNKQKAVMLADMAHISGLVAAGVIPSPFEYADVVTTTTHKSLRGPRGAMIFFRK 
GLKEINKQGKEVMYDYEDRINQAVFPGLQGGPHNHTITGLAVALKQARTPEYKAYQDQVL 
RNCSKFAELDIRPTVIISYGLSMQTLLAKGYDLVSGGTDNHLVLVNLKNKGIDGSRVEKV 
LELVHIAANKNTVPGDVSAMVPGGIRMGTPALTSRGFIEEDFAKVAEYFDLAVKIALKIK 
AESQGTKLKDFVATMQSNEKLQSEMSKLREMVEEYAKQFPTIGFEKETMRYKE*
>AT5G26780.3 |  SHM2 (SERINE HYDROXYMETHYLTRANSFERASE 2) catalytic/ glycine hydroxymethyltransferase/ pyridoxal phosphate binding 
MALALRRLSSSVKKPISLLSSNGGSLRFMSSLSTAAMAESEKSRSSWIKQLNASLDEIDP 
EVADIIELEKARQWKGFELIPSENFTSLSVMQAVGSVMTNKYSEGYPGARYYGGNEYIDM 
AETLCQKRALEAFQLDPSKWGVNVQSLSGSPANFQVYTALLKPHERIMALDLPHGGHLSH 
GYQTDTKKISAVSIFFETMPYRLDENTGYIDYDQLEKSAVLFRPKLIVAGASAYARLYDY 
ARIRKVCNKQKAVMLADMAHISGLVAAGVIPSPFEYADVVTTTTHKSLRGPRGAMIFFRK 
GLKEINKQGKEVMYDYEDRINQAVFPGLQGGPHNHTITGLAVALKQARTPEYKAYQDQVL 
RNCSKFAELDIRPTVIISYGLSMQTLLAKGYDLVSGGTDNHLVLVNLKNKGIDGSRVEKV 
LELVHIAANKNTVPGDVSAMVPGGIRMGTPALTSRGFIEEDFAKVAEYFDLAVKIALKIK 
AESQGTKLKDFVATMQSNEKLQSEMSKLREMVEEYAKQFPTIGFEKETMRYKE*
>AT5G26780.3 |  SHM2 (SERINE HYDROXYMETHYLTRANSFERASE 2) catalytic/ glycine hydroxymethyltransferase/ pyridoxal phosphate binding 
MALALRRLSSSVKKPISLLSSNGGSLRFMSSLSTAAMAESEKSRSSWIKQLNASLDEIDP 
EVADIIELEKARQWKGFELIPSENFTSLSVMQAVGSVMTNKYSEGYPGARYYGGNEYIDM 
AETLCQKRALEAFQLDPSKWGVNVQSLSGSPANFQVYTALLKPHERIMALDLPHGGHLSH 
GYQTDTKKISAVSIFFETMPYRLDENTGYIDYDQLEKSAVLFRPKLIVAGASAYARLYDY 
ARIRKVCNKQKAVMLADMAHISGLVAAGVIPSPFEYADVVTTTTHKSLRGPRGAMIFFRK 
GLKEINKQGKEVMYDYEDRINQAVFPGLQGGPHNHTITGLAVALKQARTPEYKAYQDQVL 
RNCSKFAELDIRPTVIISYGLSMQTLLAKGYDLVSGGTDNHLVLVNLKNKGIDGSRVEKV 
LELVHIAANKNTVPGDVSAMVPGGIRMGTPALTSRGFIEEDFAKVAEYFDLAVKIALKIK 
AESQGTKLKDFVATMQSNEKLQSEMSKLREMVEEYAKQFPTIGFEKETMRYKE*
>AT5G26780.2 |  SHM2 (SERINE HYDROXYMETHYLTRANSFERASE 2) catalytic/ glycine hydroxymethyltransferase/ pyridoxal phosphate binding 
MALALRRLSSSVKKPISLLSSNGGSLRFMSSLSTAAMAESEKSRSSWIKQLNASLDEIDP 
EVADIIELEKARQWKGFELIPSENFTSLSVMQAVGSVMTNKYSEGYPGARYYGGNEYIDM 
AETLCQKRALEAFQLDPSKWGVNVQSLSGSPANFQVYTALLKPHERIMALDLPHGGHLSH 
GYQTDTKKISAVSIFFETMPYRLDENTGYIDYDQLEKSAVLFRPKLIVAGASAYARLYDY 
ARIRKVCNKQKAVMLADMAHISGLVAAGVIPSPFEYADVVTTTTHKSLRGPRGAMIFFRK 
GLKEINKQGKEVMYDYEDRINQAVFPGLQGGPHNHTITGLAVALKQARTPEYKAYQDQVL 
RNCSKFAELDIRPTVIISYGLSMQTLLAKGYDLVSGGTDNHLVLVNLKNKGIDGSRVEKV 
LELVHIAANKNTVPGDVSAMVPGGIRMGTPALTSRGFIEEDFAKVAEYFDLAVKIALKIK 
AESQGTKLKDFVATMQSNEKLQSEMSKLREMVEEYAKQFPTIGFEKETMRYKE*
>AT5G26780.2 |  SHM2 (SERINE HYDROXYMETHYLTRANSFERASE 2) catalytic/ glycine hydroxymethyltransferase/ pyridoxal phosphate binding 
MALALRRLSSSVKKPISLLSSNGGSLRFMSSLSTAAMAESEKSRSSWIKQLNASLDEIDP 
EVADIIELEKARQWKGFELIPSENFTSLSVMQAVGSVMTNKYSEGYPGARYYGGNEYIDM 
AETLCQKRALEAFQLDPSKWGVNVQSLSGSPANFQVYTALLKPHERIMALDLPHGGHLSH 
GYQTDTKKISAVSIFFETMPYRLDENTGYIDYDQLEKSAVLFRPKLIVAGASAYARLYDY 
ARIRKVCNKQKAVMLADMAHISGLVAAGVIPSPFEYADVVTTTTHKSLRGPRGAMIFFRK 
GLKEINKQGKEVMYDYEDRINQAVFPGLQGGPHNHTITGLAVALKQARTPEYKAYQDQVL 
RNCSKFAELDIRPTVIISYGLSMQTLLAKGYDLVSGGTDNHLVLVNLKNKGIDGSRVEKV 
LELVHIAANKNTVPGDVSAMVPGGIRMGTPALTSRGFIEEDFAKVAEYFDLAVKIALKIK 
AESQGTKLKDFVATMQSNEKLQSEMSKLREMVEEYAKQFPTIGFEKETMRYKE*
>AT5G26780.2 |  SHM2 (SERINE HYDROXYMETHYLTRANSFERASE 2) catalytic/ glycine hydroxymethyltransferase/ pyridoxal phosphate binding 
MALALRRLSSSVKKPISLLSSNGGSLRFMSSLSTAAMAESEKSRSSWIKQLNASLDEIDP 
EVADIIELEKARQWKGFELIPSENFTSLSVMQAVGSVMTNKYSEGYPGARYYGGNEYIDM 
AETLCQKRALEAFQLDPSKWGVNVQSLSGSPANFQVYTALLKPHERIMALDLPHGGHLSH 
GYQTDTKKISAVSIFFETMPYRLDENTGYIDYDQLEKSAVLFRPKLIVAGASAYARLYDY 
ARIRKVCNKQKAVMLADMAHISGLVAAGVIPSPFEYADVVTTTTHKSLRGPRGAMIFFRK 
GLKEINKQGKEVMYDYEDRINQAVFPGLQGGPHNHTITGLAVALKQARTPEYKAYQDQVL 
RNCSKFAELDIRPTVIISYGLSMQTLLAKGYDLVSGGTDNHLVLVNLKNKGIDGSRVEKV 
LELVHIAANKNTVPGDVSAMVPGGIRMGTPALTSRGFIEEDFAKVAEYFDLAVKIALKIK 
AESQGTKLKDFVATMQSNEKLQSEMSKLREMVEEYAKQFPTIGFEKETMRYKE*
>AT1G70980.1 |  SYNC3 ATP binding / aminoacyl-tRNA ligase/ asparagine-tRNA ligase/ aspartate-tRNA ligase/ nucleic acid binding / nucleotide binding 
MGDEIVPPANQLAADNLENDGSTVQKAQFSDRVLIRSILGGGAKLAGQKVRIGGWVKTGR 
QQGKGTFAFLEVNDGSCPANLQVMVDSSLYDLSRLVATGTCVTVDGVLKIPPEGKGLKQS 
IELSVETVIAVGTVDPTTYPLPKTKLTPEFLRDVLHLRSRTNLISAVARIRNALAFATHS 
FFQEHSFLYIHTPIITTSDCEGAGEMFQVTTLINHTERVEQDLIDNPPPTEADVEAARLI 
VKERGEAVAQLKVAKASKEEITASVAQLSVAKASLAHVEERLRLKPGLPKNDGKIDYSND 
FFGRQAFLTVSGQLQVETYACALSSVYTFGPTFRAENSHTSRHLAEFWMVEPEIAFADIH 
DDMNCAEAYVKYMCKWLMDKCGDDMELMDKNVDEGCTKRLNMVAKASFKRVTYTEAIERL 
EKAVAQGKVVFDNKVEWGIDLASEHERYLTEVEFDQKPIIVYNYPKGIKAFYMRLNDDEK 
TVAAMDVLVPKVGELIGGSQREERYDVIKQRIEEMGLPMEPYEWYLDLRRYGTVKHCGFG 
LGFERMIQFATGIDNIRDVIPFPRYPGKADL*
>AT3G56340.1 |  40S ribosomal protein S26 (RPS26C) 
MTFKRRNGGRNKHNRGHVKPIRCSNCGKCCPKDKAIKRFIVRNIVEQAAIRDVQEASVYE 
GYTLPKLYAKTQYCVSCAIHSHVVRVRSRTNRRVRTPPPRFARRKEDTPKPAQPGQAPRP 
AGGAPAAPRA*
>AT3G46210.1 |  3 exoribonuclease family domain 1-containing protein 
MEIDREDGRTPNQLRPLACSRNILHRPHGSASWSQGDTKVLAAVYGPKAGTKKNENAEKA 
CFEVIWKPKSGQIGKVEKEYEMILKRTIQSICVLTVNPNTTTSVIIQVVHDDGSLLPCAI 
NAACAALVDAGIPMKHLAVAICCCLAENGYLVLDPNKLEEKKMTAFAYLVFPNTTLSVLP 
EGSSVAEGEPVEHGIITSITHGVMSVDDYFLCVENGRAATASLSAFFRKNFQQSSSKAG*
>AT3G46210.1 |  3 exoribonuclease family domain 1-containing protein 
MEIDREDGRTPNQLRPLACSRNILHRPHGSASWSQGDTKVLAAVYGPKAGTKKNENAEKA 
CFEVIWKPKSGQIGKVEKEYEMILKRTIQSICVLTVNPNTTTSVIIQVVHDDGSLLPCAI 
NAACAALVDAGIPMKHLAVAICCCLAENGYLVLDPNKLEEKKMTAFAYLVFPNTTLSVLP 
EGSSVAEGEPVEHGIITSITHGVMSVDDYFLCVENGRAATASLSAFFRKNFQQSSSKAG*
>AT3G46210.1 |  3 exoribonuclease family domain 1-containing protein 
MEIDREDGRTPNQLRPLACSRNILHRPHGSASWSQGDTKVLAAVYGPKAGTKKNENAEKA 
CFEVIWKPKSGQIGKVEKEYEMILKRTIQSICVLTVNPNTTTSVIIQVVHDDGSLLPCAI 
NAACAALVDAGIPMKHLAVAICCCLAENGYLVLDPNKLEEKKMTAFAYLVFPNTTLSVLP 
EGSSVAEGEPVEHGIITSITHGVMSVDDYFLCVENGRAATASLSAFFRKNFQQSSSKAG*
>AT3G46210.1 |  3 exoribonuclease family domain 1-containing protein 
MEIDREDGRTPNQLRPLACSRNILHRPHGSASWSQGDTKVLAAVYGPKAGTKKNENAEKA 
CFEVIWKPKSGQIGKVEKEYEMILKRTIQSICVLTVNPNTTTSVIIQVVHDDGSLLPCAI 
NAACAALVDAGIPMKHLAVAICCCLAENGYLVLDPNKLEEKKMTAFAYLVFPNTTLSVLP 
EGSSVAEGEPVEHGIITSITHGVMSVDDYFLCVENGRAATASLSAFFRKNFQQSSSKAG*
>AT3G46210.1 |  3 exoribonuclease family domain 1-containing protein 
MEIDREDGRTPNQLRPLACSRNILHRPHGSASWSQGDTKVLAAVYGPKAGTKKNENAEKA 
CFEVIWKPKSGQIGKVEKEYEMILKRTIQSICVLTVNPNTTTSVIIQVVHDDGSLLPCAI 
NAACAALVDAGIPMKHLAVAICCCLAENGYLVLDPNKLEEKKMTAFAYLVFPNTTLSVLP 
EGSSVAEGEPVEHGIITSITHGVMSVDDYFLCVENGRAATASLSAFFRKNFQQSSSKAG*
>AT3G46210.2 |  3 exoribonuclease family domain 1-containing protein 
MEIDREDGRTPNQLRPLACSRNILHRPHGSASWSQGDTKVLAAVYGPKAGTKKNENAEKA 
CFEVIWKPKSGQIGKVEKEYEMILKRTIQSICVLTVNPNTTTSVIIQVVHDDGSLLPCAI 
NAACAALVDAGIPMKHLAVAICCCLAENGYLVLDPNKLEEKKMTAFAYLVFPNTTLSVLP 
EGSSVAEGEPVEHGIITSITHGVMSVDDYFLCVENGRAATASLSAFFRKNFQQSSSKAG*
>AT3G46210.2 |  3 exoribonuclease family domain 1-containing protein 
MEIDREDGRTPNQLRPLACSRNILHRPHGSASWSQGDTKVLAAVYGPKAGTKKNENAEKA 
CFEVIWKPKSGQIGKVEKEYEMILKRTIQSICVLTVNPNTTTSVIIQVVHDDGSLLPCAI 
NAACAALVDAGIPMKHLAVAICCCLAENGYLVLDPNKLEEKKMTAFAYLVFPNTTLSVLP 
EGSSVAEGEPVEHGIITSITHGVMSVDDYFLCVENGRAATASLSAFFRKNFQQSSSKAG*
>AT3G46210.2 |  3 exoribonuclease family domain 1-containing protein 
MEIDREDGRTPNQLRPLACSRNILHRPHGSASWSQGDTKVLAAVYGPKAGTKKNENAEKA 
CFEVIWKPKSGQIGKVEKEYEMILKRTIQSICVLTVNPNTTTSVIIQVVHDDGSLLPCAI 
NAACAALVDAGIPMKHLAVAICCCLAENGYLVLDPNKLEEKKMTAFAYLVFPNTTLSVLP 
EGSSVAEGEPVEHGIITSITHGVMSVDDYFLCVENGRAATASLSAFFRKNFQQSSSKAG*
>AT3G46210.2 |  3 exoribonuclease family domain 1-containing protein 
MEIDREDGRTPNQLRPLACSRNILHRPHGSASWSQGDTKVLAAVYGPKAGTKKNENAEKA 
CFEVIWKPKSGQIGKVEKEYEMILKRTIQSICVLTVNPNTTTSVIIQVVHDDGSLLPCAI 
NAACAALVDAGIPMKHLAVAICCCLAENGYLVLDPNKLEEKKMTAFAYLVFPNTTLSVLP 
EGSSVAEGEPVEHGIITSITHGVMSVDDYFLCVENGRAATASLSAFFRKNFQQSSSKAG*
>AT3G46210.2 |  3 exoribonuclease family domain 1-containing protein 
MEIDREDGRTPNQLRPLACSRNILHRPHGSASWSQGDTKVLAAVYGPKAGTKKNENAEKA 
CFEVIWKPKSGQIGKVEKEYEMILKRTIQSICVLTVNPNTTTSVIIQVVHDDGSLLPCAI 
NAACAALVDAGIPMKHLAVAICCCLAENGYLVLDPNKLEEKKMTAFAYLVFPNTTLSVLP 
EGSSVAEGEPVEHGIITSITHGVMSVDDYFLCVENGRAATASLSAFFRKNFQQSSSKAG*
>AT3G46210.3 |  3 exoribonuclease family domain 1-containing protein 
MEIDREDGRTPNQLRPLACSRNILHRPHGSASWSQGDTKVLAAVYGPKAGTKKNENAEKA 
CFEVIWKPKSGQIGKVEKEYEMILKRTIQSICVLTVNPNTTTSVIIQVVHDDGSLLPCAI 
NAACAALVDAGIPMKHLAVAICCCLAENGYLVLDPNKLEEKKMTAFAYLVFPNTTLSVLP 
EGSSVAEGEPVEHGIITSITHGVMSVDDYFLCVENGRAATASLSAFFRKNFQQSSSKAG*
>AT3G46210.3 |  3 exoribonuclease family domain 1-containing protein 
MEIDREDGRTPNQLRPLACSRNILHRPHGSASWSQGDTKVLAAVYGPKAGTKKNENAEKA 
CFEVIWKPKSGQIGKVEKEYEMILKRTIQSICVLTVNPNTTTSVIIQVVHDDGSLLPCAI 
NAACAALVDAGIPMKHLAVAICCCLAENGYLVLDPNKLEEKKMTAFAYLVFPNTTLSVLP 
EGSSVAEGEPVEHGIITSITHGVMSVDDYFLCVENGRAATASLSAFFRKNFQQSSSKAG*
>AT3G46210.3 |  3 exoribonuclease family domain 1-containing protein 
MEIDREDGRTPNQLRPLACSRNILHRPHGSASWSQGDTKVLAAVYGPKAGTKKNENAEKA 
CFEVIWKPKSGQIGKVEKEYEMILKRTIQSICVLTVNPNTTTSVIIQVVHDDGSLLPCAI 
NAACAALVDAGIPMKHLAVAICCCLAENGYLVLDPNKLEEKKMTAFAYLVFPNTTLSVLP 
EGSSVAEGEPVEHGIITSITHGVMSVDDYFLCVENGRAATASLSAFFRKNFQQSSSKAG*
>AT3G46210.3 |  3 exoribonuclease family domain 1-containing protein 
MEIDREDGRTPNQLRPLACSRNILHRPHGSASWSQGDTKVLAAVYGPKAGTKKNENAEKA 
CFEVIWKPKSGQIGKVEKEYEMILKRTIQSICVLTVNPNTTTSVIIQVVHDDGSLLPCAI 
NAACAALVDAGIPMKHLAVAICCCLAENGYLVLDPNKLEEKKMTAFAYLVFPNTTLSVLP 
EGSSVAEGEPVEHGIITSITHGVMSVDDYFLCVENGRAATASLSAFFRKNFQQSSSKAG*
>AT3G46210.3 |  3 exoribonuclease family domain 1-containing protein 
MEIDREDGRTPNQLRPLACSRNILHRPHGSASWSQGDTKVLAAVYGPKAGTKKNENAEKA 
CFEVIWKPKSGQIGKVEKEYEMILKRTIQSICVLTVNPNTTTSVIIQVVHDDGSLLPCAI 
NAACAALVDAGIPMKHLAVAICCCLAENGYLVLDPNKLEEKKMTAFAYLVFPNTTLSVLP 
EGSSVAEGEPVEHGIITSITHGVMSVDDYFLCVENGRAATASLSAFFRKNFQQSSSKAG*
>AT3G46210.4 |  3 exoribonuclease family domain 1-containing protein 
MEIDREDGRTPNQLRPLACSRNILHRPHGSASWSQGDTKVLAAVYGPKAGTKKNENAEKA 
CFEVIWKPKSGQIGKVEKEYEMILKRTIQSICVLTVNPNTTTSVIIQVVHDDGSLLPCAI 
NAACAALVDAGIPMKHLAVAICCCLAENGYLVLDPNKLEEKKMTAFAYLVFPNTTLSVLP 
EGSSVAEGEPVEHGIITSITHGVMSVDDYFLCVENGRAATASLSAFFRKNFQQSSSKAG*
>AT3G46210.4 |  3 exoribonuclease family domain 1-containing protein 
MEIDREDGRTPNQLRPLACSRNILHRPHGSASWSQGDTKVLAAVYGPKAGTKKNENAEKA 
CFEVIWKPKSGQIGKVEKEYEMILKRTIQSICVLTVNPNTTTSVIIQVVHDDGSLLPCAI 
NAACAALVDAGIPMKHLAVAICCCLAENGYLVLDPNKLEEKKMTAFAYLVFPNTTLSVLP 
EGSSVAEGEPVEHGIITSITHGVMSVDDYFLCVENGRAATASLSAFFRKNFQQSSSKAG*
>AT3G46210.4 |  3 exoribonuclease family domain 1-containing protein 
MEIDREDGRTPNQLRPLACSRNILHRPHGSASWSQGDTKVLAAVYGPKAGTKKNENAEKA 
CFEVIWKPKSGQIGKVEKEYEMILKRTIQSICVLTVNPNTTTSVIIQVVHDDGSLLPCAI 
NAACAALVDAGIPMKHLAVAICCCLAENGYLVLDPNKLEEKKMTAFAYLVFPNTTLSVLP 
EGSSVAEGEPVEHGIITSITHGVMSVDDYFLCVENGRAATASLSAFFRKNFQQSSSKAG*
>AT3G46210.4 |  3 exoribonuclease family domain 1-containing protein 
MEIDREDGRTPNQLRPLACSRNILHRPHGSASWSQGDTKVLAAVYGPKAGTKKNENAEKA 
CFEVIWKPKSGQIGKVEKEYEMILKRTIQSICVLTVNPNTTTSVIIQVVHDDGSLLPCAI 
NAACAALVDAGIPMKHLAVAICCCLAENGYLVLDPNKLEEKKMTAFAYLVFPNTTLSVLP 
EGSSVAEGEPVEHGIITSITHGVMSVDDYFLCVENGRAATASLSAFFRKNFQQSSSKAG*
>AT3G46210.4 |  3 exoribonuclease family domain 1-containing protein 
MEIDREDGRTPNQLRPLACSRNILHRPHGSASWSQGDTKVLAAVYGPKAGTKKNENAEKA 
CFEVIWKPKSGQIGKVEKEYEMILKRTIQSICVLTVNPNTTTSVIIQVVHDDGSLLPCAI 
NAACAALVDAGIPMKHLAVAICCCLAENGYLVLDPNKLEEKKMTAFAYLVFPNTTLSVLP 
EGSSVAEGEPVEHGIITSITHGVMSVDDYFLCVENGRAATASLSAFFRKNFQQSSSKAG*
>AT3G46210.5 |  3 exoribonuclease family domain 1-containing protein 
MEIDREDGRTPNQLRPLACSRNILHRPHGSASWSQGDTKVLAAVYGPKAGTKKNENAEKA 
CFEVIWKPKSGQIGKVEKEYEMILKRTIQSICVLTVNPNTTTSVIIQVVHDDGSLLPCAI 
NAACAALVDAGIPMKHLAVAICCCLAENGYLVLDPNKLEEKKMTAFAYLVFPNTTLSVLP 
EGSSVAEGEPVEHGIITSITHGVMSVDDYFLCVENGRAATASLSAFFRKNFQQSSSKAG*
>AT3G46210.5 |  3 exoribonuclease family domain 1-containing protein 
MEIDREDGRTPNQLRPLACSRNILHRPHGSASWSQGDTKVLAAVYGPKAGTKKNENAEKA 
CFEVIWKPKSGQIGKVEKEYEMILKRTIQSICVLTVNPNTTTSVIIQVVHDDGSLLPCAI 
NAACAALVDAGIPMKHLAVAICCCLAENGYLVLDPNKLEEKKMTAFAYLVFPNTTLSVLP 
EGSSVAEGEPVEHGIITSITHGVMSVDDYFLCVENGRAATASLSAFFRKNFQQSSSKAG*
>AT3G46210.5 |  3 exoribonuclease family domain 1-containing protein 
MEIDREDGRTPNQLRPLACSRNILHRPHGSASWSQGDTKVLAAVYGPKAGTKKNENAEKA 
CFEVIWKPKSGQIGKVEKEYEMILKRTIQSICVLTVNPNTTTSVIIQVVHDDGSLLPCAI 
NAACAALVDAGIPMKHLAVAICCCLAENGYLVLDPNKLEEKKMTAFAYLVFPNTTLSVLP 
EGSSVAEGEPVEHGIITSITHGVMSVDDYFLCVENGRAATASLSAFFRKNFQQSSSKAG*
>AT3G46210.5 |  3 exoribonuclease family domain 1-containing protein 
MEIDREDGRTPNQLRPLACSRNILHRPHGSASWSQGDTKVLAAVYGPKAGTKKNENAEKA 
CFEVIWKPKSGQIGKVEKEYEMILKRTIQSICVLTVNPNTTTSVIIQVVHDDGSLLPCAI 
NAACAALVDAGIPMKHLAVAICCCLAENGYLVLDPNKLEEKKMTAFAYLVFPNTTLSVLP 
EGSSVAEGEPVEHGIITSITHGVMSVDDYFLCVENGRAATASLSAFFRKNFQQSSSKAG*
>AT3G46210.5 |  3 exoribonuclease family domain 1-containing protein 
MEIDREDGRTPNQLRPLACSRNILHRPHGSASWSQGDTKVLAAVYGPKAGTKKNENAEKA 
CFEVIWKPKSGQIGKVEKEYEMILKRTIQSICVLTVNPNTTTSVIIQVVHDDGSLLPCAI 
NAACAALVDAGIPMKHLAVAICCCLAENGYLVLDPNKLEEKKMTAFAYLVFPNTTLSVLP 
EGSSVAEGEPVEHGIITSITHGVMSVDDYFLCVENGRAATASLSAFFRKNFQQSSSKAG*
>AT1G04170.1 |  EIF2 GAMMA translation factor nucleic acid binding / translation initiation factor 
MSRNKGLAEQDLKKLDVTVLHPLSPEVISRQATINIGTIGHVAHGKSTVVKAISGVQTVR 
FKNELERNITIKLGYANAKIYKCEDEKCPRPMCYKAYGSGKEDTPNCDVPGFENSKMKLL 
RHVSFVDCPGHDILMATMLNGAAIMDGALLLIAANETCPQPQTSEHLAAVEIMQLKHIII 
LQNKIDLIQENVAINQHEAIQKFIMNTVADAAPIVPVSAQLKYNIDVVCEYIVKKIPIPE 
RNFVSPPNMIVIRSFDVNKPGYEVDEIKGGVAGGSILRGVLRVNQLIEIRPGIVTKDERG 
NSKCTPIYSRIISLYAEQNELQFAVPGGLIGVGTTMDPTLTRADRLVGQVLGEIGSLPDV 
FVELEVNFFLLRRLLGVRTKGSEKQGKVSKLTKGEILMLNIGSMSTGAKVVGVKVDLAKL 
QLTAPVCTSKGEKVALSRRVEKHWRLIGWGQIQAGTTIEVPPSPF*
>AT1G29940.1 |  NRPA2 DNA binding / DNA-directed RNA polymerase/ ribonucleoside binding 
MVVNAKDSTVPTMEDFKELHNLVTHHIESFDYMTLKGLDVMFNRIKPVSVYDPNTENELS 
IWLENPLVFAPQKESFKSTSRKEPLLPFECRQAKISYTGTFMADVCFKYNDGVVVRDKFD 
FGQFPIMLMSKLCSLKGADCRKLLKCKESTSEMGGYFILNGIERVFRCVIAPKRNHPTSM 
IRNSFRDRKEGYSSKAVVTRCVRDDQSSVTVKLYYLRNGSARVGFWIVGREYLLPVGLVL 
KALTNSCDEEIYESLNCCYSEHYGRGDGAIGTQLVRERAKIILDEVRDLGLFTREQCRKH 
LGQHFQPVLDGVKKESLSIVAEAVLRDYLFVHLDNDHDKFNLLIFIIQKLYSLVDQTSLP 
DNPDSLQNQEILVPGHVITIYLKEKLEEWLRKCKSLLKDELDNTNSKFSFESLADVKKLI 
NKNPPRSIGTSIETLLKTGALKTQSGLDLQQRAGYTVQAERLNFLRFLSFFRAVHRGASF 
AGLRTTTVRKLLPESWGFLCPVHTPDGTPCGLLNHMTRTSRITSQFDSKGNIRDFLKIRK 
SVVDVLTGAGMVPSLPKLVRAGPPKVIHVLLDGQVVGTLSSNLVTKVVSYIRRLKVEAPS 
VIPEDLEVGYVPTSMGGSYPGLYLASCPARFIRPVKNISIPSDNIELIGPFEQVFMEISC 
PDGGNGGRNNSSLATHEEIHPTGMISVVANLTPWSDHNQSPRNMYQCQMAKQTMAYSTQA 
LQFRADQKIYHLQTPQSPVVRTKTYTTYSIDENPTGTNAIVAVLAHTGFDMEDAMILNKS 
SVERGMCHGQIYQTENIDLSDQNSRFDSGSKSFRRSTNKAEHFRIDADGLPSVGQKLYPD 
EPYCSIYDEVTNKTRHMKRKGTDPVIVDFVSVDMKSKKHPQRANIRFRHARNPIIGDKFS 
SRHGQKGVCSQLWPDIDMPFNGVTGMRPDLIINPHAFPSRMTIAMLLESIAAKGGSLHGK 
FVDATPFRDAVKKTNGEEESKSSLLVDDLGSMLKEKGFNHYGTETLYSGYLGVELKCEIF 
MGPVYYQRLRHMVSDKFQVRSTGQVDQLTHQPIKGRKRGGGIRFGEMERDSLLAHGASYL 
LHDRLHTSSDHHIADVCSLCGSLLTSSVVNVQQKKLIQEIGKLPPGRTPKKVTCYSCKTS 
KGMETVAMPYVFRYLAAELASMNIKMTLQLSDREGVTD*
>AT1G53880.1 |  GTP binding / translation initiation factor 
MVKIHPDKAFPVDFSVGEGKTSPYLTTEKESFTIWMRSLVFHSKGCTVFDSKGNLIYRVD 
NYNSKSCSEVYLMDLYGKILFTLRQKKLGLFKSWKGYNSTGTRFQLRKNFKILPKGSSSS 
YKVVMGSRIVDGDHQSCYKIVKRKSVFTIEDGSGRLLAEVKKKQSNIKSLDLGKDVLTMM 
VEPQLETEIFEEFIKMWRESASFILDKHQNNKPISLTHSIDSPPPPSASMADENPNPNPI 
SAYYQTRAAHHGIVTSEWLEQAQAAVRRYPDRDSLVSGRPFSVIEDFNSWRQQPDLAEAV 
AAIRALAAVIRASEATTMMELEIELKKASDTLKSWDTTSISLTAGCDLFMRYVTRTSALE 
FEDFNSAKSRVLERAEKFGEISCKARTIIAMLSQDFIFDGCTILVHGFSRVVFEILKTSA 
QNKKLFRVLCTEGRPDKTGVLLANELAKLDIPVKLLIDSAVAYSMDEVDMVFVGADGVVE 
SGGIINMMGTYQIALVAQSMNKPVYVAAESYKFARLYPLDQKDLEPALRPIDFSVPVPPK 
VEVERSARDYTPPQYLTLLFTDLGVLTPSVVSDELIQLYL*
>AT4G03430.1 |  EMB2770 (EMBRYO DEFECTIVE 2770) RNA splicing factor transesterification mechanism 
MVFLSIPNGKTLSIDVNPNSTTISAFEQLAHQRSDVPQSFLRYSLRMRNPSRVFVDSKDS 
DSILLSDLGVSRFSTVIIHVLLLGGMQAAPPKPRLDFLNSKPPSNYVAGLGRGATGFTTR 
SDIGPARAAPDLPDRSALATAAAPGVGRGAGKPSEAEAEDDEEAEEKRYDENQTFDEFEG 
NDVGLFANAEYDEDDKEADAIWESIDQRMDSRRKDRREAKLKEEIEKYRASNPKITEQFA 
DLKRKLHTLSADEWDSIPEIGDYSLRNKKKKFESFVPIPDTLLEKAKKEKELVMALDPKS 
RAAGGSETPWGQTPVTDLTAVGEGRGTVLSLKLDNLSDSVSGQTVVDPKGYLTDLKSMKR 
TTDEEIYDRNRARLLYKSLTQSNPKNPNGWIAAARVEEVDGKIKAARFQIQRGCEECPKN 
EDVWLEACRLANPEDAKGVIAKGVKLIPNSVKLWLEAAKLEHDVENKSRVLRKGLEHIPD 
SVRLWKAVVELANEEDARILLHRAVECCPLHLELWVALARLETYAESKKVLNKAREKLPK 
EPAIWITAAKLEEANGKLDEANDNTAMVGKIIDRGIKTLQREGVVIDRENWMSEAEACER 
VGSVATCQAIIKNTIGIGVEEEDRKRTWVADADECKKRGSIETARAIYAHALSVFLTKKS 
IWLKAAQLEKSHGSRESLDALLRKAVTYVPQAEVLWLMGAKEKWLAGDVPAARAILQEAY 
AAIPNSEEIWLAAFKLEFENKEPERARMLLAKARERGGTERVWMKSAIVERELGNVEEER 
RLLNEGLKQFPTFFKLWLMLGQLEERFKHLEQARKAYDTGLKHCPHCIPLWLSLADLEEK 
VNGLNKARAILTTARKKNPGGAELWLAAIRAELRHDNKREAEHLMSKALQDCPKSGILWA 
ADIEMAPRPRRKTKSIDAMKKCDRDPHVTIAVAKLFWQDKKVEKARAWFERAVTVGPDIG 
DFWALFYKFELQHGSDEDRKEVVAKCVACEPKHGEKWQAISKAVENAHQPIEVILKRVVN 
ALSKEENSA*
>AT4G08960.1 |  phosphotyrosyl phosphatase activator (PTPA) family protein 
MEPPKEQQNTPEKSISDAPATISSAFPPSGCCTNCGGPTISEAPPPLASFPEMSPPPNYR 
PIRAPAINLPHNSQAIILSPVPHAEQVPVVSPPYHFQSPVKRIHSPDDIRRFHESASCKN 
FLGFVVSLSESIRGFKISDPCHISPTVAAIVSILETLLQWIDEIPPAQQSARYGNVSFRS 
WHERLRERGESLILEFLPEEFKESVIEIVPYFFDSFGNSSRIDYGTGHETNFAAWLYCLA 
RMGIVKEEDYHSLVARVFVKYLELMRKLQMVYCLEPAGSHGVWGLDDYHFLPFIFGSSQL 
IDHKYMKPKSIHNDDILENFSSEYMYLSCIAFVKKVKKGLFAEHSPLLDDISGVPNWKKV 
NSGLLKMYRVEVLEKVPIMQHFLFGWLIKWEE*
>AT5G38890.1 |  exoribonuclease-related 
MTTGLVTPGDVIGKATEFKAGKGAYVNDATIYASLTGTCRIVSPLPESIDQRAIVEVTGH 
KAHGPIPETGSVVIARVTKVMTKMAAVDILCVGSKAVRENFAGVIRQQDVRATEIDKVDM 
HQSFHAGDIVRAMVLSLGDARAYYLSTAKNELGVVSAESAAGETMVPISWTEMQCPLSGQ 
TEQRKVAKVGN*
>AT2G32170.1 |  LOCATED IN cellular_component unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321603) Has 362 Blast hits to 319 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 132 Fungi - 130 Plants - 25 Viruses - 0 Other Eukaryotes - 75 (source NCBI BLink) 
MVSPSEMTETAPAIMTTESERIESSRELVDNEEEEKIRRQKKLEEALEAKSLRRIISAYL 
NYPEASEEDLKRWERSYRKLSPAHKALVPHYPMKFQRLRRCISANSYFIFNMLQAFEPPI 
DLSQELDGCEDSNLDCAPHERYTLDERHDSSCQPALTNSCTYKEESKHIRDPITGVSIEE 
LQRKEAHDHSPKDDSADTRINDKTCDCHEGQLNHDHGSVSFSSHDWLDSSLQTHVPLVDV 
DKVRCIIRNIVRDWAAEGQRERDQCYKPILEELDSLFPDRLKESTPPACLVPGAGLGRLA 
LEISCLGFISQGNEFSYYMMICSSFILNYTQVPGEWTIYPWIHSNCNSLSDNDQLRPIAI 
PDIHPASAGITEGFSMCGGDFVEVYNESSHAGMWDAVVTCFFIDTAHNVIEYIQTISKIL 
KDGGVWINLGPLLYHFADTYGHENEMSIELSLEDVKRVASHFGFVIEKERTIETTYTTNP 
RAMMQNRYYTAFWTMRKKCAITTT*
>AT2G32170.1 |  LOCATED IN cellular_component unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321603) 
MVSPSEMTETAPAIMTTESERIESSRELVDNEEEEKIRRQKKLEEALEAKSLRRIISAYL 
NYPEASEEDLKRWERSYRKLSPAHKALVPHYPMKFQRLRRCISANSYFIFNMLQAFEPPI 
DLSQELDGCEDSNLDCAPHERYTLDERHDSSCQPALTNSCTYKEESKHIRDPITGVSIEE 
LQRKEAHDHSPKDDSADTRINDKTCDCHEGQLNHDHGSVSFSSHDWLDSSLQTHVPLVDV 
DKVRCIIRNIVRDWAAEGQRERDQCYKPILEELDSLFPDRLKESTPPACLVPGAGLGRLA 
LEISCLGFISQGNEFSYYMMICSSFILNYTQVPGEWTIYPWIHSNCNSLSDNDQLRPIAI 
PDIHPASAGITEGFSMCGGDFVEVYNESSHAGMWDAVVTCFFIDTAHNVIEYIQTISKIL 
KDGGVWINLGPLLYHFADTYGHENEMSIELSLEDVKRVASHFGFVIEKERTIETTYTTNP 
RAMMQNRYYTAFWTMRKKCAITTT*
>AT2G32170.2 |  LOCATED IN cellular_component unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321603) Has 362 Blast hits to 319 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 132 Fungi - 130 Plants - 25 Viruses - 0 Other Eukaryotes - 75 (source NCBI BLink) 
MVSPSEMTETAPAIMTTESERIESSRELVDNEEEEKIRRQKKLEEALEAKSLRRIISAYL 
NYPEASEEDLKRWERSYRKLSPAHKALVPHYPMKFQRLRRCISANSYFIFNMLQAFEPPI 
DLSQELDGCEDSNLDCAPHERYTLDERHDSSCQPALTNSCTYKEESKHIRDPITGVSIEE 
LQRKEAHDHSPKDDSADTRINDKTCDCHEGQLNHDHGSVSFSSHDWLDSSLQTHVPLVDV 
DKRETKIIMCHVRTYSQSLLVPLGSLVSYEGWFAHVALHGEVNPVELKCTFLHVRCIIRN 
IVRDWAAEGQRERDQCYKPILEELDSLFPDRLKESTCCIHSPYKVDYMICSTPPACLVPG 
AGLGRLALEISCLGFISQGNEFSYYMMICSSFILNYTQVPGEWTIYPWIHSNCNSLSDND 
QLRPIAIPDIHPASAGITEGFSMCGGDFVEVYNESSHAGMWDAVVTCFFIDTAHNVIEYI 
QTISKILKDGGVWINLGPLLYHFADTYGHENEMSIELSLEDVKRVASHFGFVIEKERTIE 
TTYTTNPRAMMQNRYYTAFWTMRKKCAITTT*
>AT2G32170.2 |  LOCATED IN cellular_component unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321603) 
MVSPSEMTETAPAIMTTESERIESSRELVDNEEEEKIRRQKKLEEALEAKSLRRIISAYL 
NYPEASEEDLKRWERSYRKLSPAHKALVPHYPMKFQRLRRCISANSYFIFNMLQAFEPPI 
DLSQELDGCEDSNLDCAPHERYTLDERHDSSCQPALTNSCTYKEESKHIRDPITGVSIEE 
LQRKEAHDHSPKDDSADTRINDKTCDCHEGQLNHDHGSVSFSSHDWLDSSLQTHVPLVDV 
DKRETKIIMCHVRTYSQSLLVPLGSLVSYEGWFAHVALHGEVNPVELKCTFLHVRCIIRN 
IVRDWAAEGQRERDQCYKPILEELDSLFPDRLKESTCCIHSPYKVDYMICSTPPACLVPG 
AGLGRLALEISCLGFISQGNEFSYYMMICSSFILNYTQVPGEWTIYPWIHSNCNSLSDND 
QLRPIAIPDIHPASAGITEGFSMCGGDFVEVYNESSHAGMWDAVVTCFFIDTAHNVIEYI 
QTISKILKDGGVWINLGPLLYHFADTYGHENEMSIELSLEDVKRVASHFGFVIEKERTIE 
TTYTTNPRAMMQNRYYTAFWTMRKKCAITTT*
>AT4G32720.1 |  AtLa1 (Arabidopsis thaliana La protein 1) RNA binding 
MSIPCLTEETAKTVLRQVEFYFSDSNLPIDDFLKKTVTESEDGLVSLALICSFSKMRGYL 
KLGDSKGDDIPEDTIKAVADTLRTSSALKISDDGKKVGRSTELLKLEDLIEQLNARTVAA 
SPFSYDVKREDVESFFSQYGKVNSVRMPRHVAESRIFSGVALVEFPTEEDAQNVMKQNLV 
FAGQELELKPKKEFDNEREKDEVKFANYQPQKGSANQKNGSDHKNNSAYEPDYPKGLIIS 
FTLKRSAEEGTTEQKSSEEPTDKTMEESETKPADTPDADKENTGEVQAEGAEDEDDEKEE 
KGALATHKDNKDVVLREDLKAVFGKFGDVKFVDFKMGSETGYLRFDEPEASQKARAAAVL 
ANEGGLAVKNFIAVLEPVIGEAEKEYWTLLRSKDRFDKGGRGGRGGRRGGRFGRKRGSDS 
PGGRWNKSQKVEA*
>AT4G32720.1 |  AtLa1 (Arabidopsis thaliana La protein 1) RNA binding 
MSIPCLTEETAKTVLRQVEFYFSDSNLPIDDFLKKTVTESEDGLVSLALICSFSKMRGYL 
KLGDSKGDDIPEDTIKAVADTLRTSSALKISDDGKKVGRSTELLKLEDLIEQLNARTVAA 
SPFSYDVKREDVESFFSQYGKVNSVRMPRHVAESRIFSGVALVEFPTEEDAQNVMKQNLV 
FAGQELELKPKKEFDNEREKDEVKFANYQPQKGSANQKNGSDHKNNSAYEPDYPKGLIIS 
FTLKRSAEEGTTEQKSSEEPTDKTMEESETKPADTPDADKENTGEVQAEGAEDEDDEKEE 
KGALATHKDNKDVVLREDLKAVFGKFGDVKFVDFKMGSETGYLRFDEPEASQKARAAAVL 
ANEGGLAVKNFIAVLEPVIGEAEKEYWTLLRSKDRFDKGGRGGRGGRRGGRFGRKRGSDS 
PGGRWNKSQKVEA*
>AT4G32720.2 |  AtLa1 (Arabidopsis thaliana La protein 1) RNA binding 
MSIPCLTEETAKTVLRQVEFYFSDSNLPIDDFLKKTVTESEDGLVSLALICSFSKMRGYL 
KLGDSKGDDIPEDTIKAVADTLRTSSALKISDDGKKVGRSTELLKLEDLIEQLNARTVAA 
SPFSYDVKREDVESFFSQYGKVNSVRMPRHVAESRIFSGVALVEFPTEEDAQNVMKQNLV 
FAGQELELKPKKEFDNEREKDEVKFANYQPQKGSANQKNGSDHKNNSAYEPDYPKGLIIS 
FTLKRSAEEGTTEQKSSEEPTDKTMEESETKPADTPDADKENTGEVQAEGAEDEDDEKEE 
KGALATHKDNKDVVLREDLKAVFGKFGDVKFVDFKMGSETGYLRFDEPEASQKARAAAVL 
ANEGGLAVKNFIAVLEPVIGEAEKEYWTLLRSKDRFDKGGRGGR*
>AT4G32720.2 |  AtLa1 (Arabidopsis thaliana La protein 1) RNA binding 
MSIPCLTEETAKTVLRQVEFYFSDSNLPIDDFLKKTVTESEDGLVSLALICSFSKMRGYL 
KLGDSKGDDIPEDTIKAVADTLRTSSALKISDDGKKVGRSTELLKLEDLIEQLNARTVAA 
SPFSYDVKREDVESFFSQYGKVNSVRMPRHVAESRIFSGVALVEFPTEEDAQNVMKQNLV 
FAGQELELKPKKEFDNEREKDEVKFANYQPQKGSANQKNGSDHKNNSAYEPDYPKGLIIS 
FTLKRSAEEGTTEQKSSEEPTDKTMEESETKPADTPDADKENTGEVQAEGAEDEDDEKEE 
KGALATHKDNKDVVLREDLKAVFGKFGDVKFVDFKMGSETGYLRFDEPEASQKARAAAVL 
ANEGGLAVKNFIAVLEPVIGEAEKEYWTLLRSKDRFDKGGRGGR*
>AT5G41770.1 |  crooked neck protein putative / cell cycle protein putative 
MASGGKDSDRTLGYMTRKDTEVKLPRPTRVKNKTPAPIQITAEQILREARERQEAEIRPP 
KQKITDSTELSDYRLRRRKEFEDQIRRARWNIQVWVKYAQWEESQKDYARARSVWERAIE 
GDYRNHTLWLKYAEFEMKNKFVNSARNVWDRAVTLLPRVDQLWYKYIHMEEILGNIAGAR 
QIFERWMDWSPDQQGWLSFIKFELRYNEIERARTIYERFVLCHPKVSAYIRYAKFEMKGG 
EVARCRSVYERATEKLADDEEAEILFVAFAEFEERCKEVERARFIYKFALDHIPKGRAED 
LYRKFVAFEKQYGDKEGIEDAIVGKRRFQYEDEVRKSPSNYDSWFDYVRLEESVGNKDRI 
REIYERAIANVPPAEEKRYWQRYIYLWINYALFEEIETEDIERTRDVYRECLKLIPHSKF 
SFAKIWLLAAQFEIRQLNLTGARQILGNAIGKAPKDKIFKKYIEIELQLGNMDRCRKLYE 
RYLEWSPENCYAWSKYAELERSLVETERARAIFELAISQPALDMPELLWKAYIDFEISEG 
ELERTRALYERLLDRTKHYKVWVSFAKFEASAAELEEDENEDEDQEEDVIEHKKDCIKRA 
RAIFDRANTYYKDSTPELKEERATLLEDWLNMESSFGNLGDVSIVQSKLPKKLKKRKAIT 
REDGSTEYEEYIDYLYPEESQTTNLKILEAAYKWKKQKVAASEDD*
>AT2G32160.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 325 Blast hits to 315 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 70 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV 
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG 
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL 
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET 
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKKQ*
>AT2G32160.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 327 Blast hits to 317 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV 
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG 
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL 
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET 
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKKQ*
>AT2G32160.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 328 Blast hits to 321 proteins in 142 species Archae - 0 Bacteria - 0 Metazoa - 99 Fungi - 132 Plants - 25 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV 
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG 
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL 
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET 
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKKQ*
>AT2G32160.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 325 Blast hits to 315 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 70 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV 
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG 
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL 
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET 
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKEKTIETT 
YSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT2G32160.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 327 Blast hits to 317 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV 
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG 
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL 
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET 
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKEKTIETT 
YSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT2G32160.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 328 Blast hits to 321 proteins in 142 species Archae - 0 Bacteria - 0 Metazoa - 99 Fungi - 132 Plants - 25 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV 
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG 
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL 
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET 
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKEKTIETT 
YSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT2G32160.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 325 Blast hits to 315 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 70 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSVPF 
SSHDWLDSSLQAHVPLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRH 
KESTRPACLVPGAGLGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPW 
IHTNCNSLSDDDQLRPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCF 
FIDTAHNIIEYIETISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASH 
YGFEMEKEKTIETTYSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT2G32160.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 327 Blast hits to 317 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSVPF 
SSHDWLDSSLQAHVPLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRH 
KESTRPACLVPGAGLGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPW 
IHTNCNSLSDDDQLRPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCF 
FIDTAHNIIEYIETISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASH 
YGFEMEKEKTIETTYSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT2G32160.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 328 Blast hits to 321 proteins in 142 species Archae - 0 Bacteria - 0 Metazoa - 99 Fungi - 132 Plants - 25 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSVPF 
SSHDWLDSSLQAHVPLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRH 
KESTRPACLVPGAGLGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPW 
IHTNCNSLSDDDQLRPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCF 
FIDTAHNIIEYIETISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASH 
YGFEMEKEKTIETTYSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT5G15770.1 |  AtGNA1 (Arabidopsis thaliana glucose-6-phosphate acetyltransferase 1) N-acetyltransferase/ glucosamine 6-phosphate N-acetyltransferase 
MAETFKIRKLEISDKRKGFIELLGQLTVTGSVTDEEFDRRFEEIRSYGDDHVICVIEEET 
SGKIAATGSVMIEKKFLRNCGKAGHIEDVVVDSRFRGKQLGKKVVEFLMDHCKSMGCYKV 
ILDCSVENKVFYEKCGMSNKSIQMSKYFD*
>AT1G79880.2 |  La domain-containing protein 
MRNLLGLGNINREDIPPRIVEEVANLLRTSDFLKVSNNGQRIGRGTKLSKPEEVLEQVHR 
RTLAASPFEYSIKMEDVSSFFSQYAKVNSVRLPPNIADKRRFCGTALVEFSSEQDTQDIL 
RQSLVYAGADLVLIPKSDFDCQRENMIKQLGKSESHNEFRRGQIVKFALKWIASEEKVTN 
KEKPSALKNKIKEKEDKETGIADREKENGDNSCASLCKDNTDQLVVPPWNNSNSVSSEVL 
KDLFQRFGSVEHIEYSGGLDSGYVWFTDSETAMKARAAVEFVGGLVVKNNFSVALEAING 
EMERELWKRLSSAELEGGKEGHKKEKGKDECFENVQPTKKARKEP*
>AT1G79880.2 |  La domain-containing protein 
MRNLLGLGNINREDIPPRIVEEVANLLRTSDFLKVSNNGQRIGRGTKLSKPEEVLEQVHR 
RTLAASPFEYSIKMEDVSSFFSQYAKVNSVRLPPNIADKRRFCGTALVEFSSEQDTQDIL 
RQSLVYAGADLVLIPKSDFDCQRENMIKQLGKSESHNEFRRGQIVKFALKWIASEEKVTN 
KEKPSALKNKIKEKEDKETGIADREKENGDNSCASLCKDNTDQLVVPPWNNSNSVSSEVL 
KDLFQRFGSVEHIEYSGGLDSGYVWFTDSETAMKARAAVEFVGGLVVKNNFSVALEAING 
EMERELWKRLSSAELEGGKEGHKKEKGKDECFENVQPTKKARKEP*
>AT1G79880.2 |  La domain-containing protein 
MRNLLGLGNINREDIPPRIVEEVANLLRTSDFLKVSNNGQRIGRGTKLSKPEEVLEQVHR 
RTLAASPFEYSIKMEDVSSFFSQYAKVNSVRLPPNIADKRRFCGTALVEFSSEQDTQDIL 
RQSLVYAGADLVLIPKSDFDCQRENMIKQLGKSESHNEFRRGQIVKFALKWIASEEKVTN 
KEKPSALKNKIKEKEDKETGIADREKENGDNSCASLCKDNTDQLVVPPWNNSNSVSSEVL 
KDLFQRFGSVEHIEYSGGLDSGYVWFTDSETAMKARAAVEFVGGLVVKNNFSVALEAING 
EMERELWKRLSSAELEGGKEGHKKEKGKDECFENVQPTKKARKEP*
>AT1G79880.3 |  La domain-containing protein 
MRNLLGLGNINREDIPPRIVEEVANLLRTSDFLKVSNNGQRIGRGTKLSKPEEVLEQVHR 
RTLAASPFEYSIKMEDVSSFFSQYAKVNSVRLPPNIADKRRFCGTALVEFSSEQDTQDIL 
RQSLVYAGADLVLIPKSDFDCQRENMIKQLGKSESHNEFRRGQIVKFALKWIASEEKVTN 
KEKPSALKNKIKEKEDKETGIADREKENGDNSCASLCKDNTDQLVVPPWNNSNSVSSEVL 
KDLFQRFGSVEHIEYSGGLDSGYVWFTDSETAMKARAAVEFVGGLVVKNNFSVALEAING 
EMERELWKRLSSAELEGGYDIKPNLVYSSLKEGHKKEKGKDECFENVQPTKKARKEP*
>AT1G79880.3 |  La domain-containing protein 
MRNLLGLGNINREDIPPRIVEEVANLLRTSDFLKVSNNGQRIGRGTKLSKPEEVLEQVHR 
RTLAASPFEYSIKMEDVSSFFSQYAKVNSVRLPPNIADKRRFCGTALVEFSSEQDTQDIL 
RQSLVYAGADLVLIPKSDFDCQRENMIKQLGKSESHNEFRRGQIVKFALKWIASEEKVTN 
KEKPSALKNKIKEKEDKETGIADREKENGDNSCASLCKDNTDQLVVPPWNNSNSVSSEVL 
KDLFQRFGSVEHIEYSGGLDSGYVWFTDSETAMKARAAVEFVGGLVVKNNFSVALEAING 
EMERELWKRLSSAELEGGYDIKPNLVYSSLKEGHKKEKGKDECFENVQPTKKARKEP*
>AT1G79880.3 |  La domain-containing protein 
MRNLLGLGNINREDIPPRIVEEVANLLRTSDFLKVSNNGQRIGRGTKLSKPEEVLEQVHR 
RTLAASPFEYSIKMEDVSSFFSQYAKVNSVRLPPNIADKRRFCGTALVEFSSEQDTQDIL 
RQSLVYAGADLVLIPKSDFDCQRENMIKQLGKSESHNEFRRGQIVKFALKWIASEEKVTN 
KEKPSALKNKIKEKEDKETGIADREKENGDNSCASLCKDNTDQLVVPPWNNSNSVSSEVL 
KDLFQRFGSVEHIEYSGGLDSGYVWFTDSETAMKARAAVEFVGGLVVKNNFSVALEAING 
EMERELWKRLSSAELEGGYDIKPNLVYSSLKEGHKKEKGKDECFENVQPTKKARKEP*
>AT1G79880.1 |  La domain-containing protein 
MASSFNEETAKKLLTQVEFYFSDSNLPTDGFLNREVTKSKDGLVSLPLVCSFSRMRNLLG 
LGNINREDIPPRIVEEVANLLRTSDFLKVSNNGQRIGRGTKLSKPEEVLEQVHRRTLAAS 
PFEYSIKMEDVSSFFSQYAKVNSVRLPPNIADKRRFCGTALVEFSSEQDTQDILRQSLVY 
AGADLVLIPKSDFDCQRENMIKQLGKSESHNEFRRGQIVKFALKWIASEEKVTNKEKPSA 
LKNKIKEKEDKETGIADREKENGDNSCASLCKDNTDQLVVPPWNNSNSVSSEVLKDLFQR 
FGSVEHIEYSGGLDSGYVWFTDSETAMKARAAVEFVGGLVVKNNFSVALEAINGEMEREL 
WKRLSSAELEGGKEGHKKEKGKDECFENVQPTKKARKEP*
>AT1G79880.1 |  La domain-containing protein 
MASSFNEETAKKLLTQVEFYFSDSNLPTDGFLNREVTKSKDGLVSLPLVCSFSRMRNLLG 
LGNINREDIPPRIVEEVANLLRTSDFLKVSNNGQRIGRGTKLSKPEEVLEQVHRRTLAAS 
PFEYSIKMEDVSSFFSQYAKVNSVRLPPNIADKRRFCGTALVEFSSEQDTQDILRQSLVY 
AGADLVLIPKSDFDCQRENMIKQLGKSESHNEFRRGQIVKFALKWIASEEKVTNKEKPSA 
LKNKIKEKEDKETGIADREKENGDNSCASLCKDNTDQLVVPPWNNSNSVSSEVLKDLFQR 
FGSVEHIEYSGGLDSGYVWFTDSETAMKARAAVEFVGGLVVKNNFSVALEAINGEMEREL 
WKRLSSAELEGGKEGHKKEKGKDECFENVQPTKKARKEP*
>AT1G79880.1 |  La domain-containing protein 
MASSFNEETAKKLLTQVEFYFSDSNLPTDGFLNREVTKSKDGLVSLPLVCSFSRMRNLLG 
LGNINREDIPPRIVEEVANLLRTSDFLKVSNNGQRIGRGTKLSKPEEVLEQVHRRTLAAS 
PFEYSIKMEDVSSFFSQYAKVNSVRLPPNIADKRRFCGTALVEFSSEQDTQDILRQSLVY 
AGADLVLIPKSDFDCQRENMIKQLGKSESHNEFRRGQIVKFALKWIASEEKVTNKEKPSA 
LKNKIKEKEDKETGIADREKENGDNSCASLCKDNTDQLVVPPWNNSNSVSSEVLKDLFQR 
FGSVEHIEYSGGLDSGYVWFTDSETAMKARAAVEFVGGLVVKNNFSVALEAINGEMEREL 
WKRLSSAELEGGKEGHKKEKGKDECFENVQPTKKARKEP*
>AT1G72340.1 |  eukaryotic translation initiation factor 2B family protein / eIF-2B family protein 
MWRRSPSFILDERRSSNSPPMADTTRGPFQNPDSISAYYQTRAAHHGVITSDWLAQAQAA 
VGGVSGDAQHDLSVTDLGNEKSFNVIEEFNNWRKQPDLAEAVAAIRALAAVIRASEASTM 
MELEIELKKASDTLKSWDKTSISLTAGCDLFIRYVTRTSALEYEDFNSAKSRLLERAEKF 
GEISCKARRIIAMLSQDFIFDGCTILVHGLSRVVLEILKTAAQNNKLFRVLCTEGRPDGT 
GVLLSSELSKLDIPVKLLLDSAVAYSMDEVDMVFVGADGVVESGGIINMMGTYQIALVAH 
SMNKPVYVAAESYKFARLYPLDQKDMAPALRPIEFGVKIPTKVEVERSARDYTPPQYLTL 
LFTDLGVLSPSVVSDELIQLYL*