>AT4G25630.1 |  FIB2 (FIBRILLARIN 2) snoRNA binding 
MRPPLTGSGGGFSGGRGRGGYSGGRGDGGFSGGRGGGGRGGGRGFSDRGGRGRGRGPPRG 
GARGGRGPAGRGGMKGGSKVIVEPHRHAGVFIAKGKEDALVTKNLVPGEAVYNEKRISVQ 
NEDGTKTEYRVWNPFRSKLAAAILGGVDNIWIKPGAKVLYLGAASGTTVSHVSDLVGPEG 
CVYAVEFSHRSGRDLVNMAKKRTNVIPIIEDARHPAKYRMLVGMVDVIFSDVAQPDQARI 
LALNASYFLKSGGHFVISIKANCIDSTVPAEAVFQTEVKKLQQEQFKPAEQVTLEPFERD 
HACVVGGYRMPKKPKAATAA*
>AT2G34570.1 |  MEE21 (maternal effect embryo arrest 21) 
MRVKRQKKNRRTVRFFTVCYGFRQPYKVLCDGTFVHHLVTNEITPADTAVSELLGGPVKL 
FTTRCVIAELEKLGKDFAESLEAAQTLNTATCEHEEAKTADECLSEVIGVQNTEHFFLGT 
QDAEFRRKLQQESIVPLVFGLRNILLIDQPSDFQRQSAKDSENKRLTMTDTEKKLLVKRT 
AKIIASNRKEATIANEEWGMPRVVSTKNGLGVKDRPQFKRNRAKGPNPLSCMKKKKENPQ 
SKSKADSNSNAQKEKKEGGSDTQKRSRKRSKKGKSGPERTE*
>AT5G66540.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN rRNA processing LOCATED IN cytosol nucleolus nucleus EXPRESSED IN 22 plant structures EXPRESSED DURING 12 growth stages CONTAINS InterPro DOMAIN/s U3 small nucleolar ribonucleoprotein complex subunit Mpp10p (InterProIPR012173) Mpp10 protein (InterProIPR007151) Has 76240 Blast hits to 38667 proteins in 1479 species Archae - 252 Bacteria - 6537 Metazoa - 31185 Fungi - 9935 Plants - 3937 Viruses - 750 Other Eukaryotes - 23644 (source NCBI BLink) 
MATVKDSGFEALEKLKATEPPVFLAPSSISEDARSASQYLFMKLKPHNPKCPFDQLSSDG 
FDAEQIWQQIDMQSQPLLTSLRQEVKRFAKNPEEIRKLGKLALKVSHEDDIDEMDMDGFD 
SDDVDDEDKEIESNDSEGEDEEEEEEDEEEEEEEEEEEEEEKDGDNEGIEDKFFKIKELE 
EFLEEGEAEEYGIDHKNKKGVAQRKKQNLSDDEDEEDDDDEEEDVEFDAFAGGDDEETDK 
LGKARYDDFFGGKKETKMKLKDLSEDEEAEIENKGNEKLSTHERARLKLQSKIEQMEKAN 
LDPKHWTMQGEITAAKRPMNSALEVDLDFEHNARPAPVITEEVTASLEDLIKSRIIEARF 
DDVQRAPRLPTKGKREAKELDESKSKKGLAEVYEAEYFQKANPAFAPTTHSDELKKEASM 
LFKKLCLKLDALSHFHFTPKPVIEEMSIPNVSAIAMEEVAPVAVSDAAMLAPEEIFSGKG 
DIKDESELTQEDRKRRRANKKRKFKAESANEPPKKALDTSTKNP*
>AT4G38220.1 |  aminoacylase putative / N-acyl-L-amino-acid amidohydrolase putative 
MSLLRLLLVVVVLHLSAVAGDDAIVSRFQEYLRINTVQPNPEYYKAVDFIISQAKPLSLE 
SQTIEFVKGKPLLLLKWVGSDPTLPAFLLNSHTDVVPFEDSKWTHHPLQAHMDHHGDIYA 
RGSQDMKCVGMQYLEAIRKLQASGFKPLRSVYLSFVPDEEIGGHDGAEKFAESQLFKSLN 
IAIVLDEGLPSPTESYRVFYGERSPWWLVIKAKGPPGHGAKLYDNSAMENLLKSIESIRR 
FRASQFDLLKAGGIAEGDVVSVNMAFLKAGTPSPTGFVMNLQPSEAEAGFDIRVPPSVDA 
EALERRLVEEWAPAARNMSFEFKQKLTGKQFLTAADDSNPWWGLLENAVKEAGGRTSKPE 
IFPASTDARYFRKAGVPAFGFSPISNTPSLLHDHNEYLGKAEYLKGIEVYVSIIKAYASY 
ESKSGSRDEL*
>AT4G38220.1 |  aminoacylase putative / N-acyl-L-amino-acid amidohydrolase putative 
MSLLRLLLVVVVLHLSAVAGDDAIVSRFQEYLRINTVQPNPEYYKAVDFIISQAKPLSLE 
SQTIEFVKGKPLLLLKWVGSDPTLPAFLLNSHTDVVPFEDSKWTHHPLQAHMDHHGDIYA 
RGSQDMKCVGMQYLEAIRKLQASGFKPLRSVYLSFVPDEEIGGHDGAEKFAESQLFKSLN 
IAIVLDEGLPSPTESYRVFYGERSPWWLVIKAKGPPGHGAKLYDNSAMENLLKSIESIRR 
FRASQFDLLKAGGIAEGDVVSVNMAFLKAGTPSPTGFVMNLQPSEAEAGFDIRVPPSVDA 
EALERRLVEEWAPAARNMSFEFKQKLTGKQFLTAADDSNPWWGLLENAVKEAGGRTSKPE 
IFPASTDARYFRKAGVPAFGFSPISNTPSLLHDHNEYLGKAEYLKGIEVYVSIIKAYASY 
ESKSGSRDEL*
>AT4G38220.2 |  aminoacylase putative / N-acyl-L-amino-acid amidohydrolase putative 
MSLLRLLLVVVVLHLSAVAGDDAIVSRFQEYLRINTVQPNPEYYKAVDFIISQAKPLSLE 
SQTIEFVKGKPLLLLKWVGSDPTLPAFLLNSHTDVVPFEDSKWTHHPLQAHMDHHGDIYA 
RGSQDMKCVGMQYLEAIRKLQASGFKPLRSVYLSFVPDEEIGGHDGAEKFAESQLFKSLN 
IAIVLDEGLPSPTESYRVFYGERSPWWLVIKAKGPPGHGAKLYDNSAMENLLKSIESIRR 
FRASQFDLLKAGGIAEGDVVSVNMAFLKAGTPSPTGFVMNLQPSEAEAGFDIRVPPSVDA 
EALERRLVEEWAPAARNMSFELGQFKQKLTGKQFLTAADDSNPWWGLLENAVKEAGGRTS 
KPEIFPASTDARYFRKAGVPAFGFSPISNTPSLLHDHNEYLGKAEYLKGIEVYVSIIKAY 
ASYESKSGSRDEL*
>AT4G38220.2 |  aminoacylase putative / N-acyl-L-amino-acid amidohydrolase putative 
MSLLRLLLVVVVLHLSAVAGDDAIVSRFQEYLRINTVQPNPEYYKAVDFIISQAKPLSLE 
SQTIEFVKGKPLLLLKWVGSDPTLPAFLLNSHTDVVPFEDSKWTHHPLQAHMDHHGDIYA 
RGSQDMKCVGMQYLEAIRKLQASGFKPLRSVYLSFVPDEEIGGHDGAEKFAESQLFKSLN 
IAIVLDEGLPSPTESYRVFYGERSPWWLVIKAKGPPGHGAKLYDNSAMENLLKSIESIRR 
FRASQFDLLKAGGIAEGDVVSVNMAFLKAGTPSPTGFVMNLQPSEAEAGFDIRVPPSVDA 
EALERRLVEEWAPAARNMSFELGQFKQKLTGKQFLTAADDSNPWWGLLENAVKEAGGRTS 
KPEIFPASTDARYFRKAGVPAFGFSPISNTPSLLHDHNEYLGKAEYLKGIEVYVSIIKAY 
ASYESKSGSRDEL*
>AT2G32160.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 325 Blast hits to 315 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 70 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV 
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG 
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL 
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET 
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKKQ*
>AT2G32160.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 327 Blast hits to 317 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV 
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG 
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL 
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET 
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKKQ*
>AT2G32160.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 328 Blast hits to 321 proteins in 142 species Archae - 0 Bacteria - 0 Metazoa - 99 Fungi - 132 Plants - 25 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV 
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG 
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL 
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET 
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKKQ*
>AT2G32160.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 325 Blast hits to 315 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 70 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV 
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG 
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL 
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET 
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKEKTIETT 
YSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT2G32160.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 327 Blast hits to 317 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV 
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG 
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL 
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET 
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKEKTIETT 
YSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT2G32160.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 328 Blast hits to 321 proteins in 142 species Archae - 0 Bacteria - 0 Metazoa - 99 Fungi - 132 Plants - 25 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV 
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG 
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL 
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET 
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKEKTIETT 
YSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT2G32160.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 325 Blast hits to 315 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 70 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSVPF 
SSHDWLDSSLQAHVPLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRH 
KESTRPACLVPGAGLGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPW 
IHTNCNSLSDDDQLRPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCF 
FIDTAHNIIEYIETISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASH 
YGFEMEKEKTIETTYSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT2G32160.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 327 Blast hits to 317 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSVPF 
SSHDWLDSSLQAHVPLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRH 
KESTRPACLVPGAGLGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPW 
IHTNCNSLSDDDQLRPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCF 
FIDTAHNIIEYIETISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASH 
YGFEMEKEKTIETTYSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT2G32160.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 328 Blast hits to 321 proteins in 142 species Archae - 0 Bacteria - 0 Metazoa - 99 Fungi - 132 Plants - 25 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink) 
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA 
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP 
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSVPF 
SSHDWLDSSLQAHVPLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRH 
KESTRPACLVPGAGLGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPW 
IHTNCNSLSDDDQLRPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCF 
FIDTAHNIIEYIETISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASH 
YGFEMEKEKTIETTYSTNPRSMMKNRYYPVFWTMRKKCAITTT*