>AT4G25630.1 | FIB2 (FIBRILLARIN 2) snoRNA binding
MRPPLTGSGGGFSGGRGRGGYSGGRGDGGFSGGRGGGGRGGGRGFSDRGGRGRGRGPPRG
GARGGRGPAGRGGMKGGSKVIVEPHRHAGVFIAKGKEDALVTKNLVPGEAVYNEKRISVQ
NEDGTKTEYRVWNPFRSKLAAAILGGVDNIWIKPGAKVLYLGAASGTTVSHVSDLVGPEG
CVYAVEFSHRSGRDLVNMAKKRTNVIPIIEDARHPAKYRMLVGMVDVIFSDVAQPDQARI
LALNASYFLKSGGHFVISIKANCIDSTVPAEAVFQTEVKKLQQEQFKPAEQVTLEPFERD
HACVVGGYRMPKKPKAATAA*
>AT2G34570.1 | MEE21 (maternal effect embryo arrest 21)
MRVKRQKKNRRTVRFFTVCYGFRQPYKVLCDGTFVHHLVTNEITPADTAVSELLGGPVKL
FTTRCVIAELEKLGKDFAESLEAAQTLNTATCEHEEAKTADECLSEVIGVQNTEHFFLGT
QDAEFRRKLQQESIVPLVFGLRNILLIDQPSDFQRQSAKDSENKRLTMTDTEKKLLVKRT
AKIIASNRKEATIANEEWGMPRVVSTKNGLGVKDRPQFKRNRAKGPNPLSCMKKKKENPQ
SKSKADSNSNAQKEKKEGGSDTQKRSRKRSKKGKSGPERTE*
>AT5G66540.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN rRNA processing LOCATED IN cytosol nucleolus nucleus EXPRESSED IN 22 plant structures EXPRESSED DURING 12 growth stages CONTAINS InterPro DOMAIN/s U3 small nucleolar ribonucleoprotein complex subunit Mpp10p (InterProIPR012173) Mpp10 protein (InterProIPR007151) Has 76240 Blast hits to 38667 proteins in 1479 species Archae - 252 Bacteria - 6537 Metazoa - 31185 Fungi - 9935 Plants - 3937 Viruses - 750 Other Eukaryotes - 23644 (source NCBI BLink)
MATVKDSGFEALEKLKATEPPVFLAPSSISEDARSASQYLFMKLKPHNPKCPFDQLSSDG
FDAEQIWQQIDMQSQPLLTSLRQEVKRFAKNPEEIRKLGKLALKVSHEDDIDEMDMDGFD
SDDVDDEDKEIESNDSEGEDEEEEEEDEEEEEEEEEEEEEEKDGDNEGIEDKFFKIKELE
EFLEEGEAEEYGIDHKNKKGVAQRKKQNLSDDEDEEDDDDEEEDVEFDAFAGGDDEETDK
LGKARYDDFFGGKKETKMKLKDLSEDEEAEIENKGNEKLSTHERARLKLQSKIEQMEKAN
LDPKHWTMQGEITAAKRPMNSALEVDLDFEHNARPAPVITEEVTASLEDLIKSRIIEARF
DDVQRAPRLPTKGKREAKELDESKSKKGLAEVYEAEYFQKANPAFAPTTHSDELKKEASM
LFKKLCLKLDALSHFHFTPKPVIEEMSIPNVSAIAMEEVAPVAVSDAAMLAPEEIFSGKG
DIKDESELTQEDRKRRRANKKRKFKAESANEPPKKALDTSTKNP*
>AT4G38220.1 | aminoacylase putative / N-acyl-L-amino-acid amidohydrolase putative
MSLLRLLLVVVVLHLSAVAGDDAIVSRFQEYLRINTVQPNPEYYKAVDFIISQAKPLSLE
SQTIEFVKGKPLLLLKWVGSDPTLPAFLLNSHTDVVPFEDSKWTHHPLQAHMDHHGDIYA
RGSQDMKCVGMQYLEAIRKLQASGFKPLRSVYLSFVPDEEIGGHDGAEKFAESQLFKSLN
IAIVLDEGLPSPTESYRVFYGERSPWWLVIKAKGPPGHGAKLYDNSAMENLLKSIESIRR
FRASQFDLLKAGGIAEGDVVSVNMAFLKAGTPSPTGFVMNLQPSEAEAGFDIRVPPSVDA
EALERRLVEEWAPAARNMSFEFKQKLTGKQFLTAADDSNPWWGLLENAVKEAGGRTSKPE
IFPASTDARYFRKAGVPAFGFSPISNTPSLLHDHNEYLGKAEYLKGIEVYVSIIKAYASY
ESKSGSRDEL*
>AT4G38220.1 | aminoacylase putative / N-acyl-L-amino-acid amidohydrolase putative
MSLLRLLLVVVVLHLSAVAGDDAIVSRFQEYLRINTVQPNPEYYKAVDFIISQAKPLSLE
SQTIEFVKGKPLLLLKWVGSDPTLPAFLLNSHTDVVPFEDSKWTHHPLQAHMDHHGDIYA
RGSQDMKCVGMQYLEAIRKLQASGFKPLRSVYLSFVPDEEIGGHDGAEKFAESQLFKSLN
IAIVLDEGLPSPTESYRVFYGERSPWWLVIKAKGPPGHGAKLYDNSAMENLLKSIESIRR
FRASQFDLLKAGGIAEGDVVSVNMAFLKAGTPSPTGFVMNLQPSEAEAGFDIRVPPSVDA
EALERRLVEEWAPAARNMSFEFKQKLTGKQFLTAADDSNPWWGLLENAVKEAGGRTSKPE
IFPASTDARYFRKAGVPAFGFSPISNTPSLLHDHNEYLGKAEYLKGIEVYVSIIKAYASY
ESKSGSRDEL*
>AT4G38220.2 | aminoacylase putative / N-acyl-L-amino-acid amidohydrolase putative
MSLLRLLLVVVVLHLSAVAGDDAIVSRFQEYLRINTVQPNPEYYKAVDFIISQAKPLSLE
SQTIEFVKGKPLLLLKWVGSDPTLPAFLLNSHTDVVPFEDSKWTHHPLQAHMDHHGDIYA
RGSQDMKCVGMQYLEAIRKLQASGFKPLRSVYLSFVPDEEIGGHDGAEKFAESQLFKSLN
IAIVLDEGLPSPTESYRVFYGERSPWWLVIKAKGPPGHGAKLYDNSAMENLLKSIESIRR
FRASQFDLLKAGGIAEGDVVSVNMAFLKAGTPSPTGFVMNLQPSEAEAGFDIRVPPSVDA
EALERRLVEEWAPAARNMSFELGQFKQKLTGKQFLTAADDSNPWWGLLENAVKEAGGRTS
KPEIFPASTDARYFRKAGVPAFGFSPISNTPSLLHDHNEYLGKAEYLKGIEVYVSIIKAY
ASYESKSGSRDEL*
>AT4G38220.2 | aminoacylase putative / N-acyl-L-amino-acid amidohydrolase putative
MSLLRLLLVVVVLHLSAVAGDDAIVSRFQEYLRINTVQPNPEYYKAVDFIISQAKPLSLE
SQTIEFVKGKPLLLLKWVGSDPTLPAFLLNSHTDVVPFEDSKWTHHPLQAHMDHHGDIYA
RGSQDMKCVGMQYLEAIRKLQASGFKPLRSVYLSFVPDEEIGGHDGAEKFAESQLFKSLN
IAIVLDEGLPSPTESYRVFYGERSPWWLVIKAKGPPGHGAKLYDNSAMENLLKSIESIRR
FRASQFDLLKAGGIAEGDVVSVNMAFLKAGTPSPTGFVMNLQPSEAEAGFDIRVPPSVDA
EALERRLVEEWAPAARNMSFELGQFKQKLTGKQFLTAADDSNPWWGLLENAVKEAGGRTS
KPEIFPASTDARYFRKAGVPAFGFSPISNTPSLLHDHNEYLGKAEYLKGIEVYVSIIKAY
ASYESKSGSRDEL*
>AT2G32160.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 325 Blast hits to 315 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 70 (source NCBI BLink)
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKKQ*
>AT2G32160.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 327 Blast hits to 317 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink)
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKKQ*
>AT2G32160.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 328 Blast hits to 321 proteins in 142 species Archae - 0 Bacteria - 0 Metazoa - 99 Fungi - 132 Plants - 25 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink)
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKKQ*
>AT2G32160.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 325 Blast hits to 315 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 70 (source NCBI BLink)
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKEKTIETT
YSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT2G32160.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 327 Blast hits to 317 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink)
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKEKTIETT
YSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT2G32160.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 328 Blast hits to 321 proteins in 142 species Archae - 0 Bacteria - 0 Metazoa - 99 Fungi - 132 Plants - 25 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink)
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSAHV
PLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRHKESTRPACLVPGAG
LGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPWIHTNCNSLSDDDQL
RPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCFFIDTAHNIIEYIET
ISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASHYGFEMEKEKTIETT
YSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT2G32160.3 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 325 Blast hits to 315 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 70 (source NCBI BLink)
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSVPF
SSHDWLDSSLQAHVPLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRH
KESTRPACLVPGAGLGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPW
IHTNCNSLSDDDQLRPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCF
FIDTAHNIIEYIETISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASH
YGFEMEKEKTIETTYSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT2G32160.3 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 327 Blast hits to 317 proteins in 141 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 130 Plants - 30 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink)
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSVPF
SSHDWLDSSLQAHVPLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRH
KESTRPACLVPGAGLGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPW
IHTNCNSLSDDDQLRPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCF
FIDTAHNIIEYIETISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASH
YGFEMEKEKTIETTYSTNPRSMMKNRYYPVFWTMRKKCAITTT*
>AT2G32160.3 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 19 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s N2227-like (InterProIPR012901) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G321701) Has 328 Blast hits to 321 proteins in 142 species Archae - 0 Bacteria - 0 Metazoa - 99 Fungi - 132 Plants - 25 Viruses - 0 Other Eukaryotes - 72 (source NCBI BLink)
MISSSEMMATEKAPEFVTAMATETERIGNSREVDEEKISRKKNPEEEALEAKCLPGIISA
YLNYPKAAEENLKKCERSYQKLSPAHKALVTHFPMKLQRLRRCILMNSHFIYNMLQAFEP
PIDLSKHMREPITGALSMEELQREEHHHDHSLKNAEIRLNNKTCEFDGGHLNHDHGSVPF
SSHDWLDSSLQAHVPLVDVNKVRWVIRNIVRDWGAEGQRERDECYKPILEELDSLFPDRH
KESTRPACLVPGAGLGRLALEISCLGFRSQGNEVSYYMMLCSSFILNYTQLPGEWTIYPW
IHTNCNSLSDDDQLRPISIPDIHPASAGVTESFSMCRGDFVEVFNESSQAGMWDAVVTCF
FIDTAHNIIEYIETISKILKDGGVLINLGPLLYHFADEQGLENEMSIELSLEDVKRVASH
YGFEMEKEKTIETTYSTNPRSMMKNRYYPVFWTMRKKCAITTT*