>AT5G66540.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN rRNA processing LOCATED IN cytosol nucleolus nucleus EXPRESSED IN 22 plant structures EXPRESSED DURING 12 growth stages CONTAINS InterPro DOMAIN/s U3 small nucleolar ribonucleoprotein complex subunit Mpp10p (InterProIPR012173) Mpp10 protein (InterProIPR007151) Has 76240 Blast hits to 38667 proteins in 1479 species Archae - 252 Bacteria - 6537 Metazoa - 31185 Fungi - 9935 Plants - 3937 Viruses - 750 Other Eukaryotes - 23644 (source NCBI BLink)
MATVKDSGFEALEKLKATEPPVFLAPSSISEDARSASQYLFMKLKPHNPKCPFDQLSSDG
FDAEQIWQQIDMQSQPLLTSLRQEVKRFAKNPEEIRKLGKLALKVSHEDDIDEMDMDGFD
SDDVDDEDKEIESNDSEGEDEEEEEEDEEEEEEEEEEEEEEKDGDNEGIEDKFFKIKELE
EFLEEGEAEEYGIDHKNKKGVAQRKKQNLSDDEDEEDDDDEEEDVEFDAFAGGDDEETDK
LGKARYDDFFGGKKETKMKLKDLSEDEEAEIENKGNEKLSTHERARLKLQSKIEQMEKAN
LDPKHWTMQGEITAAKRPMNSALEVDLDFEHNARPAPVITEEVTASLEDLIKSRIIEARF
DDVQRAPRLPTKGKREAKELDESKSKKGLAEVYEAEYFQKANPAFAPTTHSDELKKEASM
LFKKLCLKLDALSHFHFTPKPVIEEMSIPNVSAIAMEEVAPVAVSDAAMLAPEEIFSGKG
DIKDESELTQEDRKRRRANKKRKFKAESANEPPKKALDTSTKNP*
>AT4G31700.1 | RPS6 (RIBOSOMAL PROTEIN S6) structural constituent of ribosome
MKFNVANPTTGCQKKLEIDDDQKLRAFYDKRISQEVSGDALGEEFKGYVFKIKGGCDKQG
FPMKQGVLTPGRVRLLLHRGTPCFRGHGRRTGERRRKSVRGCIVSPDLSVLNLVIVKKGE
NDLPGLTDTEKPRMRGPKRASKIRKLFNLKKEDDVRTYVNTYRRKFTNKKGKEVSKAPKI
QRLVTPLTLQRKRARIADKKKKIAKANSDAADYQKLLASRLKEQRDRRSESLAKKRSRLS
SAAAKPSVTA*
>AT4G31700.1 | RPS6 (RIBOSOMAL PROTEIN S6) structural constituent of ribosome
MKFNVANPTTGCQKKLEIDDDQKLRAFYDKRISQEVSGDALGEEFKGYVFKIKGGCDKQG
FPMKQGVLTPGRVRLLLHRGTPCFRGHGRRTGERRRKSVRGCIVSPDLSVLNLVIVKKGE
NDLPGLTDTEKPRMRGPKRASKIRKLFNLKKEDDVRTYVNTYRRKFTNKKGKEVSKAPKI
QRLVTPLTLQRKRARIADKKKKIAKANSDAADYQKLLASRLKEQRDRRSESLAKKRSRLS
SAAAKPSVTA*
>AT4G31700.2 | RPS6 (RIBOSOMAL PROTEIN S6) structural constituent of ribosome
MKQGVLTPGRVRLLLHRGTPCFRGHGRRTGERRRKSVRGCIVSPDLSVLNLVIVKKGEND
LPGLTDTEKPRMRGPKRASKIRKLFNLKKEDDVRTYVNTYRRKFTNKKGKEVSKAPKIQR
LVTPLTLQRKRARIADKKKKIAKANSDAADYQKLLASRLKEQRDRRSESLAKKRSRLSSA
AAKPSVTA*
>AT4G31700.2 | RPS6 (RIBOSOMAL PROTEIN S6) structural constituent of ribosome
MKQGVLTPGRVRLLLHRGTPCFRGHGRRTGERRRKSVRGCIVSPDLSVLNLVIVKKGEND
LPGLTDTEKPRMRGPKRASKIRKLFNLKKEDDVRTYVNTYRRKFTNKKGKEVSKAPKIQR
LVTPLTLQRKRARIADKKKKIAKANSDAADYQKLLASRLKEQRDRRSESLAKKRSRLSSA
AAKPSVTA*
>AT5G15200.1 | 40S ribosomal protein S9 (RPS9B)
MVHVCYYRNYGKTFKGPRRPYEKERLDSELKLVGEYGLRNKRELWRVQYSLSRIRNAARD
LLTLDEKSPRRIFEGEALLRRMNRYGLLDESQNKLDYVLALTVENFLERRLQTIVFKSGM
AKSIHHSRVLIRQRHIRVGKQLVNIPSFMVRLDSQKHIDFALTSPFGGGRPGRVKRRNEK
SASKKASGGGDADGDDEE*
>AT5G15200.1 | 40S ribosomal protein S9 (RPS9B)
MVHVCYYRNYGKTFKGPRRPYEKERLDSELKLVGEYGLRNKRELWRVQYSLSRIRNAARD
LLTLDEKSPRRIFEGEALLRRMNRYGLLDESQNKLDYVLALTVENFLERRLQTIVFKSGM
AKSIHHSRVLIRQRHIRVGKQLVNIPSFMVRLDSQKHIDFALTSPFGGGRPGRVKRRNEK
SASKKASGGGDADGDDEE*
>AT5G15200.2 | 40S ribosomal protein S9 (RPS9B)
MVHVCYYRNYGKTFKGPRRPYEKERLDSELKLVGEYGLRNKRELWRVQYSLSRIRNAARD
LLTLDEKSPRRIFEGEALLRRMNRYGLLDESQNKLDYVLALTVENFLERRLQTIVFKSGM
AKSIHHSRVLIRQRHIRYCISSIYLCLCMNYLCYIN*
>AT5G15200.2 | 40S ribosomal protein S9 (RPS9B)
MVHVCYYRNYGKTFKGPRRPYEKERLDSELKLVGEYGLRNKRELWRVQYSLSRIRNAARD
LLTLDEKSPRRIFEGEALLRRMNRYGLLDESQNKLDYVLALTVENFLERRLQTIVFKSGM
AKSIHHSRVLIRQRHIRYCISSIYLCLCMNYLCYIN*
>AT5G20290.1 | 40S ribosomal protein S8 (RPS8A)
MGISRDSIHKRRATGGKQKQWRKKRKYEMGRQPANTKLSSNKTVRRIRVRGGNVKWRALR
LDTGNYSWGSEATTRKTRVLDVVYNASNNELVRTKTLVKSAIVQVDAAPFKQWYLSHYGV
ELGRKKKSASSTKKDGEEGEEAAVAAPEEVKKSNHLLRKIASRQEGRSLDSHIEDQFASG
RLLACISSRPGQCGRADGYILEGKELEFYMKKIQKKKGKGAA*
>AT3G11964.1 | RNA binding
MVVPQKKFANGKRNDSTKSFKPMKKPFKKTKDDVAARSEAMALQLEDVPDFPRGGGTSLS
KKEREKLYEEVDAEFDADERVSKKSKGGKSKKRIPSDLDDLGLLFGGGLHGKRPRYANKI
TTKNISPGMKLLGVVTEVNQKDIVISLPGGLRGLVRASEVSDFTDRGIEDDENELLGDIF
SVGQLVPCIVLELDDDKKEAGKRKIWLSLRLSLLHKGFSFDSFQLGMVFSANVKSIEDHG
SILHFGLPSITGFIEISDDGNQESGMKTGQLIQGVVTKIDRDRKIVHLSSDPDSVAKCLT
KDLSGMSFDLLIPGMMVNARVQSVLENGILFDFLTYFNGTVDLFHLKNPLSNKSWKDEYN
QNKTVNARILFIDPSSRAVGLTLSPHVVCNKAPPLHVFSGDIFDEAKVVRIDKSGLLLEL
PSKPTPTPAYVSFKEGNHIRVRVLGLKQMEGLAVGTLKESAFEGPVFTHSDVKPGMVTKA
KVISVDTFGAIVQFSGGLKAMCPLRHMSEFEVTKPRKKFKVGAELVFRVLGCKSKRITVT
YKKTLVKSKLPILSSYTDATEGLVTHGWITKIEKHGCFVRFYNGVQGFVPRFELGLEPGS
DPDSVFHVGEVVKCRVTSAVHGTQRITLNDSIKLGSIVSGIIDTITSQAVIVRVKSKSVV
KGTISAEHLADHHEQAKLIMSLLRPGYELDKLLVLDIEGNNMALSSKYSLIKLAEELPSD
FNQLQPNSVVHGYVCNLIENGCFVRFLGRLTGFAPRSKAIDDPKADVSESFFVGQSVRAN
IVDVNQEKSRITLSLKQSSCASVDASFVQEYFLMDEKISDLQSSDITKSDCSWVEKFSIG
SLIKGTIQEQNDLGVVVNFDNINNVLGFIPQHHMGGATLVPGSVVNAVVLDISRAERLVD
LSLRPELLNNLTKEVSNSSKKKRKRGISKELEVHQRVSAVVEIVKEQHLVLSIPEHGYTI
GYASVSDYNTQKLPVKQFSTGQSVVASVKAVQNPLTSGRLLLLLDSVSGTSETSRSKRAK
KKSSCEVGSVVHAEITEIKPFELRVNFGNSFRGRIHITEVLVNDASTSDEPFAKFRVGQS
ISARVVAKPCHTDIKKTQLWELSVKPAMLKDSSEFNDTQESEQLEFAAGQCVIGYVYKVD
KEWVWLAVSRNVTARIFILDTSCKAHELEEFERRFPIGKAVSGYVLTYNKEKKTLRLVQR
PLLFIHKSIANGGGSKTDKPDSSIPGDDDTLFIHEGDILGGRISKILPGVGGLRVQLGPY
VFGRVHFTEINDSWVPDPLDGFREGQFVKCKVLEISSSSKGTWQIELSLRTSLDGMSSAD
HLSEDLKNNDNVCKRFERIEDLSPDMGVQGYVKNTMSKGCFIILSRTVEAKVRLSNLCDT
FVKEPEKEFPVGKLVTGRVLNVEPLSKRIEVTLKTVNAGGRPKSESYDLKKLHVGDMISG
RIRRVEPFGLFIDIDQTGMVGLCHISQLSDDRMENVQARYKAGESVRAKILKLDEEKKRI
SLGMKSSYLMNGDDDKAQPLSEDNTSMECDPINDPKSEVLAAVDDFGFQETSGGTSLVLA
QVESRASIPPLEVDLDDIEETDFDSSQNQEKLLGANKDEKSKRREKQKDKEEREKKIQAA
EGRLLEHHAPENADEFEKLVRSSPNSSFVWIKYMAFMLSLADIEKARSIAERALRTINIR
EEEEKLNIWVAYFNLENEHGNPPEESVKKVFERARQYCDPKKVYLALLGVYERTEQYKLA
DKLLDEMIKKFKQSCKIWLRKIQSSLKQNEEAIQSVVNRALLCLPRHKHIKFISQTAILE
FKCGVADRGRSLFEGVLREYPKRTDLWSVYLDQEIRLGEDDVIRSLFERAISLSLPPKKM
KFLFKKFLEYEKSVGDEERVEYVKQRAMEYANSTLA*
>AT3G11510.1 | 40S ribosomal protein S14 (RPS14B)
MSKRKTKEPKVETVTLGPSVREGEQVFGVVHIFASFNDTFIHVTDLSGRETLVRITGGMK
VKADRDESSPYAAMLAAQDVAQRCKELGITAMHVKLRATGGNKTKTPGPGAQSALRALAR
SGMKIGRIEDVTPIPTDSTRRKGGRRGRRL*
>AT4G00100.1 | ATRPS13A (ARABIDOPSIS THALIANA RIBOSOMAL PROTEIN S13A) structural constituent of ribosome
MGRMHSRGKGISASALPYKRSSPSWLKTTSQDVDESICKFAKKGLTPSQIGVILRDSHGI
PQVKSVTGSKILRILKAHGLAPEIPEDLYHLIKKAVAIRKHLERNRKDKDSKFRLILVES
RIHRLARYYKKTKKLPPVWKYESTTASTLVA*
>AT5G04600.1 | RNA recognition motif (RRM)-containing protein
MGAKAKKALKKNMKKVAASASSSQLPLPQNPKPSADFLPLEGGPARKAPVTTPPLQNKAT
VLYIGRIPHGFYETEIEAFFSQFGTVKRVRVARNKKTGKSKHFGFIQFEDPEVAEIAAGA
MNDYLLMEHMLKVHVIEPENVKPNLWRGFKCNFKPVDSVQIERRQLNKERTLEEHRKMLQ
KIVKKDQKRRKRIEAAGIEYECPELVGNTQPVPKRIKFSEED*
>AT4G25630.1 | FIB2 (FIBRILLARIN 2) snoRNA binding
MRPPLTGSGGGFSGGRGRGGYSGGRGDGGFSGGRGGGGRGGGRGFSDRGGRGRGRGPPRG
GARGGRGPAGRGGMKGGSKVIVEPHRHAGVFIAKGKEDALVTKNLVPGEAVYNEKRISVQ
NEDGTKTEYRVWNPFRSKLAAAILGGVDNIWIKPGAKVLYLGAASGTTVSHVSDLVGPEG
CVYAVEFSHRSGRDLVNMAKKRTNVIPIIEDARHPAKYRMLVGMVDVIFSDVAQPDQARI
LALNASYFLKSGGHFVISIKANCIDSTVPAEAVFQTEVKKLQQEQFKPAEQVTLEPFERD
HACVVGGYRMPKKPKAATAA*
>AT2G41500.1 | EMB2776 nucleotide binding
MEPNKDDNVSLAATAQISAPPVLQDASSLPGFSAIPPVVPPSFPPPMAPIPMMPHPPVAR
PPTFRPPVSQNGGVKTSDSDSESDDEHIEISEESKQVRERQEKALQDLLVKRRAAAMAVP
TNDKAVRDRLRRLGEPITLFGEQEMERRARLTQLLTRYDINGQLDKLVKDHEEDVTPKEE
VDDEVLEYPFFTEGPKELREARIEIAKFSVKRAAVRIQRAKRRRDDPDEDMDAETKWALK
HAKHMALDCSNFGDDRPLTGCSFSRDGKILATCSLSGVTKLWEMPQVTNTIAVLKDHKER
ATDVVFSPVDDCLATASADRTAKLWKTDGTLLQTFEGHLDRLARVAFHPSGKYLGTTSYD
KTWRLWDINTGAELLLQEGHSRSVYGIAFQQDGALAASCGLDSLARVWDLRTGRSILVFQ
GHIKPVFSVNFSPNGYHLASGGEDNQCRIWDLRMRKSLYIIPAHANLVSQVKYEPQEGYF
LATASYDMKVNIWSGRDFSLVKSLAGHESKVASLDITADSSCIATVSHDRTIKLWTSSGN
DDEDEEKETMDIDL*
>AT3G05060.1 | SAR DNA-binding protein putative
MVLVLYETAAGFALFKVKDEGKMANVEDLCKEFDTPDSARKMVKLKAFEKFDNTSEALEA
VAKLLEGAPSKGLRKFLKANCQGETLAVADSKLGNVIKEKLKIDCIHNNAVMELLRGVRS
QFTELISGLGDQDLAPMSLGLSHSLARYKLKFSSDKVDTMIIQAIGLLDDLDKELNTYAM
RVREWYGWHFPELAKIISDNILYAKSVKLMGNRVNAAKLDFSEILADEIEADLKDAAVIS
MGTEVSDLDLLHIRELCDQVLSLSEYRAQLYDYLKSRMNTIAPNLTALVGELVGARLISH
GGSLLNLSKQPGSTVQILGAEKALFRALKTKHATPKYGLIFHASLVGQAAPKHKGKISRS
LAAKTVLAIRVDALGDSQDNTMGLENRAKLEARLRNLEGKDLGRLSGSSKGKPKIEVYNK
DKKMGSGGLITPAKTYNTAADSLLGETSAKSEEPSKKKDKKKKKKVEEEKPEEEEPSEKK
KKKKAEAETEAVVEVAKEEKKKNKKKRKHEEEETTETPAKKKDKKEKKKKSKD*
>AT3G12860.1 | nucleolar protein Nop56 putative
MKIYLLSESPSGYGLFEGHGSDEIGQNTEAVRSSVSDLSRFGRVVQLTAFHPFQSALDAL
NQINAVSEGYMSDELRSFLELNLPKVKEGKKPKFSLGVSEPKIGSCIFEATKIPCQSNEF
VHELLRGVRQHFDRFIKDLKPGDLEKAQLGLAHSYSRAKVKFNVNRVDNMVIQAIFMLDT
LDKDINSFAMRVREWYSWHFPELVKIVNDNYLYAKVSKIIVDKSKLSEEHIPMLTEALGD
EDKAREVIEAGKASMGQDLSPVDLINVQTFAQRVMDLADYRKKLYDYLVTKMSDIAPNLA
ALIGEMVGARLISHAGSLTNLAKCPSSTLQILGAEKALFRALKTRGNTPKYGLIFHSSFI
GRASAKNKGRIARFLANKCSIASRIDCFSDNSTTAFGEKLREQVEERLDFYDKGVAPRKN
VDVMKEVLENLEKKDEGEKTVDASEKKKKRKTEEKEEEKEEEKSKKKKKKSKAVEGEELT
ATDNGHSKKKKKTKSQDDE*
>AT4G05410.1 | transducin family protein / WD-40 repeat family protein
MKYNNEKKKGGSFKRGGKKGSNERDPFFEEEPKKRRKVSYDDDDIESVDSDAEENGFTGG
DEDGRRVDGEVEDEDEFADETAGEKRKRLAEEMLNRRREAMRREREEADNDDDDDEDDDE
TIKKSLMQKQQEDSGRIRRLIASRVQEPLSTDGFSVIVKHRRSVVSVALSDDDSRGFSAS
KDGTIMHWDVSSGKTDKYIWPSDEILKSHGMKLREPRNKNHSRESLALAVSSDGRYLATG
GVDRHVHIWDVRTREHVQAFPGHRNTVSCLCFRYGTSELYSGSFDRTVKVWNVEDKAFIT
ENHGHQGEILAIDALRKERALTVGRDRTMLYHKVPESTRMIYRAPASSLESCCFISDNEY
LSGSDNGTVALWGMLKKKPVFVFKNAHQDIPDGITTNGILENGDHEPVNNNCSANSWVNA
VATSRGSDLAASGAGNGFVRLWAVETNAIRPLYELPLTGFVNSLAFAKSGKFLIAGVGQE
TRFGRWGCLKSAQNGVAIHPLRLA*
>AT1G63780.1 | IMP4
MQRRLVRLKKEYIYRKSLEGDERKVYEQKRLIREALQEGKPIPTELRNVEAKLRQEIDLE
DQNTAVPRSHIDDEYANATEADPKILLTTSRNPSAPLIRFTKELKFVFPNSQRINRGSQV
ISEIIETARSHDFTDVILVHEHRGVPDGLIISHLPFGPTAYFGLLNVVTRHDISDKKSIG
KMPEQYPHLIFNNFTTQMGQRVGNILKHIFPAPKLDAKRIVTFSNQSDYISFRNHVYDKG
EGGPKSIELKEIGPRFELRLYQVKLGTVEQNEAEIEWVIRPYMNTSKKRKFIGE*
>AT1G63810.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 21 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Nrap protein (InterProIPR005554) Has 268 Blast hits to 263 proteins in 124 species Archae - 3 Bacteria - 0 Metazoa - 116 Fungi - 89 Plants - 17 Viruses - 0 Other Eukaryotes - 43 (source NCBI BLink)
MEADTKTDSRTLKVNDLLKDARLDYDSLRKLVDDTVSSIKEAIDGIPEKFQVTSELAPSF
VEDIGADKEVEFSFKKPNGFNLCGSYSICGMAKPDTSVDLLVHLPKECFYEKDYMNHRYH
AKRCLYLCVIEKHLLSSSSIEKVVWSTLHNEARKPVLVVFPAKKLDQFPGFSIRLIPSAT
SLFSVAKLSISRNNVRSVTADGVPEPTPTYNSSILEDMFLEENSEFLKKTFSEWKELSDA
LILLKIWARQRSSIYVHDCLNGFLISVILSYLATHSKINKALSALDIFRVTLDFIATSKL
WERGLYLPPQSEIRVSKEEKMQFRELFPVVICDSSTFVNLAFRMTSVGFLELQDEASLTL
KCMEKLRDGGFEEIFMTKIDYPVKYDHCIRLQLKGKTAVSLSGFCLDKECWRLYEQKVHS
LLLEGLGDRAKSIRVVWRNTNQDWHVESGLSVLDREPLFIGISVSSTEKAYRTVDIGPDA
ENKIEALRFRKFWGEKSDLRRFKDGRISESTVWETQQWTKHLIMKQIVEYILKRHLSLTS
DDIVQLVDQLDFSLNYGGKDPISLSGNLVQAYEVLSKCLREIEGIPLKVSSVQSLDSALR
FTSVFPPEPHPVACEKIDSRRLQKLIPSCIPAMEVMIQLEGSGNWPMDDLAVEKTKSAFL
LKIAESLQNVKGIPCTATEDNVDVFIGGYAFRLRILHERGLSLVKREIGVDPVKHVSSTD
KMLFIRSQHASMINGLQGRFPVYAPVARLAKRWVSAHLFSGCLAEEAIELLVAYLFLTPL
PLGVPSSRINGFLRFLRLLADYEWMFYPLIVDINNDFGRNDEKEINDNFMSSRKGYEEDK
QNISSAMFLAAPYDKASEAWTSTSPNLLEQKRLVAYARSSANVLSKMVLQEHNDSVQWEC
LFRTPLNNYDAVILLHRDKLPYPRRLLFPSELNQGKHVARGKASRLFNPFMSPGDLKRSH
EELKNKLMVDFEPTKCLLSGLQEEFGTLKPWYDHIGGDAIGLTWNKHNSKKRERDEEEEE
EEESNPMEMLKAVGEMGKGLVRDIYLLKPPRFV*
>AT3G21540.1 | transducin family protein / WD-40 repeat family protein
MVKAYQRYDAATSFGVISSVDSNIAYDSTGKYVLAPALEKVGIWHVRQGVCSKTLTPSSS
RGGPSLAVTSIASSASSLVAVGYADGSIRIWDTEKGTCEVNFNSHKGAVTALRYNKVGSM
LASGSKDNDIILWDVVGESGLFRLRGHRDQVTDLVFLDGGKKLVSSSKDKFLRVWDLETQ
HCMQIVSGHHSEVWSVDTDPEERYVVTGSADQELRFYAVKEYSSNGSLVSDSNANEIKAS
EEHSTENKWEILKLFGEIQRQTKDRVARVRFNVSGKLLACQMAGKTIEIFRVLDEAEAKQ
KAKRRLRRKEKKSSKVGDENSTANGEASAKIELADAVSSPTVLDVFKLLQVIRAGRKISS
FSFCPTAPKESLGTLALSLNNNSLEFYSLKSSENAKTVTIEHQGHRSDVRSVTLSEDNTL
LMSTSHSEVKIWNPSTGSCLRTIDSGYGLCSLIVPQNKYGIVGTKSGVLEIIDIGSATKV
EEVKAHGGTIWSITPIPNDSGFVTVSADHEVKFWEYQVKQKSGKATKKLTVSNVKSMKMN
DDVLAVAISPDAKHIAVALLDSTVKVFYMDSLKFYLSLYGHKLPVMCIDISSDGELIVTG
SQDKNLKIWGLDFGDCHKSIFAHGDSVMGVKFVRNTHYLFSIGKDRLVKYWDADKFEHLL
TLEGHHAEIWCLAISNRGDFLVTGSHDRSMRRWDRSEEPFFLEEEKEKRLEELFESEIDN
AADDRHGPMEEIPEEGVAALAGKKTIDVLSAADSIIDALEVAEDETKRHAAYEEEKTKGK
VPEYLPNAVMFGLSPSEYVLRAISNVRTNDLEQTLLALPFSDSLKFLCYMKDWSLIPEKV
ELVSRIATIILQTHHNQLVTTPSARPILSVLRDILYSEIKACKDTIGFNLAAMDHVKQMM
DARSDAPFKDAKAKLLEIRSQQAKRMASRGDTKMEKKRKKKQKKLEEGQHGHALF*
>AT3G57000.1 | nucleolar essential protein-related
MVRPYGIKVNKRKEREERYDKEEDEVEEQPKFEQKQKARESSKKAKKESTSRAEEDNDEE
EVTVEATAAAEDIVGGIPIVLNAPNKEKSGIVFVLEKASLEVAKVGKTYQLLNSDDHANF
LKKNNRNPADYRPDITHQALLMILDSPVNKAGRLKAVYVRTEKGVLFEVKPHVRIPRTFK
RFAGIMLQLLQKLSITAVNSREKLLRCVKNPIEEHHLPVNSHRIGFSHSSEKLVNMQKHL
ATVCDDDRDTVFVVGAMAHGKIDCNYIDEFVSVSEYPLSAAYCISRICEALATNWNII*
>AT2G41840.1 | 40S ribosomal protein S2 (RPS2C)
MAERGGERGVERGGERGDFGRGFGGRGGRGDRGGRGRGGRGGRRGGRASEEEKWVPVTKL
GRLVAAGHIKQIEQIYLHSLPVKEYQIIDMLIGPTLKDEVMKIMPVQKQTRAGQRTRFKA
FVVVGDGNGHVGLGVKCSKEVATAIRGAIILAKLSVVPVRRGYWGNKIGKPHTVPCKVTG
KCGSVTVRMVPAPRGSGIVAARVPKKVLQFAGIDDVFTSSRGSTKTLGNFVKATFDCLQK
TYGFLTPEFWKETRFSRSPYQEHTDFLASKALSTSKPDPVVEDQA*
>AT2G36160.1 | 40S ribosomal protein S14 (RPS14A)
MSKRKTKEPKVDVVTLGPSVREGEQVFGVVHIFASFNDTFIHVTDLSGRETLVRITGGMK
VKADRDESSPYAAMLAAQDVAQRCKELGITAMHVKLRATGGNKTKTPGPGAQSALRALAR
SGMKIGRIEDVTPIPTDSTRRKGGRRGRRL*
>AT2G17360.1 | 40S ribosomal protein S4 (RPS4A)
MARGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLVLIIRNRLKYALTYRE
VISILMQRHIQVDGKVRTDKTYPAGFMDVVSIPKTNENFRLLYDTKGRFRLHSIKDEEAK
FKLCKVRSIQFGQKGIPYLNTYDGRTIRYPDPLIKPNDTIKLDLEENKIVEFIKFDVGNV
VMVTGGRNRGRVGVIKNREKHKGSFETIHIQDSTGHEFATRLGNVYTIGKGTKPWVSLPK
GKGIKLTIIEEARKRLSAQQA*
>AT2G37270.1 | ATRPS5B (RIBOSOMAL PROTEIN 5B) structural constituent of ribosome
MAASAEIDAEIQQQLTNEVKLFNRWSFDDVSVTDISLVDYIGVQPSKHATFVPHTAGRYS
VKRFRKAQCPIVERLTNSLMMHGRNNGKKLMAVRIVKHAMEIIHLLSDLNPIQVIIDAIV
NSGPREDATRIGSAGVVRRQAVDISPLRRVNQAIFLLTTGAREAAFRNIKTIAECLADEL
INAAKGSSNSYAIKKKDEIERVAKANR*
>AT2G37270.1 | ATRPS5B (RIBOSOMAL PROTEIN 5B) structural constituent of ribosome
MAASAEIDAEIQQQLTNEVKLFNRWSFDDVSVTDISLVDYIGVQPSKHATFVPHTAGRYS
VKRFRKAQCPIVERLTNSLMMHGRNNGKKLMAVRIVKHAMEIIHLLSDLNPIQVIIDAIV
NSGPREDATRIGSAGVVRRQAVDISPLRRVNQAIFLLTTGAREAAFRNIKTIAECLADEL
INAAKGSSNSYAIKKKDEIERVAKANR*
>AT2G37270.2 | ATRPS5B (RIBOSOMAL PROTEIN 5B) structural constituent of ribosome
MAASAEIDAEIQQQLTNEVKLFNRWSFDDVSVTDISLVDYIGVQPSKHATFVPHTAGRYS
VKRFRKAQCPIVERLTNSLMMHGRNNGKKLMAVRIVKHAMEIIHLLSDLNPIQVIIDAIV
NSGPREDATRIGSAGVVRRQAVDISPLRRVNQAIFLLTTGAREAAFRNIKTIAECLADEL
INAAKGSSNSYAIKKKDEIERVAKANR*
>AT2G37270.2 | ATRPS5B (RIBOSOMAL PROTEIN 5B) structural constituent of ribosome
MAASAEIDAEIQQQLTNEVKLFNRWSFDDVSVTDISLVDYIGVQPSKHATFVPHTAGRYS
VKRFRKAQCPIVERLTNSLMMHGRNNGKKLMAVRIVKHAMEIIHLLSDLNPIQVIIDAIV
NSGPREDATRIGSAGVVRRQAVDISPLRRVNQAIFLLTTGAREAAFRNIKTIAECLADEL
INAAKGSSNSYAIKKKDEIERVAKANR*
>AT1G07770.1 | RPS15A (ribosomal protein s15a) structural constituent of ribosome
MVRISVLNDALKSMYNAEKRGKRQVMIRPSSKVIIKFLIVMQKHGYIGEFEYVDDHRSGK
IVVELNGRLNKCGVISPRFDVGVKEIEGWTARLLPSRQFGYIVLTTSAGIMDHEEARRKN
VGGKVLGFFY*
>AT1G07770.1 | RPS15A (ribosomal protein s15a) structural constituent of ribosome
MVRISVLNDALKSMYNAEKRGKRQVMIRPSSKVIIKFLIVMQKHGYIGEFEYVDDHRSGK
IVVELNGRLNKCGVISPRFDVGVKEIEGWTARLLPSRQFGYIVLTTSAGIMDHEEARRKN
VGGKVLGFFY*
>AT1G07770.2 | RPS15A (ribosomal protein s15a) structural constituent of ribosome
MVRISVLNDALKSMYNAEKRGKRQVMIRPSSKVIIKFLIVMQKHGYIGEFEYVDDHRSGK
IVVELNGRLNKCGVISPRFDVGVKEIEGWTARLLPSRQFGYIVLTTSAGIMDHEEARRKN
VGGKVLGFFY*
>AT1G07770.2 | RPS15A (ribosomal protein s15a) structural constituent of ribosome
MVRISVLNDALKSMYNAEKRGKRQVMIRPSSKVIIKFLIVMQKHGYIGEFEYVDDHRSGK
IVVELNGRLNKCGVISPRFDVGVKEIEGWTARLLPSRQFGYIVLTTSAGIMDHEEARRKN
VGGKVLGFFY*
>AT1G48830.1 | 40S ribosomal protein S7 (RPS7A)
MFSAQHKIHKEKGVELSELDEQVAQAFFDLENTNQELKSELKDLYVNSAVQVDISGGRKA
IVVNVPYRLRKAYRKIHVRLVRELEKKFSGKDVILIATRRIVRPPKKGSAAKRPRNRTLT
SVHEAILDDVVLPAEIVGKRTRYRLDGTKIMKVFLDPKERNNTEYKVEAFSAVYKKLTGK
DVVFEFPITEA*
>AT1G48830.1 | 40S ribosomal protein S7 (RPS7A)
MFSAQHKIHKEKGVELSELDEQVAQAFFDLENTNQELKSELKDLYVNSAVQVDISGGRKA
IVVNVPYRLRKAYRKIHVRLVRELEKKFSGKDVILIATRRIVRPPKKGSAAKRPRNRTLT
SVHEAILDDVVLPAEIVGKRTRYRLDGTKIMKVFLDPKERNNTEYKVEAFSAVYKKLTGK
DVVFEFPITEA*
>AT1G48830.2 | 40S ribosomal protein S7 (RPS7A)
MFSAQHKIHKEKGVELSELDEQVAQAFFDLENTNQELKSELKDLYVNSAVQVDISGGRKA
IVVNVPYRLRKAYRKIHVRLVRELEKKFSGKDVILIATRRIVRPPKKGSAAKRPRNRTLT
SVHEAILDDVVLPAEIVGKRTRYRLDGTKIMKVFLDPKERNNTEYKVEAFSAVYKKLTGK
DVVFEFPITEA*
>AT1G48830.2 | 40S ribosomal protein S7 (RPS7A)
MFSAQHKIHKEKGVELSELDEQVAQAFFDLENTNQELKSELKDLYVNSAVQVDISGGRKA
IVVNVPYRLRKAYRKIHVRLVRELEKKFSGKDVILIATRRIVRPPKKGSAAKRPRNRTLT
SVHEAILDDVVLPAEIVGKRTRYRLDGTKIMKVFLDPKERNNTEYKVEAFSAVYKKLTGK
DVVFEFPITEA*
>AT1G58380.1 | XW6 structural constituent of ribosome
MAERGGEGGAERGGDRGDFGRGFGGGRGGGRGRDRGPRGRGRRGGRASEETKWVPVTKLG
RLVADNKITKLEQIYLHSLPVKEYQIIDHLVGPTLKDEVMKIMPVQKQTRAGQRTRFKAF
VVVGDGNGHVGLGVKCSKEVATAIRGAIILAKLSVVPVRRGYWGNKIGKPHTVPCKVTGK
CGSVTVRMVPAPRGSGIVAARVPKKVLQFAGIDDVFTSSRGSTKTLGNFVKATFDCLQKT
YGFLTPEFWKETRFSRSPYQEHTDFLSTKAVSATKVITEGEDQA*
>AT3G06530.1 | binding
MSSSIVSQLQALKSVLQADTEPSKRPFTRPSILFSPKEAADFDIESIYELGLKGLEVLGN
KDERFKNYMNDLFSHKSKEIDRELLGKEENARIDSSISSYLRLLSGYLQFRASLETLEYL
IRRYKIHIYNLEDVVLCALPYHDTHAFVRIVQLLSTGNSKWKFLDGVKNSGAPPPRSVIV
QQCIRDKQVLEALCDYASRTKKYQPSKPVVSFSTAVVVGVLGSVPTVDGDIVKTILPFVD
SGLQSGVKGCLDQQAGALMVVGMLANRAVLNTNLIKRLMRSIIDIGREHAKESSDPHSLR
LSLMALINFVQLQSVDLIPRKALDLFNEIRDISGVLLGLSKEFNIKRFLAVLLDSLLFYS
SSDDKCCEVLASIIETVPVSNLVDHLISKVFSLCMTQYQKNSDFRSSTSGSWAKKFLVVV
SKKYPAELRAAVPKFLEATEVQSKKEDLKLEMLSCMLDGNSDMSHPFVDSKLWFRLHHPR
AAVRCAALSSLNGVLKDDSSKAENLVTIQDAILRQLWDDDLAVVQAALSFDKLPNIITSS
GLLDALLHVVKRCVGILVSGVSHNVQLAVDVVALSLKIAVSSFGNQTDSTEKVTSAMFPF
LLIQPKTWNLNLLVLKLGKDVNWPLFKNLAADDGMKKLPDIMSTNLSSISMDIINDLGEA
LSLDPDERRIELIERACNYKLSEVLETCSNIKCSEQDRNKLQKGLLIRESVSALNIDVIN
KLVEAFMMHPADYIQWLTTEWEELEVEVDVSLKELSKSNCQELLYQLLDTSDFTALNSKV
LICLFWKLGESFIKLEPAHDASVLNKRLSSGLEDLFFFFATTRLRHVFKEHLHFRVREAK
VCPVLFLSRLISREDVPPLVQIESLRCFSYLCSSGNNEWLIQVFSSFPVLLVPMSSDNQD
VKAAAINCIEALFNLRAAIYGSSFDELLGMIVQQRRLILSDNKFFASYLTSLLSSTTNDL
LVPVGLQKRFDQSTKENILSVILLCAEDLPAYGKLRVLSLLKDLGIMLMRDEIVKLLSQL
LDKRSQYYYKLDKTSQPLSDTEVDLLCLLLECSMMRTSSFKGQSLDDHILSALNVDCMAS
ERPAVISPCLTILEKLSNRFYDELQTDVQIRFFHKLVSMFRSSNGSIQNGAKEAVLRLKL
SSSTVVLALDRITQQDTLVIGSLSKKKKQKKNSKSCPEEDINSEEFRSGEKALSFIASLL
DMLLLKKDLTHRESLIRPLFKLLQRSMSKEWVKIAFSIEETSLQPPQDVRETTPTFISSI
QQTLLLILKDIFDSLNMNPLKAEVANEINVKMLVELAHSSNDGVTRNHIFSLFTAIVKFV
PDKVLDHIISILTLVGESTVTQIDSHSKSIFEGFISMVIPFWLSKTKSEEQLLQIFVKVL
PDIVEHRRRSIVAYLLGVIGERNGLPALLVLLFKSLISRKDSAWLGNANVSESFASIVKK
EWEYSFAMEICEQYSSSTWLSSLVILLQTISKDSKQCFLQMRLVLEFVFQKLQDPEFAFA
VSLEPRNNVSVGIQQELQELMKCCICLLQAIDAKKEKDVTSSVRNEIRMRIHDVLMTVTG
AMDLSIYFRVVTSLLQQQTDYNGTKKVLGLISERAKDTSSSKMKHKRKISNQKGRNSWLN
LDEVAVDSFGKMCEEIVHLINATDDESGVPVKRAAISTLEVLAGRFPSGHPIFRKCLAAV
AECISSKNLGVSSSCLRTTGALINVLGPKALIELPCIMKNLVKQSLEVSFASQSGRNATA
EEQLLMLSVLVTLEAVIDKLGGFLNPHLGDIMKIMVLHPEYVSDFDKNLKSKANAIRRLL
TDKIPVRLTLQPLLRIYNEAVSSGNASLVIAFNMLEDLVVKMDRSSIVSSHGKIFDQCLV
ALDIRRLNPAAIQNIDDAERSVTSAMVALTKKLTESEFRPLFIRSIDWAESDVVDGSGSE
NKSIDRAISFYGLVDRLCESHRSIFVPYFKYVLDGIVAHLTTAEASVSTRKKKKAKIQQT
SDSIQPKSWHLRALVLSCLKNCFLHDTGSLKFLDTNNFQVLLKPIVSQLVVEPPSSLKEH
PHVPSVDEVDDLLVSCIGQMAVASGSDLLWKPLNHEVLMQTRSESVRSRMLSLRSVKQML
DNLKEEYLVLLAETIPFLAELLEDVELSVKSLAQDIIKQMEEMSGESLAEYL*
>AT5G52640.1 | ATHSP901 (HEAT SHOCK PROTEIN 901) ATP binding / unfolded protein binding
MADVQMADAETFAFQAEINQLLSLIINTFYSNKEIFLRELISNSSDALDKIRFESLTDKS
KLDGQPELFIRLVPDKSNKTLSIIDSGIGMTKADLVNNLGTIARSGTKEFMEALQAGADV
SMIGQFGVGFYSAYLVAEKVVVTTKHNDDEQYVWESQAGGSFTVTRDVDGEPLGRGTKIT
LFLKDDQLEYLEERRLKDLVKKHSEFISYPIYLWTEKTTEKEISDDEDEDEPKKENEGEV
EEVDEEKEKDGKKKKKIKEVSHEWELINKQKPIWLRKPEEITKEEYAAFYKSLTNDWEDH
LAVKHFSVEGQLEFKAILFVPKRAPFDLFDTRKKLNNIKLYVRRVFIMDNCEELIPEYLS
FVKGVVDSDDLPLNISRETLQQNKILKVIRKNLVKKCIEMFNEIAENKEDYTKFYEAFSK
NLKLGIHEDSQNRGKIADLLRYHSTKSGDEMTSFKDYVTRMKEGQKDIFYITGESKKAVE
NSPFLERLKKRGYEVLYMVDAIDEYAVGQLKEYDGKKLVSATKEGLKLEDETEEEKKKRE
EKKKSFENLCKTIKEILGDKVEKVVVSDRIVDSPCCLVTGEYGWTANMERIMKAQALRDS
SMSGYMSSKKTMEINPDNGIMEELRKRAEADKNDKSVKDLVMLLYETALLTSGFSLDEPN
TFAARIHRMLKLGLSIDEDENVEEDGDMPELEEDAAEESKMEEVD*
>AT5G17310.1 | UTP--glucose-1-phosphate uridylyltransferase putative / UDP-glucose pyrophosphorylase putative / UGPase putative
MRYAVYSFFRFHFLFLFLCRSVIEVRDGLTFLDLIVIQIENLNNKYNCKVPLVLMNSFNT
HDDTQKIVEKYTKSNVDIHTFNQSKYPRVVADEFVPWPSKGKTDKDGWYPPGHGDVFPSL
MNSGKLDAFLSQGKEYVFIANSDNLGAIVDLKILKHLIQNKNEYCMEVTPKTLADVKGGT
LISYEGKVQLLEIAQVPDEHVNEFKSIEKFKIFNTNNLWVNLKAIKKLVEADALKMEIIP
NPKEVDGVKVLQLETAAGAAIRFFDNAIGVNVPRSRFLPVKATSDLLLVQSDLYTLVDGF
VTRNKARTNPTNPAIELGPEFKKVASFLSRFKSIPSIVELDSLKVSGDVWFGSGVVLKGK
VTVKANAGTKLEIPDNAVLENKDINGPEDL*
>AT5G17310.1 | UTP--glucose-1-phosphate uridylyltransferase putative / UDP-glucose pyrophosphorylase putative / UGPase putative
MRYAVYSFFRFHFLFLFLCRSVIEVRDGLTFLDLIVIQIENLNNKYNCKVPLVLMNSFNT
HDDTQKIVEKYTKSNVDIHTFNQSKYPRVVADEFVPWPSKGKTDKDGWYPPGHGDVFPSL
MNSGKLDAFLSQGKEYVFIANSDNLGAIVDLKILKHLIQNKNEYCMEVTPKTLADVKGGT
LISYEGKVQLLEIAQVPDEHVNEFKSIEKFKIFNTNNLWVNLKAIKKLVEADALKMEIIP
NPKEVDGVKVLQLETAAGAAIRFFDNAIGVNVPRSRFLPVKATSDLLLVQSDLYTLVDGF
VTRNKARTNPTNPAIELGPEFKKVASFLSRFKSIPSIVELDSLKVSGDVWFGSGVVLKGK
VTVKANAGTKLEIPDNAVLENKDINGPEDL*
>AT5G17310.2 | UTP--glucose-1-phosphate uridylyltransferase putative / UDP-glucose pyrophosphorylase putative / UGPase putative
MAATATEKLPQLKSAVDGLTEMSENEKSGFINLVSRYLSGEAQHIEWSKIQTPTDEIVVP
YDKMANVSEDASETKYLLDKLVVLKLNGGLGTTMGCTGPKSVIEVRDGLTFLDLIVIQIE
NLNNKYNCKVPLVLMNSFNTHDDTQKIVEKYTKSNVDIHTFNQSKYPRVVADEFVPWPSK
GKTDKDGWYPPGHGDVFPSLMNSGKLDAFLSQGKEYVFIANSDNLGAIVDLKILKHLIQN
KNEYCMEVTPKTLADVKGGTLISYEGKVQLLEIAQVPDEHVNEFKSIEKFKIFNTNNLWV
NLKAIKKLVEADALKMEIIPNPKEVDGVKVLQLETAAGAAIRFFDNAIGVNVPRSRFLPV
KATSDLLLVQSDLYTLVDGFVTRNKARTNPTNPAIELGPEFKKVASFLSRFKSIPSIVEL
DSLKVSGDVWFGSGVVLKGKVTVKANAGTKLEIPDNAVLENKDINGPEDL*
>AT5G17310.2 | UTP--glucose-1-phosphate uridylyltransferase putative / UDP-glucose pyrophosphorylase putative / UGPase putative
MAATATEKLPQLKSAVDGLTEMSENEKSGFINLVSRYLSGEAQHIEWSKIQTPTDEIVVP
YDKMANVSEDASETKYLLDKLVVLKLNGGLGTTMGCTGPKSVIEVRDGLTFLDLIVIQIE
NLNNKYNCKVPLVLMNSFNTHDDTQKIVEKYTKSNVDIHTFNQSKYPRVVADEFVPWPSK
GKTDKDGWYPPGHGDVFPSLMNSGKLDAFLSQGKEYVFIANSDNLGAIVDLKILKHLIQN
KNEYCMEVTPKTLADVKGGTLISYEGKVQLLEIAQVPDEHVNEFKSIEKFKIFNTNNLWV
NLKAIKKLVEADALKMEIIPNPKEVDGVKVLQLETAAGAAIRFFDNAIGVNVPRSRFLPV
KATSDLLLVQSDLYTLVDGFVTRNKARTNPTNPAIELGPEFKKVASFLSRFKSIPSIVEL
DSLKVSGDVWFGSGVVLKGKVTVKANAGTKLEIPDNAVLENKDINGPEDL*
>AT3G03250.1 | UGP (UDP-glucose pyrophosphorylase) UTPglucose-1-phosphate uridylyltransferase/ nucleotidyltransferase
MAATTENLPQLKSAVDGLTEMSESEKSGFISLVSRYLSGEAQHIEWSKIQTPTDEIVVPY
EKMTPVSQDVAETKNLLDKLVVLKLNGGLGTTMGCTGPKSVIEVRDGLTFLDLIVIQIEN
LNNKYGCKVPLVLMNSFNTHDDTHKIVEKYTNSNVDIHTFNQSKYPRVVADEFVPWPSKG
KTDKEGWYPPGHGDVFPALMNSGKLDTFLSQGKEYVFVANSDNLGAIVDLTILKHLIQNK
NEYCMEVTPKTLADVKGGTLISYEGKVQLLEIAQVPDEHVNEFKSIEKFKIFNTNNLWVN
LKAIKKLVEADALKMEIIPNPKEVDGVKVLQLETAAGAAIRFFDNAIGVNVPRSRFLPVK
ASSDLLLVQSDLYTLVDGFVTRNKARTNPSNPSIELGPEFKKVATFLSRFKSIPSIVELD
SLKVSGDVWFGSSIVLKGKVTVAAKSGVKLEIPDRAVVENKNINGPEDL*
>AT4G30990.1 | binding
MATSADARAVKSLNTSEGRKRFVFKSASQRTNDIDNISNYRNLDKVKAEPSEGSTFFRDC
LIEWRELNTAEDFILFYEEMLPSVQSLSLIIMQKERIFSNLVSRLQMKARLSLEPILRLI
AALSRDLLNDFIPFLPQIVNSFVTLLNNGAHNDPEIIQQVFTSWASIIVSLQKYLVCDIE
GILRDTLELRYYPKDNISEFMSESMSFLLRKAQDEQLEKGMKMILSEVAHPSKKAGGVGV
LYNVMRGTYGRLHSKAGRVLSFLLKDSTLSFLDNFPQGPCTVVEVVSLVLQRICEDLEAE
KLSAMWEYLYKKINKSISNKKSVHLSRLLSVLMAVVKIKEGRKVHDIPSLIGIVSRIVST
FFTSSETAVEGDNLSAVLDEVLELILCTINTVNEMETVASLWAPIFALKSSSLLTFLREF
LKKDQSVVKAFTKNILCAINNMIWESSEEVIPLLLTLCEEHKTQQTSHDVVNSISQTFES
RYERIHEFLEAKIKKVQQNIENAGLAQINEAELAAIWGVVKCYPYFKVDSSLLICFKKTL
RQHLAVSDVDTCSGPELMWQSLLGTTLRSCYKMTGINHSDLEEALSFAKDYKSCEQVLSP
VADVLEFMHRPALAHGRSKPYPELQANKAGDAFEIFSENLRHPNKNIRLMTLRILCHFET
LSSDPSFEEHPPKKKMKTEKNVLQLLLLFEETAPTVDTSRMLAGYISTIQDNLSAGRIHS
AYVKLVLNGMLGILHISYRPLCVQASECLAVLVRKYTGAVWSDFVCYLGQCQLKFETLHD
HSENANQSMSERHAYLNGRFNLFLFPPSAITPTATVSDVVSQLLQTLQKASSVAQSRASE
ILPLLLKFLGYNSENPGSVGSYNGRVCKGEDWKTVLVQWLTLLKLMKNPRFLDDNDAEIQ
TNVLECLLLANDFLLPHRQHLLNLIKPKELREELTTWNLSENIGEPHRSYIFSLVIRILM
PKVRTLKNSASRKHTSIRHRKAVLCFISQLDVNELALFFALLIKPLNIISEETMDSFWSS
GKSSLDYFQNSNFLKYFTVDTISTLSRNQKFGFLHVIQHILEVFDELRVRPFLDFMMGCV
VRLLVNYAPNVDEEMNIDSLALRNVTAAPSTSDDKENASINHDQAGTAFKQFKELRSLCL
KIIAHVLDKYEDCDLGSEFWDLFFSAVSPLIKSFKQEGSSSEKPSSLFSCFLSMSKSRNL
VNLLCREESLVPDIFSILTVTTASEAIKSSALKFIENLLCLDNVLGEDENMIRGFVDPYI
EALINSLHSLFIGDILKRKSVKYHGEREIKILKLLSKRMQDRSHVMKYLDVLLSFLNKSV
KDPDIRREALLAIQDIIAYLGMESTSKIINTVSPLLVDAELDVRLCICDLLESLAKIDFS
LDDVAKRVRDMNAISAMEVDDLDYEKIVNAYVEINADFFIKSSEQHTMIILSQSSILCRE
APAHSEFGKEVKNADEWILLIREMVTKLPDAANLSAFRPLCSEDENVDFFKAIVHIQAHR
RARAISRFSSVVKDSSLPEGVVRKLLVSVFFNMLLEGQDGKDNNVRNACTEALASISAHM
SWTSYYALLNRCFREMNKHTKKGKILLRLICLILDKFHFAKDGYPHEAEEIRTCLQKIVF
PRMQKLMNSDSDNVNVNSSVAALKVLKLLPEDVLDSNLSSIVHKIASFLKNRLESTRDEA
RLALVACLKELGLEYLQVVVNILRAILKRGSEVHVLGYTLNSILSKCLSNPTCGKLDHCL
VDLLAVVETDILGEVAEQKEVEKFASKMKETRKRKSFETLKLIAENVTFRSHGLKLLSPV
TAQLQRHLTPKIKTNLEKMLKQIAAGIEGNTSVDQGDLFLFIYGLVDDGINNRSGLGDQV
SLPPSKKKKKSRDLKETSGLCFGPKSCPHLITVFALDLFYNRMKKLRLDNTDEELLSKCF
TSLVKFPLPSLTSEADELKTALLTIAQSAVSSSSPLVQSCLKLLTTLLKNINITLSSEQL
KMLIQFPIFIDLESDSSFVTLSLLKAIMNRKLVVPEIYDIAIQVSKLMVNSQLESIRKKC
KHILLQFMVHYTLSEKRLEQHVNFLLENLRYEFPTGREAVLDMLHALILKFSEPNLGKQS
VLDQQSQKLFIQLTVCLSNETDRKVLPLVGAVIEVLIGRMSKDQVDSSLLYCLCWYKQQN
LSAAAAQVLGFFISAMKKTFRKHIYNTVEDARTILESAISASSLQLQDTVEEASLPFWKE
AYYSLVMIEKMLEQFPDLRFGKDLEDIWKMVFKFLLHPHAWLRNKSCRLLNLYFEALAGR
KRPECRTLVADSLLEKPSSLFMVAVSLCFQLKEQPTTGNIDVDLLTANIVFAVSSLHSLI
GQFDQATHNRFWSSLGEDEQVVFLKAFEVLDAGKGRSTFLALTSGKRTENGDDDVRNVMI
GSLLKRMGKIALDMESVQMRVMFNVYKSFASQLNQEECRLYAYKILLPLYKVCEGYTGKI
VTDELKQLAEEVRDSIRDKSLGNKMFVEVYSEIRNSLRTKRDKRKREEKLMAVVNPERNA
KRKLRLASKNKANKKRRMTSMKLSRWACS*
>AT4G30990.1 | binding
MATSADARAVKSLNTSEGRKRFVFKSASQRTNDIDNISNYRNLDKVKAEPSEGSTFFRDC
LIEWRELNTAEDFILFYEEMLPSVQSLSLIIMQKERIFSNLVSRLQMKARLSLEPILRLI
AALSRDLLNDFIPFLPQIVNSFVTLLNNGAHNDPEIIQQVFTSWASIIVSLQKYLVCDIE
GILRDTLELRYYPKDNISEFMSESMSFLLRKAQDEQLEKGMKMILSEVAHPSKKAGGVGV
LYNVMRGTYGRLHSKAGRVLSFLLKDSTLSFLDNFPQGPCTVVEVVSLVLQRICEDLEAE
KLSAMWEYLYKKINKSISNKKSVHLSRLLSVLMAVVKIKEGRKVHDIPSLIGIVSRIVST
FFTSSETAVEGDNLSAVLDEVLELILCTINTVNEMETVASLWAPIFALKSSSLLTFLREF
LKKDQSVVKAFTKNILCAINNMIWESSEEVIPLLLTLCEEHKTQQTSHDVVNSISQTFES
RYERIHEFLEAKIKKVQQNIENAGLAQINEAELAAIWGVVKCYPYFKVDSSLLICFKKTL
RQHLAVSDVDTCSGPELMWQSLLGTTLRSCYKMTGINHSDLEEALSFAKDYKSCEQVLSP
VADVLEFMHRPALAHGRSKPYPELQANKAGDAFEIFSENLRHPNKNIRLMTLRILCHFET
LSSDPSFEEHPPKKKMKTEKNVLQLLLLFEETAPTVDTSRMLAGYISTIQDNLSAGRIHS
AYVKLVLNGMLGILHISYRPLCVQASECLAVLVRKYTGAVWSDFVCYLGQCQLKFETLHD
HSENANQSMSERHAYLNGRFNLFLFPPSAITPTATVSDVVSQLLQTLQKASSVAQSRASE
ILPLLLKFLGYNSENPGSVGSYNGRVCKGEDWKTVLVQWLTLLKLMKNPRFLDDNDAEIQ
TNVLECLLLANDFLLPHRQHLLNLIKPKELREELTTWNLSENIGEPHRSYIFSLVIRILM
PKVRTLKNSASRKHTSIRHRKAVLCFISQLDVNELALFFALLIKPLNIISEETMDSFWSS
GKSSLDYFQNSNFLKYFTVDTISTLSRNQKFGFLHVIQHILEVFDELRVRPFLDFMMGCV
VRLLVNYAPNVDEEMNIDSLALRNVTAAPSTSDDKENASINHDQAGTAFKQFKELRSLCL
KIIAHVLDKYEDCDLGSEFWDLFFSAVSPLIKSFKQEGSSSEKPSSLFSCFLSMSKSRNL
VNLLCREESLVPDIFSILTVTTASEAIKSSALKFIENLLCLDNVLGEDENMIRGFVDPYI
EALINSLHSLFIGDILKRKSVKYHGEREIKILKLLSKRMQDRSHVMKYLDVLLSFLNKSV
KDPDIRREALLAIQDIIAYLGMESTSKIINTVSPLLVDAELDVRLCICDLLESLAKIDFS
LDDVAKRVRDMNAISAMEVDDLDYEKIVNAYVEINADFFIKSSEQHTMIILSQSSILCRE
APAHSEFGKEVKNADEWILLIREMVTKLPDAANLSAFRPLCSEDENVDFFKAIVHIQAHR
RARAISRFSSVVKDSSLPEGVVRKLLVSVFFNMLLEGQDGKDNNVRNACTEALASISAHM
SWTSYYALLNRCFREMNKHTKKGKILLRLICLILDKFHFAKDGYPHEAEEIRTCLQKIVF
PRMQKLMNSDSDNVNVNSSVAALKVLKLLPEDVLDSNLSSIVHKIASFLKNRLESTRDEA
RLALVACLKELGLEYLQVVVNILRAILKRGSEVHVLGYTLNSILSKCLSNPTCGKLDHCL
VDLLAVVETDILGEVAEQKEVEKFASKMKETRKRKSFETLKLIAENVTFRSHGLKLLSPV
TAQLQRHLTPKIKTNLEKMLKQIAAGIEGNTSVDQGDLFLFIYGLVDDGINNRSGLGDQV
SLPPSKKKKKSRDLKETSGLCFGPKSCPHLITVFALDLFYNRMKKLRLDNTDEELLSKCF
TSLVKFPLPSLTSEADELKTALLTIAQSAVSSSSPLVQSCLKLLTTLLKNINITLSSEQL
KMLIQFPIFIDLESDSSFVTLSLLKAIMNRKLVVPEIYDIAIQVSKLMVNSQLESIRKKC
KHILLQFMVHYTLSEKRLEQHVNFLLENLRYEFPTGREAVLDMLHALILKFSEPNLGKQS
VLDQQSQKLFIQLTVCLSNETDRKVLPLVGAVIEVLIGRMSKDQVDSSLLYCLCWYKQQN
LSAAAAQVLGFFISAMKKTFRKHIYNTVEDARTILESAISASSLQLQDTVEEASLPFWKE
AYYSLVMIEKMLEQFPDLRFGKDLEDIWKMVFKFLLHPHAWLRNKSCRLLNLYFEALAGR
KRPECRTLVADSLLEKPSSLFMVAVSLCFQLKEQPTTGNIDVDLLTANIVFAVSSLHSLI
GQFDQATHNRFWSSLGEDEQVVFLKAFEVLDAGKGRSTFLALTSGKRTENGDDDVRNVMI
GSLLKRMGKIALDMESVQMRVMFNVYKSFASQLNQEECRLYAYKILLPLYKVCEGYTGKI
VTDELKQLAEEVRDSIRDKSLGNKMFVEVYSEIRNSLRTKRDKRKREEKLMAVVNPERNA
KRKLRLASKNKANKKRRMTSMKLSRWACS*
>AT4G30990.2 | binding
MATSADARAVKSLNTSEGRKRFVFKSASQRTNDIDNISNYRNLDKVKAEPSEGSTFFRDC
LIEWRELNTAEDFILFYEEMLPSVQSLSLIIMQKERIFSNLVSRLQMKARLSLEPILRLI
AALSRDLLNDFIPFLPQIVNSFVTLLNNGAHNDPEIIQQVFTSWASIIVSLQKYLVCDIE
GILRDTLELRYYPKDNISEFMSESMSFLLRKAQDEQLEKGMKMILSEVAHPSKKAGGVGV
LYNVMRGTYGRLHSKAGRVLSFLLKDSTLSFLDNFPQGPCTVVEVVSLVLQRICEDLEAE
KLSAMWEYLYKKINKSISNKKSVHLSRLLSVLMAVVKIKEGRKVHDIPSLIGIVSRIVST
FFTSSETAVEGDNLSAVLDEVLELILCTINTVNEMETVASLWAPIFALKSSSLLTFLREF
LKKDQSVVKAFTKNILCAINNMIWESSEEVIPLLLTLCEEHKTQQTSHDVVNSISQTFES
RYERIHEFLEAKIKKVQQNIENAGLAQINEAELAAIWGVVKCYPYFKVDSSLLICFKKTL
RQHLAVSDVDTCSGPELMWQSLLGTTLRSCYKMTGINHSDLEEALSFAKDYKSCEQVLSP
VADVLEFMHRPALAHGRSKPYPELQANKAGDAFEIFSENLRHPNKNIRLMTLRILCHFET
LSSDPSFEEHPPKKKMKTEKNVLQLLLLFEETAPTVDTSRMLAGYISTIQDNLSAGRIHS
AYVKLVLNGMLGILHISYRPLCVQASECLAVLVRKYTGAVWSDFVCYLGQCQLKFETLHD
HSENANQSMSERHAYLNGRFNLFLFPPSAITPTATVSDVVSQLLQTLQKASSVAQSRASE
ILPLLLKFLGYNSENPGSVGSYNGRVCKGEDWKTVLVQWLTLLKLMKNPRFLDDNDAEIQ
TNVLECLLLANDFLLPHRQHLLNLIKPKELREELTTWNLSENIGEPHRSYIFSLVIRILM
PKVRTLKNSASRKHTSIRHRKAVLCFISQLDVNELALFFALLIKPLNIISEETMDSFWSS
GKSSLDYFQNSNFLKYFTVDTISTLSRNQKFGFLHVIQHILEVFDELRVRPFLDFMMGCV
VRLLVNYAPNVDEEMNIDSLALRNVTAAPSTSDDKENASINHDQAGTAFKQFKELRSLCL
KIIAHVLDKYEDCDLGSEFWDLFFSAVSPLIKSFKQEGSSSEKPSSLFSCFLSMSKSRNL
VNLLCREESLVPDIFSILTVTTASEAIKSSALKFIENLLCLDNVLGEDENMIRGFVDPYI
EALINSLHSLFIGDILKRKSVKYHGEREIKILKLLSKRMQDRSHVMKYLDVLLSFLNKSV
KDPDIRREALLAIQDIIAYLGMESTSKIINTVSPLLVDAELDVRLCICDLLESLAKIDFS
LDDVAKRVRDMNAISAMEVDDLDYEKIVNAYVEINADFFIKSSEQHTMIILSQSSILCRE
APAHSEFGKEVKNADEWILLIREMVTKLPDAANLSAFRPLCSEDENVDFFKAIVHIQAHR
RARAISRFSSVVKDSSLPEGVVRKLLVSVFFNMLLEGQDGKDNNVRNACTEALASISAHM
SWTSYYALLNRCFREMNKHTKKGKILLRLICLILDKFHFAKDGYPHEAEEIRTCLQKIVF
PRMQKLMNSDSDNVNVNSSVAALKVLKLLPEDVLDSNLSSIVHKIASFLKNRLESTRDEA
RLALVACLKELGLEYLQVVVNILRAILKRGSEVHVLGYTLNSILSKCLSNPTCGKLDHCL
VDLLAVVETDILGEVAEQKEVEKFASKMKETRKRKSFETLKLIAENVTFRSHGLKLLSPV
TAQLQRHLTPKIKTNLEKMLKQIAAGIEGNTSVDQGDLFLFIYGLVDDGINNRSGLGDQV
SLPPSKKKKKSRDLKETSGLCFGPKSCPHLITVFALDLFYNRMKKLRLDNTDEELLSKCF
TSLVKFPLPSLTSEADELKTALLTIAQSAVSSSSPLVQSCLKLLTTLLKNINITLSSEQL
KMLIQFPIFIDLESDSSFVTLSLLKAIMNRKLVVPEIYDIAIQVSKLMVNSQLESIRKKC
KHILLQFMVHYTLSEKRLEQHVNFLLENLRYEFPTGREAVLDMLHALILKFSEPNLGKQS
VLDQQSQKLFIQLTVCLSNETDRKVLPLVGAVIEVLIGRMSKDQVDSSLLYCLCWYKQQN
LSAAAAQVLGFFISAMKKTFRKHIYNTVEDARTILESAISASSLQLQDTVEEASLPFWKE
AYYSLVMIEKMLEQFPDLRFGKDLERSPARKITALALSASITTLDLDIWKMVFKFLLHPH
AWLRNKSCRLLNLYFEALAGRKRPECRTLVADSLLEKPSSLFMVAVSLCFQLKEQPTTGN
IDVDLLTANIVFAVSSLHSLIGQFDQATHNRFWSSLGEDEQVVFLKAFEVLDAGKGRSTF
LALTSGKRTENGDDDVRNVMIGSLLKRMGKIALDMESVQMRVMFNVYKSFASQLNQEECR
LYAYKILLPLYKVCEGYTGKIVTDELKQLAEEVRDSIRDKSLGNKMFVEVYSEIRNSLRT
KRDKRKREEKLMAVVNPERNAKRKLRLASKNKANKKRRMTSMKLSRWACS*
>AT4G30990.2 | binding
MATSADARAVKSLNTSEGRKRFVFKSASQRTNDIDNISNYRNLDKVKAEPSEGSTFFRDC
LIEWRELNTAEDFILFYEEMLPSVQSLSLIIMQKERIFSNLVSRLQMKARLSLEPILRLI
AALSRDLLNDFIPFLPQIVNSFVTLLNNGAHNDPEIIQQVFTSWASIIVSLQKYLVCDIE
GILRDTLELRYYPKDNISEFMSESMSFLLRKAQDEQLEKGMKMILSEVAHPSKKAGGVGV
LYNVMRGTYGRLHSKAGRVLSFLLKDSTLSFLDNFPQGPCTVVEVVSLVLQRICEDLEAE
KLSAMWEYLYKKINKSISNKKSVHLSRLLSVLMAVVKIKEGRKVHDIPSLIGIVSRIVST
FFTSSETAVEGDNLSAVLDEVLELILCTINTVNEMETVASLWAPIFALKSSSLLTFLREF
LKKDQSVVKAFTKNILCAINNMIWESSEEVIPLLLTLCEEHKTQQTSHDVVNSISQTFES
RYERIHEFLEAKIKKVQQNIENAGLAQINEAELAAIWGVVKCYPYFKVDSSLLICFKKTL
RQHLAVSDVDTCSGPELMWQSLLGTTLRSCYKMTGINHSDLEEALSFAKDYKSCEQVLSP
VADVLEFMHRPALAHGRSKPYPELQANKAGDAFEIFSENLRHPNKNIRLMTLRILCHFET
LSSDPSFEEHPPKKKMKTEKNVLQLLLLFEETAPTVDTSRMLAGYISTIQDNLSAGRIHS
AYVKLVLNGMLGILHISYRPLCVQASECLAVLVRKYTGAVWSDFVCYLGQCQLKFETLHD
HSENANQSMSERHAYLNGRFNLFLFPPSAITPTATVSDVVSQLLQTLQKASSVAQSRASE
ILPLLLKFLGYNSENPGSVGSYNGRVCKGEDWKTVLVQWLTLLKLMKNPRFLDDNDAEIQ
TNVLECLLLANDFLLPHRQHLLNLIKPKELREELTTWNLSENIGEPHRSYIFSLVIRILM
PKVRTLKNSASRKHTSIRHRKAVLCFISQLDVNELALFFALLIKPLNIISEETMDSFWSS
GKSSLDYFQNSNFLKYFTVDTISTLSRNQKFGFLHVIQHILEVFDELRVRPFLDFMMGCV
VRLLVNYAPNVDEEMNIDSLALRNVTAAPSTSDDKENASINHDQAGTAFKQFKELRSLCL
KIIAHVLDKYEDCDLGSEFWDLFFSAVSPLIKSFKQEGSSSEKPSSLFSCFLSMSKSRNL
VNLLCREESLVPDIFSILTVTTASEAIKSSALKFIENLLCLDNVLGEDENMIRGFVDPYI
EALINSLHSLFIGDILKRKSVKYHGEREIKILKLLSKRMQDRSHVMKYLDVLLSFLNKSV
KDPDIRREALLAIQDIIAYLGMESTSKIINTVSPLLVDAELDVRLCICDLLESLAKIDFS
LDDVAKRVRDMNAISAMEVDDLDYEKIVNAYVEINADFFIKSSEQHTMIILSQSSILCRE
APAHSEFGKEVKNADEWILLIREMVTKLPDAANLSAFRPLCSEDENVDFFKAIVHIQAHR
RARAISRFSSVVKDSSLPEGVVRKLLVSVFFNMLLEGQDGKDNNVRNACTEALASISAHM
SWTSYYALLNRCFREMNKHTKKGKILLRLICLILDKFHFAKDGYPHEAEEIRTCLQKIVF
PRMQKLMNSDSDNVNVNSSVAALKVLKLLPEDVLDSNLSSIVHKIASFLKNRLESTRDEA
RLALVACLKELGLEYLQVVVNILRAILKRGSEVHVLGYTLNSILSKCLSNPTCGKLDHCL
VDLLAVVETDILGEVAEQKEVEKFASKMKETRKRKSFETLKLIAENVTFRSHGLKLLSPV
TAQLQRHLTPKIKTNLEKMLKQIAAGIEGNTSVDQGDLFLFIYGLVDDGINNRSGLGDQV
SLPPSKKKKKSRDLKETSGLCFGPKSCPHLITVFALDLFYNRMKKLRLDNTDEELLSKCF
TSLVKFPLPSLTSEADELKTALLTIAQSAVSSSSPLVQSCLKLLTTLLKNINITLSSEQL
KMLIQFPIFIDLESDSSFVTLSLLKAIMNRKLVVPEIYDIAIQVSKLMVNSQLESIRKKC
KHILLQFMVHYTLSEKRLEQHVNFLLENLRYEFPTGREAVLDMLHALILKFSEPNLGKQS
VLDQQSQKLFIQLTVCLSNETDRKVLPLVGAVIEVLIGRMSKDQVDSSLLYCLCWYKQQN
LSAAAAQVLGFFISAMKKTFRKHIYNTVEDARTILESAISASSLQLQDTVEEASLPFWKE
AYYSLVMIEKMLEQFPDLRFGKDLERSPARKITALALSASITTLDLDIWKMVFKFLLHPH
AWLRNKSCRLLNLYFEALAGRKRPECRTLVADSLLEKPSSLFMVAVSLCFQLKEQPTTGN
IDVDLLTANIVFAVSSLHSLIGQFDQATHNRFWSSLGEDEQVVFLKAFEVLDAGKGRSTF
LALTSGKRTENGDDDVRNVMIGSLLKRMGKIALDMESVQMRVMFNVYKSFASQLNQEECR
LYAYKILLPLYKVCEGYTGKIVTDELKQLAEEVRDSIRDKSLGNKMFVEVYSEIRNSLRT
KRDKRKREEKLMAVVNPERNAKRKLRLASKNKANKKRRMTSMKLSRWACS*
>AT2G47990.1 | SWA1 (SLOW WALKER1) nucleotide binding
MEEELRVRLNDHQVSKVFPVKPKSTAKPVSESETPESRYWSSFKNHSTPNLVSSVAALAF
SPVHPHSLAVAHSATVSLFSSQSLSSSRRFSFRDVVSSVCFRSDGALFAACDLSGVVQVF
DIKERMALRTLRSHSAPARFVKYPVQDKLHLVSGGDDGVVKYWDVAGATVISDLLGHKDY
VRCGDCSPVNDSMLVTGSYDHTVKVWDARVHTSNWIAEINHGLPVEDVVYLPSGGLIATA
GGNSVKVWDLIGGGKMVCSMESHNKTVTSLRVARMESAESRLVSVALDGYMKVFDYGRAK
VTYSMRFPAPLMSLGLSPDGSTRVIGGSNGMVFAGKKKVRDVVGGQKKSLNLWSLISDVD
ESRRRALRPTYFRYFQRGQSEKPSKDDYLVKEKKGLKLTRHDKLLKKFRHKEALVSVLEE
KKPANVVAVMEELVARRKLMKCVSNMEEGELGMLLGFLQRYCTVQRYSGLLMGLTKKVLE
TRAEDIKGKNEFKGLLRNLKREVNQEIRIQQSLLEIQGVIAPLMRIAGRS*
>AT3G60360.1 | EDA14 (EMBRYO SAC DEVELOPMENT ARREST 14)
MSSLRNAIPRPAHKERSQPEARKRFGILEKHKDYIIRANAYHKKQETLKILRQKAAFKNP
DEFNFKMINSKTVDGRHRPKDEVNKYSAEELMIMKTQDIGYVFQKWQSEKNKIDKLTASL
QCTGDQSSRRHVYYAEDREEARELEVQGRSKSDISTVEIPKDIKKKMDRSYRDLEGRKSR
AKDLEKLYTDMSMQKELQKKGRKRKLRDDELLNPNGKAVYKWRADRKR*
>AT5G59240.1 | 40S ribosomal protein S8 (RPS8B)
MGISRDSIHKRRATGGKQKMWRKKRKYELGRQPANTKLSSNKTVRRIRVRGGNVKWRALR
LDTGNFSWGSEAVTRKTRILDVAYNASNNELVRTQTLVKSAIVQVDAAPFKQGYLQHYGV
DIGRKKKGEAVTTEEVKKSNHVQRKLEMRQEGRALDSHLEEQFSSGRLLACIASRPGQCG
RADGYILEGKELEFYMKKLQKKKGKNAGAA*
>AT5G22100.1 | RNA cyclase family protein
MVMMKKMKGSQSFRQRLLLSTLSSTPISIDEIRADETIPGLRPHEVNLLRLLEIVTDDAV
VDINETGTRLKYKPGTIVGGKNLVHSCSLSRSIGYYLEPLLLLGLFGKKPLSIRLKGITN
DPRDASVDTFRSTTLNIIKRFGVPAEDLELKIEARGVAPNGGGEVLLTVPNIKTLSAVHW
VEEGMVKKIRGTTFSTRVTSDFEHSMRFAARGIFNNLLPDVHIFQDHRAGAQAGKSPGYG
ISLAAETTTGCFISADTTVSCERPDETGELDVEKKERSPAEDTGVEVASWLLQEIEKGGV
VDSTHQGLLFLLCALSEQDVSKVRVGTLSPYAVETLRNIKEFLGVKFAIKPDPLTGTVIL
KCTGSGLINLSRKLS*
>AT1G10490.1 | unknown protein
MRKKVDERIRTLIENGVKLRHRSMFVIIGDKARDQIVNLHHILSKSVVKSNPSVLWCYKN
RLDISSHNKKRAKQLKKMKERGQLDPEKLDAFSLFLDVVDVTHCLYKDSERILGNTFGIC
ILQDFEALTPNLLARTIETVEGGGLVVLLLQSLASLTSLCTMVMDVHDRFRTESHSEASG
RFNERFLLSLASCKACVVMDDELNLLPLSSHIKSITKVPTKEDSEALSEAERDLKSLKDA
LNDDFPVGPLINKCCTLDQGKAVVTFFDAILDKTLRSIVALIASRGRGKSAALGLAVAGA
VAAGYSNIYVTAPSPDNLKTVFEFVCKGFDALEYKEHLEYDVVRSVNPEFNKAIVRINIF
KQHRQTIQYIQPHEHEKLSQVELLVIDEAAAIPLPVVKSLLGPYLVFLSSTVSGYEGTGR
SLSLKLLQQLEEQSRAPVTGVEGSLSGCLFKKIELSESIRYASGDPIESWLNGLLCLDVA
NCLPNPACHPLPSQCDLYYVNRDTLFSYHKDSELFLQRMMALCVSSHYKNSPNDLQLLSD
APAHHLFVLLGPVDESKNQLPDILCVIQVCLEGQISRKSAEKSLREGHSPHGDQIPWKFC
EQFRDVVFPKLSGARIVRIAVHPNAMKMGYGSAAVELLTRYFEGQLASISEGDDELEVEP
SPVRVTEAAAKVSLLEEQIKPRANLPPLLVPLRDRRPERLHYIGVSFGLTLDLFRFWRKH
KFAPFYISQIPSAVTGEHTCMLLKPLTLSNDEFEVDESDELGFFAPFYKDFRIRFSKLLS
DKFKKMDYKLAMSVLNPKINFPEVDLTGNSPDGFLKKLDGVLSPYDMERFRAYTANLVDF
NLVYDICKTLAHHYFQEKLPVSLSYVQASVLLCLGLQESDFSSIERQMQLERGQIYSLLL
KVGKKLYKYLNGIATKELESTLPRLKDRVLEPHKVSVDEDLREGAKEVEEQMRARIEELL
DPELLDQFAIGDKEAEALQKSKISSSGLISIESTKTDNKKEKPSGFDKSAKKRGNDKHSS
TSNKKRRA*
>AT5G15750.1 | RNA-binding S4 domain-containing protein
MRKLKYHEKKLIKKVNFLEWKREGNHRENEITYRYHMGSRDDYKKYSGLCRMVQKLTNIM
KQMDPADPFRIQMTDMLLEKLYNMGVIPTRKSLTLTERLSVSSFCRRRLSTVLVHLKFAE
HHKEAVTYIEQGHVRVGPETITDPAFLVTRNMEDFITWVDSSKIKRKVLEYNDTLDDYDM
LA*
>AT1G06720.1 | INVOLVED IN ribosome biogenesis LOCATED IN nucleus EXPRESSED IN 21 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s AARP2CN (InterProIPR012948) Protein of unknown function DUF663 (InterProIPR007034) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G424401) Has 7944 Blast hits to 5342 proteins in 373 species Archae - 33 Bacteria - 667 Metazoa - 2609 Fungi - 1045 Plants - 414 Viruses - 80 Other Eukaryotes - 3096 (source NCBI BLink)
MAADELMPSHRSHRTPKSGPTARKKSELDKKKRGISVDKQKNLKAFGVKSVVHAKKAKHH
AAEKEQKRLHLPKIDRNYGEAPPFVVVVQGPPGVGKSLVIKSLVKEFTKQNVPEVRGPIT
IVQGKQRRFQFVECPNDINAMVDCAKVADLALLVVDGSYGFEMETFEFLNIMQVHGFPRV
MGVLTHLDKFNDVKKLRKTKHHLKHRFWTEIYHGAKLFYLSGLIHGKYTPREVHNLARFV
IVIKPQPLTWRTAHPYVLVDRLEDVTPPEKVQMDKKCDRNITVFGYLRGCNFKKRMKVHI
AGVGDFIVAGVTALTDPCPLPSAGKKKGLRDRDKLFYAPMSGIGDLVYDKDAVYININSH
QVQYSKTDDGKGEPTNKGKGRDVGEDLVKSLQNTKYSVDEKLDKTFINFFGKKTSASSET
KLKAEDAYHSLPEGSDSESQSGDDEEDIVGNESEMKQETEIHGGRLRRKAIFKTDLNEDD
FEEADDLELDSYDPDTYDFEEADDAESDDNEVEDGGDDSASDSADGEPGDYQIDDKDSGN
ISQWKAPLKEIARKKNPNLMQIVYGASSLATPLINENHDISDDDESDDEDFFKPKGEQHK
NLGGGLDVGYVNSEDCSKFVNYGYLKNWKEKEVCESIRDRFTTGDWSKAALRDKNLGTGG
EGEDDELYGDFEDLETGEKHKSHENLESGANENEDEDAEVVERDGNNPRSQADEPGYADK
LKEAQEITKQRNELEYNDLDEETRIELAGFRTGTYLRLEIHNVPYEMVEFFDPCHPILVG
GIGFGEDNVGYMQARLKKHRWHKKVLKTRDPIIVSIGWRRYQTIPVFAIEDRNGRHRMLK
YTPEHMHCLASFWGPLVPPNTGFVAFQNLSNNQAGFRITATSVVLEFNHQARIVKKIKLV
GTPCKIKKKTAFIKDMFTSDLEIARFEGSSVRTVSGIRGQVKKAGKNMLDNKAEEGIARC
TFEDQIHMSDMVFLRAWTTVEVPQFYNPLTTALQPRDKTWNGMKTFGELRRELNIPIPVN
KDSLYKAIERKQKKFNPLQIPKRLEKDLPFMSKPKNIPKRKRPSLEDKRAVIMEPKERKE
HTIIQQFQLLQHHTMKKKKATDQKKRKEYEAEKAKNEEINKKRRREERRDRYREEDKQKK
KTRRSLD*
>AT2G17250.1 | EMB2762 (EMBRYO DEFECTIVE 2762)
MASILSKKQKKNEKYTLKELKSLGHDLLTSRSHINNLPLLLTFVSPESPPQFVVESLLSL
QSFFTPLLSQLPPTSSSPSSTKTEDPEVVFKAWLRSKFDEFVKLLLDVLVSQQSEDSLRG
IVLGTLMEFVKLLNAGRFHSSIYHRLLDAIIHSEVDIEIFLDILTSKYFKYIDVRYFTYI
SMEKFVKTLEASVSADRTVIENNEAESDSKESLELSVRKIYQVLSQIPPPEKQAEKSQHE
MWSGSDESISEKPTDKKKKTEKGDSTLLSPATISKRMKLKFTKAWISFLRLPLPIDVYKE
VLASIHLTVIPHLSNPTMLCDFLTKSYDIGGVVSVMALSSLFILMTQHGLEYPFFYEKLY
ALLVPSVFVAKHRAKFLQLLDACLKSSMLPAYLAASFTKKLSRLSLSIPPAGSLVITALI
YNLLRRNPTINHLVQEIVENADEANTEAGEHNESQPKTIKKRKLGIDYFNNQESDPKKSG
ALKSSLWEIDTLRHHYCPPVSRFISSLETNLTIRSKTTEMKIEDFCSGSYATIFGDEIRR
RVKQVPLAFYKTVPTSLFADSDFPGWTFTIPQEEGTC*
>AT2G34357.1 | binding
MELLCDDIGTSMCLTPSEPDLPVSEDFGEYMRSRLSQSKRPDHEHLCAVIEELSKTLAED
NHRRTPVAYFACTCRSLDSLFSAHAEPPVDVVQPHIVILSLVFPKVSAGVLKRDGLALRL
VLNVLRLKSATPECLISGLKCLVHLLTTVESIMVNEGSDSYNILLNFVTHSDGKVRKLAS
SCLRDVLQKSHGTKAWQSVSGAITEMFQNYLDLAHKSEVGSTEGARGAKQVLYILSTLKE
CLALMSKKHIATLIEGFKVLMILRDPYITRPVIDSLNAVCLNPTSEVPVEALLEVLSLAA
GLFSGHETSADAMTFTARLLKVGMTRSFTLNRDLCVVKLPSVFNGLNDIIASEHEEAIFA
ATDALKSLIFSCIDESLIREGVNEIRNSNLNVRKPSPTVIEKLCATVESLLDYKYHAVWD
MAFQVVSAMFDKLGEHSAYFMRNTLQGLSDMQDLPDEGFPYRKQLHECVGSALGAMGPET
FLSIVRLNLEANDLSEVKVWLFPILKQYTVGGRLSFFTEAIFSMVETMSHKAQKLKLQGL
PVASRSVDSLVYSLWALLPSFCNYPVDTVESFADLGRILCGVLQTQAETHGIICASLNIL
IQQNKEVVEGKEVPTNDASPAMQRATARYDSQHAAANLKVLRLCAPKLLDVLSRIFHECS
KDDGGSLQSAIGNLASIAEKKTVSKLLFKTLQELLEATKTAIAQDESPVSGMDVDNTADK
NSSSNLRARLFDLLVSLLPGLDGQEVDTIFSSLKPAMQDSKGLIQKKAYKVLSVILKSSD
GFVSKNLEELLVLMHNICHVSAKRHKLDCLYFLLAHASRTDDLKERKDIVSSFLPEVILA
LKEVNKKTRNRAYDVLVQIGHAYADEENGGDNEKLHGYFDMVVGCLAGEKPQMISAAVKG
VARLTYEFSDLISSAYNLLPSTFLLLQRKNKEITKANLGLLKVLVAKSPVEGLHANLKSM
VEGLLKWPEGTKNLFKAKVRLLLEMLIKKCGTEAVKSVMPEEHMKLLTNIRKIKERKEKK
YAAGSDISKSQHSKDTSSKVSRWNDTKIFSDVYADSEDSDGDDMDAESHGRSKASSLLKS
KASALRSKKSRNQSHLEVDESDDEPLDLMDQHKTRLALRSSELRKRKADSDEEAEFDVEG
RLVIREGERSKRKELSDADSDAKSSKGSRFSGNSSKKNQKRMKTSESGYAYTGKEYASKK
ASGDLKKKDKLEPYAYWPLDRKMMSRRPEQRAVAVRGMSSVVKMAKKMEGKSAAEALATT
KFKKFKRSGQKKSAGKKKNK*
>AT2G34570.1 | MEE21 (maternal effect embryo arrest 21)
MRVKRQKKNRRTVRFFTVCYGFRQPYKVLCDGTFVHHLVTNEITPADTAVSELLGGPVKL
FTTRCVIAELEKLGKDFAESLEAAQTLNTATCEHEEAKTADECLSEVIGVQNTEHFFLGT
QDAEFRRKLQQESIVPLVFGLRNILLIDQPSDFQRQSAKDSENKRLTMTDTEKKLLVKRT
AKIIASNRKEATIANEEWGMPRVVSTKNGLGVKDRPQFKRNRAKGPNPLSCMKKKKENPQ
SKSKADSNSNAQKEKKEGGSDTQKRSRKRSKKGKSGPERTE*
>AT2G46230.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 22 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF652 (InterProIPR006984) Nucleotide binding protein PINc (InterProIPR006596) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G265301) Has 493 Blast hits to 493 proteins in 165 species Archae - 21 Bacteria - 0 Metazoa - 192 Fungi - 143 Plants - 53 Viruses - 0 Other Eukaryotes - 84 (source NCBI BLink)
MGKAKKPQKFAVVKKMISHKALKHYKEEVLNPNKKDLTELPRNVPSVPAGLFFSYNSTLV
PPYRVLVDTNFINFSIQNKIDLEKGMRDCLYANCTPCITDCVMAELEKLGQKYRVALRIA
KDPHFERLPCIHKGTYADDCLVDRVTQHKCFIVATCDRDLKRRIRKIPGVPIMYVTNRKY
SIEKLPEATLGGAPRY*
>AT2G46230.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 22 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF652 (InterProIPR006984) Nucleotide binding protein PINc (InterProIPR006596) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G265301) Has 441 Blast hits to 441 proteins in 161 species Archae - 15 Bacteria - 0 Metazoa - 185 Fungi - 119 Plants - 49 Viruses - 0 Other Eukaryotes - 73 (source NCBI BLink)
MGKAKKPQKFAVVKKMISHKALKHYKEEVLNPNKKDLTELPRNVPSVPAGLFFSYNSTLV
PPYRVLVDTNFINFSIQNKIDLEKGMRDCLYANCTPCITDCVMAELEKLGQKYRVALRIA
KDPHFERLPCIHKGTYADDCLVDRVTQHKCFIVATCDRDLKRRIRKIPGVPIMYVTNRKY
SIEKLPEATLGGAPRY*
>AT2G46230.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 22 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF652 (InterProIPR006984) Nucleotide binding protein PINc (InterProIPR006596) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G265301) Has 493 Blast hits to 493 proteins in 165 species Archae - 21 Bacteria - 0 Metazoa - 192 Fungi - 143 Plants - 53 Viruses - 0 Other Eukaryotes - 84 (source NCBI BLink)
MGKAKKPQKFAVVKKMISHKALKHYKEEVLNPNKKDLTELPRNVPSVPAGLFFSYNSTLV
PPYRVLVDTNFINFSIQNKIDLEKGMRDCLYANCTPCITDCVMAELEKLGQKYRVALRIA
KDPHFERLPCIHKGTYADDCLVDRVTQHKCFIVATCDRDLKRRIRKVKIVCCISALVYIH
G*
>AT2G46230.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 22 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF652 (InterProIPR006984) Nucleotide binding protein PINc (InterProIPR006596) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G265301) Has 441 Blast hits to 441 proteins in 161 species Archae - 15 Bacteria - 0 Metazoa - 185 Fungi - 119 Plants - 49 Viruses - 0 Other Eukaryotes - 73 (source NCBI BLink)
MGKAKKPQKFAVVKKMISHKALKHYKEEVLNPNKKDLTELPRNVPSVPAGLFFSYNSTLV
PPYRVLVDTNFINFSIQNKIDLEKGMRDCLYANCTPCITDCVMAELEKLGQKYRVALRIA
KDPHFERLPCIHKGTYADDCLVDRVTQHKCFIVATCDRDLKRRIRKVKIVCCISALVYIH
G*
>AT2G47420.1 | dimethyladenosine transferase putative
MAGGKIRKEKPKASNRAPSNHYQGGISFHKSKGQHILKNPLLVDSIVQKAGIKSTDVILE
IGPGTGNLTKKLLEAGKEVIAVELDSRMVLELQRRFQGTPFSNRLKVIQGDVLKTELPRF
DICVANIPYQISSPLTFKLLFHPTSFRCAVIMYQREFAMRLVAQPGDNLYCRLSVNTQLY
ARVSHLLKVGKNNFRPPPKVDSSVVRIEPRRPGPQVNKKEWDGFLRVCFIRKNKTLGSIF
KQKSVLSMLEKNFKTLQAVLASLQNNGEPALNTTSMDLGDQSMGMEDDDNEMDDDDMEMD
EGEGDGGETSEFKEKVMNVLKEGGFEEKRSSKLSQQEFLYLLSLFNKSGIHFT*
>AT3G13230.1 | RNA binding
MAESTQMEVETATEGTVPLPPKPTFKPLKAHEMSDGKVQFRKIAVPPNRYSPLKKAWLDI
YTPIYDQMKVDIRMNLKARKVELKTRADTPDISNLQKSADFVHAFMLGFDIPDAISLLRM
DELYVESFEIKDVKTLKGEHLSRAIGRLSGKGGKTKFAIENSTKTRIVIADTRIHILGAF
SNIKVARSSLCSLIMGSPAGKVYSKLRSVSARLNE*
>AT4G04940.1 | transducin family protein / WD-40 repeat family protein
MGIFEPFRAIGYITSTVPFSVQRLGTETFVTVSVGKAFQIYNCAKLNLVIISPQLPKKIR
ALASYRDYTFVAFGNEIAVFRRAHQVATWSKHVAKVDLLLVFGEHVLSLDVEGNMFIWAF
KGIEEHLAPIGNLQLTGKFTPTSIVHPDTYLNKVLVGSQEGPLQLWNINTKKMLYQFKGW
GSSVTSCVSSPALDVVAIGCADGKIHVHNIKLDEEIVTFEHASRGAVTALSFSTDGRPLL
ASGGSFGVISIWNLNKKRLQSVIRDAHDSSIISLNFLANEPVLMSASADNSLKMWIFDTN
DGDPRLLRFRSGHSAPPLCIRFYSNGRHILSAGQDRAFRLFSVIQEQQSRELSQRHISRR
AKKLRLKEEELKLKPVVSFDCAEIRERDWCNVVTCHMDTAEAYVWRLQNFVLGEHILKPC
PENPTPIKACAISACGNFAVVGTAGGWIERFNLQSGISRGSYFDMSEKRRYAHDGEVIGV
ACDSTNTLMISAGYHGDLKVWDFKKRELKSQWDVGCSLVKIVYHRVNGLLATVADDFVIR
LYDVVTLKMVREFRGHTDRITDLCFSEDGKWVISSSMDGSLRIWDVILAKQIDGVHVDVP
ITALSLSPNMDVLATAHSDQNGVYLWVNQSMFSGLPSVESYASGKDVVNVKLPSVSALTS
SEADDDMDRQVLENSEALQASSFSISQKQIPELVTLSLLPKSQWQSLINLDIIKARNKPI
EPPKKPEKAPFFLPSIPSLSGDILFKANDSEADGENEENNKKDQNSMKNFDALESPFSKH
LKSSWDSKHFLDFTNYMKSLSPSALDMELRMLEIIDEDVEEELIKRPEFILIGQLLDYFI
NEVSCKNDFEFMQAVVKLFLKIHGETIRCHPSLQEKAKKLLETQSLVWQKMEKLFQSTRC
IVTFLSNSQF*
>AT4G28450.1 | nucleotide binding / protein binding
MKIKTLSRSVDEYTRERSQDLQRVFHNFDPSLRPMEKAVEYQRALTAAKLEKIFARPFVG
AMDGHRDGVSCMAKNPNYLKGIFSASMDGDIRLWDISSRRTVCQFPGHQGAVRGLTASTD
GNVLVSCGTDCTVRLWNVPRPSLEDSSISSENFIEPSATYVWKNAFWAVDHQFEGDLFAT
AGAQLDIWNHNRSQPVQSFQWGTDSVISVRFNPGEPNLLATSASDRSITIYDLRLSSAAR
KIIMMTKTNSIAWNPMEPMNLTAANEDGSCYSFDGRKLDEAKCVHKDHVSAVMDIDFSPT
GREFVTGSYDRSVRIFPYNGGHSREIYHTKRMQRVFCVKYSCDATYVISGSDDTNLRLWK
AKASEQLGVILPREQKKHEYNEAVKNRYKHLSEVKRIVRHRHLPKPIYKAMGIIRTVNDS
KRRKEARRKAHSAPGTVVTAPLRKRKIIKEVE*
Your search returns no results, please check if everything was correct, and feel free to resubmit your search