>AT1G73430.1 |  sec34-like family protein 
MATKAASSSSLPKSGAISKGYNFASTWEQSAPLTEQQQAAIVSLSHAVAERPFPANLVHE 
HVHRPENGLSVSVEDTHLGDSGAIEAVLVNTNQFYKWFTDLESAMKSETEEKYRHYVSTL 
TERIQTCDNILHQVDETLDLFNELQLQHQGVTTKTKTLHDACDRLLMEKQKLMEFAEALR 
SKLNYFDELENVSSNFYSPNMNVSNSNFLPLLKRLDECISYIEDNPQYAESSVYLLKFRQ 
LQSRALGMIRTYILAVLKTAASQVQAAFRGTGGNKTSVSEGVEASVIYVRFKAAANELKP 
VLEEIESRSARKEYVQILAECHRLYCEQRLSLVKGIVHQRVSDFAKKEALPSLTRSGCAY 
LMQVCHMEHQLFTHFFPASSEEVSSLAPLVDPLSTYLYDILRPKLIHEANIDLLCELVHI 
LKVEVLGDQSARQSEPLAGLRPTLQRILADVNERLTFRARTYIRDEIANYTPSDEDLDYP 
AKLEGSPNTTSETDLRDDENADVFKTWYPPLEKTLSCLSKLYRCLEQAVFTGLAQEAVEV 
CSLSIQKASKLIIKRSTTMDGQLFLIKHLLILREQIAPFDIEFSVTHKELDFSHLLEHLR 
RILRGQASLFDWSRSTSLARTLSPRVLESQIDAKKELEKCLKTTCEEFIMSVTKLVVDPM 
LSFVTKVTAIKVALSSGTQNHKVDSVMAKPLKEQAFATPDKVVELVQKVYAAIQQELLPI 
LAKMKLYLQNPSTRTILFKPIKTNIVEAHTQVESLLKAEYSAEEQANINMISIQDLQTQL 
DNFL*
>AT1G73430.1 |  sec34-like family protein 
MATKAASSSSLPKSGAISKGYNFASTWEQSAPLTEQQQAAIVSLSHAVAERPFPANLVHE 
HVHRPENGLSVSVEDTHLGDSGAIEAVLVNTNQFYKWFTDLESAMKSETEEKYRHYVSTL 
TERIQTCDNILHQVDETLDLFNELQLQHQGVTTKTKTLHDACDRLLMEKQKLMEFAEALR 
SKLNYFDELENVSSNFYSPNMNVSNSNFLPLLKRLDECISYIEDNPQYAESSVYLLKFRQ 
LQSRALGMIRTYILAVLKTAASQVQAAFRGTGGNKTSVSEGVEASVIYVRFKAAANELKP 
VLEEIESRSARKEYVQILAECHRLYCEQRLSLVKGIVHQRVSDFAKKEALPSLTRSGCAY 
LMQVCHMEHQLFTHFFPASSEEVSSLAPLVDPLSTYLYDILRPKLIHEANIDLLCELVHI 
LKVEVLGDQSARQSEPLAGLRPTLQRILADVNERLTFRARTYIRDEIANYTPSDEDLDYP 
AKLEGSPNTTSETDLRDDENADVFKTWYPPLEKTLSCLSKLYRCLEQAVFTGLAQEAVEV 
CSLSIQKASKLIIKRSTTMDGQLFLIKHLLILREQIAPFDIEFSVTHKELDFSHLLEHLR 
RILRGQASLFDWSRSTSLARTLSPRVLESQIDAKKELEKCLKTTCEEFIMSVTKLVVDPM 
LSFVTKVTAIKVALSSGTQNHKVDSVMAKPLKEQAFATPDKVVELVQKVYAAIQQELLPI 
LAKMKLYLQNPSTRTILFKPIKTNIVEAHTQVESLLKAEYSAEEQANINMISIQDLQTQL 
DNFL*
>AT1G73430.2 |  sec34-like family protein 
MATKAASSSSLPKSGAISKGYNFASTWEQSAPLTEQQQAAIVSLSHAVAERPFPANLVHE 
HVHRPENGLSVSVEDTHLGDSGAIEAVLVNTNQFYKWFTDLESAMKSETEEKYRHYVSTL 
TERIQTCDNILHQVDETLDLFNELQLQHQGVTTKTKTLHDACDRLLMEKQKLMEFAEALR 
SKLNYFDELENVSSNFYSPNMNVSNSNFLPLLKRLDECISYIEDNPQYAESSVYLLKFRQ 
LQSRALGMIRTYILAVLKTAASQVQAAFRGTGGNKTSVSEGVEASVIYVRFKAAANELKP 
VLEEIESRSARKEYVQILAECHRLYCEQRLSLVKGIVHQRVSDFAKKEALPSLTRSGCAY 
LMQVCHMEHQLFTHFFPASSEEVSSLAPLVDPLSTYLYDILRPKLIHEANIDLLCELVHI 
LKVEVLGDQSARQSEPLAGLRPTLQRILADVNERLTFRARTYIRDEIANYTPSDEDLDYP 
AKLEGSPNTTSETDLRDDENADVFKTWYPPLEKTLSCLSKLYRCLEQAVFTGLAQEAVEV 
CSLSIQKASKLIIKRSTTMDGQLFLIKHLLILREQIAPFDIEFSVTHKELDFSHLLEHLR 
RILRGQASLFDWSRSTSLARTLSPRVLESQIDAKKELEKCLKTTCEEFIMSVTKLVVDPM 
LSFVTKVTAIKVALSSGTQNHKVDSVMAKPLKEQAFATPDKVVELVQKVYAAIQQELLPI 
LAKMKLYLQNPSTRTILFKPIKTNIVEAHTQVESLLKAEYSAEEQANINMISIQDLQTQL 
DNFL*
>AT1G73430.2 |  sec34-like family protein 
MATKAASSSSLPKSGAISKGYNFASTWEQSAPLTEQQQAAIVSLSHAVAERPFPANLVHE 
HVHRPENGLSVSVEDTHLGDSGAIEAVLVNTNQFYKWFTDLESAMKSETEEKYRHYVSTL 
TERIQTCDNILHQVDETLDLFNELQLQHQGVTTKTKTLHDACDRLLMEKQKLMEFAEALR 
SKLNYFDELENVSSNFYSPNMNVSNSNFLPLLKRLDECISYIEDNPQYAESSVYLLKFRQ 
LQSRALGMIRTYILAVLKTAASQVQAAFRGTGGNKTSVSEGVEASVIYVRFKAAANELKP 
VLEEIESRSARKEYVQILAECHRLYCEQRLSLVKGIVHQRVSDFAKKEALPSLTRSGCAY 
LMQVCHMEHQLFTHFFPASSEEVSSLAPLVDPLSTYLYDILRPKLIHEANIDLLCELVHI 
LKVEVLGDQSARQSEPLAGLRPTLQRILADVNERLTFRARTYIRDEIANYTPSDEDLDYP 
AKLEGSPNTTSETDLRDDENADVFKTWYPPLEKTLSCLSKLYRCLEQAVFTGLAQEAVEV 
CSLSIQKASKLIIKRSTTMDGQLFLIKHLLILREQIAPFDIEFSVTHKELDFSHLLEHLR 
RILRGQASLFDWSRSTSLARTLSPRVLESQIDAKKELEKCLKTTCEEFIMSVTKLVVDPM 
LSFVTKVTAIKVALSSGTQNHKVDSVMAKPLKEQAFATPDKVVELVQKVYAAIQQELLPI 
LAKMKLYLQNPSTRTILFKPIKTNIVEAHTQVESLLKAEYSAEEQANINMISIQDLQTQL 
DNFL*
>AT1G02130.1 |  ARA-5 (ARABIDOPSIS RAS 5) GTP binding 
MNPEYDYLFKLLLIGDSGVGKSCLLLRFSDDSYVESYISTIGVDFKIRTVEQDGKTIKLQ 
IWDTAGQERFRTITSSYYRGAHGIIIVYDVTDEESFNNVKQWLSEIDRYASDNVNKLLVG 
NKSDLTENRAIPYETAKAFADEIGIPFMETSAKDATNVEQAFMAMSASIKERMASQPAGN 
NARPPTVQIRGQPVAQKNGCCST*
>AT1G31780.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s Conserved oligomeric complex COG6 (InterProIPR010490) Has 281 Blast hits to 279 proteins in 131 species Archae - 0 Bacteria - 2 Metazoa - 132 Fungi - 106 Plants - 21 Viruses - 0 Other Eukaryotes - 20 (source NCBI BLink) 
MASTVGLAPGLSRKLKKVLDCRTDSPDLVASLNALSSFYDENSAHARRNLRSTIEKRALQ 
INSEFLNAADSTQIALDRVEEEVNALADCCDKIAAALSSSAATTSDIISTTERLKQELEV 
TTQRQEIVNCFLRDYQLSNEEIKALREDELNENFFQALSHVQEIHSNCKLLLRTHHQRAG 
LELMDMMAVYQEGAYERLCRWVQAECRKLGDTDNPEVSELLRTAVRCLKERPVLFKYCAE 
EVGNLRHNALFRRFISALTRGGPGGMPRPIEVHAHDPLRYVGDMLGWLHQALASERELVH 
ALFDIDSADHQSNAKNTSENIALKAGESDFTFVLDRIFEGVCRPFKVRVEQVLQSQPSLI 
ISYKLTNTLEFYSYTISDLLGRDTALCNTIGMVKDAAQKTFFDILKTRGEKLLRYPPPVA 
VDLSPPPAVREGVSLTLEIIENYNSMMVSASGEKPAFDPVLSALLDPIIKMCEQAAEAHK 
SKKSGQLPRRSRTSSDSSQLTSVDALLSSSPSPPQNNETPSKIFLINCLCAIQQPLLRHD 
VASQYVTNIGLMIENHINLLVQNEVDTLLHKCGLSDKMQIFRSSTSELPLSERQDTSPAM 
LSECLKAFFGLVLGSEGSLPEFEQIQVPKLRSEACVRVAKTLAEAYEVIYQAVTDQQNGY 
PDPKSLARHPPDQIRTILGI*
>AT4G24840.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN protein transport Golgi organization LOCATED IN vacuole EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s COG complex component COG2 (InterProIPR009316) Has 214 Blast hits to 204 proteins in 99 species Archae - 0 Bacteria - 0 Metazoa - 105 Fungi - 56 Plants - 22 Viruses - 0 Other Eukaryotes - 31 (source NCBI BLink) 
MSDLVATSPSPSSAPRSATDFFSDPYDSHPLWFKPSLFLSPNFDSESYISELRTFVPFDT 
LRSELRSHLASLNRELVDLINRDYADFVNLSTKLVDIDAAVVRMRAPLLELREKITGFRG 
SVEAALFALRNGLQQRSDAAAAREVLELLLDTFHVVSKVEKLIKVLPSTPSDWQNEDANS 
MGRSSMNDENSTQQDGTTMRETQSMLLERIASEMNRLKFYMAHAQNLPFIENMEKRIQSA 
SVLLDASLGHCFIDGLNNSDTSVLYNCLRAYAAIDNTNAAEEIFRTTIVAPFIQKIITHE 
TTTNAAGTSEDELENDYKQIKHFIAKDCKMLLEISSTDKSGLHVFDFLANSILKEVLWAI 
QKVKPGAFSPGRPTEFLKNYKASLDFLAYLEGYCPSRSAVTKFRAEAICVEFMKQWNVGV 
YFSLRFQEIAGALDSALTSPSLVFIQDSDKESSLNLILRQSDTLLECLRSCWKEDVLVFS 
AADKFLRLTLQLLSRYSFWVSSALNNRKSNASPSPGCEWAVSATAEDFVYVIHDVNCLVS 
EVCGDYLGHISQYLSSSSTEVLDVVRISIEQGGVSLEKVLPLLTKTIIDVIVDKSVEDLR 
QLRGITATFRMTNKPLPVRHSPYVVGLLRPVKAFLEGDKARNYLTQKTKEELLHGSVSEI 
TRRYYELAADVVSVARKTQSSLQKLRQNAQRRGGAASGVSDQNVSETDKMCMQLFLDIQE 
YGRNVSALGLKPADIPEYCSFWQCVAPADRQNSISV*
>AT2G45200.1 |  GOS12 (GOLGI SNARE 12) SNARE binding 
MTESSLDLQESGWEELRREARKIEGDLDVKLSSYAKLGARFTQGGYVDTGSPTVGSGRSW 
KSMEMEIQSLLEKLLDINDSMSRCAASAAPTTSVTQKLARHRDILHEYTQEFRRIKGNIN 
SLREHAELLSSVRDDISEYKASGSMSPGVQVLRERASIHGSISHIDDVIGQAQATRAVLG 
SQRSLFSDVQGKVKNLGDKFPVIRGLLGSIKRKRSRDTLILSAVIAACTLFLIIYWLSK*
>AT2G47610.1 |  60S ribosomal protein L7A (RPL7aA) 
MAPKKGVKVAAKKKTAEKVSNPLFERRPKQFGIGGALPPKKDLSRYIKWPKSIRLQRQKR 
ILKQRLKVPPALNQFTKTLDKNLATSLFKVLLKYRPEDKAAKKERLVKKAQAEAEGKPSE 
SKKPIVVKYGLNHVTYLIEQNKAQLVVIAHDVDPIELVVWLPALCRKMEVPYCIVKGKSR 
LGAVVHQKTASCLCLTTVKNEDKLEFSKILEAIKANFNDKYEEYRKKWGGGIMGSKSQAK 
TKAKERVIAKEAAQRMN*
>AT5G41790.1 |  CIP1 (COP1-INTERACTIVE PROTEIN 1) protein binding 
MKKHKFRETLKSFFEPHFDHEKGEMLKGTKTEIDEKVNKILGMVESGDVNEDESNRQVVA 
DLVKEFYSEYQSLYRQYDDLTGEIRKKVNGKGESSSSSSSDSDSDHSSKRKVKRNGNGKV 
EKDVELVTGALKQQIEAANLEIADLKGKLTTTVEEKEAVDSELELALMKLKESEEISSKL 
KLETEKLEDEKSIALSDNRELHQKLEVAGKTETDLNQKLEDIKKERDELQTERDNGIKRF 
QEAEKVAEDWKTTSDQLKDETSNLKQQLEASEQRVSELTSGMNSAEEENKSLSLKVSEIS 
DVIQQGQTTIQELISELGEMKEKYKEKESEHSSLVELHKTHERESSSQVKELEAHIESSE 
KLVADFTQSLNNAEEEKKLLSQKIAELSNEIQEAQNTMQELMSESGQLKESHSVKERELF 
SLRDIHEIHQRDSSTRASELEAQLESSKQQVSDLSASLKAAEEENKAISSKNVETMNKLE 
QTQNTIQELMAELGKLKDSHREKESELSSLVEVHETHQRDSSIHVKELEEQVESSKKLVA 
ELNQTLNNAEEEKKVLSQKIAELSNEIKEAQNTIQELVSESGQLKESHSVKDRDLFSLRD 
IHETHQRESSTRVSELEAQLESSEQRISDLTVDLKDAEEENKAISSKNLEIMDKLEQAQN 
TIKELMDELGELKDRHKEKESELSSLVKSADQQVADMKQSLDNAEEEKKMLSQRILDISN 
EIQEAQKTIQEHMSESEQLKESHGVKERELTGLRDIHETHQRESSTRLSELETQLKLLEQ 
RVVDLSASLNAAEEEKKSLSSMILEITDELKQAQSKVQELVTELAESKDTLTQKENELSS 
FVEVHEAHKRDSSSQVKELEARVESAEEQVKELNQNLNSSEEEKKILSQQISEMSIKIKR 
AESTIQELSSESERLKGSHAEKDNELFSLRDIHETHQRELSTQLRGLEAQLESSEHRVLE 
LSESLKAAEEESRTMSTKISETSDELERTQIMVQELTADSSKLKEQLAEKESKLFLLTEK 
DSKSQVQIKELEATVATLELELESVRARIIDLETEIASKTTVVEQLEAQNREMVARISEL 
EKTMEERGTELSALTQKLEDNDKQSSSSIETLTAEIDGLRAELDSMSVQKEEVEKQMVCK 
SEEASVKIKRLDDEVNGLRQQVASLDSQRAELEIQLEKKSEEISEYLSQITNLKEEIINK 
VKVHESILEEINGLSEKIKGRELELETLGKQRSELDEELRTKKEENVQMHDKINVASSEI 
MALTELINNLKNELDSLQVQKSETEAELEREKQEKSELSNQITDVQKALVEQEAAYNTLE 
EEHKQINELFKETEATLNKVTVDYKEAQRLLEERGKEVTSRDSTIGVHEETMESLRNELE 
MKGDEIETLMEKISNIEVKLRLSNQKLRVTEQVLTEKEEAFRKEEAKHLEEQALLEKNLT 
MTHETYRGMIKEIADKVNITVDGFQSMSEKLTEKQGRYEKTVMEASKILWTATNWVIERN 
HEKEKMNKEIEKKDEEIKKLGGKVREDEKEKEMMKETLMGLGEEKREAIRQLCVWIDHHR 
SRCEYLEEVLSKTVVARGQRRVSQRT*
>AT1G64330.1 |  myosin heavy chain-related 
MRKLSIRDSLKSFFEPHLHPDNGESLKGTKTEIDEKVKKILGIVESGDIEEDESKRLVVA 
ELVKDFYKEYESLYHQYDDLTGEIRKKVHGKGENDSSSSSSSDSDSDKKSKRNGRGENEI 
ELLKKQMEDANLEIADLKMKLATTDEHKEAVESEHQEILKKLKESDEICGNLRVETEKLT 
SENKELNEKLEVAGETESDLNQKLEDVKKERDGLEAELASKAKDHESTLEEVNRLQGQKN 
ETEAELEREKQEKPALLNQINDVQKALLEQEAAYNTLSQEHKQINGLFEEREATIKKLTD 
DYKQAREMLEEYMSKMEETERRMQETGKDVASRESAIVDLEETVESLRNEVERKGDEIES 
LMEKMSNIEVKLRLSNQKLRVTEQVLTEKEGELKRIEAKHLEEQALLEEKIATTHETYRG 
LIKEISERVDSTILNRFQSLSEKLEEKHKSYEKTVVEATKMLLTAKKCVVEMKKEKDEMA 
KEKEEVEKKLEGQVREEEKEKEKLKETLLGLGEEKREAIRQLCIWIEHHRDRCEYLEEVL 
SKMVVARGQRRSQRA*
>AT3G24350.1 |  SYP32 (SYNTAXIN OF PLANTS 32) SNAP receptor 
MSARHGQSSYRDRSDEFFKIVETLRRSIAPAPAANNVPYGNNRNDGARREDLINKSEFNK 
RASHIGLAINQTSQKLSKLAKLAKRTSVFDDPTQEIQELTVVIKQEISALNSALVDLQLF 
RSSQNDEGNNSRDRDKSTHSATVVDDLKYRLMDTTKEFKDVLTMRTENMKVHESRRQLFS 
SNASKESTNPFVRQRPLAAKAAASESVPLPWANGSSSSSSQLVPWKPGEGESSPLLQQSQ 
QQQQQQQQQMVPLQDTYMQGRAEALHTVESTIHELSSIFTQLATMVSQQGEIAIRIDQNM 
EDTLANVEGAQSQLARYLNSISSNRWLMMKIFFVLIAFLMIFLFFVA*
>AT5G58060.1 |  YKT61 
MKITALLVLKCAPEASDPVILSNASDVSHFGYFQRSSVKEFVVFVGRTVASRTPPSQRQS 
VQHEEYKVHAYNRNGLCAVGFMDDHYPVRSAFSLLNQVLDEYQKSFGESWRSAKEDSNQP 
WPYLTEALNKFQDPAEADKLLKIQRELDETKIILHKTIDSVLARGEKLDSLVEKSSDLSM 
ASQMFYKQAKKTNSCCTIL*
>AT5G58060.1 |  YKT61 
MKITALLVLKCAPEASDPVILSNASDVSHFGYFQRSSVKEFVVFVGRTVASRTPPSQRQS 
VQHEEYKVHAYNRNGLCAVGFMDDHYPVRSAFSLLNQVLDEYQKSFGESWRSAKEDSNQP 
WPYLTEALNKFQDPAEADKLLKIQRELDETKIILHKTIDSVLARGEKLDSLVEKSSDLSM 
ASQMFYKQAKKTNSCCTIL*
>AT5G58060.2 |  YKT61 
MKITALLVLKCAPEASDPVILSNASDVSHFGYFQRSSVKEFVVFVGRTVASRTPPSQRQS 
VQHEGCAPFLILDLPGLCPGFNFLRFYLIVHAYNRNGLCAVGFMDDHYPVRSAFSLLNQV 
LDEYQKSFGESWRSAKEDSNQPWPYLTEALNKFQDPAEADKLLKIQRELDETKIILHKTI 
DSVLARGEKLDSLVEKSSDLSMASQMFYKQAKKTNSCCTIL*
>AT5G58060.2 |  YKT61 
MKITALLVLKCAPEASDPVILSNASDVSHFGYFQRSSVKEFVVFVGRTVASRTPPSQRQS 
VQHEGCAPFLILDLPGLCPGFNFLRFYLIVHAYNRNGLCAVGFMDDHYPVRSAFSLLNQV 
LDEYQKSFGESWRSAKEDSNQPWPYLTEALNKFQDPAEADKLLKIQRELDETKIILHKTI 
DSVLARGEKLDSLVEKSSDLSMASQMFYKQAKKTNSCCTIL*
>AT1G11890.1 |  SEC22 transporter 
MVKMTLIARVTDGLPLAEGLDDGRDLPDSDMYKQQVKALFKNLSRGQNDASRMSVETGPY 
VFHYIIEGRVCYLTMCDRSYPKKLAFQYLEDLKNEFERVNGPNIETAARPYAFIKFDTFI 
QKTKKLYQDTRTQRNIAKLNDELYEVHQIMTRNVQEVLGVGEKLDQVSEMSSRLTSESRI 
YADKAKDLNRQALIRKWAPVAIVFGVVFLLFWVKNKLW*
>AT5G12480.1 |  CPK7 (calmodulin-domain protein kinase 7) ATP binding / calcium ion binding / calmodulin-dependent protein kinase/ kinase/ protein kinase/ protein serine/threonine kinase/ protein tyrosine kinase 
MGNCCGNPSSATNQSKQGKPKNKNNPFYSNEYATTDRSGAGFKLSVLKDPTGHDISLQYD 
LGREVGRGEFGITYLCTDKETGEKYACKSISKKKLRTAVDIEDVRREVEIMKHMPKHPNV 
VSLKDSFEDDDAVHIVMELCEGGELFDRIVARGHYTERAAAAVMKTIVEVVQICHKQGVM 
HRDLKPENFLFANKKETSALKAIDFGLSVFFKPGEQFNEIVGSPYYMAPEVLRRNYGPEI 
DVWSAGVILYILLCGVPPFWAETEQGVAQAIIRSVIDFKRDPWPRVSDSAKDLVRKMLEP 
DPKKRLTAAQVLEHTWILNAKKAPNVSLGETVKARLKQFSVMNKLKKRALRVIAEHLSVE 
EAAGIKEAFEMMDVNKRGKINLEELKYGLQKAGQQIADTDLQILMEATDVDGDGTLNYSE 
FVAVSVHLKKMANDEHLHKAFNFFDQNQSGYIEIDELREALNDELDNTSSEEVIAAIMQD 
VDTDKDGRISYEEFVAMMKAGTDWRKASRQYSRERFNSLSLKLMRDGSLQLEGET*
>AT5G12480.1 |  CPK7 (calmodulin-domain protein kinase 7) ATP binding / calcium ion binding / calmodulin-dependent protein kinase/ kinase/ protein kinase/ protein serine/threonine kinase 
MGNCCGNPSSATNQSKQGKPKNKNNPFYSNEYATTDRSGAGFKLSVLKDPTGHDISLQYD 
LGREVGRGEFGITYLCTDKETGEKYACKSISKKKLRTAVDIEDVRREVEIMKHMPKHPNV 
VSLKDSFEDDDAVHIVMELCEGGELFDRIVARGHYTERAAAAVMKTIVEVVQICHKQGVM 
HRDLKPENFLFANKKETSALKAIDFGLSVFFKPGEQFNEIVGSPYYMAPEVLRRNYGPEI 
DVWSAGVILYILLCGVPPFWAETEQGVAQAIIRSVIDFKRDPWPRVSDSAKDLVRKMLEP 
DPKKRLTAAQVLEHTWILNAKKAPNVSLGETVKARLKQFSVMNKLKKRALRVIAEHLSVE 
EAAGIKEAFEMMDVNKRGKINLEELKYGLQKAGQQIADTDLQILMEATDVDGDGTLNYSE 
FVAVSVHLKKMANDEHLHKAFNFFDQNQSGYIEIDELREALNDELDNTSSEEVIAAIMQD 
VDTDKDGRISYEEFVAMMKAGTDWRKASRQYSRERFNSLSLKLMRDGSLQLEGET*
>AT5G12480.2 |  CPK7 (calmodulin-domain protein kinase 7) ATP binding / calcium ion binding / calmodulin-dependent protein kinase/ kinase/ protein kinase/ protein serine/threonine kinase/ protein tyrosine kinase 
MGNCCGNPSSATNQSKQGKPKNKNNPFYSNEYATTDRSGAGFKLSVLKDPTGHDISLQGH 
YTERAAAAVMKTIVEVVQICHKQGVMHRDLKPENFLFANKKETSALKAIDFGLSVFFKPG 
EQFNEIVGSPYYMAPEVLRRNYGPEIDVWSAGVILYILLCGVPPFWAETEQGVAQAIIRS 
VIDFKRDPWPRVSDSAKDLVRKMLEPDPKKRLTAAQVLEHTWILNAKKAPNVSLGETVKA 
RLKQFSVMNKLKKRALRVIAEHLSVEEAAGIKEAFEMMDVNKRGKINLEELKYGLQKAGQ 
QIADTDLQILMEATDVDGDGTLNYSEFVAVSVHLKKMANDEHLHKAFNFFDQNQSGYIEI 
DELREALNDELDNTSSEEVIAAIMQDVDTDKDGRISYEEFVAMMKAGTDWRKASRQYSRE 
RFNSLSLKLMRDGSLQLEGET*
>AT5G12480.2 |  CPK7 (calmodulin-domain protein kinase 7) ATP binding / calcium ion binding / calmodulin-dependent protein kinase/ kinase/ protein kinase/ protein serine/threonine kinase 
MGNCCGNPSSATNQSKQGKPKNKNNPFYSNEYATTDRSGAGFKLSVLKDPTGHDISLQGH 
YTERAAAAVMKTIVEVVQICHKQGVMHRDLKPENFLFANKKETSALKAIDFGLSVFFKPG 
EQFNEIVGSPYYMAPEVLRRNYGPEIDVWSAGVILYILLCGVPPFWAETEQGVAQAIIRS 
VIDFKRDPWPRVSDSAKDLVRKMLEPDPKKRLTAAQVLEHTWILNAKKAPNVSLGETVKA 
RLKQFSVMNKLKKRALRVIAEHLSVEEAAGIKEAFEMMDVNKRGKINLEELKYGLQKAGQ 
QIADTDLQILMEATDVDGDGTLNYSEFVAVSVHLKKMANDEHLHKAFNFFDQNQSGYIEI 
DELREALNDELDNTSSEEVIAAIMQDVDTDKDGRISYEEFVAMMKAGTDWRKASRQYSRE 
RFNSLSLKLMRDGSLQLEGET*
>AT1G18260.1 |  suppressor of lin-12-like protein-related / sel-1 protein-related 
MRILSYGIVILSLLVFSFIEFGVHARPVVLVLSNDDLNSGGDDNGVGESSDFDEFGESEP 
KSEEELDPGSWRSIFEPDDSTVQAASPQYYSGLKKILSAASEGNFRLMEEAVDEIEAASS 
AGDPHAQSIMGFVYGIGMMREKSKSKSFLHHNFAAAGGNMQSKMALAFTYLRQDMHDKAV 
QLYAELAETAVNSFLISKDSPVVEPTRIHSGTEENKGALRKSRGEEDEDFQILEYQAQKG 
NANAMYKIGLFYYFGLRGLRRDHTKALHWFLKAVDKGEPRSMELLGEIYARGAGVERNYT 
KALEWLTLAAKEGLYSAFNGIGYLYVKGYGVDKKNYTKAREYFEKAVDNEDPSGHYNLGV 
LYLKGIGVNRDVRQATKYFFVAANAGQPKAFYQLAKMFHTGVGLKKNLEMATSFYKLVAE 
RGPWSSLSRWALEAYLKGDVGKALILYSRMAEMGYEVAQSNAAWILDKYGERSMCMGVSG 
FCTDKERHERAHSLWWRASEQGNEHAALLIGDAYYYGRGTERDFVRAAEAYMHAKSQSNA 
QAMFNLGYMHEHGQGLPFDLHLAKRYYDESLQSDAAARLPVTLALASLWLRRNYADTVLV 
RVVDSLPEVYPKVETWIENVVFEEGNATILTLFVCLITILYLRERQRRQVVVVADPVAAD 
VAQPLDADVAQHLAAFPR*
>AT1G13180.1 |  DIS1 (DISTORTED TRICHOMES 1) ATP binding / actin binding / protein binding / structural constituent of cytoskeleton 
MDPTSRPAIVIDNGTGYTKMGFAGNVEPCFILPTVVAVNESFLNQSKSSSKATWQTQHNA 
GVAADLDFYIGDEALAKSRSSSTHNLHYPIEHGQVEDWDAMERYWQQCIFNYLRCDPEDH 
YFLLTESPLTPPESREYTGEILFETFNVPGLYIAVNSVLALAAGYTTSKCEMTGVVVDVG 
DGATHVVPVAEGYVIGSCIKSIPIAGKDVTLFIQQLMRERGENIPPEDSFDVARKVKEMY 
CYTCSDIVKEFNKHDKEPAKYIKQWKGVKPKTGAPYTCDVGYERFLGPEVFFNPEIYSND 
FTTTLPAVIDKCIQSAPIDTRRALYKNIVLSGGSTMFKDFGRRLQRDLKKIVDARVLANN 
ARTGGEITSQPVEVNVVSHPVQRFAVWFGGSVLSSTPEFFASCRTKEEYEEYGASICRTN 
PVFKGMY*
>AT5G16300.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 204 Blast hits to 183 proteins in 76 species Archae - 0 Bacteria - 0 Metazoa - 127 Fungi - 34 Plants - 24 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink) 
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ 
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN 
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS 
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL 
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK 
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV 
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL 
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN 
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL 
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP 
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT 
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS 
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR 
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR 
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW 
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP 
ISAPALSSRSTNKVSIPVTSNDASARNSWKAFTNGEQSQTSDLEENSNFGVAFKSFMQES 
TLKLGSILTDGQVGIFKDRSAAAMSTFGDILPAQAAGLLSSFTNTRSE*
>AT5G16300.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 199 Blast hits to 182 proteins in 76 species Archae - 0 Bacteria - 0 Metazoa - 122 Fungi - 34 Plants - 24 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink) 
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ 
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN 
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS 
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL 
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK 
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV 
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL 
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN 
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL 
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP 
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT 
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS 
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR 
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR 
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW 
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP 
ISAPALSSRSTNKVSIPVTSNDASARNSWKAFTNGEQSQTSDLEENSNFGVAFKSFMQES 
TLKLGSILTDGQVGIFKDRSAAAMSTFGDILPAQAAGLLSSFTNTRSE*
>AT5G16300.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 201 Blast hits to 182 proteins in 75 species Archae - 0 Bacteria - 0 Metazoa - 125 Fungi - 34 Plants - 23 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink) 
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ 
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN 
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS 
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL 
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK 
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV 
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL 
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN 
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL 
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP 
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT 
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS 
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR 
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR 
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW 
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP 
ISAPALSSRSTNKVSIPVTSNDASARNSWKAFTNGEQSQTSDLEENSNFGVAFKSFMQES 
TLKLGSILTDGQVGIFKDRSAAAMSTFGDILPAQAAGLLSSFTNTRSE*
>AT5G16300.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 204 Blast hits to 183 proteins in 76 species Archae - 0 Bacteria - 0 Metazoa - 127 Fungi - 34 Plants - 24 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink) 
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ 
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN 
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS 
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL 
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK 
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV 
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL 
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN 
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL 
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP 
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT 
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS 
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR 
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR 
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW 
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP 
ISAPALSSRSTNKVSIPVTSNDASESTLKLGSILTDGQVGIFKDRSAAAMSTFGDILPAQ 
AAGLLSSFTNTRSE*
>AT5G16300.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 199 Blast hits to 182 proteins in 76 species Archae - 0 Bacteria - 0 Metazoa - 122 Fungi - 34 Plants - 24 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink) 
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ 
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN 
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS 
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL 
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK 
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV 
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL 
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN 
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL 
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP 
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT 
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS 
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR 
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR 
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW 
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP 
ISAPALSSRSTNKVSIPVTSNDASESTLKLGSILTDGQVGIFKDRSAAAMSTFGDILPAQ 
AAGLLSSFTNTRSE*
>AT5G16300.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 201 Blast hits to 182 proteins in 75 species Archae - 0 Bacteria - 0 Metazoa - 125 Fungi - 34 Plants - 23 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink) 
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ 
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN 
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS 
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL 
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK 
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV 
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL 
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN 
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL 
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP 
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT 
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS 
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR 
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR 
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW 
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP 
ISAPALSSRSTNKVSIPVTSNDASESTLKLGSILTDGQVGIFKDRSAAAMSTFGDILPAQ 
AAGLLSSFTNTRSE*
>AT5G16300.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 204 Blast hits to 183 proteins in 76 species Archae - 0 Bacteria - 0 Metazoa - 127 Fungi - 34 Plants - 24 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink) 
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ 
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN 
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS 
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL 
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK 
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV 
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL 
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN 
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL 
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP 
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT 
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS 
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR 
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR 
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW 
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP 
ISAPALSSRSTNKVSIPVTSNDASARNSWKAFTNGEQSQTSDLEENSNFGVAFKSFMQVK 
HIETGININ*
>AT5G16300.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 199 Blast hits to 182 proteins in 76 species Archae - 0 Bacteria - 0 Metazoa - 122 Fungi - 34 Plants - 24 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink) 
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ 
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN 
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS 
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL 
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK 
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV 
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL 
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN 
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL 
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP 
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT 
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS 
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR 
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR 
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW 
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP 
ISAPALSSRSTNKVSIPVTSNDASARNSWKAFTNGEQSQTSDLEENSNFGVAFKSFMQVK 
HIETGININ*
>AT5G16300.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 201 Blast hits to 182 proteins in 75 species Archae - 0 Bacteria - 0 Metazoa - 125 Fungi - 34 Plants - 23 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink) 
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ 
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN 
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS 
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL 
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK 
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV 
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL 
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN 
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL 
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP 
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT 
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS 
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR 
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR 
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW 
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP 
ISAPALSSRSTNKVSIPVTSNDASARNSWKAFTNGEQSQTSDLEENSNFGVAFKSFMQVK 
HIETGININ*
>AT1G18830.1 |  transducin family protein / WD-40 repeat family protein 
MDCIKSIGRSAFVAIAPESPFIAAGTMAGAVDLSFSSSANLEIFELDFQSNDRELKLVGQ 
CQSSERFNRLAWGSYGSGSDGLIAGGLVDGNIGLWNPISSESGEIAHVRDLSKHKGPVRG 
LEFNVKSPNQLASGADDGTVCIWDLANPSKPSHYLKGTGSYMQSEISSLSWNKGFQHVLA 
STSHNGTTVIWDVNNEKIITDLKTTVRCSVLQWDPDHFNQILVASDEDSSPNVKLLDIRY 
LQSPVRTFVGHQRGVIAMEWCPSDSLYLLTCGKDNRTICWNTKTGKIVAELPTGQNWNFD 
VHWYPKMPGVISASSVDGKIGIYNLEGCSSYGTENQQHFLFHLLDADPLTAPKWWKRPAG 
ASFGFGGKLISFNKNLPEASEVFLHSLATEKSLVNRISKFEAALENGEKTSLRGLCEKKT 
EEAESEEEKETWGLLKIMLEEDGNAKTKLRSHLGFSLPSEENDQTANEPHATCSSTNVEE 
TQKVPEPEGEEEESSDPTFDDAIQRSLIVGDYKEAVAQCFSANKMADALVIAHVGGTELW 
ESTRDKYIRMSNAPYMKVVSAMMNNDLMTYLHTRQPKSWKETLALICTFAEGDEWISLCD 
ALASNLMAAGFTLAATLCYICAGNVDKTVDIWSMSLEKQSAGKSYAECVQDLMEKTLVLA 
LTTCNKRVSASLRKLFESYAEILASQGLIATAMKFLKLLESGDFSPELSILRDRISLYAE 
PEAANTSASTNTQPKISNPYQEKSFTPAPLSNAQPSRSITFFPLNPPRELKNADQYQQPT 
MDYHSFNRSAGPAYNAPPGPGSYRSIHSQVGPYINSKIPQTVAPPVRPMTPTHQVAVQPE 
PVAPPPTVQTADTSNVPAHQKPIVASLTRLFKETFEPLRGYSRDTPAKKREAEDNCSRKL 
GALFSKLNNGDISKNAAEKLTQLCQALDKRDFGAALKIQGLMTSTEWDECSSWLPTLKKM 
IVTGRQNVR*
>AT4G01400.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s COG4 transport (InterProIPR013167) Pentatricopeptide repeat (InterProIPR002885) BEST Arabidopsis thaliana protein match is pentatricopeptide (PPR) repeat-containing protein (TAIRAT5G461001) Has 11639 Blast hits to 3936 proteins in 180 species Archae - 0 Bacteria - 2 Metazoa - 206 Fungi - 141 Plants - 11004 Viruses - 0 Other Eukaryotes - 286 (source NCBI BLink) 
MIRRPIYDFAAVFRHLTSPLSTSSRFLFYSSSEHEARKPIVSNPKSPIGSPTRVQKLIAS 
QSDPLLAKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGE 
IFTYLIKVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKS 
SRLHGVMPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQ 
VNGAMELLDDMLNKGFVPDRTLIGGLCDQGMFDEGKKYLEEMISKGFSPHFSVSNCLVKG 
FCSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIKLFLEDAVKEEITGD 
TRIVDVGIENKKMPEIEQDDAAAETVDSSTVKFGTPEALEYVRSLTDVGAMTRLLHECIA 
YQRSLDSDLDTLLSQRTELDRNLVQLQRSAEILDIVKADADHMLGNVRSTCDLADQVSGK 
VRELDLAQSRVNVTLSRIDAIVERGNCIEGVKTALESEDYESAAKFVQRFLQIDLQYKDS 
GSDQSEQLHASKEQLEGIAKKKLLAAIDQRDHPTILRFVRLYSPLGMETEGLQLYVGYLK 
KVIALRGRMEYENVVELMEQGLGQVNFVGCLTNLFKDIVMAIEENDEILRGLCGEDGVAY 
AICELQEECDLRGSLILKKYMDFRKLAILASDINNSPNLNILPGGASEGPDPREVELYVE 
EILSLMQLGEDYTEFMVSKIKSLTSVDPELLPTATKAFRNKSFSKAIQDVTRYYVILEGF 
FMVENVRKAIRIDEHVPDSLTTSMVDDVFYVLQSCLRRAISTSNISSVIAVLSYAGSLLG 
NDYHEALQQKIREPNLGARLFLGGIGVENTGTEIATALNNMDVSCEYILKLKHEIEEQCT 
EVFPAPADRERIKSCLSELGELSSTFKQLLNSGMEQLVATVTPRIRPVLDTVATISYELT 
ETEYAENEVNDPWVQRLLHSVETNAAWLQPLMTSNNYDSFLHLIIDFIVKRLEVIMMQKR 
FSQLGGLQLDRDTRALVSHFSGMTQRTVRDKFARLTQMATILNLEKVSEILDFWGENSGP 
MTWRLTPAEVRRVLGLRVEFKPESIAALKL*
>AT4G01400.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s COG4 transport (InterProIPR013167) 
MIRRPIYDFAAVFRHLTSPLSTSSRFLFYSSSEHEARKPIVSNPKSPIGSPTRVQKLIAS 
QSDPLLAKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGE 
IFTYLIKVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKS 
SRLHGVMPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQ 
VNGAMELLDDMLNKGFVPDRTLIGGLCDQGMFDEGKKYLEEMISKGFSPHFSVSNCLVKG 
FCSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIKLFLEDAVKEEITGD 
TRIVDVGIENKKMPEIEQDDAAAETVDSSTVKFGTPEALEYVRSLTDVGAMTRLLHECIA 
YQRSLDSDLDTLLSQRTELDRNLVQLQRSAEILDIVKADADHMLGNVRSTCDLADQVSGK 
VRELDLAQSRVNVTLSRIDAIVERGNCIEGVKTALESEDYESAAKFVQRFLQIDLQYKDS 
GSDQSEQLHASKEQLEGIAKKKLLAAIDQRDHPTILRFVRLYSPLGMETEGLQLYVGYLK 
KVIALRGRMEYENVVELMEQGLGQVNFVGCLTNLFKDIVMAIEENDEILRGLCGEDGVAY 
AICELQEECDLRGSLILKKYMDFRKLAILASDINNSPNLNILPGGASEGPDPREVELYVE 
EILSLMQLGEDYTEFMVSKIKSLTSVDPELLPTATKAFRNKSFSKAIQDVTRYYVILEGF 
FMVENVRKAIRIDEHVPDSLTTSMVDDVFYVLQSCLRRAISTSNISSVIAVLSYAGSLLG 
NDYHEALQQKIREPNLGARLFLGGIGVENTGTEIATALNNMDVSCEYILKLKHEIEEQCT 
EVFPAPADRERIKSCLSELGELSSTFKQLLNSGMEQLVATVTPRIRPVLDTVATISYELT 
ETEYAENEVNDPWVQRLLHSVETNAAWLQPLMTSNNYDSFLHLIIDFIVKRLEVIMMQKR 
FSQLGGLQLDRDTRALVSHFSGMTQRTVRDKFARLTQMATILNLEKVSEILDFWGENSGP 
MTWRLTPAEVRRVLGLRVEFKPESIAALKL*
>AT4G01400.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s Pentatricopeptide repeat (InterProIPR002885) BEST Arabidopsis thaliana protein match is pentatricopeptide (PPR) repeat-containing protein (TAIRAT5G461001) 
MIRRPIYDFAAVFRHLTSPLSTSSRFLFYSSSEHEARKPIVSNPKSPIGSPTRVQKLIAS 
QSDPLLAKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGE 
IFTYLIKVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKS 
SRLHGVMPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQ 
VNGAMELLDDMLNKGFVPDRTLIGGLCDQGMFDEGKKYLEEMISKGFSPHFSVSNCLVKG 
FCSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIKLFLEDAVKEEITGD 
TRIVDVGIENKKMPEIEQDDAAAETVDSSTVKFGTPEALEYVRSLTDVGAMTRLLHECIA 
YQRSLDSDLDTLLSQRTELDRNLVQLQRSAEILDIVKADADHMLGNVRSTCDLADQVSGK 
VRELDLAQSRVNVTLSRIDAIVERGNCIEGVKTALESEDYESAAKFVQRFLQIDLQYKDS 
GSDQSEQLHASKEQLEGIAKKKLLAAIDQRDHPTILRFVRLYSPLGMETEGLQLYVGYLK 
KVIALRGRMEYENVVELMEQGLGQVNFVGCLTNLFKDIVMAIEENDEILRGLCGEDGVAY 
AICELQEECDLRGSLILKKYMDFRKLAILASDINNSPNLNILPGGASEGPDPREVELYVE 
EILSLMQLGEDYTEFMVSKIKSLTSVDPELLPTATKAFRNKSFSKAIQDVTRYYVILEGF 
FMVENVRKAIRIDEHVPDSLTTSMVDDVFYVLQSCLRRAISTSNISSVIAVLSYAGSLLG 
NDYHEALQQKIREPNLGARLFLGGIGVENTGTEIATALNNMDVSCEYILKLKHEIEEQCT 
EVFPAPADRERIKSCLSELGELSSTFKQLLNSGMEQLVATVTPRIRPVLDTVATISYELT 
ETEYAENEVNDPWVQRLLHSVETNAAWLQPLMTSNNYDSFLHLIIDFIVKRLEVIMMQKR 
FSQLGGLQLDRDTRALVSHFSGMTQRTVRDKFARLTQMATILNLEKVSEILDFWGENSGP 
MTWRLTPAEVRRVLGLRVEFKPESIAALKL*
>AT4G01400.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s COG4 transport (InterProIPR013167) Pentatricopeptide repeat (InterProIPR002885) BEST Arabidopsis thaliana protein match is pentatricopeptide (PPR) repeat-containing protein (TAIRAT5G461001) Has 11639 Blast hits to 3936 proteins in 180 species Archae - 0 Bacteria - 2 Metazoa - 206 Fungi - 141 Plants - 11004 Viruses - 0 Other Eukaryotes - 286 (source NCBI BLink) 
MPEIEQDDAAAETVDSSTVKFGTPEALEYVRSLTDVGAMTRLLHECIAYQRSLDSDLDTL 
LSQRTELDRNLVQLQRSAEILDIVKADADHMLGNVRSTCDLADQVSGKVRELDLAQSRVN 
VTLSRIDAIVERGNCIEGVKTALESEDYESAAKFVQRFLQIDLQYKDSGSDQSEQLHASK 
EQLEGIAKKKLLAAIDQRDHPTILRFVRLYSPLGMETEGLQLYVGYLKKVIALRGRMEYE 
NVVELMEQGLGQVNFVGCLTNLFKDIVMAIEENDEILRGLCGEDGVAYAICELQEECDLR 
GSLILKKYMDFRKLAILASDINNSPNLNILPGGASEGPDPREVELYVEEILSLMQLGEDY 
TEFMVSKIKSLTSVDPELLPTATKAFRNKSFSKAIQDVTRYYVILEGFFMVENVRKAIRI 
DEHVPDSLTTSMVDDVFYVLQSCLRRAISTSNISSVIAVLSYAGSLLGNDYHEALQQKIR 
EPNLGARLFLGGIGVENTGTEIATALNNMDVSCEYILKLKHEIEEQCTEVFPAPADRERI 
KSCLSELGELSSTFKQLLNSGMEQLVATVTPRIRPVLDTVATISYELTETEYAENEVNDP 
WVQRLLHSVETNAAWLQPLMTSNNYDSFLHLIIDFIVKRLEVIMMQKRFSQLGGLQLDRD 
TRALVSHFSGMTQRTVRDKFARLTQMATILNLEKVSEILDFWGENSGPMTWRLTPAEVRR 
VLGLRVEFKPESIAALKL*
>AT4G01400.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s COG4 transport (InterProIPR013167) 
MPEIEQDDAAAETVDSSTVKFGTPEALEYVRSLTDVGAMTRLLHECIAYQRSLDSDLDTL 
LSQRTELDRNLVQLQRSAEILDIVKADADHMLGNVRSTCDLADQVSGKVRELDLAQSRVN 
VTLSRIDAIVERGNCIEGVKTALESEDYESAAKFVQRFLQIDLQYKDSGSDQSEQLHASK 
EQLEGIAKKKLLAAIDQRDHPTILRFVRLYSPLGMETEGLQLYVGYLKKVIALRGRMEYE 
NVVELMEQGLGQVNFVGCLTNLFKDIVMAIEENDEILRGLCGEDGVAYAICELQEECDLR 
GSLILKKYMDFRKLAILASDINNSPNLNILPGGASEGPDPREVELYVEEILSLMQLGEDY 
TEFMVSKIKSLTSVDPELLPTATKAFRNKSFSKAIQDVTRYYVILEGFFMVENVRKAIRI 
DEHVPDSLTTSMVDDVFYVLQSCLRRAISTSNISSVIAVLSYAGSLLGNDYHEALQQKIR 
EPNLGARLFLGGIGVENTGTEIATALNNMDVSCEYILKLKHEIEEQCTEVFPAPADRERI 
KSCLSELGELSSTFKQLLNSGMEQLVATVTPRIRPVLDTVATISYELTETEYAENEVNDP 
WVQRLLHSVETNAAWLQPLMTSNNYDSFLHLIIDFIVKRLEVIMMQKRFSQLGGLQLDRD 
TRALVSHFSGMTQRTVRDKFARLTQMATILNLEKVSEILDFWGENSGPMTWRLTPAEVRR 
VLGLRVEFKPESIAALKL*
>AT4G01400.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s Pentatricopeptide repeat (InterProIPR002885) BEST Arabidopsis thaliana protein match is pentatricopeptide (PPR) repeat-containing protein (TAIRAT5G461001) 
MPEIEQDDAAAETVDSSTVKFGTPEALEYVRSLTDVGAMTRLLHECIAYQRSLDSDLDTL 
LSQRTELDRNLVQLQRSAEILDIVKADADHMLGNVRSTCDLADQVSGKVRELDLAQSRVN 
VTLSRIDAIVERGNCIEGVKTALESEDYESAAKFVQRFLQIDLQYKDSGSDQSEQLHASK 
EQLEGIAKKKLLAAIDQRDHPTILRFVRLYSPLGMETEGLQLYVGYLKKVIALRGRMEYE 
NVVELMEQGLGQVNFVGCLTNLFKDIVMAIEENDEILRGLCGEDGVAYAICELQEECDLR 
GSLILKKYMDFRKLAILASDINNSPNLNILPGGASEGPDPREVELYVEEILSLMQLGEDY 
TEFMVSKIKSLTSVDPELLPTATKAFRNKSFSKAIQDVTRYYVILEGFFMVENVRKAIRI 
DEHVPDSLTTSMVDDVFYVLQSCLRRAISTSNISSVIAVLSYAGSLLGNDYHEALQQKIR 
EPNLGARLFLGGIGVENTGTEIATALNNMDVSCEYILKLKHEIEEQCTEVFPAPADRERI 
KSCLSELGELSSTFKQLLNSGMEQLVATVTPRIRPVLDTVATISYELTETEYAENEVNDP 
WVQRLLHSVETNAAWLQPLMTSNNYDSFLHLIIDFIVKRLEVIMMQKRFSQLGGLQLDRD 
TRALVSHFSGMTQRTVRDKFARLTQMATILNLEKVSEILDFWGENSGPMTWRLTPAEVRR 
VLGLRVEFKPESIAALKL*
>AT4G01400.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s COG4 transport (InterProIPR013167) Pentatricopeptide repeat (InterProIPR002885) BEST Arabidopsis thaliana protein match is pentatricopeptide (PPR) repeat-containing protein (TAIRAT5G461001) Has 11639 Blast hits to 3936 proteins in 180 species Archae - 0 Bacteria - 2 Metazoa - 206 Fungi - 141 Plants - 11004 Viruses - 0 Other Eukaryotes - 286 (source NCBI BLink) 
MIRRPIYDFAAVFRHLTSPLSTSSRFLFYSSSEHEARKPIVSNPKSPIGSPTRVQKLIAS 
QSDPLLAKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGE 
IFTYLIKVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKS 
SRLHGVMPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQ 
VNGAMELLDDMLNKGFVPDRLSYTTLLNSLCRKTQLREAYKLLCRMKLKGCNPDLVHYNT 
MILGFCREDRAMDARKVLDDMLSNGCSPNSVSYRTLIGGLCDQGMFDEGKKYLEEMISKG 
FSPHFSVSNCLVKGFCSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIK 
LFLEDAVKEEITGDTRIVDVGIGLGSYLSSKLQMKRKNARERRRHL*
>AT4G01400.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s COG4 transport (InterProIPR013167) 
MIRRPIYDFAAVFRHLTSPLSTSSRFLFYSSSEHEARKPIVSNPKSPIGSPTRVQKLIAS 
QSDPLLAKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGE 
IFTYLIKVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKS 
SRLHGVMPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQ 
VNGAMELLDDMLNKGFVPDRLSYTTLLNSLCRKTQLREAYKLLCRMKLKGCNPDLVHYNT 
MILGFCREDRAMDARKVLDDMLSNGCSPNSVSYRTLIGGLCDQGMFDEGKKYLEEMISKG 
FSPHFSVSNCLVKGFCSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIK 
LFLEDAVKEEITGDTRIVDVGIGLGSYLSSKLQMKRKNARERRRHL*
>AT4G01400.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s Pentatricopeptide repeat (InterProIPR002885) BEST Arabidopsis thaliana protein match is pentatricopeptide (PPR) repeat-containing protein (TAIRAT5G461001) 
MIRRPIYDFAAVFRHLTSPLSTSSRFLFYSSSEHEARKPIVSNPKSPIGSPTRVQKLIAS 
QSDPLLAKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGE 
IFTYLIKVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKS 
SRLHGVMPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQ 
VNGAMELLDDMLNKGFVPDRLSYTTLLNSLCRKTQLREAYKLLCRMKLKGCNPDLVHYNT 
MILGFCREDRAMDARKVLDDMLSNGCSPNSVSYRTLIGGLCDQGMFDEGKKYLEEMISKG 
FSPHFSVSNCLVKGFCSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIK 
LFLEDAVKEEITGDTRIVDVGIGLGSYLSSKLQMKRKNARERRRHL*
>AT2G17980.1 |  ATSLY1 protein transporter 
MALNLRQKQTECVIRMLNLNQPLNPSGTANEEVYKILIYDRFCQNILSPLTHVKDLRKHG 
VTLFFLIDKDRQPVHDVPAVYFVQPTESNLQRIIADASRSLYDTFHLNFSSSIPRKFLEE 
LASGTLKSGSVEKVSKVHDQYLEFVTLEDNLFSLAQQSTYVQMNDPSAGEKEINEIIERV 
ASGLFCVLVTLGVVPVIRCPSGGPAEMVASLLDQKLRDHLLSKNNLFTEGGGFMSSFQRP 
LLCIFDRNFELSVGIQHDFRYRPLVHDVLGLKLNQLKVQGEKGPPKSFELDSSDPFWSAN 
STLEFPDVAVEIETQLNKYKRDVEEVNKKTGGGSGAEFDGTDLIGNIHTEHLMNTVKSLP 
ELTERKKVIDKHTNIATALLGQIKERSIDAFTKKESDMMMRGGIDRTELMAALKGKGTKM 
DKLRFAIMYLISTETINQSEVEAVEAALNEAEADTSAFQYVKKIKSLNASFAATSANSAS 
RSNIVDWAEKLYGQSISAVTAGVKNLLSSDQQLAVTRTVEALTEGKPNPEIDSYRFLDPR 
APKSSSSGGSHVKGPFREAIVFMIGGGNYVEYGSLQELTQRQLTVKNVIYGATEILNGGE 
LVEQLGLLGKKMGLGGPVASTSLSGGH*
>AT4G34450.1 |  coatomer gamma-2 subunit putative / gamma-2 coat protein putative / gamma-2 COP putative 
MAQPLVKKDDDHDDELEYSPFMGIEKGAVLQEARVFNDPQVDPRRCSQVITKLLYLLNQG 
ESFTKVEATEVFFSVTKLFQSKDTGLRRMVYLIIKELSPSSDEVIIVTSSLMKDMNSKID 
MYRANAIRVLCRIIDGTLLTQIERYLKQAIVDKNPVVSSAALVSGLHLLKTNPEIVKRWS 
NEVQEGIQSRSALVQFHALALLHQIRQNDRLAVSKLVGSLTRGSVRSPLAQCLLIRYTSQ 
VIRDMANHGQSGERPFYEFLESCLRHKAEMVILEAARAITELDGVTSRELTPAITVLQLF 
LSSPRPVLRFAAVRTLNKVAMTHPMAVTNCNIDMESLISDQNRSIATLAITTLLKTGNES 
SVERLMKQITNFMSDIADEFKIVVVDAIRSLCVKFPLKYRSLMTFLSNILREEGGFEYKR 
AIVDSIVTIIRDIPDAKESGLLHLCEFIEDCEFTYLSTQILHFLGIEGPNTSDPSKYIRY 
IYNRVHLENATVRAAAVSTLAKFGFMVESLKPRITVLLKRCIYDSDDEVRDRATLYLSVL 
GGDGTVDTDKESKDFLFGSLEVPLVNMETSLKNYEPSEEAFDINSVPKEVKSQPLAEKKA 
QGKKPTGLGAPPAAPASGFDGYERLLSSIPEFAAFGKLFKSSLPVELTEAETEYAVNVVK 
HIFDSHVVFQYNCTNTIPEQLLERVNVIVDASEAEEFSEVTSKALNSLPYDSPGQAFVVF 
EKPAGVPAVGKFSNTLTFVVKEVDPSTGEAEDDGVEDEYQLEDLEVVAGDYMVKVGVSNF 
RNAWESMDEEDERVDEYGLGQRESLGEAVKAVMDLLGMQTCEGTETIPLNARSHTCLLSG 
VYIGNVKVLVRAQFGMDSSKDIAMKLTVRAEDVSVAEAIHEIVASG*
>AT2G22425.1 |  peptidase 
MDWQGQKLVEQLMQILLVISGVVAVVVGYTTESFRTMMLIYAGGVVLTTLVTVPNWPFYN 
LHPLKWLDPSEAEKHPKPEVVSVASKKKFSKK*
>AT2G22425.1 |  peptidase 
MDWQGQKLVEQLMQILLVISGVVAVVVGYTTESFRTMMLIYAGGVVLTTLVTVPNWPFYN 
LHPLKWLDPSEAEKHPKPEVVSVASKKKFSKK*
>AT2G22425.2 |  peptidase 
MDWQGQKLVEQLMQILLVISGVVAVVVGYTTESFRTMMLIYAGGVVLTTLVTVPNWPFYN 
LHPLKWLDPSEAEKHPKPEVVSVASKKKFSKK*
>AT2G22425.2 |  peptidase 
MDWQGQKLVEQLMQILLVISGVVAVVVGYTTESFRTMMLIYAGGVVLTTLVTVPNWPFYN 
LHPLKWLDPSEAEKHPKPEVVSVASKKKFSKK*