>AT4G01400.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s COG4 transport (InterProIPR013167) Pentatricopeptide repeat (InterProIPR002885) BEST Arabidopsis thaliana protein match is pentatricopeptide (PPR) repeat-containing protein (TAIRAT5G461001) Has 11639 Blast hits to 3936 proteins in 180 species Archae - 0 Bacteria - 2 Metazoa - 206 Fungi - 141 Plants - 11004 Viruses - 0 Other Eukaryotes - 286 (source NCBI BLink) 
MIRRPIYDFAAVFRHLTSPLSTSSRFLFYSSSEHEARKPIVSNPKSPIGSPTRVQKLIAS 
QSDPLLAKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGE 
IFTYLIKVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKS 
SRLHGVMPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQ 
VNGAMELLDDMLNKGFVPDRTLIGGLCDQGMFDEGKKYLEEMISKGFSPHFSVSNCLVKG 
FCSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIKLFLEDAVKEEITGD 
TRIVDVGIENKKMPEIEQDDAAAETVDSSTVKFGTPEALEYVRSLTDVGAMTRLLHECIA 
YQRSLDSDLDTLLSQRTELDRNLVQLQRSAEILDIVKADADHMLGNVRSTCDLADQVSGK 
VRELDLAQSRVNVTLSRIDAIVERGNCIEGVKTALESEDYESAAKFVQRFLQIDLQYKDS 
GSDQSEQLHASKEQLEGIAKKKLLAAIDQRDHPTILRFVRLYSPLGMETEGLQLYVGYLK 
KVIALRGRMEYENVVELMEQGLGQVNFVGCLTNLFKDIVMAIEENDEILRGLCGEDGVAY 
AICELQEECDLRGSLILKKYMDFRKLAILASDINNSPNLNILPGGASEGPDPREVELYVE 
EILSLMQLGEDYTEFMVSKIKSLTSVDPELLPTATKAFRNKSFSKAIQDVTRYYVILEGF 
FMVENVRKAIRIDEHVPDSLTTSMVDDVFYVLQSCLRRAISTSNISSVIAVLSYAGSLLG 
NDYHEALQQKIREPNLGARLFLGGIGVENTGTEIATALNNMDVSCEYILKLKHEIEEQCT 
EVFPAPADRERIKSCLSELGELSSTFKQLLNSGMEQLVATVTPRIRPVLDTVATISYELT 
ETEYAENEVNDPWVQRLLHSVETNAAWLQPLMTSNNYDSFLHLIIDFIVKRLEVIMMQKR 
FSQLGGLQLDRDTRALVSHFSGMTQRTVRDKFARLTQMATILNLEKVSEILDFWGENSGP 
MTWRLTPAEVRRVLGLRVEFKPESIAALKL*
>AT4G01400.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s COG4 transport (InterProIPR013167) 
MIRRPIYDFAAVFRHLTSPLSTSSRFLFYSSSEHEARKPIVSNPKSPIGSPTRVQKLIAS 
QSDPLLAKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGE 
IFTYLIKVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKS 
SRLHGVMPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQ 
VNGAMELLDDMLNKGFVPDRTLIGGLCDQGMFDEGKKYLEEMISKGFSPHFSVSNCLVKG 
FCSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIKLFLEDAVKEEITGD 
TRIVDVGIENKKMPEIEQDDAAAETVDSSTVKFGTPEALEYVRSLTDVGAMTRLLHECIA 
YQRSLDSDLDTLLSQRTELDRNLVQLQRSAEILDIVKADADHMLGNVRSTCDLADQVSGK 
VRELDLAQSRVNVTLSRIDAIVERGNCIEGVKTALESEDYESAAKFVQRFLQIDLQYKDS 
GSDQSEQLHASKEQLEGIAKKKLLAAIDQRDHPTILRFVRLYSPLGMETEGLQLYVGYLK 
KVIALRGRMEYENVVELMEQGLGQVNFVGCLTNLFKDIVMAIEENDEILRGLCGEDGVAY 
AICELQEECDLRGSLILKKYMDFRKLAILASDINNSPNLNILPGGASEGPDPREVELYVE 
EILSLMQLGEDYTEFMVSKIKSLTSVDPELLPTATKAFRNKSFSKAIQDVTRYYVILEGF 
FMVENVRKAIRIDEHVPDSLTTSMVDDVFYVLQSCLRRAISTSNISSVIAVLSYAGSLLG 
NDYHEALQQKIREPNLGARLFLGGIGVENTGTEIATALNNMDVSCEYILKLKHEIEEQCT 
EVFPAPADRERIKSCLSELGELSSTFKQLLNSGMEQLVATVTPRIRPVLDTVATISYELT 
ETEYAENEVNDPWVQRLLHSVETNAAWLQPLMTSNNYDSFLHLIIDFIVKRLEVIMMQKR 
FSQLGGLQLDRDTRALVSHFSGMTQRTVRDKFARLTQMATILNLEKVSEILDFWGENSGP 
MTWRLTPAEVRRVLGLRVEFKPESIAALKL*
>AT4G01400.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s Pentatricopeptide repeat (InterProIPR002885) BEST Arabidopsis thaliana protein match is pentatricopeptide (PPR) repeat-containing protein (TAIRAT5G461001) 
MIRRPIYDFAAVFRHLTSPLSTSSRFLFYSSSEHEARKPIVSNPKSPIGSPTRVQKLIAS 
QSDPLLAKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGE 
IFTYLIKVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKS 
SRLHGVMPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQ 
VNGAMELLDDMLNKGFVPDRTLIGGLCDQGMFDEGKKYLEEMISKGFSPHFSVSNCLVKG 
FCSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIKLFLEDAVKEEITGD 
TRIVDVGIENKKMPEIEQDDAAAETVDSSTVKFGTPEALEYVRSLTDVGAMTRLLHECIA 
YQRSLDSDLDTLLSQRTELDRNLVQLQRSAEILDIVKADADHMLGNVRSTCDLADQVSGK 
VRELDLAQSRVNVTLSRIDAIVERGNCIEGVKTALESEDYESAAKFVQRFLQIDLQYKDS 
GSDQSEQLHASKEQLEGIAKKKLLAAIDQRDHPTILRFVRLYSPLGMETEGLQLYVGYLK 
KVIALRGRMEYENVVELMEQGLGQVNFVGCLTNLFKDIVMAIEENDEILRGLCGEDGVAY 
AICELQEECDLRGSLILKKYMDFRKLAILASDINNSPNLNILPGGASEGPDPREVELYVE 
EILSLMQLGEDYTEFMVSKIKSLTSVDPELLPTATKAFRNKSFSKAIQDVTRYYVILEGF 
FMVENVRKAIRIDEHVPDSLTTSMVDDVFYVLQSCLRRAISTSNISSVIAVLSYAGSLLG 
NDYHEALQQKIREPNLGARLFLGGIGVENTGTEIATALNNMDVSCEYILKLKHEIEEQCT 
EVFPAPADRERIKSCLSELGELSSTFKQLLNSGMEQLVATVTPRIRPVLDTVATISYELT 
ETEYAENEVNDPWVQRLLHSVETNAAWLQPLMTSNNYDSFLHLIIDFIVKRLEVIMMQKR 
FSQLGGLQLDRDTRALVSHFSGMTQRTVRDKFARLTQMATILNLEKVSEILDFWGENSGP 
MTWRLTPAEVRRVLGLRVEFKPESIAALKL*
>AT4G01400.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s COG4 transport (InterProIPR013167) Pentatricopeptide repeat (InterProIPR002885) BEST Arabidopsis thaliana protein match is pentatricopeptide (PPR) repeat-containing protein (TAIRAT5G461001) Has 11639 Blast hits to 3936 proteins in 180 species Archae - 0 Bacteria - 2 Metazoa - 206 Fungi - 141 Plants - 11004 Viruses - 0 Other Eukaryotes - 286 (source NCBI BLink) 
MPEIEQDDAAAETVDSSTVKFGTPEALEYVRSLTDVGAMTRLLHECIAYQRSLDSDLDTL 
LSQRTELDRNLVQLQRSAEILDIVKADADHMLGNVRSTCDLADQVSGKVRELDLAQSRVN 
VTLSRIDAIVERGNCIEGVKTALESEDYESAAKFVQRFLQIDLQYKDSGSDQSEQLHASK 
EQLEGIAKKKLLAAIDQRDHPTILRFVRLYSPLGMETEGLQLYVGYLKKVIALRGRMEYE 
NVVELMEQGLGQVNFVGCLTNLFKDIVMAIEENDEILRGLCGEDGVAYAICELQEECDLR 
GSLILKKYMDFRKLAILASDINNSPNLNILPGGASEGPDPREVELYVEEILSLMQLGEDY 
TEFMVSKIKSLTSVDPELLPTATKAFRNKSFSKAIQDVTRYYVILEGFFMVENVRKAIRI 
DEHVPDSLTTSMVDDVFYVLQSCLRRAISTSNISSVIAVLSYAGSLLGNDYHEALQQKIR 
EPNLGARLFLGGIGVENTGTEIATALNNMDVSCEYILKLKHEIEEQCTEVFPAPADRERI 
KSCLSELGELSSTFKQLLNSGMEQLVATVTPRIRPVLDTVATISYELTETEYAENEVNDP 
WVQRLLHSVETNAAWLQPLMTSNNYDSFLHLIIDFIVKRLEVIMMQKRFSQLGGLQLDRD 
TRALVSHFSGMTQRTVRDKFARLTQMATILNLEKVSEILDFWGENSGPMTWRLTPAEVRR 
VLGLRVEFKPESIAALKL*
>AT4G01400.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s COG4 transport (InterProIPR013167) 
MPEIEQDDAAAETVDSSTVKFGTPEALEYVRSLTDVGAMTRLLHECIAYQRSLDSDLDTL 
LSQRTELDRNLVQLQRSAEILDIVKADADHMLGNVRSTCDLADQVSGKVRELDLAQSRVN 
VTLSRIDAIVERGNCIEGVKTALESEDYESAAKFVQRFLQIDLQYKDSGSDQSEQLHASK 
EQLEGIAKKKLLAAIDQRDHPTILRFVRLYSPLGMETEGLQLYVGYLKKVIALRGRMEYE 
NVVELMEQGLGQVNFVGCLTNLFKDIVMAIEENDEILRGLCGEDGVAYAICELQEECDLR 
GSLILKKYMDFRKLAILASDINNSPNLNILPGGASEGPDPREVELYVEEILSLMQLGEDY 
TEFMVSKIKSLTSVDPELLPTATKAFRNKSFSKAIQDVTRYYVILEGFFMVENVRKAIRI 
DEHVPDSLTTSMVDDVFYVLQSCLRRAISTSNISSVIAVLSYAGSLLGNDYHEALQQKIR 
EPNLGARLFLGGIGVENTGTEIATALNNMDVSCEYILKLKHEIEEQCTEVFPAPADRERI 
KSCLSELGELSSTFKQLLNSGMEQLVATVTPRIRPVLDTVATISYELTETEYAENEVNDP 
WVQRLLHSVETNAAWLQPLMTSNNYDSFLHLIIDFIVKRLEVIMMQKRFSQLGGLQLDRD 
TRALVSHFSGMTQRTVRDKFARLTQMATILNLEKVSEILDFWGENSGPMTWRLTPAEVRR 
VLGLRVEFKPESIAALKL*
>AT4G01400.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s Pentatricopeptide repeat (InterProIPR002885) BEST Arabidopsis thaliana protein match is pentatricopeptide (PPR) repeat-containing protein (TAIRAT5G461001) 
MPEIEQDDAAAETVDSSTVKFGTPEALEYVRSLTDVGAMTRLLHECIAYQRSLDSDLDTL 
LSQRTELDRNLVQLQRSAEILDIVKADADHMLGNVRSTCDLADQVSGKVRELDLAQSRVN 
VTLSRIDAIVERGNCIEGVKTALESEDYESAAKFVQRFLQIDLQYKDSGSDQSEQLHASK 
EQLEGIAKKKLLAAIDQRDHPTILRFVRLYSPLGMETEGLQLYVGYLKKVIALRGRMEYE 
NVVELMEQGLGQVNFVGCLTNLFKDIVMAIEENDEILRGLCGEDGVAYAICELQEECDLR 
GSLILKKYMDFRKLAILASDINNSPNLNILPGGASEGPDPREVELYVEEILSLMQLGEDY 
TEFMVSKIKSLTSVDPELLPTATKAFRNKSFSKAIQDVTRYYVILEGFFMVENVRKAIRI 
DEHVPDSLTTSMVDDVFYVLQSCLRRAISTSNISSVIAVLSYAGSLLGNDYHEALQQKIR 
EPNLGARLFLGGIGVENTGTEIATALNNMDVSCEYILKLKHEIEEQCTEVFPAPADRERI 
KSCLSELGELSSTFKQLLNSGMEQLVATVTPRIRPVLDTVATISYELTETEYAENEVNDP 
WVQRLLHSVETNAAWLQPLMTSNNYDSFLHLIIDFIVKRLEVIMMQKRFSQLGGLQLDRD 
TRALVSHFSGMTQRTVRDKFARLTQMATILNLEKVSEILDFWGENSGPMTWRLTPAEVRR 
VLGLRVEFKPESIAALKL*
>AT4G01400.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s COG4 transport (InterProIPR013167) Pentatricopeptide repeat (InterProIPR002885) BEST Arabidopsis thaliana protein match is pentatricopeptide (PPR) repeat-containing protein (TAIRAT5G461001) Has 11639 Blast hits to 3936 proteins in 180 species Archae - 0 Bacteria - 2 Metazoa - 206 Fungi - 141 Plants - 11004 Viruses - 0 Other Eukaryotes - 286 (source NCBI BLink) 
MIRRPIYDFAAVFRHLTSPLSTSSRFLFYSSSEHEARKPIVSNPKSPIGSPTRVQKLIAS 
QSDPLLAKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGE 
IFTYLIKVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKS 
SRLHGVMPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQ 
VNGAMELLDDMLNKGFVPDRLSYTTLLNSLCRKTQLREAYKLLCRMKLKGCNPDLVHYNT 
MILGFCREDRAMDARKVLDDMLSNGCSPNSVSYRTLIGGLCDQGMFDEGKKYLEEMISKG 
FSPHFSVSNCLVKGFCSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIK 
LFLEDAVKEEITGDTRIVDVGIGLGSYLSSKLQMKRKNARERRRHL*
>AT4G01400.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s COG4 transport (InterProIPR013167) 
MIRRPIYDFAAVFRHLTSPLSTSSRFLFYSSSEHEARKPIVSNPKSPIGSPTRVQKLIAS 
QSDPLLAKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGE 
IFTYLIKVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKS 
SRLHGVMPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQ 
VNGAMELLDDMLNKGFVPDRLSYTTLLNSLCRKTQLREAYKLLCRMKLKGCNPDLVHYNT 
MILGFCREDRAMDARKVLDDMLSNGCSPNSVSYRTLIGGLCDQGMFDEGKKYLEEMISKG 
FSPHFSVSNCLVKGFCSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIK 
LFLEDAVKEEITGDTRIVDVGIGLGSYLSSKLQMKRKNARERRRHL*
>AT4G01400.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s Pentatricopeptide repeat (InterProIPR002885) BEST Arabidopsis thaliana protein match is pentatricopeptide (PPR) repeat-containing protein (TAIRAT5G461001) 
MIRRPIYDFAAVFRHLTSPLSTSSRFLFYSSSEHEARKPIVSNPKSPIGSPTRVQKLIAS 
QSDPLLAKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGE 
IFTYLIKVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKS 
SRLHGVMPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQ 
VNGAMELLDDMLNKGFVPDRLSYTTLLNSLCRKTQLREAYKLLCRMKLKGCNPDLVHYNT 
MILGFCREDRAMDARKVLDDMLSNGCSPNSVSYRTLIGGLCDQGMFDEGKKYLEEMISKG 
FSPHFSVSNCLVKGFCSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIK 
LFLEDAVKEEITGDTRIVDVGIGLGSYLSSKLQMKRKNARERRRHL*
>AT1G49240.1 |  ACT8 (ACTIN 8) copper ion binding / structural constituent of cytoskeleton 
MADADDIQPIVCDNGTGMVKAGFAGDDAPRAVFPSVVGRPRHHGVMVGMNQKDAYVGDEA 
QSKRGILTLKYPIEHGVVSNWDDMEKIWHHTFYNELRIAPEEHPVLLTEAPLNPKANREK 
MTQIMFETFNSPAMYVAIQAVLSLYASGRTTGIVLDSGDGVSHTVPIYEGFSLPHAILRL 
DLAGRDLTDYLMKILTERGYMFTTTAEREIVRDIKEKLSFVAVDYEQEMETSKTSSSIEK 
NYELPDGQVITIGAERFRCPEVLFQPSFVGMEAAGIHETTYNSIMKCDVDIRKDLYGNIV 
LSGGTTMFSGIADRMSKEITALAPSSMKIKVVAPPERKYSVWIGGSILASLSTFQQMWIS 
KAEYDEAGPGIVHRKCF*
>AT1G02130.1 |  ARA-5 (ARABIDOPSIS RAS 5) GTP binding 
MNPEYDYLFKLLLIGDSGVGKSCLLLRFSDDSYVESYISTIGVDFKIRTVEQDGKTIKLQ 
IWDTAGQERFRTITSSYYRGAHGIIIVYDVTDEESFNNVKQWLSEIDRYASDNVNKLLVG 
NKSDLTENRAIPYETAKAFADEIGIPFMETSAKDATNVEQAFMAMSASIKERMASQPAGN 
NARPPTVQIRGQPVAQKNGCCST*
>AT1G31780.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s Conserved oligomeric complex COG6 (InterProIPR010490) Has 281 Blast hits to 279 proteins in 131 species Archae - 0 Bacteria - 2 Metazoa - 132 Fungi - 106 Plants - 21 Viruses - 0 Other Eukaryotes - 20 (source NCBI BLink) 
MASTVGLAPGLSRKLKKVLDCRTDSPDLVASLNALSSFYDENSAHARRNLRSTIEKRALQ 
INSEFLNAADSTQIALDRVEEEVNALADCCDKIAAALSSSAATTSDIISTTERLKQELEV 
TTQRQEIVNCFLRDYQLSNEEIKALREDELNENFFQALSHVQEIHSNCKLLLRTHHQRAG 
LELMDMMAVYQEGAYERLCRWVQAECRKLGDTDNPEVSELLRTAVRCLKERPVLFKYCAE 
EVGNLRHNALFRRFISALTRGGPGGMPRPIEVHAHDPLRYVGDMLGWLHQALASERELVH 
ALFDIDSADHQSNAKNTSENIALKAGESDFTFVLDRIFEGVCRPFKVRVEQVLQSQPSLI 
ISYKLTNTLEFYSYTISDLLGRDTALCNTIGMVKDAAQKTFFDILKTRGEKLLRYPPPVA 
VDLSPPPAVREGVSLTLEIIENYNSMMVSASGEKPAFDPVLSALLDPIIKMCEQAAEAHK 
SKKSGQLPRRSRTSSDSSQLTSVDALLSSSPSPPQNNETPSKIFLINCLCAIQQPLLRHD 
VASQYVTNIGLMIENHINLLVQNEVDTLLHKCGLSDKMQIFRSSTSELPLSERQDTSPAM 
LSECLKAFFGLVLGSEGSLPEFEQIQVPKLRSEACVRVAKTLAEAYEVIYQAVTDQQNGY 
PDPKSLARHPPDQIRTILGI*
>AT1G73430.1 |  sec34-like family protein 
MATKAASSSSLPKSGAISKGYNFASTWEQSAPLTEQQQAAIVSLSHAVAERPFPANLVHE 
HVHRPENGLSVSVEDTHLGDSGAIEAVLVNTNQFYKWFTDLESAMKSETEEKYRHYVSTL 
TERIQTCDNILHQVDETLDLFNELQLQHQGVTTKTKTLHDACDRLLMEKQKLMEFAEALR 
SKLNYFDELENVSSNFYSPNMNVSNSNFLPLLKRLDECISYIEDNPQYAESSVYLLKFRQ 
LQSRALGMIRTYILAVLKTAASQVQAAFRGTGGNKTSVSEGVEASVIYVRFKAAANELKP 
VLEEIESRSARKEYVQILAECHRLYCEQRLSLVKGIVHQRVSDFAKKEALPSLTRSGCAY 
LMQVCHMEHQLFTHFFPASSEEVSSLAPLVDPLSTYLYDILRPKLIHEANIDLLCELVHI 
LKVEVLGDQSARQSEPLAGLRPTLQRILADVNERLTFRARTYIRDEIANYTPSDEDLDYP 
AKLEGSPNTTSETDLRDDENADVFKTWYPPLEKTLSCLSKLYRCLEQAVFTGLAQEAVEV 
CSLSIQKASKLIIKRSTTMDGQLFLIKHLLILREQIAPFDIEFSVTHKELDFSHLLEHLR 
RILRGQASLFDWSRSTSLARTLSPRVLESQIDAKKELEKCLKTTCEEFIMSVTKLVVDPM 
LSFVTKVTAIKVALSSGTQNHKVDSVMAKPLKEQAFATPDKVVELVQKVYAAIQQELLPI 
LAKMKLYLQNPSTRTILFKPIKTNIVEAHTQVESLLKAEYSAEEQANINMISIQDLQTQL 
DNFL*
>AT1G73430.1 |  sec34-like family protein 
MATKAASSSSLPKSGAISKGYNFASTWEQSAPLTEQQQAAIVSLSHAVAERPFPANLVHE 
HVHRPENGLSVSVEDTHLGDSGAIEAVLVNTNQFYKWFTDLESAMKSETEEKYRHYVSTL 
TERIQTCDNILHQVDETLDLFNELQLQHQGVTTKTKTLHDACDRLLMEKQKLMEFAEALR 
SKLNYFDELENVSSNFYSPNMNVSNSNFLPLLKRLDECISYIEDNPQYAESSVYLLKFRQ 
LQSRALGMIRTYILAVLKTAASQVQAAFRGTGGNKTSVSEGVEASVIYVRFKAAANELKP 
VLEEIESRSARKEYVQILAECHRLYCEQRLSLVKGIVHQRVSDFAKKEALPSLTRSGCAY 
LMQVCHMEHQLFTHFFPASSEEVSSLAPLVDPLSTYLYDILRPKLIHEANIDLLCELVHI 
LKVEVLGDQSARQSEPLAGLRPTLQRILADVNERLTFRARTYIRDEIANYTPSDEDLDYP 
AKLEGSPNTTSETDLRDDENADVFKTWYPPLEKTLSCLSKLYRCLEQAVFTGLAQEAVEV 
CSLSIQKASKLIIKRSTTMDGQLFLIKHLLILREQIAPFDIEFSVTHKELDFSHLLEHLR 
RILRGQASLFDWSRSTSLARTLSPRVLESQIDAKKELEKCLKTTCEEFIMSVTKLVVDPM 
LSFVTKVTAIKVALSSGTQNHKVDSVMAKPLKEQAFATPDKVVELVQKVYAAIQQELLPI 
LAKMKLYLQNPSTRTILFKPIKTNIVEAHTQVESLLKAEYSAEEQANINMISIQDLQTQL 
DNFL*
>AT1G73430.2 |  sec34-like family protein 
MATKAASSSSLPKSGAISKGYNFASTWEQSAPLTEQQQAAIVSLSHAVAERPFPANLVHE 
HVHRPENGLSVSVEDTHLGDSGAIEAVLVNTNQFYKWFTDLESAMKSETEEKYRHYVSTL 
TERIQTCDNILHQVDETLDLFNELQLQHQGVTTKTKTLHDACDRLLMEKQKLMEFAEALR 
SKLNYFDELENVSSNFYSPNMNVSNSNFLPLLKRLDECISYIEDNPQYAESSVYLLKFRQ 
LQSRALGMIRTYILAVLKTAASQVQAAFRGTGGNKTSVSEGVEASVIYVRFKAAANELKP 
VLEEIESRSARKEYVQILAECHRLYCEQRLSLVKGIVHQRVSDFAKKEALPSLTRSGCAY 
LMQVCHMEHQLFTHFFPASSEEVSSLAPLVDPLSTYLYDILRPKLIHEANIDLLCELVHI 
LKVEVLGDQSARQSEPLAGLRPTLQRILADVNERLTFRARTYIRDEIANYTPSDEDLDYP 
AKLEGSPNTTSETDLRDDENADVFKTWYPPLEKTLSCLSKLYRCLEQAVFTGLAQEAVEV 
CSLSIQKASKLIIKRSTTMDGQLFLIKHLLILREQIAPFDIEFSVTHKELDFSHLLEHLR 
RILRGQASLFDWSRSTSLARTLSPRVLESQIDAKKELEKCLKTTCEEFIMSVTKLVVDPM 
LSFVTKVTAIKVALSSGTQNHKVDSVMAKPLKEQAFATPDKVVELVQKVYAAIQQELLPI 
LAKMKLYLQNPSTRTILFKPIKTNIVEAHTQVESLLKAEYSAEEQANINMISIQDLQTQL 
DNFL*
>AT1G73430.2 |  sec34-like family protein 
MATKAASSSSLPKSGAISKGYNFASTWEQSAPLTEQQQAAIVSLSHAVAERPFPANLVHE 
HVHRPENGLSVSVEDTHLGDSGAIEAVLVNTNQFYKWFTDLESAMKSETEEKYRHYVSTL 
TERIQTCDNILHQVDETLDLFNELQLQHQGVTTKTKTLHDACDRLLMEKQKLMEFAEALR 
SKLNYFDELENVSSNFYSPNMNVSNSNFLPLLKRLDECISYIEDNPQYAESSVYLLKFRQ 
LQSRALGMIRTYILAVLKTAASQVQAAFRGTGGNKTSVSEGVEASVIYVRFKAAANELKP 
VLEEIESRSARKEYVQILAECHRLYCEQRLSLVKGIVHQRVSDFAKKEALPSLTRSGCAY 
LMQVCHMEHQLFTHFFPASSEEVSSLAPLVDPLSTYLYDILRPKLIHEANIDLLCELVHI 
LKVEVLGDQSARQSEPLAGLRPTLQRILADVNERLTFRARTYIRDEIANYTPSDEDLDYP 
AKLEGSPNTTSETDLRDDENADVFKTWYPPLEKTLSCLSKLYRCLEQAVFTGLAQEAVEV 
CSLSIQKASKLIIKRSTTMDGQLFLIKHLLILREQIAPFDIEFSVTHKELDFSHLLEHLR 
RILRGQASLFDWSRSTSLARTLSPRVLESQIDAKKELEKCLKTTCEEFIMSVTKLVVDPM 
LSFVTKVTAIKVALSSGTQNHKVDSVMAKPLKEQAFATPDKVVELVQKVYAAIQQELLPI 
LAKMKLYLQNPSTRTILFKPIKTNIVEAHTQVESLLKAEYSAEEQANINMISIQDLQTQL 
DNFL*
>AT1G67930.1 |  Golgi transport complex protein-related 
MALPPSSPSPSSPSLQRLSTFKNPPPSSLSSGAPPPQTPSSSSSSPLDSFATDPILSPFL 
SSSFSSASFSSAALASGSPASTAERLHQAIRLLDSQLRNDVISRHPELLAQLSSLSHADV 
SLSSLRSSVSSLQSSIRRVRSDLSEPIKSIRSKSVQLSNLHTATELLSHSVRTLRLSKKL 
RDLADFPDPDKIDLTKAAQFHFEILTMCKEYDLFGIDVIDEEIKFVTEIGEKLRSEAMKV 
LERGMEGLNQAEVGTGLQVFYNLGELKSTVDQLVNKYKGMAVKSVSVAMDMKAITSGSGG 
GFGPGGIRSSGSPHIGGGAKVREALWQRMASCMEQLCSLVVAVWHLQRVLSKKRDPFTHV 
LLLDEVIKEGDSMLTDRVWDALVKAFTSQMKSAYTASSFVKEIFTMGYPKLVSMIENLLE 
RISRDTDVKGVLPAINLERKEQMVACIAIFQTAFLSLCFGRLSDLVNSIFPMSSRGSLPS 
KEQISQVLSHIQDEIEAVHPDARLTLLVLREIGKALSNLAQRAECQISTGPETRQISGPA 
TSTQIRNFTLCQHLQGIHTHISSMVADLPSIATDVLSPYLAAIYDAACEPVTPLFKAMRD 
KLESCILQIHDQNFGADDADMDNNASSYMEELQRSILHFRKEFLSRLLPSAANANTAGTE 
SICTRLTRQMASRVLIFYIRHASLVRPLSEWGKLRMAKDMAELELAVGQNLFPVEQLGAP 
YRALRAFRPLVFLETSQMGSSPLINDLPPSIVLHHLYTRGPDELESPMQKNRLSPKQYSL 
WLDNQREDQIWKGIKATLDDYAVKIRSRGDKEFSPVYPLMLQIGSSLTQENL*
>AT4G24840.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN protein transport Golgi organization LOCATED IN vacuole EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s COG complex component COG2 (InterProIPR009316) Has 214 Blast hits to 204 proteins in 99 species Archae - 0 Bacteria - 0 Metazoa - 105 Fungi - 56 Plants - 22 Viruses - 0 Other Eukaryotes - 31 (source NCBI BLink) 
MSDLVATSPSPSSAPRSATDFFSDPYDSHPLWFKPSLFLSPNFDSESYISELRTFVPFDT 
LRSELRSHLASLNRELVDLINRDYADFVNLSTKLVDIDAAVVRMRAPLLELREKITGFRG 
SVEAALFALRNGLQQRSDAAAAREVLELLLDTFHVVSKVEKLIKVLPSTPSDWQNEDANS 
MGRSSMNDENSTQQDGTTMRETQSMLLERIASEMNRLKFYMAHAQNLPFIENMEKRIQSA 
SVLLDASLGHCFIDGLNNSDTSVLYNCLRAYAAIDNTNAAEEIFRTTIVAPFIQKIITHE 
TTTNAAGTSEDELENDYKQIKHFIAKDCKMLLEISSTDKSGLHVFDFLANSILKEVLWAI 
QKVKPGAFSPGRPTEFLKNYKASLDFLAYLEGYCPSRSAVTKFRAEAICVEFMKQWNVGV 
YFSLRFQEIAGALDSALTSPSLVFIQDSDKESSLNLILRQSDTLLECLRSCWKEDVLVFS 
AADKFLRLTLQLLSRYSFWVSSALNNRKSNASPSPGCEWAVSATAEDFVYVIHDVNCLVS 
EVCGDYLGHISQYLSSSSTEVLDVVRISIEQGGVSLEKVLPLLTKTIIDVIVDKSVEDLR 
QLRGITATFRMTNKPLPVRHSPYVVGLLRPVKAFLEGDKARNYLTQKTKEELLHGSVSEI 
TRRYYELAADVVSVARKTQSSLQKLRQNAQRRGGAASGVSDQNVSETDKMCMQLFLDIQE 
YGRNVSALGLKPADIPEYCSFWQCVAPADRQNSISV*
>AT3G07680.1 |  emp24/gp25L/p24 family protein 
MSLKGTIVLLGLLWSFQATLGIRFVIDREECFSHKAEYEGDTLHVSFVVIKSDSQWHFNE 
DGVDLVIHGPTGEQIHDFREQISAKHDFVVQKKGVYRFCFTNKSPYHETIDFDVQLGHFA 
YYDQHAKDEHFTPLMEQISKLEEALYNIQFEQHWLEAQTDRQAIVNENMSKRAVHKALFE 
SFALIGASFLQVYLLRRLFERKLGMSRV*
>AT5G51430.1 |  EYE (EMBRYO YELLOW) 
MMLDLGPFSDEKFDAKRWVNSSCQARHPQDSLEKHLVDLEMKLQIASEEIGASLEEQSGG 
ALLRVPRATRDVLRLRDDAVSLRGSVAGILQKLKKAEGSSADCIAALARVDNVKQRMEAA 
YKTLQDAAGLTQLSSTVEDVFASGDLPRAAETLASMRNCLSAVGEVAEFANVRKQLEVLE 
DRLEAMVQPRLTDALTYHKVDVAQDLRVILIRIGRFKSLELQYSKVRLKPIKQLWEDFDT 
KQRANKLANERSESQRLSSGDEFQSTSSQTSFASWLTSFYDELLLYLEQEWKWCMVAFPD 
DYMTLVPKLLVETMGVLGASFVSRLNLATGDAVPETKALAKGVMDLLSGDLPKGINIQTK 
HLEALIELHNVTGSFARNIQHLFAESELRILIDTLKAVYSPFESFKQKYGKMERAILSSE 
IAVVDLRGAVTRGVGAQGIELSETVRRMEESIPQVVVLLEAAVERCIGFTGGSEADELIL 
ALDDIMLQYISMLQETLKSLRVVCGVDGTGDGVGSKKDASAEKRESSRKMDLTSNEEWSI 
VQGALQILTVADCLTSRSSVFEASLRATLARLNSSLSISLFGTNLDHNLSHLKSEQTAGD 
LSMAGRASMDVAAIRLVDVPEKAHKLLNLLEQSKDPRFHALPLASQRVAAFADTVNELVY 
DVLISKVRQRLGEVSRLPIWSSVEEQTAFPLPNFSSYPQSYVTSVGEYLLTLPQQLEPLA 
EGISTNGDSNNEDAQFFATEWMFKVAEGATALYMDQLRGIQYISDRGAQQLSVDIEYLSN 
VLSALSMPIPPVLATFQTCLATPRGELKDVMKSEAGNELDCPTANLVCKMRRISFD*
>AT3G22845.1 |  emp24/gp25L/p24 protein-related 
MERRQAKIHVFVLIGLILLNSINQISSLSVTVNDEECVQEYVLYEGDTVSGNFVVVDHDI 
FWGSDHPGLDFTVTSPAGNIVQTLKGTSGDKFEFKAPKSGMYKFCFHNPYSTPETVSFYI 
HVGHIPNEHDLAKDEHLDPVNVKIAELREALESVVAEQKYLKARDTRHRHTNESTRKRVI 
FYTVGEYIFLAAASGLQVLYIRKLFSKSVAYNRV*
>AT3G12110.1 |  ACT11 (actin-11) structural constituent of cytoskeleton 
MADGEDIQPLVCDNGTGMVKAGFAGDDAPRAVFPSIVGRPRHTGVMVGMGQKDAYVGDEA 
QSKRGILTLKYPIEHGIVSNWDDMEKIWHHTFYNELRVAPEEHPVLLTEAPLNPKANREK 
MTQIMFETFNTPAMYVAIQAVLSLYASGRTTGIVLDSGDGVSHTVPIYEGYALPHAILRL 
DLAGRDLTDYLMKILTERGYSFTTSAEREIVRDVKEKLAYIALDYEQEMETANTSSSVEK 
SYELPDGQVITIGGERFRCPEVLFQPSLVGMEAAGIHETTYNSIMKCDVDIRKDLYGNIV 
LSGGTTMFPGIADRMSKEITALAPSSMKIKVVAPPERKYSVWIGGSILASLSTFQQMWIA 
KAEYDESGPSIVHRKCF*
>AT3G24350.1 |  SYP32 (SYNTAXIN OF PLANTS 32) SNAP receptor 
MSARHGQSSYRDRSDEFFKIVETLRRSIAPAPAANNVPYGNNRNDGARREDLINKSEFNK 
RASHIGLAINQTSQKLSKLAKLAKRTSVFDDPTQEIQELTVVIKQEISALNSALVDLQLF 
RSSQNDEGNNSRDRDKSTHSATVVDDLKYRLMDTTKEFKDVLTMRTENMKVHESRRQLFS 
SNASKESTNPFVRQRPLAAKAAASESVPLPWANGSSSSSSQLVPWKPGEGESSPLLQQSQ 
QQQQQQQQQMVPLQDTYMQGRAEALHTVESTIHELSSIFTQLATMVSQQGEIAIRIDQNM 
EDTLANVEGAQSQLARYLNSISSNRWLMMKIFFVLIAFLMIFLFFVA*
>AT5G16300.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 204 Blast hits to 183 proteins in 76 species Archae - 0 Bacteria - 0 Metazoa - 127 Fungi - 34 Plants - 24 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink) 
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ 
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN 
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS 
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL 
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK 
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV 
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL 
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN 
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL 
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP 
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT 
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS 
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR 
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR 
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW 
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP 
ISAPALSSRSTNKVSIPVTSNDASARNSWKAFTNGEQSQTSDLEENSNFGVAFKSFMQES 
TLKLGSILTDGQVGIFKDRSAAAMSTFGDILPAQAAGLLSSFTNTRSE*
>AT5G16300.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 199 Blast hits to 182 proteins in 76 species Archae - 0 Bacteria - 0 Metazoa - 122 Fungi - 34 Plants - 24 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink) 
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ 
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN 
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS 
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL 
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK 
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV 
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL 
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN 
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL 
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP 
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT 
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS 
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR 
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR 
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW 
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP 
ISAPALSSRSTNKVSIPVTSNDASARNSWKAFTNGEQSQTSDLEENSNFGVAFKSFMQES 
TLKLGSILTDGQVGIFKDRSAAAMSTFGDILPAQAAGLLSSFTNTRSE*
>AT5G16300.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 201 Blast hits to 182 proteins in 75 species Archae - 0 Bacteria - 0 Metazoa - 125 Fungi - 34 Plants - 23 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink) 
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ 
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN 
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS 
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL 
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK 
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV 
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL 
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN 
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL 
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP 
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT 
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS 
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR 
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR 
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW 
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP 
ISAPALSSRSTNKVSIPVTSNDASARNSWKAFTNGEQSQTSDLEENSNFGVAFKSFMQES 
TLKLGSILTDGQVGIFKDRSAAAMSTFGDILPAQAAGLLSSFTNTRSE*
>AT5G16300.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 204 Blast hits to 183 proteins in 76 species Archae - 0 Bacteria - 0 Metazoa - 127 Fungi - 34 Plants - 24 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink) 
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ 
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN 
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS 
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL 
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK 
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV 
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL 
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN 
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL 
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP 
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT 
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS 
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR 
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR 
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW 
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP 
ISAPALSSRSTNKVSIPVTSNDASESTLKLGSILTDGQVGIFKDRSAAAMSTFGDILPAQ 
AAGLLSSFTNTRSE*
>AT5G16300.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 199 Blast hits to 182 proteins in 76 species Archae - 0 Bacteria - 0 Metazoa - 122 Fungi - 34 Plants - 24 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink) 
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ 
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN 
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS 
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL 
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK 
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV 
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL 
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN 
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL 
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP 
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT 
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS 
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR 
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR 
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW 
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP 
ISAPALSSRSTNKVSIPVTSNDASESTLKLGSILTDGQVGIFKDRSAAAMSTFGDILPAQ 
AAGLLSSFTNTRSE*
>AT5G16300.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 201 Blast hits to 182 proteins in 75 species Archae - 0 Bacteria - 0 Metazoa - 125 Fungi - 34 Plants - 23 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink) 
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ 
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN 
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS 
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL 
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK 
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV 
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL 
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN 
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL 
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP 
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT 
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS 
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR 
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR 
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW 
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP 
ISAPALSSRSTNKVSIPVTSNDASESTLKLGSILTDGQVGIFKDRSAAAMSTFGDILPAQ 
AAGLLSSFTNTRSE*
>AT5G16300.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 204 Blast hits to 183 proteins in 76 species Archae - 0 Bacteria - 0 Metazoa - 127 Fungi - 34 Plants - 24 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink) 
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ 
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN 
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS 
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL 
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK 
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV 
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL 
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN 
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL 
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP 
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT 
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS 
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR 
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR 
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW 
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP 
ISAPALSSRSTNKVSIPVTSNDASARNSWKAFTNGEQSQTSDLEENSNFGVAFKSFMQVK 
HIETGININ*
>AT5G16300.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 199 Blast hits to 182 proteins in 76 species Archae - 0 Bacteria - 0 Metazoa - 122 Fungi - 34 Plants - 24 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink) 
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ 
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN 
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS 
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL 
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK 
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV 
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL 
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN 
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL 
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP 
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT 
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS 
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR 
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR 
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW 
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP 
ISAPALSSRSTNKVSIPVTSNDASARNSWKAFTNGEQSQTSDLEENSNFGVAFKSFMQVK 
HIETGININ*
>AT5G16300.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 201 Blast hits to 182 proteins in 75 species Archae - 0 Bacteria - 0 Metazoa - 125 Fungi - 34 Plants - 23 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink) 
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ 
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN 
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS 
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL 
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK 
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV 
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL 
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN 
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL 
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP 
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT 
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS 
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR 
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR 
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW 
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP 
ISAPALSSRSTNKVSIPVTSNDASARNSWKAFTNGEQSQTSDLEENSNFGVAFKSFMQVK 
HIETGININ*
>AT2G17980.1 |  ATSLY1 protein transporter 
MALNLRQKQTECVIRMLNLNQPLNPSGTANEEVYKILIYDRFCQNILSPLTHVKDLRKHG 
VTLFFLIDKDRQPVHDVPAVYFVQPTESNLQRIIADASRSLYDTFHLNFSSSIPRKFLEE 
LASGTLKSGSVEKVSKVHDQYLEFVTLEDNLFSLAQQSTYVQMNDPSAGEKEINEIIERV 
ASGLFCVLVTLGVVPVIRCPSGGPAEMVASLLDQKLRDHLLSKNNLFTEGGGFMSSFQRP 
LLCIFDRNFELSVGIQHDFRYRPLVHDVLGLKLNQLKVQGEKGPPKSFELDSSDPFWSAN 
STLEFPDVAVEIETQLNKYKRDVEEVNKKTGGGSGAEFDGTDLIGNIHTEHLMNTVKSLP 
ELTERKKVIDKHTNIATALLGQIKERSIDAFTKKESDMMMRGGIDRTELMAALKGKGTKM 
DKLRFAIMYLISTETINQSEVEAVEAALNEAEADTSAFQYVKKIKSLNASFAATSANSAS 
RSNIVDWAEKLYGQSISAVTAGVKNLLSSDQQLAVTRTVEALTEGKPNPEIDSYRFLDPR 
APKSSSSGGSHVKGPFREAIVFMIGGGNYVEYGSLQELTQRQLTVKNVIYGATEILNGGE 
LVEQLGLLGKKMGLGGPVASTSLSGGH*
>AT3G54630.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 11 plant structures EXPRESSED DURING 6 growth stages CONTAINS InterPro DOMAIN/s Kinetochore protein Ndc80 (InterProIPR005550) Has 19800 Blast hits to 12108 proteins in 845 species Archae - 345 Bacteria - 1709 Metazoa - 11031 Fungi - 1499 Plants - 797 Viruses - 39 Other Eukaryotes - 4380 (source NCBI BLink) 
MRGGAAGKRRTTVGFGGAPPPPPPSIEQQRHLFNSRDSDASFASSRPSSIGLGGRGASDD 
RSSMIRFINAFLSTHNFPISIRGNPVPSVKDISETLKFLLSALDYPCDSIKWDEDLVFFL 
KSQKCPFKITKSSLKAPNTPHNWPTVLAVVHWLAELARFHQHLVSNSTSVPEDNSMNFFA 
IQSFGHFIRGEDDKVNDLDSQFLGKLEAEKTSVAETISGCEKISGELEAKLESLRKGPSK 
KESLEKVKADLENDVNKFRTIVVEYTDRNPAMEKVVEEKAKELKAKEEERERISVENKEL 
KKSVELQNFSAADVNRMRRELQAVERDVADAEVARDGWDQKAWELNSQIRNQFHQIQTLA 
IDCNQALRRLKLDIQFAVNERGETPAAVMGVDYKSVVKPALCSLCDGIKGSSAEKVEELV 
TLQHHKSEMASKIESKRSLLGSIQLQINDLEEKMKLVKKETQELSTKCDLEAKTLVESVK 
AEALNLEVVEKEAAEFVKASELRLQEAVKESEEEVQACAAQLFALIDSISKQKEYMDSKI 
SEIKTGVADTASAVSEIYKANFKKNLGI*