>AT5G16300.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 204 Blast hits to 183 proteins in 76 species Archae - 0 Bacteria - 0 Metazoa - 127 Fungi - 34 Plants - 24 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink)
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP
ISAPALSSRSTNKVSIPVTSNDASARNSWKAFTNGEQSQTSDLEENSNFGVAFKSFMQES
TLKLGSILTDGQVGIFKDRSAAAMSTFGDILPAQAAGLLSSFTNTRSE*
>AT5G16300.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 199 Blast hits to 182 proteins in 76 species Archae - 0 Bacteria - 0 Metazoa - 122 Fungi - 34 Plants - 24 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink)
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP
ISAPALSSRSTNKVSIPVTSNDASARNSWKAFTNGEQSQTSDLEENSNFGVAFKSFMQES
TLKLGSILTDGQVGIFKDRSAAAMSTFGDILPAQAAGLLSSFTNTRSE*
>AT5G16300.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 201 Blast hits to 182 proteins in 75 species Archae - 0 Bacteria - 0 Metazoa - 125 Fungi - 34 Plants - 23 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink)
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP
ISAPALSSRSTNKVSIPVTSNDASARNSWKAFTNGEQSQTSDLEENSNFGVAFKSFMQES
TLKLGSILTDGQVGIFKDRSAAAMSTFGDILPAQAAGLLSSFTNTRSE*
>AT5G16300.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 204 Blast hits to 183 proteins in 76 species Archae - 0 Bacteria - 0 Metazoa - 127 Fungi - 34 Plants - 24 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink)
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP
ISAPALSSRSTNKVSIPVTSNDASESTLKLGSILTDGQVGIFKDRSAAAMSTFGDILPAQ
AAGLLSSFTNTRSE*
>AT5G16300.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 199 Blast hits to 182 proteins in 76 species Archae - 0 Bacteria - 0 Metazoa - 122 Fungi - 34 Plants - 24 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink)
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP
ISAPALSSRSTNKVSIPVTSNDASESTLKLGSILTDGQVGIFKDRSAAAMSTFGDILPAQ
AAGLLSSFTNTRSE*
>AT5G16300.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 201 Blast hits to 182 proteins in 75 species Archae - 0 Bacteria - 0 Metazoa - 125 Fungi - 34 Plants - 23 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink)
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP
ISAPALSSRSTNKVSIPVTSNDASESTLKLGSILTDGQVGIFKDRSAAAMSTFGDILPAQ
AAGLLSSFTNTRSE*
>AT5G16300.3 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 204 Blast hits to 183 proteins in 76 species Archae - 0 Bacteria - 0 Metazoa - 127 Fungi - 34 Plants - 24 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink)
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP
ISAPALSSRSTNKVSIPVTSNDASARNSWKAFTNGEQSQTSDLEENSNFGVAFKSFMQVK
HIETGININ*
>AT5G16300.3 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 199 Blast hits to 182 proteins in 76 species Archae - 0 Bacteria - 0 Metazoa - 122 Fungi - 34 Plants - 24 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink)
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP
ISAPALSSRSTNKVSIPVTSNDASARNSWKAFTNGEQSQTSDLEENSNFGVAFKSFMQVK
HIETGININ*
>AT5G16300.3 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 201 Blast hits to 182 proteins in 75 species Archae - 0 Bacteria - 0 Metazoa - 125 Fungi - 34 Plants - 23 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink)
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP
ISAPALSSRSTNKVSIPVTSNDASARNSWKAFTNGEQSQTSDLEENSNFGVAFKSFMQVK
HIETGININ*
>AT1G73430.1 | sec34-like family protein
MATKAASSSSLPKSGAISKGYNFASTWEQSAPLTEQQQAAIVSLSHAVAERPFPANLVHE
HVHRPENGLSVSVEDTHLGDSGAIEAVLVNTNQFYKWFTDLESAMKSETEEKYRHYVSTL
TERIQTCDNILHQVDETLDLFNELQLQHQGVTTKTKTLHDACDRLLMEKQKLMEFAEALR
SKLNYFDELENVSSNFYSPNMNVSNSNFLPLLKRLDECISYIEDNPQYAESSVYLLKFRQ
LQSRALGMIRTYILAVLKTAASQVQAAFRGTGGNKTSVSEGVEASVIYVRFKAAANELKP
VLEEIESRSARKEYVQILAECHRLYCEQRLSLVKGIVHQRVSDFAKKEALPSLTRSGCAY
LMQVCHMEHQLFTHFFPASSEEVSSLAPLVDPLSTYLYDILRPKLIHEANIDLLCELVHI
LKVEVLGDQSARQSEPLAGLRPTLQRILADVNERLTFRARTYIRDEIANYTPSDEDLDYP
AKLEGSPNTTSETDLRDDENADVFKTWYPPLEKTLSCLSKLYRCLEQAVFTGLAQEAVEV
CSLSIQKASKLIIKRSTTMDGQLFLIKHLLILREQIAPFDIEFSVTHKELDFSHLLEHLR
RILRGQASLFDWSRSTSLARTLSPRVLESQIDAKKELEKCLKTTCEEFIMSVTKLVVDPM
LSFVTKVTAIKVALSSGTQNHKVDSVMAKPLKEQAFATPDKVVELVQKVYAAIQQELLPI
LAKMKLYLQNPSTRTILFKPIKTNIVEAHTQVESLLKAEYSAEEQANINMISIQDLQTQL
DNFL*
>AT1G73430.1 | sec34-like family protein
MATKAASSSSLPKSGAISKGYNFASTWEQSAPLTEQQQAAIVSLSHAVAERPFPANLVHE
HVHRPENGLSVSVEDTHLGDSGAIEAVLVNTNQFYKWFTDLESAMKSETEEKYRHYVSTL
TERIQTCDNILHQVDETLDLFNELQLQHQGVTTKTKTLHDACDRLLMEKQKLMEFAEALR
SKLNYFDELENVSSNFYSPNMNVSNSNFLPLLKRLDECISYIEDNPQYAESSVYLLKFRQ
LQSRALGMIRTYILAVLKTAASQVQAAFRGTGGNKTSVSEGVEASVIYVRFKAAANELKP
VLEEIESRSARKEYVQILAECHRLYCEQRLSLVKGIVHQRVSDFAKKEALPSLTRSGCAY
LMQVCHMEHQLFTHFFPASSEEVSSLAPLVDPLSTYLYDILRPKLIHEANIDLLCELVHI
LKVEVLGDQSARQSEPLAGLRPTLQRILADVNERLTFRARTYIRDEIANYTPSDEDLDYP
AKLEGSPNTTSETDLRDDENADVFKTWYPPLEKTLSCLSKLYRCLEQAVFTGLAQEAVEV
CSLSIQKASKLIIKRSTTMDGQLFLIKHLLILREQIAPFDIEFSVTHKELDFSHLLEHLR
RILRGQASLFDWSRSTSLARTLSPRVLESQIDAKKELEKCLKTTCEEFIMSVTKLVVDPM
LSFVTKVTAIKVALSSGTQNHKVDSVMAKPLKEQAFATPDKVVELVQKVYAAIQQELLPI
LAKMKLYLQNPSTRTILFKPIKTNIVEAHTQVESLLKAEYSAEEQANINMISIQDLQTQL
DNFL*
>AT1G73430.2 | sec34-like family protein
MATKAASSSSLPKSGAISKGYNFASTWEQSAPLTEQQQAAIVSLSHAVAERPFPANLVHE
HVHRPENGLSVSVEDTHLGDSGAIEAVLVNTNQFYKWFTDLESAMKSETEEKYRHYVSTL
TERIQTCDNILHQVDETLDLFNELQLQHQGVTTKTKTLHDACDRLLMEKQKLMEFAEALR
SKLNYFDELENVSSNFYSPNMNVSNSNFLPLLKRLDECISYIEDNPQYAESSVYLLKFRQ
LQSRALGMIRTYILAVLKTAASQVQAAFRGTGGNKTSVSEGVEASVIYVRFKAAANELKP
VLEEIESRSARKEYVQILAECHRLYCEQRLSLVKGIVHQRVSDFAKKEALPSLTRSGCAY
LMQVCHMEHQLFTHFFPASSEEVSSLAPLVDPLSTYLYDILRPKLIHEANIDLLCELVHI
LKVEVLGDQSARQSEPLAGLRPTLQRILADVNERLTFRARTYIRDEIANYTPSDEDLDYP
AKLEGSPNTTSETDLRDDENADVFKTWYPPLEKTLSCLSKLYRCLEQAVFTGLAQEAVEV
CSLSIQKASKLIIKRSTTMDGQLFLIKHLLILREQIAPFDIEFSVTHKELDFSHLLEHLR
RILRGQASLFDWSRSTSLARTLSPRVLESQIDAKKELEKCLKTTCEEFIMSVTKLVVDPM
LSFVTKVTAIKVALSSGTQNHKVDSVMAKPLKEQAFATPDKVVELVQKVYAAIQQELLPI
LAKMKLYLQNPSTRTILFKPIKTNIVEAHTQVESLLKAEYSAEEQANINMISIQDLQTQL
DNFL*
>AT1G73430.2 | sec34-like family protein
MATKAASSSSLPKSGAISKGYNFASTWEQSAPLTEQQQAAIVSLSHAVAERPFPANLVHE
HVHRPENGLSVSVEDTHLGDSGAIEAVLVNTNQFYKWFTDLESAMKSETEEKYRHYVSTL
TERIQTCDNILHQVDETLDLFNELQLQHQGVTTKTKTLHDACDRLLMEKQKLMEFAEALR
SKLNYFDELENVSSNFYSPNMNVSNSNFLPLLKRLDECISYIEDNPQYAESSVYLLKFRQ
LQSRALGMIRTYILAVLKTAASQVQAAFRGTGGNKTSVSEGVEASVIYVRFKAAANELKP
VLEEIESRSARKEYVQILAECHRLYCEQRLSLVKGIVHQRVSDFAKKEALPSLTRSGCAY
LMQVCHMEHQLFTHFFPASSEEVSSLAPLVDPLSTYLYDILRPKLIHEANIDLLCELVHI
LKVEVLGDQSARQSEPLAGLRPTLQRILADVNERLTFRARTYIRDEIANYTPSDEDLDYP
AKLEGSPNTTSETDLRDDENADVFKTWYPPLEKTLSCLSKLYRCLEQAVFTGLAQEAVEV
CSLSIQKASKLIIKRSTTMDGQLFLIKHLLILREQIAPFDIEFSVTHKELDFSHLLEHLR
RILRGQASLFDWSRSTSLARTLSPRVLESQIDAKKELEKCLKTTCEEFIMSVTKLVVDPM
LSFVTKVTAIKVALSSGTQNHKVDSVMAKPLKEQAFATPDKVVELVQKVYAAIQQELLPI
LAKMKLYLQNPSTRTILFKPIKTNIVEAHTQVESLLKAEYSAEEQANINMISIQDLQTQL
DNFL*
>AT5G11980.1 | conserved oligomeric Golgi complex component-related / COG complex component-related
MAMEVGEMSQPEATASLLSLASATQQPYVSELLSFTLDRLHKEPELLRVDAERIQRQMQE
VAVGNYRAFITAADALLAIRQEVSSIDKHLESLIGEVPKLTSGCTEFIDSAENILEKRKM
NQALLANHSTLLDLLEIPQLMDTCVRNGNFDEALDLEAFVSKLATLHPKLPVIQALAAEV
RQTTQSLLSQLLQKLRSNIQLPECLRIIGYLRRIGVFGEYEMRLQFLRCREAWLTGILED
LDQKNAYEYLKGMINCHRMHLFDVVNQYRAIFSDDTSGSEENYDGGLLFSWAMHQITSHL
KTLKIMLPKITEGGSLSNILDQCMYCAMGLGGVGLDFRGLLPPLFEEAVLNLFSKNMSTA
VENFQLVLDSHRWVPLPSVGFPSSGINEDSKDDVTPPSYLMEHPPLAVFINGVSSALNEL
RPCAPLSLKNVVAHELIKGLQAVSDSLLRYNTTRMLRLSESNLFLSLCRAFVEVVFPHCA
TCFGRCYPGGATIVMDAKSAYEGLGRILAASSSQEPSNKSPKVISTDTKDASENGVASQP
EEKQAENPNAKEEDNSPIPLQTPEITPES*
>AT3G07180.1 | GPI transamidase component PIG-S-related
MEEISDRLNPGITPEFDPKTMRSTKPGLKRLFITSSVLFSFLLGVPFLWKSVEIYRSPLP
FHDIDSLSDQLESTPLRFPCNFHAVFVGFRSTDPDNLRSQIQDGINQLTHQSSQCGSCDF
SLSVTVQNREDQCSDTLAHSSTTCSYRCGVIKRNGFSVGLDDTVDESLNDVFSGCSENSG
KMYSVVVVNKENANGGDEVKAVVGKRRHAWIVGNGLEERYGDIVARVSEIFVQVFMNGGR
EEDSIQGEFMPVGSDGKIVLSFSLLNSNPRDWVYDWDFQRIDEALLAPVTKALAPIANIT
VESQVLYHTPKSSFSSWDKKLQSYIFRTSDLPFFVNSNEWHLDTSAGASGRSKILQFVVY
IPSGKECPLLLQLPNVEISKTNGFISPMWGGVIVWNPGNCDKDSESPSRNTISLQDLEQI
VEIFLGQFRQLFGFKSEAKYTTGLGTYKILTSERGFTEWELDVLSRKHTCFNLHSCSTTL
GSLSRLVRSLPRMIIKDEIGEQVKYSLKAAKLAQSNASLGGYSSSASSSREARSLAENAF
FHPSIMSVSYFSYEHCFAVYSPFFLPVVGHVVLAAVREWKRYKQEKAKYLTWLTRKKTT*
>AT3G07180.1 | GPI transamidase component PIG-S-related
MEEISDRLNPGITPEFDPKTMRSTKPGLKRLFITSSVLFSFLLGVPFLWKSVEIYRSPLP
FHDIDSLSDQLESTPLRFPCNFHAVFVGFRSTDPDNLRSQIQDGINQLTHQSSQCGSCDF
SLSVTVQNREDQCSDTLAHSSTTCSYRCGVIKRNGFSVGLDDTVDESLNDVFSGCSENSG
KMYSVVVVNKENANGGDEVKAVVGKRRHAWIVGNGLEERYGDIVARVSEIFVQVFMNGGR
EEDSIQGEFMPVGSDGKIVLSFSLLNSNPRDWVYDWDFQRIDEALLAPVTKALAPIANIT
VESQVLYHTPKSSFSSWDKKLQSYIFRTSDLPFFVNSNEWHLDTSAGASGRSKILQFVVY
IPSGKECPLLLQLPNVEISKTNGFISPMWGGVIVWNPGNCDKDSESPSRNTISLQDLEQI
VEIFLGQFRQLFGFKSEAKYTTGLGTYKILTSERGFTEWELDVLSRKHTCFNLHSCSTTL
GSLSRLVRSLPRMIIKDEIGEQVKYSLKAAKLAQSNASLGGYSSSASSSREARSLAENAF
FHPSIMSVSYFSYEHCFAVYSPFFLPVVGHVVLAAVREWKRYKQEKAKYLTWLTRKKTT*
>AT3G07180.2 | GPI transamidase component PIG-S-related
MEEISDRLNPGITPEFDPKTMRSTKPGLKRLFITSSVLFSFLLGVPFLWKSVEIYRSPLP
FHDIDSLSDQLESTPLRFPCNFHAVFVGFRSTDPDNLRSQIQDGINQLTHQSSQCGSCDF
SLSVTVQNREDQCSDTLAHSSTTCSYRCGVIKRNGFSVGLDDTVDESLNDVFSGCSENSG
KMYSVVVVNKENANGGDEVKAVVGKRRHAWIVGNGLEERYGDIVARVSEIFVQVFMNGGR
EEDSIQGEFMPVGSDGKIVLSFSLLNSNPRDWVYDWDFQRIDEALLAPVTKALAPIANIT
VESQVLYHTPKSSFSSWDKKLQSYIFRTSDLPFFVNSNEWHLDTSAGASGRSKILQFVKY
SLKAAKLAQSNASLGGYSSSASSSREARSLAENAFFHPSIMSVSYFSYEHCFAVYSPFFL
PVVGHVVLAAVREWKRYKQEKAKYLTWLTRKKTT*
>AT3G07180.2 | GPI transamidase component PIG-S-related
MEEISDRLNPGITPEFDPKTMRSTKPGLKRLFITSSVLFSFLLGVPFLWKSVEIYRSPLP
FHDIDSLSDQLESTPLRFPCNFHAVFVGFRSTDPDNLRSQIQDGINQLTHQSSQCGSCDF
SLSVTVQNREDQCSDTLAHSSTTCSYRCGVIKRNGFSVGLDDTVDESLNDVFSGCSENSG
KMYSVVVVNKENANGGDEVKAVVGKRRHAWIVGNGLEERYGDIVARVSEIFVQVFMNGGR
EEDSIQGEFMPVGSDGKIVLSFSLLNSNPRDWVYDWDFQRIDEALLAPVTKALAPIANIT
VESQVLYHTPKSSFSSWDKKLQSYIFRTSDLPFFVNSNEWHLDTSAGASGRSKILQFVKY
SLKAAKLAQSNASLGGYSSSASSSREARSLAENAFFHPSIMSVSYFSYEHCFAVYSPFFL
PVVGHVVLAAVREWKRYKQEKAKYLTWLTRKKTT*
>AT4G01400.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s COG4 transport (InterProIPR013167) Pentatricopeptide repeat (InterProIPR002885) BEST Arabidopsis thaliana protein match is pentatricopeptide (PPR) repeat-containing protein (TAIRAT5G461001) Has 11639 Blast hits to 3936 proteins in 180 species Archae - 0 Bacteria - 2 Metazoa - 206 Fungi - 141 Plants - 11004 Viruses - 0 Other Eukaryotes - 286 (source NCBI BLink)
MIRRPIYDFAAVFRHLTSPLSTSSRFLFYSSSEHEARKPIVSNPKSPIGSPTRVQKLIAS
QSDPLLAKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGE
IFTYLIKVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKS
SRLHGVMPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQ
VNGAMELLDDMLNKGFVPDRTLIGGLCDQGMFDEGKKYLEEMISKGFSPHFSVSNCLVKG
FCSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIKLFLEDAVKEEITGD
TRIVDVGIENKKMPEIEQDDAAAETVDSSTVKFGTPEALEYVRSLTDVGAMTRLLHECIA
YQRSLDSDLDTLLSQRTELDRNLVQLQRSAEILDIVKADADHMLGNVRSTCDLADQVSGK
VRELDLAQSRVNVTLSRIDAIVERGNCIEGVKTALESEDYESAAKFVQRFLQIDLQYKDS
GSDQSEQLHASKEQLEGIAKKKLLAAIDQRDHPTILRFVRLYSPLGMETEGLQLYVGYLK
KVIALRGRMEYENVVELMEQGLGQVNFVGCLTNLFKDIVMAIEENDEILRGLCGEDGVAY
AICELQEECDLRGSLILKKYMDFRKLAILASDINNSPNLNILPGGASEGPDPREVELYVE
EILSLMQLGEDYTEFMVSKIKSLTSVDPELLPTATKAFRNKSFSKAIQDVTRYYVILEGF
FMVENVRKAIRIDEHVPDSLTTSMVDDVFYVLQSCLRRAISTSNISSVIAVLSYAGSLLG
NDYHEALQQKIREPNLGARLFLGGIGVENTGTEIATALNNMDVSCEYILKLKHEIEEQCT
EVFPAPADRERIKSCLSELGELSSTFKQLLNSGMEQLVATVTPRIRPVLDTVATISYELT
ETEYAENEVNDPWVQRLLHSVETNAAWLQPLMTSNNYDSFLHLIIDFIVKRLEVIMMQKR
FSQLGGLQLDRDTRALVSHFSGMTQRTVRDKFARLTQMATILNLEKVSEILDFWGENSGP
MTWRLTPAEVRRVLGLRVEFKPESIAALKL*
>AT4G01400.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s COG4 transport (InterProIPR013167)
MIRRPIYDFAAVFRHLTSPLSTSSRFLFYSSSEHEARKPIVSNPKSPIGSPTRVQKLIAS
QSDPLLAKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGE
IFTYLIKVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKS
SRLHGVMPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQ
VNGAMELLDDMLNKGFVPDRTLIGGLCDQGMFDEGKKYLEEMISKGFSPHFSVSNCLVKG
FCSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIKLFLEDAVKEEITGD
TRIVDVGIENKKMPEIEQDDAAAETVDSSTVKFGTPEALEYVRSLTDVGAMTRLLHECIA
YQRSLDSDLDTLLSQRTELDRNLVQLQRSAEILDIVKADADHMLGNVRSTCDLADQVSGK
VRELDLAQSRVNVTLSRIDAIVERGNCIEGVKTALESEDYESAAKFVQRFLQIDLQYKDS
GSDQSEQLHASKEQLEGIAKKKLLAAIDQRDHPTILRFVRLYSPLGMETEGLQLYVGYLK
KVIALRGRMEYENVVELMEQGLGQVNFVGCLTNLFKDIVMAIEENDEILRGLCGEDGVAY
AICELQEECDLRGSLILKKYMDFRKLAILASDINNSPNLNILPGGASEGPDPREVELYVE
EILSLMQLGEDYTEFMVSKIKSLTSVDPELLPTATKAFRNKSFSKAIQDVTRYYVILEGF
FMVENVRKAIRIDEHVPDSLTTSMVDDVFYVLQSCLRRAISTSNISSVIAVLSYAGSLLG
NDYHEALQQKIREPNLGARLFLGGIGVENTGTEIATALNNMDVSCEYILKLKHEIEEQCT
EVFPAPADRERIKSCLSELGELSSTFKQLLNSGMEQLVATVTPRIRPVLDTVATISYELT
ETEYAENEVNDPWVQRLLHSVETNAAWLQPLMTSNNYDSFLHLIIDFIVKRLEVIMMQKR
FSQLGGLQLDRDTRALVSHFSGMTQRTVRDKFARLTQMATILNLEKVSEILDFWGENSGP
MTWRLTPAEVRRVLGLRVEFKPESIAALKL*
>AT4G01400.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s Pentatricopeptide repeat (InterProIPR002885) BEST Arabidopsis thaliana protein match is pentatricopeptide (PPR) repeat-containing protein (TAIRAT5G461001)
MIRRPIYDFAAVFRHLTSPLSTSSRFLFYSSSEHEARKPIVSNPKSPIGSPTRVQKLIAS
QSDPLLAKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGE
IFTYLIKVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKS
SRLHGVMPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQ
VNGAMELLDDMLNKGFVPDRTLIGGLCDQGMFDEGKKYLEEMISKGFSPHFSVSNCLVKG
FCSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIKLFLEDAVKEEITGD
TRIVDVGIENKKMPEIEQDDAAAETVDSSTVKFGTPEALEYVRSLTDVGAMTRLLHECIA
YQRSLDSDLDTLLSQRTELDRNLVQLQRSAEILDIVKADADHMLGNVRSTCDLADQVSGK
VRELDLAQSRVNVTLSRIDAIVERGNCIEGVKTALESEDYESAAKFVQRFLQIDLQYKDS
GSDQSEQLHASKEQLEGIAKKKLLAAIDQRDHPTILRFVRLYSPLGMETEGLQLYVGYLK
KVIALRGRMEYENVVELMEQGLGQVNFVGCLTNLFKDIVMAIEENDEILRGLCGEDGVAY
AICELQEECDLRGSLILKKYMDFRKLAILASDINNSPNLNILPGGASEGPDPREVELYVE
EILSLMQLGEDYTEFMVSKIKSLTSVDPELLPTATKAFRNKSFSKAIQDVTRYYVILEGF
FMVENVRKAIRIDEHVPDSLTTSMVDDVFYVLQSCLRRAISTSNISSVIAVLSYAGSLLG
NDYHEALQQKIREPNLGARLFLGGIGVENTGTEIATALNNMDVSCEYILKLKHEIEEQCT
EVFPAPADRERIKSCLSELGELSSTFKQLLNSGMEQLVATVTPRIRPVLDTVATISYELT
ETEYAENEVNDPWVQRLLHSVETNAAWLQPLMTSNNYDSFLHLIIDFIVKRLEVIMMQKR
FSQLGGLQLDRDTRALVSHFSGMTQRTVRDKFARLTQMATILNLEKVSEILDFWGENSGP
MTWRLTPAEVRRVLGLRVEFKPESIAALKL*
>AT4G01400.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s COG4 transport (InterProIPR013167) Pentatricopeptide repeat (InterProIPR002885) BEST Arabidopsis thaliana protein match is pentatricopeptide (PPR) repeat-containing protein (TAIRAT5G461001) Has 11639 Blast hits to 3936 proteins in 180 species Archae - 0 Bacteria - 2 Metazoa - 206 Fungi - 141 Plants - 11004 Viruses - 0 Other Eukaryotes - 286 (source NCBI BLink)
MPEIEQDDAAAETVDSSTVKFGTPEALEYVRSLTDVGAMTRLLHECIAYQRSLDSDLDTL
LSQRTELDRNLVQLQRSAEILDIVKADADHMLGNVRSTCDLADQVSGKVRELDLAQSRVN
VTLSRIDAIVERGNCIEGVKTALESEDYESAAKFVQRFLQIDLQYKDSGSDQSEQLHASK
EQLEGIAKKKLLAAIDQRDHPTILRFVRLYSPLGMETEGLQLYVGYLKKVIALRGRMEYE
NVVELMEQGLGQVNFVGCLTNLFKDIVMAIEENDEILRGLCGEDGVAYAICELQEECDLR
GSLILKKYMDFRKLAILASDINNSPNLNILPGGASEGPDPREVELYVEEILSLMQLGEDY
TEFMVSKIKSLTSVDPELLPTATKAFRNKSFSKAIQDVTRYYVILEGFFMVENVRKAIRI
DEHVPDSLTTSMVDDVFYVLQSCLRRAISTSNISSVIAVLSYAGSLLGNDYHEALQQKIR
EPNLGARLFLGGIGVENTGTEIATALNNMDVSCEYILKLKHEIEEQCTEVFPAPADRERI
KSCLSELGELSSTFKQLLNSGMEQLVATVTPRIRPVLDTVATISYELTETEYAENEVNDP
WVQRLLHSVETNAAWLQPLMTSNNYDSFLHLIIDFIVKRLEVIMMQKRFSQLGGLQLDRD
TRALVSHFSGMTQRTVRDKFARLTQMATILNLEKVSEILDFWGENSGPMTWRLTPAEVRR
VLGLRVEFKPESIAALKL*
>AT4G01400.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s COG4 transport (InterProIPR013167)
MPEIEQDDAAAETVDSSTVKFGTPEALEYVRSLTDVGAMTRLLHECIAYQRSLDSDLDTL
LSQRTELDRNLVQLQRSAEILDIVKADADHMLGNVRSTCDLADQVSGKVRELDLAQSRVN
VTLSRIDAIVERGNCIEGVKTALESEDYESAAKFVQRFLQIDLQYKDSGSDQSEQLHASK
EQLEGIAKKKLLAAIDQRDHPTILRFVRLYSPLGMETEGLQLYVGYLKKVIALRGRMEYE
NVVELMEQGLGQVNFVGCLTNLFKDIVMAIEENDEILRGLCGEDGVAYAICELQEECDLR
GSLILKKYMDFRKLAILASDINNSPNLNILPGGASEGPDPREVELYVEEILSLMQLGEDY
TEFMVSKIKSLTSVDPELLPTATKAFRNKSFSKAIQDVTRYYVILEGFFMVENVRKAIRI
DEHVPDSLTTSMVDDVFYVLQSCLRRAISTSNISSVIAVLSYAGSLLGNDYHEALQQKIR
EPNLGARLFLGGIGVENTGTEIATALNNMDVSCEYILKLKHEIEEQCTEVFPAPADRERI
KSCLSELGELSSTFKQLLNSGMEQLVATVTPRIRPVLDTVATISYELTETEYAENEVNDP
WVQRLLHSVETNAAWLQPLMTSNNYDSFLHLIIDFIVKRLEVIMMQKRFSQLGGLQLDRD
TRALVSHFSGMTQRTVRDKFARLTQMATILNLEKVSEILDFWGENSGPMTWRLTPAEVRR
VLGLRVEFKPESIAALKL*
>AT4G01400.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s Pentatricopeptide repeat (InterProIPR002885) BEST Arabidopsis thaliana protein match is pentatricopeptide (PPR) repeat-containing protein (TAIRAT5G461001)
MPEIEQDDAAAETVDSSTVKFGTPEALEYVRSLTDVGAMTRLLHECIAYQRSLDSDLDTL
LSQRTELDRNLVQLQRSAEILDIVKADADHMLGNVRSTCDLADQVSGKVRELDLAQSRVN
VTLSRIDAIVERGNCIEGVKTALESEDYESAAKFVQRFLQIDLQYKDSGSDQSEQLHASK
EQLEGIAKKKLLAAIDQRDHPTILRFVRLYSPLGMETEGLQLYVGYLKKVIALRGRMEYE
NVVELMEQGLGQVNFVGCLTNLFKDIVMAIEENDEILRGLCGEDGVAYAICELQEECDLR
GSLILKKYMDFRKLAILASDINNSPNLNILPGGASEGPDPREVELYVEEILSLMQLGEDY
TEFMVSKIKSLTSVDPELLPTATKAFRNKSFSKAIQDVTRYYVILEGFFMVENVRKAIRI
DEHVPDSLTTSMVDDVFYVLQSCLRRAISTSNISSVIAVLSYAGSLLGNDYHEALQQKIR
EPNLGARLFLGGIGVENTGTEIATALNNMDVSCEYILKLKHEIEEQCTEVFPAPADRERI
KSCLSELGELSSTFKQLLNSGMEQLVATVTPRIRPVLDTVATISYELTETEYAENEVNDP
WVQRLLHSVETNAAWLQPLMTSNNYDSFLHLIIDFIVKRLEVIMMQKRFSQLGGLQLDRD
TRALVSHFSGMTQRTVRDKFARLTQMATILNLEKVSEILDFWGENSGPMTWRLTPAEVRR
VLGLRVEFKPESIAALKL*
>AT4G01400.3 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s COG4 transport (InterProIPR013167) Pentatricopeptide repeat (InterProIPR002885) BEST Arabidopsis thaliana protein match is pentatricopeptide (PPR) repeat-containing protein (TAIRAT5G461001) Has 11639 Blast hits to 3936 proteins in 180 species Archae - 0 Bacteria - 2 Metazoa - 206 Fungi - 141 Plants - 11004 Viruses - 0 Other Eukaryotes - 286 (source NCBI BLink)
MIRRPIYDFAAVFRHLTSPLSTSSRFLFYSSSEHEARKPIVSNPKSPIGSPTRVQKLIAS
QSDPLLAKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGE
IFTYLIKVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKS
SRLHGVMPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQ
VNGAMELLDDMLNKGFVPDRLSYTTLLNSLCRKTQLREAYKLLCRMKLKGCNPDLVHYNT
MILGFCREDRAMDARKVLDDMLSNGCSPNSVSYRTLIGGLCDQGMFDEGKKYLEEMISKG
FSPHFSVSNCLVKGFCSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIK
LFLEDAVKEEITGDTRIVDVGIGLGSYLSSKLQMKRKNARERRRHL*
>AT4G01400.3 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s COG4 transport (InterProIPR013167)
MIRRPIYDFAAVFRHLTSPLSTSSRFLFYSSSEHEARKPIVSNPKSPIGSPTRVQKLIAS
QSDPLLAKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGE
IFTYLIKVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKS
SRLHGVMPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQ
VNGAMELLDDMLNKGFVPDRLSYTTLLNSLCRKTQLREAYKLLCRMKLKGCNPDLVHYNT
MILGFCREDRAMDARKVLDDMLSNGCSPNSVSYRTLIGGLCDQGMFDEGKKYLEEMISKG
FSPHFSVSNCLVKGFCSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIK
LFLEDAVKEEITGDTRIVDVGIGLGSYLSSKLQMKRKNARERRRHL*
>AT4G01400.3 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s Pentatricopeptide repeat (InterProIPR002885) BEST Arabidopsis thaliana protein match is pentatricopeptide (PPR) repeat-containing protein (TAIRAT5G461001)
MIRRPIYDFAAVFRHLTSPLSTSSRFLFYSSSEHEARKPIVSNPKSPIGSPTRVQKLIAS
QSDPLLAKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGE
IFTYLIKVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKS
SRLHGVMPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQ
VNGAMELLDDMLNKGFVPDRLSYTTLLNSLCRKTQLREAYKLLCRMKLKGCNPDLVHYNT
MILGFCREDRAMDARKVLDDMLSNGCSPNSVSYRTLIGGLCDQGMFDEGKKYLEEMISKG
FSPHFSVSNCLVKGFCSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIK
LFLEDAVKEEITGDTRIVDVGIGLGSYLSSKLQMKRKNARERRRHL*