>AT1G47330.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN plasma membrane EXPRESSED IN 22 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF21 (InterProIPR002550) Cystathionine beta-synthase core (InterProIPR000644) BEST Arabidopsis thaliana protein match is CBS domain-containing protein (TAIRAT2G145201) Has 6970 Blast hits to 6717 proteins in 1361 species Archae - 64 Bacteria - 4382 Metazoa - 390 Fungi - 183 Plants - 125 Viruses - 0 Other Eukaryotes - 1826 (source NCBI BLink) 
MSSDIPCCGTTFSLYVVIIIALVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRINAG 
KIFPVVKNQHLLLCTLLIGNSMAMEALPIFLDKIVPPWLAILLSVTLILVFGEIMPQAVC 
TRYGLKVGAIMAPFVRVLLVLFFPISYPISKVLDWMLGKGHGVLLRRAELKTFVNFHGNE 
AGKGGDLTTDETSIITGALELTEKTAKDAMTPISNAFSLELDTPLNLETLNTIMSVGHSR 
VPVYFRNPTHIIGLILVKNLLAVDARKEVPLRKMSMRKIPRVSETMPLYDILNEFQKGHS 
HIAVVYKDLDEQEQSPETSENGIERRKNKKTKDELFKDSCRKPKAQFEVSEKEVFKIETG 
DAKSGKSENGEEQQGSGKTSLLAAPAKKRHRGCSFCILDIENTPIPDFPTNEEVVGVITM 
EDVIEELLQEEILDETDEYVNIHNRIRVNMHASPENLPSVITSITQSSSGSTSPNQTSHM 
ATPDSSPTTKPSNSSPTRKPSVSSPTREPSDSSHSMAPKHEESTQTL*
>AT5G41170.1 |  pentatricopeptide (PPR) repeat-containing protein 
MAMRFFQLHRNRLVKGNSGKALSFSRLLDLSFWVRAFCNYREILRNGLHSLQFNEALDLF 
THMVESRPLPSIIDFTKLLNVIAKMKKFDVVINLCDHLQIMGVSHDLYTCNLLMNCFCQS 
SQPYLASSFLGKMMKLGFEPDIVTFTSLINGFCLGNRMEEAMSMVNQMVEMGIKPDVVMY 
TTIIDSLCKNGHVNYALSLFDQMENYGIRPDVVMYTSLVNGLCNSGRWRDADSLLRGMTK 
RKIKPDVITFNALIDAFVKEGKFLDAEELYNEMIRMSIAPNIFTYTSLINGFCMEGCVDE 
ARQMFYLMETKGCFPDVVAYTSLINGFCKCKKVDDAMKIFYEMSQKGLTGNTITYTTLIQ 
GFGQVGKPNVAQEVFSHMVSRGVPPNIRTYNVLLHCLCYNGKVKKALMIFEDMQKREMDG 
VAPNIWTYNVLLHGLCYNGKLEKALMVFEDMRKREMDIGIITYTIIIQGMCKAGKVKNAV 
NLFCSLPSKGVKPNVVTYTTMISGLFREGLKHEAHVLFRKMKEDGVS*