>AT2G02980.1 |  pentatricopeptide (PPR) repeat-containing protein 
MAISSASLISSFSHAETFTKHSKIDTVNTQNPILLISKCNSLRELMQIQAYAIKSHIEDV 
SFVAKLINFCTESPTESSMSYARHLFEAMSEPDIVIFNSMARGYSRFTNPLEVFSLFVEI 
LEDGILPDNYTFPSLLKACAVAKALEEGRQLHCLSMKLGLDDNVYVCPTLINMYTECEDV 
DSARCVFDRIVEPCVVCYNAMITGYARRNRPNEALSLFREMQGKYLKPNEITLLSVLSSC 
ALLGSLDLGKWIHKYAKKHSFCKYVKVNTALIDMFAKCGSLDDAVSIFEKMRYKDTQAWS 
AMIVAYANHGKAEKSMLMFERMRSENVQPDEITFLGLLNACSHTGRVEEGRKYFSQMVSK 
FGIVPSIKHYGSMVDLLSRAGNLEDAYEFIDKLPISPTPMLWRILLAACSSHNNLDLAEK 
VSERIFELDDSHGGDYVILSNLYARNKKWEYVDSLRKVMKDRKAVKVPGCSSIEVNNVVH 
EFFSGDGVKSATTKLHRALDEMVKELKLSGYVPDTSMVVHANMNDQEKEITLRYHSEKLA 
ITFGLLNTPPGTTIRVVKNLRVCRDCHNAAKLISLIFGRKVVLRDVQRFHHFEDGKCSCG 
DFW*
>AT4G34200.1 |  EDA9 (embryo sac development arrest 9) ATP binding 
MSATAAASSSIAVATNSLRNVTLSSRSPLPSAISVAFPSRGRNTLQRRLVLVSCSTGDGS 
KPTILVAEKLGDAGIKLLEDVANVDCSYNMTPEELNIKISLCDALIVRSGTKVGREVFES 
SHGRLKVVGRAGVGIDNVDLSAATEFGCLVVNAPTANTIAAAEHGIALMAAMARNVAQAD 
ASVKAGEWKRNKYVGVSLVGKTLAVLGFGKVGTEVARRAKGLGMRVIAHDPYAPADRAHA 
IGVDLVSFDEALATADFISLHMPLTPTTSKILNDETFAKMKKGVRIVNVARGGVIDEDAL 
VRALDAGIVAQAALDVFTKEPPAKDSKLVQHERVTVTPHLGASTMEAQEGVAIEIAEAVV 
GALNGELAATAVNAPMVSAEVLTELKPYVVLAEKLGRLAVQLVAGGSGVKNAKITYASAR 
ATDDLDTRLLRAMITKGIIEPISDVYVNLVNADFTAKQRGLRLSEERVLLDGSPESPLET 
ITVQLSNVESKFASSLSESGEVKVEGKVKDGVPHLTKVGSFEVDVTLEGSIILCRQVDQP 
GMIGTVGSILGESNVNVNFMSVGRIAPRKQAIMAIGVDDIPSKETLKKIGEIPAVEEFVF 
LKL*
>AT1G59720.1 |  CRR28 (CHLORORESPIRATORY REDUCTION28) endonuclease 
MVVRSIIVSPPTTITYYHPMSIGLLVHPLSPHIPPASSPSASTAGNHHQRIFSLAETCSD 
MSQLKQLHAFTLRTTYPEEPATLFLYGKILQLSSSFSDVNYAFRVFDSIENHSSFMWNTL 
IRACAHDVSRKEEAFMLYRKMLERGESSPDKHTFPFVLKACAYIFGFSEGKQVHCQIVKH 
GFGGDVYVNNGLIHLYGSCGCLDLARKVFDEMPERSLVSWNSMIDALVRFGEYDSALQLF 
REMQRSFEPDGYTMQSVLSACAGLGSLSLGTWAHAFLLRKCDVDVAMDVLVKNSLIEMYC 
KCGSLRMAEQVFQGMQKRDLASWNAMILGFATHGRAEEAMNFFDRMVDKRENVRPNSVTF 
VGLLIACNHRGFVNKGRQYFDMMVRDYCIEPALEHYGCIVDLIARAGYITEAIDMVMSMP 
MKPDAVIWRSLLDACCKKGASVELSEEIARNIIGTKEDNESSNGNCSGAYVLLSRVYASA 
SRWNDVGIVRKLMSEHGIRKEPGCSSIEINGISHEFFAGDTSHPQTKQIYQQLKVIDDRL 
RSIGYLPDRSQAPLVDATNDGSKEYSLRLHSERLAIAFGLINLPPQTPIRIFKNLRVCND 
CHEVTKLISKVFNTEIIVRDRVRFHHFKDGSCSCLDYW*
>AT2G03670.1 |  CDC48B ATP binding / ATPase/ nucleoside-triphosphatase/ nucleotide binding 
MLETESSVCDNIAGNEKWRAEAEIGGNERALQALRELIIFPFRYPLEARTLGLKWPRGLL 
LYGPPGTGKTSLVRAVVQECDAHLIVLSPHSVHRAHAGESEKVLREAFAEASSHAVSDKP 
SVIFIDEIDVLCPRRDARREQDVRIASQLFTLMDSNKPSSSAPRVVVVASTNRVDAIDPA 
LRRAGRFDALVEVSTPNEEDRLKILQLYTKKVNLDPSVDLQAIAISCNGYVGADLEALCR 
EATISASKRSSDSLILTSQDFKIAKSVVGPSINRGITVEIPKVTWDDVGGLKDLKKKLQQ 
AVEWPIKHSAAFVKMGISPMRGILLHGPPGCSKTTLAKAAANAAQASFFSLSCAELFSMY 
VGEGEALLRNTFQRARLASPSIIFFDEADVVACKRGDESSSNSSTVGERLLSTLLTEMDG 
LEEAKGILVLAATNRPYAIDAALMRPGRFDLVLYVPPPDLEARFEILQVHTRNMTLGDDV 
DLRKIAEETDLFTGAELEGLCRESGTVSLRENIAATAVFNRHFQTAKSSLKPALTIEEVE 
TYSSFRKAAKRSDSKPIPINKKKATSTVFGFSWQLGVLSLLLLATGNYYFNHTKHELLVA 
SAT*
>AT1G31920.1 |  pentatricopeptide (PPR) repeat-containing protein 
MIKAPILQSLLASRDDLTHNPEVNNFGGKEQECLYLLKRCHNIDEFKQVHARFIKLSLFY 
SSSFSASSVLAKCAHSGWENSMNYAASIFRGIDDPCTFDFNTMIRGYVNVMSFEEALCFY 
NEMMQRGNEPDNFTYPCLLKACTRLKSIREGKQIHGQVFKLGLEADVFVQNSLINMYGRC 
GEMELSSAVFEKLESKTAASWSSMVSARAGMGMWSECLLLFRGMCSETNLKAEESGMVSA 
LLACANTGALNLGMSIHGFLLRNISELNIIVQTSLVDMYVKCGCLDKALHIFQKMEKRNN 
LTYSAMISGLALHGEGESALRMFSKMIKEGLEPDHVVYVSVLNACSHSGLVKEGRRVFAE 
MLKEGKVEPTAEHYGCLVDLLGRAGLLEEALETIQSIPIEKNDVIWRTFLSQCRVRQNIE 
LGQIAAQELLKLSSHNPGDYLLISNLYSQGQMWDDVARTRTEIAIKGLKQTPGFSIVELK 
GKTHRFVSQDRSHPKCKEIYKMLHQMEWQLKFEGYSPDLTQILLNVDEEEKKERLKGHSQ 
KVAIAFGLLYTPPGSIIKIARNLRMCSDCHTYTKKISMIYEREIVVRDRNRFHLFKGGTC 
SCKDYW*
>AT5G48910.1 |  pentatricopeptide (PPR) repeat-containing protein 
MNPTQTLFSPGGNSPASSPASHPSSLFPQINNCRTIRDLSQIHAVFIKSGQMRDTLAAAE 
ILRFCATSDLHHRDLDYAHKIFNQMPQRNCFSWNTIIRGFSESDEDKALIAITLFYEMMS 
DEFVEPNRFTFPSVLKACAKTGKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMK 
DARVLFYKNIIEKDMVVMTDRRKRDGEIVLWNVMIDGYMRLGDCKAARMLFDKMRQRSVV 
SWNTMISGYSLNGFFKDAVEVFREMKKGDIRPNYVTLVSVLPAISRLGSLELGEWLHLYA 
EDSGIRIDDVLGSALIDMYSKCGIIEKAIHVFERLPRENVITWSAMINGFAIHGQAGDAI 
DCFCKMRQAGVRPSDVAYINLLTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDL 
LGRSGLLDEAEEFILNMPIKPDDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDSGAY 
VALSNMYASQGNWSEVSEMRLRMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEIN 
SMLVEISDKLRLAGYRPITTQVLLNLEEEDKENVLHYHSEKIATAFGLISTSPGKPIRIV 
KNLRICEDCHSSIKLISKVYKRKITVRDRKRFHHFQDGSCSCMDYW*
>AT5G06540.1 |  pentatricopeptide (PPR) repeat-containing protein 
MSNIVLNTLRFKHPKLALLQSCSSFSDLKIIHGFLLRTHLISDVFVASRLLALCVDDSTF 
NKPTNLLGYAYGIFSQIQNPNLFVFNLLIRCFSTGAEPSKAFGFYTQMLKSRIWPDNITF 
PFLIKASSEMECVLVGEQTHSQIVRFGFQNDVYVENSLVHMYANCGFIAAAGRIFGQMGF 
RDVVSWTSMVAGYCKCGMVENAREMFDEMPHRNLFTWSIMINGYAKNNCFEKAIDLFEFM 
KREGVVANETVMVSVISSCAHLGALEFGERAYEYVVKSHMTVNLILGTALVDMFWRCGDI 
EKAIHVFEGLPETDSLSWSSIIKGLAVHGHAHKAMHYFSQMISLGFIPRDVTFTAVLSAC 
SHGGLVEKGLEIYENMKKDHGIEPRLEHYGCIVDMLGRAGKLAEAENFILKMHVKPNAPI 
LGALLGACKIYKNTEVAERVGNMLIKVKPEHSGYYVLLSNIYACAGQWDKIESLRDMMKE 
KLVKKPPGWSLIEIDGKINKFTMGDDQKHPEMGKIRRKWEEILGKIRLIGYKGNTGDAFF 
DVDEEEKESSIHMHSEKLAIAYGMMKTKPGTTIRIVKNLRVCEDCHTVTKLISEVYGREL 
IVRDRNRFHHFRNGVCSCRDYW*
>AT4G37380.1 |  pentatricopeptide (PPR) repeat-containing protein 
MASSPLLATSLPQNQLSTTATARFRLPPPEKLAVLIDKSQSVDEVLQIHAAILRHNLLLH 
PRYPVLNLKLHRAYASHGKIRHSLALFHQTIDPDLFLFTAAINTASINGLKDQAFLLYVQ 
LLSSEINPNEFTFSSLLKSCSTKSGKLIHTHVLKFGLGIDPYVATGLVDVYAKGGDVVSA 
QKVFDRMPERSLVSSTAMITCYAKQGNVEAARALFDSMCERDIVSWNVMIDGYAQHGFPN 
DALMLFQKLLAEGKPKPDEITVVAALSACSQIGALETGRWIHVFVKSSRIRLNVKVCTGL 
IDMYSKCGSLEEAVLVFNDTPRKDIVAWNAMIAGYAMHGYSQDALRLFNEMQGITGLQPT 
DITFIGTLQACAHAGLVNEGIRIFESMGQEYGIKPKIEHYGCLVSLLGRAGQLKRAYETI 
KNMNMDADSVLWSSVLGSCKLHGDFVLGKEIAEYLIGLNIKNSGIYVLLSNIYASVGDYE 
GVAKVRNLMKEKGIVKEPGISTIEIENKVHEFRAGDREHSKSKEIYTMLRKISERIKSHG 
YVPNTNTVLQDLEETEKEQSLQVHSERLAIAYGLISTKPGSPLKIFKNLRVCSDCHTVTK 
LISKITGRKIVMRDRNRFHHFTDGSCSCGDFW*
>AT2G36730.1 |  pentatricopeptide (PPR) repeat-containing protein 
MIWSSDSCFKSRKHQCLIFLKLCSSIKHLLQIHGQIHLSSLQNDSFIISELVRVSSLSLA 
KDLAFARTLLLHSSDSTPSTWNMLSRGYSSSDSPVESIWVYSEMKRRGIKPNKLTFPFLL 
KACASFLGLTAGRQIQVEVLKHGFDFDVYVGNNLIHLYGTCKKTSDARKVFDEMTERNVV 
SWNSIMTALVENGKLNLVFECFCEMIGKRFCPDETTMVVLLSACGGNLSLGKLVHSQVMV 
RELELNCRLGTALVDMYAKSGGLEYARLVFERMVDKNVWTWSAMIVGLAQYGFAEEALQL 
FSKMMKESSVRPNYVTFLGVLCACSHTGLVDDGYKYFHEMEKIHKIKPMMIHYGAMVDIL 
GRAGRLNEAYDFIKKMPFEPDAVVWRTLLSACSIHHDEDDEGIGEKVKKRLIELEPKRSG 
NLVIVANRFAEARMWAEAAEVRRVMKETKMKKIAGESCLELGGSFHRFFSGYDPRSEYVS 
IYELLDLFKFQLTCDYRLVSE*
>AT1G06145.1 |  EMB1444 (EMBRYO DEFECTIVE 1444) 
MNAFANVHSLRVPSHHLRDFSASLSLAPPNLKKIIKQCSTPKLLESALAAMIKTSLNQDC 
RLMNQFITACTSFKRLDLAVSTMTQMQEPNVFVYNALFKGFVTCSHPIRSLELYVRMLRD 
SVSPSSYTYSSLVKASSFASRFGESLQAHIWKFGFGFHVKIQTTLIDFYSATGRIREARK 
VFDEMPERDDIAWTTMVSAYRRVLDMDSANSLANQMSEKNEATSNCLINGYMGLGNLEQA 
ESLFNQMPVKDIISWTTMIKGYSQNKRYREAIAVFYKMMEEGIIPDEVTMSTVISACAHL 
GVLEIGKEVHMYTLQNGFVLDVYIGSALVDMYSKCGSLERALLVFFNLPKKNLFCWNSII 
EGLAAHGFAQEALKMFAKMEMESVKPNAVTFVSVFTACTHAGLVDEGRRIYRSMIDDYSI 
VSNVEHYGGMVHLFSKAGLIYEALELIGNMEFEPNAVIWGALLDGCRIHKNLVIAEIAFN 
KLMVLEPMNSGYYFLLVSMYAEQNRWRDVAEIRGRMRELGIEKICPGTSSIRIDKRDHLF 
AAADKSHSASDEVCLLLDEIYDQMGLAGYVQETENVY*
>AT5G40405.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown CONTAINS InterPro DOMAIN/s Pentatricopeptide repeat (InterProIPR002885) BEST Arabidopsis thaliana protein match is pentatricopeptide (PPR) repeat-containing protein (TAIRAT5G665201) Has 12548 Blast hits to 5076 proteins in 168 species Archae - 0 Bacteria - 2 Metazoa - 87 Fungi - 66 Plants - 12146 Viruses - 0 Other Eukaryotes - 247 (source NCBI BLink) 
MSRIGKHPAIALLDSGITFKEVRQIHAKLYVDGTLKDDHLVGHFVKAVALSDHKYLDYAN 
QILDRSEKPTLFALNSMIRAHCKSPVPEKSFDFYRRILSSGNDLKPDNYTVNFLVQACTG 
LRMRETGLQVHGMTIRRGFDNDPHVQTGLISLYAELGCLDSCHKVFNSIPCPDFVCRTAM 
VTACARCGDVVFARKLFEGMPERDPIAWNAMISGYAQVGESREALNVFHLMQLEGVKVNG 
VAMISVLSACTQLGALDQGRWAHSYIERNKIKITVRLATTLVDLYAKCGDMEKAMEVFWG 
MEEKNVYTWSSALNGLAMNGFGEKCLELFSLMKQDGVTPNAVTFVSVLRGCSVVGFVDEG 
QRHFDSMRNEFGIEPQLEHYGCLVDLYARAGRLEDAVSIIQQMPMKPHAAVWSSLLHASR 
MYKNLELGVLASKKMLELETANHGAYVLLSNIYADSNDWDNVSHVRQSMKSKGVRKQPGC 
SVMEVNGEVHEFFVGDKSHPKYTQIDAVWKDISRRLRLAGYKADTTPVMFDIDEEEKEDA 
LCLHSEKAAIAFGIMSLKEDVPIRIVKNLRVCGDCHQVSMMISKIFNREIIVRDRNRFHH 
FKDGHCSCNGFW*
>AT3G13640.1 |  ATRLI1 transporter 
MSDRLTRIAIVSEDRCKPKKCRQECKKSCPVVKTGKLCIEVGSTSKSAFISEELCIGCGI 
CVKKCPFEAIQIINLPKDLAKDTTHRYGANGFKLHRLPIPRPGQVLGLVGTNGIGKSTAL 
KILAGKLKPNLGRFNTPPDWEEILTHFRGSELQSYFIRVVEENLKTAIKPQHVDYIKEVV 
RGNLGKMLEKLDERGLMEEICADMELNQVLEREARQVSGGELQRFAIAAVFVKKADIYMF 
DEPSSYLDVRQRLKAAQVIRSLLRHDSYVIVVEHDLSVLDYLSDFVCCLYGKPGAYGVVT 
LPFSVREGINVFLAGFIPTENLRFRDESLTFRVSETTQENDGEVKSYARYKYPNMTKQLG 
DFKLEVMEGEFTDSQIIVMLGENGTGKTTFIRMLAGAFPREEGVQSEIPEFNVSYKPQGN 
DSKRECTVRQLLHDKIRDACAHPQFMSDVIRPLQIEQLMDQVVKTLSGGEKQRVAITLCL 
GKPADIYLIDEPSAHLDSEQRITASKVIKRFILHAKKTAFIVEHDFIMATYLADRVIVYE 
GQPAVKCIAHSPQSLLSGMNHFLSHLNITFRRDPTNFRPRINKLESIKDKEQKTAGSYYY 
LDD*
>AT5G01320.1 |  pyruvate decarboxylase putative 
MDTKIGAIDTCKPTTGDIGSPPSNAVATIQDSAPITTTSESTLGRHLSRRLVQAGVTDVF 
SVPGDFNLTLLDHLIAEPELNNIGCCNELNAGYAADGYARSRGVGACVVTFTVGGLSVLN 
AIAGAYSENLPVICIVGGPNSNDFGTNRILHHTIGLPDFSQELRCFQTVTCYQAVVNNLE 
DAHEQIDKAIATALKESKPVYISISCNLAATPHPTFARDPVPFDLTPRMSNTMGLEAAVE 
ATLEFLNKAVKPVMVGGPKLRVAKASEAFLELADASGYPLAVMPSTKGLVPENHPHFIGT 
YWGAVSTPFCSEIVESADAYIFAGPIFNDYSSVGYSLLLKKEKAIIVHPDRVVVANGPTF 
GCVLMSDFFRELAKRVKRNETAYENYERIFVPEGKPLKCKPGEPLRVNAMFQHIQKMLSS 
ETAVIAETGDSWFNCQKLKLPKGCGYEFQMQYGSIGWSVGATLGYAQATPEKRVLSFIGD 
GSFQVTAQDISTMIRNGQKAIIFLINNGGYTIEVEIHDGPYNVIKNWNYTGLVDAIHNGE 
GKCWTTKVRYEEELVEAIKTATTEKKDSLCFIEVIVHKDDTSKELLEWGSRVSAANGRPP 
NPQ*
>AT2G17650.1 |  AMP-dependent synthetase and ligase family protein 
MRFLLTKRAFRIFNPRFQRLWLTSSPFSSTSNSGGFPDDSEPESWRTIEGLLRSPANFSP 
LSPITFLERSAKVYRDRTSLVFGSVKHTWFQTYQRCLRLASALTNLGISRGDVVAALAPN 
VPAMHELHFAVPMAGLILCPLNTRLDPSTLSVLLAHSEAKILFVDHQLLEIAHGALDLLA 
KSDKTRKSLKLVLISQSNDDDDSDEDSSSTFASKYSFDYEYETLLKSGDSEFEIIKPRCE 
WDPISINYTSGTTSRPKGVVYSHRGAYLNSLATVFLHQMSVYPVYLWTVPMFHCNGWCLV 
WGVAAQGGTNICLRKVSPKMIFKNIAMHKVTHMGGAPTVLNMIVNYTVTEHKPLPHRVEI 
MTGGSPPLPQILAKMEELGFNVSHLYGLTETYGPGTHCVWKPEWDSLSLEERTKLKARQG 
VQHLGLEGLDVKDPLTMETVPDDGLTMGEVMFRGNTVMSGYFKDIEATRKAFEGDWFHSG 
DLAVKYPDGYIEIKDRLKDVIISGGENISSVEVERVLCSHQAVLEAAVVARPDHHWGQTP 
CGFVKLKEGFDTIKPEEIIGFCRDHLPHYMAPKTIVFGDIPKTSTGKVQKYLLRKKADEM 
GSL*