>AT1G12620.1 |  pentatricopeptide (PPR) repeat-containing protein 
MRGLIQTRLLETGTLRTALFLSCYGRVFSSVSDGKGKVSYRERLRSGIVDIKEDDAVDLF 
QEMTRSRPRPRLIDFSRLFSVVARTKQYDLVLDLCKQMELKGIAHNLYTLSIMINCCCRC 
RKLSLAFSAMGKIIKLGYEPDTVTFSTLINGLCLEGRVSEALELVDRMVEMGHKPTLITL 
NALVNGLCLNGKVSDAVLLIDRMVETGFQPNEVTYGPVLKVMCKSGQTALAMELLRKMEE 
RKIKLDAVKYSIIIDGLCKDGSLDNAFNLFNEMEIKGFKADIIIYTTLIRGFCYAGRWDD 
GAKLLRDMIKRKITPDVVAFSALIDCFVKEGKLREAEELHKEMIQRGISPDTVTYTSLID 
GFCKENQLDKANHMLDLMVSKGCGPNIRTFNILINGYCKANLIDDGLELFRKMSLRGVVA 
DTVTYNTLIQGFCELGKLEVAKELFQEMVSRRVRPDIVSYKILLDGLCDNGEPEKALEIF 
EKIEKSKMELDIGIYNIIIHGMCNASKVDDAWDLFCSLPLKGVKPDVKTYNIMIGGLCKK 
GSLSEADLLFRKMEEDGHSPNGCTYNILIRAHLGEGDATKSAKLIEEIKRCGFSVDASTV 
KMVVDMLSDGRLKKSFLDMLS*
>AT2G29690.1 |  ASA2 (ANTHRANILATE SYNTHASE 2) anthranilate synthase 
MSAVSISAVKSDFFTVEAIAVTHHRTPHPPHFPSLRFPLSLKSPPATSLNLVAGSKLLHF 
SRRLPSIKCSYTPSLDLSEEQFTKFKKASEKGNLVPLFRCVFSDHLTPILAYRCLVKEDD 
RDAPSFLFESVEPGSQSSNIGRYSVVGAQPTIEIVAKGNVVTVMDHGASLRTEEEVDDPM 
MVPQKIMEEWNPQGIDELPEAFCGGWVGYFSYDTVRYVEKKKLPFSNAPEDDRSLPDVNL 
GLYDDVIVFDHVEKKAYVIHWVRIDKDRSVEENFREGMNRLESLTSRIQDQKPPKMPTGF 
IKLRTQLFGPKLEKSTMTSEAYKEAVVEAKEHILAGDIFQIVLSQRFERRTFADPFEIYR 
ALRIVNPSPYMAYLQVRGCILVASSPEILLRSKNRKITNRPLAGTVRRGKTPKEDLMLEK 
ELLSDEKQCAEHIMLVDLGRNDVGKVSKPGSVEVKKLKDIEWFSHVMHISSTVVGELLDH 
LTSWDALRAVLPVGTVSGAPKVKAMELIDELEVTRRGPYSGGFGGISFNGDMDIALALRT 
MVFPTNTRYDTLYSYKHPQRRREWIAHIQAGAGIVADSNPDDEHRECENKAAALARAIDL 
AESSFLEAPEFTTITPHINNI*
>AT2G46440.1 |  ATCNGC11 (CYCLIC NUCLEOTIDE-GATED CHANNELS) calmodulin binding / cation channel/ cyclic nucleotide binding / ion channel 
MNLQRRKFVRLDSTGVDGKLKSVRGRLKKVYGKMKTLENWRKTVLLACVVALAIDPLFLF 
IPLIDSQRFCFTFDKTLVAVVCVIRTFIDTFYVIHIIYYLITETIAPRSQASLRGEIVVH 
SKATLKTRLLFHFIVDIISVLPIPQVVVLTLIPLSASLVSERILKWIILSQYVPRIIRMY 
PLYKEVTRAFGTVAESKRVGAALNFFLYMLHSYVCGAFWYLSSIERKSTCWRAACARTSD 
CNLTVTDLLCKRAGSDNIRFLNTSCPLIDPAQITNSTDFDFGMYIDALKSGVLEVKPKDF 
PRKFVYCFWWGLRNISALGQNLETSNSAGEIFFAIIICVSGLLLFAVLIGNVQKYLQSST 
TRVDEMEEKKRDTEKWMSYREIPEYLKERIRRFEDYKWRRTKGTEEEALLRSLPKDLRLE 
TKRYLFLKLLKKVPLLQAMDDQLLDALCARLKTVHYTEKSYIVREGEPVEDMLFIMRGNL 
ISTTTYGGRTGFFNSVDLIAGDSCGDLLTWALYSLSSQFPISSRTVQALTEVEGFVISAD 
DLKFVATQYRRLHSKQLQHMFRFYSLQWQTWAACFIQAAWKRHCRRKLSKALREEEGKLH 
NTLQNDDSGGNKLNLGAAIYA*
>AT1G64100.1 |  pentatricopeptide (PPR) repeat-containing protein 
MTLQSQSSHCLNVHSSMVRDIEKCEQKTKRDIEKKNTKSGGVRLNSRRLIHGRVAEKGTK 
SLPSLTQVTFEGEELKLKSGSHYFKSLDDAIDFFDYMVRSRPFYTAVDCNKVIGVFVRMN 
RPDVAISLYRKMEIRRIPLNIYSFNILIKCFCDCHKLSFSLSTFGKLTKLGFQPDVVTFN 
TLLHGLCLEDRISEALALFGYMVETGFLEAVALFDQMVEIGLTPVVITFNTLINGLCLEG 
RVLEAAALVNKMVGKGLHIDVVTYGTIVNGMCKMGDTKSALNLLSKMEETHIKPDVVIYS 
AIIDRLCKDGHHSDAQYLFSEMLEKGIAPNVFTYNCMIDGFCSFGRWSDAQRLLRDMIER 
EINPDVLTFNALISASVKEGKLFEAEKLCDEMLHRCIFPDTVTYNSMIYGFCKHNRFDDA 
KHMFDLMASPDVVTFNTIIDVYCRAKRVDEGMQLLREISRRGLVANTTTYNTLIHGFCEV 
DNLNAAQDLFQEMISHGVCPDTITCNILLYGFCENEKLEEALELFEVIQMSKIDLDTVAY 
NIIIHGMCKGSKVDEAWDLFCSLPIHGVEPDVQTYNVMISGFCGKSAISDANVLFHKMKD 
NGHEPDNSTYNTLIRGCLKAGEIDKSIELISEMRSNGFSGDAFTIKMVADLITDGRLDKS 
FSDMLS*
>AT1G64100.1 |  pentatricopeptide (PPR) repeat-containing protein 
MTLQSQSSHCLNVHSSMVRDIEKCEQKTKRDIEKKNTKSGGVRLNSRRLIHGRVAEKGTK 
SLPSLTQVTFEGEELKLKSGSHYFKSLDDAIDFFDYMVRSRPFYTAVDCNKVIGVFVRMN 
RPDVAISLYRKMEIRRIPLNIYSFNILIKCFCDCHKLSFSLSTFGKLTKLGFQPDVVTFN 
TLLHGLCLEDRISEALALFGYMVETGFLEAVALFDQMVEIGLTPVVITFNTLINGLCLEG 
RVLEAAALVNKMVGKGLHIDVVTYGTIVNGMCKMGDTKSALNLLSKMEETHIKPDVVIYS 
AIIDRLCKDGHHSDAQYLFSEMLEKGIAPNVFTYNCMIDGFCSFGRWSDAQRLLRDMIER 
EINPDVLTFNALISASVKEGKLFEAEKLCDEMLHRCIFPDTVTYNSMIYGFCKHNRFDDA 
KHMFDLMASPDVVTFNTIIDVYCRAKRVDEGMQLLREISRRGLVANTTTYNTLIHGFCEV 
DNLNAAQDLFQEMISHGVCPDTITCNILLYGFCENEKLEEALELFEVIQMSKIDLDTVAY 
NIIIHGMCKGSKVDEAWDLFCSLPIHGVEPDVQTYNVMISGFCGKSAISDANVLFHKMKD 
NGHEPDNSTYNTLIRGCLKAGEIDKSIELISEMRSNGFSGDAFTIKMVADLITDGRLDKS 
FSDMLS*
>AT1G64100.2 |  pentatricopeptide (PPR) repeat-containing protein 
MTLQSQSSHCLNVHSSMVRDIEKCEQKTKRDIEKKNTKSGGVRLNSRRLIHGRVAEKGTK 
SLPSLTQVTFEGEELKLKSGSHYFKSLDDAIDFFDYMVRSRPFYTAVDCNKVIGVFVRMN 
RPDVAISLYRKMEIRRIPLNIYSFNILIKCFCDCHKLSFSLSTFGKLTKLGFQPDVVTFN 
TLLHGLCLEDRISEALALFGYMVETGFLEAVALFDQMVEIGLTPVVITFNTLINGLCLEG 
RVLEAAALVNKMVGKGLHIDVVTYGTIVNGMCKMGDTKSALNLLSKMEETHIKPDVVIYS 
AIIDRLCKDGHHSDAQYLFSEMLEKGIAPNVFTYNCMIDGFCSFGRWSDAQRLLRDMIER 
EINPDVLTFNALISASVKEGKLFEAEKLCDEMLHRCIFPDTVTYNSMIYGFCKHNRFDDA 
KHMFDLMASPDVVTFNTIIDVYCRAKRVDEGMQLLREISRRGLVANTTTYNTLIHGFCEV 
DNLNAAQDLFQEMISHGVCPDTITCNILLYGFCENEKLEEALELFEVIQMSKIDLDTVAY 
NIIIHGMCKGSKVDEAWDLFCSLPIHGVEPDVQTYNVMISGFCGKSAISDANVLFHKMKD 
NGHEPDNSTYNTLIRGCLKAGEIDKSIELISEMRSNGFSGDAFTIKMAEEIICRVSDEEI 
IENYLRPKINGETSSIPRYVVELAEELYTVEPWLLPRQTAPILNPGEWFYFGKRNRKYSN 
LEGVHCEGSWILEDGCIAVLSKETGEEIGGTTRFRYYYRNKGDKESRLKMSNWFMREYRL 
YYKSRRVFNGRQVFCIITCNDEHFIE*
>AT1G64100.2 |  pentatricopeptide (PPR) repeat-containing protein 
MTLQSQSSHCLNVHSSMVRDIEKCEQKTKRDIEKKNTKSGGVRLNSRRLIHGRVAEKGTK 
SLPSLTQVTFEGEELKLKSGSHYFKSLDDAIDFFDYMVRSRPFYTAVDCNKVIGVFVRMN 
RPDVAISLYRKMEIRRIPLNIYSFNILIKCFCDCHKLSFSLSTFGKLTKLGFQPDVVTFN 
TLLHGLCLEDRISEALALFGYMVETGFLEAVALFDQMVEIGLTPVVITFNTLINGLCLEG 
RVLEAAALVNKMVGKGLHIDVVTYGTIVNGMCKMGDTKSALNLLSKMEETHIKPDVVIYS 
AIIDRLCKDGHHSDAQYLFSEMLEKGIAPNVFTYNCMIDGFCSFGRWSDAQRLLRDMIER 
EINPDVLTFNALISASVKEGKLFEAEKLCDEMLHRCIFPDTVTYNSMIYGFCKHNRFDDA 
KHMFDLMASPDVVTFNTIIDVYCRAKRVDEGMQLLREISRRGLVANTTTYNTLIHGFCEV 
DNLNAAQDLFQEMISHGVCPDTITCNILLYGFCENEKLEEALELFEVIQMSKIDLDTVAY 
NIIIHGMCKGSKVDEAWDLFCSLPIHGVEPDVQTYNVMISGFCGKSAISDANVLFHKMKD 
NGHEPDNSTYNTLIRGCLKAGEIDKSIELISEMRSNGFSGDAFTIKMAEEIICRVSDEEI 
IENYLRPKINGETSSIPRYVVELAEELYTVEPWLLPRQTAPILNPGEWFYFGKRNRKYSN 
LEGVHCEGSWILEDGCIAVLSKETGEEIGGTTRFRYYYRNKGDKESRLKMSNWFMREYRL 
YYKSRRVFNGRQVFCIITCNDEHFIE*
>AT1G12775.1 |  LOCATED IN mitochondrion EXPRESSED IN 7 plant structures EXPRESSED DURING F mature embryo stage petal differentiation and expansion stage D bilateral stage E expanded cotyledon stage CONTAINS InterPro DOMAIN/s Pentatricopeptide repeat (InterProIPR002885) BEST Arabidopsis thaliana protein match is pentatricopeptide (PPR) repeat-containing protein (TAIRAT1G123001) Has 27792 Blast hits to 6269 proteins in 197 species Archae - 2 Bacteria - 23 Metazoa - 895 Fungi - 767 Plants - 24554 Viruses - 0 Other Eukaryotes - 1551 (source NCBI BLink) 
MVRMMIRRLSSQASRFVQPRLLETGTLRIALINCPNELLFCCERGFSTFSDRNLSYRDKL 
SSGLVGIKADDAVDLFRDMIQSRPLPTVIDFNRLFSAIAKTKQYELVLALCKQMESKGIA 
HSIYTLSIMINCFCRCRKLSYAFSTMGKIMKLGYEPDTVIFNTLLNGLCLECRVSEALEL 
VDRMVEMGHKPTLITLNTLVNGLCLNGKVSDAVVLIDRMVETGFQPNEVTYGPVLNVMCK 
SGQTALAMELLRKMEERNIKLDAVKYSIIIDGLCKDGSLDNAFNLFNEMEIKGFKADIIT 
YNTLIGGFCNAGRWDDGAKLLRDMIKRKISPNVVTFSVLIDSFVKEGKLREADQLLKEMM 
QRGIAPNTITYNSLIDGFCKENRLEEAIQMVDLMISKGCDPDIMTFNILINGYCKANRID 
DGLELFREMSLRGVIANTVTYNTLVQGFCQSGKLEVAKKLFQEMVSRRVRPDIVSYKILL 
DGLCDNGELEKALEIFGKIEKSKMELDIGIYMIIIHGMCNASKVDDAWDLFCSLPLKGVK 
LDARAYNIMISELCRKDSLSKADILFRKMTEEGHAPDELTYNILIRAHLGDDDATTAAEL 
IEEMKSSGFPADVSTVKMVINMLSSGELDKSFLDMLSTTRASLK*
>AT3G48250.1 |  pentatricopeptide (PPR) repeat-containing protein 
MYRSMAILSSLRHAYSQISTRSYLSRSKVGFSSNLSSPLDSFAIVPSRFLWKFRTFSSKP 
DSMLQLVLENDWSKEVEEGLRKPDMSLTHETAIYVLRKLEKYPEKAYYFLDWVLRDSGLS 
PSTPLYSIMLRILVQQRSMKRFWMTLREMKQGGFYLDEDTYKTIYGELSKEKSKADAVAV 
AHFYERMLKENAMSVVAGEVSAVVTKGDWSCEVERELQEMKLVLSDNFVIRVLKELREHP 
LKALAFFHWVGGGGSSSGYQHSTVTYNAALRVLARPNSVAEFWSVVDEMKTAGYDMDLDT 
YIKVSRQFQKSRMMAETVKLYEYMMDGPFKPSIQDCSLLLRYLSGSPNPDLDLVFRVSRK 
YESTGKSLSKAVYDGIHRSLTSVGRFDEAEEITKAMRNAGYEPDNITYSQLVFGLCKAKR 
LEEARGVLDQMEAQGCFPDIKTWTILIQGHCKNNELDKALACFANMLEKGFDIDSNLLDV 
LIDGFVIHNKFEGASIFLMEMVKNANVKPWQSTYKLLIDKLLKIKKSEEALDLLQMMKKQ 
NYPAYAEAFDGYLAKFGTLEDAKKFLDVLSSKDSPSFAAYFHVIEAFYREGRLTDAKNLL 
FICPHHFKTHPKISELFGAAA*
>AT3G22470.1 |  pentatricopeptide (PPR) repeat-containing protein 
MIQRLIPLNRKASNFTQILEKGTSLLHYSSITEAKLSYKERLRNGIVDIKVNDAIDLFES 
MIQSRPLPTPIDFNRLCSAVARTKQYDLVLGFCKGMELNGIEHDMYTMTIMINCYCRKKK 
LLFAFSVLGRAWKLGYEPDTITFSTLVNGFCLEGRVSEAVALVDRMVEMKQRPDLVTVST 
LINGLCLKGRVSEALVLIDRMVEYGFQPDEVTYGPVLNRLCKSGNSALALDLFRKMEERN 
IKASVVQYSIVIDSLCKDGSFDDALSLFNEMEMKGIKADVVTYSSLIGGLCNDGKWDDGA 
KMLREMIGRNIIPDVVTFSALIDVFVKEGKLLEAKELYNEMITRGIAPDTITYNSLIDGF 
CKENCLHEANQMFDLMVSKGCEPDIVTYSILINSYCKAKRVDDGMRLFREISSKGLIPNT 
ITYNTLVLGFCQSGKLNAAKELFQEMVSRGVPPSVVTYGILLDGLCDNGELNKALEIFEK 
MQKSRMTLGIGIYNIIIHGMCNASKVDDAWSLFCSLSDKGVKPDVVTYNVMIGGLCKKGS 
LSEADMLFRKMKEDGCTPDDFTYNILIRAHLGGSGLISSVELIEEMKVCGFSADSSTIKM 
VIDMLSDRRLDKSFLDMLS*
>AT1G51580.1 |  KH domain-containing protein 
MEFSTSKRPATTATAAESVHFRLLCPATRTGAIIGKGGSVIRHLQSVTGSKIRVIDDIPV 
PSEERVVLIIAPSGKKKDESNVCDSENPGSEEPKQEKGSECAGTSGGDDEEAPSSAQMAL 
LRVFERIVFGDDAATVDGDELDKGESEGLCRMIVRGNQVDYLMSKGGKMIQKIREDSGAI 
VRISSTDQIPPCAFPGDVVIQMNGKFSSVKKALLLVTNCLQESGAPPTWDECPFPQPGYP 
PEYHSMEYHPQWDHPPPNPMPEDVGPFNRPVVEEEVAFRLLCPADKVGSLIGKGGAVVRA 
LQNESGASIKVSDPTHDSEERIIVISARENLERRHSLAQDGVMRVHNRIVEIGFEPSAAV 
VARLLVHSPYIGRLLGKGGHLISEMRRATGASIRVFAKDQATKYESQHDEIVQVIGNLKT 
VQDALFQILCRLREAMFPGRLPFQGMGGPPPPFMGPYPEPPPPFGPRQYPASPDRYHSPV 
GPFHERHCHGPGFDRPPGPGFDRPPSPMSWTPQPGIDGHPGGMVPPDVNHGFALRNEPIG 
SENPVMTSANVEIVIPQAYLGHVYGENCSNLNYIKQVSGANVVVHDPKAGTTEGLVVVSG 
TSDQAHFAQSLLHAFILCGQS*
>AT4G09730.1 |  DEAD/DEAH box helicase putative 
MVGASRTILSLSLSSSLFTFSKIPHVFPFLRLHKPRFHHAFRPLYSAAATTSSPTTETNV 
TDPDQLKHTILLERLRLRHLKESAKPPQQRPSSVVGVEEESSIRKKSKKLVENFQELGLS 
EEVMGALQELNIEVPTEIQCIGIPAVMERKSVVLGSHTGSGKTLAYLLPIVQLMREDEAN 
LGKKTKPRRPRTVVLCPTRELSEQVYRVAKSISHHARFRSILVSGGSRIRPQEDSLNNAI 
DMVVGTPGRILQHIEEGNMVYGDIAYLVLDEADTMFDRGFGPEIRKFLAPLNQRALKTND 
QGFQTVLVTATMTMAVQKLVDEEFQGIEHLRTSTLHKKIANARHDFIKLSGGEDKLEALL 
QVLEPSLAKGSKVMVFCNTLNSSRAVDHYLSENQISTVNYHGEVPAEQRVENLKKFKDEE 
GDCPTLVCTDLAARGLDLDVDHVVMFDFPKNSIDYLHRTGRTARMGAKGKVTSLVSRKDQ 
MLAARIEEAMRNNESLESLTTDNVRRDAARTHITQEKGRSVKQIREVSKQRNSRDKPSSS 
SPPARSTGGKTPVRKSSSSSFSKPRKASSPPEKSSKPKRKILKTVGSRSIAARGKTGSDR 
RPGKKLSVVGFRGKSSSARAS*
>AT1G61640.1 |  ABC1 family protein 
MTLISRFLISRVISRTLFSNQNNRTAINVSVKLPQFRIQSNGYHTLGLLHNVKGRFLGSN 
HHQLARRSYSIAVASNVVKQHAQVSWGRLVQRVTLSRSWNLPRISQIAQACSLSLARSHF 
LLPGLLALTYRQVAYAQRVVPNPVVYSPSHISPYRSSINFPIVLSSLVFSAVKGVVLIGR 
ALYLAILFSPNVIMALLGFACGPRYRQLQYEVLHRTLEKAGPAFIKFGQWIATRPDRFNK 
DLCLQLSKLHSNAPEHSFAFTKKSIENAFGRKLSEIFEEFDEAPVASGSIAQVHRASLKF 
QYAGQKVKSSEVAVKVRHPCVEETMKRDFVIINFVARLTTFIPGLNWLRLDECVQQFSVY 
MLSQVDLSREASHLSRFIYNFRGWKDVSFPKPIYPLIHPAVLVETYEHGESVARYVDGSE 
GQEKLKAKVAHIGTNALLKMLLVDNFIHADMHPGNILVRPNNTRRGLFRSRKPHIVFLDV 
GMTAELSKTDRDNLLGFFKAVARRDGRTAAERTLKLSKQQNCPDPQAFIKEVEEAFTFWG 
TEEGDLVHPADCMHELFEKMRSHRVNIDGNVSTVMFTTLVLEGWQRKLDPGYDVMRTLQT 
MLLKTDWMKSLSYTIDGLMAP*
>AT1G61640.1 |  ABC1 family protein 
MTLISRFLISRVISRTLFSNQNNRTAINVSVKLPQFRIQSNGYHTLGLLHNVKGRFLGSN 
HHQLARRSYSIAVASNVVKQHAQVSWGRLVQRVTLSRSWNLPRISQIAQACSLSLARSHF 
LLPGLLALTYRQVAYAQRVVPNPVVYSPSHISPYRSSINFPIVLSSLVFSAVKGVVLIGR 
ALYLAILFSPNVIMALLGFACGPRYRQLQYEVLHRTLEKAGPAFIKFGQWIATRPDRFNK 
DLCLQLSKLHSNAPEHSFAFTKKSIENAFGRKLSEIFEEFDEAPVASGSIAQVHRASLKF 
QYAGQKVKSSEVAVKVRHPCVEETMKRDFVIINFVARLTTFIPGLNWLRLDECVQQFSVY 
MLSQVDLSREASHLSRFIYNFRGWKDVSFPKPIYPLIHPAVLVETYEHGESVARYVDGSE 
GQEKLKAKVAHIGTNALLKMLLVDNFIHADMHPGNILVRPNNTRRGLFRSRKPHIVFLDV 
GMTAELSKTDRDNLLGFFKAVARRDGRTAAERTLKLSKQQNCPDPQAFIKEVEEAFTFWG 
TEEGDLVHPADCMHELFEKMRSHRVNIDGNVSTVMFTTLVLEGWQRKLDPGYDVMRTLQT 
MLLKTDWMKSLSYTIDGLMAP*
>AT1G61640.2 |  ABC1 family protein 
MTLISRFLISRVISRTLFSNQNNRTAINVSVKLPQFRIQSNGYHTLGLLHNVKGRFLGSN 
HHQLARRSYSIAVASNVVKQHAQVSWGRLVQRVTLSRSWNLPRISQIAQACSLSLARSHF 
LLPGLLALTYRQVAYAQRVVPNPVVYSPSHISPYRSSINFPIVLSSLVFSAVKGVVLIGR 
ALYLAILFSPNVIMALLGFACGPRYRQLQYEVLHRTLEKAGPAFIKFGQWIATRPDRFNK 
DLCLQLSKLHSNAPEHSFAFTKKSIENAFGRKLSEIFEEFDEAPVASGSIAQVHRASLKF 
QYAGQKVKSSEVAVKVRHPCVEETMKRDFVIINFVARLTTFIPGLNWLRLDECVQQFSVY 
MLSQVDLSREASHLSRFIYNFRGWKDVSFPKPIYPLIHPAVLVETYEHGESVARYVDGSE 
GQEKLKAKVAHIGTNALLKMLLVLSLSSQCIFLFISVTYVTYSCFDSCFSGRQLHSC*
>AT1G61640.2 |  ABC1 family protein 
MTLISRFLISRVISRTLFSNQNNRTAINVSVKLPQFRIQSNGYHTLGLLHNVKGRFLGSN 
HHQLARRSYSIAVASNVVKQHAQVSWGRLVQRVTLSRSWNLPRISQIAQACSLSLARSHF 
LLPGLLALTYRQVAYAQRVVPNPVVYSPSHISPYRSSINFPIVLSSLVFSAVKGVVLIGR 
ALYLAILFSPNVIMALLGFACGPRYRQLQYEVLHRTLEKAGPAFIKFGQWIATRPDRFNK 
DLCLQLSKLHSNAPEHSFAFTKKSIENAFGRKLSEIFEEFDEAPVASGSIAQVHRASLKF 
QYAGQKVKSSEVAVKVRHPCVEETMKRDFVIINFVARLTTFIPGLNWLRLDECVQQFSVY 
MLSQVDLSREASHLSRFIYNFRGWKDVSFPKPIYPLIHPAVLVETYEHGESVARYVDGSE 
GQEKLKAKVAHIGTNALLKMLLVLSLSSQCIFLFISVTYVTYSCFDSCFSGRQLHSC*