>AT5G52640.1 | ATHSP901 (HEAT SHOCK PROTEIN 901) ATP binding / unfolded protein binding
MADVQMADAETFAFQAEINQLLSLIINTFYSNKEIFLRELISNSSDALDKIRFESLTDKS
KLDGQPELFIRLVPDKSNKTLSIIDSGIGMTKADLVNNLGTIARSGTKEFMEALQAGADV
SMIGQFGVGFYSAYLVAEKVVVTTKHNDDEQYVWESQAGGSFTVTRDVDGEPLGRGTKIT
LFLKDDQLEYLEERRLKDLVKKHSEFISYPIYLWTEKTTEKEISDDEDEDEPKKENEGEV
EEVDEEKEKDGKKKKKIKEVSHEWELINKQKPIWLRKPEEITKEEYAAFYKSLTNDWEDH
LAVKHFSVEGQLEFKAILFVPKRAPFDLFDTRKKLNNIKLYVRRVFIMDNCEELIPEYLS
FVKGVVDSDDLPLNISRETLQQNKILKVIRKNLVKKCIEMFNEIAENKEDYTKFYEAFSK
NLKLGIHEDSQNRGKIADLLRYHSTKSGDEMTSFKDYVTRMKEGQKDIFYITGESKKAVE
NSPFLERLKKRGYEVLYMVDAIDEYAVGQLKEYDGKKLVSATKEGLKLEDETEEEKKKRE
EKKKSFENLCKTIKEILGDKVEKVVVSDRIVDSPCCLVTGEYGWTANMERIMKAQALRDS
SMSGYMSSKKTMEINPDNGIMEELRKRAEADKNDKSVKDLVMLLYETALLTSGFSLDEPN
TFAARIHRMLKLGLSIDEDENVEEDGDMPELEEDAAEESKMEEVD*
>AT3G55080.1 | SET domain-containing protein
MLFCISTVKLFGFQQRRNVSSLAKRFSLAGKLTLELQTQASLDNNFLPWLERIAGAKITN
TLSIGKSTYGRSLFASKVIYAGDCMLKVPFNAQITPDELPSDIRVLLSNEVGNIGMLAAV
LIREKKMGQKSRWVPYISRLPQPAEMHSSIFWGEDELSMIRCSAVHQETVKQKAQIEKDF
SFVAQAFKQHCPIVTERPDLEDFMYAYALVGSRAWENSKRISLIPFADFMNHDGLSASIV
LRDEDNQLSEVTADRNYSPGDEVFIKYGEFSNATLMLDFGFTFPYNIHDEVQIQMDVPND
DPLRNMKLGLLQTHHTRTVKDINIFHSSCDTFTIKEVKSAIGKGKGIPQSLRAFARVLCC
IIPQELNDLSKEAAQNDGRLARLPFKDGNRELEAHKILLSHINRLIEDHSVCIKEMEECY
FVSQRFAVRRQMARDLLYGELRVLRSAAEWLNHYCTTLLSETM*
>AT3G55080.1 | SET domain-containing protein
MLFCISTVKLFGFQQRRNVSSLAKRFSLAGKLTLELQTQASLDNNFLPWLERIAGAKITN
TLSIGKSTYGRSLFASKVIYAGDCMLKVPFNAQITPDELPSDIRVLLSNEVGNIGMLAAV
LIREKKMGQKSRWVPYISRLPQPAEMHSSIFWGEDELSMIRCSAVHQETVKQKAQIEKDF
SFVAQAFKQHCPIVTERPDLEDFMYAYALVGSRAWENSKRISLIPFADFMNHDGLSASIV
LRDEDNQLSEVTADRNYSPGDEVFIKYGEFSNATLMLDFGFTFPYNIHDEVQIQMDVPND
DPLRNMKLGLLQTHHTRTVKDINIFHSSCDTFTIKEVKSAIGKGKGIPQSLRAFARVLCC
IIPQELNDLSKEAAQNDGRLARLPFKDGNRELEAHKILLSHINRLIEDHSVCIKEMEECY
FVSQRFAVRRQMARDLLYGELRVLRSAAEWLNHYCTTLLSETM*
>AT3G55080.2 | SET domain-containing protein
MLAAVLIREKKMGQKSRWVPYISRLPQPAEMHSSIFWGEDELSMIRCSAVHQETVKQKAQ
IEKDFSFVAQAFKQHCPIVTERPDLEDFMYAYALVGSRAWENSKRISLIPFADFMNHDGL
SASIVLRDEDNQLSEFSTLQVTADRNYSPGDEVFIKYGEFSNATLMLDFGFTFPYNIHDE
VQIQMDVPNDDPLRNMKLGLLQTHHTRTVKDINIFHSSCDTFTIKEVKSAIGKGKGIPQS
LRAFARVLCCIIPQELNDLSKEAAQNDGRLARLPFKDGNRELEAHKILLSHINRLIEDHS
VCIKEMEECYFVSQRFAVRRQMARDLLYGELRVLRSAAEWLNHYCTTLLSETM*
>AT3G55080.2 | SET domain-containing protein
MLAAVLIREKKMGQKSRWVPYISRLPQPAEMHSSIFWGEDELSMIRCSAVHQETVKQKAQ
IEKDFSFVAQAFKQHCPIVTERPDLEDFMYAYALVGSRAWENSKRISLIPFADFMNHDGL
SASIVLRDEDNQLSEFSTLQVTADRNYSPGDEVFIKYGEFSNATLMLDFGFTFPYNIHDE
VQIQMDVPNDDPLRNMKLGLLQTHHTRTVKDINIFHSSCDTFTIKEVKSAIGKGKGIPQS
LRAFARVLCCIIPQELNDLSKEAAQNDGRLARLPFKDGNRELEAHKILLSHINRLIEDHS
VCIKEMEECYFVSQRFAVRRQMARDLLYGELRVLRSAAEWLNHYCTTLLSETM*
>AT1G02680.1 | TAF13 (TBP-ASSOCIATED FACTOR 13) DNA binding / RNA polymerase II transcription factor
MSNTPAAAASSSSKSKAAGTSQPQEKRKTLFQKELQHMMYGFGDEQNPLPESVALVEDIV
VEYVTDLTHKAQEIGSKRGRLLVDDFLYLIRKDLPKLNRCRELLAMQEELKQARKAFDVD
EKELVD*
>AT3G22142.1 | structural constituent of cell wall
MGFRTRNLSFLILLLLNFFAATYARECSPPKPSPKPHKPPKHSVVPPKPPAVKPHPHPKP
PTIKPPPPKRHPHPKPPTVKPHPHPKPPTKPHPHPKPPTKPHPHPKPPTIKPPPHPKPRP
HPKPPNVKPHPHPKPPTKPHPHPKPPTKHHPHPKPPTIKPPPKPPSVKPPPSTPKPPTTN
PPPSTPQPPTHKPPPCTPTPPVASPPMATPPTQMPPIATPPIAKSPVATPPIATPPTATP
PITIPPVATPPITTPPIANPPIIMPPIATPPVAAPPITNPPISKPPVTTPPTTTPPIAKP
PIATPPISTPPAATPPAATPPITTLPPAKPPVAISPIVTPPVTPIAQPPVATPPTATPPV
ATPPIATPPTSKSPISTPPISESPVATPPTATSPIKTPPPAKPPVATPPIAKSPIATPPT
ATPPVATPPIEKPPVATPPTTTPPTATPPVAKPPVETPPIATPPTAKPPISTPPISKPPV
ATPPAATPPITTPTPVKPPVATPPLAIPPVAKPPVVTPPTATPPIATPPIAKSPVATPPT
ATPPVATPPIAKPPVVTPPTTTPPTATPPVAKPPVATPPIATPPTAKPPISTPPISKSPV
ATPPAATPPITTPPPAKPPVATPPIATPPIAKPPVATPPTATPPIATSPVAKPPVATPPI
KTPPPAKPPVAIPPIATPPVAKPPVATPPTATPPIATPPIATPPVVTPPTATPPVATPPI
AKPPTTIPPTATPPVAMPPIATPPTAKPPIATPPIAIPPVAKPPVVTPPTATPPIATPPI
AKSPVATPPTATPPVATPPIAKPPVATPPTTAPPTATPPVAKPPVATPPIATPPTAKPPI
LTPPISKPPVATPPAATPPITTPPPAKPPVATPPIATPPIAKPPVATPPTATPPIATSPV
AKPPVAIPPIKTPPPAKPPVAIPPIATPPVAKPPVATPPTATPPIATSPIATPPVVTPPT
ATSPVATPPIAKPPTTTPPTATPPVAMPPIATPPTAKPPVATPPIANPPVEKPPVATPPI
AKPPTVLPPIAKPPVETSPTATPPTATPPVAIPPVVKPPVAIPPITKPPVATPPVTNPPT
AMPPIVTPPPIVTPPIAKSPIATPPVSTPPIAKPPIATPPVATTPIAKPPIATPPTANPP
VANPPIAKSPIAKPPIATPPTAMPSIATPPIGKPPVATPPMAKPPVASPPIATPPIIKPP
VATPPITKPPVATPPVATPPIAKPPVATSPIETPPVAKPPVTTPPVATPPIVKPPIVTPP
IATPPIAKSPIAPPPIGTPPIAKPPVATPPTATPPVATSPIAKPPVATPPPATPPVAKPP
VATPPTVTPPVATPPIAKPPGARPPVATPPVATPPIAKSPVATPPMTKPPVASSPIATPP
IAKTPIATPPTTMPKTCPIDTLKLGSCVDLLGGLVHIGIGKSAKEKCCPVVEGLVDLDAA
VCLCTTIKAKLLNIDVILPIALEVLLNCGKNPPPGFKCPA*
>AT5G65180.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown EXPRESSED IN 18 plant structures EXPRESSED DURING 9 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF618 (InterProIPR006903) Regulation of nuclear pre-mRNA protein (InterProIPR006569) ENTH/VHS (InterProIPR008942) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G100601) Has 3523 Blast hits to 3241 proteins in 333 species Archae - 25 Bacteria - 251 Metazoa - 1575 Fungi - 421 Plants - 142 Viruses - 17 Other Eukaryotes - 1092 (source NCBI BLink)
MSSPFSEEILIDNLAKLNSTQQSIQTLSQWCIVHRSEAELVVTTWEKQFHSTQIGQKVPL
LYLANDILQNSKRQGNEFVQEFWKVLPGALKDIVSLGDDYGKGVVSRLVNIWEERRVFGS
RSKSLKDVMLSEEAPPPLDVSKKRFRGSKSAKRDSKSTKTKLSSGGVSEKIVSAFNLVRA
ENSNEETEMNKCKSAVRRIRKMEKDVEDACSTAKDPRKESLAKELEEEENILRQSVEKLK
SVEESRTSLVNHLREALREQESELENLQSQIQVAQEQTEEAQNMQKRLNNETPVNNNNGT
SGQSAKITPASIAAMAEMLTSSTNSSMIMHSVLSSFAAEATQTSGLTKSNTSDTNAFVVP
PNPQQYHIIPNPAASQQFPYGFGNIPLMPPGALPPPPPGTLPPHMMNNNNQPNAAQQQAQ
QGQSFHPPGMMYFGPPHHS*
>AT5G65180.1 | unknown protein
MSSPFSEEILIDNLAKLNSTQQSIQTLSQWCIVHRSEAELVVTTWEKQFHSTQIGQKVPL
LYLANDILQNSKRQGNEFVQEFWKVLPGALKDIVSLGDDYGKGVVSRLVNIWEERRVFGS
RSKSLKDVMLSEEAPPPLDVSKKRFRGSKSAKRDSKSTKTKLSSGGVSEKIVSAFNLVRA
ENSNEETEMNKCKSAVRRIRKMEKDVEDACSTAKDPRKESLAKELEEEENILRQSVEKLK
SVEESRTSLVNHLREALREQESELENLQSQIQVAQEQTEEAQNMQKRLNNETPVNNNNGT
SGQSAKITPASIAAMAEMLTSSTNSSMIMHSVLSSFAAEATQTSGLTKSNTSDTNAFVVP
PNPQQYHIIPNPAASQQFPYGFGNIPLMPPGALPPPPPGTLPPHMMNNNNQPNAAQQQAQ
QGQSFHPPGMMYFGPPHHS*
>AT5G65180.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown EXPRESSED IN 18 plant structures EXPRESSED DURING 9 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF618 (InterProIPR006903) Regulation of nuclear pre-mRNA protein (InterProIPR006569) ENTH/VHS (InterProIPR008942) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G100601) Has 3523 Blast hits to 3241 proteins in 333 species Archae - 25 Bacteria - 251 Metazoa - 1575 Fungi - 421 Plants - 142 Viruses - 17 Other Eukaryotes - 1092 (source NCBI BLink)
MLSEEAPPPLDVSKKRFRGSKSAKRDSKSTKTKLSSGGVSEKIVSAFNLVRAENSNEETE
MNKCKSAVRRIRKMEKDVEDACSTAKDPRKESLAKELEEEENILRQSVEKLKSVEESRTS
LVNHLREALREQESELENLQSQIQVAQEQTEEAQNMQKRLNNETPVNNNNGTSGQSAKIT
PASIAAMAEMLTSSTNSSMIMHSVLSSFAAEATQTSGLTKSNTSDTNAFVVPPNPQQYHI
IPNPAASQQFPYGFGNIPLMPPGALPPPPPGTLPPHMMNNNNQPNAAQQQAQQGQSFHPP
GMMYFGPPHHS*
>AT5G65180.2 | unknown protein
MLSEEAPPPLDVSKKRFRGSKSAKRDSKSTKTKLSSGGVSEKIVSAFNLVRAENSNEETE
MNKCKSAVRRIRKMEKDVEDACSTAKDPRKESLAKELEEEENILRQSVEKLKSVEESRTS
LVNHLREALREQESELENLQSQIQVAQEQTEEAQNMQKRLNNETPVNNNNGTSGQSAKIT
PASIAAMAEMLTSSTNSSMIMHSVLSSFAAEATQTSGLTKSNTSDTNAFVVPPNPQQYHI
IPNPAASQQFPYGFGNIPLMPPGALPPPPPGTLPPHMMNNNNQPNAAQQQAQQGQSFHPP
GMMYFGPPHHS*