>AT5G52640.1 |  ATHSP901 (HEAT SHOCK PROTEIN 901) ATP binding / unfolded protein binding 
MADVQMADAETFAFQAEINQLLSLIINTFYSNKEIFLRELISNSSDALDKIRFESLTDKS 
KLDGQPELFIRLVPDKSNKTLSIIDSGIGMTKADLVNNLGTIARSGTKEFMEALQAGADV 
SMIGQFGVGFYSAYLVAEKVVVTTKHNDDEQYVWESQAGGSFTVTRDVDGEPLGRGTKIT 
LFLKDDQLEYLEERRLKDLVKKHSEFISYPIYLWTEKTTEKEISDDEDEDEPKKENEGEV 
EEVDEEKEKDGKKKKKIKEVSHEWELINKQKPIWLRKPEEITKEEYAAFYKSLTNDWEDH 
LAVKHFSVEGQLEFKAILFVPKRAPFDLFDTRKKLNNIKLYVRRVFIMDNCEELIPEYLS 
FVKGVVDSDDLPLNISRETLQQNKILKVIRKNLVKKCIEMFNEIAENKEDYTKFYEAFSK 
NLKLGIHEDSQNRGKIADLLRYHSTKSGDEMTSFKDYVTRMKEGQKDIFYITGESKKAVE 
NSPFLERLKKRGYEVLYMVDAIDEYAVGQLKEYDGKKLVSATKEGLKLEDETEEEKKKRE 
EKKKSFENLCKTIKEILGDKVEKVVVSDRIVDSPCCLVTGEYGWTANMERIMKAQALRDS 
SMSGYMSSKKTMEINPDNGIMEELRKRAEADKNDKSVKDLVMLLYETALLTSGFSLDEPN 
TFAARIHRMLKLGLSIDEDENVEEDGDMPELEEDAAEESKMEEVD*
>AT3G55080.1 |  SET domain-containing protein 
MLFCISTVKLFGFQQRRNVSSLAKRFSLAGKLTLELQTQASLDNNFLPWLERIAGAKITN 
TLSIGKSTYGRSLFASKVIYAGDCMLKVPFNAQITPDELPSDIRVLLSNEVGNIGMLAAV 
LIREKKMGQKSRWVPYISRLPQPAEMHSSIFWGEDELSMIRCSAVHQETVKQKAQIEKDF 
SFVAQAFKQHCPIVTERPDLEDFMYAYALVGSRAWENSKRISLIPFADFMNHDGLSASIV 
LRDEDNQLSEVTADRNYSPGDEVFIKYGEFSNATLMLDFGFTFPYNIHDEVQIQMDVPND 
DPLRNMKLGLLQTHHTRTVKDINIFHSSCDTFTIKEVKSAIGKGKGIPQSLRAFARVLCC 
IIPQELNDLSKEAAQNDGRLARLPFKDGNRELEAHKILLSHINRLIEDHSVCIKEMEECY 
FVSQRFAVRRQMARDLLYGELRVLRSAAEWLNHYCTTLLSETM*
>AT3G55080.1 |  SET domain-containing protein 
MLFCISTVKLFGFQQRRNVSSLAKRFSLAGKLTLELQTQASLDNNFLPWLERIAGAKITN 
TLSIGKSTYGRSLFASKVIYAGDCMLKVPFNAQITPDELPSDIRVLLSNEVGNIGMLAAV 
LIREKKMGQKSRWVPYISRLPQPAEMHSSIFWGEDELSMIRCSAVHQETVKQKAQIEKDF 
SFVAQAFKQHCPIVTERPDLEDFMYAYALVGSRAWENSKRISLIPFADFMNHDGLSASIV 
LRDEDNQLSEVTADRNYSPGDEVFIKYGEFSNATLMLDFGFTFPYNIHDEVQIQMDVPND 
DPLRNMKLGLLQTHHTRTVKDINIFHSSCDTFTIKEVKSAIGKGKGIPQSLRAFARVLCC 
IIPQELNDLSKEAAQNDGRLARLPFKDGNRELEAHKILLSHINRLIEDHSVCIKEMEECY 
FVSQRFAVRRQMARDLLYGELRVLRSAAEWLNHYCTTLLSETM*
>AT3G55080.2 |  SET domain-containing protein 
MLAAVLIREKKMGQKSRWVPYISRLPQPAEMHSSIFWGEDELSMIRCSAVHQETVKQKAQ 
IEKDFSFVAQAFKQHCPIVTERPDLEDFMYAYALVGSRAWENSKRISLIPFADFMNHDGL 
SASIVLRDEDNQLSEFSTLQVTADRNYSPGDEVFIKYGEFSNATLMLDFGFTFPYNIHDE 
VQIQMDVPNDDPLRNMKLGLLQTHHTRTVKDINIFHSSCDTFTIKEVKSAIGKGKGIPQS 
LRAFARVLCCIIPQELNDLSKEAAQNDGRLARLPFKDGNRELEAHKILLSHINRLIEDHS 
VCIKEMEECYFVSQRFAVRRQMARDLLYGELRVLRSAAEWLNHYCTTLLSETM*
>AT3G55080.2 |  SET domain-containing protein 
MLAAVLIREKKMGQKSRWVPYISRLPQPAEMHSSIFWGEDELSMIRCSAVHQETVKQKAQ 
IEKDFSFVAQAFKQHCPIVTERPDLEDFMYAYALVGSRAWENSKRISLIPFADFMNHDGL 
SASIVLRDEDNQLSEFSTLQVTADRNYSPGDEVFIKYGEFSNATLMLDFGFTFPYNIHDE 
VQIQMDVPNDDPLRNMKLGLLQTHHTRTVKDINIFHSSCDTFTIKEVKSAIGKGKGIPQS 
LRAFARVLCCIIPQELNDLSKEAAQNDGRLARLPFKDGNRELEAHKILLSHINRLIEDHS 
VCIKEMEECYFVSQRFAVRRQMARDLLYGELRVLRSAAEWLNHYCTTLLSETM*
>AT1G02680.1 |  TAF13 (TBP-ASSOCIATED FACTOR 13) DNA binding / RNA polymerase II transcription factor 
MSNTPAAAASSSSKSKAAGTSQPQEKRKTLFQKELQHMMYGFGDEQNPLPESVALVEDIV 
VEYVTDLTHKAQEIGSKRGRLLVDDFLYLIRKDLPKLNRCRELLAMQEELKQARKAFDVD 
EKELVD*
>AT3G22142.1 |  structural constituent of cell wall 
MGFRTRNLSFLILLLLNFFAATYARECSPPKPSPKPHKPPKHSVVPPKPPAVKPHPHPKP 
PTIKPPPPKRHPHPKPPTVKPHPHPKPPTKPHPHPKPPTKPHPHPKPPTIKPPPHPKPRP 
HPKPPNVKPHPHPKPPTKPHPHPKPPTKHHPHPKPPTIKPPPKPPSVKPPPSTPKPPTTN 
PPPSTPQPPTHKPPPCTPTPPVASPPMATPPTQMPPIATPPIAKSPVATPPIATPPTATP 
PITIPPVATPPITTPPIANPPIIMPPIATPPVAAPPITNPPISKPPVTTPPTTTPPIAKP 
PIATPPISTPPAATPPAATPPITTLPPAKPPVAISPIVTPPVTPIAQPPVATPPTATPPV 
ATPPIATPPTSKSPISTPPISESPVATPPTATSPIKTPPPAKPPVATPPIAKSPIATPPT 
ATPPVATPPIEKPPVATPPTTTPPTATPPVAKPPVETPPIATPPTAKPPISTPPISKPPV 
ATPPAATPPITTPTPVKPPVATPPLAIPPVAKPPVVTPPTATPPIATPPIAKSPVATPPT 
ATPPVATPPIAKPPVVTPPTTTPPTATPPVAKPPVATPPIATPPTAKPPISTPPISKSPV 
ATPPAATPPITTPPPAKPPVATPPIATPPIAKPPVATPPTATPPIATSPVAKPPVATPPI 
KTPPPAKPPVAIPPIATPPVAKPPVATPPTATPPIATPPIATPPVVTPPTATPPVATPPI 
AKPPTTIPPTATPPVAMPPIATPPTAKPPIATPPIAIPPVAKPPVVTPPTATPPIATPPI 
AKSPVATPPTATPPVATPPIAKPPVATPPTTAPPTATPPVAKPPVATPPIATPPTAKPPI 
LTPPISKPPVATPPAATPPITTPPPAKPPVATPPIATPPIAKPPVATPPTATPPIATSPV 
AKPPVAIPPIKTPPPAKPPVAIPPIATPPVAKPPVATPPTATPPIATSPIATPPVVTPPT 
ATSPVATPPIAKPPTTTPPTATPPVAMPPIATPPTAKPPVATPPIANPPVEKPPVATPPI 
AKPPTVLPPIAKPPVETSPTATPPTATPPVAIPPVVKPPVAIPPITKPPVATPPVTNPPT 
AMPPIVTPPPIVTPPIAKSPIATPPVSTPPIAKPPIATPPVATTPIAKPPIATPPTANPP 
VANPPIAKSPIAKPPIATPPTAMPSIATPPIGKPPVATPPMAKPPVASPPIATPPIIKPP 
VATPPITKPPVATPPVATPPIAKPPVATSPIETPPVAKPPVTTPPVATPPIVKPPIVTPP 
IATPPIAKSPIAPPPIGTPPIAKPPVATPPTATPPVATSPIAKPPVATPPPATPPVAKPP 
VATPPTVTPPVATPPIAKPPGARPPVATPPVATPPIAKSPVATPPMTKPPVASSPIATPP 
IAKTPIATPPTTMPKTCPIDTLKLGSCVDLLGGLVHIGIGKSAKEKCCPVVEGLVDLDAA 
VCLCTTIKAKLLNIDVILPIALEVLLNCGKNPPPGFKCPA*
>AT5G65180.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown EXPRESSED IN 18 plant structures EXPRESSED DURING 9 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF618 (InterProIPR006903) Regulation of nuclear pre-mRNA protein (InterProIPR006569) ENTH/VHS (InterProIPR008942) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G100601) Has 3523 Blast hits to 3241 proteins in 333 species Archae - 25 Bacteria - 251 Metazoa - 1575 Fungi - 421 Plants - 142 Viruses - 17 Other Eukaryotes - 1092 (source NCBI BLink) 
MSSPFSEEILIDNLAKLNSTQQSIQTLSQWCIVHRSEAELVVTTWEKQFHSTQIGQKVPL 
LYLANDILQNSKRQGNEFVQEFWKVLPGALKDIVSLGDDYGKGVVSRLVNIWEERRVFGS 
RSKSLKDVMLSEEAPPPLDVSKKRFRGSKSAKRDSKSTKTKLSSGGVSEKIVSAFNLVRA 
ENSNEETEMNKCKSAVRRIRKMEKDVEDACSTAKDPRKESLAKELEEEENILRQSVEKLK 
SVEESRTSLVNHLREALREQESELENLQSQIQVAQEQTEEAQNMQKRLNNETPVNNNNGT 
SGQSAKITPASIAAMAEMLTSSTNSSMIMHSVLSSFAAEATQTSGLTKSNTSDTNAFVVP 
PNPQQYHIIPNPAASQQFPYGFGNIPLMPPGALPPPPPGTLPPHMMNNNNQPNAAQQQAQ 
QGQSFHPPGMMYFGPPHHS*
>AT5G65180.1 |  unknown protein 
MSSPFSEEILIDNLAKLNSTQQSIQTLSQWCIVHRSEAELVVTTWEKQFHSTQIGQKVPL 
LYLANDILQNSKRQGNEFVQEFWKVLPGALKDIVSLGDDYGKGVVSRLVNIWEERRVFGS 
RSKSLKDVMLSEEAPPPLDVSKKRFRGSKSAKRDSKSTKTKLSSGGVSEKIVSAFNLVRA 
ENSNEETEMNKCKSAVRRIRKMEKDVEDACSTAKDPRKESLAKELEEEENILRQSVEKLK 
SVEESRTSLVNHLREALREQESELENLQSQIQVAQEQTEEAQNMQKRLNNETPVNNNNGT 
SGQSAKITPASIAAMAEMLTSSTNSSMIMHSVLSSFAAEATQTSGLTKSNTSDTNAFVVP 
PNPQQYHIIPNPAASQQFPYGFGNIPLMPPGALPPPPPGTLPPHMMNNNNQPNAAQQQAQ 
QGQSFHPPGMMYFGPPHHS*
>AT5G65180.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown EXPRESSED IN 18 plant structures EXPRESSED DURING 9 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF618 (InterProIPR006903) Regulation of nuclear pre-mRNA protein (InterProIPR006569) ENTH/VHS (InterProIPR008942) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G100601) Has 3523 Blast hits to 3241 proteins in 333 species Archae - 25 Bacteria - 251 Metazoa - 1575 Fungi - 421 Plants - 142 Viruses - 17 Other Eukaryotes - 1092 (source NCBI BLink) 
MLSEEAPPPLDVSKKRFRGSKSAKRDSKSTKTKLSSGGVSEKIVSAFNLVRAENSNEETE 
MNKCKSAVRRIRKMEKDVEDACSTAKDPRKESLAKELEEEENILRQSVEKLKSVEESRTS 
LVNHLREALREQESELENLQSQIQVAQEQTEEAQNMQKRLNNETPVNNNNGTSGQSAKIT 
PASIAAMAEMLTSSTNSSMIMHSVLSSFAAEATQTSGLTKSNTSDTNAFVVPPNPQQYHI 
IPNPAASQQFPYGFGNIPLMPPGALPPPPPGTLPPHMMNNNNQPNAAQQQAQQGQSFHPP 
GMMYFGPPHHS*
>AT5G65180.2 |  unknown protein 
MLSEEAPPPLDVSKKRFRGSKSAKRDSKSTKTKLSSGGVSEKIVSAFNLVRAENSNEETE 
MNKCKSAVRRIRKMEKDVEDACSTAKDPRKESLAKELEEEENILRQSVEKLKSVEESRTS 
LVNHLREALREQESELENLQSQIQVAQEQTEEAQNMQKRLNNETPVNNNNGTSGQSAKIT 
PASIAAMAEMLTSSTNSSMIMHSVLSSFAAEATQTSGLTKSNTSDTNAFVVPPNPQQYHI 
IPNPAASQQFPYGFGNIPLMPPGALPPPPPGTLPPHMMNNNNQPNAAQQQAQQGQSFHPP 
GMMYFGPPHHS*