>AT5G52640.1 |  ATHSP901 (HEAT SHOCK PROTEIN 901) ATP binding / unfolded protein binding 
MADVQMADAETFAFQAEINQLLSLIINTFYSNKEIFLRELISNSSDALDKIRFESLTDKS 
KLDGQPELFIRLVPDKSNKTLSIIDSGIGMTKADLVNNLGTIARSGTKEFMEALQAGADV 
SMIGQFGVGFYSAYLVAEKVVVTTKHNDDEQYVWESQAGGSFTVTRDVDGEPLGRGTKIT 
LFLKDDQLEYLEERRLKDLVKKHSEFISYPIYLWTEKTTEKEISDDEDEDEPKKENEGEV 
EEVDEEKEKDGKKKKKIKEVSHEWELINKQKPIWLRKPEEITKEEYAAFYKSLTNDWEDH 
LAVKHFSVEGQLEFKAILFVPKRAPFDLFDTRKKLNNIKLYVRRVFIMDNCEELIPEYLS 
FVKGVVDSDDLPLNISRETLQQNKILKVIRKNLVKKCIEMFNEIAENKEDYTKFYEAFSK 
NLKLGIHEDSQNRGKIADLLRYHSTKSGDEMTSFKDYVTRMKEGQKDIFYITGESKKAVE 
NSPFLERLKKRGYEVLYMVDAIDEYAVGQLKEYDGKKLVSATKEGLKLEDETEEEKKKRE 
EKKKSFENLCKTIKEILGDKVEKVVVSDRIVDSPCCLVTGEYGWTANMERIMKAQALRDS 
SMSGYMSSKKTMEINPDNGIMEELRKRAEADKNDKSVKDLVMLLYETALLTSGFSLDEPN 
TFAARIHRMLKLGLSIDEDENVEEDGDMPELEEDAAEESKMEEVD*
>AT2G22690.1 |  protein binding / zinc ion binding 
MELKNVKDAFDRVTKKQKLCYSKTHEVVDKMSQEIDKALKTIQEDNHESVVADLKKTFEE 
IAPINLLEASQKEINGVLTKYPKALDKTLNPDISTAYRNVKFDTHTVHQILAQFFYRQGM 
YDVGDCFISETGEVKPESSVTKAFMEMNMILEAMKERDLGPALKWVASNSDKLKEAKSDL 
ELKLHSLHFLEIAKDKTSEEAINYARKHFATYSADSCCFPEIQKLMCSLLWIRNLNKSPY 
SEFLSPVLWTNAAKELTRQYCILLGESPESPLSVTVAAGSQVLPTFLKYLNVLPEKRKEW 
QTMEQLLVPVELSEEYRFYSVFVCPVSKEHSSEDNPPMRLACGHVLCKQSINRMSRNGSR 
SFKCPYCPTDIDASQCKQLYF*
>AT2G22690.1 |  protein binding / zinc ion binding 
MELKNVKDAFDRVTKKQKLCYSKTHEVVDKMSQEIDKALKTIQEDNHESVVADLKKTFEE 
IAPINLLEASQKEINGVLTKYPKALDKTLNPDISTAYRNVKFDTHTVHQILAQFFYRQGM 
YDVGDCFISETGEVKPESSVTKAFMEMNMILEAMKERDLGPALKWVASNSDKLKEAKSDL 
ELKLHSLHFLEIAKDKTSEEAINYARKHFATYSADSCCFPEIQKLMCSLLWIRNLNKSPY 
SEFLSPVLWTNAAKELTRQYCILLGESPESPLSVTVAAGSQVLPTFLKYLNVLPEKRKEW 
QTMEQLLVPVELSEEYRFYSVFVCPVSKEHSSEDNPPMRLACGHVLCKQSINRMSRNGSR 
SFKCPYCPTDIDASQCKQLYF*
>AT2G22690.2 |  protein binding / zinc ion binding 
MELKNVKDAFDRVTKKQKLCYSKTHEVVDKMSQEIDKALKTIQEDNHESVVADLKKTFEE 
IAPINLLEASQKEINGVLTKYPKALDKTLNPDISTAYRNVKFDTHTVHQILAQFFYRQGM 
YDVGDCFISETGEVKPESSVTKAFMEMNMILEAMKERDLGPALKWVASNSDKLKEAKSDL 
ELKLHSLHFLEIAKDKTSEEAINYARKHFATYSADSCCFPEIQKLMCSLLWIRNLNKSPY 
SEFLSPVLWTNAAKELTRQYCILLGESPESPLSVTVAAGSQVLPTFLKYLNVLPEKRKEW 
QTMEQLLVPVELSEEYRFYSVFVCPVSKEHSSEDNPPMRLACGHVLCKQSINRMSRNGSR 
SFKCPYCPTDIDASQCKQLYF*
>AT2G22690.2 |  protein binding / zinc ion binding 
MELKNVKDAFDRVTKKQKLCYSKTHEVVDKMSQEIDKALKTIQEDNHESVVADLKKTFEE 
IAPINLLEASQKEINGVLTKYPKALDKTLNPDISTAYRNVKFDTHTVHQILAQFFYRQGM 
YDVGDCFISETGEVKPESSVTKAFMEMNMILEAMKERDLGPALKWVASNSDKLKEAKSDL 
ELKLHSLHFLEIAKDKTSEEAINYARKHFATYSADSCCFPEIQKLMCSLLWIRNLNKSPY 
SEFLSPVLWTNAAKELTRQYCILLGESPESPLSVTVAAGSQVLPTFLKYLNVLPEKRKEW 
QTMEQLLVPVELSEEYRFYSVFVCPVSKEHSSEDNPPMRLACGHVLCKQSINRMSRNGSR 
SFKCPYCPTDIDASQCKQLYF*
>AT2G13370.1 |  CHR5 (chromatin remodeling 5) ATP binding / DNA binding / chromatin binding / helicase/ nucleic acid binding 
MAFFRNYSNDTVSHNVLDENEERQNAATFQSSPLNEDVDGTYSERGFDMNMDVQYQSDPE 
PGCSIRQPNETAVDNVADPVDSHYQSSTKRLGVTGRWGSTFWKDCQPMGQREGSDPAKDS 
QSGYKEAYHSEDNHSNDRSEKLDSENENDNENEEEDNEMNKHQSGQADVPADEMLSDEYY 
EQDEDNQSDHVHYKGYSNPTNSRSLPKAGSAVHSNSRTSRAIHKNIHYSDSNHDHNGDAD 
MDYEEEEDEDDPEDADFEPYDAADDGGASKKHGQGWDVSDEDPESDEEIDLSDYEDDYGT 
KKPKVRQQSKGFRKSSAGLERKSFHVSSRQKRKTSYQDDDSEEDSENDNDEGFRSLARRG 
TTLRQNNGRSTNTIGQSSEVRSSTRSVRKVSYVESEDSEDIDDGKNRKNQKDDIEEEDAD 
VIEKVLWHQLKGMGEDVQTNNKSTVPVLVSQLFDTEPDWNEMEFLIKWKGQSHLHCQWKT 
LSDLQNLSGFKKVLNYTKKVTEEIRYRTALSREEIEVNDVSKEMDLDIIKQNSQVERIIA 
DRISKDGLGDVVPEYLVKWQGLSYAEATWEKDVDIAFAQVAIDEYKAREVSIAVQGKMVE 
QQRTKGKASLRKLDEQPEWLIGGTLRDYQLEGLNFLVNSWLNDTNVILADEMGLGKTVQS 
VSMLGFLQNTQQIPGPFLVVVPLSTLANWAKEFRKWLPGMNIIVYVGTRASREVCQQYEF 
YNEKKVGRPIKFNALLTTYEVVLKDKAVLSKIKWIYLMVDEAHRLKNSEAQLYTALLEFS 
TKNKLLITGTPLQNSVEELWALLHFLDPGKFKNKDEFVENYKNLSSFNESELANLHLELR 
PHILRRVIKDVEKSLPPKIERILRVEMSPLQKQYYKWILERNFHDLNKGVRGNQVSLLNI 
VVELKKCCNHPFLFESADHGYGGDINDNSKLDKIILSSGKLVILDKLLVRLRETKHRVLI 
FSQMVRMLDILAEYLSLRGFQFQRLDGSTKAELRQQAMDHFNAPASDDFCFLLSTRAGGL 
GINLATADTVVIFDSDWNPQNDLQAMSRAHRIGQQEVVNIYRFVTSKSVEEEILERAKRK 
MVLDHLVIQKLNAEGRLEKRETKKGSNFDKNELSAILRFGAEELFKEDKNDEESKKRLLS 
MDIDEILERAEQVEEKHTDETEHELLGAFKVANFCNAEDDGSFWSRWIKPDSVVTAEEAL 
APRAARNTKSYVDPSHPDRTSKRKKKGSEPPEHTERSQKRRKTEYFVPSTPLLEGTSAQV 
RGWSYGNLPKRDAQRFYRTVMKFGNHNQMACIAEEVGGVVEAAPEEAQVELFDALIDGCK 
ESVETGNFEPKGPVLDFFGVPVKANELLKRVQGLQLLSKRISRYNDPISQFRVLSYLKPS 
NWSKGCGWNQIDDARLLLGILYHGFGNWEKIRLDESLGLTKKIAPVELQHHETFLPRAPN 
LKERATALLEMELAAAGGKNTNAKASRKNSKKVKDNLINQFKAPARDRRGKSGPANVSLL 
STKDGPRKTQKAEPLVKEEGEMSDDGEVYEQFKEQKWMEWCEDVLADEIKTLGRLQRLQT 
TSADLPKEKVLFKIRRYLEILGRRIDAIVLEHEEDLYKQDRMTMRLWNYVSTFSNLSGDR 
LNQIYSKLKQEKEEEEGVGPSHLNGSRNFQRQQKFKTAGNSQGSQQVHKGIDTAKFEAWK 
RRRRTENDVQTERPTITNSNSLGILGPGPLDRSHRARQTGFPPR*
>AT2G38880.2 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYRELEGDNKGSGK 
SGDGSNRDAGGGVSGEEMPSW*
>AT2G38880.2 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYRELEGDNKGSGK 
SGDGSNRDAGGGVSGEEMPSW*
>AT2G38880.2 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYRELEGDNKGSGK 
SGDGSNRDAGGGVSGEEMPSW*
>AT2G38880.2 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYRELEGDNKGSGK 
SGDGSNRDAGGGVSGEEMPSW*
>AT2G38880.2 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYRELEGDNKGSGK 
SGDGSNRDAGGGVSGEEMPSW*
>AT2G38880.2 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYRELEGDNKGSGK 
SGDGSNRDAGGGVSGEEMPSW*
>AT2G38880.3 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYRELEGDNKGSGK 
SGDGSNRDAGGGVSGEEMPV*
>AT2G38880.3 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYRELEGDNKGSGK 
SGDGSNRDAGGGVSGEEMPV*
>AT2G38880.3 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYRELEGDNKGSGK 
SGDGSNRDAGGGVSGEEMPV*
>AT2G38880.3 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYRELEGDNKGSGK 
SGDGSNRDAGGGVSGEEMPV*
>AT2G38880.3 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYRELEGDNKGSGK 
SGDGSNRDAGGGVSGEEMPV*
>AT2G38880.3 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYRELEGDNKGSGK 
SGDGSNRDAGGGVSGEEMPV*
>AT2G38880.1 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYRELEGDNKGSGK 
SGDGSNRDAGGGVSGEEMPSW*
>AT2G38880.1 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYRELEGDNKGSGK 
SGDGSNRDAGGGVSGEEMPSW*
>AT2G38880.1 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYRELEGDNKGSGK 
SGDGSNRDAGGGVSGEEMPSW*
>AT2G38880.1 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYRELEGDNKGSGK 
SGDGSNRDAGGGVSGEEMPSW*
>AT2G38880.1 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYRELEGDNKGSGK 
SGDGSNRDAGGGVSGEEMPSW*
>AT2G38880.1 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYRELEGDNKGSGK 
SGDGSNRDAGGGVSGEEMPSW*
>AT2G38880.4 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYREVW*
>AT2G38880.4 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYREVW*
>AT2G38880.4 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYREVW*
>AT2G38880.4 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYREVW*
>AT2G38880.4 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYREVW*
>AT2G38880.4 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYREVW*
>AT2G38880.5 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYRELEGDNKGSGK 
SGDGSNRDAGGGVSGEEMPSW*
>AT2G38880.5 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYRELEGDNKGSGK 
SGDGSNRDAGGGVSGEEMPSW*
>AT2G38880.5 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYRELEGDNKGSGK 
SGDGSNRDAGGGVSGEEMPSW*
>AT2G38880.5 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYRELEGDNKGSGK 
SGDGSNRDAGGGVSGEEMPSW*
>AT2G38880.5 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYRELEGDNKGSGK 
SGDGSNRDAGGGVSGEEMPSW*
>AT2G38880.5 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYRELEGDNKGSGK 
SGDGSNRDAGGGVSGEEMPSW*
>AT2G38880.6 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYREVW*
>AT2G38880.6 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYREVW*
>AT2G38880.6 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYREVW*
>AT2G38880.6 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYREVW*
>AT2G38880.6 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYREVW*
>AT2G38880.6 |  NF-YB1 (NUCLEAR FACTOR Y SUBUNIT B1) transcription factor 
MADTPSSPAGDGGESGGSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEF 
ISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYREVW*
>AT1G59900.1 |  AT-E1 ALPHA oxidoreductase acting on the aldehyde or oxo group of donors disulfide as acceptor / pyruvate dehydrogenase (acetyl-transferring) 
MALSRLSSRSNIITRPFSAAFSRLISTDTTPITIETSLPFTAHLCDPPSRSVESSSQELL 
DFFRTMALMRRMEIAADSLYKAKLIRGFCHLYDGQEAVAIGMEAAITKKDAIITAYRDHC 
IFLGRGGSLHEVFSELMGRQAGCSKGKGGSMHFYKKESSFYGGHGIVGAQVPLGCGIAFA 
QKYNKEEAVTFALYGDGAANQGQLFEALNISALWDLPAILVCENNHYGMGTAEWRAAKSP 
SYYKRGDYVPGLKVDGMDAFAVKQACKFAKQHALEKGPIILEMDTYRYHGHSMSDPGSTY 
RTRDEISGVRQERDPIERIKKLVLSHDLATEKELKDMEKEIRKEVDDAIAKAKDCPMPEP 
SELFTNVYVKGFGTESFGPDRKEVKASLP*
>AT1G06790.2 |  RNA polymerase Rpb7 N-terminal domain-containing protein 
MFYLSELEHSLRVPPHLLNLPLEDAIKSVLQNVFLDKVLADLGLCVSIYDIKSVEGGFVL 
PGDGAATYKVGLRIVVFRPFVGEVIAAKFKESDANGLRLTLGFFDDIYVPAPLMPKPNRC 
EPDPYNRKQMIWVWEYGEPKEDYIVDDACQIKFRVESISYPSVPTERAEDAKPFAPMVVT 
VSSFIKLIL*
>AT1G06790.2 |  RNA polymerase Rpb7 N-terminal domain-containing protein 
MFYLSELEHSLRVPPHLLNLPLEDAIKSVLQNVFLDKVLADLGLCVSIYDIKSVEGGFVL 
PGDGAATYKVGLRIVVFRPFVGEVIAAKFKESDANGLRLTLGFFDDIYVPAPLMPKPNRC 
EPDPYNRKQMIWVWEYGEPKEDYIVDDACQIKFRVESISYPSVPTERAEDAKPFAPMVVT 
VSSFIKLIL*
>AT1G06790.1 |  RNA polymerase Rpb7 N-terminal domain-containing protein 
MFYLSELEHSLRVPPHLLNLPLEDAIKSVLQNVFLDKVLADLGLCVSIYDIKSVEGGFVL 
PGDGAATYKVGLRIVVFRPFVGEVIAAKFKESDANGLRLTLGFFDDIYVPAPLMPKPNRC 
EPDPYNRKQMIWVWEYGEPKEDYIVDDACQIKFRVESISYPSVPTERAEDAKPFAPMVVT 
GNMDDDGLGPVSWWDSYEQVDQEE*
>AT1G06790.1 |  RNA polymerase Rpb7 N-terminal domain-containing protein 
MFYLSELEHSLRVPPHLLNLPLEDAIKSVLQNVFLDKVLADLGLCVSIYDIKSVEGGFVL 
PGDGAATYKVGLRIVVFRPFVGEVIAAKFKESDANGLRLTLGFFDDIYVPAPLMPKPNRC 
EPDPYNRKQMIWVWEYGEPKEDYIVDDACQIKFRVESISYPSVPTERAEDAKPFAPMVVT 
GNMDDDGLGPVSWWDSYEQVDQEE*
>AT2G40110.1 |  yippee family protein 
MGRLFVVNLEGKIYSCKHCKTHLATYEDIISKSFHCKHGKAYLFNKVANVSIGETEERLM 
MTGKHTVADIFCVSCGSIVGWKYETAHEKNQKYKEGKSVLERFKISGPDGSNYWVSSHGR 
HIGGSDADDA*
>AT2G40110.1 |  yippee family protein 
MGRLFVVNLEGKIYSCKHCKTHLATYEDIISKSFHCKHGKAYLFNKVANVSIGETEERLM 
MTGKHTVADIFCVSCGSIVGWKYETAHEKNQKYKEGKSVLERFKISGPDGSNYWVSSHGR 
HIGGSDADDA*
>AT2G40110.2 |  yippee family protein 
MGRLFVVNLEGKIYSCKHCKTHLATYEDIISKSFHCKHGKAYLFNKVANVSIGETEERLM 
MTGKHTVADIFCVSCGSIVGWKYETAHEKNQKYKEGKSVLERFYFQHFRT*
>AT2G40110.2 |  yippee family protein 
MGRLFVVNLEGKIYSCKHCKTHLATYEDIISKSFHCKHGKAYLFNKVANVSIGETEERLM 
MTGKHTVADIFCVSCGSIVGWKYETAHEKNQKYKEGKSVLERFYFQHFRT*
>AT4G33100.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s Mitochondrial distribution and morphology family 35/apoptosis (InterProIPR007918) Has 160 Blast hits to 160 proteins in 70 species Archae - 0 Bacteria - 0 Metazoa - 95 Fungi - 46 Plants - 13 Viruses - 0 Other Eukaryotes - 6 (source NCBI BLink) 
MGLLKKKDSTSARSSTSPCADLRNAYHNCFNKWYSEKFVKGQWDKEECVAEWKKYRDCLS 
ENLDGKLLTRILEVDGELNPTKQATDSKESSS*
>AT5G08560.1 |  transducin family protein / WD-40 repeat family protein 
MGVVEDTEPPLKRAKRLADEPNGFSANSSVRGSSVNSNSLGDLMARPLPSQGDDETIGSK 
GVIRKSEFVRIITRALYSLGYDKTGAMLEEESGISLHNSTIKLFLQQVKDGKWDQSVKTL 
HRIGFPDEKAVKAASFLLLEQKFLEFLKVEKIADALRTLRNEMAPLRINTKRVHELASSL 
ISPSSFISHTTSTPGKESVNSRSKVLEELQTLLPASVIIPEKRLECLVENSLHIQRDSCV 
FHNTLDSDLSLYSDHQCGKHQIPSQTAQILESHTDEVWFLQFSHNGKYLASSSKDQTAII 
WEISADGHISLKHTLVGHHKPVIAILWSPDDRQVLTCGAEEVIRRWDVDSGDCVHMYEKG 
GISPISCGWYPDGQGIIAGMTDRSICMWDLDGREKECWKGQRTQKVSDIAMTDDGKWLVS 
VCKDSVISLFDREATVERLIEEEDMITSFSLSNDNKYILVNLLNQEIRLWNIEGDPKIVS 
RYKGHKRSRFIIRSCFGGYKQAFIASGSEDSQVYIWHRSTGKLIVELPGHAGAVNCVSWS 
PTNLHMLASASDDGTIRIWGLDRINQQNQKKKLVQGSSSNGVIHRCNGN*
>AT5G08560.1 |  transducin family protein / WD-40 repeat family protein 
MGVVEDTEPPLKRAKRLADEPNGFSANSSVRGSSVNSNSLGDLMARPLPSQGDDETIGSK 
GVIRKSEFVRIITRALYSLGYDKTGAMLEEESGISLHNSTIKLFLQQVKDGKWDQSVKTL 
HRIGFPDEKAVKAASFLLLEQKFLEFLKVEKIADALRTLRNEMAPLRINTKRVHELASSL 
ISPSSFISHTTSTPGKESVNSRSKVLEELQTLLPASVIIPEKRLECLVENSLHIQRDSCV 
FHNTLDSDLSLYSDHQCGKHQIPSQTAQILESHTDEVWFLQFSHNGKYLASSSKDQTAII 
WEISADGHISLKHTLVGHHKPVIAILWSPDDRQVLTCGAEEVIRRWDVDSGDCVHMYEKG 
GISPISCGWYPDGQGIIAGMTDRSICMWDLDGREKECWKGQRTQKVSDIAMTDDGKWLVS 
VCKDSVISLFDREATVERLIEEEDMITSFSLSNDNKYILVNLLNQEIRLWNIEGDPKIVS 
RYKGHKRSRFIIRSCFGGYKQAFIASGSEDSQVYIWHRSTGKLIVELPGHAGAVNCVSWS 
PTNLHMLASASDDGTIRIWGLDRINQQNQKKKLVQGSSSNGVIHRCNGN*
>AT5G08560.2 |  transducin family protein / WD-40 repeat family protein 
MGVVEDTEPPLKRAKRLADEPNGFSANSSVRGSSVNSNSLGDLMARPLPSQGDDETIGSK 
GVIRKSEFVRIITRALYSLGYDKTGAMLEEESGISLHNSTIKLFLQQVKDGKWDQSVKTL 
HRIGFPDEKAVKAASFLLLEQKFLEFLKVEKIADALRTLRNEMAPLRINTKRVHELASSL 
ISPSSFISHTTSTPGKESVNSRSKVLEELQTLLPASVIIPEKRLECLVENSLHIQRDSCV 
FHNTLDSDLSLYSDHQCGKHQIPSQTAQILESHTDEVWFLQFSHNGKYLASSSKDQTAII 
WEISADGHISLKHTLVGHHKPVIAILWSPDDRQVLTCGAEEVIRRWDVDSGDCVHMYEKG 
GISPISCGWYPDGQGIIAGMTDRSICMWDLDGREKECWKGQRTQKVSDIAMTDDGKWLVS 
VCKDSVISLFDREATVERLIEEEDMITSFSLSNDNKYILVNLLNQEIRLWNIEGDPKIVS 
RYKGHKRSRFIIRSCFGGYKQAFIASGSEDSQVYIWHRSTGKLIVELPGHAGAVNCVSWS 
PTNLHMLASASDDGTIRIWGLDRINQQNQKKKLVQGSSSNGVIHRCNGN*
>AT5G08560.2 |  transducin family protein / WD-40 repeat family protein 
MGVVEDTEPPLKRAKRLADEPNGFSANSSVRGSSVNSNSLGDLMARPLPSQGDDETIGSK 
GVIRKSEFVRIITRALYSLGYDKTGAMLEEESGISLHNSTIKLFLQQVKDGKWDQSVKTL 
HRIGFPDEKAVKAASFLLLEQKFLEFLKVEKIADALRTLRNEMAPLRINTKRVHELASSL 
ISPSSFISHTTSTPGKESVNSRSKVLEELQTLLPASVIIPEKRLECLVENSLHIQRDSCV 
FHNTLDSDLSLYSDHQCGKHQIPSQTAQILESHTDEVWFLQFSHNGKYLASSSKDQTAII 
WEISADGHISLKHTLVGHHKPVIAILWSPDDRQVLTCGAEEVIRRWDVDSGDCVHMYEKG 
GISPISCGWYPDGQGIIAGMTDRSICMWDLDGREKECWKGQRTQKVSDIAMTDDGKWLVS 
VCKDSVISLFDREATVERLIEEEDMITSFSLSNDNKYILVNLLNQEIRLWNIEGDPKIVS 
RYKGHKRSRFIIRSCFGGYKQAFIASGSEDSQVYIWHRSTGKLIVELPGHAGAVNCVSWS 
PTNLHMLASASDDGTIRIWGLDRINQQNQKKKLVQGSSSNGVIHRCNGN*
>AT5G27970.1 |  binding 
MALVAALEADLRALSAEARRRYPAVKDGAEHAILKLRSSSSASDLSSNEDILRIFLMACG 
VRNTKLSVIGLSCLQKLISHDAVEPSSLKEILYTLKDAKQLSDAVFPYLQHSEMAEENIQ 
LKTLQTILIIFQSRLHPETEDNMVLGLSICLTLLDNNRPPSVYNTAAATFRQAVALIFDQ 
VVSAESLPMPKFGSSSQTARTGSVTGDLSQNINNSGPLEKDVIGGRLTIRDTLSETGKLG 
LRLLEDLTASAAGFASPNLFPSDDIASDQFRDFVTAQLEGEMVEPYFRRLVLRSVAHIIR 
LYSSSLITECEVQYAKLYLFILNFVLLTTKEQAHEMCSADDYDLCKFSLLMCVHPISVCA 
QVFLSMLVKATFLDLPLWHRILVLEILRGFCVEARTLRILFQNFDMHPKNTNVVESMVKA 
LARVVSSIQETSEESLAAVAGMFSSKAKGIEWILDNDASSAAVLVASEAHAITLAIEGLL 
GVVFTVATLTDEAVDVGELESPRYEHLPSSDYTGKTSLLCISMVDSLWLTILDAFSLILS 
RSQGEAIVLEILKGYQAFTQACGVLHAVEPLNSFLASLCKFTIVLPTDVERKSVVQSPVS 
KRSEVQVDLKDVIVLTPKNVQALRTLFNIAHRLHNVLGPSWVLVLETLAALDRAIHSPHA 
TTQEVATAVPKLTREPSRQYADFSILSSLNSQLFESSALMQVSSVKSLLSALHMLSHQSM 
TETSGSVSSASRVEPLWDQVVGHFLELAEHSNQNLRNMALDALDQSICAVLGSEQFGEDP 
ARSRDATLDVDSKSTEVKSVECAVLSSLRVLYFSAQKADVRVGSLKILLHVLERCGEKLY 
YSWSSILEMLRSVADASEKDVATLGFQSLRVIMSDGLPTLPEDCLHVCIDVTGAYSAQKT 
DLNISLTAIGLLWTLTDFVAKGLHHGSLVEKGSGFNNADSTPQQTNGEDGEKHMGSNSGK 
SDYEAPIQVVNHEKLLFLVFSLIQKLVDDERPEVRNSAVRTFFQILGSHGNKLSKSMWED 
CLWNYIFPMLDGASHKAATSSKDEWQGKEIGTRGGKAVHMLIHHSRNSAQKQWDETFVLV 
LGGIARLFRSYFPLLESLPNFWSGWESLLAFVKKSIFNGSKEVSLAAINCLQTAVVSHCV 
KGNLQLRYLNSVLDVYELVFQKSSSYTGDTAAKVKQEILHGLGELYVQSSKMFDDKMYMQ 
LLGIVDLAIKQAIINSENFETEYGHVPPVLRHVLEILPSLGPPEHLSSMWLILLREFLHY 
LPRVDSVLPNDEGEIQQNNTGSEVLEQKADASSETIPTTRITTNMFAEKLIPALIELLLQ 
APAVEKYILFPEVIQNLRRCMMTRRDNPDGSLWKVAAEGFNRLLVEDVKLCSVGGETELK 
ISKTARIRIWKEIGDVYDIFLVGYCGRALSSNSLPAATLKANETLEIALLNGLGDIILKS 
TVDAPREVLERLVSTLDRCASRTCSLPVETVELMPAHCSRFSLTCLQKLFSLSSSETENW 
HSTRAEVSKISITTLMARCEFILSRFLIDENNLGNRPIPTARLEEIIFTLQELYRLSIHP 
EVASVLPLQPYLKNVLREDNRDTRAHLLVLFPSLCEIVLSREMRVRELVQILLRAVATEL 
GLEKVSLSS*
>AT5G65180.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown EXPRESSED IN 18 plant structures EXPRESSED DURING 9 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF618 (InterProIPR006903) Regulation of nuclear pre-mRNA protein (InterProIPR006569) ENTH/VHS (InterProIPR008942) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G100601) Has 3523 Blast hits to 3241 proteins in 333 species Archae - 25 Bacteria - 251 Metazoa - 1575 Fungi - 421 Plants - 142 Viruses - 17 Other Eukaryotes - 1092 (source NCBI BLink) 
MSSPFSEEILIDNLAKLNSTQQSIQTLSQWCIVHRSEAELVVTTWEKQFHSTQIGQKVPL 
LYLANDILQNSKRQGNEFVQEFWKVLPGALKDIVSLGDDYGKGVVSRLVNIWEERRVFGS 
RSKSLKDVMLSEEAPPPLDVSKKRFRGSKSAKRDSKSTKTKLSSGGVSEKIVSAFNLVRA 
ENSNEETEMNKCKSAVRRIRKMEKDVEDACSTAKDPRKESLAKELEEEENILRQSVEKLK 
SVEESRTSLVNHLREALREQESELENLQSQIQVAQEQTEEAQNMQKRLNNETPVNNNNGT 
SGQSAKITPASIAAMAEMLTSSTNSSMIMHSVLSSFAAEATQTSGLTKSNTSDTNAFVVP 
PNPQQYHIIPNPAASQQFPYGFGNIPLMPPGALPPPPPGTLPPHMMNNNNQPNAAQQQAQ 
QGQSFHPPGMMYFGPPHHS*
>AT5G65180.1 |  unknown protein 
MSSPFSEEILIDNLAKLNSTQQSIQTLSQWCIVHRSEAELVVTTWEKQFHSTQIGQKVPL 
LYLANDILQNSKRQGNEFVQEFWKVLPGALKDIVSLGDDYGKGVVSRLVNIWEERRVFGS 
RSKSLKDVMLSEEAPPPLDVSKKRFRGSKSAKRDSKSTKTKLSSGGVSEKIVSAFNLVRA 
ENSNEETEMNKCKSAVRRIRKMEKDVEDACSTAKDPRKESLAKELEEEENILRQSVEKLK 
SVEESRTSLVNHLREALREQESELENLQSQIQVAQEQTEEAQNMQKRLNNETPVNNNNGT 
SGQSAKITPASIAAMAEMLTSSTNSSMIMHSVLSSFAAEATQTSGLTKSNTSDTNAFVVP 
PNPQQYHIIPNPAASQQFPYGFGNIPLMPPGALPPPPPGTLPPHMMNNNNQPNAAQQQAQ 
QGQSFHPPGMMYFGPPHHS*
>AT5G65180.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown EXPRESSED IN 18 plant structures EXPRESSED DURING 9 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF618 (InterProIPR006903) Regulation of nuclear pre-mRNA protein (InterProIPR006569) ENTH/VHS (InterProIPR008942) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G100601) Has 3523 Blast hits to 3241 proteins in 333 species Archae - 25 Bacteria - 251 Metazoa - 1575 Fungi - 421 Plants - 142 Viruses - 17 Other Eukaryotes - 1092 (source NCBI BLink) 
MLSEEAPPPLDVSKKRFRGSKSAKRDSKSTKTKLSSGGVSEKIVSAFNLVRAENSNEETE 
MNKCKSAVRRIRKMEKDVEDACSTAKDPRKESLAKELEEEENILRQSVEKLKSVEESRTS 
LVNHLREALREQESELENLQSQIQVAQEQTEEAQNMQKRLNNETPVNNNNGTSGQSAKIT 
PASIAAMAEMLTSSTNSSMIMHSVLSSFAAEATQTSGLTKSNTSDTNAFVVPPNPQQYHI 
IPNPAASQQFPYGFGNIPLMPPGALPPPPPGTLPPHMMNNNNQPNAAQQQAQQGQSFHPP 
GMMYFGPPHHS*
>AT5G65180.2 |  unknown protein 
MLSEEAPPPLDVSKKRFRGSKSAKRDSKSTKTKLSSGGVSEKIVSAFNLVRAENSNEETE 
MNKCKSAVRRIRKMEKDVEDACSTAKDPRKESLAKELEEEENILRQSVEKLKSVEESRTS 
LVNHLREALREQESELENLQSQIQVAQEQTEEAQNMQKRLNNETPVNNNNGTSGQSAKIT 
PASIAAMAEMLTSSTNSSMIMHSVLSSFAAEATQTSGLTKSNTSDTNAFVVPPNPQQYHI 
IPNPAASQQFPYGFGNIPLMPPGALPPPPPGTLPPHMMNNNNQPNAAQQQAQQGQSFHPP 
GMMYFGPPHHS*
>AT3G55070.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN chloroplast EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is protein binding / zinc ion binding (TAIRAT4G378801) Has 669 Blast hits to 649 proteins in 147 species Archae - 0 Bacteria - 0 Metazoa - 334 Fungi - 200 Plants - 87 Viruses - 0 Other Eukaryotes - 48 (source NCBI BLink) 
MEIDSATNGNSDTVMTESAATITPSPVVVSSSRSNQFTESLKLEHQLLRVPFEHYKKTIR 
TNHRSFEKEVSTIVNGVGELADSDWSKDDTVSRLTCLVTRLQGLKRKLEEGSNVENLQAQ 
RCRARIDHLDSVDVENITEWNNTKLKRILVDYMLRMSYFETATKLSESSNIMDLVDIDIF 
REAKKVIDALKNREVASALTWCADNKTRLKKSKSKFEFQLRLQEFIELVRVDTAESYKKA 
IQYARKHLASWGTTHMKELQHVLATLAFKSTTECSKYKVLFELRQWDVLVDQFKQEFCKL 
YGMTMEPLLNIYLQAGLSALKTPYGLEEGCTKEDPLSQENFRKLALPLPFSKQHHSKLVC 
YISKELMDTENPPQVLPNGYVYSTKALKEMAEKNGGKITCPRTGLVCNYTELVKAYIS*
>AT3G55070.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN chloroplast EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is protein binding / zinc ion binding (TAIRAT4G378801) Has 611 Blast hits to 608 proteins in 143 species Archae - 0 Bacteria - 0 Metazoa - 313 Fungi - 169 Plants - 83 Viruses - 0 Other Eukaryotes - 46 (source NCBI BLink) 
MEIDSATNGNSDTVMTESAATITPSPVVVSSSRSNQFTESLKLEHQLLRVPFEHYKKTIR 
TNHRSFEKEVSTIVNGVGELADSDWSKDDTVSRLTCLVTRLQGLKRKLEEGSNVENLQAQ 
RCRARIDHLDSVDVENITEWNNTKLKRILVDYMLRMSYFETATKLSESSNIMDLVDIDIF 
REAKKVIDALKNREVASALTWCADNKTRLKKSKSKFEFQLRLQEFIELVRVDTAESYKKA 
IQYARKHLASWGTTHMKELQHVLATLAFKSTTECSKYKVLFELRQWDVLVDQFKQEFCKL 
YGMTMEPLLNIYLQAGLSALKTPYGLEEGCTKEDPLSQENFRKLALPLPFSKQHHSKLVC 
YISKELMDTENPPQVLPNGYVYSTKALKEMAEKNGGKITCPRTGLVCNYTELVKAYIS*
>AT3G55070.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN chloroplast EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is protein binding / zinc ion binding (TAIRAT4G378801) Has 669 Blast hits to 649 proteins in 147 species Archae - 0 Bacteria - 0 Metazoa - 334 Fungi - 200 Plants - 87 Viruses - 0 Other Eukaryotes - 48 (source NCBI BLink) 
MEIDSATNGNSDTVMTESAATITPSPVVVSSSRSNQFTESLKLEHQLLRVPFEHYKKTIR 
TNHRSFEKEVSTIVNGVGELADSDWSKDDTVSRLTCLVTRLQGLKRKLEEGSNVENLQAQ 
RCRARIDHLDSVDVENITEWNNTKLKRILVDYMLRMSYFETATKLSESSNIMDLVDIDIF 
REAKKVIDALKNREVASALTWCADNKTRLKKSKSKFEFQLRLQEFIELVRVDTAESYKKA 
IQYARKHLASWGTTHMKELQHVLATLAFKSTTECSKYKVLFELRQWDVLVDQFKQEFCKL 
YGMTMEPLLNIYLQAGLSALKTP*
>AT3G55070.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN chloroplast EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is protein binding / zinc ion binding (TAIRAT4G378801) Has 611 Blast hits to 608 proteins in 143 species Archae - 0 Bacteria - 0 Metazoa - 313 Fungi - 169 Plants - 83 Viruses - 0 Other Eukaryotes - 46 (source NCBI BLink) 
MEIDSATNGNSDTVMTESAATITPSPVVVSSSRSNQFTESLKLEHQLLRVPFEHYKKTIR 
TNHRSFEKEVSTIVNGVGELADSDWSKDDTVSRLTCLVTRLQGLKRKLEEGSNVENLQAQ 
RCRARIDHLDSVDVENITEWNNTKLKRILVDYMLRMSYFETATKLSESSNIMDLVDIDIF 
REAKKVIDALKNREVASALTWCADNKTRLKKSKSKFEFQLRLQEFIELVRVDTAESYKKA 
IQYARKHLASWGTTHMKELQHVLATLAFKSTTECSKYKVLFELRQWDVLVDQFKQEFCKL 
YGMTMEPLLNIYLQAGLSALKTP*
>AT5G67320.1 |  HOS15 (high expression of osmotically responsive genes 15) 
MSSLTSVELNFLVFRYLQESGFTHAAFTLGYEAGINKSNIDGNMVPPGALIKFVQKGLQY 
MEMEANLSNSEVDIDEDFSFFQPLDLISKDVKELQDMLREKKRKERDMEKERDRSKENDK 
GVEREHEGDRNRAKEKDRHEKQKEREREREKLEREKEREREKIEREKEREREKMEREIFE 
REKDRLKLEKEREIEREREREKIEREKSHEKQLGDADREMVIDQTDKEIAGDGSTGAEPM 
DIVMTPTSQTSHIPNSDVRILEGHTSEVCACAWSPSASLLASGSGDATARIWSIPEGSFK 
AVHTGRNINALILKHAKGKSNEKSKDVTTLDWNGEGTLLATGSCDGQARIWTLNGELIST 
LSKHKGPIFSLKWNKKGDYLLTGSVDRTAVVWDVKAEEWKQQFEFHSGPTLDVDWRNNVS 
FATSSTDSMIYLCKIGETRPAKTFTGHQGEVNCVKWDPTGSLLASCSDDSTAKIWNIKQS 
TFVHDLREHTKEIYTIRWSPTGPGTNNPNKQLTLASASFDSTVKLWDAELGKMLCSFNGH 
REPVYSLAFSPNGEYIASGSLDKSIHIWSIKEGKIVKTYTGNGGIFEVCWNKEGNKIAAC 
FADNSVCVLDFRM*
>AT4G27740.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown CONTAINS InterPro DOMAIN/s Yippee-like protein (InterProIPR004910) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G277451) Has 684 Blast hits to 682 proteins in 148 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 132 Plants - 111 Viruses - 0 Other Eukaryotes - 52 (source NCBI BLink) 
MAANKTLPTYFCRNCENPLALGEDLISKKFVGASGPAFMFSHAMNVVVGPKIGRKLITGS 
YVVADVMCSKCGETLGWKYVETFDLKQRYKEGMFVIEKLKLTKRY*
>AT4G37880.1 |  protein binding / zinc ion binding 
MELKSIKDAFDRVATKQKLSYSKTNEIVHMLSQEIDKALSILEETPSSDTMLLDHRSILA 
DVKKVFMEIAPITQLEATEKELHAALTKYPKVLEKQLNPDISKAYRHNVEFDTHIVNQII 
ANFFYRQGMFDIGDCFVAETGESECSTRQSFVEMYRILEAMKRRDLEPALNWAVSNSDKL 
KEARSDLEMKLHSLHFLEIARGKNSKEAIDYARKHIATFADSCLPEIQKLMCSLLWNRKL 
DKSPYSEFLSPALWNNAVKELTRQYCNLLGESSESPLSITVTAGTQALPVLLKYMNVVMA 
NKKLDWQTMEQLPVDAQLSEEFQFHSVFVCPVSKEQSSDDNPPMMMSCGHVLCKQTINKM 
SKNGSKSSFKCPYCPTDVDISRCRQLHF*
>AT5G09630.1 |  protein binding / zinc ion binding 
MDVTGTVTVRDAFDRVSKKQKLYHSVTQDVIDLVCDGIQDTLTRIQLGNDDGVEPESVLT 
ELRRKLDALLPIIQLQKSHKETKWSLSKLVKLLEVSYHPDISLACFSVDFDINLVNKILI 
HHCYREGLFDVGDCLVKEAGREEETEVRSQFLEFHQIVDSLKLRNIEPAMRWIFANRGKL 
KQKSSKLEFKLLSLKYCDILREGKSDDALEYARTHFTQYPLHFKEIQKLITCLLWIGNFE 
KSPYAEIVSPSCWDKVTKELIMEYHHLLDQPINSPLKVALSAGYESLPSLLKLVHLMALT 
KQEWQAMKQLPVPLELGNEYKFHSAFVCPVSRDQSSEENPPMQLPCGHVISKQSMMRLSK 
NCAFRTFKCPYCPAETLASACRQLYF*