Analysis of the Kluyveromyces lactis Heat Shock Factor: A Representative of the Helix-Turn-Helix Transcription Factor Family |
|
SUMMARY DISEASE FAMILY FUNCTION STRUCTURE PROPERTIES SEQUENCE-FUNCTION STRUCTURE-FUNCTION EVOLUTION STRUCTURE COMPARISON |
EvolutionTo determine how the sequence of the Heat Shock Factor protein has evolved over time I discovered protein sequences that are closely homologous to HSF using two different search tools, one that aligns proteins based on local sequence similarities (BLAST) and one that aligns sequences based on global sequence similarities (FASTA). After determining which protein sequences are the most related to HSF, I searched for sequences which are less closely related using an iterated search program that takes into account areas of the subject protein with high homology and looks for other proteins containing related domains (Psi-BLAST).BLAST results
Protein E value FASTA resultsFASTA aligns sequences based on global sequence similarities. The top ten proteins that share homology with Kluyveromyces lactis HSF are shown in order of significance. Protein E value Psi-BLAST resultsPsi-BLAST is an iterated alignment program which is capable of uncovering distantly related homologs. The top five proteins that share homology with Kluyveromyces lactis HSF but are not considered to be Heat Shock Factors are shown in order of significance. Protein E value It's interesting that both the local and global search programs reported almost all of the same sequences but in a different order. This shows that local and global similarities differ some what between homologous sequences and that both types of searches agree (at least for the most part) on what sequences are the closest relatives. The Psi-BLAST search revealed three bacterial proteins which may be ancestral to the heat shock factor in yeast and showed that a portion of pherophorin proteins in plants may be distantly related to Heat Shock Factor. After determining which protein sequences are the most related to Kluyveromyces lactis HSF I decided to align several so that conserved residues within the sequence of all HSF homologs could be found. Below is a multiple sequence alignment of the two most conserved regions of Heat Shock Factor, the "winged" helix-turn-helix DNA binding domain and the trimerization domain. These two domains are highly conserved in all HSF homologs suggesting that DNA binding and multimerization are two of the most important functions of HSF.Sequence alignment of the DNA binding domains from several HSF homologs
Sequence alignment of the trimerization domains from several HSF homologs
Based on the multiple sequence alignment of HSF homologs, I was able to generate a rooted phylogenetic tree of the HSF sequences using a neighbor joining method that determines the most parsimonious linkage by finding the tree which requires the fewest and smallest branches. The image below shows that this method of tree building grouped together specific homologs. For instance all three of the yeast homolog where placed on one branch which is linked to a second branch containing all of the plant homologs. These groupings suggest that homologs in the same cluster are more closely related to one another than to any of the other homologs in the tree. Also, the tree shows what the closest ancestor of any two homologs is and how many times a "novel" homolog has arisen. For instance, the linkages between the vertebrate HSF homologs suggest that homolog 4 was the first to split off from the last common ancestor followed by homolog 2 and finally 3. Divergence from the common ancestor may suggest a novel functional role for the new homologs.
|