Gene Ther Mol Biol Vol 9, 229-246, 2005

 

Towards understanding the epigenetics of transcription by chromatin structure and the nuclear matrix

Review Article

 

Rui Pires Martins1 and Stephen A. Krawetz1, 2, *

1Center for Molecular Medicine and Genetics;

2Department of Obstetrics and Gynecology, Institute for Scientific Computing; Wayne State University School of Medicine, Detroit, MI 48201

__________________________________________________________________________________

*Correspondence: Stephen A. Krawetz, Ph.D., Charlotte B. Failing Professor of Fetal Therapy and Diagnosis, Department of Obstetrics and Gynecology, Center for Molecular Medicine and Genetics, Institute for Scientific Computing, Director, WSU Bioinformatics Node Michigan Life Sciences Corridor, Director, Center of Excellence: Paternal Impact of Toxicological Exposure, Wayne State University School of Medicine, 253 C.S. Mott Center for Human Growth and Development, 275 E. Hancock, Detroit, MI, 48201; e-mail: steve@compbio.med.wayne.edu

Key words: potentiation; epigenetics; DNA methylation; histone modification; RNA-mediated silencing, nuclear matrix; gene regulation; epigenetic therapy

Abbreviations: Differentially Methylated Region, (DMR); DNA MethylTransferases, (DNMTs); heterochromatin protein 1, (HP1); Histone AcetylTranserases, (HATs); Histone De-ACetylases, (HDACs); histone MethylTransferases, (MTases); Imitation SWItch, (ISWI); Methyl-CpG Binding Domain, (MBD); Nucleosome Remodelling and histone De-acetylases, (NuRDs); polycomb group, (PcG); RNA interference, (RNAi); RNA-dependent RNA Poymerase-based, (RdRP); RNA-induced Initiation of Transcriptional gene Silencing complex, (RITS); RNA-Induced Silencing Complex, (RISC); SWItch/Sucrose Non-Fermenting helicases, (SWI/SNF)

 

Received: 2 July 2005; Accepted: 16 September 2005; electronically published: September 2005

 

Summary

The eukaryotic nucleus houses a significant amount of information that is carefully ordered to ensure that genes can be transcribed as needed throughout development and differentiation. The genome is partitioned into regions containing functional transcription units, providing the means for the cell to selectively activate some, while keeping other regions of the genome silent. Over the last quarter of a century the structure of chromatin and how it is influenced by epigenetics has come into the forefront of modern biology. However, it has thus far failed to identify the mechanism by which individual genes or domains are selected for expression. Through covalent and structural modification of the DNA and chromatin proteins, epigenetics maintains both active and silent chromatin states. This is the “other” genetic code, often superseding that dictated by the nucleotide sequence. The nuclear matrix is rich in many of the factors that govern nuclear processes. It includes a host of unknown factors that may provide our first insight into the structural mechanism responsible for the genetic selectivity of a differentiating cell. This review will consider the nuclear matrix as an integral component of the epigenetic mechanism.

 


I. Introduction

Recent data mining of the 3.27 billion nucleotide base pairs of DNA that comprise the human genome, estimate our complement of genes above 24,000, coding for at least 35,000 transcripts (EMBL-EBI Ensembl, 2005). This significant amount of information has to be carefully ordered in the nucleus to ensure that genes can be transcribed as needed throughout development and differentiation. The majority of cell and tissue specific gene activity is controlled by DNA-sequence dependent forms of regulation mediated through molecular interactions at gene promoters and enhancers. There are additional mechanisms that modulate, if not supersede, transcriptional and post-transcriptional forms of regulation with a less-direct relationship to the DNA sequence.

 Mechanisms that control the outcome of heritable gene activity without manipulating, or relying on the DNA sequence, are termed “epigenetic” (Russo et al, 1996). These forms of regulation shift the focus from enhancer/promoter interactions to those that generally impact the chromatin structure of a locus. They address whether a locus is in an “open” conformation enabling trans-acting factor access as necessary for transcription. A gene locus or chromatin domain that has acquired an “open” transcriptionally permissive conformation is “potentiated” (Kramer et al, 1998).

Epigenetic forms of regulation were first suggested in the late 1950s when McClintock’s group began their studies directed toward understanding how transposable elements regulate maize coat color (McClintock, 1958). During this same period, Brink defined “paramutation”, the expression of a heterozygotic allele heritably altered by exposure to the other allele (Brink, 1958a,b). The field blossomed with new studies pursuing the basis of mammalian X-inactivation (Ohno et al, 1959; Lyon, 1961) along with those using insect models targeted towards understanding imprinting (Crouse, 1960). This set the stage for chromatin structure’s steady progression into the spotlight.

The balance between activating factors and silencing factors dictates the transcriptional outcome. This has been considered the language overlaying that used by DNA-sequence dependent forms of regulation (Strahl and Allis, 2000). Cross-talk and reinforcement between each of the different epigenetic mechanisms enrich the lexicon regulating gene activation. What is thus far lacking, is the first word of the sentence, i.e., the crucial factor or process that sets potentiation in motion. Evidence has been accumulating that interactions with the nuclear matrix may be the first word, since the nuclear matrix appears to recruit many epigenetic factors (Stauffer et al, 2001; Yasui et al, 2002). However, we require a greater appreciation for syntax. For example, the timing of each spoken word impacts how the sentence is interpreted. It communicates whether silencing or activation proceeds. The latter setting the stage for the recruitment of the transcription machinery to a gene’s enhancer and promoter regions as requisite for initiation and elongation.

This review will examine the body of work that has contributed to our understanding of the importance of chromatin structure and epigenetics with regard to its control over the expression of our genome. Interactions with the nuclear matrix that provide the framework for the molecular interactions that mediate epigenetic regulation will be considered.

 

II. Chromatin structure

Within the eukaryotic nucleus DNA associates with a number of proteins to achieve a high degree of compaction (Daban, 2003; Daban, 2000). This compensates for the increased size of the genome in a fixed nuclear space (Vinogradov, 2005). One of the building blocks, the nucleosome, has been characterized at the atomic level (Richmond and Davey, 2003). Nucleosomal packaging presents an obstacle to transcription (reviewed in Struhl, 1999). It is comprised of 146 base pairs of supercoiled DNA wrapped approximately one and three quarter times around a histone octamer core of H2A, H2B, H3 and H4 dimers. Each adjacent nucleosome is linked by approximately 15 base pairs of linker DNA complexed with histone (H1) to yield, in vitro, the “beads on a string”, or 10nm chromatin fibre. These structures coil, forming an approximately 30 nm diameter “solenoid” (Figure 1).

Heterochromatin are those transcriptionally silent chromosomal regions exhibiting higher levels of compaction. These structures are created and stabilized through a series of interactions with “heterochromatinizing” proteins such as heterochromatin protein 1 (HP1) or members of the polycomb group (PcG) of proteins (reviewed in Craig, 2005). In this case, trans-acting factor access is mitigated by the effects of higher order chromatin structure. Less compacted regions of chromatin, termed “euchromatin,” are those regions of the chromosome that can be permissive to transcription when they have assumed the open-potentiated conformation requisite for transcription. This is orchestrated through a number of epigenetic and transcription-promoting factors. Potentiated chromatin was originally viewed as the 10 nm fibre, although it is now known that other higher order euchromatic structures can support basal levels of transcription (Georgel et al, 2003).

Although the mechanistic intricacies mediating potentiation remain to be delineated, a series of clues to the mechanism are provided by the recently described host of epigenetic and chromatin structure-altering modifications. The modifiers include enzymes that act on the DNA directly, e.g., DNA MethylTransferases (DNMTs) including those that act on histones; histone MethylTransferases (MTases) and de-methylases, ATP-dependent chromatin-remodelling complexes, Histone AcetylTranserases (HATs) and Histone De-ACetylases (HDACs), mi-RNAs, as well as interactions with nuclear matrix constituents.

 

III. DNA methylation

Through covalent modification, DNMTs attach a methyl group to cytosine residues at the carbon-5 position (Figure 2a). In mammals, genomic methylation is localized predominantly to CpG dinucleotides (Holliday and Pugh, 1975; Riggs, 1975), although other non-CpG motifs have been shown to be methylated (Tasheva and Roufa, 1994; Clark et al, 1995). Eukaryotic DNMTs can be grouped into two main classes based on their activity (Riggs, 1975), maintenance or de novo (reviewed in Chen and Li, 2004; Ponger and Li, 2005).

Mammals possess five different DNA MethylTransferases, DNMT1, DNMT2, DNMT3a, 3b and 3L. Specific patterns of methylation are usually established in the early embryo by de novo MethylTransferases (MTases) DNMT3a and DNMT3b (Okano et al, 1999). DNMT3L has been shown to associate with DNMT3a or 3b and play an essential role in establishing parental imprints (Bourc'his et al, 2001; Hata et al, 2002; Webster et al, 2005) but possesses no catalytic activity (Aapola et al, 2001; Bourc'his et al, 2001). These patterns are perpetuated by the maintenance MTase DNMT1 after DNA replication. Lsh, a member of the SNF2 family of ATP-dependent helicases, is essential to maintaining genomic methylation (Dennis et al, 2001) in repetitive element rich regions (reviewed in Bourc'his and Bestor, 2002). DNMT2 is not-essential to mammalian methylation, as DNMT2 has only residual MTase activity in humans (Hermann et al, 2003). In addition, the corresponding targeted mouse ES cell knockout shows no change in the pattern of methylation (Okano et al, 1998). However, Drosophila DNMT2, plays an important role in non-CpG methylation (Kunert et al, 2003).

DNA methylation is usually associated with transcriptional repression. This is exemplified by the action of MeCP2 as shown in Figure 2b. This Methylation-sensitive chromatin protein binds via a specific methyl-CpG binding domain, MBD (Nan et al, 1993; Nan et al, 1996). The MBD recognizes symmetrically methylated CpG dinucleotides in the major groove (Rauch et al, 2005). Contact, critical to methyl-CpG recognition, is mediated by several residues in the loop domain between the second and third b-sheets (Free et al, 2001). Once bound, transcription is attenuated using a transcription repression domain to recruit mSin3A/HDAC, a nucleosome remodeller and histone de-acetylase (Deplus et al, 2002; Fuks et al, 2000; Fuks et al, 2001; Nan et al, 1998). MeCP2 can act in two different manners. First, by promoting histone de-acetylation, neighboring nucleosomes can be compacted to a silenced state. Second, by direct interaction with TFIIB, the formation of the pre-initiation complex can be inhibited (Wong and Privalsky, 1998).

Genomic methylation is also dependent on the methylation of a number of chromatin proteins (Fuks et al, 2002). For example, the amino-terminal of histone H3 can be methylated at lysine 9 (H3:K9) by Suv39-MTase. This promotes binding to members of the HP1 family of Heterochromatin Proteins (Nielsen et al, 2002) that are known to associate with yeast centromeric heterochromatin (Bannister et al, 2001). Failure to methylate H3:K9 can effect establishment, maintenance and inheritance of genomic methylation (Bannister et al, 2001; Lachner et al, 2001; Peters et al, 2001).

Sixty to ninety percent of vertebrate CpGs are methylated. The remaining non-methylated CpGs are commonly associated with gene promoters of constitutively expressed, housekeeping genes (Antequera and Bird, 1993). They are hypomethylated CpG islands defined as ~100 bp CºG rich regions, i.e. >55%, providing a universal control mechanism to ensure constitutive expression of housekeeping genes (Bird, 1986).


 

 

Figure 1. The nucleosome is the fundamental unit of DNA organization in the eukaryotic nucleus. Approximately 146 base pairs (bp) of DNA wraps 1.75 times around a histone octamer core composed of two of each of the H2A and H2B homodimers together with two H3-H4 dimers. Successive nucleosomes are linked by approximately 15 bp of DNA and the entire structure is stabilized associating with histone H1. In solution this arrangement forms the 10 nm “beads on a string” that is readily transcribed in the presence of all of the requisite trans-acting factors and transcription “machinery.” This form can be compacted by coiling into a solenoid form (~30 nm in diameter, in solution) that can be transcribed at basal levels. However, potentiation is a necessary transition from the compacted to the “beads on the string” conformation in order to allow higher degrees of transcription.

 

Figure 2. Genomic Methylation and silencing. a) Cytosine can be methylated at the carbon-5 position by any number of DNA Methyltransferases (DMNTs) with S-adenosyl-L-methionine (AdoMet) co-factor. Once modified, 5-methylcytosine is highly labile and can be readily deaminated to thymine. This modification is mutagenic yielding CºG ®5mCºG ®T=G®T=A transition essentially ablating trans-factor recognition, silencing Alu elements and other invading DNA elements. b) The primary mechanism of genomic methylation-based silencing requires the recognition of symmetrically methylated CpG dinucleotides (di-mCpG). For example, MeCP2, recognizes and binds to methyl-CpG Binding Domains (MBD) excluding transcription promoting interactions. The MBD interacts with the modified bases in the major groove and several residues in the loop domain between the second and third b-sheets have been shown to be critical to methyl-CpG recognition. Methyl-residues are shown in red, DNA phosphodiester backbone in blue. The ribbon model was adapted (Wakefield et al, 1999; Khorasanizadeh, 2004).

 

 


In mammals, principally primates, the Alu family of short interspersed repetitive elements are typically CpG rich. These transposable elements are often regarded as parasitic elements in genomes and contribute as many as one third of the CpGs in the genome (Schmid, 1991). In contrast to CpG islands, Alu elements are usually highly methylated to constitutively silence the endogenous RNA Polymerase III promoters embedded within these elements (Li et al, 2002). This is regarded as a genomic defense mechanism against transposable elements and viral sequences. Regulatory cis-elements in the Pol III promoters are silenced by methyl-cytosine deamination that transitions 5mCºG to T=A (Rideout et al, 1990). Thus, methylation based genomic silencing can be reinforced through histone modification and by mutating the invading sequences.

Genomic imprinting, the mono-allelic pattern of expression passed from parent to offspring, is largely controlled by methylation. The mouse Igf2-H19 locus on chromosome 7, shown in Figure 3, is a well characterized model of imprinting. H19 is expressed from the maternal allele and Igf2 from the paternal allele. Expression is dictated by the methylation status of three DMRs, Differentially Methylated Regions. The maternal allele is not methylated promoting the binding of the CTCF boundary element at the DMR, just upstream of H19 (Figure 3 top, region iii). This restricts the activity of the H19 enhancer. In comparison, the paternal allele (Figure 3 bottom) is heavily methylated at all DMRs (Figure 3, regions i, ii, iii). Thus CTCF cannot bind and the enhancer (Figure 3, EEE) can now interact with the Igf2 gene. Non-methylated DMR i, acts as an activator that likely loops out to physically interact with DMR iii to drive maternal H19 expression (reviewed in Kato and Sasaki, 2005). This interaction may be mediated by CTCF binding to unmethylated DMR iii. New evidence suggests that long-range chromatin looping mediated through the DMRs may position the transcribing alleles in active chromosome territory regions to facilitate their transcription (Murrell et al, 2004). DMR ii, located in intron 6 of Igf2, acts as a methylation-sensitive repressor. Paternal expression of Igf2 is promoted through its interaction in the unmethylated state with the downstream enhancer (EEE; Bell and Felsenfeld, 2000; Srivastava et al, 2000).

At the zygotic stage, most mammalian genomes are demethylated with the exception of sheep, where the degree of zygotic methylation loss is greatly decreased (Young and Beaujean, 2004). Methylation is then restored de novo at gastrulation (Monk et al, 1987). The mechanism of genome-wide demethylation has yet to be determined. Passive and progressive demethylation by down regulating maintenance DNMTs proceeded by successive rounds of replication and division has been proposed (Matsuo et al, 1998; Hsieh, 1999, 2000). However, this process would be exceedingly slow to account for the scale of change that occurs within a single day of development (Hajkova et al, 2002). Alternatively, active mechanisms of demethylation, employing DNA excision and repair mechanisms have also been implicated. For example, consider the expression of 5-methylcytosine DNA glycosylase that removes methyl-cytosine, cleaving the sugar backbone thus signaling to the DNA damage/repair mechanisms to replace the cleaved base for a cytosine. In vitro, over-expression demethylated the promoter of a stably co-transfected hormone-responsive reporter gene but failed to affect the level of genome-wide methylation (Zhu et al, 2001). This enzyme and the chicken homologue (Jost and Jost, 1994) preferentially utilize a hemi-methylated substrate. Accordingly genome-wide demethylation would require a concomitant decrease in DNMT activity. In essence, the active mechanism is dependent on the passive loss of DNA methylation. While the above have provided testable models, additional evidence of how re-activation of a methylated region of silenced DNA can be achieved is still lacking. However, the absence of an active mechanism


 

 

Figure 3. Genomic imprinting is governed by DNA methylation. Genomic methylation plays an essential role in genomic imprinting, the mono-allelic expression of specific loci. In mice, the Igf2-H19 locus provides a well-characterized model where differential methylation between alleles at various DMRs (differentially methylated regions) mediates the imprint. The unmethylated maternal allele, enables the CTCF boundary element to bind, restricting the activity of the upstream enhancers (EEE) on H19. In contrast, the paternal allele is heavily methylated at all three DMRs (red m’s). When unmethylated, DMRi, acts as an activator and could loop to physically interact with DMRiii to drive maternal H19 expression. This interaction may be mediated by CTCF, binding the unmethylated downstream DMR. DMRii acts as a methylation-sensitive repressor within intron 6 of Igf2. Upon interacting with the downstream enhancers unmethylated DMRii promotes paternal Igf2 expression.

 


would ensure that silent states are inherited as cells divide then maintained once terminal differentiation is achieved.

This raises the question what is the mechanism that directs or signals de novo genomic methylation? It has been proposed that paired DNA repeats (Hsieh and Fire, 2000) or spans of unpaired DNA that form unusual secondary structures, e.g., cruciforms or hairpins (Smith et al, 1991; Chen et al, 1995; Laayoun and Smith, 1995; Kho et al, 1998) could provide recognition sequences for de novo DNMTs, thus directing methylation to a specific locus. Recently, the recognition of multiple short A=T tracts in the minor groove have been implicated as triggers when establishing methylation in Neurospora crassa (Tamaru and Selker, 2003). Interestingly, plants utilize an RNA-mediated mechanism (Wassenegger et al, 1994). As shown in Figure 4, RNA-dependent DNA Methylation uses a ~21-26 nt, double stranded RNA (dsRNA; Mette et al, 2000; Sijen et al, 2001; Hamilton et al, 2002; Zilberman et al, 2003). These short RNase III-like nuclease processed dsRNAs, target and pair with specific regions of DNA e.g., promoters, (Mette et al, 2000; Jones et al, 2001; Sijen et al, 2001; Aufsatz et al, 2002) leading to the methylation of all cytosines contained within that processed dsRNA. Unlike X-ist RNA targeting in mammalian X-inactivation, this process is specific and does not spread. Only those cytosines within the short RNA are methylated (Jones et al, 2001; Aufsatz et al, 2002) to block the molecular recognition of cis-regulatory promoter elements silencing transcription.

 

IV. Post-translational histone modification

Histones are a dynamic substrate for post-translational modification at key amino acid residues. Modifications include acetylation, methylation, phosphorylation, glycosylation, ADP-ribosylation, ubiquitination and sumoylation. The sites of core histone modification are summarized in Figure 5. Each can signal, or results from, a host of cell processes including DNA repair, gene transcription, chromatin activation, silencing, heterochromatin formation, replication and/or recombination. Phosphorylation and ADP-ribosylation are more commonly associated with DNA repair mechanisms (de la Cruz et al, 2005; Lowndes and Toh, 2005) but can impact transcription. Sumoylation and ubiquitination are two related protein modification pathways associated with protein trafficking and degradation. Interestingly, sumoylation of the amino-terminus of H4 has been associated with transcriptionally quiescent DNA (Shiio and Eisenman, 2003; Gill, 2004). Detailed reviews of the various histone modifications have been presented (Fischle et al, 2003a; Lachner et al, 2003; Gill, 2004; Khorasanizadeh, 2004; de la Cruz et al, 2005; Lowndes and Toh, 2005). The sections that follow will focus on acetylation and methylation that are well characterized and known to change with transcriptional status.

 

A. Acetylation

Heritable states of active and silent chromatin can be achieved through various post-translational modifications to amino- and carboxy- histone termini. Acetylation and methylation of key lysine residues in the amino-termini of histones H3 and H4 are well characterized. Acetylation of the H3 and H4 lysines employ large Histone AcetylTransferase domain complexes such as GCN5. Transcription is generally facilitated by these modifications in two manners. First, lysine acetylation increases steric hindrance between adjacent nucleosomes,


 

 

Figure 4. RNA-dependent DNA methylation RdDM. A ~21-26 nt, double stranded RNA targets specific regions, e.g., promoters, for localized methylation of all cytosines. Only those cytosines within the short RNA are methylated to block the molecular recognition of cis-regulatory promoter elements silencing transcription. Unlike X-ist RNA targeting in mammalian X-inactivation, this process is specific and does not spread along a larger domain.

 

Figure 5. Post-translational histone modifications. The covalent modification of histones can serve to signal any number of cellular and genetic processes. This has been reviewed by (Fischle et al, 2003a; Lachner et al, 2003; Gill, 2004; Khorasanizadeh, 2004; de la Cruz et al, 2005; Lowndes and Toh, 2005), and (Upstate, Waltham, MA, USA).The effect is varied, depending on the residue and how it is modified. Lysines are subject to acetylation, methylation, ubiquitination and sumoylation, arginine to methylation while serine and tyrosine can be phosphorylated. Acetylation of H3 and H4 lysines are generally associated with transcriptionally active DNA, e.g. H3:K4, K9, K27. In contast, sumoylation of H4 lysine is associated with transcriptional repression, but the specific residues have not yet been identified. Arginine and lysine can be mono-, di- and tri-methylated associated with both active and silent chromatin. Specific modifications seldom exist to the exclusion of the other. In concert they form a code imparting silencing or activation.

 

 


thereby disrupting higher order chromatin folding (Fry and Peterson, 2001). The degree of order is substantially lessened upon hyperacetylation, when at least six of the H3 and H4 lysine residues are acetylated. In conjunction with other factors, this maintains a potentiated open chromatin state enabling trans-acting factors to interact with cis-regulatory elements. Secondly, these acetylated residues are recognized by the bromodomains of transcription-promoting factors such as TFIID of the pre-initiation complex as well as ATP-dependent chromatin remodeling complexes (Agalioti et al, 2002).

Histone de-acetylases (HDACs) can reverse these modifications. They are divided into three groups depending on their similarity to those originally cloned from yeast and whether they require NAD+ (reviewed in de Ruijter et al, 2003). Silencing by histone methylation usually accompanies deacetylation as exemplified by MT1, metallothionein I. For example, DNMT3a/b methylates the MT1 promoter at several sites becoming substrates for MeCP2. This recruits HDAC1 to the promoter and de-acetylates several key histones effectively silencing the MT1 promoter (Beisel et al, 2002). This class of associations between HDACs and other chromatin remodelers is termed NuRDs (Nucleosome Remodelling and histone De-acetylases).

 

B. Methylation

While acetylation generally correlates with transcriptionally competent chromatin, methylation lacks that specificity. Both lysine or arginine residues can be methylated. The extent of the modification and the position of the residue on the histone tail, promotes either active or silent chromatin states. For example, the methylation of H3:K9 and H3:K27 in Drosophila promotes chromodomain binding of HP1 and Polycomb protein to heritably silence spans of chromatin (Fischle et al, 2003b). In contrast, the interaction of RNA Polymerase II with methylated H3:K4 can be associated with transcriptional activation (Ng et al, 2003). Methylation of H3:K4, H3:K9 and H4:K20 by histone MTase Ash1 provides substrates for Brm, a SWI/SNF family homologue in Drosophila. In this case nucleosomes localized to several cis-regulatory trithorax response elements are remodeled (Beisel et al, 2002) thereby activating the Ubx promoter.

Unlike direct nucleotide methylation, recent studies have identified two novel histone de-methylation pathways. These include PAD4, PeptidylArginine Deiminase 4 and LSD1, Lysine Specific Demethylase 1. For example, PAD4 has been identified as catalyzing the deimination of arginine residues in humans. This is achieved by converting methylated H3:R17 and H4:R3 to citrulline. In turn this down regulates the transcription of an oestrogen-responsive gene (Wang et al, 2004). In comparison, using the FAD cofactor, yeast LSD1 directly demethylates H3:lys4. This leads to transcriptional repression (Shi et al, 2004).

 

V. ATP-dependent chromatin remodeling

Several ATP-dependent chromatin remodeling processes are mediated by large multi-subunit helicase containing complexes. These include SWI/SNF (SWItch/Sucrose Non-Fermenting helicases), ISWI (Imitation SWItch, nucleosome remodelers) and NuRD. By altering nucleo-histone interactions as well as protein-protein interactions these complexes can both modulate silencing as well as maintain open chromatin structures.

This group of ATP-dependent chromatin remodelers were initially characterized as part of an ~2 MDa, 11-subunit complex of proteins in yeast. Homologues have since been identified throughout eukaryotes (Becker and Horz, 2002; Natarajan et al, 1999; Pollard and Peterson, 1998). While SWI/SNF have been shown to act on genes independent of other factors (Natarajan et al, 1999), they usually form complexes with HATs such as SAGA and trans-activators like GAL-4 (VP16 and AH). In this manner they can cooperatively act to remodel chromatin and promote transcription (Neely et al, 1999; Yudkovsky et al, 1999). The order in which these large remodeling complexes are formed is still a matter of intense debate. For example, there is evidence suggesting that SWI/SNF and HATs are recruited by transcription factors, transcription associated factors and the pre-initiation complex, prior to associating with DNA (Soutoglou and Talianidis, 2002). Other models suggest that SWI/SNF remodeling must occur first, in order to enable additional factors to interact with the DNA (Peterson, 2000; Peterson and Logie, 2000; Peterson and Workman, 2000).

While some members of this family of enzymes repress transcription, e.g., NuRD (Wade et al, 1999), the majority of these factors activate transcription. This is achieved in an ATP-dependent manner by twisting the nucleosome thereby disrupting the association between DNA and histones in a ubiquitin independent manner (Flaus and Owen-Hughes, 2003 and reviewed Fry and Peterson, 2001). By disrupting nucleo-protein association at promoters or enhancers these regions become histone free and available to bind the necessary trans-acting factors to effect transcription. In addition to their role during transcription, chromatin remodellers are recruited to sites of replication, facilitating the re-association of epigenetic marks to ensure the heritability of the chromatin states (Poot et al, 2004, 2005).

 

VI. RNA-mediated regulation

The last decade has revolutionized how we view the role of RNA in the cell. It is well-documented that X chromosome dosage compensation in mammals is mediated by RNA-directed silencing. The Xist RNA coats and directs the inactivation (Penny et al, 1996; Marahrens et al, 1997) of the specific chromosome from which it is expressed. Different regions of this ~70 kb transcript regulate association to the X chromosome, as well as its ability to signal downstream silencing (Wutz et al, 2002). It is thought that several repeating stem-loops form in the 5’ region of the transcript to serve as binding sites for proteins that may direct the myriad of epigenetic modifications required to silence the inactive copy of the X chromosome (Brown and Baldry, 1996; Wutz et al, 2002). These include, methylation of H3:K9 and K27 by several histone MTases (Okamoto et al, 2004; Rougeulle et al, 2004).

No longer recognized as just an intermediate as DNA is decoded to protein, RNAs are now also appreciated as molecules of regulatory function from plant to man. RNA-mediated gene silencing, termed RNA interference (RNAi) in animals uses signaled transcript degradation or translational inhibition by targeted single-stranded 21-25 nt micro RNA (miRNA) molecules (Figure 6). Interestingly, genes coding for miRNAs are found throughout the genome (Lau et al, 2001; Lagos-Quintana et al, 2001; Lee and Ambros, 2001; Mourelatos et al, 2002) and when initially transcribed bear a 5’cap structure and poly(A)-tail as shown in Figure 6.1 (Lee et al, 2004). Functional miRNAs are derived from a series of maturation events catalyzed by various RNase III complexes. Subsequent to transcription, 70-nt nuclear hairpin pre-miRNAs are generated by Drosha-DGCR8 (Figure 6.2.3; Lee et al, 2003; Han et al, 2004). This dictates specificity and relative silencing ability (Lee et al, 2003; Lund et al, 2004). After transitioning to the cytoplasm Dicer cleaves the pre-miRNAs to yield ~22 nt mature miRNAs (Bernstein et al, 2001; Grishok et al, 2001; Hutvagner et al, 2001; Ketting et al, 2001; Knight and Bass, 2001). To specifically target transcripts for down-regulation (Hammond et al, 2000, 2001) these short molecules then pair with a large protein complex including members of the Argonaut family, termed RISC (RNA-Induced Silencing Complex) as summarized in Figure 6.7, 8. Down-regulation is achieved by either cleavage or translational inhibition of the target mRNA as determined by the type of Argonaut protein that is part of that RISC and the level complementation between the miRNA sequence and its target mRNA (reviewed in Tang, 2005).

RNAi impacts numerous developmental pathways. Members include Lin-4 (Lee et al, 1993; Wightman et al, 1993) and let-7 (Reinhart et al, 2000; Slack et al, 2000). They were the first miRNAs discovered and shown to regulate developmental and morphogenetic timing in the nematode caenorhabditis elegans. Interesting but still controversial, several miRNA-like transcripts have been identified in mature human sperm (Ostermeier et al, 2005) that could aid in gene regulation in the early embryo and imprinting (Krawetz, 2005).

In plants and fungi, an RNA-dependent RNA Poymerase-based (RdRP) amplification mechanism that reinforces and spreads silencing has been identified (reviewed in Caplen, 2004). A similar RdRP mechanism, has been shown to participate in the establishment of centromeric heterochromatin in budding yeast (Motamedi et al, 2004; Sugiyama et al, 2005). In this case, RNAs complimentary to the centromeric repeats are processed by the RNA-induced Initiation of Transcriptional gene Silencing complex (RITS). RITS contains an RdRP (Rdp1), Dicer and Chp1, a heterochromatin-associated protein (Verdel et al, 2004). To permanently silence the centromeric DNA the RITS complex then recruits a number of ATP-mediated chromatin remodelers (e.g., Swi6), heterochromatin assembly proteins (e.g., HP1), as well as the histone MTases Clr4 and Suv39H that methylate H3:K9, (Motamedi et al, 2004; Noma et al, 2004; Sugiyama et al, 2005).

 

VII. The nuclear matrix

The manner in which chromatin and its modifying factors are organized within the cell nucleus is a layer of complexity whose role in gene regulation is becoming apparent. Individual chromosomes occupy distinct territories within the cell nucleus with transcriptionally active segments generally localized to the periphery and transcriptionally inert regions localizing to the center of the territories (Kurz et al, 1996; Dietzel et al, 1999). While delimiting each chromosome in the interphase nucleus, territories appear as static regions. Recent evidence has emerged that transcriptionally active regions of DNA can loop out of chromosome territories (Chambeyron and Bickmore, 2004; Chambeyron et al, 2005; Spilianakis et al, 2005) to interact with the domains of co-regulated genes on other chromosomes (Spilianakis et al, 2005). Chromatin is organized by attachment or association to a network of proteins including ribonucleo-proteins within the nucleus that are interior to the nuclear envelope, termed the nuclear matrix (Cremer et al, 2000). The very existence of the nuclear matrix itself has been a great source of contention, as many of the studies conducted in the field use biochemically fractionated cell preparations that preclude examination of in vivo function (Mirkovitch et al, 1984; Balasubramaniam and Oleinick, 1995).


 

 

Figure 6. RNA-mediated silencing. Post-transcriptional RNA-mediated silencing termed RNAi, RNA interference is a multi-step mechanism. It begins in the nucleus, with expression of 5’ capped, polyadenylated transcripts that are ubiquitous throughout the genome (1). These are processed in the nucleus by an RNaseIII-like complex including Drosha (2) that cleaves the primary transcripts into ~70 nucleotide hairpin pre-micro RNAs (pre-miRNAs; 3) that are exported to the cytoplasm by Exportin 5, a nuclear pore protein (4). Upon reaching the cytoplasm, these pre-miRNAs are further processed by Dicer (5), another RNaseIII family member, to produce 21-25nt miRNAs (6). Silencing is achieved upon association of these miRNAs with the RNA-Induced Silencing Complex, RISC (7). RISC is a heterogeneous complex of additional RNaseIII-like factors as well as members of the Argonaut family. This association empowers RISC to down regulate a specific message by either targeted mRNA cleavage or targeted mRNA translational inhibition (8). Silencing efficiency is dependent upon the level of miRNA and target identity as well as the type of Argonaut protein that is associated with the RISC. The greater the level of identity drives the mechanism towards cleavage and degradation as opposed to translational repression of the target.

 

 


However, over time two paradigms of the nuclear matrix have emerged.

The first paradigm regards the nuclear matrix as a static structure. Chromatin is segmented into transcriptional domains ranging from 50-200 kb in length. The second paradigm includes both a structural and dynamic or transient component. The latter views the composition of the nuclear matrix as transient, reflecting the developmental competence of the cell. Just as cells undergoing differentiation utilize different promoters, enhancers and “epigenetic terms” to specify genetic fates, so too are different factors mobilized to their respective nuclear matricies. These are reflective of the types of interaction necessary to produce the intended outcome (Samuel et al, 1997a,b; Yudkovsky et al, 1999; Alvarez et al, 2000; Kontaraki et al, 2000; Vassetzky et al, 2000; Spencer and Davie, 2001; Sun et al, 2001; Beisel et al, 2002; Soutoglou and Talianidis, 2002).

 

A. Matrix attachment regions

Nuclear matrix association is mediated by Matrix Attachment Regions, (MARs), which anchor chromatin to the nuclear matrix and shield domains from neighboring enhancers and the silencing effects of neighboring chromosomal regions (McKnight et al, 1992; Namciu et al, 1998; Martins and Krawetz, 2004). While these and other roles of nuclear matrix-DNA interaction have been scrutinized (Bode et al, 2000), the exact nature of the association and its mechanism remain to be fully characterized.

The few examples of characterized MARs indicate that they vary in size ranging from 100 - 1000 base pairs in length and in primary sequence. Several “families” of matrix associating DNA motifs have been described in the literature (Singh et al, 1997). These include AT-rich spans, GT-rich spans, topoimerase II binding sites, regions of kinked and curved DNA secondary structures, as well as origins of replication