Review Article |
Corresponding author: Magdalena Achrem ( magdalena.achrem@usz.edu.pl ) Academic editor: Svetlana Galkina
© 2020 Magdalena Achrem, Izabela Szućko, Anna Kalinka.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Achrem M, Szućko I, Kalinka A (2020) The epigenetic regulation of centromeres and telomeres in plants and animals. Comparative Cytogenetics 14(2): 265-311. https://doi.org/10.3897/CompCytogen.v14i2.51895
|
The centromere is a chromosomal region where the kinetochore is formed, which is the attachment point of spindle fibers. Thus, it is responsible for the correct chromosome segregation during cell division. Telomeres protect chromosome ends against enzymatic degradation and fusions, and localize chromosomes in the cell nucleus. For this reason, centromeres and telomeres are parts of each linear chromosome that are necessary for their proper functioning. More and more research results show that the identity and functions of these chromosomal regions are epigenetically determined. Telomeres and centromeres are both usually described as highly condensed heterochromatin regions. However, the epigenetic nature of centromeres and telomeres is unique, as epigenetic modifications characteristic of both eu- and heterochromatin have been found in these areas. This specificity allows for the proper functioning of both regions, thereby affecting chromosome homeostasis. This review focuses on demonstrating the role of epigenetic mechanisms in the functioning of centromeres and telomeres in plants and animals.
cytosine methylation, histone code, non-coding RNA, pericentromeric, subtelomeric
The term epigenetics refers to a variety of processes that change gene expression independently of DNA sequence. An important feature of the epigenetic pattern is that it is stable and inherited through cell divisions, although it can be reversible (
DNA methylation is of great importance among the epigenetic mechanisms that regulate gene expression in plants and animals. DNA methylation is associated with gene silencing (
Chromatin remodeling results from the action of ATP-dependent complexes that change the association of DNA with core histones and from modifications of histone proteins, affecting the availability of DNA (
Post-translational modifications of histone proteins are another important epigenetic mechanism. Histones (H2A, H2B, H3 and H4)2 are the basic protein component of the nucleosome that forms the core around which a DNA strand of about 146 bp is wrapped (
Epigenetic regulators also include non-coding RNA (ncRNA). In epigenetic processes, the most important role among non-coding RNAs is played by those molecules that act in the RNAi (RNA interference) pathway and certain lncRNA (long non-coding RNAs, over 200 nt in length) (
Most of the presented epigenetic mechanisms are closely associated with each other to ensure stabilization and transmission of epigenetic patterns from cell to cell during cell divisions. They interact with each other in different ways. DNA methylation can promote changes in histone modification and vice versa. However, they can also change accidentally under the influence of stimuli coming from the internal and external environment (
However, there is a fairly close connection between epigenetic regulators and the spatial structure of the cell nucleus due to the fact that the organization of chromatin is epigenetically determined. In turn, the organization of chromatin influence the spatial structure of the cell nucleus. Based on the studies of the nucleus of mammalian cells, chromatin was divided into following compartments A – euchromatin, B – facultative heterochromatin (
The epigenetic nature of both centromeric and telomeric regions is not clearly defined. This is because these are regions built from repetitive sequences, which makes it difficult to accurately show epigenetic modifications of centromeres and telomeres. This review focuses on demonstrating how epigenetic mechanisms affect the functioning of centromeric and telomeric regions, taking into account differences in plants and animals.
The centromere was first described by Walther
The simplest centromere with a length of 125 bp is found in Saccharomyces cerevisiae (Meyen, 1883). This simple, small centromere contains a single cenH3 (centromere specific histone 3) nucleosome, which binds a single microtubule during cell division, which is why this centromere type is called the “point centromere” (
Holocentric chromosomes have been found in some plants (e.g. the genus Luzula Candolle and Lamarck, 1805) , animals (several arthopods and nematodes) and Rhizaria (Cavalier-Smith, 2002) (
In monocentric chromosomes of animals and plants the centromere region constitutes a segment from several kb to Mb in size, that contains satellite DNA with repeating monomers of ~100–400 bp (
In plants, the centromeric region is composed of alternating tandem repeats and retrotransposons. For example, sequencing of maize centromeric DNA revealed two types of repetitive sequences in this region: satellite CentC (156 bp monomer) and retrotransposon CRM (centromeric retrotransposon of maize) sequences (
Human centromeres are characterized by the presence of satellite tandem repeats of ~171 bp in size, arranged “head-to-tail”, that are further arranged in higher order repeats (HOR). Individual monomers share 50–70% sequence identity, but HORs have 95–98% similarity (
Centromeric DNA sequences are evolving relatively fast (
The centromeric core, which provides the kinetochore attachment site, is flanked by pericentromeric regions. Pericentromeric chromatin stabilizes the centromeric core, inhibiting internal recombination between core repeat sequences (
Pericentromeres, like the core centromere, mainly consist of repetitive sequences. Among the sequences included in pericentromeric DNA, there are satellite sequences, as well as transposons, LTR and non-LTR retrotransposons (
As previously mentioned, it is believed that satellite DNA is not essential for maintaining centromere structure and function. The term “centromere paradox” defines the fact that centromere sequences are very variable, while centromere function is conservatively maintained. However, as it turns out, centromere functionality does not result from the composition of the relevant DNA sequences, but the epigenetic mechanisms are responsible for it (
Epigenetic modifications of centromeric regions and their functions in plants and animals.
Epigenetic modification | Region | Function | Reference |
---|---|---|---|
histone variant cenH3 CENP-A | centromeric | specifies centromere location essential for kinetochore assembly |
|
H3K4me1, H3K4me2, H3K36me2, H3K36me3 | centromeric | maintenance of centromere stability RNA II pol activity recruitment of HJURP proteins CENP-A deposition |
|
H4K5ac and H4K12ac | centromeric | CENP-A deposition |
|
H4K20ac | centromeric | required for transcriptional activity required for kinetochore formation in human and Gallus cells |
|
H2AT133ph H2AT120ph | centromeric | recruitment of Shugoshin (Sgo1) protein prevents precocious separation of sister chromatids |
|
monoubiquitinated H2B (H2Bub1) | centromeric | required for transcriptional activity provides structural integrity required for proper chromosome segregation |
|
H3K9me | pericentromeric | chromatin condensation ensures chromatid cohesion provides structural integrity |
|
H4K20me | pericentromeric | chromatin condensation provides structural integrity |
|
H3K27me | pericentromeric | transcriptional repression of transposable elements |
|
H3 and H4 lysine residues acetylation | pericentromeric and centromeric | increase in chromatin compaction heterochromatin integrity |
|
Cytosine methylation of DNA | pericentromeric and centromeric | chromatin condensation provides structural integrity |
|
The results of studies on the epigenetic regulation of centromeric regions are ambiguous. The difficulty in studying these regions is caused by the fact that centromeres in most multicellular eukaryotes are formed of numerous copies of repetitive sequences (
Many studies on centromere chromatin in Arabidopsis thaliana (Linnaeus, 1753) have shown that it forms chromocentres in the interphase nuclei, it is rich in H3K9me2, characterized by DNA hypermethylation and enrichment in histone variant H2A.W (
In contrast, studies on centromeres in rice have shown that DNA sequences in a functional centromere can be both hypo- and hypermethylated. DNA methylation patterns appear to be correlated with specific sequence motifs (CG, CHG, CHH) in centromeric DNA (
Research on the level of DNA methylation in medaka fish (Oryzias latipes Temminck et Schlegel, 1846) demonstrated that centromeres are mainly hypermethylated, but have hypomethylated subregions (
Centromeric chromatin (CEN) is characterized by the presence of specific histone H3 variant – cenH3 (CENP-A in mammals, CID (centromere identifier) in Drosophila melanogaster (Fallén, 1823), cenH3 in plants) (
Model of the vertebrate mitotic centromere/kinetochore complex. Kinetochores assemble on chromatin marked by CENP-A containing nucleosomes. CENP-A nucleosome binds chromosomal passenger complex (CPC), which consists of four proteins: kinase Aurora B, INCENP, Survivin and Borealin. The kinetochore is composed mainly of CCAN (constitutive centromere-associated network) and Knl1-Mis12-Ndc80 complexes. The presence of CENP-A allows the recruitment of CCAN, which is a complex consisting of 16 centromeric proteins: CENP-C, CENP-T-W-S-X, CENP-H-I-K-M, CENP-N-L and CENP-O-P-Q-R-U. CENP-C and CENP-N bind CENP-A. The CENP-T-W-S-X complex creates a unique nucleosome-like structure that allows DNA binding in centromeric chromatin. CENP-N-L and CENP-H-I-K-M have regulatory roles. CENP-H-I-K-M-L-N help recruit CENP-C. CENP-C binds to the Mis12 complex, which then recruits Knl1 proteins interacting with microtubules and the Ndc80 complex. Ndc80 – kinetochore complex component (the complex consists of Ndc80-Nuf2-Spc24-Spc25 proteins); cenH3 – centromere specific histone 3 or histone H3 variant found at the centromere, CENP-A – centromere protein A, centromere specific histone 3 or histone H3 variant found at the centromere; CENP-C – centromere protein C; Mis 12 Complex – complex of the core kinetochore (the complex consists of Mis12-Dsn1-Nnf1-Nsl1 proteins); Knl1 – kinetochore scaffold 1; Zwint – kinetochore proteins; CCAN – constitutive centromere-associated network, CPC – chromosomal passenger complex (consisting of Borealin, Survivin, INCENP, and the Aurora B kinase), INCENP – Inner Centromere Protein; Ska Complex – spindle and kinetochore associated (the complex consists of Ska1-Ska2-Ska3 proteins).
In human CEN chromatin, nucleosomes containing the CENP-A variant alternate with nucleosomes with the canonical histone H3. Histones H3 in this region undergo methylation at lysine positions 4 and 36 (H3K4me1, H3K4me2, H3K36me2, H3K36me3), characteristic of transcriptionally active chromatin. They affect RNA pol II (RNA polymerase II) activity and play an important role in the recruitment of HJURP proteins that participate in the CENP-A deposition (
Interestingly, CEN is not usually associated with the presence of H3K9me2 or H3K4me3 heterochromatin markers, although H3K9me3 modification has been shown in this region to be associated with transcription repression (
In maize centromeres, the presence of histone post-translational modifications associated with transcriptional activity, such as histone H4 acetylation and H3K4me2, has been revealed. It was indicated that centromeres in this species are organized as euchromatin regions flanked by pericentromeric H3K9me2-enriched heterochromatin (
CENP-A is less likely to undergo post-translational modification than canonical histone H3 (Fig.
Epigenetic modifications in centromeric and pericentric chromatin. Centromeres consist of alternating blocks of nucleosomes containing H3 or cenH3. At pericentric sites, only H3-containing nucleosomes are present. Epigenetic markers in centomere and pericentromere regions characteristic for both plants and animals are marked with black color, only for plants with violet color, only for animals with rose color. (+) epigenetic marker always present; (-/+) epigenetic modification present or absent.
It has long been believed that centromeric chromatin is transcriptionally inactive because it is formed mainly by satellite sequences. It is now known that CEN transcription is mediated by RNA pol II, which was detected in centromeric regions in both S. pombe, Drosophila, mouse, human, Zea (Linnaeus, 1753), Oryza (Linnaeus, 1753) and neocentromeres, as well as in CEN of human artificial chromosomes (HAC) (
It has also been proven that the region directly adjacent to the centromere plays a role in sister chromatid cohesion (
Histones in pericentric chromatin are mostly hypoacetylated, which causes chromatin condensation. Pericentromeric areas are characterized by the presence of histone variants H3.3 and H2A.Z (
A characteristic protein of this region is HP1 or its homologs (
The analysis of human neocentromeres that showed centromere functioning without satellite repeats (although they had a slightly higher AT content, from 59.9 to 66.1% compared to genomic average of 59%). The acquisition of centromeric function by a chromatin region without changing the DNA sequence was called the “centromerization” phenomenon (
Studies on dicentric chromosomes also support this fact. Dicentric chromosomes are the result of genomic rearrangements placing two active centromeres on the same chromosome. Most dicentric chromosomes are unstable and only due to epigenetic mechanisms, which deactivate one of the centromeres, monocentric chromosomes can be formed that normally segregate during cell division (
Evolutionary repositioning or shift of the centromere along the chromosome with its function, leading to the formation of new evolutionary centromeres (ENCs), is another phenomenon that shows the epigenetic nature of these structures. This phenomenon was observed in primate chromosomes, other placental, marsupials and birds (
Telomeres are specialized structures located at the ends of linear eukaryotic chromosomes. Their function is to protect the ends of chromosomes from inappropriate enzymatic degradation. They are also responsible for chromosome localization in the cell nucleus and transcription regulation of genes located near telomeres (
Due to the important functions they perform in the cell, telomeres are evolutionarily conserved regions and their structure is only slightly different in individual species. However, the length of telomeric sequences shows individual, tissue and cellular variability (
In mammals, the DNA stretch comprising a telomere is terminated with single-stranded free G-overhangs of varying, species-specific length (
Telomeric chromatin has a typical organization, forming the nucleosome fiber at the basal level. This structure may be different only in regions where there are telomere-specific proteins (
Telomere structure in mammals; T-loop and D-loop are presented together with schematic representation of the shelterin complex on telomeric DNA. The shelterin complex consists of six proteins: TRF1 and TRF2 (telomere repeat-binding factor 1 and 2), RAP1 (repressor/activator protein 1), TIN2 (TRF1-interacting nuclear factor 2), TPP1 (TINT1/PTOP/PIP1 protein) and POT1 (protection of telomeres 1).
D. melanogaster telomeres have yet another structure. Three following retrotransposons have been identified in the telomere sequence: HeT-A, TART and TAHRE (HTT). At the ends of telomeres, there are numerous copies of HTT retrotransposon, while in the most proximal region, there are sequences called TAS (telomere associated sequence). The ends of telomeres are protected and stabilized by a protein complex. An important role is played by the heterochromatin 1 (HP1) protein, which binds to dimethyl lysine 9 in histone H3 (H3K9me2) (
In plants, telomeres are usually several kbs in size (A. thaliana – 2–9 kb), although they may be longer in some plants, e.g. tobacco telomeres may have a size of up to 150 kb (
The epigenetic nature of telomeres and subtelomeres remains controversial (
Epigenetic modifications in telomere and subtelomere chromatin and adjacent euchromatin. Epigenetic markers in telomere and subtelomere regions characteristic for both plants and animals are marked with black color, only for plants with violet colour, only for animals with rose color.
The variety of information regarding telomere regions may partly result from experimental limitations, but also due to the epigenetic diversity of animal (
Telomeric and subtelomeric chromatin studies in mouse showed the presence of histone modifications characteristic of the heterochromatin fraction (
However, in mouse cells, telomeres are enriched in modifications specific to heterochromatin (H3K9me3) and euchromatin (H3K4me3). Although the H3K4me3 modification was at a lower level compared to H3K9me3 (
Similar results were obtained in studies on plant telomeres. In Arabidopsis, heterochromatin modifications, such as H3K9me2 and H3K27me3, as well as euchromatin H3K4me3 modification have been reported (
Studies on telomere DNA methylation have not found so many discrepancies. Telomeres in mammalian cells are deprived of CpG dinucleotides, and therefore do not undergo DNA methylation (
While there is great controversy about the heterochromatic nature of telomeres, most studies show that this chromatin fraction is characteristic of subtelomeric regions. In animal and human cells, the subtelomeric regions are characterized by high CpG methylation and trimethylation of lysine 9 in histone H3 (H3K9me3) (
The heterochromatic state plays an important role in telomere biology, suggesting that the integrity of the subtelomeric heterochromatin may be important for the proper functioning of telomeres. A correlation was found between changes in the level of DNA methylation in the subtelomeric region and regulation of telomere length (
In budding yeasts, heterochromatinization of the subtelomeric region positively regulates telomere length (
Surprisingly, different reports have indicated that the length of telomeres does not change in epigenetic mutants (
For a long time, telomeres were perceived as silenced, transcriptionally inactive chromosome segments. This fact is negated by the presence of telomeric RNAs containing UUAGGG repeats, called TERRA, which are transcribed from the subtelomeric regions towards the ends of the chromosome by RNA pol II in yeasts, vertebrates and plants (
Binding of the TERRA transcripts to telomeres seems to be crucial for their structure and function (
The interaction of TERRA transcript with TRF1 and TRF2 proteins can facilitate the binding of TERRA to the ends of chromosomes. Due to the fact that TRF1 and TRF2 can interact with chromosomes also in different regions (especially with ITR) (
TERRA transcripts can promote homologous recombination between telomeres by creating RNA-DNA heteroduplex (R loops) at the ends of chromosomes (
Histone substitution with their variants is another epigenetic mechanism that plays a role in the functioning of telomeres. In human and mouse cells, histone H3.3 variant was correlated with TERRA transcriptional repression in telomeres and subtelomeres (
Centromeres and telomeres are indispensable elements of every functional chromosome in Eukaryota. Considering the conservative role, their structure should be similar, not only in the context of the DNA nucleotide sequence, but also at the level of chromatin organization. Whereas in the case of telomeres this can be seen, in centromeres the similarity is observed mainly at the level of epigenetic modifications, with a great diversity of nucleotide sequences. Although microscopic analysis indicates that they are heterochromatin elements, they should now be considered as specific regions of the so-called intermediate heterochromatin, i.e. having epigenetic features of both euchromatin and heterochromatin. Undoubtedly, epigenetic status plays an extremely important role in regulating both telomeres and centromeres. For it is the specific structure of chromatin, and not just the DNA sequence itself, that ensures the proper functioning of these regions during the entire cell cycle. Many analyses have been carried out, the results of which were often contradictory, hindering an unambiguous determination of epigenetic markers of centromeric and telomeric regions.
However, these analyses have allowed us to perceive the epigenetic nature of telomeres and centromeres as very complex systems, precisely regulated at many levels. Disorders of this regulation can lead to destabilization of the entire genome. It also turned out that adjacent regions, i.e. subtelomeres and pericentromeres, often no less important than key elements, were thought for a long time to be heterochromatin boundary areas. Currently, it seems that maintaining their epigenetic status affects the structure and functioning of telomeres and centromeres. There is a need for further research on other species that will allow better understanding of telomere and centromere regulation systems in all their complexity.