Research Article |
Corresponding author: Renhai Peng ( aydxprh@163.com ) Academic editor: Viktoria Shneyer
© 2020 Yuling Liu, Zhen Liu, Yangyang Wei, Yanjun Wang, Jiaran Shuang, Renhai Peng.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Liu Y, Liu Z, Wei Y, Wang Y, Shuang J, Peng R (2020) Cloning and preliminary verification of telomere-associated sequences in upland cotton. Comparative Cytogenetics 14(2): 183-195. https://doi.org/10.3897/CompCytogen.v14i2.49391
|
Telomeres are structures enriched in repetitive sequences at the end of chromosomes. In this study, using the telomere primer AA(CCCTAAA)3CCC for the single primer PCR, two DNA sequences were obtained from Gossypium hirsutum (Linnaeus, 1753) accession (acc.) TM-1. Sequence analysis showed that the two obtained sequences were all rich in A/T base, which was consistent with the characteristic of the telomere-associated sequence (TAS). They were designated as GhTAS1 and GhTAS2 respectively. GhTAS1 is 489 bp long, with 57.6% of A/T, and GhTAS2 is 539 bp long, with 63.9% of A/T. Fluorescence in situ hybridization results showed that both of the cloned TASs were located at the ends of the partial chromosomes of G. hirsutum, with the strong signals, which further confirmed that GhTAS1 and GhTAS2 were telomere-associated sequences including highly tandemly repetitive sequences. Results of blast against the assembled genome of G. hirsutum showed that GhTAS sequences may be missed on some assembled chromosomes. The results provide important evidence for the evaluation of the integrity of assembled chromosome end sequences, and will also contribute to the further perfection of the draft genomes of cotton.
G. hirsutum, telomere-associated sequence, cloning, FISH
Telomeres are DNA-protein complexes at the ends of chromosomes (
Telomere tandem repeats located at the end of chromosomes represent only a part of the end of chromosomes. Telomere-associated sequences (TASs) located directly proximal to telomere tandem repeats (
At present, there is relatively little research on cotton telomere. Combining FISH using the Arabidopsis-type telomere sequence amplified from Arabidopsis genomic DNA and BAL-31 digestion,
The plant material was G. hirsutum acc. TM-1 (AADD)1, which was planted in the experimental field of Anyang Institute of Technology, Henan, China. Genomic DNA was isolated from fresh young leaves using the modified CTAB method (
The eight single primers of the plant telomere repeat were selected from NCBI database (https://www.ncbi.nlm.nih.gov) according to the previous studies for single primer PCR (
The selected single primers of the plant telomere repeat sequence (Table
Name | Taxonomic name | Reference | Sequence |
TR1 | Oryza sativa (Linnaeus, 1753) |
|
(TTTAGGG)3 |
TR2 | Zea mays (Linnaeus, 1753) |
|
(TTTAGGG)4 |
TR3 | Othocallis siberica (Linnaeus, 1753) |
|
(TTTAGGG)5 |
TR4 | Ginkgo biloba (Linnaeus, 1771) |
|
(CCCTAAA)3 |
TR5 | Brassica campestris (Linnaeus, 1753) |
|
(CCCTAAA)3CCC |
TR6 | Othocallis siberica |
|
AA (CCCTAAA)3CCC |
TR7 | Zea mays |
|
(CCCTAAA)4 |
TR8 | Othocallis siberica |
|
(CCCTAAA)5 |
DNAMAN software was used for extraction and alignment of cloned sequences. Repetitive sequence analysis was performed using the online program CENSOR (https://www.girinst.org/censor/index.php). BLAST algorithm blastn (https://www.cottongen.org/blast) was used to identify GhTAS from G. hirsutum genome database (Gossypium hirsutum ZJU v2.1, a1) (
The TAS plasmid DNA was extracted using the TIANprep Mini Plasmid Kit according to the instructions. Then, TAS plasmid DNA was labeled with DIG-Nick Translation Mix (Roche). The 45S rDNA probes derived from Arabidopsis thaliana (
According to the melting temperature (TM) value distribution of the eight candidate single primers (55 °C–62 °C), two annealing temperatures were selected, namely 55 °C and 60 °C. The results of PCR amplification showed that an obvious band of roughly 500 bp was amplified using the single primer TR6 under the two annealing temperatures, especially, the band amplified under annealing temperature of 60 °C showed better specificity and higher brightness (Fig.
A single band with a size of roughly 500 bp was amplified using the single primer TR6 under the annealing temperature of 60 °C with Phanta Max Super-Fidelity DNA Polymerase (Fig.
Sequence component analysis
Sequence analysis of the eight positive clones revealed that all clones had the same forward and inverted telomere primer sequence at the two ends. Sequence alignment showed that there were two different internal sequences in eight sequences. So, the two different cloned DNA sequences with different size of 488 bp and 538 bp were selected and named as GhTAS1 and GhTAS2 (Fig.
The two sequences were rich in A/T bases, that is, 57.6% and 63.9% respectively. Repeat masking analysis indicated that the tandem repeats content were 31.35% in GhTAS1 and 42.38% in GhTAS2, which mainly consisted of satellite DNA and transposable elements. The above results are consistent with the typical characteristics of telomere-associated sequences (
Sequence alignment results of DNAMAN shown that GhTAS1 and GhTAS2 had low homology, with the sequence similarity of 38.90%, which may be due to their different chromosomal sources.
After comprehensive comparison of the obtained TASs of G. hirsutum and the TASs of Arabidopsis thaliana, Glycine max (Linnaeus, 1753), Oryza sativa (Linnaeus, 1753), Zea mays (Linnaeus, 1753), Larix gmelinii (Ruprecht, 1920) listed on NCBI, it was found that their similarity was low, ranging from 25% to 50% (Table
Similarity of telomere-associated sequences between G. hirsutum and other plants.
Species | NCBI accession No. | TASs of G. hirsutum | |
---|---|---|---|
GhTAS1 | GhTAS2 | ||
Arabidopsis thaliana | AC074298.1 | 39.60% | 36.71% |
AM177016.1 | 14.08% | 12.94% | |
AM177019.1 | 13.52% | 13.93% | |
AM177060.1 | 10.88% | 10.15% | |
Glycine max | AF041999.1 | 20.24% | 16.79% |
Oryza sativa | U12056.1 | 28.71% | 25.27% |
Zea mays | S46927.1 | 48.70% | 41.93% |
Larix gmelinii | EF474441.1 | 31.40% | 30.57% |
GhTAS1 and GhTAS2 were found using blastn with the latest G. hirsutum genome sequence (Gossypium hirsutum ZJU v2.1, a1) in Cottongen (https://www.cottongen.org/). Results showed that GhTAS1 was mapped onto five chromosomes and one scaffold of G. hirsutum, and GhTAS2 was mapped onto all 26 chromosomes and 14 scaffolds of G. hirsutum with different E-value. The partial blast results with lower E-value were listed in Table
To examine the chromosome physical location of GhTAS1 and GhTAS2, we carried out FISH on G. hirsutum metaphase chromosomes using a digoxin-labeled GhTAS probe and a biotion-labeled 45S rDNA probe. The results showed that GhTAS1 had signals at the end of nearly half of the chromosomes of G. hirsutum, and most of them were distributed at the single end. The signal intensity on different chromosomes was also different (Fig.
Sequence name | Genomic location | Query matches | Hit matches | Identity (%) |
GhTAS1 | D06 | 1–488 | 65407147–65406660 | 98.98% |
D03 | 184–281 | 26586–26683 | 78.57% | |
A01 | 171–237 | 118151185–118151119 | 82.09% | |
D02 | 138–219 | 69751633–69751551 | 79.52% | |
Scaffold515-obj | 184–281 | 9914–9817 | 75.51% | |
D01 | 184–281 | 64676574–64676477 | 75.51% | |
GhTAS2 | A06 | 14–537 | 126445179–126444656 | 99.62% |
D11 | 14–537 | 71336660–71336138 | 98.47% | |
A13 | 14–535 | 47688–48202 | 94.08% | |
A02 | 25–512 | 40084–39589 | 88.15% | |
D02 | 25–512 | 69751559–69752073 | 86.68% | |
A12 | 25–512 | 30186–29672 | 86.15% | |
Scaffold546-obj | 46–455 | 8556–8146 | 89.07% | |
Scaffold515-obj | 25–271 | 31264–31514 | 89.33% | |
A09 | 25–315 | 83200103–83200398 | 86.96% | |
A11 | 25–271 | 121355653–121355904 | 88.54% | |
scaffold407_obj_A03 | 59–271 | 36503–36719 | 92.24% | |
A07 | 25–271 | 96580716–96580969 | 88.24% | |
D10 | 278–455 | 66830830–66831007 | 93.26 | |
A10 | 25–271 | 115081227–115081476 | 87.75% | |
A05 | 285–455 | 39831434–39831267 | 93.60% | |
A01 | 278–455 | 118169784–118169962 | 91.06% | |
D08 | 278–512 | 69075939–69076196 | 84.11% | |
D03 | 278–455 | 23313–23139 | 91.01% | |
D09 | 278–442 | 51987281–51987445 | 91.52% |
In this study, the telomere primer AA(CCCTAAA)3CCC was used as a single primer to obtain the TAS sequences of G. hirsutum by single primer PCR. The homology of the two TASs is relatively low and with the similarity of 38.90%. Chromosome FISH localization of the two sequences also showed obvious differences in chromosome distribution and signal strength (Fig.
Since telomere and adjacent subtelomere regions could not be covered by PAC and BAC clones, sequencing efforts were unable to reveal the structure of these regions. In addition, the discovery of interstitial telomeric sequences (ITSs) makes telomeric minisatellites have double-faced character, which causes more problems in producing genomic assemblies (
We cloned two telomere-associated sequences from G. hirsutum acc. TM-1 using the single-primer PCR, and made analysis about the sequence characteristics of two TASs. The two TASs sequences were enriched in A/T, and were flanked by the forward and inverted primer sequences at each end respectively. By comparative analysis based on the results of blastn and FISH localization of the two TASs, we found that TASs at the end of some chromosomes were not assembled in the genome sequence map. Our study not only contributes to the analysis of telomere structure of cotton, but also provides intuitive evidence for the evaluation of the integrity of the assembled G. hirsutum genome.
The research was sponsored by National Key R&D Program of China (No. 2018YFD0100300), Innovation Scientists and Technicians Troop Construction Projects of Henan Province (20IRTSTHN021), Science and Technology Development Program of Anyang City (2018-66-133), Science and Technology Development Project of Henan Province (182102410041).