Data Paper |
Corresponding author: Fernando Roa ( ferroao@gmail.com ) Academic editor: Puneet Puneet
© 2017 Fernando Roa, Mariana Pires de Campos Telles.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Roa F, Telles MPC (2017) The Cerrado (Brazil) plant cytogenetics database. Comparative Cytogenetics 11(2): 285-297. https://doi.org/10.3897/CompCytogen.v11i2.11395
|
Cerrado is a biodiversity hotspot that has lost ca. 50% of its original vegetation cover and hosts ca. 11,000 species belonging to 1,423 genera of phanerogams. For a fraction of those species some cytogenetic characteristics like chromosome numbers and C-value were available in databases, while other valuable information such as karyotype formula and banding patterns are missing. In order to integrate and share all cytogenetic information published for Cerrado species, including frequency of cytogenetic attributes and scientometrics aspects, Cerrado plant species were searched in bibliographic sources, including the 50 richest genera (with more than 45 taxa) and 273 genera with only one species in Cerrado. Determination of frequencies and the database website (http://cyto.shinyapps.io/cerrado) were developed in R. Studies were pooled by employed technique and decade, showing a rise in non-conventional cytogenetics since 2000. However, C-value estimation, heterochromatin staining and molecular cytogenetics are still not common for any family. For the richest and best sampled families, the following modal 2n counts were observed: Oxalidaceae 2n = 12, Lythraceae 2n = 30, Sapindaceae 2n = 24, Solanaceae 2n = 24, Cyperaceae 2n = 10, Poaceae 2n = 20, Asteraceae 2n = 18 and Fabaceae 2n = 26. Chromosome number information is available for only 16.1% of species, while there are genome size data for only 1.25%, being lower than the global percentages. In general, genome sizes were small, ranging from 2C = ca. 1.5 to ca. 3.5 pg. Intra-specific 2n number variation and higher 2n counts were mainly related to polyploidy, which relates to the prevalence of even haploid numbers above the mode of 2n in most major plant clades. Several orphan genera with almost no cytogenetic studies for Cerrado were identified. This effort represents a complete diagnosis for cytogenetic attributes of plants of Cerrado.
Cerrado , cytogenetics, database, scientometrics, shiny-R
Cerrado, a phytogeographic domain from Brazil, is the third largest biodiversity hotspot in the world considering species endemism and degree of threat (
Since the chromosome number is the most basic karyotype feature and it can be observed by conventional staining of meristem cells or pollen mother cells, most cytogenetic databases deal only with this attribute (
The aim of this study was to assess the current cytogenetic knowledge of Cerrado plants, aiming to provide a consistent database that didn’t exclude any attribute and diagnose recent progress in the field.
Plant names in the Cerrado plant list (
Name | URL |
---|---|
Chromosome Counts Database* | ccdb.tau.ac.il |
Plant DNA C-values Database* | data.kew.org/cvalues |
International Organization of Plant Biosystematists | www.iopb.org |
Scholar Google | scholar.google.com |
Scopus | www.scopus.com |
Biodiversity Heritage Library | biodiversitylibrary.org |
JSTOR | www.jstor.org |
Real Jardín Botánico CSIC | bibdigital.rjb.csic.es/spa/index.php |
Naturalis Biodiversity Center | www.repository.naturalis.nl |
Botanicus Digital Library | www.botanicus.org |
Smithsonian Contributions to Botany | repository.si.edu/handle/10088/6943 |
Crossref | www.crossref.org |
All statistics and the online database were performed in R (
Even n numbers are more common than odd ones because of the prevalence of even 2n that eventually undergo duplication (
A subset of 38.9% (4,590 taxa) of the Cerrado Plant list was searched, including the 50 richest genera and 273 genera with only one species. Information of 1,431 accessions, from 366 available primary sources, were included in the database. 702 (16.8%) species of our searched sample (4182) had any cytogenetic data and about 500 of them had at least one accession collected in Brazil. Complete lack of information happens for 70% of the single-species genera addressed (Suppl. material
For the richest and best sampled families, the following modal 2n counts were observed: Oxalidaceae 2n = 12, Lythraceae 2n = 30, Sapindaceae 2n = 24, Solanaceae 2n = 24, Cyperaceae 2n = 10, Poaceae 2n = 20, Asteraceae 2n = 18 and Fabaceae 2n = 26 (Suppl. material
Frequencies of use of karyological techniques show that the first cytogenetic analysis were mainly based on meiosis from 1928 on, and subsequently by 1990 the analysis of mitosis (sporophyte) gained more prevalence. Measurements of karyotype formula (morphology) achieved significance after the 1990’s. A trend to include images in publications, only with a fall during the 2000’s decade was observed. Sophisticated techniques that show a substantial rise from the 2000’s decade are the estimation of the C-value, CMA banding (including C-CMA banding) and FISH. However, they have been applied to a limited number of species, ranging from 0.5 to 2.3% of the 1,431 entries in the database (Fig.
In order to track affiliation of authors over time, they were clustered by decade in a time-line. For accessions collected in Brazil [774 (54%)] the country of affiliation of the author showed an increase in the prevalence of Brazilian based research after the 1990’s. Other significant contributions for Brazilian samples have been made by Argentinian based authors (Fig.
699 (16.1%) species of the searched list have known chromosome numbers with a general median of 2n = 28. Fig.
Dot-plot of observed and calculated 2n numbers for Cerrado plant species considering APG major clades. The three most common numbers (frequency ties separated by comma) and sample size in parentheses are indicated. Dot size varies depending on the maximum frequency of each group.
Clade or group | n even/odd ratio | |
---|---|---|
Greater | Lesser | |
than n mode | ||
Monocots - Commelinids | 3.47 | 0.40 |
Other monocots | 4.00 | 0.12 |
Core eudicots-Superasterids-Asterids-Lamiids | 6.50 | 1.00 |
Core eudicots-Superasterids-Asterids-Campanulids | 1.50 | 1.57 |
Core eudicots-Superrosids-Malvids | 3.22 | 0.50 |
Core eudicots- Superrosids-Fabids | 3.40 | 0.56 |
Only 88 C-values were found for the taxa of Cerrado, most of them ranging from 2C = ca. 1.5 pg to ca. 3.5 pg with a median value of 2.38 pg (Fig.
Violin-plots for genome sizes in pg for Cerrado angiosperms. Y-axis contains clades, three highest peaks of the Gaussian kernel density and sample size. Bandwidth of the density estimator customizable in http://cyto.shinyapps.io/cerrado
There is strong interest for genomics and genetics of Cerrado species (
Discontinuities in chromosome numbers seen as multimodal distributions indicate that higher numbers are polyploids generated by genome duplication. A polyploid origin for most high n numbers is also evidenced by the high (> 1) even/odd ratios of n (
The Database of Cytogenetics of Cerrado plants and its website is presented, making an important step in facilitating access to most known cytogenetic attributes (http://cyto.shinyapps.io/cerrado/). The amount of chromosome number data for Cerrado is lower than the world total. The chromosome number, is still lacking in several rich genera and families, and therefore, they can be considered “orphan”. This complete lack of information also happens for 70% of the single-species genera addressed. Analysis of chromosome numbers at several taxonomical levels revealed a straightforward relationship between polyploidy and high chromosome numbers. Regarding other techniques, like heterochromatin staining, molecular cytogenetics and C-value estimation, they have been applied to a very small percentage of species, however, those studies have been steadily increasing since 2000. Though cytogenetic data are the basis for some evolutionary and ecological studies, there is a lack of those kind of interdisciplinary studies for Cerrado.
FR wishes to thank CAPES for their financial support, through a PNPD fellowship. MPCT is supported by productivity grants CNPq/FAPEG/Pró-Centro-Oeste (563839/2010-4 and 201110267000125), Pronex – FAPEG/CNPq (Funding Announcement No. 07/2012) and CNPq (447754/2014-9). This publication was funded by the PPGS CAPES/FAPEG F. A. No. 08/2014. Current research is developed in the context of the National Institutes for Science and Technology (INCT) in Ecology, Evolution and Biodiversity Conservation, supported by MCTIC/CNpq (proc. 465610/2014-5) and FAPEG.
Summary statistics for genera
Data type: cytogenetic attributes
Explanation note: more details in http://cyto.shinyapps.io/cerrado
Summary statistics for families
Data type: cytogenetic attributes
Explanation note: more details in http://cyto.shinyapps.io/cerrado
Summary statistics for major clades
Data type: cytogenetic attributes
Explanation note: more details in http://cyto.shinyapps.io/cerrado
Authors with more publications of cytogenetic data of Cerrado plants per decade
Data type: affiliation and taxa – categorical data table
Explanation note: more details in http://cyto.shinyapps.io/cerrado
Copyright notice: This dataset is made available under the Open Database License (http://opendatacommons.org/licenses/odbl/1.0/). The Open Database License (ODbL) is a license agreement intended to allow users to freely share, modify, and use this Dataset while maintaining this same freedom for others, provided that the original source and author(s) are credited.
Figure S1. Total chromosome length for Cerrado angiosperms
Data type: Histogram
Explanation note: See text for details.
Figure S2. Total chromosome area for Cerrado angiosperms
Data type: Histogram
Explanation note: See text for details.