Note to users. If you're seeing this message, it means that your browser cannot find this page's style/presentation instructions -- or possibly that you are using a browser that does not support current Web standards. Find out more about why this message is appearing, and what you can do to make your experience of our site the best it can be.


Science 18 August 2006:
Vol. 313. no. 5789, p. 918
DOI: 10.1126/science.1121225

Technical Comments

Comment on "Computational Improvements Reveal Great Bacterial Diversity and High Metal Toxicity in Soil"

Igor Volkov,1,2* Jayanth R. Banavar,1 Amos Maritan3

Based on analysis of the reassociation kinetics of bacterial DNA in soil, Gans et al. (Reports, 26 August 2005, p. 1387) claimed that millions of microbe species existed in 10 grams of pristine soil and that 99.9% of the diversity was lost as a result of toxic metals. We show that the data do not support these startling conclusions unambiguously.

1 Department of Physics, 104 Davey Lab, Pennsylvania State University, University Park, PA 16802, USA.
2 Department of Biology, 208 Mueller Lab, Pennsylvania State University, University Park, PA 16802, USA.
3 Dipartimento di Fisica Galileo Galilei, Università di Padova and Istituto Nazionale per la Fisica Della Materia, via Marzolo 8, 35131 Padova, Italy.

* To whom correspondence should be addressed. E-mail: volkov{at}psu.edu

Gans et al. (1) reanalyzed the reassociation kinetics for bacterial DNA from pristine and metal-polluted soils. They claimed that a power law best described the abundance distributions and that more than one million species existed in pristine soil—an increase of two orders of magnitude compared with earlier estimates. Our analysis shows that the data analyzed in (1) constrain neither the species-abundance relationship nor the effective parameters, including the total diversity.

Consider the rate equation

Formula(1)
where [Ci] is the concentration of single-stranded DNA of the i-th species at time t, [C0i] is the concentration at t = 0, the coefficient [(1 + {gamma})/({gamma})] determines the order of the reaction, and k is a measure of the reaction rate. The empirical retardation factor {gamma} is equal to 1 for a second-order reaction. As a solution, one obtains the basic equation used by Gans et al. (1).

Formula(2)

Following Gans et al., we reconsidered data fits to equation 2 in (1) obtained for the case of a species-abundance distribution P(n). We considered two of the models studied in (1) for the species-abundance relationship, delta and zipf, described by the following equations.

Formula(3)
where S is the total number of species and N is the total population, and

Formula(4)
where N0 ≤ n ≤ N0 + {Delta} and N0, {Delta}, and z are all free parameters. One can readily find analytic forms for the Cot equation in the two cases to be

Formula(5)
and

Formula(6)

Formula
respectively, where 2F1(·) is the hypergeometric function, {alpha} = 1 + [({Delta})/(N0)], and ß = [(k z(1–{alpha}z–1))/(S(1–z)({alpha}z–1))].

Figure 1 shows the results of a least-squares fit of Eq. 5 (the delta model) to the data (2) analyzed in (1). For the noncontaminated sample, one obtains k/S = 0.033, {gamma} = 0.092, and R2adj = 0.9965; for the low-contaminated sample, one obtains k/S = 0.034, {gamma} = 0.12, and R2adj = 0.9925; and for the highly contaminated sample, one obtains k/S = 0.035, {gamma} = 0.16, and R2adj = 0.9963. Note the high quality of the fits, the notably similar values of k/S, and the somewhat low values of {gamma}, the retardation factor. Were one to assume that k = 5.19 (as for Escherichia coli), one would find that S {approx} 150, which is unrealistically low and might merely reflect the number of abundant species. Interestingly, as shown in (1), the fit is much worse if {gamma} is chosen to be 0.45 and not allowed to be a free parameter.


Figure 1 Fig. 1. Cot curves for noncontaminated (red), low-contaminated (blue), and highly contaminated (green) samples, along with the fits to the delta model of species abundance. [View Larger Version of this Image (19K GIF file)]
 

The presence of a large number of species in the sample leads to the retardation factor {gamma} becoming smaller than that observed for a single species (e.g., E. coli). Equation 2 is valid only under the assumption of a special sample in which the species are segregated from each other. However, the actual sample is a mixture of all the species, such that the rate at which two single-stranded DNA molecules of the same species reassociate when in proximity to each other should decrease. This slower reassociation rate may be captured by a decrease of {gamma}, which determines the rate of change of [([C]')/([C])] {propto} [C]1/{gamma}. It would be important to carry out careful studies of the influence of sample heterogeneity on the form of the Cot curve and on the values of the empirical parameters. Furthermore, the value of {gamma} could very well depend on the degree of purity of the soil.

Figure 2 shows the fits of the data using Eq. 6 (the zipf model), with k = 5.19 (the reaction rate for E. coli) and {gamma} = 0.45 (the retardation factor for E. coli), as carried out in (1). Interestingly, the quality of the fits does not degrade significantly when S varies over a wide range.


Figure 2 Fig. 2. Cot curves for (A) noncontaminated (red), (B) low-contaminated (blue), and (C) highly contaminated (green) samples, along with the fits to the zipf model of species abundance. In each case, the data are fitted to several versions of the model with varying numbers of species S as shown in the inset. The blue solid lines represent the best fit in accord with analysis in (1). The other lines are fits in which the number of species is decreased by factors of 8.2 and 82 for the pristine soil and increased by factors of 3.56 and 50 for the two polluted cases, with little degradation in the quality of fit. The most sensitive region of the Cot curve occurs when [C]/[C0] becomes small compared with 1, and the experimental data does not extend to this regime. [View Larger Version of this Image (11K GIF file)]
 

Thus, our reanalysis of the data using the framework developed in (1) suggests that the data, in and of themselves, constrain neither the species-abundance relationship nor the effective parameters, including the total diversity. This problem is exacerbated because there are no error estimates in the experimental data, and one can observe great variation in the model behavior upon changing the empirical retardation factor {gamma}.

Accurately determining microbial diversity and gauging the impact of pollutants on it are extremely vital issues (3) that affect health, agriculture, and geochemical cycles. Unfortunately, the experimental data analyzed in (1) do not allow one to infer either that the diversity of pristine soil is orders of magnitude higher than previously thought or that metal pollutants have a devastating effect on microbial diversity.


References


Received for publication 11 October 2005. Accepted for publication 30 March 2006.





To Advertise     Find Products

ADVERTISEMENT

Featured Jobs

Science. ISSN 0036-8075 (print), 1095-9203 (online)