Fluctuation relations and fitness landscapes of growing cell populations

Genthon, Arthur; Lacoste, David

doi:10.1038/s41598-020-68444-x

Download PDF

Article
Open access
Published: 17 July 2020

Fluctuation relations and fitness landscapes of growing cell populations

Arthur Genthon¹ &
David Lacoste¹

Scientific Reports volume 10, Article number: 11889 (2020) Cite this article

1631 Accesses
11 Citations
2 Altmetric
Metrics details

Subjects

Abstract

We construct a pathwise formulation of a growing population of cells, based on two different samplings of lineages within the population, namely the forward and backward samplings. We show that a general symmetry relation, called fluctuation relation relates these two samplings, independently of the model used to generate divisions and growth in the cell population. These relations lead to estimators of the population growth rate, which can be very efficient as we demonstrate by an analysis of a set of mother machine data. These fluctuation relations lead to general and important inequalities between the mean number of divisions and the doubling time of the population. We also study the fitness landscape, a concept based on the two samplings mentioned above, which quantifies the correlations between a phenotypic trait of interest and the number of divisions. We obtain explicit results when the trait is the age or the size, for age and size-controlled models.

Cell population heterogeneity driven by stochastic partition and growth optimality

Article Open access 28 June 2019

Jorge Fernandez-de-Cossio-Diaz, Roberto Mulet & Alexei Vazquez

The physics of cell-size regulation across timescales

Article 19 August 2019

Clotilde Cadart, Larisa Venkova, … Matthieu Piel

First-passage-time statistics of growing microbial populations carry an imprint of initial conditions

Article Open access 04 December 2023

Eric W. Jones, Joshua Derrick, … David A. Sivak

Introduction

While the growth of cell populations appears deterministic, many processes occurring at the single cell level are stochastic. Among many possibilities, stochasticity at the single cell level can arise from stochasticity in the generation times¹, from stochasticity in the partition at division^2,3, or from the stochasticity of single cell growth rates, which are usually linked to stochastic gene expression⁴. Ideally one would like to be able to disentangle the various sources of stochasticity present in experimental data⁵. This would allow to understand and predict how the various sources of stochasticity affect macroscopic parameters of the cell population, such as the Malthusian population growth rate^6,7. Beyond this specific question, research in this field attempts to elucidate the fundamental physical constraints which control growth and divisions in cell populations.

With the advances in single cell experiments, where the growth and divisions of thousand of individual cells can be tracked, robust statistics can be acquired. New theoretical methods are needed to exploit this kind of data and to relate experiments carried out at the population level with experiments carried out at the single cell level. For instance, one would like to relate single-cell time-lapse video microscopy experiments of growing cell populations⁸, which provide information on all the lineages in the branched tree, with experiments carried out with the mother machine configuration, which provide information on single lineages^9,10.

Let us now review quickly how the issue was addressed theoretically. In 2015, a pathwise thermodynamic framework was built for population dynamics using large deviation theory. One important result was a variational principle for the population growth rate¹¹, which was formulated in terms of two key path distributions, namely the chronological and the retrospective probability distributions. Then, in order to explain their experimental observation that populations of Escherichia coli double faster than the mean doubling time of their constituent single cells, Hashimoto et al. extended the classical work of Powell⁶ for age models without mother-daugher correlations¹². Nozoe et al.¹³ then showed that the difference between the forward (chronological) and backward (retrospective) distributions can be used to define a quantity called phenotypic fitness landscape, which informs whether a specific phenotypic trait affects the population growth rate. In that work, they also derived the key relation between the two distributions, already known in the mathematical literature of branching processes¹⁴, but they did not connect this result with the field of fluctuation relations. Various theoretical works followed which addressed other aspects of the role of the stochasticity at the single cell level^3,15,16. As far as we can tell, the connection between the results of Nozoe and the field of fluctuation relations was only made explicit in our first work on this topic¹⁷. In that work, we also derived inequalities for mean generation times, already obtained in¹² for age models, but importantly we proved using the fluctuation relation that they are valid beyond age models, in particular for a broad class of size models.

In the present work, we further investigate the connection between the statistics at the single lineage and population levels using fluctuation relations. These fluctuation relations only depend on the structure of the branched tree but not on the class of dynamical variables (protein concentration, cell size or cell age.) defined on it. These relations imply that the above inequalities for the mean number of divisions or mean generation times hold regardless of the specific dynamics.

We then provide an interpretation of the fluctuation relations within Stochastic Thermodynamics¹⁸ and we explore some consequences. One application concerns the inference of the population growth rate using single lineage data¹⁹. We then introduce some specific dynamical models, which we simulate to confirm the fluctuation relations and their consequences for mean generation times. Then, for these specific models and for key phenotypic variables such as the size and the age, we also study the fitness landscape¹³.

Theoretical framework

The backward and forward processes

Let us consider a branched tree, starting with $N_0$ cells at time $t=0$ and ending with N(t) cells at time t as shown on Fig. 1. We assume that all lineages survive up to time t, and therefore the final number N(t) of cells corresponds to the number of lineages in the tree.

The most natural way to sample the lineages is to put uniform weights on all of them. This sampling is called backward, (or retrospective) because at the end of the experiment one randomly chooses one lineage among the N(t) with a uniform probability and then one traces the history of the lineage backward in time from time t to 0, until reaching the ancestor population. The backward weight associated with a lineage l is defined as

$$\begin{aligned} \omega _{\text {back}}(l)=N(t)^{-1} \,. \end{aligned}$$

(1)

In a tree, some lineages divide more often than others, which results in an over-representation of lineages that have divided more often than the average. Therefore by choosing a lineage with uniform distribution, we are more likely to choose a lineage with more divisions than the average number of divisions in the tree.

The other way of sampling a tree is the forward (or chronological) one and consists in putting the weight

$$\begin{aligned} \omega _\text {for}(l)= N_0^{-1} m^{-K(l)} \,, \end{aligned}$$

(2)

on a lineage l with K(l) divisions, where m is the number of offspring at division. This choice of weights is called forward because one starts at time 0 by uniformly choosing one cell among the $N_0$ initial cells, and one goes forward in time up to time t, by choosing one of the m offspring with equal weight 1/m at each division. The backward and forward weights are properly normalized probabilities, defined on the N(t) lineages in the tree at time t: $\sum _{i=1}^{N(t)} \omega _{\text {back}}(l_i) = \sum _{i=1}^{N(t)} \omega _{\text {for}}(l_i) =1$.

Single lineage experiments are precisely described by a forward process since experimentally, at each division, only one of the two daughter cells is conserved while the other is eliminated (for instance flushed away in a microfluidic channel^{9, 10}). In these experiments, a tree is generated but at each division only one of the two lineages is conserved, with probability 1/2, while the rest of the tree is eliminated. This means that single lineage observables can be measured without single lineage experiments, provided population experiments are analyzed with the correct weights on lineages.

Link with the population growth rate

Since the backward weight put on a lineage depends on the number of cells at time t, it takes into account the reproductive performance of the colony but it is unaffected by the reproductive performance of the lineage considered. On the contrary, the forward weight put on a specific lineage depends on the number of divisions of that lineage but is unaffected by the reproductive performance of other lineages in the tree. Therefore, the difference between the values of the two weights for a particular lineage informs on the difference between the reproductive performance of the lineage with respect to the colony.

We now introduce the population growth rate:

$$\begin{aligned} \Lambda _t=\frac{1}{t} \ln \frac{N(t)}{N_0} \,, \end{aligned}$$

(3)

which is linked to forward weights by the relation

$$\begin{aligned} \frac{N(t)}{N_0}=\sum _{i=1}^{N(t)} m^{K_i} \ \omega _\text {for}(l_i) = \langle m^K \rangle _\text {for} \,, \end{aligned}$$

(4)

where $\langle \cdot \rangle _\text {for}$ is the average over the lineages weighted by $\omega _\text {for}$, and $K_i=K(l_i)$. Combining the two equations above, we obtain¹⁹:

$$\begin{aligned} \Lambda _t=\frac{1}{t} \ln \langle m^K \rangle _\text {for} \,, \end{aligned}$$

(5)

which allows an experimental estimation of the population growth rate from the knowledge of the forward statistics only.

Equation (4) can also be re-written to express the bias between the forward and backward weights of the same lineage

$$\begin{aligned} \frac{\omega _{\text {back}}(l)}{\omega _\text {for}(l)}=\frac{m^{K(l)}}{\langle m^K \rangle _\text {for}} \,, \end{aligned}$$

(6)

which is the reproductive performance of the lineage divided by its average in the colony with respect to $\omega _\text {for}$.

A similar relation is derived using the relation

$$\begin{aligned} \frac{N_0}{N(t)}=\sum _{i=1}^{N(t)} m^{-K_i} \ \omega _\text {back}(l_i) = \langle m^{-K} \rangle _\text {back} \,. \end{aligned}$$

(7)

Combining Eqs. (5) and (7) we obtain:

$$\begin{aligned} \Lambda _t= - \frac{1}{t} \ln \langle m^{-K} \rangle _\text {back} \,. \end{aligned}$$

(8)

A similar equation as Eq. (6) can be obtained in terms of the backward sampling and reads:

$$\begin{aligned} \frac{\omega _{\text {back}}(l)}{\omega _\text {for}(l)}=\frac{\langle m^{-K} \rangle _\text {back}}{m^{-K(l)}} \,. \end{aligned}$$

(9)

Combining Eqs. (1) to (3), we obtain the fluctuation relation^13,17:

$$\begin{aligned} \omega _{\text {back}}(l)= \omega _\text {for}(l) \ e^{K(l) \ln m - t \Lambda _t} \,. \end{aligned}$$

(10)

If we now introduce the probability distribution of the number of divisions for the forward sampling $p_\text {for}(K)=\sum _l \delta (K-K(l)) \omega _\text {for}(l)$ and similarly for the backward sampling, we can also recast the above relation as a fluctuation relation for the distribution of the number of divisions:

$$\begin{aligned} p_{\text {back}} (K,t)=p_{\text {for}} (K,t) \ e^{K \ln m - t \Lambda _t} \,. \end{aligned}$$

(11)

Let us now introduce the Kullback–Leibler divergence between two probability distributions p and q, which is the non-negative number:

$$\begin{aligned} {{\mathscr {D}}}_{\text {KL}}(p||q)=\int {\mathrm {d}}x \, p(x) \ln \frac{p(x)}{q(x)} \ge 0 \,. \end{aligned}$$

(12)

Using Eq. (10), we obtain

$$\begin{aligned} {{\mathscr {D}}}_{\text {KL}}(\omega _{\text {back}}|| \omega _\text {for}) = \langle K \rangle _{\text {back}} \ln m - t \Lambda _t \ge 0 \,. \end{aligned}$$

(13)

A similar inequality follows by considering ${{\mathscr {D}}}_{\text {KL}}(\omega _{\text {for}}|| \omega _\text {back})$. Finally we obtain

$$\begin{aligned} \frac{t}{\langle K \rangle _{\text {back}}} \le \frac{\ln m}{\Lambda _t} \le \frac{t}{\langle K \rangle _\text {for}} \,. \end{aligned}$$

(14)

In the long time limit, $\lim \nolimits _{t \rightarrow + \infty } t/\langle K \rangle _{\text {back}} = \langle \tau \rangle _{\text {back}}$, where $\tau$ is the inter-division time, or generation time, defined as the time between two consecutive divisions on a lineage. The same argument goes for the forward average. In the case of cell division where each cell only gives birth to two daughter cells ($m=2$), the center term in the inequality tends to the population doubling time $T_d$. Therefore, this inequality reads in the long time limit:

$$\begin{aligned} \langle \tau \rangle _{\text {back}} \le T_d \le \langle \tau \rangle _\text {for} \,. \end{aligned}$$

(15)

Let us now mention a minor but subtle point related to this long time limit. For a lineage with K divisions up to time t, we can write $t=a + \sum _{i=1}^{K} \tau _i$, where a is the age of the cell at time t and where $\tau _i$ is the generation time associated with the ith division. Then $t/ K= \tau _m + a/K$, where $\tau _m$ is the mean generation time along the lineage. For finite times, all we can deduce is $t/ K \ge \tau _m$. Therefore the left inequality of Eq. (15) always holds

$$\begin{aligned} \langle \tau \rangle _{\text {back}} \le \frac{t}{\langle K \rangle _{\text {back}}} \le \frac{\ln m}{\Lambda _t} \,, \end{aligned}$$

(16)

while the right inequality does not necessarily hold at finite time.

Inspired by work by Powell⁶, the inequalities of Eq. (15) have been theoretically derived in¹² for age models. In our previous work¹⁷, we have replotted the experimental data of¹² which confirm theses inequalities and we have shown theoretically that the same inequalities should also hold for size models. In fact, as the present derivation shows, the relation equation (14) is very general and only depends on the branching structure of the tree, while the relation equation (15) requires in addition the existence of a steady state. These inequalities and Eq. (11) express fundamental constraints between division and growth, which should hold for any model.

Stochastic thermodynamic interpretation

The results derived above have a form similar to that found in Stochastic Thermodynamics¹⁸. According to this framework, Eq. (5) is an integral fluctuation relation (similar to Jarzynski relation) while Eq. (11) is a detailed fluctuation relation (similar to Crooks fluctuation relation). Furthermore, the inequalities equation (14) represent a constraint equivalent to the second law of thermodynamics, which classically follows from the Jarzynski or Crooks fluctuation relations. It is known that these inequalities take a slightly different form when expressed at finite time or at steady state, which is indeed the case here when comparing Eq. (14) with Eq. (15). A difference between work fluctuation relations like Crooks or Jarzynski and equations (5) and (11), is that Crooks or Jarzynski describe non-autonomous systems which are driven out of equilibrium by the application of a time-dependent protocol, whereas the relations for cell growth derived here concern autonomous systems, in the absence of any external protocol.

One of the main applications of Jarzynski or Crooks fluctuation relations concerns the thermodynamic inference of free energies from non-equilibrium fluctuations. Similarly, Eq. (5) or Eq. (11) can be used as estimators of the population growth rate. The specific advantage of Eq. (5) with respect to Eq. (11) is that it only requires single lineage statistics, which can be obtained from mother machine experiments. Let us now show how this can be done in practice. We use the data from²⁰, where the growth of many independent lineages of E. coli have been recorded over 70 generations in a mother machine at three different temperatures (25 °C, 27 °C, and 37 °C), precisely 65 lineages for 25 °C, 54 for 27 °C, and 160 for 37 °C. For each temperature condition, we study the convergence of the estimator of the population growth rate based on Eq. (5), which we call $\Lambda _{\mathrm{lin}}$ as a function of the length t of the lineages for a fixed number of independent lineages L, and as a function of the number of independent lineages for a fixed observation time.

Firstly, for each temperature, we take into account all the lineages available and truncate them at an arbitrary time t smaller than the length of the shortest lineage of the set. On these portions of lineages of length t, we compute $\Lambda _{\mathrm{lin}}$ versus the time t as shown in Fig. 2a. We see that the estimator $\Lambda _{\mathrm{lin}}$ starts from zero, increases and eventually converges rather quickly towards a limiting value. The limit we found agree with the independent analysis carried out in¹⁹, with only one caveat, these authors reported that their estimator started at high values and then decreased towards the limit, while in our case, the estimator starts at zero and later increases towards the limit. In our case, the estimator needs to be zero at short times, before the first divisions occur.

Secondly, we truncate all the lineages at a fixed time equal to the length of the shortest lineage of the set, and compute $\Lambda _{\mathrm{lin}}$ versus the number L of lineages considered for the estimation, which have been randomly selected from the ensemble of available lineages. As shown in Fig. 2b for the case at $37^{\,\circ } \hbox {C}$ (curves for the other temperatures look exactly the same), the convergence is also excellent in that case. Although the value of the population growth rate which is obtained in this way can not be measured independently from the evolution of the population in the mother machine setup, this convergence is indicative of the success of the method. The figure also confirms that the value of the population growth rate deduced from the estimator $\Lambda _{\mathrm{lin}}$ is larger than $\ln (2)/\langle \tau \rangle _{\text {for}}$, as predicted by the right inequality of Eq. (15).

Here, the estimator is found to provide an excellent estimation, but this is not always so. For instance, for the inference of free energies from non-equilibrium work measurements, the exponential average of the estimator is often dominated by rare values, which are not accessible or not well sampled²¹. To understand why this problem does not arise here, we show in inset of Fig. 2b, the distribution P(K) of the number of divisions together with the same distribution weighted by the factor $2^K$ and normalized. The peak of that modified distribution informs on the dominant values in the estimator²¹. Here, we observe that both distributions have a narrow support and are close to each other. The weighted distribution is peaked at $K=67$ while P(K) is peaked at $K=66$, therefore typical and dominating values are very close, which explains why the estimator is good.

Let us now further develop the Stochastic Thermodynamic interpretation of our results by analyzing the implications of the previous fluctuation relations when dynamical variables are introduced on the branched tree of the population. Let us introduce M variables labeled $(y_1,y_2, \ldots ,y_M)$ to describe a dynamical state of the system, then a path is fully determined by the values of these variables at division, and the times of each division. We call ${\mathbf {y}}(t)=(y_1(t),y_2(t), \ldots ,y_M(t))$ a vector state at time t and $\{{\mathbf {y}}\}=\{{\mathbf {y}}(t_j)\}_{j=1}^{K}$ a path with K divisions. For cell growth models, the variables $y_i$ can typically be the size and age of the cell, or the concentration of a key protein.

The probability ${{\mathscr {P}}}$ of path $\{{\mathbf {y}}\}$ is defined as the sum over all lineages of the weights of the lineages that follow the path $\{{\mathbf {y}}\}$:

$$\begin{aligned} {{\mathscr {P}}}(\{{\mathbf {y}}\},K,t)=\sum _{i=1}^{N(t)} \omega (l_i) \, \delta (K-K_i) \delta (\{{\mathbf {y}}\} - \{{\mathbf {y}}\}_i) \,, \end{aligned}$$

(17)

where $\{{\mathbf {y}}\}_i$ is the path followed by lineage $l_i$. Using the normalization of the weights $\omega$ on the lineages, we show that ${{\mathscr {P}}}$ is properly normalized: $\int \mathrm {d}\{{\mathbf {y}}\} \sum _K {{\mathscr {P}}}(\{{\mathbf {y}}\},K,t) = 1$. We then define the number $n(\{{\mathbf {y}}\},K,t)$ of lineages in the tree at time t that follow the path $\{{\mathbf {y}}\}$ with K divisions:

$$\begin{aligned} n(\{{\mathbf {y}}\},K,t)=\sum _{i=1}^{N(t)} \delta (K-K_i) \delta (\{{\mathbf {y}}\} - \{{\mathbf {y}}\}_i) \,. \end{aligned}$$

(18)

This number of lineages is normalized as $\int \mathrm {d}\{{\mathbf {y}}\} \sum _K n(\{{\mathbf {y}}\},K,t) = N(t)$. Then, the path probability can be re-written as

$$\begin{aligned} {{\mathscr {P}}}(\{{\mathbf {y}}\},K,t) = n(\{{\mathbf {y}}\},K,t) \cdot \omega (l) \,. \end{aligned}$$

(19)

Since $n(\{{\mathbf {y}}\},K,t)$ is independent of a particular choice of lineage weighting, we obtain

$$\begin{aligned} \frac{{{\mathscr {P}}}_{\text {back}}(\{{\mathbf {y}}\},K,t)}{{{\mathscr {P}}}_\text {for} (\{{\mathbf {y}}\},K,t)}=\frac{\omega _{\text {back}}(l)}{\omega _\text {for}(l)}= \ e^{K \ln m - t \Lambda _t} \, , \end{aligned}$$

(20)

which generalizes Eq. (11). In our previous work¹⁷, we have derived this relation for size models with individual growth rate fluctuations (i.e. ${\mathbf {y}}=(x,\nu )$) but we were not aware of the weighting method introduced by¹³, and for this reason, we used the term ‘tree’ to denote the backward sampling, and the term ‘lineage’ to denote the forward sampling.

This relation has a familiar form in Stochastic Thermodynamics. The central quantity called entropy production can indeed be expressed similarly as the relative entropy between probability distributions associated with a forward and a backward evolution. In this analogy, $\{{\mathbf {y}}\}$ is analog to the trajectory and $t \Lambda _t - K \ln m$ is analog to the entropy production. Then, the equivalent of a reversible trajectory for which the entropy production is null is a lineage for which the number K of divisions is equal to $t \Lambda _t / \ln m$, that is, a lineage having the same reproductive performance as that of the colony. When all the lineages in a tree have this property, there is no variability of the number of divisions among them. In that case, the forward and backward distributions are identical, and the cost function $t \Lambda _t - K \ln m$ vanishes for all lineages.

Mixed age-size controlled models

Dynamics at the population level

The state of a cell is described by its size x, its age a and its individual growth rate $\nu$, with ${\mathbf {y}}=(x,a,\nu )$. Such mixed size-age model includes the ‘adder’ in which the cell divides after adding a constant volume to its birth volume^22,23,24,25.

The evolution of the number of cells $n({\mathbf {y}},K,t)$ in the state ${\mathbf {y}}$ at time t, that belong to a lineage with K divisions up to time t is governed by the equation

$$\begin{aligned} \left( \partial _t + \partial _a \right) n({\mathbf {y}},K,t) +\partial _x \left[ \nu x n({\mathbf {y}},K,t) \right] + B({\mathbf {y}})n({\mathbf {y}},K,t) =0 \,, \end{aligned}$$

(21)

and the boundary condition

$$\begin{aligned} n(x,a=0,\nu ,K,t)= m \int {\mathrm {d}}{\mathbf {y}}' B({\mathbf {y}}') \Sigma ({\mathbf {y}}|{\mathbf {y}}') n({\mathbf {y}}',K-1,t) \,, \end{aligned}$$

(22)

where $B({\mathbf {y}})$ is the division rate and $\Sigma ({\mathbf {y}}|{\mathbf {y}}')$ is the conditional probability (also called division kernel) for a newborn cell to be in state ${\mathbf {y}}$ knowing its mother divided while in state ${\mathbf {y}}'$, normalized as $\int \Sigma ({\mathbf {y}}|{\mathbf {y}}') \mathrm {d}{\mathbf {y}} =1$, for any ${\mathbf {y}}'$.

Dynamics at the probability level

While $n({\mathbf {y}},K,t)$ in Eq. (21) is independent of the choice of weights put on the lineages, we now turn to a description in terms of the probability $p({\mathbf {y}},K,t)$ for a cell to be in state $({\mathbf {y}},K)$ at time t if chosen randomly among the N(t) cells in the tree at that time. To do so, one has to choose how to weight each cell in the colony, which is equivalent to weight each lineage, since at time t each cell is the ending point of one lineage.

The first possibility is the backward sampling, for which each lineage is weighted uniformly. In this case, we define $p_{\text {back}}$ as

$$\begin{aligned} p_{\text {back}}({\mathbf {y}},K,t)=\frac{n({\mathbf {y}},K,t)}{N(t)} \,. \end{aligned}$$

(23)

Dividing Eq. (21) and the boundary condition equation (22) by N(t) we obtain

$$\begin{aligned} \left( \partial _t + \partial _a \right) p_{\text {back}}({\mathbf {y}},K,t) +\partial _x \left[ \nu x p_{\text {back}}({\mathbf {y}},K,t) \right] + \left[ B({\mathbf {y}}) + \Lambda _p(t) \right] p_{\text {back}}({\mathbf {y}},K,t) =0 \,, \end{aligned}$$

(24)

and

$$\begin{aligned} p_{\text {back}}(x,a=0,\nu ,K,t)= m \int {\mathrm {d}}{\mathbf {y}}' B({\mathbf {y}}') \Sigma ({\mathbf {y}}|{\mathbf {y}}') p_{\text {back}}({\mathbf {y}}',K-1,t) \,, \end{aligned}$$

(25)

where we defined the instantaneous population growth rate as

$$\begin{aligned} \Lambda _p(t)=\frac{{\dot{N}}}{N} \,. \end{aligned}$$

(26)

The instantaneous population growth rate and the population growth rate defined in Eq. (3) are related by:

$$\begin{aligned} \Lambda _t=\frac{1}{t} \int _{0}^{t} \Lambda _p(t') {\mathrm {d}}t' \,. \end{aligned}$$

(27)

In the long-time limit, N grows exponentially with constant rate $\Lambda _p$, and thus $\Lambda _t=\Lambda _p=\Lambda$.

The other possibility is to use the forward statistics, in which case we define the probability $p_{\text {for}}$, as

$$\begin{aligned} p_{\text {for}}({\mathbf {y}},K,t)=\frac{n({\mathbf {y}},K,t)}{m^K} \,. \end{aligned}$$

(28)

Dividing Eq. (21) and the boundary condition equation (22) by $m^K$ we obtain

$$\begin{aligned} \left( \partial _t + \partial _a \right) p_{\text {for}}({\mathbf {y}},K,t) +\partial _x \left[ \nu x p_{\text {for}}({\mathbf {y}},K,t) \right] + B({\mathbf {y}}) p_{\text {for}}({\mathbf {y}},K,t) =0 \,, \end{aligned}$$

(29)

and

$$\begin{aligned} p_{\text {for}}(x,a=0,\nu ,K,t)= \int {\mathrm {d}}{\mathbf {y}}' B({\mathbf {y}}') \Sigma ({\mathbf {y}}|{\mathbf {y}}') p_{\text {for}}({\mathbf {y}}',K-1,t) \,. \end{aligned}$$

(30)

One can notice that the backward statistics is well suited to study the population, while the forward statistics reproduce the behaviour of single lineage experiments. Indeed, by taking Eqs. (24) and (25) for the population/backward probability $p_{\text {back}}$, and choosing $\Lambda _p(t)=0$ and $m=1$ we recover Eqs. (29) and (30). This equation is then a population equation in which we follow only one cell, so that $\Lambda _p(t)=0$ and $m=1$, which we call single lineage experiment.

Illustration of the fluctuation relation

We simulated the time evolution of colonies of cells, obeying Eqs. (21) and (22), for age and size models in order to illustrate the fluctuation relation. Since results are very similar—as expected—for age models, we restrict ourselves to size models. We tested two results: the fluctuation relation for the number of divisions Eq. (11) and one of its consequences: the inequality for the mean number of divisions Eq. (14).

All simulations for size models were conducted with the division rate $B(x,\nu )=\nu x^{\alpha }$, where $\alpha$ is the strength of the control and x is the dimensionless size. Power law were found to be good approximations for empirical division rates B(x)^2,24,26. The factor $\nu$, being the only time scale for size models, gives B(x) its proper dimension. Similarly for age models²⁶, we choose $B(a,\nu )=\nu a^{\alpha }$.

On Fig. 3a, the backward and forward probability distributions of the number of divisions are shown for a size model. The two distributions intersect at the number of divisions $K=t \Lambda _t / \ln 2$. The inset of Fig. 3a shows the logarithm of the ratio $q(K,t)=p_{\text {for}} (K,t)/p_{\text {back}} (K,t)$ of the two distributions, which is as expected a straight line of slope $- \ln 2$ when plotted against the number of divisions. For convenience and for Fig. 3a only, noise in the volume partition at division has been introduced, by choosing for the conditional probability $\Sigma (x | x')$ a uniform distribution between sizes $x=0$ and $x=x'$. This has the effect of broadening the distributions P(K) with respect to the case of deterministic symmetrical volume partition.

Then, we tested the inequality on the mean numbers of divisions by varying the strength of the size-control $\alpha$. Results are shown on Fig. 3b. One one hand, we see that the less control on size, the more discrepancy between the two determinations $\langle K \rangle _{\text {back}}$ and $\langle K \rangle _{\text {for}}$. On the other hand, when increasing the control, the two determinations converge to the population doubling time, where no stochasticity in the number of divisions is left, and every lineage carries the same number of divisions, leading to the equality of the backward and forward statistics.

Phenotypic fitness landscapes

The fitness of a phenotypic trait s is a measure of the reproductive success of individuals carrying it. It is usually defined as the number of offsprings of one individual with a given value of the trait and is quite difficult to evaluate. Nozoe et al. suggested that one way to measure it could be to compare the chronological and retrospective marginal probabilities¹³ and accordingly defined it as:

$$\begin{aligned} h(s)=\Lambda _t + \frac{1}{t} \ln \left[ \frac{P_{\text {back}}(s)}{P_{\text {for}}(s)} \right] \,, \end{aligned}$$

(31)

so that

$$\begin{aligned} P_{\text {back}}(s)=P_{\text {for}}(s) \exp \left[ (h(s)-\Lambda _t)t \right] \,. \end{aligned}$$

(32)

This has again the form of a fluctuation relation similar to Eq. (11), except for the replacement of the factor $K \ln 2 /t$ by the function h(s). This suggests that the fitness landscape h(s) plays a role similar to that of an effective division rate, which depends on the trait s. In line with this interpretation, in the particular case where $s=K$, Eq. (11) leads to ${\tilde{h}}(K)=K \ln 2 /t$, where the fitness landscape for trait K is called the lineage fitness and is written ${\tilde{h}}$. In a branched tree, lineages with a large number of divisions K are exponentially over-represented in the population with the backward sampling as compared to the forward sampling. This means that lineages with large K have a larger fitness than the ones with a small K, which is coherent with ${\tilde{h}}(K)$ being an increasing function of K.

In the following, we rewrite the definition of h(s) in a slightly different way¹⁷ using

$$\begin{aligned} P_{\text {back}}(s) =e^{-t \Lambda _t} P_{\text {for}}(s) \sum _K 2^K R_{\text {for}}(K|s), \end{aligned}$$

(33)

where we have introduced the probability of the number of division events conditioned on trait s at the forward level, $R_{\text {for}}(K|s)$. Lastly, the fitness landscape reads¹⁷

$$\begin{aligned} h(s)=\frac{1}{t}\ln \left[ \sum _K 2^K R_{\text {for}}(K|s) \right] \,. \end{aligned}$$

(34)

An increasing or decreasing fitness landscape means a positive or negative correlation of the trait value with the capacity to divide, whereas a constant fitness landscape means that the trait is not correlated with the number of divisions. Indeed, if we consider a trait s which does not affect the number K of divisions, then $R_{\text {for}}(K|s)=P_{\text {for}}(K)$ and Eq. (34) reads $h(s)=\ln \left[ \sum _K 2^K P_{\text {for}}(K) \right] /t$, which is equal to $\Lambda _t$ according to Eq. (5). In that case, we find that the backward and forward probabilities for that trait s are equal.

In the next sections, we evaluate the relevance of the key variables from our model, namely the size and the age by evaluating their fitness landscapes in size and age models.

Size models

We start with a case where the fitness landscape is fully solvable namely a size model with no individual growth rate fluctuations and with symmetric division. Let us consider a colony starting with one ancestor cell of size $x_0$. Then, the available sizes at time t are discrete and given by $x=x_0 \exp [\nu t] / 2^K$ where K is the number of divisions undergone by the cell. Therefore a particular size x can be reached only if there is an integer K satisfying this relation, and this integer is unique, leading to

$$\begin{aligned} R_{\text {for}}(K|x)=\delta \left( K - \frac{\ln \left[ \frac{x_0 e^{\nu t}}{x} \right] }{\ln 2} \right) \,. \end{aligned}$$

(35)

Using this relation in Eq. (34), one finds

$$\begin{aligned} h(x)= \nu + \frac{1}{t} \ln \left( \frac{x_0}{x} \right) \,. \end{aligned}$$

(36)

The fitness landscape of the size is a decreasing function, which is coherent with the over-representation of cells that divided a lot, since these cells are more likely to be small due to the numerous divisions. Reporting this result in Eq. (33), we obtain a fluctuation relation for the size

$$\begin{aligned} P_{\text {back}}(x) = e^{\left( \nu - \Lambda _t \right) t} \frac{x_0}{x} P_{\text {for}}(x) \,, \end{aligned}$$

(37)

which in the long time limit becomes

$$\begin{aligned} P_{\text {back}}(x) = \frac{x_0}{x} P_{\text {for}}(x) \, , \end{aligned}$$

(38)

where we used the property that in a steady state, the population growth rate and the individual growth rate are equal when there is no individual growth rate variability.

In some setups, experiments do not start with a unique ancestor cell but with $N_0 > 1$ initial cells, with possibly heterogeneous sizes. We describe this heterogeneity by the average initial size $\langle x_0 \rangle$ and the standard deviation $\sigma _{x_0}$. In this case, accessible sizes are still discrete but depend on both the number of divisions and the initial cell that started the lineage, and are expressed as $x_0^i \exp [\nu t] /2^K$, where K takes integer values from 0 to $\infty$ and where $x_0^i \in {{\mathscr {X}}}_0$, with ${{\mathscr {X}}}_0$ the set of initial sizes. Consequently, a final size x can possibly be reached by different couples $(K_i,x_0^i)$.

In order to go further, we now introduce explicitly the initial sizes $x_0^i$ in Eq. (34) as

$$\begin{aligned} h(x)&=\frac{1}{t}\ln \left[ \sum _K \sum _{i} 2^K R_{\text {for}}(K,x_0^i|x) \right] \nonumber \\&= \frac{1}{t}\ln \left[ \sum _K \sum _{i} 2^K R_{\text {for}}(K|x,x_0^i) R_{\text {for}}(x_0^i|x)\right] \,. \end{aligned}$$

(39)

When conditioning on the initial size $x_0^i$, there is only one possible number of divisions K to reach size x, so that $R_{\text {for}}(K|x,x_0^i)$ obeys an equation similar to Eq. (35).

Let us examine two limit cases: (i) small variability in the initial sizes and (ii) large variability in the initial sizes.

Case (i) is characterized by a small number $N_0$ of initial cells and a small coefficient of variation $\sigma _{x_0}/\langle x_0 \rangle$. In this case, it is realistic to say that a final size x can only be reached by one couple $(K^*,x^*)$, because the sets of accessible sizes generated by each initial cell do not overlap. Therefore, $R_{\text {for}}(x_0^i|x)=\delta (x_0^i-x^*)$ and so for any final size x, only one initial size $x^*$ survives in the sum, so that Eq. (39) reads $h(x)=\nu + \ln \left( x^*/(x^* \exp [\nu t]/2^K) \right) /t ={\tilde{h}}(K)$. Thus cells that come from lineages with the same number of divisions K have the same fitness landscape value h(x) for the size, regardless of the size $x^*$ of the initial cell of their lineages. Thus, available values for h(x) are quantified by K and form plateaus, where points representing cells coming from different ancestors but with the same number of divisions accumulate, as shown in Fig. 4a.

Case (ii) is characterized by a large number $N_0$ of initial cells and a large coefficient of variation $\sigma _{x_0}/\langle x_0 \rangle$. Unlike in case (i), the sets of accessible sizes generated by each initial cell have many overlaps, so that a final size x can be reached by many different couples $(K_i,x_0^i)$. We make the hypothesis that a final size x can be reached by any initial cell with uniform probability, so that $R_{\text {for}}(x_0^i|x)=1/N_0$. Therefore, Eq. (39) becomes

$$\begin{aligned} h(x)&=\frac{1}{t}\ln \left[ \frac{1}{N_0} \sum _{i} \frac{x_0^i e^{\nu t}}{x} \right] \nonumber \\&=\nu + \frac{1}{t}\ln \frac{\langle x_0 \rangle }{x} \,. \end{aligned}$$

(40)

This behavior was tested numerically and the result plotted on Fig. 4b confirms that the plateaus observed in case (i) are replaced by a smooth curve depending on the mean initial size.

We observe the same effect, namely the loss of the plateaus, when introducing fluctuations in individual growth rates.

Age models

Constant individual growth rate

We consider the case where the individual growth rate is constant and equal to $\nu$. In steady-state, the forward age distribution reads (see¹⁷ where $p_{\text {for}}(a)$ (resp. $p_{\text {back}}(a)$) were denoted p(a) (resp. P(a))):

$$\begin{aligned} p_{\text {for}}(a)=p_{\text {for}}(0) \, \exp \left[ -\int _{0}^{a} B(a') {\mathrm {d}}a' \right] \,. \end{aligned}$$

(41)

To find the integration constant $p_{\text {for}}(0)$, we use the normalization of probability $p_{\text {for}}$:

$$\begin{aligned} Z=p_{\text {for}}(0)^{-1}=\int _{0}^{\infty } {\mathrm {d}}a \exp \left[ -\int _{0}^{a} B(a') {\mathrm {d}}a' \right] \,. \end{aligned}$$

(42)

Similarly, the steady-state backward distribution of ages reads

$$\begin{aligned} p_{\text {back}}(a)=p_{\text {back}}(0) \, \exp \left[ -\Lambda a -\int _{0}^{a} B(a') {\mathrm {d}}a' \right] \,. \end{aligned}$$

(43)

In this case, the integration constant $p_{\text {back}}(0)$ can be expressed both using the normalization of $p_{\text {back}}(a)$, as done for the forward case, or using $p_{\text {back}}(0)=2 \Lambda$, as shown in¹⁷.

Therefore, the ratio of the age distributions using the backward and forward statistics reads

$$\begin{aligned} \frac{p_{\text {back}}(a)}{p_{\text {for}}(a)}= 2 Z \Lambda e^{-\Lambda a} \,, \end{aligned}$$

(44)

where Z is defined in Eq. (42) and only depends on the division rate B(a). This relation has a similar form as the relation derived by Hashimoto et al.¹² for the distributions of generation times, except for the extra age-independent factor $Z \Lambda$. Finally, the fitness landscape reads

$$\begin{aligned} h(a)=\frac{1}{t} \left[ \Lambda (t-a) + \ln (2 Z \Lambda ) \right] \,. \end{aligned}$$

(45)

For the same reason as for h(x) in size models, h(a) in age models is a decreasing function of a because lineages that divided a lot are over-represented in the population and are therefore more likely to contain young cells at time t.

The initial condition does not play any role in this derivation, therefore, unlike size models, the results obtained are unchanged for any number $N_0$ of initial cells with heterogeneous initial ages.

The above calculation is general because we did not put any constraint on B(a). Let us now go into more details by choosing a power law for the division rate: $B(a)=\nu a^{\alpha }$. In this case, the integral of Eq. (42) is solvable and gives

$$\begin{aligned} Z=\frac{1}{\alpha +1} \left( \frac{\alpha +1}{\nu } \right) ^{\frac{1}{\alpha +1}} \Gamma \left( \frac{1}{\alpha +1} \right) \,, \end{aligned}$$

(46)

in terms of Gamma function $\Gamma (x)$. Results are plotted on Fig. 5a, which shows that theoretical predictions for the backward and forward age distributions are in good agreement with the numerical histograms. The inset plot shows the age fitness landscape, which follows the linear behavior predicted by Eq. (45).

Let us examine the particular case of uncontrolled models, for which the division rate is constant: $B=\nu$. This corresponds to the case $\alpha =0$ in the power law analysis conducted above. Replacing $\alpha$ by 0 in Eq. (46) leads to $Z=1/\nu$; moreover in steady state $\Lambda =\nu$, so that

$$\begin{aligned} p_{\text {back}}(a) = 2 \, p_{\text {for}}(a) \, e^{-\Lambda a} \,. \end{aligned}$$

(47)

Moreover, the distributions themselves are greatly simplified and read

$$\begin{aligned} p_{\text {for}}(a)&=\nu e^{ - \nu a} \,, \end{aligned}$$

(48)

$$\begin{aligned} p_{\text {back}}(a)&=2 \nu e^{ - 2 \nu a} \,, \end{aligned}$$

(49)

which shows that in this special case the age distributions are themselves identical with the generation time distributions.

Fluctuating individual growth rates

Another interesting extension of this calculation concerns the case of fluctuating individual growth rate $\nu$, for which the division rate then becomes a function of a and $\nu$: $B(a,\nu )$. Then, steady state age distributions are¹⁷:

$$\begin{aligned} p_{\text {for}}(a)= & {} \int {\mathrm{d}}\nu \, p_{\text {for}}(0,\nu ) \, \exp \left[ -\int _{0}^{a} B(a',\nu ) {\mathrm{d}}a' \right] \,, \end{aligned}$$

(50)

$$\begin{aligned} p_{\text {back}}(a)= & {} e^{-\Lambda a} \int {\mathrm {d}}\nu \, p_{\text {back}}(0,\nu ) \, \exp \left[ -\int _{0}^{a} B(a',\nu ) {\mathrm {d}}a' \right] \,, \end{aligned}$$

(51)

where $p_{\text {for}}(0,\nu )$ and $p_{\text {back}}(0,\nu )$ are given by the boundary conditions:

$$\begin{aligned} p_{\text {for}}(0,\nu )= & {} \int {\mathrm {d}}a {\mathrm {d}}\nu ' B(a,\nu ') \Sigma \left( \nu | \nu ' \right) p_{\text {for}}(a,\nu ') \,, \end{aligned}$$

(52)

$$\begin{aligned} p_{\text {back}}(0,\nu )= & {} 2 \int {\mathrm {d}}a {\mathrm {d}}\nu ' B(a,\nu ') \Sigma \left( \nu | \nu ' \right) p_{\text {back}}(a,\nu ') \,. \end{aligned}$$

(53)

In the absence of mother-daughter correlations for the individual growth rate, then $\Sigma \left( \nu | \nu ' \right) = {\hat{\Sigma }} \left( \nu \right)$, which implies that $p_{\text {for}}(0,\nu )$ and $p_{\text {back}}(0,\nu )$ have the same dependency in $\nu$:

$$\begin{aligned} p_{\text {for}}(0,\nu )&= {\hat{\Sigma }} \left( \nu \right) \int {\mathrm {d}}a {\mathrm {d}}\nu ' B(a,\nu ') p_{\text {for}}(a,\nu ') \,, \end{aligned}$$

(54)

$$\begin{aligned}&={\hat{\Sigma }}\left( \nu \right) \, {\hat{Z}}^{-1} \end{aligned}$$

(55)

$$\begin{aligned} p_{\text {back}}(0,\nu )&=2 {\hat{\Sigma }} \int {\mathrm {d}}a {\mathrm {d}}\nu ' B(a,\nu ') p_{\text {back}}(a,\nu ') \end{aligned}$$

(56)

$$\begin{aligned}&= 2 {\hat{\Sigma }} \left( \nu \right) \, \Lambda \,. \end{aligned}$$

(57)

Finally, the fluctuation relation for the age reads

$$\begin{aligned} \frac{p_{\text {back}}(a)}{p_{\text {for}}(a)}= 2 {\hat{Z}} \Lambda e^{-\Lambda a} \,, \end{aligned}$$

(58)

which is the equivalent of Eq. (44) for fluctuating growth rates without mother-daughter correlations. Therefore, the age fitness landscape features the same linear dependency in age with a slope $- \Lambda$ as in the case of constant individual growth rate.

In the general case with mother-daughter correlations, this statement is not necessarily true though, because $p_{\text {for}}(0,\nu )$ and $p_{\text {back}}(0,\nu )$ do not have in general the same dependency in $\nu$.

Consequently, looking at the slope of the age fitness landscape informs on the presence of mother-daughter correlations as illustrated numerically in Fig. 5b, where the age fitness landscape for models without mother-daughter correlations aligns with the theoretical prediction of slope $- \Lambda$; while the same function for models with mother-daughter correlations presents a non-linear age dependency.

Discussion

We have studied the relation between two different samplings of lineages in a branched tree: one sampling called backward or retrospective presents a statistical bias with respect to the forward or chronological sampling, an observation which is important to relate experiments carried out at the population level with the ones carried out at the single lineage level. This statistical bias can be rationalized by a set of fluctuation relations, which relate the probability distributions in the two ensembles and which are similar to fluctuation relations known in Stochastic Thermodynamics. This analogy leads to efficient methods to infer the population growth rate from an analysis of lineages, as we demonstrated by the analysis of the mother machine data of Tanouchi et al.²⁰. Important inequalities for the mean number of divisions or the mean generation times follow from these fluctuation relations, which have been verified experimentally¹² for various strains of E Coli. It would be interesting to generalize these studies to other cell types, and in the particular context of this paper, it would be useful to perform experimental studies in bulk or semi open configurations, to test the predictions which involve a comparison between forward and backward samplings.

By measuring the difference between these two samplings for a specific trait, one obtains the fitness landscape, introduced by Nozoe et al.¹³. While these authors have applied that concept to variables which are not reset or redistributed at division in their work, in the present paper, we used the concept of fitness landscape for variables like the size and the age, which precisely undergo a reset at division in size and age models. We derived expressions for these fitness landscapes, which agree with the statistical bias which we expect when measuring size or age distributions in cell populations. In addition, we also find that the precise form of the age fitness function appears to inform whether or not mother-daughter correlations are present in age models.

In the future, it would be valuable to extend our approach to include other important phenotypic state variables besides size or age, such as variables controlling replication dynamics^3,27. We hope that our work contributes to clarifying the connection between single lineage and population statistics and to understanding the fundamental constraints which cell growth and division must obey.

References

Sandler, O. et al. Lineage correlations of single cell division time as a probe of cell-cycle dynamics. Nature 519, 468–471. https://doi.org/10.1038/nature14318 (2015).
Article ADS CAS PubMed Google Scholar
Hosoda, K., Matsuura, T., Suzuki, H. & Yomo, T. Origin of lognormal-like distributions with a common width in a growth and division process. Phys. Rev. E 83, 031118. https://doi.org/10.1103/PhysRevE.83.031118 (2011).
Article ADS CAS Google Scholar
Thomas, P. Making sense of snapshot data: ergodic principle for clonal cell populations. J. R. Soc. Interface 14, 20170467. https://doi.org/10.1098/rsif.2017.0467 (2017).
Article PubMed PubMed Central Google Scholar
Elowitz, M. B., Levine, A. J., Siggia, E. D. & Swain, P. S. Stochastic gene expression in a single cell. Science 297, 1183–1186. https://doi.org/10.1126/science.1070919 (2002).
Article ADS CAS PubMed Google Scholar
Barizien, A., Suryateja Jammalamadaka, M. S., Amselem, G. & Baroud, C. N. Growing from a few cells: combined effects of initial stochasticity and cell-to-cell variability. J. R. Soc. Interface 16, 20180935. https://doi.org/10.1098/rsif.2018.0935 (2019).
Article CAS PubMed PubMed Central Google Scholar
Powell, E. O. Growth rate and generation time of bacteria, with special reference to continuous culture. J. Gen. Microbiol. 15, 492–511. https://doi.org/10.1099/00221287-15-3-492 (1956).
Article CAS PubMed Google Scholar
Olivier, A. How does variability in cell aging and growth rates influence the Malthus parameter?. Kinet. Relat. Mod. 10, 481–512. https://doi.org/10.3934/krm.2017019 (2017).
Article MathSciNet MATH Google Scholar
Kiviet, D. J. et al. Stochasticity of metabolism and growth at the single-cell level. Nature 514, 376–379. https://doi.org/10.1038/nature13582 (2014).
Article ADS CAS PubMed Google Scholar
Taheri-Araghi, S. et al. Cell-size control and homeostasis in bacteria. Curr. Biol. 25, 385–391. https://doi.org/10.1016/j.cub.2014.12.009 (2015).
Article CAS PubMed Google Scholar
Wang, P. et al. Robust growth of Escherichia coli. Curr. Biol. 20, 1099–1103. https://doi.org/10.1016/j.cub.2010.04.045 (2010).
Article CAS PubMed PubMed Central Google Scholar
Baake, E. & Georgii, H.-O. Mutation, selection, and ancestry in branching models: a variational approach. J. Math. Biol. 54, 257–303. https://doi.org/10.1007/s00285-006-0039-5 (2007).
Article MathSciNet PubMed MATH Google Scholar
Hashimoto, M. et al. Noise-driven growth rate gain in clonal cellular populations. Proc. Natl. Acad. Sci. U. S. A. 113, 3251–3256. https://doi.org/10.1073/pnas.1519412113 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Nozoe, T., Kussell, E. & Wakamoto, Y. Inferring fitness landscapes and selection on phenotypic states from single-cell genealogical data. PLoS Genet. 13, e1006653. https://doi.org/10.1371/journal.pgen.1006653 (2017).
Article CAS PubMed PubMed Central Google Scholar
Hoffmann, M. & Olivier, A. Nonparametric estimation of the division rate of an age dependent branching process. Stoc. Proc. Appl. 126, 1433–1471. https://doi.org/10.1016/j.spa.2015.11.009 (2016).
Article MathSciNet MATH Google Scholar
Jafarpour, F. et al. Bridging the timescales of single-cell and population dynamics. Phys. Rev. X 8, 021007. https://doi.org/10.1103/PhysRevX.8.021007 (2018).
Article CAS Google Scholar
Thomas, P. Single-cell histories in growing populations: relating physiological variability to population growth. BiorXiv https://doi.org/10.1101/100495 (2017).
García-García, R., Genthon, A. & Lacoste, D. Linking lineage and population observables in biological branching processes. Phys. Rev. E 99, 042413. https://doi.org/10.1103/PhysRevE.99.042413 (2019).
Article ADS PubMed Google Scholar
Seifert, U. Stochastic thermodynamics, fluctuation theorems and molecular machines. Rep. Prog. Phys. 75, 126001. https://doi.org/10.1088/0034-4885/75/12/126001 (2012).
Article ADS PubMed Google Scholar
Levien, E., GrandPre, T. & Amir, A. A large deviation principle linking lineage statistics to fitness in microbial populations. arXiv arXiv:2002.00019 (2020).
Tanouchi, Y. et al. Long-term growth data of Escherichia coli at a single-cell level. Sci. Data 4, 170036. https://doi.org/10.1038/sdata.2017.36 (2017).
Article CAS PubMed PubMed Central Google Scholar
Jarzynski, C. Rare events and the convergence of exponentially averaged work values. Phys. Rev. E 73, 046105. https://doi.org/10.1103/PhysRevE.73.046105 (2006).
Article ADS CAS Google Scholar
Jun, S., Si, F., Pugatch, R. & Scott, M. Fundamental principles in bacterial physiology–history, recent progress, and the future with focus on cell size control: a review. Rep. Prog. Phys. 81, 056601. https://doi.org/10.1088/1361-6633/aaa628 (2018).
Article ADS MathSciNet CAS PubMed PubMed Central Google Scholar
Amir, A. Cell size regulation in bacteria. Phys. Rev. Lett. 112, 208102. https://doi.org/10.1103/PhysRevLett.112.208102 (2014).
Article ADS CAS Google Scholar
Osella, M., Nugent, E. & Cosentino Lagomarsino, M. Concerted control of Escherichia coli cell division. Proc. Natl. Acad. Sci. U. S. A. 111, 3431–3435. https://doi.org/10.1073/pnas.1313715111 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Tzur, A., Kafri, R., LeBleu, V. S., Lahav, G. & Kirschner, M. W. Cell growth and size homeostasis in proliferating animal cells. Science 325, 167–171. https://doi.org/10.1126/science.1174294 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Robert, L. et al. Division in Escherichia coli is triggered by a size-sensing rather than a timing mechanism. BMC Biol. 12, 17. https://doi.org/10.1186/1741-7007-12-17 (2014).
Article CAS PubMed PubMed Central Google Scholar
Beentjes, C. H. L., Perez-Carrasco, R. & Grima, R. Exact solution of stochastic gene expression models with bursting, cell cycle and replication dynamics. Phys. Rev. E 101, 032403. https://doi.org/10.1103/PhysRevE.101.032403 (2020).
Article ADS PubMed Google Scholar

Download references

Acknowledgements

The authors acknowledge R. García-García for a previous collaboration, which made possible the present work. We would also like to thank L. Robert, P. Gaspard and J. Unterberger for stimulating discussions.

Author information

Authors and Affiliations

Gulliver, CNRS, ESPCI Paris, PSL University, 75005, Paris, France
Arthur Genthon & David Lacoste

Authors

Arthur Genthon
View author publications
You can also search for this author in PubMed Google Scholar
David Lacoste
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.L. and A.G. designed the study and wrote the manuscript. A.G. conducted the calculations and numerical simulations, and analyzed the data.

Corresponding author

Correspondence to Arthur Genthon.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Genthon, A., Lacoste, D. Fluctuation relations and fitness landscapes of growing cell populations. Sci Rep 10, 11889 (2020). https://doi.org/10.1038/s41598-020-68444-x

Download citation

Received: 20 April 2020
Accepted: 25 June 2020
Published: 17 July 2020
DOI: https://doi.org/10.1038/s41598-020-68444-x

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Fluctuation relations and fitness landscapes of growing cell populations

Subjects

Abstract

Similar content being viewed by others

Cell population heterogeneity driven by stochastic partition and growth optimality

The physics of cell-size regulation across timescales

First-passage-time statistics of growing microbial populations carry an imprint of initial conditions

Introduction

Theoretical framework

The backward and forward processes

Link with the population growth rate

Stochastic thermodynamic interpretation

Mixed age-size controlled models

Dynamics at the population level

Dynamics at the probability level

Illustration of the fluctuation relation

Phenotypic fitness landscapes

Size models

Age models

Constant individual growth rate

Fluctuating individual growth rates

Discussion

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Comments

Search

Quick links

Subjects

Abstract

Similar content being viewed by others

Cell population heterogeneity driven by stochastic partition and growth optimality

The physics of cell-size regulation across timescales

First-passage-time statistics of growing microbial populations carry an imprint of initial conditions

Introduction

Theoretical framework

The backward and forward processes

Link with the population growth rate

Stochastic thermodynamic interpretation

Mixed age-size controlled models

Dynamics at the population level

Dynamics at the probability level

Illustration of the fluctuation relation

Phenotypic fitness landscapes

Size models

Age models

Constant individual growth rate

Fluctuating individual growth rates

Discussion

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links