Team:WHU-China/Standard

From 2012.igem.org

(Difference between revisions)
Line 609: Line 609:
<p align="justify">
<p align="justify">
  By setting the right side of the equations to zeros, we get algebra equations about the five variables at the steady state. And after elimination we obtain the cubit equation  </p>
  By setting the right side of the equations to zeros, we get algebra equations about the five variables at the steady state. And after elimination we obtain the cubit equation  </p>
-
<p>
+
<p><center>
-
 
+
   <p>(ak<sub>1</sub>D<sup>3</sup>)x<sup>3</sup> + (D<sup>3</sup>k<sub>1</sub> - ak<sub>1</sub>D<sup>2</sup>E)x<sup>2</sup> + (k<sub>6</sub>D<sup>2</sup>V - k<sub>1</sub>D<sup>2</sup>E + k<sub>1</sub>k<sub>3</sub>DV + k<sub>2</sub>k<sub>6</sub>DV)x - k<sub>6</sub>DEV - k<sub>2</sub>k<sub>6</sub>EV = 0 </p>
-
<center>
+
-
   <img src="https://static.igem.org/mediawiki/2012/8/8e/Fatty_Fml_3.png" width="504" height="84" hspace="2" vspace="1" align="middle" />③
+
</center>  </p><p align="justify">
</center>  </p><p align="justify">
Line 625: Line 623:
  Then note that the balanced point in (4) may not be authentic when <i>k<sub>4</sub></i><<i>k<sub>3</sub></i> and <i>x<sub>3</sub><sup>*</sup></i> becomes negative, which is impossible to occur. So we stochastically generate 100 parameters in which <i>k<sub>4</sub></i><<i>k<sub>3</sub></i>, and after solve the ODE numerically we find that all variables tend to an asymptotic steady state except for <i>x<sub>3</sub></i>, which approaches +∞ as t →+∞ (Fig 2). Interestingly, we find that the authentic balanced point of <i>x<sub>1</sub></i> and <i>x<sub>2</sub></i> calculated by directly solving the ODE (1) numerically is very <i>close</i> to that calculated by formula (4). For example, when <i>E</i>=4.5249, <i>a</i>=8.0649, <i>V</i>=2.5906, <i>D</i>=1.6831, <i>k<sub>1</sub></i>=5.2315, <i>k<sub>2</sub></i>=8.6560, <i>k<sub>3</sub></i>=8.7696, <i>k<sub>4</sub></i>=1.0092, <i>k<sub>5</sub></i>=6.9635, <i>k<sub>6</sub></i>=9.3253, <i>x<sub>1</sub></i> and <i>x<sub>2</sub></i> finally approach to 2.6599 and 0.0686, respectively, while <i>x<sub>1</sub><sup>*</sup></i>=2.3952 and <i>x<sub>2</sub><sup>*</sup></i>=0.0758 (Fig 2). The term <i>close</i> may not be mathematically strict, but it plays an important role in the later discussion.  </p><p align="center">
  Then note that the balanced point in (4) may not be authentic when <i>k<sub>4</sub></i><<i>k<sub>3</sub></i> and <i>x<sub>3</sub><sup>*</sup></i> becomes negative, which is impossible to occur. So we stochastically generate 100 parameters in which <i>k<sub>4</sub></i><<i>k<sub>3</sub></i>, and after solve the ODE numerically we find that all variables tend to an asymptotic steady state except for <i>x<sub>3</sub></i>, which approaches +∞ as t →+∞ (Fig 2). Interestingly, we find that the authentic balanced point of <i>x<sub>1</sub></i> and <i>x<sub>2</sub></i> calculated by directly solving the ODE (1) numerically is very <i>close</i> to that calculated by formula (4). For example, when <i>E</i>=4.5249, <i>a</i>=8.0649, <i>V</i>=2.5906, <i>D</i>=1.6831, <i>k<sub>1</sub></i>=5.2315, <i>k<sub>2</sub></i>=8.6560, <i>k<sub>3</sub></i>=8.7696, <i>k<sub>4</sub></i>=1.0092, <i>k<sub>5</sub></i>=6.9635, <i>k<sub>6</sub></i>=9.3253, <i>x<sub>1</sub></i> and <i>x<sub>2</sub></i> finally approach to 2.6599 and 0.0686, respectively, while <i>x<sub>1</sub><sup>*</sup></i>=2.3952 and <i>x<sub>2</sub><sup>*</sup></i>=0.0758 (Fig 2). The term <i>close</i> may not be mathematically strict, but it plays an important role in the later discussion.  </p><p align="center">
-
<center><img src="https://static.igem.org/mediawiki/2012/1/16/Fatty_Fig_2.png" width="560" height="403" hspace="2" vspace="1" align="middle" /></p>
+
<center><img src="https://static.igem.org/mediawiki/2012/1/16/Fatty_Fig_2.png" width="500" height="360" hspace="2" vspace="1" align="middle" /></p>
<p><strong>Fig 2</strong> Numerical simulation when <i>k<sub>4</sub></i><<i>k<sub>3</sub></i>.  </p></center>
<p><strong>Fig 2</strong> Numerical simulation when <i>k<sub>4</sub></i><<i>k<sub>3</sub></i>.  </p></center>
Line 645: Line 643:
</div>
</div>
<div class="passage divcell2">
<div class="passage divcell2">
-
<h3>ccc</h3>
+
<h3>The Human Gut Microbiota Regulation by <i>E.coslim</i> and Mathematical Modeling</h3>
-
<p> ccccc </p>
+
<p>  
 +
<!-- Microbiota -->
 +
cccc
 +
  </p>
 +
</div>
 +
<div class="passage divcell3">
 +
<h3>Gene Expression Measurement by <i>FANCY</i></h3>
 +
<p>
 +
<!-- FANCY -->
 +
<h4>Abstract</h4>
 +
 
 +
<p>
 +
    The quantitative measurement of gene expression levels is of great importance in the research of molecule biology, but existing measuring methods are either semi-quantitative (Western Blotting) or require costly instruments and agents (Flow Cytometry). We invent a novel quantitative and inexpensive method, <i>FANCY</i>, which is short for <i>Fluorescent ANalysis of CYto-imaging</i> (also in the honor of a wise and pretty girl <a href="http:这里有一张成帆照片的超链接">Fan Cheng</a> in our team) and based on cell fluorescence imaging and Support Vector Machine, to measure the gene expression in our project. <i>FANCY</i> contains three sub-programme: <i>FANCYSelector</i> to select single cells manually and calculate their properties, <i>FANCYTrainer</i> to train an SVM, and <i>FANCYScanner</i> to identify single cells by SVM trained by <i>FANCYTrainer</i>. The application of <i>FANCY</i> to 5 typical images proves the effectiveness and the error rate is acceptable. All our programs are freely available online and we suggest <i>FANCY</i> more widely applied in iGEM and other further researches.
 +
</p>
 +
 
 +
<h4>Protocol</h4>
 +
 
 +
<p>
 +
  <strong>Preparation of fluorescent images </strong> </p><p>
 +
  1. Connect a red fluorescent protein gene (gene <i>rfp</i>, or biobrick 1-22O) with the objective gene under the same promoter in a plasmid by molecule cloning, i.e., make a polycistron which can encode both the objective gene product and RFP, thus making the expression levels of the two genes the same. Other fluorescent genes such as <i>gfp</i> or <i>yfp</i> may be also applicable.  </p><p>
 +
  2. Transform the plasmid into E.coli and incubate it until a proper density.  </p><p>
 +
  3. Prepare a slide sample of the transformed E.coli and dry it in the air. Cover the bacteria with anti-fluorescence quencher to maintain the fluorescence. The bacteria may be diluted to avoid cell mass formation on the slide during the preparation.  </p><p>
 +
  4. Take photos of the slide by fluorescent microscopy. Select visual fields with as many single cells and little cell masses as possible, and avoid photographing for too long a time since the fluorescent density may decrease under the exciting light.  </p><p>
 +
  <strong>Support vector machine training (FANCYSelector and FANCYTrainer)  </strong> </p><p>
 +
  1. 10 typical images are selected as the training images. </p><p>
 +
  2. Transfer the original RGB images to gray images <i>G</i> and binary images <i>B</i>. No image enhancement was processed in our project, but it is suggested to be done in other conditions if needed.  </p><p>
 +
  3. For each object detected in the binary image <i>B</i>, ask the user to classify whether it is a single cell or a cell mass, and after that the object will be marked TRUE or FALSE, respectively. Then calculate the area <i>A</i>, perimeter <i>P</i>, Euler number <i>E</i>, the maximal length of any two points in the object <i>L</i> and the fluorescence <i>F</i>. The fluorescence <i>F</i> is calculated by the numerical double integral of the gray level of the image <i>G</i> on the region identified by the current object in the image <i>B</i>. </p><p>
 +
  4. Record all the data obtained in Step 3 and save them in an ASCII or .mat file (optional).  </p><p>
 +
  5. Use the data <i>A</i>, <i>P</i>, <i>E</i> and <i>L</i> as the training data and the user's classification result as the group data to train the support vector machine (SVM). The Gaussian radial basis function with a scaling factor of 1 is selected as the kennel function of the SVM.  </p><p>
 +
  <strong>Single cell identification and fluorescence calculation (FANCYScanner)  </strong> </p><p>
 +
  1. For all the images taken from the slide, transfer them to gray and binary images respectively, calculate the <i>A</i>, <i>P</i>, <i>E</i>, <i>L</i> and <i>F</i> of each object in the binary images (the same as the former part) and classify them as single cells or cell masses using the SVM trained. All objects classified as single cells will be marked with * in the binary image <i>B</i>. And <i>B</i> will be saved for possible performance evaluation.  </p><p>
 +
      2. All data from objects classified as single cells will be recorded in a ASCII file. And the average and the standard variance of the fluorescent density (fluorescence <i>F</i> divided by area <i>A</i>) is calculated as the quantitative description of the objective gene expression level.
 +
</p>
 +
 
 +
<h4>Performance Evaluation</h4>
 +
 
 +
<p>
 +
    We randomly selected and apply both <i>FANCY</i> and manual identification to 5 other images to evaluate the performance of <i>FANCY</i>. The false positive and false negative rate of <i>FANCY</i> are calculated in comparison to manual identification, which defaults to be absolutely correct. 179 positive and 122 negative objects in all are identified by <i>FANCY</i> and the evaluation result is shown in Table 1.
 +
</p>
 +
 
 +
<p><br><center><strong>Table 1</strong></p>
 +
<div align="center"><img src="https://static.igem.org/mediawiki/2012/2/2c/Table_1_in_FANCY.png" width="726" height="122" hspace="2" vspace="1" align="center" /> 
 +
    </center>
 +
 
 +
    <br>
 +
 
 +
</div>
 +
<center>
 +
<img src="https://static.igem.org/mediawiki/2012/6/60/Fig_1_in_FANCY.png" width="446" height="184" hspace="2" vspace="1" align="center" />
 +
</center>
 +
 
 +
<p>
 +
<small><strong>Fig 1</strong>. Typical false positive and false negative results of <i>FANCY</i> identification. <strong>A</strong>. The identification result. <strong>B</strong>. The original fluorescent image (some cells in the original image are not able to be detected after image transformation).  </small>
 +
</p>
 +
 
 +
<p><br>
 +
    The false positive and negative rate may decrease with more training data and proper image enhancement.
 +
</p>
 +
 
 +
<h4>Source Code</h4>
 +
<p>
 +
    All source code programmed in Matlab can be freely downloaded <a href="https://static.igem.org/mediawiki/2012/8/80/FANCY.zip">here</a>. For detailed usage please refer to the annotation in the code.
 +
</p>
</div>
</div>
</div>
</div>

Revision as of 22:08, 24 September 2012

    Future Perspective

      To date, we have built all our three devices and tested the function and regulation of each biobrick. However, our aim is far more than just making a toy in the laboratory. We aim to create a product that can be widely applied in clinical and other areas.

      First, we will amalgamate the three devices, namely, the Fatty Acid Degradation, the Cellulose Synthesis, and the Colonization, into one whole system. Since there are tens of genes and regulation elements in all and their total length may exceed 30kb, a larger vector like λ phage rather than a plasmid may be adopted. It may also be a good choice to integrate the three devices into the chromosome.

      Second, the Escherichia coli may be a good model for molecule cloning operations, but not suit for pharmacy since its possible risk in infection and diseases. We propose Lactobacillus as a better model because not only its safety has been well demonstrated in food industry, but also the yogurt made from the genetically modified Lactobacillus will possess the property of making you slim!

      The function of the whole system in Escherichia coli or Lactobacillus will be tested both in vitro and in vivo to confirm the effectiveness of our project. The gut microbiota and E.coslim will be inoculated in a glass tube, through which plasma made from different kinds of food will flow. We will test the changes of the microbe community inoculated. Furthermore, a gut microbiota transplantation experiment on mice may also be conducted for further confirmation.

      Our product will be finally packaged into two capsules. Capsule A is E.coslim, which is able to influent the human body's absorption to high energy contained nutrients, regulate the gut microbiota and make you slimmer day by day. And Capsule B is xylose, which can induce the Death Device in E.coslim and avoid people from malnutrition after taking in E.coslim for a long enough time.

      Last but not least, we not only create a new microbe that can make people slim, but also provide tools to sense the fatty acid, the glucose and the xylose in other circumstances. So the new biobricks we submit may be applied in a variety of areas, such as the degradation of waste oil, urine sugar test for diabetes patients, safety control of genetic engineering, and so on. We are looking forward the day when E.coslim is truly applied in people's life, in clinical and more widely in other areas beyond our imagination.

    Model I: Fatty Acid Degradation

      The Fatty Acid Degradation Device may be the most complicated part in our project, along with its great importance. The antagonistic relationship between gene fadR and those related to β oxidation -- the fadL, fadD, etc, -- makes it regulatable to the concentration of fatty acid in the environment. Thus, it is necessary to explore the quantitative response corresponding to the concentration change of fatty acid. We build an ordinary differential equations-based mathematical model to describe the device and find a proper set of parameters under which the proportion of the steady expression level of fadL to fadR changes broadly from 0.2 to 3.5. The model mathematically demonstrates the effectiveness of the Fatty Acid Degradation Device and also provides meaningful clues for the optimization of the device in experiments.

    The Ordinary Differential Equations of the Model

      We conduct an evaluation by mathematical modeling and build the ordinary differential equations (ODE) as follows:

     

      For simplicity, all genes with a promoter PfadR and equally regulated by FadR are deemed as a whole and represented as FadX, i.e., FadX refers to FadL, FadD, FadE, FadA, FadB, FadI, FadJ. And the Complex, or variable x7, refers to the Fatty Acyl-CoA-FadR Complex.

      Parameters in the ODEs:

      ① E denotes the constitutive expression rate of FadR, and D the degradation rates of FadR, FadX and Complex, which is assumed equal.

      ② a denotes the affinity of FadR to the promoter PfadR, and V denotes the background expression rate of related genes.

      ③ k1 and k2 denote the forward and reverse reaction rate coefficients, respectively. k3 to k6 are parameters related to enzyme-catalyzed reactions based on the Michaelis-Menten Equation. Specially,

     


      while f denotes the concentration of fatty acid outside the bacteria, KL the Michaelis constant of FadL, and kL the maximal activity of FadL. Details for the ODE can be illustrated in Fig 1.

     

    Fig 1 Illustration of the meaning of the ODE

    Analysis on the Steady State of the ODE

      By setting the right side of the equations to zeros, we get algebra equations about the five variables at the steady state. And after elimination we obtain the cubit equation

    (ak1D3)x3 + (D3k1 - ak1D2E)x2 + (k6D2V - k1D2E + k1k3DV + k2k6DV)x - k6DEV - k2k6EV = 0 ③

      And the value of each variable in its steady state (the balanced point) is

      The ODE (1) is highly complicated and we adopt numerical methods to analyze its properties. First, we generate 100000 sets of parameters stochastically (all in interval [0,10], and this setting keeps unchanged without special statement) to see the root distribution of equation (3). The results show that there is only one real positive root in 99890 cases and 3 in the rest cases. No cases when the real positive root doesn't exist are found.

      Then note that the balanced point in (4) may not be authentic when k4k3 and x3* becomes negative, which is impossible to occur. So we stochastically generate 100 parameters in which k4k3, and after solve the ODE numerically we find that all variables tend to an asymptotic steady state except for x3, which approaches +∞ as t →+∞ (Fig 2). Interestingly, we find that the authentic balanced point of x1 and x2 calculated by directly solving the ODE (1) numerically is very close to that calculated by formula (4). For example, when E=4.5249, a=8.0649, V=2.5906, D=1.6831, k1=5.2315, k2=8.6560, k3=8.7696, k4=1.0092, k5=6.9635, k6=9.3253, x1 and x2 finally approach to 2.6599 and 0.0686, respectively, while x1*=2.3952 and x2*=0.0758 (Fig 2). The term close may not be mathematically strict, but it plays an important role in the later discussion.

    Fig 2 Numerical simulation when k4k3.

      Besides, x4* and x5* may also be negative. We also generate 100000 sets of parameters stochastically (without the limitation of k4k3) to see how frequently x4* or x5* will be negative and what will it be like. However, it turns out that in no case will x4* orx5* be negative. So we may draw a fuzzy conclusion according to all the results above that under most conditions (99.89%), we can obtain a balanced point of ODE (1) which may not be authentic by formula (4). Fuzzy as the conclusion is, it is still useful to serve as an indicator for searching for a proper set of parameters, under which the Fatty Acid Degradation Device is highly regulatable.

    Parameter Screening

      It is expected that when the concentration of fatty acid in the environment is high, the expression level of gene fadR is relatively high while that of gene fadX is relatively low, and vice versa. And among the 10 parameters in ODE (1), k3 is positively related to the concentration of fatty acid outside the bacteria according to formula (2). So we assume that the device is thought to be regulatable when k3 is equal to 0.5 and 1.5, the ratio of the expression level of fadX to fadR at steady state rises but is still lower than 0.5, and when k3 is equal to 8.5 and 9.5, the ratio also rises and is both greater than 2.0.

      We take advantage of the simplicity in calculating complexity of formula (4) to calculate the steady expression levels of fadR and fadX despite its possible errors. 10000 random parameters (k3 excluded) are generated and for each k3 in [0.5, 1.5, 8.5, 9.5], the balanced point of ODE (1) is calculated according to formula (4), respectively. Then compare the ratio of the expression levels of fadX to fadR at the balanced point and save it if it meets the condition above. We verify all the parameters saved by directly solving the ODE (1) numerically to see if it really meets the condition. 182 out of 10000 sets of parameters are saved and 57 of them remain after the verification. A typical example below (Fig 3) illustrates the change of the expression levels of fadR and fadX and their ratio corresponding to k3. As k3 increases, the ratio rises smoothly from 0.2 to 3.5, while the expression level of fadX rises from 0.6 to 2.0, and that of fadR decreases from 2.6 to 0.6.

    Fig 3 The change of the expression levels of fadR and fadX and their ratio corresponding to k3.

    Conclusion

      To evaluate the response of gene expression levels to the concentration of fatty acid in the environment quantitatively, we build a mathematical model based on ODE and demonstrate that the antagonistic relationship between fadR and fadX serves as a linear regulator to the gene expression. This is important for the function of Fatty Acid Degradation Device because the model suggests that the Device can adjust itself to an appropriate state when induced by fatty acid and function properly rather than changes drastically. So the Device is implied mathematically to possess a great potential of applications in human being.

    The Human Gut Microbiota Regulation by E.coslim and Mathematical Modeling

    cccc

    Gene Expression Measurement by FANCY

    Abstract

        The quantitative measurement of gene expression levels is of great importance in the research of molecule biology, but existing measuring methods are either semi-quantitative (Western Blotting) or require costly instruments and agents (Flow Cytometry). We invent a novel quantitative and inexpensive method, FANCY, which is short for Fluorescent ANalysis of CYto-imaging (also in the honor of a wise and pretty girl Fan Cheng in our team) and based on cell fluorescence imaging and Support Vector Machine, to measure the gene expression in our project. FANCY contains three sub-programme: FANCYSelector to select single cells manually and calculate their properties, FANCYTrainer to train an SVM, and FANCYScanner to identify single cells by SVM trained by FANCYTrainer. The application of FANCY to 5 typical images proves the effectiveness and the error rate is acceptable. All our programs are freely available online and we suggest FANCY more widely applied in iGEM and other further researches.

    Protocol

      Preparation of fluorescent images

      1. Connect a red fluorescent protein gene (gene rfp, or biobrick 1-22O) with the objective gene under the same promoter in a plasmid by molecule cloning, i.e., make a polycistron which can encode both the objective gene product and RFP, thus making the expression levels of the two genes the same. Other fluorescent genes such as gfp or yfp may be also applicable.

      2. Transform the plasmid into E.coli and incubate it until a proper density.

      3. Prepare a slide sample of the transformed E.coli and dry it in the air. Cover the bacteria with anti-fluorescence quencher to maintain the fluorescence. The bacteria may be diluted to avoid cell mass formation on the slide during the preparation.

      4. Take photos of the slide by fluorescent microscopy. Select visual fields with as many single cells and little cell masses as possible, and avoid photographing for too long a time since the fluorescent density may decrease under the exciting light.

      Support vector machine training (FANCYSelector and FANCYTrainer)

      1. 10 typical images are selected as the training images.

      2. Transfer the original RGB images to gray images G and binary images B. No image enhancement was processed in our project, but it is suggested to be done in other conditions if needed.

      3. For each object detected in the binary image B, ask the user to classify whether it is a single cell or a cell mass, and after that the object will be marked TRUE or FALSE, respectively. Then calculate the area A, perimeter P, Euler number E, the maximal length of any two points in the object L and the fluorescence F. The fluorescence F is calculated by the numerical double integral of the gray level of the image G on the region identified by the current object in the image B.

      4. Record all the data obtained in Step 3 and save them in an ASCII or .mat file (optional).

      5. Use the data A, P, E and L as the training data and the user's classification result as the group data to train the support vector machine (SVM). The Gaussian radial basis function with a scaling factor of 1 is selected as the kennel function of the SVM.

      Single cell identification and fluorescence calculation (FANCYScanner)

      1. For all the images taken from the slide, transfer them to gray and binary images respectively, calculate the A, P, E, L and F of each object in the binary images (the same as the former part) and classify them as single cells or cell masses using the SVM trained. All objects classified as single cells will be marked with * in the binary image B. And B will be saved for possible performance evaluation.

    2. All data from objects classified as single cells will be recorded in a ASCII file. And the average and the standard variance of the fluorescent density (fluorescence F divided by area A) is calculated as the quantitative description of the objective gene expression level.

    Performance Evaluation

        We randomly selected and apply both FANCY and manual identification to 5 other images to evaluate the performance of FANCY. The false positive and false negative rate of FANCY are calculated in comparison to manual identification, which defaults to be absolutely correct. 179 positive and 122 negative objects in all are identified by FANCY and the evaluation result is shown in Table 1.


    Table 1


    Fig 1. Typical false positive and false negative results of FANCY identification. A. The identification result. B. The original fluorescent image (some cells in the original image are not able to be detected after image transformation).


        The false positive and negative rate may decrease with more training data and proper image enhancement.

    Source Code

        All source code programmed in Matlab can be freely downloaded here. For detailed usage please refer to the annotation in the code.