Team:Colombia/Modeling/Paramterers

From 2012.igem.org

(Difference between revisions)
(How did we do it?)
(How did we do it?)
Line 34: Line 34:
== How did we do it? ==
== How did we do it? ==
 +
For Pest-busters we used options i) and iii). In the literature, it was possible to find the parameters for the complete CI activation box.  For the rest of values, we found an approximately rank where they could be. Thus, we proceeded to step iii).
-
For our biological system we use options i) and iii). In the literature it was possible to find the parameters for the complete CI activation box and for the rest of values we found an aproximately rank where they could be, so we proceeded to step iii).
+
We used a lot of resources and methods to approximate the limits for this ranges.  Here we show an explanation:
-
 
+
-
We used a lot of resources and methods to approximate the limits for this ranges here is its explanation:
+
[[File:tabla2.png|center|300x250pxpx]]
[[File:tabla2.png|center|300x250pxpx]]
Line 43: Line 42:
'''''a.Basal levels of the proteins:'''''
'''''a.Basal levels of the proteins:'''''
-
To determine a range in which this value we described mathematically a very simple process. Suppose there is this typical differential equation:
+
Determining a rank which this value could be,  we described mathematically a very simple process. Suppose there is a usual differential equation:
[[File:alfbsal1.png|center|120x120px]]
[[File:alfbsal1.png|center|120x120px]]
-
Where αo is the basal level, γ is the protein parameter for degradation and P is the protein concentration, now with this, we can assume that in steady state conditions we have that dP/dt= 0, then:
+
Where αo is the basal level, γ is the protein parameter for degradation and P is the protein concentration. Thus, we can assume that in steady state conditions we have that dP/dt= 0:
[[File:alfbasal2.png|center|80x60pxpx]]
[[File:alfbasal2.png|center|80x60pxpx]]
-
But γ is approximately 1/T, having that:
+
But γ is approximately 1/T, then:
[[File:alfbasal3.png|center|80x60pxpx]]
[[File:alfbasal3.png|center|80x60pxpx]]
-
To establish the range we assumed T as φ and we looked for a normal range of concentration of a protein in a cell [http://bionumbers.hms.harvard.edu/bionumber.aspx?&id=104520&ver=10&trm=protein]. Another important aspect we took into account is that there were constitutive's proteins and inducible's proteins, so we divided the basal levels in two class, one for the basal proteins and other for the constitutive proteins, now with this, the following ranges were defined:  
+
Establishing the rank, we assumed T as φ and we looked for a normal rank of concentration of a protein in a cell [http://bionumbers.hms.harvard.edu/bionumber.aspx?&id=104520&ver=10&trm=protein]. Another important aspect is that there were constitutive proteins and inducible proteins. Thus, we divided the basal levels in two class: i) one for basal proteins and ii)  one for constitutive proteins. Finally, following ranges were defined:  
Basal proteins range (LuxR, LuxI, HipB and Salicylic Acid):
Basal proteins range (LuxR, LuxI, HipB and Salicylic Acid):
[[File:range1.png|center|156x104pxpx]]
[[File:range1.png|center|156x104pxpx]]
-
+
 
Constitutive proteins range (HipA7,Sensor, Chitinase, Chitoporin and CBP):
Constitutive proteins range (HipA7,Sensor, Chitinase, Chitoporin and CBP):
[[File:z.png|center|153x105pxpx]]
[[File:z.png|center|153x105pxpx]]
-
By other part the basal production of HipA has to be constitutive and inducible because we want the cell to be always sleep (constitutive) and when there is an impulse of chitin or 3-OH-PAME that awakes the cell, we want a positive control to make it sleep again (inducible). The basal concentration used in the equations was the constitutive one because is the number that's gonna have more effect in the response.  
+
By the other hand, the basal production of HipA has to be constitutive and inducible because the cell has to have three stages. The first stage is always sleep (constitutive) due to HipA effect. Once an impulse of chitin or 3-OH-PAME is presented, it is going to awake the cell due to concentration decrease of HipA . This is the second stage of the cell. The third stage is when cell is slept again which it is achieve through a positive control (inducible). The basal concentration used in the equations was the constitutive one because it is going to have more effect in the response.  
'''b.Protein degradation:'''
'''b.Protein degradation:'''
-
Our unit of time is the bacteria's life's time, because in normal cells it's when the proteins are diluted and become less in the cell. One of the facilities of using for the values related with time the E.Coli’s lifetime is that we can establish easily the protein degradation, taking in count that a determined amount of proteins is always destructed each time our unit time is completed, we defined the protein degradation for almost all values as follows:
+
The chosen unit of time was the bacteria life time because it represents when the proteins are diluted and decrease in the cell. We can establish easily the protein degradation using values related to life time of E.Coli. We defined the protein degradation for almost all values as follows:  
[[File:degrada.png|center|100x80pxpx]]
[[File:degrada.png|center|100x80pxpx]]
-
Again we can found that there is a special case, the degradation of HB is greater so we assigned it a value of 4/φ.  
+
Again, we found a special case. Degradation of HB is greater, then we assigned it a value of 4/φ.  
'''c. Reaction constant:'''
'''c. Reaction constant:'''
-
This value can vary a lot depending of the process we are describing, fortunately we found an article that describes how to approximate this value [http://www.biokin.com/dynafit/scripting/html/node23.html#SECTION00431000000000000000] so we took the given approximation and turned it into our units establishing a range we considered prudent.  
+
This value can vary depending of the described process. Fortunately, we found in literature how to approximate this value [http://www.biokin.com/dynafit/scripting/html/node23.html#SECTION00431000000000000000]. However, we took the approximation and turned it into units described above.  
[[File:reactioncte.png|center|186x126pxpx]]
[[File:reactioncte.png|center|186x126pxpx]]
Line 83: Line 82:
'''d. Hill coefficients k:'''
'''d. Hill coefficients k:'''
-
First of all, we looked for some reference in different data sources where we found the coefficient of CI ([http://www.sciencemag.org/content/307/5717/1962.short  Nitzal et al.2005]), now with this, we decided to approximate a range taking this value as a reference.
+
We looked for some reference in different data sources and we found the coefficient of CI ([http://www.sciencemag.org/content/307/5717/1962.short  Nitzal et al.2005]). Then, we decided to approximate a rank taking this value as a reference.
[[File:zzzzzzz.png|center|145x95pxpx]]
[[File:zzzzzzz.png|center|145x95pxpx]]
Line 89: Line 88:
'''e. Hill coefficients n:'''
'''e. Hill coefficients n:'''
-
This case was similar to determining the hill coefficient k, we based on the value of n found for CI( [http://www.sciencemag.org/content/307/5717/1962.short NItzal et al.2005]), it’s important to say that in this case there was an analysis that took into account how many molecules interfere with the gene activation.
+
We based on the value "n" found for CI( [http://www.sciencemag.org/content/307/5717/1962.short NItzal et al.2005]). In this case, there was an analysis that took in account the number of molecules that interfere with gene activation.
-
                                     
+
 
[[File:zzzzRRzzz.png|center|260x172pxpx]]
[[File:zzzzRRzzz.png|center|260x172pxpx]]
'''f. Maximum cell production (β):'''
'''f. Maximum cell production (β):'''
-
Having that the concentration of Rubisco, which is about 20 times the average concentration of a protein [http://bionumbers.hms.harvard.edu/bionumber.aspx?&id=107431&ver=1&trm=rubisco], and the rate of a producing a average protein we assign the limits for this range.  
+
Considering the concentration of Rubisco, which is about 20 times the average concentration and rate of production of a protein, [http://bionumbers.hms.harvard.edu/bionumber.aspx?&id=107431&ver=1&trm=rubisco], we assign the limits for this range as follows.
[[File:RRMoizzz.png|center|136x90pxpx]]
[[File:RRMoizzz.png|center|136x90pxpx]]
-
 
'''CI PARAMETERS'''
'''CI PARAMETERS'''
-
All the parameters for the CI activation system were found in the literature. ([http://www.sciencemag.org/content/307/5717/1962.short  Nitzal et al.2005]) We only made an change of units  
+
All the parameters for CI activation system were found in the literature. ([http://www.sciencemag.org/content/307/5717/1962.short  Nitzal et al.2005]). We only made a change of units.
[[File:CI.png|center|150x150pxpx]]
[[File:CI.png|center|150x150pxpx]]
-
Once the ranks of the parameters were set, we proceed to do the screening. We tried a lot of techniques and most of the failed, here we present the steps of the succesful one:
+
Once the ranks of the parameters were set, we proceed to do the screening. We tried a lot of techniques but we failed. Here we present the steps for the successful one:
-
 
+
----
----

Revision as of 00:24, 27 September 2012

Team Colombia @ 2012 iGEM

Template:Https://2012.igem.org/User:Tabima

Parameters of the equations

When we want to model a biological process, it is necessary to write the differential equation that model the system which requires a number of constants. If the real value for the constants are unknown, the system can not have any biological sense. These entries are called parameters and they are crucial elements in the mathematical model made in this project.

There are three possible ways to find this parameters:



i) Literature. There are a lot of studies trying to find biological parameters, such as basal rate of protein production, kinetic constants, Monod's constants, etc. Moreover, there are so many biological systems and only a few of them have been characterized. Thus, it is difficult to find the parameters that are needed.



ii) Experimental way. If an experiment is made using the biological system of interest, it is possible to find the parameters for the equations that models the whole system. For this project, it was necessary to model the biological system first. Thus, experiments couldn't be performed to find the constant for the differential equations.




iii) Screening of parameters. Sometimes we don't have the exact number that we need, but we have a rank where it could be or the parameter for a similar biological system, then we can perfom a screening of parameter, where we try to find the value that perfectly fits the reponse of our circuit.

How did we do it?

For Pest-busters we used options i) and iii). In the literature, it was possible to find the parameters for the complete CI activation box. For the rest of values, we found an approximately rank where they could be. Thus, we proceeded to step iii).

We used a lot of resources and methods to approximate the limits for this ranges. Here we show an explanation:

Tabla2.png

a.Basal levels of the proteins:

Determining a rank which this value could be, we described mathematically a very simple process. Suppose there is a usual differential equation:

Alfbsal1.png

Where αo is the basal level, γ is the protein parameter for degradation and P is the protein concentration. Thus, we can assume that in steady state conditions we have that dP/dt= 0:

Alfbasal2.png

But γ is approximately 1/T, then:

Alfbasal3.png

Establishing the rank, we assumed T as φ and we looked for a normal rank of concentration of a protein in a cell [1]. Another important aspect is that there were constitutive proteins and inducible proteins. Thus, we divided the basal levels in two class: i) one for basal proteins and ii) one for constitutive proteins. Finally, following ranges were defined:

Basal proteins range (LuxR, LuxI, HipB and Salicylic Acid):

Range1.png

Constitutive proteins range (HipA7,Sensor, Chitinase, Chitoporin and CBP):

Z.png

By the other hand, the basal production of HipA has to be constitutive and inducible because the cell has to have three stages. The first stage is always sleep (constitutive) due to HipA effect. Once an impulse of chitin or 3-OH-PAME is presented, it is going to awake the cell due to concentration decrease of HipA . This is the second stage of the cell. The third stage is when cell is slept again which it is achieve through a positive control (inducible). The basal concentration used in the equations was the constitutive one because it is going to have more effect in the response.

b.Protein degradation:

The chosen unit of time was the bacteria life time because it represents when the proteins are diluted and decrease in the cell. We can establish easily the protein degradation using values related to life time of E.Coli. We defined the protein degradation for almost all values as follows:

Degrada.png

Again, we found a special case. Degradation of HB is greater, then we assigned it a value of 4/φ.

c. Reaction constant:

This value can vary depending of the described process. Fortunately, we found in literature how to approximate this value [2]. However, we took the approximation and turned it into units described above.

Reactioncte.png

d. Hill coefficients k:

We looked for some reference in different data sources and we found the coefficient of CI (Nitzal et al.2005). Then, we decided to approximate a rank taking this value as a reference.

Zzzzzzz.png

e. Hill coefficients n:

We based on the value "n" found for CI( NItzal et al.2005). In this case, there was an analysis that took in account the number of molecules that interfere with gene activation.

ZzzzRRzzz.png

f. Maximum cell production (β):

Considering the concentration of Rubisco, which is about 20 times the average concentration and rate of production of a protein, [3], we assign the limits for this range as follows.

RRMoizzz.png

CI PARAMETERS

All the parameters for CI activation system were found in the literature. (Nitzal et al.2005). We only made a change of units.

CI.png

Once the ranks of the parameters were set, we proceed to do the screening. We tried a lot of techniques but we failed. Here we present the steps for the successful one:



Step 1.Sensitivity Analysis

We asked ourselves for the weighted importance of each of these parameters to our design’s main outputs, namely Salicylic Acid, the toxin HipA7 and the antitoxin HipB. Hence, we came up with a sensitivity test in order to understand how every parameter adds up to our model. This test does not know about the optimal or most useful parameter values, but instead informs us of how and how much will our final response vary in function of each parameter.

The test consisted of the following: Knowing the ranges of the parameters (see above) we determined appropriate range divisions to obtain enough data for analysis (to be known as step size from here onwards). Then we iterated each parameter values while leaving the others fixed in our MATLAB code.

Note: the exact value of each parameter is not important. What matters is their relevance and how they change the response.

Here is an example:

Parameter: Chitinase leakage production

Range: 0.1-0.9 μM

Step size: 0.01

Sa.png

The diagram above shows that the chitinase leakage production does not have an important effect in salicylic acid, toxin, or antitoxin behaviour.

We make this procedure for all the parameters in both systems (Ralstonia and Rust), here are the results:

Rust:

The following table show us the different parameters involved in our coffee rust detection system and how they affect the final behaviour.

Resultsa.png

The following graphs represent some more relevant examples. It seems our coffee rust detection system is not directly sensible to some parameters. The blue line is the Salicylic Acid, the green is HipA7 and the red one is HipB:

Resutsa1.png

Ralstonia

We expose the results obtained by the sensibility test for Ralstonia. The next table shows the information obtained by the test so that we can extract two important aspects we can stand out from our analysis; the first one is to distinguish between those parameters that, when varying, make a relevant change in the model, from those that don’t, and the second aspect is, having relevant parameters, the classification of them taking into account those ones that are directly or inversely proportional to the salicylic acid response denoting them respectively as “+” and “-“.

Resultsa3.png


The following image shows the results obtained for almost all the relevant parameters graphing the response of the salicylic acid in function of the value of a determined parameter:

Resultsa4.png


Now we have an idea of which parameters affect the response the most. Although we still don't know what is the exact value of the parameter we know how the system is supose to response.With this two things we can proceed to step 2.



Step 2. Optimization of parameters

Within all the ranges of the parameters there is a number that combined with the other parameters makes the system behave just like is expected. There are many possible combinations where this can happen, and is represented as an area where the system woks. The figure below shows an example of the acceptable area of two parameters.

Param.png

In two dimensions this sees easy to identify but when we are dealing with more than 10 parameters, doing this by hand or with iterations in a computer takes a lot of time (weeks or even months).

The goal in this step is to find a point within this area and from here the screening can begin. And one way for doing this is by a process of optimization

Param2.png

A process of optimization takes a function and maximizes o minimizes changing some variable of it. The changing variables are named optimization variables and function is called objective function. If we want to have some limitations or behaves with the variable we can add restrictions to the system, and the function will be optimized without breaking this limitations.

In our case the objective function is a minimum difference of squares between the points of our expected behavior and the response of the equations with a set of optimization variable. When the distance between these points is minimum the parameters are found!!!

This optimization process was done discretizing the differential equations and putting them as restrictions, the same was done for the stable state concentration where the differential equation must be equal to zero when the chitin is zero. All was programed in the software Gamside.

Unfortunately the firsts results were not satisfactory, so we are now developing a new code to run.



3.Screening
If we have a set of parameters why do we do a screening?

In the last step we found a set of parameters that give the expected response of the system. But this is only a point... What does happen with the rest of points of the area? Does the optimization method “know” if these parameters are real or have biological sense? As long as we specified one single response for the parameters, we don’t know if there are other possible responses of valid solutions. Do they exist?

To answer these questions we performed a screening of the parameters. Beginning at the resulting point of the optimization, we started to move in all directions in a fixed range and then we examined when the acceptable area ends.

How much do we move, depends on the sensitivity analysis – If the response is not very sensible to the parameter, we take longer steps than the case in which the response is largely sensible.

Param3.png

Now we need to know if we are in the acceptable area or not. To solve this problem we defined some ranges where we want the Salicylic Acid to be depending on the impulse of chitin or 3-OH-PAME. The figure below shows this:

Param4.png
 -Note: Due to the fact that this takes a lot of computational time we are parallelizing the code 
and we expect to run the tests shortly