From 2012.igem.org

(Difference between revisions)

Revision as of 14:40, 23 September 2012

Modeling Contents

HGTBass

Propagation

Human

Calculation

In this page

Overview

Input of Our Calculation

Formula Derivation

The Calculation

Calculation Results of ΔG

Prediction Of Protein Expression Amount

Model Extension

Header-3

Link-1

Link-2

Link-3

Calculation and Derivation of the protein expression amount model in three states

Overview

Three problems came up while we start calculating: what sequences to input, which method to use and how cogent the result will be. As for sequence, we input the both the SD and the protein coding sequence.

Our goal is to calculate the total ΔG of each reaction and then predict the amount of protein expressed. There are several softwares dealing with DNA or RNA base-pairing progress, such as NUPACK, RBS-Calculator, and Vienna RNA etc. After comparison, we decide to use RBS-Calculator.

Due to the complication of translation progress and our lack of insight in this issue, the results of our modeling can’t be very precise. But at least it should have the precision of order of magnitude.

Input of Our Calculation

RBS sequence

RFP, normal RBS

ATTTCACACATACTAGAGAAAGAGGAGAAATACTAGATGGCTTCCTCCGAAGACGTTATCAAAGAGTT

CATGCGTT

RFP, orthogonal RBS

ATTTCACACATGTTCCGTACTAGATGGCTTCCTCCGAAGACGTTATCAAAGAGTTCATGCGTT

GFP, normal RBS

TACTAGAGAAAGAGGAGAAATACTAGATGCGTAAAGGAGAAGAACTTTTCACTGGAGTTGTCCCAAT

TCTTGTT

16S rRNA sequence

normal 16S: ACCTCCTTA
orthogonal 16S: ACGGAACTA

Formula Derivation

Basic Idea

For model design, please refer to Design part. Data of the curve Er-time is obtained from E_c=K∙E_r experiments. The function of our model is to work out the proportion factor 'K'.

Basic Assumption

The expression of RFP and GFP are independent.
The expression of the two proteins are determined by the percentage of normal and orthogonal ribosomes rather than the number of the two ribosomes.
The growth curve of bacteria do not change significantly after the transferred into orthogonal protein expression system.

Formula Derivation

We start from Formula 1 and 2. In Formula 1, m stands for the number of mRNA transcript, R_tot is the total number of ribosomes, β is the apparent Boltzmann constant, ∆G_tot is the total change of Gibbs free energy, k is proportion factor.

Because the GFP and RFP coding sequence are on the same mRNA transcript, the values of m of GFP and RFP are always same. In different state and time, the total number of ribosomes varies. We assume that at the same time R_tot of different state remain same. Calculation of DG_tot is the main job of this model. The proportion factor 'k' represents all unknown factors. Here we assume that as for the same protein in deferent state 'k' varies little.

The Calculation

Control State (c)

Figure 1. Diagram for control state

The two formulas above serve as denominators in following deduction. We get series of disjointed data of Function 3 and 4 through experiments. The amount of GFP and RFP can't be measured directly so we measured the fluorescence intensity of each protein. And because all the formula in this model are based on a singular cell, we must consider the influence caused by the number and growing condition of bacteria.

Experimental state 1 (E1)

Figure 2. Diagram for experimental state 1

The expression of GFP is composed of two part, the expression of n-RBS::n-16S and of n-RBS::o-16S.

There is no difference for the expression of GFP in Experimental state ONE from the control state, except for the distribution of ribosomes. For this reason, we presume that kc1,G and kr,G are nearly equal and thus Kc1,G equal to 1. The same thought is also shown in following derivation.

We noticed that the proportion factors in equation 6 and 8 are equal, which is not a coincident. This is because ΔG_3,G-ΔG_1,G≈ΔG_3,R-ΔG_1,R. The difference of ΔG_3,G and ΔG_1,G.

Experimental state 2 (E2)

Figure 3. Diagram for experimental state 2

Note : Strictly speaking, the factor K should better be obtained from experimental data rather than assumed to be 1 for such simplification could lead to much deviation from real value.

The Calculation of ΔG_tot

In most cases, the difference of delta G among the four pathways are mainly reflected by the ΔG_mRNA-rRNA. There are two ways to calculate the ΔG_mRNA-rRNA:

With the help of RBS calculator;
Use the method in literature Computational design of orthogonal ribosomes.

There are some differences between the input and output of the two method.

1st method:

input:
- standby + RBS + Spacing + Start Codon + Protein-Coding,
- 16S 3' last nine bases;
output: ΔG_mRNA:rRNA, ΔG_start, ΔG_spacing, ΔG_standby, ΔG_mRNA;
Software: RBS-Calculator;
Strength: taking more factors into our consideration and are more accessible to real condition;
Weakness: not sure of the input sequence.

2nd method:

input:
- ASD sequence,
- SD sequence (each are 6 bases long);
Output: ΔG under different conditions;
Software: RNA-Cofold in Vienna RNA web servers;
Strength: only need to input the SD and ASD sequence which is very easy;
Weakness: not as accurate as the first method.

Similarity of the two method：The core program of RBS-Calculator is based on Vienna RNA.

There are also two points that should be focused on during our modeling:

the specification of input;
the analysis of errors if there are any in the output.

As for analysis of errors, the most frequent errors are the Long-Range Paring which occurred when the head and the tile of the mRNA sequence complement with each other within the sequence itself. In such case, the ΔG_mRNA result is not accurate. We usually use RNA-Fold to calculate the accurate ΔG_mRNA to avoid such error.

Calculation Results of ΔG

Prediction of Protein Expression Amount

We should, first of all, measure the relative protein expression amount (fluorescence intensity) and to obtain the parameters in the model through regression. After the obtaining the model parameters, we can predict the amount of protein expressed and to compare them with that of wet lab results. The workflow of our prediction process is shown in Figure 4.

Figure 4. Workflow of our modeling prediction

Model Extension

In the previous protein expression model, we just suppose the existence of the orthogonal expression system do not have any significant impact on our system. However, to make our system more accurate, we also need to take the factor of estimation.

@@ Line 166: / Line 166: @@
 [[File:TJU2012-Mode-cal-equ-2.png|center|equation3_4]]
-The two formulas above serve as denominators in following deduction. We get series of disjointed data of function 3 and 4 through experiments. The amount of GFP and RFP can’t be measured directly so we measured the fluorescence intensity of each protein. And because all the formula in this model are based on a singular cell, we must consider the influence caused by the number and growing condition of bacteria.
+The two formulas above serve as denominators in following deduction. We get series of disjointed data of Function 3 and 4 through experiments. The amount of GFP and RFP can't be measured directly so we measured the fluorescence intensity of each protein. And because all the formula in this model are based on a singular cell, we must consider the influence caused by the number and growing condition of bacteria.
 ===Experimental state 1 (E1)===

Team:Tianjin/Modeling/Calculation

From 2012.igem.org

Revision as of 14:40, 23 September 2012

Contents

Overview

Input of Our Calculation

RBS sequence

16S rRNA sequence

Formula Derivation

Basic Idea

Basic Assumption

Formula Derivation

The Calculation

Control State (c)

Experimental state 1 (E1)

Experimental state 2 (E2)

The Calculation of ΔGtot

Calculation Results of ΔG

Prediction of Protein Expression Amount

Model Extension

The Calculation of ΔG_tot