Team:SUSTC-Shenzhen-B/algorithm

From 2012.igem.org

(Difference between revisions)
(Created page with "{{tempalte:sustc_shenzhen_b/1}} <html> <head> <meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> <title>SUSTC iGEM Theme - Free CSS Template</title> <meta name...")
Line 16: Line 16:
                 <h3>Introduction to scoring system 2</h3>
                 <h3>Introduction to scoring system 2</h3>
              
              
-
                 <p>Scoring System 2 is based on the model created by d'Aubenton Carafa.[1] The score of terminator consists of two parts, the free energy of stemloop and the score of 15 nt poly T tail. The free energy of stemloop is calculated using Loop Dependent Energy Rules[2]. The minimization of the free energy also determined the secondary structure of the stemloop. T tail score is calculated by the formula given by d’ Aubenton Carafa.  
+
                 <p>Scoring System 2 is based on the model created by d'Aubenton Carafa.[1] The score of terminator consists of two parts, the free energy of stemloop and the score of 15 nt poly T tail. The free energy of stemloop is calculated using Loop Dependent Energy Rules[2]. The minimization of the free energy also determined the secondary structure of the stemloop. T tail score is calculated by the formula given by d’ Aubenton Carafa. </p>
-
Detailed Calculation of Score
+
<p>Detailed Calculation of Score</p>
-
1. Some definitions[2]
+
<p>1. Some definitions[2]</p>
-
i. Closing Base Pair
+
<p>i. Closing Base Pair</p>
-
For an RNA sequence, we number it from 5’ to 3’ . If i < j and nucleotides ri and rj form a base pair,we denote it by i.j. We call base ri’ or base pair i’.j’ is accessible from i.j if i < i’ ( < j’ ) < j and if there is no other base pair k.l so that i < k < i’ ( < j’ ) < l < j. We denote the collection of base and base pair accessible from i.j by L(i,j). Then i.j is the closing base pair. Here “L” means loop.
+
<p>For an RNA sequence, we number it from 5’ to 3’ . If i < j and nucleotides ri and rj form a base pair,we denote it by i.j. We call base ri’ or base pair i’.j’ is accessible from i.j if i < i’ ( < j’ ) < j and if there is no other base pair k.l so that i < k < i’ ( < j’ ) < l < j. We denote the collection of base and base pair accessible from i.j by L(i,j). Then i.j is the closing base pair. Here “L” means loop.</p>
-
ii. n-loop
+
<p>ii. n-loop</p>
-
If the loop contain n – 1 base pairs, we denote it by n-loop. (Because there is a closing base pair, so we denote it by n-loop even though the closing base pair is not included in the loop.)
+
<p>If the loop contain n – 1 base pairs, we denote it by n-loop. (Because there is a closing base pair, so we denote it by n-loop even though the closing base pair is not included in the loop.)</p>
-
Here we can divide loops which may be formed in the terminator secondary structure into two kinds.
+
<p>Here we can divide loops which may be formed in the terminator secondary structure into two kinds.</p>
-
1-loop : Hairpin loop(size of loop shouldn’t be smaller than 3)         
+
<p>1-loop : Hairpin loop(size of loop shouldn’t be smaller than 3)        </p>
-
2-loop : Interior Loop(right strand size and left strand size are both bigger than 0.)  
+
<p>2-loop : Interior Loop(right strand size and left strand size are both bigger than 0.)</p>
-
Buldge(Size of one strand is bigger than 0 and that of another strand is 0.)
+
<p>Buldge(Size of one strand is bigger than 0 and that of another strand is 0.)</p>
-
Stack(size of the loop is 0.)
+
<p>Stack(size of the loop is 0.)</p>
-
2. Calculation of the Minimum Free Energy Change of Stemloop Formation[3]
+
<p>2. Calculation of the Minimum Free Energy Change of Stemloop Formation[3]</p>
-
Assume i.j is the closing base pair of the loop  
+
<p>Assume i.j is the closing base pair of the loop </p>
-
G(i,j)= min { GH ( i , j ) , GS( i , j ) + G ( i + 1 , j – 1 ) , GBI( i , j ) } ;
+
<p>G(i,j)= min { GH ( i , j ) , GS( i , j ) + G ( i + 1 , j – 1 ) , GBI( i , j ) } ;</p>
-
GBI ( i , j ) = min{ gbi( i , j , k , l ) + G( k , l ) } for all  0 < k – i + l – j - 2 < max_size
+
<p>GBI ( i , j ) = min{ gbi( i , j , k , l ) + G( k , l ) } for all  0 < k – i + l – j - 2 < max_size</p>
-
G(i,j) is the minimum free energy change of stemloop formation. GH is the free energy change to form a hairpin loop. GS is the free energy change to form a stack. GBI is to calculate the minimum free energy change of structure containing 2-loop. gbi(i,j,k,l) is the free energy change to form 2-loop.
+
<p>G(i,j) is the minimum free energy change of stemloop formation. GH is the free energy change to form a hairpin loop. GS is the free energy change to form a stack. GBI is to calculate the minimum free energy change of structure containing 2-loop. gbi(i,j,k,l) is the free energy change to form 2-loop.</p>
-
3.Calculation of T Tail Score
+
<p>3.Calculation of T Tail Score</p>
-
Here we consider 15 nucleotide in the downstream of stemloop. T tail score nT is calculated as follows :
+
<p>Here we consider 15 nucleotide in the downstream of stemloop. T tail score nT is calculated as follows :</p>
   
   
-
In our program, if the length of the T tail( n ) is less than 15, we will only consider n nucleotides. If TL is more than 15, we will only consider 15 nucleotides.
+
<p>In our program, if the length of the T tail( n ) is less than 15, we will only consider n nucleotides. If TL is more than 15, we will only consider 15 nucleotides.</p>
-
4.Calculation of Score
+
<p>4.Calculation of Score</p>
-
Score = nT * 18.16 + deltaG / LH * 96.59 – 116.87
+
<p>Score = nT * 18.16 + deltaG / LH * 96.59 – 116.87</p>
-
Here nT is T tail score. deltaG is the minimum free energy change of stemloop formation. LH is the length of stemloop.
+
<p>Here nT is T tail score. deltaG is the minimum free energy change of stemloop formation. LH is the length of stemloop.
</p>
</p>
              
              

Revision as of 14:53, 7 September 2012

SUSTC iGEM Theme - Free CSS Template

SUSTC iGEM Theme - Free CSS Template

Introduction to Scoring Systems

Introduction to scoring system 2

Scoring System 2 is based on the model created by d'Aubenton Carafa.[1] The score of terminator consists of two parts, the free energy of stemloop and the score of 15 nt poly T tail. The free energy of stemloop is calculated using Loop Dependent Energy Rules[2]. The minimization of the free energy also determined the secondary structure of the stemloop. T tail score is calculated by the formula given by d’ Aubenton Carafa.

Detailed Calculation of Score

1. Some definitions[2]

i. Closing Base Pair

For an RNA sequence, we number it from 5’ to 3’ . If i < j and nucleotides ri and rj form a base pair,we denote it by i.j. We call base ri’ or base pair i’.j’ is accessible from i.j if i < i’ ( < j’ ) < j and if there is no other base pair k.l so that i < k < i’ ( < j’ ) < l < j. We denote the collection of base and base pair accessible from i.j by L(i,j). Then i.j is the closing base pair. Here “L” means loop.

ii. n-loop

If the loop contain n – 1 base pairs, we denote it by n-loop. (Because there is a closing base pair, so we denote it by n-loop even though the closing base pair is not included in the loop.)

Here we can divide loops which may be formed in the terminator secondary structure into two kinds.

1-loop : Hairpin loop(size of loop shouldn’t be smaller than 3)

2-loop : Interior Loop(right strand size and left strand size are both bigger than 0.)

Buldge(Size of one strand is bigger than 0 and that of another strand is 0.)

Stack(size of the loop is 0.)

2. Calculation of the Minimum Free Energy Change of Stemloop Formation[3]

Assume i.j is the closing base pair of the loop

G(i,j)= min { GH ( i , j ) , GS( i , j ) + G ( i + 1 , j – 1 ) , GBI( i , j ) } ;

GBI ( i , j ) = min{ gbi( i , j , k , l ) + G( k , l ) } for all 0 < k – i + l – j - 2 < max_size

G(i,j) is the minimum free energy change of stemloop formation. GH is the free energy change to form a hairpin loop. GS is the free energy change to form a stack. GBI is to calculate the minimum free energy change of structure containing 2-loop. gbi(i,j,k,l) is the free energy change to form 2-loop.

3.Calculation of T Tail Score

Here we consider 15 nucleotide in the downstream of stemloop. T tail score nT is calculated as follows :

In our program, if the length of the T tail( n ) is less than 15, we will only consider n nucleotides. If TL is more than 15, we will only consider 15 nucleotides.

4.Calculation of Score

Score = nT * 18.16 + deltaG / LH * 96.59 – 116.87

Here nT is T tail score. deltaG is the minimum free energy change of stemloop formation. LH is the length of stemloop.