Team:Buenos Aires/Results/Bb1
From 2012.igem.org
(→Devices Design for Synthetic Ecology) |
(→Kozak Sequence) |
||
Line 49: | Line 49: | ||
== Kozak Sequence == | == Kozak Sequence == | ||
{| | {| | ||
- | | style="width: 65%" | The Kozak sequence is | + | | style="width: 65%" | The Kozak sequence is the eukaryotic analog to the bacterial RBS, it is required for efficient initiation of translation. There is only one yeast Kozak sequence in the registry (part [http://partsregistry.org/Part:BBa_J63003 BBa_J63003], distributed in the [http://partsregistry.org/assembly/libraries.cgi?id=42 2012 kit]). Note that this sequence codes for a glutamic acid (E) after the start codon. |
+ | |||
+ | We decided to create a new part ([http://partsregistry.org/Part:BBa_K792001 BBa_K792001]) using the 5’UTR of the MF-α1 gene of yeast, partly because we used the signal peptide from this gene (see below). This gene is efficiently translated in yeast, and therefore it stands to reason that translation is efficiently initiated on its mRNA. | ||
+ | |||
| align="center" | | | align="center" | | ||
{| class="wikitable" | {| class="wikitable" | ||
Line 56: | Line 59: | ||
! scope="row" style="background: #7ac5e8" |DNA Sequence | ! scope="row" style="background: #7ac5e8" |DNA Sequence | ||
|- | |- | ||
- | |BBa_J63003 | + | |[http://partsregistry.org/Part:BBa_J63003 BBa_J63003] |
|CCCGCCGCCACCATGGAG | |CCCGCCGCCACCATGGAG | ||
|- | |- |
Revision as of 15:34, 26 September 2012
Contents |
Devices Design for Synthetic Ecology
In order for the cross-feeding scheme to work we need each strain to export the amino acid they produce (either Histidine or Tryptophan). To achieve this we created a devices design to secrete to the medium an His (or Trp) rich peptides.
Due to time and resources limitation, we decided to simplify the construction (assembly) process as much as possible. The devices were ordered as a whole (as gBlocks gene fragments) instead of obtaining each constitutive part and then assembling them. Although this goes against the standard part base approach, it saved both time and money. We plan to obtain each constitutive part from the devices by PCR with suffix/prefix containing primers, as a contribution to the registry.
We decided to use yeast expression plasmids with repressible or constitutive promoters, to drive the expression of our devices. This decision was taken because such plasmids were readily available to us, they had adequate selection markers and they are a fairly standard approach in yeast genetics.
The design of the devices with their constitutive parts is schematically represented in Figure 1.
Figure 1: Schematic representation of the (generic) devices for the cross-feeding design. RS: Restriction Sites, Prefix, Suffix: BioBrick standard, Signal: Secretion signal peptide. The ORF is highlighted in violet.
Each device has the following parts:
- Prefix of the BioBrick standard.
- Kozak consensus sequence for initiation of translation.
- Signal peptide that targets the product of the gene for secretion.
- Trojan peptide to increase internalization in target cell.
- Payload: this is the exported amino acid rich domain of the protein.
- Suffix of the BioBrick standard.
Note: Although the prefix and suffix are not considered part of the device, they were included in the ordered gene fragment for future assembly.
In addition, we included convenient restriction sites (RS) for directional cloning into the yeast expression vectors (RS1 and RS3). RS2 will allow easy removal of the sequence coding for the trojan peptide, by restriction and re-ligation.
Assembly Standard Prefix and Suffix
Because the entire ORF is contained within the prefix and suffix, no care for in-frame assembly has to be taken. We decided to use the original RFC10 BioBrick standard.
Figure2: RFC10 BioBrick standard
Kozak Sequence
The Kozak sequence is the eukaryotic analog to the bacterial RBS, it is required for efficient initiation of translation. There is only one yeast Kozak sequence in the registry (part [http://partsregistry.org/Part:BBa_J63003 BBa_J63003], distributed in the [http://partsregistry.org/assembly/libraries.cgi?id=42 2012 kit]). Note that this sequence codes for a glutamic acid (E) after the start codon.
We decided to create a new part ([http://partsregistry.org/Part:BBa_K792001 BBa_K792001]) using the 5’UTR of the MF-α1 gene of yeast, partly because we used the signal peptide from this gene (see below). This gene is efficiently translated in yeast, and therefore it stands to reason that translation is efficiently initiated on its mRNA. |
|
Signal Peptide
The signal peptides target proteins for secretion, effectively exporting the payload. This peptides are cleaved once the protein is in the lumen of the ER, so they won't have any further relevance.
We have two main options for yeast secretion signal peptides:
- MRFPSIFTAVLFAASSALA
- MKVLIVLLAIFAALPLALAQPVISTTVGSAAEGSLDKR
Number 1 is the signal peptide for the yeast alpha-mating factor [Water et al 1987]. Because its a part of a yeast gene, we don’t need to optimize it for yeast. Number 2 is part BBa_K416003 from the registry (not included in the kit), and was designed for yeast [Clements 1991].
The DNA sequence are shown in Table 2
Name | DNA Sequence |
1: MF-alpha1 signal | atgagatttccttcaatttttactgcagttttattcgcagcatcctccgcattagct |
2: BBa_K416003 | atgaaagttttgattgttttgttggctattttcgctgctttgccattggctttggctcaaccagttatttctactactgttggttctgctgctgaaggttcactagataaaaga |
Table 2: DNA sequences for the signal peptides, atg codon in bold
Trojan peptide
Trojan peptides are short (15aa) sequences that penetrate through the plasma membrane inside the cell without the need of any receptor or endocitosis process [Derossi 1998]. We want to use them to increase the efficiency with which the payload enters the target cell. Ideally, they should not contain Trp or His, as those are the relevant amino acids for exportation. Two good candidates are the penetratin from the HIV TAT protein, and polyarginine [Jones et al 2005].
Penetratin | Residue sequence |
---|---|
TAT | YGRKKRRQRRR |
polyarginine | RRRRRRRRRRR |
Table 3: Primary structure for penetratins
This proteins are not from yeast, so we need to retro-translate them. Using the conventions in table 4, we get the DNA sequence in Table 5.
|
|
Table 4: Nucleotide convention for retro-translated sequences
Penetratin | DNA Sequence |
---|---|
TAT | TAYGGNMGNAARAARMGNMGNCARMGNMGN |
polyarginine | MGNMGNMGNMGNMGNMGNMGNMGNMGNMGN |
Table 5: DNA sequence for penetratins. MGN=CGN or AGR
Next we need to codon optimize for yeast, and if possible avoid mRNA secondary structures.
MANU, PONE COMO LO HICISTE Y CON QUE PROGRAMAS; SI HAY UNA TABLA DE CODONES OPTIMOS PARA LEVADURAS PONELA PORFA.
Payloads
The payloads are the elements of the synthetic gene that code for the “amino acid rich” region of the secreted protein. By “a.a. rich” we mean, rich in the amino acid we want to export, Trp or His in our case. This domain should be soluble enough not to cause precipitation of the protein, and should be relatively stable not to be degraded before they are actually secreted from the cell.
Payload | Residue sequence |
---|---|
PolyHa | HNHNHNHNHNHN |
TrpZipper2 | SWTWENGKWTWK |
PolyHb | HGDHDGHGKHKG HGDHDGHGKHKG HGDHDGHGKHKG |
PolyWb | WGDWDGWGKWKG WGDWDGWGKWKG WGDWDGWGKWKG |
Table 6: Payloads protein sequences
PolyHa is Clontech’s His tag sequence. TrpZipper is a small peptide that folds into a beta-hairpin secondary structure. The indole rings of the Trp form a hydrophobic core. The protein is water soluble and monomeric [Cochran 2001].
PolyHb and PolyWb were desing taking into acount the following consideration; 1) avoided repeating the same residue in tandem to minimize local tRNA depleation, 2) avoided Trp in tandem because of therir low solubility, 3) we included Gly to avoid the formation of stable structures and 4) included acidic and basic amino acids to increase solubility. Retro-translating using the conventions of table 4, we obtained the sequences in Table 7.
Payload DNA Sequence
PolyHa | CAYAAYCAYAAYCAYAAYCAYAAYCAYAAYCAY |
TrpZipper2 | WSNTGGACNTGGGARAAYGGNAARTGGACNTGG |
PolyHb | CAYGGNGAYCAYGAYGGNCAYGGNAARCAYAARCAYGGNGAYCAYGAYGGNCAYGGNAARCAYAARCAYGGNGAYCAYGAYGGNCAYGGNAARCAYAAR |
PolyWb | TGGGGNGAYTGGGAYGGNTGGGGNAARTGGAARTGGGGNGAYTGGGAYGGNTGGGGNAARTGGAARTGGGGNGAYTGGGAYGGNTGGGGNAARTGGAAR |
Table 7: Payload’s DNA sequence. WSN=TCN or AGY
ORFs Arrangements
We are planning to synthesize 4 genes with different combinations of signal peptides, trojans and payloads. We want these genes to be immediately functional (no cloning necessary), but we also want o to have the possibility to recombine the parts in different arrangements. The 4 ORFs we will synthesize are described in Table 8.
ORF# | Kozak | Signal | Trojan | Payload |
---|---|---|---|---|
1 | MF1[-12,6] | 1: MF1 signal | TAT | PoliHa |
2 | MF1[-12,6] | 1: MF1 signal | TAT | TrpZipper2 |
3 | BBa_J63003 | BBa_K416003 | polyarginine | PoliHb |
4 | BBa_J63003 | BBa_K416003 | polyarginine | PoliWb |
Table 8: Arrangement of the synthesized ORFs
This arrangement would allow to compare ORF1 with ORF2, and ORF3 with ORF4.
Expression Plasmids
To determine which restriction sites to use for cloning (RS1 and RS3 in Figure 1), we need to know the MCS of the expression plasmids we are going to use.
One of these plasmids will probably of the pCM180-5 series, which are centromeric plasmids with TRP1 marker, and with a doxycycline repressible promoter [Gari et al 1996].
The Multiple Cloning Site of these vectors is shown in Figure 3.
Figure 3. Multiple Cloning Site (MCS) of the pCM180 series plasmids.
The other plasmid we might use is pEG202, with a 2 ori, HIS3 marker and a constitutive promoter (PADH1).
Figure 4: pEG202 MCS sequence and restriction sites.
Comparing Figure 3 and Figure 4, BamHI and NotI appear in both MCS in the same order, so they are good candidates for RS1 and RS3 (Figure 1) respectively.
We will probably need to clone the construct in a general purpose plasmid for manipulation. For instance, to remove the trojan we need to clone the construct into a plasmid, cut it with the RE of RS2, precipitate the DNA (to get rid of the trojan fragment), and religate. For this to work we have to make sure that there is no RS2 in the vector. A common vector for this is pBluescript, wich has the MCS shown in Figure 5.
Figure 5: pBluescript Multiple Cloning Site
Restriction Enzymes
The restriction sites used for RS1-3 have to be different from the ones used in the BioBricks standard. The standard RE for BioBricks are EcoRI, NotI, XbaI, SpeI and PstI.
Unfortunately, NotI was our candidate for RS3, so we have a problem here. There are different solutions. We can either use two restriction sites instead of RS3, one for each plasmid, or we can change the BioBrick standard to something like RFC[21] (Berkeley standard) that has no NotI restriction site.
An other option would be to use the restriction sites in the prefix and suffix to clone the construct into the expression plasmid. This is appealing because we don´t need new REs. Anyhow we would need to include a RS in the 5’ end to be able to directionally clone into pCM180. The new design would look something like Figure 6.
Figure 6: Alternative scheme for the restriction sites
If we make RS4 a BamHI site, we can directionally clone the construct into both plasmids (pCM180 and pEG202 ) by cutting with BamHI and NotI. In this design we would not need RS1 and RS3, but we can include them just in case we need to clone them into an other vector.
Regarding RS2 we need a restriction enzyme that produces cohesive ends, codes for acceptable amino acids, is easily available and not used in an other part of the construct. Some candidate RE are listed in Table 9.
R. Enzyme | R. Site sequence | Overhang | Codes for |
---|---|---|---|
HindIII | A/AGCTT | AGCT | Lys-Leu (KL) |
XhoI | C/TCGAG | TCGA | Leu-Glu (LE) |
Table 9. Candidate restriction enzymes for RS2
Probably any of them will work, but the trojan peptide needs to be basic so the HindIII site looks better suited. If we want to remove the trojan, we will have to clone the construct into a vector with no HindIII site. One way to do this is to clone it into pBluescript in such a way that the HindIII restriction site of the MCS is removed.
Looking at Figure 5 we can see that if we cut pBluescript with XhoI and PstI, the HindIII site is removed. If we make RS1 -> XhoI (which is easily available) we can cut the construct with these same enzymes and directionally clone it into pBluescript.
Most likely we wont use RS3, but we can assign it a restriction site just in case. For example NcoI could be used instead of NotI to do the directional cloning into pEG202.
RS# | R. Enzyme | R. Site Sequence |
---|---|---|
RS1 | XhoI | C/TCGAG |
RS2 | HindIII | A/AGCTT |
RS3 | NcoI | C/CATGG |
RS4 | BamHI | G/GATCC |
Table 10. Assignation of restriction enzymes to the different restriction sites.
When we order the construct we will probably have to specify in which vector we want it shipped. RS3 (and perhaps RS1) could be changed depending on this vector. For example we could order the gene in pBluescript, in which case it might be convenient to make RS3 -> SacI. For other plasmids other RS might be needed, but if possible it would be convenient to retain RS1 -> XhoI that allows us to clone the gene into pBluescript as described above.