Team:Amsterdam/modeling/generaldesign

From 2012.igem.org

The molecular design in a nutshell

We established in a very early stage of the design process to use DNA methylation as the molecular mechanism to create our storage mechanism. DNA methylation, which means enriching a specific nucleotide with a methyl group, is performed by a group of proteins called the methyltransferases. In making this design choice, we were heavily influenced by the DamID technology (4,5). Van Steensel was interested in the binding sites in drosophila genomes of various eukaryotic transcription factors. By fusing a bacterial methyltransferase (MTase) to these transcription factors, he was able to infer the transcription factor binding sites by reading out which genomic regions were methylated.

We thought we could reverse this idea to create a memory unit, at the core constituted by an MTase that would methylate an especially designated genomic region but only in reaction to the sensing of a signal by the cell. The epigenetic status of the bit region could thus either be 1 (methylated or written) or 0 (ummethylated or unwritten), effectively forming a binary memory unit.

Finally, the methylation status of the bit region can be assessed using digestion of the plasmid extracts with the MTase-coupled RE, followed by analyzing the band lengths intensities of a gel electrophoresis of the product.

Choosing a methyltransferase

To create this system, we first required an MTase that: i) is not already present in the chassis organism E.coli, ii) has a binding motif that is not methylated by any other MTases in the system and iii) for which a restriction enzyme (RE) dependent upon the action of this methyltransferase has been identified. This last condition is always true for any methyltransferase which has been identified to be part of a bacterial [http://en.wikipedia.org/wiki/Restriction_modification_system Restriction/Modification system]. We wrote a python script to mine the [http://rebase.neb.com/rebase/rebase.html REBASE] database of REs and MTases for a MTase that meets our needs. We found an ideal candidate: M.ScaI, originally from Streptomyces caespitosus. The only remaining design question was how to control the activity of the fusion protein as we would want it to only be active in the presence of the generic signal. This design question will be more elaborately discussed in the next section.

Extending the idea to multiple bits

Extending the design idea further, we realized that it should be possible to register and store the presence of multiple signals smultaneously in a single cell. This extension can be accomplished by fusing DNA binding proteins to the methyltransferase and adding their corresponding DNA binding motifs to the bit regions of the DNA. Either traditional bacterial transcriptional regulators could be used for this purpose or the Zinc-Finger Array (ZFA) technology (1). ZFA allows the construction of highly specific artificial protein-DNA interactions (2).

By fusing the MTase to a Zinc Finger or traditional transcriptional regulator and extending the bit region with the transcriptional regulator’s binding motif, the binding affinity of the fusion protein for the DNA motif could increase about 18-fold (3) (a C5-cytosine MTase was studied here, not in the class of N4C-MTases to which M.ScaI belongs). This will heavily diminish the possible aspecific cross-talk between FPs and bits assigned to other FPs.

The wide range of available Zinc Fingers could also allows for the usage of the same MTase for multiple signal. This assumes that the site specificity of the Zinc Finger Array is much higher than the site specifity of the MTase, which is very plausible (3). In this setup there would be a very low chance of cross-talk between FPs and other signals’ bits.

The ZF-binding sites and methylation motif groups are distributed over the plasmid in a way that the distances between the TF-binding sites are unique: for all binding restriction sites <math>i</math> the intermediate genomic distance with the next bit, $D(i,i+1)$, has to be: $$ D(i,i+1) = \frac{D(i-1,i)}{2} $$ By designing the plasmid to adhere to this rule, a unique pattern of fragment lengths between each of the TF binding sites - or bits - will result for each combination of bit statuses. One plasmid will then be able to hold <math> \log_2 (\text{Plasmid BP length}) - \log_2 (256) </math> bits with the current achievable resolution in gel electrophoresis techniques (resolution down to 300 bp and <math>2^{7} = 256</math>). For the typical plasmid base pair length of <math>6 \cdot 10^{4}</math> nucleotides this is <math>8\ \text{bits}</math>.

File:Multiplebits.png

A plasmid storing three bits in which site 0 is always cleaved. Notice the unique distances between bits 1, 2 and 3. The value of each bit can be inferred from the unique band lengths pattern after digestion

Extending from multiple to ~64 bits

The amount of bits stored on a single plasmid could be enlarged even more. In the previous section all bits used the same MTase and are digested using the same restriction enzyme with the potential to store up to 8 bits. By finding a new MTase-RE combination and engineering the memory plasmid to have bits for this new combination slightly offset from the original MTase-RE binding sites, 8 additional bits will be storable on the plasmid. This requires that during the read-out step, digestions are performed for each RE individually. It is essential that only a single RE-type is for each individual plasmid extract to retain the ability to read out the bits from the unique band lengths. For each RE used, a separate band on the final gel must be used and determination of bit values will proceed in the same way as for single MTase memory modules. A plasmid bp-size of 60k will allow for about 8 different RE’s and thus:

$$ 8\ [\text{RE}] \cdot [\frac{8\ \text{bits}}{\text{RE}}] = 64\ \text{bits} $$

Signal strength measurement

Binary encoding of a signal's presence is useful, but often in biology gaining insight in the quantity of a signal is even more relevant. By coupling different promoters with different strengths as responses to the same signal and assigning each promoter its own unique bit, the range in which the strength of the signal is located can be measured.

References

^{Fu, Fengli, Jeffry D. Sander, Morgan Maeder, Stacey Thibodeau-Beganny, J. Keith Joung, Drena Dobbs, Leslie Miller, and Daniel F. Voytas. 2009. “Zinc Finger Database (ZiFDB): a repository for information on C2H2 zinc fingers and engineered zinc-finger arrays.” Nucleic acids research 37 (jan): 279. doi:10.1093/nar/gkn606. http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=2686427\&tool=pmcentrez\&rendertype=abstract}

^{Kaseniit, K. E., Perli, S. D., & Lu, T. K. (2011). Designing extensible protein-DNA interactions for synthetic biology. 2011 IEEE Biomedical Circuits and Systems Conference (BioCAS), 349–352. doi:10.1109/BioCAS.2011.6107799}

^{McNamara, A. R., Hurd, P. J., Smith, A. E. F., & Ford, K. G. (2002). Characterisation of site-biased DNA methyltransferases: specificity, affinity and subsite relationships. Nucleic acids research, 30(17), 3818–30. Retrieved from http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=137423&tool=pmcentrez&rendertype=abstract}

^{van Steensel, B., Delrow, J., & Henikoff, S. (2001). Chromatin profiling using targeted DNA adenine methyltransferase. Nature genetics, 27(3), 304–8. doi:10.1038/85871
</span>}

^{van Steensel, B., & Henikoff, S. (2000). Identification of in vivo DNA targets of chromatin proteins using tethered dam methyltransferase. Nature biotechnology, 18(4), 424–8. doi:10.1038/74487}

</div>

</html>