Team:Peking/Project/Luminesensor/Design
From 2012.igem.org
This summer, our team wished to create a novel optogenetic tool that could really make a big difference in the field of synthetic biology. But what should the difference be? In order to answer this question, we should perform a brief retrospect to meditate upon what accomplishment has already been achieved and what problems remained to be solved in optogenetic research.
Design: A Retrospect
During the past decade, numerous light-sensitive optogenetic modules have been designed and characterized. Equipped with these optogenetic tools, scientists have been able to manipulate cellular behavior to achieve some truly fascinating goals, such as tuning the expression of genes through the intensity of light, making bacteria capable of detecting the edge of a light illuminated pattern, illuminating specific positions on the cell to trigger pseudopod formation or cell polarization, or even recruiting fluorescent proteins to cell membrane to print patterns on the cell.
Virtually all of these light-sensitive optogenetic modules have been designed following two similar general principles –- attaching a physiologically functional domain to a photoreceptor domain or rewiring physiological pathways to the downstream of the signaling pathways of photoreceptor proteins in order to use light to trigger physiological response. These two principles, whose concepts easy to understand, actually bear in their kernel the spirit of modular design which has been so strongly proposed in synthetic biology, with the former principle particularly versatile and easy to manipulate.
Thus we reasoned that the first principle is an appropriate paradigm to follow. However, in order to make our novel optogenetic tool an unprecedented success, the functional domain, photoreceptor domain and the connection between them must be carefully chosen and sophistically (?) designed to solve several important problems that have become gradually manifest along with the development of optogenetic research, for they may impede the future application of our optogentic tool.
First of all, in order to be widely applicable for biotechnological use, a photosensitive module must be truly ‘sensitive’. Some of the existing photosensitive modules work best around light intensity of 10W/m2, which is about the illuminance inside a room on a sunny day; some even require laser beam to activate, whose light intensity can hardly be achieved in natural environment. Apparently, the dependency on high light intensity is severely defective, because high-intensity light could be detrimental to cellular function or can even cause cell death. Moreover, this defect limits the application of optogenetics in future possible scenarios where photosensitive modules may be required to respond to light that is soft, cell-and tissue-friendly, or light emitted by biological organisms, or, even more stringent, light emitted by bacteria, which is by no means brighter than the dim light of the moon.
Secondly, the photosensitive modules must be widely compatible. Most of current photosensitive modules have utilized light-sensitive domains of photoreceptors of eukaryotic origin, which often fail to work in prokaryotic organisms. Yet comparing to eukaryotic organisms, prokaryotic cells require significantly less stringent culturing conditions and are considerably less susceptible to environmental pollutions. What’s more, some of current photosensitive modules utilize photoreceptor domains such as phytochromes where exogenously supplied specific chormophores are necessary for its normal function. These two obvious defects substantially block the way to apply optogenetics to synthetic biology where in-field application is very much considered.
Thirdly, the design of a successful photosensitive module should be instructive to future photosensitive module engineering, which is particularly important for synthetic biology. In another word, its design should be modular enough to make possible the reshuffling of functional domain to construct modules with novel physiological functions. With no much deduction, one can easily postulate that this would require equal amount of modularity in the original photosensor domain and physiologically functional domain based on which the design was constructed.
Now, with the little retrospect above, we have determined the ‘differences’ we would like to make: sensitivity, compatibility and modularity. But how to create a photosensitive optogenetic tool that is ultra-sensitive, widely compatible and highly modular all at the same time? To address this problem, we need to start from choosing a promising light-sensitive photoreceptor domain—the core of our design.
Design: Choosing the photoreceptor
domain
From the discussion above, we have already formed a few criteria that the photoreceptor domain must satisfy. It should have been proved to be highly sensitive, should incorporate no chromophore that cannot be synthesized by bacteria, and should be modular in structure and function. Based on these standards we can go through the photoreceptor proteins previously used in optogenetic research, examine them one by one, and rule out inappropriate ones one by one to finally reach the most proper light-sensitive photoreceptor domain for our design.
The light-sensitive domains of most of existing optogenetic modules can be categorized into three major groups: rhodopsins, phytochromes, and phototropins, each with distinct characters and utilities.
Rhodopsins are membrane anchored proteins that can be devided into two subtypes. Both of them detect light through a retinal chromophore. Type 1 rhodopsins, with bacteria rhodopsin as a prototypical representative, directly drive ionic currents through a cycle of light-induced conformational change. Type 2 rhodopsins, with bovine rod rhodopsin as a representative, regulate ionic currents through a downstream signaling pathway triggered by light-induced conformational change in the molecule. The type 1 rhodopsins proved very useful in neuron science. For example, by expressing Channelrhodopsin-2 (ChR2), a light-sensitive ion-channel from green algae which will become permeable to cations after being triggered by blue light, under the control of a tissue specific promoter, scientists have achieved precise and timed light-dependent activation of specific neuronal cells. Type 2 rhodopsons can also achieve light-dependent spatial-temporal control of gene expression through synthetic rewiring of calmodulin signaling pathway which will activate nuclear transcription to the downstream of signaling pathway of rhodopsins such as melanopsin, which will, upon light activation, trigger calcium influx. However, rhodopsins may not be suitable for our project, partly due to the extraordinary integrity of its structure and its membrane-anchored nature, which makes it difficult to fit into the framework of the first principle we have mentioned before, and partly due to the requirement of exogenous supply of the chromophore—retinal, which is not involved in common bacteria metabolism pathways. These two unique features clearly contradict with our goal—‘modularity’ and ‘compatibility’, and thus ruled rhodopsin out of our consideration.
The plant phytochrome B (PhyB) is an outstanding representative of phtochrome photosensors. PhyB photoconverts between two forms: when PhyB in its inactive form (Pr) absorbs red light, it converts into its active form (Pfr), and reconverts back into Pr form slowly, when left in the dark, or rapidly, when exposed to far red light. In its physiologically active form Pr, PhyB is able to interact with its nuclear signaling partener Phytochrome Interacting Factor 3 (PIF3) to form a dimer. By fusing DNA binding domain and transcription activating domain of GAL4 to PhyB and PIF3, respectively, scientists have been able to create a light based yeast-two-hybrid system. As a variation of the theme, scientists have managed to utilize this light-dependent protein-protein interaction to recruit cell signaling molecules to the plasma membrane of mammalian cells to trigger responses such as pseudopod formation and cell polarization. Although radiating with such promise, the phytochrome still cannot be included in our system, simply because it requires, again, the exogenous supply of chromophore (in the PhyB’s case, phycocyanobilin (PCB) ). Although solutions such as incorporating into cells metabolic pathways which will enable them to synthesize these chromophores have been proposed, these proposals will greatly complicate the process of design and application of the light-sensitive system. Although Phytochormes have been excluded from our plan, its mechanism greatly enlightened us. Those previous designs that have utilized the heterodimerization of PhyB and PIF3 have grabbed the essence of a common theme in the functioning of cellular processes—the protein-protein interaction, and have been exemplary in demonstrating a modular and versatile design. These particular characters are precisely the things we are looking for. When we are going further to seek our ideal photosensor domain, we shall keep this example in mind.
Now we have come to phtotropins. We gladly found that among all these three groups of commonly used light-sensitive domains, phototropins possesses the most distinct modular structure. Phototropins have a structurally conserved light sensor domain, termed LOV (light, oxygen and voltage) domain, which is easily discernable and often precedes, within a single reading frame, a sequence of an enzymatically functional domain, connected by the sequence of a linker domain. Despite its structural modularity, the LOV light sensitive domain also seems to possess an outstanding functional modularity, for it was found to combine with various functional domains to perform various cellular functions. Several classes of LOV domain–effector domain combinations have hitherto been described, such as LOV-HKs combination, performing histidine kinase function, LOV GGDEF–EAL combination, predicted to regulate the synthesis and hydrolysis of cyclic di-GMP, and LOV_HTH combination, performing light-induced DNA binding activity. Such versatility already existing in nature strongly suggested that a newly designed artificial fusion protein that fuses the LOV photosensitive domain to a physiologically functional domain to create a non-pre-existing combination would probably work. LOV photosensor domain seems to bear much more virtue than just its modularity. LOV domains have a non-covalently bound flavin (FMN or FAD) chromophore that is absolutely essential for its function. Unlike those choromophres of rhodopsins and phytochormes, flavin is an essential chemical compound deeply involved in the respiratory chain in all forms of life, including bacteria. In another word, in order to let this LOV domain function in bacteria, we only need to incorporate the gene into the cell, and the cell will automatically supply the protein with the chromophore. So far our good phototropin has satisfied two of our requirements: modularity and compatibility. So how about its sensitivity? To date, scientists have, for example, attached Rac1 protein to LOV2 domain from phototropin1 originated from Avena sativa to achieve photoactivatable cell motility, or shuffled the histidine kinase domain of FixL protein, which belongs to a two-component system, to the downstream of the LOV domain of B. subtilis YtvA protein to create a light-regulated histidine kinase, and so on. Among these designs, a particular one that utilizes the photosensor domain of a Vivid (VVD) protein, which originated from Neurospora. Crassa, camptured our attention. The VVD-GAL fusion protein thus designed achieved a rather high light-sensitivity—about 0.04W/m2. This is definitely thrilling news. Moreover, VVD was shown to form a rapidly exchanging homodimer upon blue-light activation, which means, like the Phy-PIF system, the general theme of protein-protein interaction might also be applied to this particular protein. Adding more promise to this VVD protein is its size—the smallest LOV domain containing protein known. This feature makes it easy to engineer and more likely to be stably expressed in bacteria systems.
Based on the discussion above, we have finally chosen the smallest LOV protein (or phototropin) VVD protein’s photosensor domain as our photoreceptor domain. Due to its small size, structural and functional modularity, bacteria compatibility, previous experience of being highly sensitive and its photoswitching mechanism, we reasoned that VVD photosensor domain would be an enabling tool in our coming design of a novel optogenetic module.
Our next step is to choose a physiologically functional domain for our new optogenetic module.
Design: Choosing the physiologically
functional domain
Now we have to choose a physiologically functional domain for our optogenetic module following the general principle we have described in our brief retrospect.
To do this, we need to decide in first place what physiological function we would like our module to achieve. Considering the limited amount of time we have, we need the function of our light-sensitive module easy to manipulate and characterize; and to make our light-sensitive module widely applicable in biotechnological field, we need its function to be modular (?). So we choose the most conventional and fundamental way of controlling cellular behavior as our module’s function: tuning gene expression through transcriptional control.
In bacteria system, transcriptional control of gene expression is often achieved trough the association and disassociation of transcriptional factors (TF) with target sequence on or around promoter site of a particular gene. Among these bacteria Transcriptional Factors, transcription inhibitors take up the major proportion and, coincidentally, many of the inhibitors require dimerization for its inhibitory function. What’s more, these transcriptional factors that require dimerization to function usually consist a DNA binding domain. Now, based on these facts, we can envisage a rough blueprint of our design: the DNA binding domain of a transcription inhibitor shall be fused to the downstream of VVD photosensor domain, and upon blue-light activation, VVD domain will dimerize, helping the DNA binding domain dimerize at the same time to enable its DNA binding activity and inhibition of transcription initiation.
But what transcription factor should we use exactly? In order to work well with our VVD photosensor domain, the DNA binding domain of the transcriptional factor must be easily discernable in the whole structure and had better been proved to function independent of the rest part of the protein. Of course, the DNA binding activity of the transcriptional factor we choose must be specific and rigorously require dimerization.
Based on these criteria, a LexA transcription repressor came into our sight.
LexA is a transcription repressor of all the genes in the SOS system in E.coli, and its crystal structure has been resolved at high-resolution. LexA protein consists of a N-terminal DNA binding domain and a C-terminal dimerization domain with a short hydrophilic linker linking the two separate domains, making the structure impressively modular. Under normal physiological conditions, two LexA proteins will form a homodimer through the dimerization domain interface and bind to the SOS box target gene sequence in promoters of genes in SOS system and create significant steric hindrance for transcription polymerase binding and thus inhibit transcription initiation. When E.coli cell encounters some extracellular factors that would cause intracellular damage, for example, DNA breakage that would result in ssDNA formation, the RecA protein, which also belongs to the SOS family, will bind to the ssDNA to form a filament and interact with LexA and cause its auto-cleavage. The cleavage will cause the DNA binding domain to dissociate from the dimerization domain. Without the help of dimerization domain, the DNA binding domain will no longer stay in its dimerized form. LexA DNA binding domain has been shown to have a thousand-fold lower binding affinity to its target sequence in its monomer form compared to its dimerized form. Thus the cleaved LexA can no longer bind efficiently to its target sequence to block the way for transcription initiation, and the target gene will be expressed.
From these facts, we can conclude that the LexA repressor is the suitable physiological functional domain for our light-sensitive module, as it matches all of our criteria. First of all, a high-resolution crystal structure clearly shows its structural modularity. Beside the apparent structural modularity, various researches have utilized LexA to create versatile gene expression regulating modules that have convincingly demonstrated its functional modularity, further adding to its promise. (For example, a LexA-based genetic system has been devised for monitoring and analyzing protein heterodimerization in Escherichia coli.) Moreover, the DNA binding domain of LexA rigorously requires dimerization to bind efficiently to its targeting site (SOS box), and DNA binding domain itself has no dimerization ability. In another word, when we fuse the LexA DNA binding domain to the VVD photosensor domain, the fusion protein can bind to the SOS box only when light has triggered the VVD domains to form a homodimer and thus helping the DNA binding domains to dimerize. Last but not least, LexA is bacteria endogenous protein, which means it is likely to be efficiently expressed and function normally in bacteria systems.
With the discussion above, we have decided to use DNA binding domain of LexA repressor protein as the physiologically functional domain. With a flexible linker linking the two distinct domains (?), we now have the panorama of our design of photosensitive fusion protein. We term this VVD-LexA based photosensitive fusion protein ‘luminesensor’.
Design: ensuring orthogonality
Perhaps a careful reader has already noticed a severe defect in our design. That is, by choosing the DNA binding domain of a bacteria endogenous transcription repressor, we must use reporter genes that will also be repressed by endogenous LexA protein. If we do not introduce changes into the DNA binding domain of our luminesensor, the only way to rule out the interference of endogenous LexA would be knocking out the genomic sequence of LexA protein. This will indubitably diminish its applicability in the future, since it will not work in any strain with endogenous LexA remaining.
So we set out to find a solution to this important problem. The most natural way would be creating a mutated form of LexA so that it will only recognize a target sequence different from the sequence the wild-type LexA would recognize. We gladly found that such a thing actually exists. It is called the LexA408 variant, which carries mutations PA40, NS41 and AS42 and has been shown much higher affinity for a symmetrically altered target sequence CCGT (N)8 ACGG, different from the wild type SOS box CTGT (N)8 ACAG, which is specifically recognized by wild-type LexA. Thus by introducing the 408 mutations into the DNA binding domain of our luminesensor, we can insulate our light-sensitive system from the interference from the genetic context of the host strain. This outstanding character that have ensured the orthogonality of our system to the endogenous SOS system will greatly enhance the compatibility of our module to those synthetic modules that have hitherto been designed and thus add to the promise of future application.
Design: dynamic performance optimization
Though our Luminesensors has outshined many previously designed optogenetic modules in many aspects, space still remains for its improvement. A particularly important matter would be the dynamic performance of our Luminesensors. Signal-background ratio is always a significant judging criterion in synthetic modules. The more digital-like the output may be, the more likely the module would function efficiently in a complex biological system where wanted and unwanted noises are inevitable. Though our Luminesensors has already shown a high background-signal ratio, we still deem dynamic range optimization necessary, since background-signal ratio can never be too high. Another concern would be whether the system is resettable. In our Luminesensors system, this equals to say whether our Luminesensors will be able to decay back fast to its dark state when it is no longer exposed to light. This is particular important when our system is to be applied in a scenario where cell movement is inevitable. For example, imagine we are using liquid medium to culture our Luminesensors containing bacteria and applying to a proportion of the medium blue light to inhibit reporter gene, say GFP, expression to create some sort of pattern, cells that have received blue light will constantly swim out of the illuminated area. If our Luminesensors’s decay half-life is very long, the light-received cells will eventually distribute evenly in those dark area, so no pattern would form. Unfortunately, VVD protein, the photosensor domain we use, has a typical light-to-dark decay half-life of hours. This makes our decay half-life optimization extremely important.
In order to rationally optimize our Luminesensors’s dynamic range and decay half-life, we need to build a dynamic model in the first place to simulate our Luminesensors system and determine which ones of those parameters in the system would be decisive to the two dynamic features we are interested in.
We listed all possible states of the molecules involved in our Luminesensors system and transitions between those states and created an ordinary deferential equation system based on law of mass action to describe the dynamic behavior of our system. After in-silico simulation, we singled out 4 critical parameters in our system that would determine the above two aspects of our system’s dynamic performance.
Two of them will affect the background-signal ratio, namely the VVD protein dimerization equilibrium constant and dimerized Luminesensors’s DNA binding equilibrium constant. This is easy to understand intuitively. Because the tighter the Luminesensorss dimerize when illuminated by light, the greater the intracellular concentration of dimerized Luminesensors will be. And the tighter the dimerized Luminesensorss bind the target DNA sequence, the greater proportion of target gene will be repressed, thus reducing the background of reporter gene expression under light. The other two parameters are responsible for the decay half-life, namely the rate constant of VVD photosensor domain’s decaying from the light-state back to its dark-state and rate constant of monomer luminesenor’s disassociating form target DNA sequence. This is also easy to understand intuitively. The faster the VVD proteins decay back to its dark-state, the faster the Luminesensors will turn from it dimerized form to its monomer form, which is significantly less efficient in binding target DNA sequence. And the faster the monomer Luminesensorss disassociate from the target sequence, the faster the steric hindrance to RNA polymerase created by DNA-bound Luminesensorss will be lifted, thus reducing the amount of time required for the system to reset.
Having singled out these four critical parameters, we set out to search through literature to find mutations that would enhance them. We found that there does not seem to exist mutations that would enhance dimerized Luminesensors’s DNA binding equilibrium constant. Rather, those mutations seem to undermine the DNA binding affinity, which would be detrimental to our system. We further reasoned that any mutation that would facilitate the disassociation of monomer Luminesensorss from target DNA sequence is also likely to undermine the DNA binding affinity. So we focused on looking for VVD photosensor domain mutations that would promote VVD dimerization and VVD decay. We eventually localized on two specific mutations of VVD photosensor domain: a M135I mutation that will enhance the VVD dimerization, and a I74V mutation in the vicinity of FAD molecule that will enhance the VVD decay rate.
Having determined the plausible mutations, when then introduced mutation M135I, I74V and the combination of the two mutations into our 408 form of Luminesensors. By characterizing the dynamic behavior of the mutated Luminesensors along time course, we discovered that mutation M135I indeed improved the signal-background ratio significantly. However, I74V mutation seems to reduce the signal-background ratio dramatically, rendering our Luminesensors useless. So disregarding of the faster decay half-life it may provide, we have to rule I74V out of our design.