Team:TU Munich/Project/Caffeine
From 2012.igem.org
Contents |
Background and principles
Caffeine is a purine- alkaloid and its biosynthesis is known in coffee plants and tea plants, for example. Its chemical structure is similar to the ribonucleoside adenosine. Hence it can block specific receptors in the hypothalamus in a competitive manner, which leads to decreased neurotransmitter- release and therefore decreased neuron activity. Biological background is to beware the brain of overexertion by inducing sleep and that is the reason for using coffeine to stay awake. On average, one cup of coffee contains about 50 - 130 mg Caffeine.
At higher doses (1g), caffeine leads to higher pulse rates and hyperactivity, but until that i think the beer would have done its work already...
Caffeine was shown to decrease the growth of E. Coli and Yeast reversibly as of a concentration of 0,1% by acting as a mutagen (Putrament et al., On the Specificity of Caffeine Effects, MGG, 1972), but previous caffeine synthesis experiments (see below) have only led to a concentration of about 5 µg/g (per g fresh weight of tobacco leaves), so i do not think we would reach the problematic concentration.
It has already been achieved to produce caffeine in tobacco plants ( Uefuji et al., 2005; Yun- Soo Kim et al., 2007 ), but has never been performed in yeast.
Biosynthesis:
The biosynthetic pathway of caffeine (1,3,7 Trimethylxanthine) starts with xanthosine, which is a natural component of the purine- metabolism of all organism. Necessary for its production are three distinct N- methyl transferases and one nucleosidase, whereupon it has not been totally elucidated whether the nucleosidase reaction is catalyzed by any purine nucleosidase or by the first N- methyl transferase of the reaction cascade shown in the picture (but the latter assumption is favoured (H. Ashihara et al., 2008), because an in vitro synthesis of caffeine with the three N- methyl transferases has already been shown). I have indicated, that the caffeine syntase (last reaction step) can catalyze both, the conversion of 7- methyl xanthine to theobromine and the methylation of theobromine to caffeine. This is true, indeed, but Uefuji et al. (2003) showed, that the affinity to 7- methyl xanthosine is less than one sixth of that of CaMXMT1 (there are two isoformes of CaMXMT), so it is much better to express both enzymes. One can also see the Km values for the required enzymes in this paper - it shows that the substrate affinity decreases continiously towards the endpoint (caffeine), "making the reaction proceed irreversibly and stepwise" (Uefuji et al., 2003, p.377).
The chemical compound xanthosine is produced via at least four different routes, shown in the picture "xanthosine routes". To improve caffeine production, these pathways could be a possible target for metabolic engineering. Anyway it should be interesting/ necessary for this project to determine the in vivo xanthosine concentration of yeast.
Catabolism:
Caffeine is demethylated to theophylline by 7N- demethylase (main pathway). The decreased rate of this reaction is the reason for the accumulating caffeine in the plant. Afterwards theophylline is degraded to xanthine via 3- methylxanthine and xanthine enters the conventional purine catabolism pathway (degradation to CO2 and NH3) (see H. Ashihara et al., 2008, p. 846). This catabolistic pathway is another possible target for metabolic engineering to increase the amount of caffeine (e.g. partially inhibition of 7N- demethylase (?))
Idea
The idea is to perform a heterologous gene expression of the distinct N-methyl transferases required for caffeine biosynthesis. The research groups which accomplished caffeine production in transgenic tobacco plants used the following three genes:
- CaXMT1 (AB048793)
- UniProt entry: Q9AVK0
- E.C.: 2.1.1.158
- PDB: 3D- Structure: http://www.pdb.org/pdb/explore/explore.do?structureId=2eg5 (Structure of C. canephora- CaXMT1)
- Complete mRNA- sequence: http://www.ncbi.nlm.nih.gov/nuccore/AB048793 CaXMT1 (coffea arabica)
- Coding sequence: bp 45 - 1163 ==> total length: 1119 bp
- Problematic restriction sites: EcoR1 at bp 104- 109
- CaMXMT1 (AB048794)
- UniProt entry: Q9AVJ9
- 2.1.1.159
- This gene is eventually not essential, for CaDXMT1 is able to catalyze this reaction, too.
- Complete mRNA- sequence: http://www.ncbi.nlm.nih.gov/nuccore/AB048794 CaMXMT1 (coffea arabica)
- Coding sequence: bp 32 - 1168 ==> total length: 1137 bp
- Problematic restriction sites: EcoR1 at bp 722- 728
- CaDXMT1 (AB084125)
- UniProt entry: Q8H0D2
- 2.1.1.160
- PDB: 3D- Structure: http://www.pdb.org/pdb/explore/explore.do?structureId=2efj (Structure of C. canephora- CaDXMT1)
- Complete mRNA- sequence: http://www.ncbi.nlm.nih.gov/nuccore/AB084125 CaDXMT1 (coffea arabica)
- Coding sequence: bp 1- 1155 ==> total length: 1155 bp
- Problematic restriction sites: EcoR1 at bp 691- 697 and at bp 705- 711
General:
In any case, we will have to order special RBS from the Partsregistry, for example http://partsregistry.org/wiki/index.php?title=Part:BBa_J63003 BBa_J63003, because they exhibit organism- specificity. However, an RBS is not essential in eukaryotic gene translation.
All mentionend methyltransferases use SAM als methyl- donor and are located in the cytoplasm of the plants. Furthermore they exist as homodimers, being also able to form heterodimers with each other (see [http://www.brenda-enzymes.org/php/result_flat.php4?ecno=2.1.1.160| BRENDA], also for further characteristics). The temperature and pH optimum of all three enzymes is quite similar between 20°C - 37°C and 7,5 - 8,5, respectively. (On a recent brewery guide tour i happend to learn that the pH of beer is slightly acid, but i do not know how much influence that would have on the enzyme activity).
General remarks and issues
Analytical Methods
- Detection and quantification of the produced caffeine can be performed by the use of HPLC. Uefuji et al., 2005; describe those and other relevant methods in this context.
- Proof of the gene- expression (of the necessary methyl- transferases) could be accomplished by standard methods (RT PCR and Western Blot, see also Uefuji et al., 2005; and Yun- Soo Kim and Hiroshi Sano, 2007;)
Gene Expression
- To increase gene expression it is possible to work with double promoter constructs. I am working with that in my bachelors thesis and my adviser said this would be an often underrated possibility to improve gene expression. Perhaps we can make use of this.
cDNA Synthesis
- If we do not want to order the sequences for the N- methyl transferases, another possibility would be an isolation of the corresponding genes by cDNA synthesis. Of course, it would be much more elaborate and i also do not know, wether it would be cheaper in the end. Anyway, Uefuji et al. (2003) show how they got their sequences. Interesting thing: Due to high sequence- homology of those three genes (partly up to 97%), one primer- pair is enough, to generate all three cDNAs, because their sequences (containing both, start and stop- codon) are absolutely identical in each gene.
Biobricks and sequences
CaXMT1
>gi|13365750|dbj|AB048793.1| Coffea arabica CaXMT1 mRNA for xanthosine methyltransferase, complete cds
1 ctttggcagt cccaatttga tttatgtaca agtcctgcat atgaatggag ctccaagaag 61 tcctgcggat gaatggaggc gaaggcgata caagctacgc caagaattca gcctacaatc 121 aactggttct cgccaaggtg aaacctgtcc ttgaacaatg cgtacgggaa ttgttgcggg 181 ccaacttgcc caacatcaac aagtgcatta aagttgcgga tttgggatgc gcttctggac 241 caaacacact tttaacagtt cgggacattg tccaaagtat tgacaaagtt ggccaggaaa 301 agaagaatga attagaacgt cccaccattc agatttttct gaatgatctt ttcccaaatg 361 atttcaattc ggttttcaag ttgctgccaa gcttctaccg caaacttgag aaagaaaatg 421 gacgcaaaat aggatcgtgc ctaatagggg caatgcccgg ctctttctac agcagactct 481 tccccgagga gtccatgcat tttttacact cttgttactg tcttcaatgg ttatctcagg 541 ttcctagcgg tttggtgact gaattgggga tcagtacgaa caaagggagc atttactctt 601 ccaaagcaag tcgtctgccc gtccagaagg catatttgga tcaatttacg aaagatttta 661 ccacatttct aaggattcat tcggaagagt tgttttcaca tggccgaatg ctccttactt 721 gcatttgtaa aggagttgaa ttagacgccc ggaatgccat agacttactt gagatggcaa 781 taaacgactt ggttgttgag ggacatctgg aggaagaaaa attggatagt ttcaatcttc 841 cagtctatat accttcagca gaagaagtaa agtgcatagt tgaggaggaa ggttcttttg 901 aaattttata cctggagact tttaaggtcc tttacgatgc tggcttctct attgacgatg 961 aacatattaa agcagagtat gttgcatctt ccgttagagc agtttacgaa cccatcctcg 1021 caagtcattt tggagaagct attatacctg acatattcca caggtttgcg aagcatgcag 1081 caaaggttct ccccttgggc aaaggcttct ataataatct tatcatttct ctcgccaaaa 1141 agccagagaa gtcagacgtg taaaagtttg tttttgtgtt ggggaaagga ataagtgccg 1201 ttgggggtct ttcgggtatt gtgcttttta tattatattg ttttgtatcc gtaataaaag 1261 tggtgtgtaa gaataagata tttgacatat attattttca aaaaaaaaaa aaaaaa
Name | Length | RFC10 | RFC25 | Codon Usage | NCBI |
7-Methylxanthosintransferase | 1119bp | 1x EcoRI(104-110) | ok after RFC10 | 5AS<10% | [http://www.ncbi.nlm.nih.gov/nuccore/AB048793 AB048793] |
CaMXMT1
>gi|13365752|dbj|AB048794.1| Coffea arabica CaMXMT1 mRNA for 7-methylxanthine N-methyltransferase, complete cds
1 agcagtcgca attcgattgt cctgcatatg aatggagctc caagaagtcc tgcatatgaa 61 tgaaggtgaa ggcgatacaa gctacgccaa gaatgcatcc tacaatctgg ctcttgccaa 121 ggtgaaacct ttccttgaac aatgcatacg agaattgttg cgggccaact tgcccaacat 181 caacaagtgc attaaagttg cggatttggg atgcgcttct ggaccaaaca cacttttaac 241 agtgcgggac attgtgcaaa gtattgacaa agttggccag gaagagaaga atgaattaga 301 acgtcccacc attcagattt ttctgaatga tcttttccaa aatgatttca attcggtttt 361 caagttgctg ccaagcttct accgcaaact cgagaaagaa aatggacgca agataggatc 421 gtgcctaata agcgcaatgc ctggctcttt ctacggcaga ctcttccccg aggagtccat 481 gcattttttg cactcttgtt acagtgttca ttggttatct caggttccca gcggtttggt 541 gattgaattg gggattggtg caaacaaagg gagtatttac tcttccaaag gatgtcgtcc 601 gcccgtccag aaggcatatt tggatcaatt tacgaaagat tttaccacat ttctaaggat 661 tcattcgaaa gagttgtttt cacgtggccg aatgctcctt acctgcattt gtaaagtaga 721 tgaattcgac gaaccgaatc ccctagactt acttgacatg gcaataaacg acttgattgt 781 tgagggactt ctggaggaag aaaaattgga tagtttcaat attccattct ttacaccttc 841 agcagaagaa gtaaagtgca tagttgagga ggaaggttct tgcgaaattt tatatctgga 901 gacttttaag gcccattatg atgctgcctt ctctattgat gatgattacc cagtaagatc 961 ccatgaacaa attaaagcag agtatgtggc atcattaatt agatcagttt acgaacccat 1021 cctcgcaagt cattttggag aagctattat gcctgactta ttccacaggc ttgcgaagca 1081 tgcagcaaag gttctccaca tgggcaaagg ctgctataat aatcttatca tttctctcgc 1141 caaaaagcca gagaagtcag acgtgtaaaa gtttgttttt agttggtttt tgtgccgttg 1201 ggggtctttc gggtattgtc gttttgtatt cgtaataaaa gtgatgtgca agaataagat 1261 atttagtaca atattttcat aaaaaaaaaa aaaaaaaa
Name | Length | RFC10 | RFC25 | Codon Usage | NCBI |
N-Methylnucleosidase | 1137bp | 1x EcoRI(722-728) | ok after RFC10 | 2AS<10% | [http://www.ncbi.nlm.nih.gov/nuccore/AB048794 AB048794] |
CaDXMT
>gi|30023549|dbj|AB084125.1| Coffea arabica CaDXMT1 mRNA for 3,7-dimethylxanthine N-methyltransferase, complete cds
1 atggagctcc aagaagtcct gcatatgaat ggaggcgaag gcgatacaag ctacgccaag 61 aactcattct acaatctgtt tctcatcagg gtgaaaccta tccttgaaca atgcatacaa 121 gaattgttgc gggccaactt gcccaacatc aacaagtgca ttaaagttgc ggatttggga 181 tgcgcttctg gaccaaacac acttttaaca gttcgggaca ttgtacaaag tattgacaaa 241 gttggccagg aaaagaagaa tgaattagaa cgtcccacca ttcagatttt tctgaatgat 301 cttttccaaa atgatttcaa ttcggttttc aagtcgctgc caagcttcta ccgcaaactt 361 gagaaagaaa atggacgcaa aataggatca tgcctgatag gcgcaatgcc tggctctttc 421 tacggcagac tcttccccga ggagtccatg cattttttac actcttgtta ctgtttgcat 481 tggttatctc aggttcccag cggtttggtg actgaattgg ggatcagtgc gaacaaaggg 541 tgcatttact cttccaaagc aagtcgtccg cccatccaga aggcatattt ggatcaattt 601 acgaaagatt ttaccacatt tcttaggatt cattcggaag agttgatttc acgtggccga 661 atgctcctta cttggatttg caaagaagat gaattcgaga acccgaattc catagactta 721 cttgagatgt caataaacga cttggttatt gagggacatc tggaggaaga aaaattggac 781 agtttcaatg ttccaatcta tgcaccttca acagaagaag taaagtgcat agttgaggag 841 gaaggttctt ttgaaatttt atacctggag acttttaagg tcccttatga tgctggcttc 901 tctattgatg atgattacca aggaagatcc cattccccag tatcctgcga tgaacatgct 961 agagcagcgc atgtggcatc tgtcgttaga tcaattttcg aacccatcgt cgcaagtcat 1021 tttggagaag ctatcatgcc tgacttatcc cacaggattg cgaagaatgc agcaaaggtt 1081 cttcgctccg gcaaaggctt ctatgatagt cttatcattt ctctcgccaa aaagccagag 1141 aagtcagacg tgtaa
Name | Length | RFC10 | RFC25 | Codon Usage | NCBI |
N-Methylnucleosidase | 1155bp | 2x EcoRI(691-697/705-711) | ok after RFC10 | 2AS<10% | [http://www.ncbi.nlm.nih.gov/nuccore/AB084125 AB084125] |
General
Theoretically, we only need the coding sequences of the genes, because we ought to use the special RBS and Terminator biobricks from the registry and thus do not need the existing ones (in the 5' UTR and 3' UTR, respectively) (Anyway, they would probably not work in yeast)
Finally ordered Sequences
The ordered sequences were constructed as follows:
- the 5' UTR and 3' UTR of the sequences above were removed
- the yeast consensus sequence for improved ribosome binding (TACACA) was added 5' of the start codon ATG
- according to n- end rule and the yeast consensus sequence for improved ribosome binding, the first triplet after ATG (GAG) was exchanged with TCT (serine), to optimize both, protein stability and mRNA translation. This decision was made after proofing the 3D- structure of the enzyme CaDXMT1. Due to the fact, that the the first two residues of the amino acid sequence are not shown in the crystalized structure (probably because of high flexibility), Prof. Skerra said we can risk the exchange of this amino acid, for it is probably not that necessary for the uptake of the ligands (uniprot entry further shows, that it is not immediately involved in ligand binding in one of the three enzymes). Because of the high similarity of the enzyme- sequences, we also exchanged this amino acid in the enzyme CaMXMT1, although here is no 3D- structure available
- we added a c- terminal strep-tag for purification
- the remaining coding sequence was extended with the standard RFC10 prefix and suffix, respectively
- at last we made an optimization of the coding sequences with respect to the codon usage and mRNA structures (online tool of a gene- synthesis company)
- annotated sequences:
References
- Putrament et al., On the Specificity of Caffeine Effects, MGG, 1972
- H. Uefuji et al., Plant Physiology, 2003, Vol. 132, pp. 372–380
- H. Uefuji et al., Plant Molecular Biology, 2005, Vol. 59, p. 221–227
- H. Ashihara et al., Phytochemistry, 2008, Vol. 69, p. 841–856
- Yun-Soo Kim, Hiroshi Sano, Phytochemistry, 2008, Vol. 69, p. 882–888