Team:Johns Hopkins-Software/theSoftware overview

From 2012.igem.org

(Difference between revisions)
 
(34 intermediate revisions not shown)
Line 1: Line 1:
-
{{:Team:Johns_Hopkins-Software/background}}
+
<html></div></div></div></div></div></div></div></div></div></div></div></div>
-
<html>
+
<link rel="stylesheet" href="https://2012.igem.org/Team:Johns_Hopkins-Software/Templates/cssindex?action=raw&ctype=text/css" type="text/css"/>
<link rel="stylesheet" href="https://2012.igem.org/Team:Johns_Hopkins-Software/Templates/cssindex?action=raw&ctype=text/css" type="text/css"/>
Line 6: Line 5:
<div id="title">
<div id="title">
-
Software Goals
+
Overview
-
</div>
+
-
 
+
-
<p>We recognized a need for a comprehensive piece of software that provided the means to achieve an easier Plasmid Design process. Manual annotation is simply too time consuming, tedious, and error-prone. We wanted to create something to fix this noticeable setback of existing synthetic biology software. AutoGene sets a new technological standard by bringing automation to the plasmid design process. That being said, we developed a list of specific goals we wished to achieve. We split these goals into two modules, and decided the first set of goals would be achieved by AutoPlasmid, the second set of goals would be achieved by AutoDesign, and together, we would create AutoGene.</p>
+
-
+
-
<img src="https://static.igem.org/mediawiki/2012/9/9c/SoftwareGoals11.png" width=700 height=420/>
+
-
 
+
-
<p> Each of these goals encompasses an extensive list of sub-goals. Here, we discuss the more detailed goals for our primary milestones listed above.</p>
+
-
<div id="title">AutoPlasmid</div>
+
-
<p>AutoPlasmid was developed to take the process of annotating a plasmid to a whole new level. Firstly, we wanted to create a centralized depot of many thousands of well-known features and their associated sequences. We chose what we thought were some of the more popular features, and relied on databases such as SGD, PlasMapper, and the University of Wisconsin Madison's E. Coli Database. We aggregated over 40,000 features and compiled them into a complex database with the following tables:<br>
+
-
1. Annotation<br>
+
-
2. Feature<br>
+
-
3. FeatureType<br>
+
-
4. Oligo<br>
+
-
5. Organism<br>
+
-
6. Pathogen<br>
+
-
7. Plasmid<br>
+
-
8. RegistryPart<br>
+
-
9. RegistryType<br>
+
-
<p>But we knew this database would grow as large as it did. To address this concern, we implemented Cloud Computing, what we feel is one of the most impressive aspects of our software application. A plasmid such as puc18 normally takes 4 hours on a slow computer to annotate and search through all our features. But in the cloud, with 30 parallel processes running at once, we can split up the algorithm and divvy up the work, resulting in a run of just 68 seconds, a dramatic decrease in time by 210-fold!</p>
+
-
 
+
-
<p>But we added many, many more features on top of this architectural backend. In particular, we gave users the ability to annotate their plasmid using imperfect matches selecting any threshold they want (ex. 95% match). While this significantly increases the time a particular annotation takes, it still results in rapidly fast annotations when the Cloud is used. In addition, users may select particular features they want to search for, such as Genes, Promoters, Terminators, and more. Lastly, users may translate DNA into amino acid in all 6 reading frames.</p>
+
-
 
+
-
<p> Once a plasmid is fully annotated, the user can do even more. They can look inside the plasmid, view details about each feature that was annotated, and manipulate the plasmid even more. They can add custom annotations, view where a particular segment of DNA is, and even isolate out features to later design with. Users can view oligo matches, and amino acid translation of particular DNA segments. One of the most useful features of the AutoPlasmid's plasmid view window is that a user can find exactly why a annotation was identified as an imperfect match. In puc18, it is well known that the origin has a 1 base pair mutation, for example. This can be easily located by viewing the details of the alignment. </p>
+
-
 
+
-
<p> Lastly, a user may export their annotated plasmid in any format they'd like: genbank, fasta, or SBOL. These standardized file formats are useful when using AutoGene in collaboration with other standard softwares, such as Ape. Our genbank format, in particular, preserves characteristics of annotations that may be crossed over to Ape, such as the color of each annotation.</p>
+
-
 
+
-
<p>***NOTE: scroll down below for examples images showing screenshots of all the aforementioned features.</p>
+
-
 
+
-
<div id="title">AutoDesign</div>
+
-
 
+
-
<p>AutoGene was developed to make a simple-to-use design tool whereby users can select features they want to design with, drag these features into the order they want, perform operations on these features, and then import their newly designed plasmid to the workspace.</p>
+
-
+
-
<p>A user starts by selecting features they want to design with. These features can be chosen from multiple plasmids in the workspace. To select a feature, a user simply drags that annotation into the Design Registry, a listing of features the user wants to save for later use. We wanted to make this process extremely intuitive and easy, so we implemented a user-friendly drag and drop system whereby a user can drag their cursor over multiple features from a plasmid and drag them into the design registry all at once. They pop inside the pane on the right side of the screen, waiting to be designed with</p>
+
-
 
+
-
<p>When a user is ready to begin designing, they continue the drag and drop process by dragging features from the private registry into the design pane. The whole process is especially user-friendly and simple to use. Anyone can use it, and this was also another one of our major goals. The design pane consists of three windows which are discussed below. As the plasmid is made, the associated DNA sequence and the associated picture of the plasmid is updated in real time so that a user can get a real view into the Plasmid they are designing. AutoPlasmid helps visualize this new plasmid. Once a plasmid is done being designed, the user may import it into the workspace and then export as a genbank, fasta, and SBOL as discussed above</p>
+
-
 
+
-
<div id="title">
+
-
Interface Organization - AutoPlasmid
+
</div>
</div>
<br>
<br>
-
1. PlasmidView: this is the AutoPlasmid window. On the left is a picture of the plasmid. On the right is the DNA and the list of features contained inside that plasmid. Using the many buttons in the toolbar on the right side, or by clicking annotations in the picture of the plasmid, the user may interact with AutoPlasmid in many ways described above.
+
AutoGene is an innovative plasmid design suite meant to streamline the processes of annotating and building sequences. Written in Java and Python and consisting of two modules -- AutoPlasmid and AutoDesign -- it first uses a highly curated database of features to search an imported plasmid, scanning the sequence for both perfect and imperfect alignments, and then generating an interactive visualization of the annotated plasmid. Once a plasmid is annotated, a user is then able to alter its contents, using the AutoGene feature database, the biobrick database, as well as custom components. Still in progress, the design module enables structural optimization by maintaining sets of sequence rules and taking an algorithmic approach to minimizing structural violations. Additionally the program reduces the problems caused by restriction sites during application of designs, and proposes the most suitable enzyme selections through an analysis of standard restriction sites libraries.  
<br><br>
<br><br>
-
2. Plasmid List: this is where AutoGene keeps track of the Plasmids that the user is annotation and/or designing with throughout their use of AutoGene. Anytime a plasmid is imported, it pops into the Plasmid List so that a user may easily open it at any time.
+
The processes will be run on the cloud as a standard web server gateway interface service to dramatically increasing its speed and accessibility while maintaining the same level of accuracy. As a cloud service, it will run its algorithms in parallel and thus have the ability to perform more computationally intense procedures, such as optimizing codons to increase gene expression, and designing the most efficient oligonucleotide sequences for PCR assembly.  
<br><br>
<br><br>
-
3. Private Registry of Features: this is where AutoGene keeps track of features that users would like to design with. A user drags features from the PlasmidView into the Private Registry as a type of Bookmarking utility. It is a way to quickly access features to later use in designing plasmids.
+
Furthermore, AutoGene has the potential for integrating fabrication or design software in the future. 3D visualization techniques could provide users with views of protein structures created from the sequences or that of a related ortholog. The process for fabrication of synthetic DNA sequences could be automated through building block design, helping the the user to assemble their oligonucleotides. A number of additional features, such as evaluating overlapping segments of oligonucleotides, and evaluating melting temperatures for PCR assembly, may also be built on top of this project.  
<br><br>
<br><br>
-
4. Log: this is AutoGene's log, which gives warnings or error messages to the user. For example, if the user tries to import a DNA sequence with incorrect characters. It also will notify users of tasks that are running, such as annotations (and features that have been found) or DNA inverting.
+
Using a clear user interface, a large, highly curated feature database, and through harnessing the power of cloud computing, we hope to simplify the process of plasmid design. AutoGene aims to make it faster and easier for a user to both identify and modify the contents of any plasmid, enhancing sequences for greater viability and expression.
<br><br>
<br><br>
-
<img src="https://static.igem.org/mediawiki/2012/b/b2/AutoPlasmidOverview.png" width=925 height=582/>
+
<div id="title">
-
<br><br>
+
Software Goals
 +
</div>
 +
<br>
 +
<p>We recognized a need for a comprehensive piece of software that provided the means to achieve an easier Plasmid Design process. Manual annotation is simply too time consuming, tedious, and error-prone. We wanted to create something to fix this noticeable setback of existing synthetic biology software. AutoGene sets a new technological standard by bringing automation to the plasmid design process. That being said, we developed a list of specific goals we wished to achieve. We split these goals into two modules, and decided the first set of goals would be achieved by AutoPlasmid, the second set of goals would be achieved by AutoDesign, and together, we would create AutoGene.</p>
 +
 +
<center><img src="https://static.igem.org/mediawiki/2012/9/9c/SoftwareGoals11.png" width=700 height=420/></center>
 +
<br><br> Each of these goals encompasses an extensive list of sub-goals. Here, we discuss the more detailed goals for our primary milestones listed above.<br>
 +
<br>
 +
 +
<a name="futuregoals">
<div id="title">
<div id="title">
-
Interface Organization - AutoDesign
+
Future Goals
</div>
</div>
 +
</a>
<br>
<br>
-
1. Design Pane: the user drags features from the Private Registry into the Design Pane. Each feature type has an associated icon (ex. a Terminator is a capital T). The user may user the design pane to rearrange features using an easy drag and drop process. They may also right click any feature to perform a number of operators on that feature, such as duplicate, delete, and invert.
+
Autogene is still evolving as we both finish up the design module, and fine tune the plasmid mapper. There are several features we’d like to develope in the future, including 3d protein visualization using MAYA, codon optimization, and primer design.
<br><br>
<br><br>
-
2. DNA Pane: the DNA Pane shows the current DNA that is contained inside the plasmid that is being designed. Different features are highlighted in different colors associated with their feature type. For example, Origins are in brown. The DNA updates as the user continues to design their plasmid in the Design Pane.
+
Codon optimization is a common synthetic biology technique used to improve the efficiency of protein translation. DNA is translated into amino acids in groups of three bases called codons, but only the first two bases determine into which amino acid the codon in translated. This allows researchers to change this third letter without affecting the resulting proteins. When a codon is ‘optimized’, this means that the codon is replaced with a version producing the same protein as the original, but which is more common in the organism’s DNA. If the codon is more common, then the thought is that it can be produced more efficiently.
<br><br>
<br><br>
-
3. PlasmidView: the PlasmidView shows the current Plasmid, and is updated just as the DNA Pane is updated while the user designs their plasmid.  
+
Primer design is one of the most common computational tasks in synthetic biology. In order to synthesize a DNA sequence, it is necessary to break the total sequence into small, overlapping parts, which are then combined using PCR. Once we finish this feature, Autogene will truly encompass all portions of plasmid design, using the simple philosophy of Scan/Edit/Print. From the annotation of the starting sequence, to the gathering and arranging of desired features, to the development of building blocks to actually synthesize the given sequence, everything will be done through Autogene.
<br><br>
<br><br>
-
 
+
Autogene is the first tool of its kind in many ways, and we hope that its development will demonstrate the potential of many new techniques and technologies when applied to synthetic biology. By exporting our alignments to the cloud, we’ve shown that complex computation, the kind that before could only be performed on very high-powered computers, can be accessed by anyone with an internet connection. An increase in computational power for every researcher across the board could revolutionize synthetic biology, and scientific research in general.
-
<img src="https://static.igem.org/mediawiki/2012/c/cd/AutoDesignExplanation.png" width=925 height=582/>
+
<br><br>
<br><br>
-
<div id="title">
 
-
How to use AutoPlasmid
 
</div>
</div>
-
<div id="largeFont"></div>
+
<br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br><br>
-
<br><font size ="4" color = "white">
+
-
 
+
-
<br>First, hand AutoPlasmid a sequence of DNA, however big or small you'd like. Import a file or copy and paste the sequence. <br><br>
+
-
<img src="https://static.igem.org/mediawiki/2012/4/47/ImportPlasmidScreenshot.png"/><br><br>
+
-
 
+
-
Then get the annotation loaded and started. AutoPlasmid will search through a database containing 40,000 features and look for matches. Not only does it find perfect matches, but if you specify, you can search for imperfect matches with any threshold you'd like (ex. 90% match).
+
-
 
+
-
<br><br><img src="https://static.igem.org/mediawiki/2012/8/8d/AnnotationOptionsJHU.png"/><br><br>  
+
-
 
+
-
After searching for annotations, open your plasmid to look inside.
+
-
<br><br><img src="https://static.igem.org/mediawiki/2012/6/6e/Puc18JHU.png"/><br><br>
+
-
                                 
+
-
Interact with it by selecting features, manipulating the DNA, and adding any custom annotations you'd like. You can even view amino acid translations or oligo matches. <br><br>
+
-
 
+
-
<img src="https://static.igem.org/mediawiki/2012/e/e5/CustomAnnotationWindow.png"/><br><br>
+
-
<img src="https://static.igem.org/mediawiki/2012/a/a1/TranslationScreenshot.png"/><br><br>
+
-
<img src="https://static.igem.org/mediawiki/2012/6/62/OligoScreenshot.png"/><br><br>
+
-
 
+
-
Lastly, you can view the details of an imperfect match.
+
-
 
+
-
<br><br><img src="https://static.igem.org/mediawiki/2012/5/5a/MatchDetailsJHU.png"/><br><br>
+
-
 
+
-
AutoPlasmid is also compatible with a variety of standard biology tools, such as Ape. A plasmid can be imported as a fasta, gb, and SBOL file, as well as being exported as a fasta, gb, or SBOL file.
+
-
<br><br><br><br><br>
+
-
<div id="title">
+
-
How to use AutoDesign </div>
+
-
<br>
+
-
 
+
-
Using the annotated features that AutoPlasmid finds, the user may select features to design a new plasmid with. First, the user chooses features and drags them into the private registry.<br><br>
+
-
<img src="https://static.igem.org/mediawiki/2012/c/c0/DesignRegistryScreenshot.png"/><br><br>
+
-
 
+
-
Next, the user drags features into the private registry. The user can change the order of features, invert features, duplicate features, and delete features. At the same time, the user sees an updated version of the plasmid they are designing and the associated DNA.<br><br>
+
-
<img src="https://static.igem.org/mediawiki/2012/2/27/JHUAutoDesignScreenshot.png"/><br><br>
+
-
 
+
-
 
+
-
<div id="title">Future Goals</div>
+
-
The future of AutoGene, we believe, includes a number of new features:<br>
+
-
1.
+
-
<br>
+
-
2.
+
-
<br>
+
-
3.
+
-
<br>
+
-
                        </font>
+
-
 
+
-
</div>
+
-
+
-
 
+
-
 
+
-
 
+
-
 
+
</html>
</html>
{{:Team:Johns_Hopkins-Software/header}}
{{:Team:Johns_Hopkins-Software/header}}

Latest revision as of 23:32, 26 October 2012

Overview

AutoGene is an innovative plasmid design suite meant to streamline the processes of annotating and building sequences. Written in Java and Python and consisting of two modules -- AutoPlasmid and AutoDesign -- it first uses a highly curated database of features to search an imported plasmid, scanning the sequence for both perfect and imperfect alignments, and then generating an interactive visualization of the annotated plasmid. Once a plasmid is annotated, a user is then able to alter its contents, using the AutoGene feature database, the biobrick database, as well as custom components. Still in progress, the design module enables structural optimization by maintaining sets of sequence rules and taking an algorithmic approach to minimizing structural violations. Additionally the program reduces the problems caused by restriction sites during application of designs, and proposes the most suitable enzyme selections through an analysis of standard restriction sites libraries.

The processes will be run on the cloud as a standard web server gateway interface service to dramatically increasing its speed and accessibility while maintaining the same level of accuracy. As a cloud service, it will run its algorithms in parallel and thus have the ability to perform more computationally intense procedures, such as optimizing codons to increase gene expression, and designing the most efficient oligonucleotide sequences for PCR assembly.

Furthermore, AutoGene has the potential for integrating fabrication or design software in the future. 3D visualization techniques could provide users with views of protein structures created from the sequences or that of a related ortholog. The process for fabrication of synthetic DNA sequences could be automated through building block design, helping the the user to assemble their oligonucleotides. A number of additional features, such as evaluating overlapping segments of oligonucleotides, and evaluating melting temperatures for PCR assembly, may also be built on top of this project.

Using a clear user interface, a large, highly curated feature database, and through harnessing the power of cloud computing, we hope to simplify the process of plasmid design. AutoGene aims to make it faster and easier for a user to both identify and modify the contents of any plasmid, enhancing sequences for greater viability and expression.

Software Goals

We recognized a need for a comprehensive piece of software that provided the means to achieve an easier Plasmid Design process. Manual annotation is simply too time consuming, tedious, and error-prone. We wanted to create something to fix this noticeable setback of existing synthetic biology software. AutoGene sets a new technological standard by bringing automation to the plasmid design process. That being said, we developed a list of specific goals we wished to achieve. We split these goals into two modules, and decided the first set of goals would be achieved by AutoPlasmid, the second set of goals would be achieved by AutoDesign, and together, we would create AutoGene.



Each of these goals encompasses an extensive list of sub-goals. Here, we discuss the more detailed goals for our primary milestones listed above.

Future Goals

Autogene is still evolving as we both finish up the design module, and fine tune the plasmid mapper. There are several features we’d like to develope in the future, including 3d protein visualization using MAYA, codon optimization, and primer design.

Codon optimization is a common synthetic biology technique used to improve the efficiency of protein translation. DNA is translated into amino acids in groups of three bases called codons, but only the first two bases determine into which amino acid the codon in translated. This allows researchers to change this third letter without affecting the resulting proteins. When a codon is ‘optimized’, this means that the codon is replaced with a version producing the same protein as the original, but which is more common in the organism’s DNA. If the codon is more common, then the thought is that it can be produced more efficiently.

Primer design is one of the most common computational tasks in synthetic biology. In order to synthesize a DNA sequence, it is necessary to break the total sequence into small, overlapping parts, which are then combined using PCR. Once we finish this feature, Autogene will truly encompass all portions of plasmid design, using the simple philosophy of Scan/Edit/Print. From the annotation of the starting sequence, to the gathering and arranging of desired features, to the development of building blocks to actually synthesize the given sequence, everything will be done through Autogene.

Autogene is the first tool of its kind in many ways, and we hope that its development will demonstrate the potential of many new techniques and technologies when applied to synthetic biology. By exporting our alignments to the cloud, we’ve shown that complex computation, the kind that before could only be performed on very high-powered computers, can be accessed by anyone with an internet connection. An increase in computational power for every researcher across the board could revolutionize synthetic biology, and scientific research in general.






































































































































































































Autogene

Retrieved from "http://2012.igem.org/Team:Johns_Hopkins-Software/theSoftware_overview"