Team:St Andrews/Human-practices

From 2012.igem.org

(Difference between revisions)
 
(227 intermediate revisions not shown)
Line 3: Line 3:
<html lang="en">
<html lang="en">
<head>
<head>
-
<link rel="stylesheet" href="https://2012.igem.org/Team:St_Andrews/Template:DocsCssFull?action=raw&ctype=text/css" type="text/css" />
+
<style type="text/css">
 +
.label-set .label {line-height:1.8}
 +
#content-container .page-header {padding-top:30px}
 +
.extra-spacing {padding-top:20px}
 +
.thumbnail {text-decoration:none !important;background-color:#ffffff}
 +
.thumbnail:hover {color:inherit !important}
 +
</style>
</head>
</head>
   <body data-spy="scroll" data-target=".subnav" data-offset="50" id="body-pattern">
   <body data-spy="scroll" data-target=".subnav" data-offset="50" id="body-pattern">
-
 
+
     <div class="container" id="content-container" style="background-image:url('https://static.igem.org/mediawiki/2012/a/ac/ImpactLogo_backgroundCorner_plusTransp_right.jpg');background-repeat:no-repeat;background-position:top right;background-size:35%">
-
     <div class="container" id="content-container">
+
<!-- Masthead
<!-- Masthead
Line 15: Line 20:
<header class="jumbotron subhead" id="overview">
<header class="jumbotron subhead" id="overview">
   <h1>Scientific impact of iGEM</h1>
   <h1>Scientific impact of iGEM</h1>
-
   <p class="lead">"The most influential Synthetic Biology competition" vs. "let the kids play"</p>
+
   <p class="lead">"Most influential synthetic biology competition" vs. "Just some kids playing"?</p>
   <div class="subnav">
   <div class="subnav">
     <ul class="nav nav-pills">
     <ul class="nav nav-pills">
       <li><a href="#introduction">Introduction</a></li>
       <li><a href="#introduction">Introduction</a></li>
       <li><a href="#data-collection">Data collection</a></li>
       <li><a href="#data-collection">Data collection</a></li>
 +
      <li><a href="#metrics">Metrics</a></li>
 +
      <li><a href="#data-analysis">Data analysis</a></li>
 +
      <li><a href="#citations">Citations</a></li>
 +
      <li><a href="#conclusion">Conclusion</a></li>
 +
      <li><a href="#references">References</a></li>
     </ul>
     </ul>
   </div>
   </div>
Line 29: Line 39:
<!-- Introduction
<!-- Introduction
================================================== -->
================================================== -->
-
<section id="introduction">
 
-
<br>
 
-
<h1>Introduction</h1>
 
-
<p>Team St Andrews forms, uniting nine students, seven world class researchers and four PhD advisors from disciplines as diverse as Computer Science and Physics, to Biology, Medicine and Chemistry.  </p>
+
<section id="introduction" class="span9">
-
+
<div class="page-header">
-
<h2>January - March 2012</h2>
+
<h1>Introduction</h1>
-
+
</div>
-
<p>Applications for sponsorship are made to specifically chosen businesses and organisations with an interest in advancing the Life Sciences. Very quickly, "BioSilta", "GenScript", "Clontech", "Geneious", "Integrated DNA Technologies" and "Thermo Fisher" pledge their support and Team St Andrews grows...</p>
+
<img src="https://static.igem.org/mediawiki/2012/a/ac/ImpactLogo_100.png" align="left"></img>
-
+
<p>We wanted to determine how relevant the iGEM competition is for the greater SynBio community. So we investigated the scientific attention garnered by both the iGEM and the Registry of Standard Parts. A data-driven approach was chosen: We extracted data from search results using various queries (such as <code>("iGEM" OR "International Genetically Engineered Machine") AND ("synthetic biology" OR "genetic engineering")</code>) from various publication search engines. We searched <a href="http://apps.webofknowledge.com/">Web of Knowledge</a>, <a href="http://www.scopus.com/home.url">Scopus</a>, <a href="http://www.ncbi.nlm.nih.gov/pubmed/">PubMed</a> and <a href="http://scholar.google.co.uk/">Google Scholar</a>. Google Scholar was chosen to perform more detailed data analysis, as we found the alternatives to have various shortcomings.</p>
-
<h2>10 March 2012</h2>
+
<p>We found that our data results are conclusive with our initial hypothesis: iGEM is an important contributor to the SynBio community. These findings have some implications for the iGEM competition, which we discuss.</p>
-
+
<p>In order to quantify these results further, we analyzed how exactly iGEM and the Registry has been cited. Examining around 50 papers in closer detail, we recommend all papers published by iGEM teams or related to iGEM or the Registry to use a standard citation.</p>
-
<p>National Science and Engineering Week explodes in Fife with a regional "Science Discovery Day".  Team St Andrews works to convey fundamental concepts in Genetic Engineering and Synthetic Biology to members of the public in new and exciting ways. The interactive "Codon Game", the 3 Dimensional visualisations of DNA and DNA polymerase 3 and display "E. Coli: under the Microscope" are all well received.  Children and adults alike are fascinated when DNA is extracted from bananas, using everyday kitchen utensils, before their eyes.</p>
+
-
+
-
<h2>April 2012</h2>
+
-
+
-
<p>Brainstorming sessions are held as the team researches project ideas.  Some promising titles include:</p>
+
-
+
-
<ul>
+
-
<li><p>"E. Coli and Omega 3: A Project to Feed the Minds of Our Generation"</p></li>
+
-
<li><p>"Enzymatic Methane Conversion in Cows: a Sweet Smelling Approach to Reducing Climate Change"</p></li>
+
-
<li><p>"Resurfacing Science, Resurfacing our Roads: Cell Factories and Metal Binding Proteins Recover Pavement Platinum"</p></li>
+
-
<li><p>"Project Bio-logic-al: Optimizing Soil Composition by Method of Biological Computation"</p></li>
+
-
</ul>
+
-
+
-
<h2>27 April 2012</h2>
+
-
+
-
<p>Team member Josi presents "Spider Mutants and Bioterrorism - an Overview of Synthetic Biology as an Emerging Scientific Discipline" to a “TEDx” audience of over eighty scientists and non - scientists akin. Josi views the field as "ground breaking" and by the end of her talk, members of her audience too admit surprise at the wealth of possibilities that this new research area makes available.  There is excitement at the tantalizing proximity of reality of these ideas.</p>
+
-
+
-
<h2>May 2012</h2>
+
-
+
-
<p>Project ideas are discussed in greater detail and are filtered until only two research topics remain.  Those preferred ideas are: the production of Omega 3 Fatty Acids by E. Coli Cells ("E. Coli and Omega 3: A Project to Feed the Minds of Our Generation") and the production of Metal – Binding Proteins ("Resurfacing Science, Resurfacing our Roads: Cell Factories and Metal Binding Proteins Recover Pavement Platinum").</p>
+
</section>
</section>
Line 66: Line 53:
================================================== -->
================================================== -->
<section id="data-collection">
<section id="data-collection">
-
<br>
+
<div class="row">
-
<h1>Data collection</h1>
+
<div class="span12">
-
+
<div class="page-header">
-
<div class="accordion" id="accordion2">
+
<h1>Data collection <small>Data and where to find it</small></h1>
-
            <div class="accordion-group">
+
</div>
-
              <div class="accordion-heading">
+
<h2 id="query-summary">Query summary</h2>
-
                <a class="accordion-toggle" data-toggle="collapse" data-parent="#accordion2" href="#collapseOne">
+
<p>Here's a quick breakdown of what we queried for on Google Scholar and what sort of data was returned. (The <em>ID</em> matches the name of the data set in our <a href="https://docs.google.com/folder/d/0B6xW2A1PIytESjJ4SUlteG9rUEU/edit?pli=1">data tables</a>). The <i>h-</i> and <i>g-indexes</i> are explained just below!</p>
-
                  Elsevier's Scopus database, search query "iGEM". Irrelevant results (e.g. institution of gas engineers and managers) manually filtered. Performed on 26 June 2012.
+
<table class="table table-striped">
-
                </a>
+
 
-
              </div>
+
<tr>
-
              <div id="collapseOne" class="accordion-body collapse" style="height: 0px; ">
+
<th>Dataset ID</th>
-
                <div class="accordion-inner">
+
<th>Plain English query</th>
-
                  Anim pariatur cliche reprehenderit, enim eiusmod high life accusamus terry richardson ad squid. 3 wolf moon officia aute, non cupidatat skateboard dolor brunch. Food truck quinoa nesciunt laborum eiusmod. Brunch 3 wolf moon tempor, sunt aliqua put a bird on it squid single-origin coffee nulla assumenda shoreditch et. Nihil anim keffiyeh helvetica, craft beer labore wes anderson cred nesciunt sapiente ea proident. Ad vegan excepteur butcher vice lomo. Leggings occaecat craft beer farm-to-table, raw denim aesthetic synth nesciunt you probably haven't heard of them accusamus labore sustainable VHS.
+
<th>Query</th>
-
                </div>
+
<th>Nº Papers</th>
-
              </div>
+
<th>Nº Citations</th>
-
            </div>
+
<th>h-index</th>
-
            <div class="accordion-group">
+
<th>g-index</th>
-
              <div class="accordion-heading">
+
<th>Query date</th>
-
                <a class="accordion-toggle" data-toggle="collapse" data-parent="#accordion2" href="#collapseTwo">
+
</tr>
-
                  Collapsible Group Item #2
+
 
-
                </a>
+
<tr>
-
              </div>
+
<td><a href="https://docs.google.com/folder/d/0B6xW2A1PIytESjJ4SUlteG9rUEU/edit?pli=1&docId=0AqxW2A1PIytEdDBNV2ExQnluOXJ6SzlqeEZSQ2twc1E">1</a></td>
-
              <div id="collapseTwo" class="accordion-body in collapse" style="height: 0px; ">
+
<td>Papers mentioning iGEM in context of synbio</td>
-
                <div class="accordion-inner">
+
<td><code>(“synthetic biology” OR "genetic engineering") AND (“iGEM” OR “International Genetically Engineered Machine")</code></td>
-
                  <table>
+
<td>770</td>
-
<tr><td>Authors</td><td>Title</td><td>Year</td><td>Source title</td><td>Cited by</td><td>Link</td></tr>
+
<td>3253</td>
-
<tr><td>Boyle P.M.</td><td> Burrill D.R.</td><td> Inniss M.C.</td><td> Agapakis C.M.</td><td> Deardon A.</td><td> DeWerd J.G.</td><td> Gedeon M.A.</td><td> Quinn J.Y.</td><td> Paull M.L.</td><td> Raman A.M.</td><td> Theilmann M.R.</td><td> Wang L.</td><td> Winn J.C.</td><td> Medvedik O.</td><td> Schellenberg K.</td><td> Haynes K.A.</td><td> Viel A.</td><td> Brenner T.J.</td><td> Church G.M.</td><td> Shah J.V.</td><td> Silver P.A."</td><td>A BioBrick compatible strategy for genetic modification of plants</td><td>2012</td><td>Journal of Biological Engineering</td><td></td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-84862311887&partnerID=40&md5=c7d8f6c120046b10d7c2846035e8967a</td></tr>
+
<td>26</td>
-
<tr><td>Hesselman M.C.</td><td> Odoni D.I.</td><td> Ryback B.M.</td><td> de Groot S.</td><td> van Heck R.G.A.</td><td> Keijsers J.</td><td> Kolkman P.</td><td> Nieuwenhuijse D.</td><td> van Nuland Y.M.</td><td> Sebus E.</td><td> Spee R.</td><td> de Vries H.</td><td> Wapenaar M.T.</td><td> Ingham C.J.</td><td> Schroen K.</td><td> Martins dos Santos V.A.P.</td><td> Spaans S.K.</td><td> Hugenholtz F.</td><td> van Passel M.W.J."</td><td>A multi-platform flow device for microbial (co-) cultivation and microscopic analysis</td><td>2012</td><td>PLoS ONE</td><td></td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-84861008844&partnerID=40&md5=1d589179f1026c6b18561886eb95137f</td></tr>
+
<td>45</td>
-
<tr><td>Materi W.</td><td>Leading a successful iGEM team</td><td>2012</td><td>Methods in Molecular Biology</td><td></td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-84858373911&partnerID=40&md5=c042ab8445472bde80fd097c2c7f5840</td></tr>
+
<td>17/7/2012</td>
-
<tr><td>Ho-Shing O.</td><td> Lau K.H.</td><td> Vernon W.</td><td> Eckdahl T.T.</td><td> Campbell A.M."</td><td>Assembly of standardized DNA parts using biobrick ends in E. coli</td><td>2012</td><td>Methods in Molecular Biology</td><td></td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-84858416895&partnerID=40&md5=257cf76e51c458476c1ba8c45ad53993</td></tr>
+
</tr>
-
<tr><td>Giavitto J.-L.</td><td>The modeling and the simulation of the fluid machines of synthetic biology</td><td>2012</td><td>Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)</td><td></td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-84856118515&partnerID=40&md5=b482e017a0312fc2b37771c0a2ebd016</td></tr>
+
 
-
<tr><td>Muller K.M.</td><td> Arndt K.M."</td><td>Standardization in synthetic biology</td><td>2012</td><td>Methods in Molecular Biology</td><td></td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-82355181974&partnerID=40&md5=19e06cbf0e67855c668e52423d0eee2b</td></tr>
+
 
-
<tr><td>Kuldell N.</td><td>Living machines: Some assembly required: Kit-based competitions challenge teams of students to learn microbiology and design principles in the context of synthetic biology</td><td>2012</td><td>Microbe</td><td></td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-84855890383&partnerID=40&md5=704e642863324ba616224b7d2000c638</td></tr>
+
<tr>
-
<tr><td>Stewart D.</td><td> Wilson-Kanamori J.R."</td><td>Modular modelling in synthetic biology: Light-based communication in E. coli</td><td>2011</td><td>Electronic Notes in Theoretical Computer Science</td><td></td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-80055072775&partnerID=40&md5=728eadae0407c4fb4d43d3c8d48648b9</td></tr>
+
<td><a href="https://docs.google.com/folder/d/0B6xW2A1PIytESjJ4SUlteG9rUEU/edit?pli=1&docId=0AqxW2A1PIytEdDVBSnY4ekt0ZUdJRXZqaFREVmJrUkE">2</a></td>
-
<tr><td>Dixon J.</td><td> Kuldell N."</td><td>BioBuilding: Using banana-scented bacteria to teach synthetic biology</td><td>2011</td><td>Methods in Enzymology</td><td>1</td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-79957526685&partnerID=40&md5=67f18898be70d64e4a5ca05e4f62b83b</td></tr>
+
<td>All synthetic biology</td>
-
<tr><td>Mitchell R.</td><td> Dori Y.J.</td><td> Kuldell N.H."</td><td>Experiential Engineering Through iGEM-An Undergraduate Summer Competition in Synthetic Biology</td><td>2011</td><td>Journal of Science Education and Technology</td><td></td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-79953037825&partnerID=40&md5=26f1a53f3e91cc6943854fc9a0b3bd02</td></tr>
+
<td><code>synthetic biology</code></td>
-
<tr><td>Dress L.</td><td>iGEM 2010: Synthetic biologists compete for the future</td><td>2010</td><td>Industrial Biotechnology</td><td></td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-78650724541&partnerID=40&md5=c97c238e5d9425acfb58d30242e71484</td></tr>
+
<td>1000</td>
-
<tr><td>Hafner S.</td><td>IGEM 2009: Synthethic biology and ethics [Les iGEM 2009 la biologie synth-éthique]</td><td>2010</td><td>Medecine/Sciences</td><td>2</td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-78650166973&partnerID=40&md5=a6b997075a88a932c2dc7551f868bda7</td></tr>
+
<td>68482</td>
-
<tr><td>Gu X.</td><td> Trybilo M.</td><td> Ramsay S.</td><td> Jensen M.</td><td> Fulton R.</td><td> Rosser S.</td><td> Gilbert D."</td><td>Engineering a novel self-powering electrochemical biosensor</td><td>2010</td><td>Systems and Synthetic Biology</td><td></td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-77958488960&partnerID=40&md5=141f80bffe6c579c0029d9d7540fe9a0</td></tr>
+
<td>127</td>
-
<tr><td>Cai Y.</td><td> Wilson M.L.</td><td> Peccoud J."</td><td>GenoCAD for iGEM: A grammatical approach to the design of standard-compliant constructs</td><td>2010</td><td>Nucleic Acids Research</td><td>12</td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-77952526654&partnerID=40&md5=ae3ab35cb311e0e1ddabb0cff9036f77</td></tr>
+
<td>214</td>
-
<tr><td>Cooling M.T.</td><td> Rouilly V.</td><td> Misirli G.</td><td> Lawson J.</td><td> Yu T.</td><td> Hallinan J.</td><td> Wipat A."</td><td>Standard virtual biological parts: A repository of modular modeling components for synthetic biology</td><td>2010</td><td>Bioinformatics</td><td>10</td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-77951969603&partnerID=40&md5=6776faf8875e836fea00b273c28fa315</td></tr>
+
<td>17/7/2012</td>
-
<tr><td>Smolke C.D.</td><td>Building outside of the box: IGEM and the BioBricks Foundation</td><td>2009</td><td>Nature Biotechnology</td><td>15</td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-71849085372&partnerID=40&md5=ad462a440b094c7dc24c9ac9491546f1</td></tr>
+
</tr>
-
<tr><td>Hafner S.</td><td>"iGEM competition 2008</td><td> two French teams! [Compétition iGEM 2008: Deux équipes françaises!]"</td><td>2009</td><td>Medecine/Sciences</td><td>2</td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-66849142197&partnerID=40&md5=715e4423dfdd513949af6798ad32ea0c</td></tr>
+
 
-
<tr><td>Jerala R.</td><td>I like iGEM</td><td>2009</td><td>Scientist</td><td></td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-65349196705&partnerID=40&md5=cbfa881ac65c74d323c1bbf94cd47a0b</td></tr>
+
<tr>
-
<tr><td>Edwards C.</td><td>The gene machines</td><td>2008</td><td>Engineering and Technology</td><td></td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-44449168043&partnerID=40&md5=6c08b9a9258980533bfca7f89c1a6d2f</td></tr>
+
<td><a href="https://docs.google.com/folder/d/0B6xW2A1PIytESjJ4SUlteG9rUEU/edit?pli=1&docId=0AqxW2A1PIytEdFFSV2hEMURWb1pCUmpHY3EyWndHeEE">3</a></td>
-
<tr><td>Bikard D.</td><td> Kepes F."</td><td>First French team success during iGEM Synthetic biology competition [Succès de la première équipe française lors de la compétition iGEM de biologie synthétique]</td><td>2008</td><td>Medecine/Sciences</td><td>6</td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-44949101352&partnerID=40&md5=b4b37b907403fc09040dc4d9dd01f9b8</td></tr>
+
<td>Papers mentioning Registry of Parts</td>
-
<tr><td>Bachman R.</td><td>An iGEM of an idea? [2]</td><td>2008</td><td>Scientist</td><td></td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-39849096038&partnerID=40&md5=983ce8270f7b0b54a9a6dc9b35c61689</td></tr>
+
<td><code>"Registry of Standard Biological Parts" OR "partsregistry.org" OR "parts.mit.edu"</code></td>
-
<tr><td>Kaczkowski P.</td><td>An iGEM of an idea? [1]</td><td>2008</td><td>Scientist</td><td></td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-39849093164&partnerID=40&md5=1af981458828ff33ece806cc2ce4443a</td></tr>
+
<td>751</td>
-
<tr><td>Goodman C.</td><td>Engineering ingenuity at iGEM</td><td>2008</td><td>Nature Chemical Biology</td><td>16</td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-37249032831&partnerID=40&md5=464370c4b42769a4b3a9a8a717a1c206</td></tr>
+
<td>6442</td>
-
<tr><td>Gallagher R.</td><td>An iGEM of an idea</td><td>2007</td><td>Scientist</td><td>2</td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-37749033035&partnerID=40&md5=2963cb1a67d5ffe45b0bed31a43f7471</td></tr>
+
<td>39</td>
-
<tr><td>Brown J.</td><td>The iGEM competition: Building with biology</td><td>2007</td><td>IET Synthetic Biology</td><td>7</td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-34547756879&partnerID=40&md5=7a4f9ee6f657126d039b6d14c32facdf</td></tr>
+
<td>69</td>
-
<tr><td>Haynes K.A.</td><td> Broderick M.L.</td><td> Brown A.D.</td><td> Butner T.L.</td><td> Harden L.</td><td> Heard L.</td><td> Jessen E.</td><td> Malloy K.</td><td> Ogden B.</td><td> Rosemond S.</td><td> Simpson S.</td><td> Zwack E.</td><td> Campbell A.M.</td><td> Eckdahl T.</td><td> Heyer L.J.</td><td> Poet J.L."</td><td>Computing with living hardware</td><td>2007</td><td>IET Synthetic Biology</td><td>2</td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-34547766474&partnerID=40&md5=5cbb92ec25cb1f7c8948678ed87f1eb9</td></tr>
+
<td>17/7/2012</td>
-
<tr><td>Dabholkar S.</td><td> Thattai M."</td><td>Brainstorming biology</td><td>2007</td><td>IET Synthetic Biology</td><td>1</td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-34547770766&partnerID=40&md5=1708fc19ef10eb3ad147917e93f98917</td></tr>
+
</tr>
-
<tr><td>Brown T.</td><td> Chang C.</td><td> Heinze B.</td><td> Hollinger P.</td><td> Kittleson J.</td><td> MacDow K.</td><td> Reavis D.</td><td> Curry J.</td><td> Riley M."</td><td>Development of an inducible three colour bacterial water colour system</td><td>2007</td><td>IET Synthetic Biology</td><td>1</td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-34547792851&partnerID=40&md5=87694ffa9b4e84807270bcd42a7ea429</td></tr>
+
 
-
<tr><td>Rodrigo G.</td><td> Montagud A.</td><td> Aparici A.</td><td> Aroca M.C.</td><td> Baguena M.</td><td> Carrera J.</td><td> Edo C.</td><td> Fernandez-De-Cordoba P.</td><td> Ferrando A.</td><td> Fuertes G.</td><td> Gimenez D.</td><td> Mata C.</td><td> Medrano J.V.</td><td> Navarrete C.</td><td> Navarro E.</td><td> Salgado J.</td><td> Tortosa P.</td><td> Urchueguia J.</td><td> Jaramillo A."</td><td>Vanillin cell sensor</td><td>2007</td><td>IET Synthetic Biology</td><td>1</td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-34547815072&partnerID=40&md5=f455a6c61e37a0c6f9f6b7b9dfb78584</td></tr>
+
<tr>
-
<tr><td>Quiroz J.N.A.</td><td> Flores R.B.</td><td> Cisneros T.G.B.</td><td> Rosales I.Y.F.</td><td> Naranjo A.G.</td><td> Sanchez J.C.G.</td><td> Jimenez M.E.G.</td><td> Padilla R.E.G.</td><td> Baena A.J.L.</td><td> Hernandez P.A.L.</td><td> Padilla P.G.</td><td> Miller R.P.</td><td> Gaspar I.N.R.</td><td> Chico J.C.R.</td><td> Martinez A.R.</td><td> Romero J.P.</td><td> Arzate A.S.</td><td> Barradas J.S.A.</td><td> Diaz D.A.</td><td> Bracho A.B.</td><td> Benitez C.</td><td> Arteag C.I.F.</td><td> Quiroz F.H.</td><td> Martinez G.J.</td><td> Rabadan J.L.</td><td> Salvador M.C.O.</td><td> Longoria P.P.</td><td> Orozco R.P.</td><td> Corona F.R.</td><td> Majarrez E.S.</td><td> Hassan E.S.</td><td> Sanchez C.S.</td><td> Saldana U.V.</td><td> Segura P.B.Z."</td><td>Biological implementation of algorithms and unconventional computing</td><td>2007</td><td>IET Synthetic Biology</td><td></td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-34547735452&partnerID=40&md5=d8366832fc70edd0d1fe9fde991ec608</td></tr>
+
<td><a href="https://docs.google.com/folder/d/0B6xW2A1PIytESjJ4SUlteG9rUEU/edit?pli=1&docId=0AqxW2A1PIytEdFBWY3B5X0tSQl9NTUFCTDhDVGU1dWc">4</a></td>
-
<tr><td>King P.</td><td> Lavrovsky V.</td><td> Von Mammen S.</td><td> Jacob C."</td><td>Teaching bacteria how to dance</td><td>2007</td><td>IET Synthetic Biology</td><td></td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-34547753985&partnerID=40&md5=ee8ac0a42229ae0664005b0e99f02df0</td></tr>
+
<td>Papers citing a particular Part</td>
-
<tr><td>Badalamenti J.P.</td><td> Weiss L.E.</td><td> Buckno C.J.</td><td> Richard T.L.</td><td> Weiss P.S.</td><td> Cirino P.C."</td><td>Synthetic sports: A bacterial relay race</td><td>2007</td><td>IET Synthetic Biology</td><td>1</td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-34547744852&partnerID=40&md5=8845204e23c68968ca36d7c44e37fc3e</td></tr>
+
<td><code>partsregistry.org/Part:</code></td>
-
<tr><td>Lohmueller J.</td><td> Neretti N.</td><td> Hickey B.</td><td> Kaka A.</td><td> Gao A.</td><td> Lemon J.</td><td> Lattanzi V.</td><td> Goldstein P.</td><td> Tam L.-K.</td><td> Schmidt M.</td><td> Brodsky A.S.</td><td> Haberstroh K.</td><td> Morgan J.</td><td> Palmore T.</td><td> Wessel G.</td><td> Jaklenec A.</td><td> Urabe H.</td><td> Gagnon J.</td><td> Cumbers J."</td><td>Progress toward construction and modelling of a tri-stable toggle switch in E. coli</td><td>2007</td><td>IET Synthetic Biology</td><td>2</td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-34547820240&partnerID=40&md5=32b2a39638653846cb463e3803da25e6</td></tr>
+
<td>54</td>
-
<tr><td>Ravi M.</td><td> Ngeleka M.</td><td> Kim S.-H.</td><td> Gyles C.</td><td> Berthiaume F.</td><td> Mourez M.</td><td> Middleton D.</td><td> Simko E."</td><td>Contribution of AIDA-I to the pathogenicity of a porcine diarrheagenic Escherichia coli and to intestinal colonization through biofilm formation in pigs</td><td>2007</td><td>Veterinary Microbiology</td><td>10</td><td>http://www.scopus.com/inward/record.url?eid=2-s2.0-33846781661&partnerID=40&md5=9f9eddba925a109f0fe648b758895db6</td></tr>
+
<td>263</td>
 +
<td>5</td>
 +
<td>16</td>
 +
<td>17/7/2012</td>
 +
</tr>
 +
 
 +
<tr>
 +
<td><a href="https://docs.google.com/folder/d/0B6xW2A1PIytESjJ4SUlteG9rUEU/edit?pli=1&docId=0AqxW2A1PIytEdFY5ay1oX0lJNEl5Ql84Y0diaUpaLXc">5</a></td>
 +
<td>Papers mentioning iGEM</td>
 +
<td><code>iGEM OR "International Genetically Engineered Machine"</code></td>
 +
<td>1000</td>
 +
<td>9095</td>
 +
<td>36</td>
 +
<td>64</td>
 +
<td>17/7/2012</td>
 +
</tr>
 +
 
 +
<tr>
 +
<td><a href="https://docs.google.com/folder/d/0B6xW2A1PIytESjJ4SUlteG9rUEU/edit?pli=1&docId=0AqxW2A1PIytEdGVLRGtzLVF4NXFjOFlOR3dsRENuekE">6</a></td>
 +
<td>Papers mentioning iGEM and Registry</td>
 +
<td><code>("iGEM" OR "International Genetically Engineered Machine") AND("Registry of Standard Biological Parts" OR "partsregistry.org" OR "parts.mit.edu")</code></td>
 +
<td>330</td>
 +
<td>2208</td>
 +
<td>23</td>
 +
<td>42</td>
 +
<td>17/7/2012</td>
 +
</tr>
 +
 
</table>
</table>
-
                </div>
+
<p><span class="label label-info">Note</span> Searches were capped at a maximum of 1000 results. Hence getting 1000 results for a query implies that <em>more exist</em>! Those first 1000 are only the ones the search engine judged most relevant.
-
              </div>
+
</p>
-
            </div>
+
</div>
-
            <div class="accordion-group">
+
<div class="span8 extra-spacing">
-
              <div class="accordion-heading">
+
<h2>Why we used Google Scholar</h2>
-
                <a class="accordion-toggle" data-toggle="collapse" data-parent="#accordion2" href="#collapseThree">
+
<p>All in all, we found Google Scholar to most closely meet our analytic needs.</p>
-
                  Collapsible Group Item #3
+
        <p>As WoK, Scopus and PubMed are strictly curated databases and limited in scope, they missed many obviously relevant publications. We also found their search options unsuitable: Many of them did not support either full text search (they looked at titles, keywords and abstracts only) or boolean operators. But for the following reasons, we needed a search to include both:
-
                </a>
+
<ul>
-
              </div>
+
<li>iGEM cannot be expected to always be the main subject of a paper, hence full text search.</li>
-
              <div id="collapseThree" class="accordion-body collapse" style="height: 0px; ">
+
<li>There are many relevant terms floating about iGEM, hence boolean operators like "OR" to allow treating papers that contain "International Genetically Engineered Machine" or the acronym "iGEM" equally.</li>
-
                <div class="accordion-inner">
+
</ul>
-
                  Anim pariatur cliche reprehenderit, enim eiusmod high life accusamus terry richardson ad squid. 3 wolf moon officia aute, non cupidatat skateboard dolor brunch. Food truck quinoa nesciunt laborum eiusmod. Brunch 3 wolf moon tempor, sunt aliqua put a bird on it squid single-origin coffee nulla assumenda shoreditch et. Nihil anim keffiyeh helvetica, craft beer labore wes anderson cred nesciunt sapiente ea proident. Ad vegan excepteur butcher vice lomo. Leggings occaecat craft beer farm-to-table, raw denim aesthetic synth nesciunt you probably haven't heard of them accusamus labore sustainable VHS.
+
        <p></p>
-
                </div>
+
<p>Just how much wider is Google's search scope? Here is an example: PubMed gave so few results (<strong>16</strong> for <code>iGEM genetic*</code>) that we quickly discarded it. Manually merging the Web of Knowledge and Scopus results for the query <code>iGEM AND genetic*</code> (discarding obviously irrelevant results) gave <strong>43</strong> results. Then we queried Google Scholar. It gave us <strong>770</strong> for <code>(“synthetic biology” OR "genetic engineering") AND (“iGEM” OR “International Genetically Engineered Machine")</code>. </p>
-
              </div>
+
<p>Of course, Google Scholar too is but a bronze bullet: It brings its own drawbacks. It is engineered to pick up things that only <em>seem like</em> scholarly articles. Like Google's search results in general, the results are not curated by a human. This has been criticised in the literature (<a href="http://www.emeraldinsight.com/journals.htm?articleid=1550726&show=abstract">Péter J., 2006.</a>). We found the occasional <a href="https://2012.igem.org/File:TokyoTechNotReallyNature.png">hilarious total miss</a>. Google Scholar is also known to somewhat overestimate citation counts (<a href="http://oneentry.wordpress.com/2006/08/11/research-on-citation-search-in-web-of-science-scopus-and-google-scholar/">Iselid, L., 2006.</a>). However, we concluded from empirical manual examination of a random sample that the majority of the results are plausible and (most importantly) <em>far</em> greater in scope than searches in curated databases. Taking these aspects into consideration, we found Google Scholar best fulfilled our requirements.</p>
-
            </div>
+
<p><span class="label label-warning">Caveat!</span> We only wanted to identify trends. Large and coarse pieces of data with some error were sufficient for this. These values should not be taken as exact!</p>
-
          </div>
+
</div>
-
 
+
 
 +
<div class="span4">
 +
<h2>Browse the data</h2>
 +
<p><form action="https://docs.google.com/folder/d/0B6xW2A1PIytESjJ4SUlteG9rUEU/edit">
 +
<button type="submit" class="btn btn-large btn-info" href="https://docs.google.com/folder/d/0B6xW2A1PIytESjJ4SUlteG9rUEU/edit"><img src="https://static.igem.org/mediawiki/2012/6/61/Glyphicons_090_table.png"></img> Our tables</button></form></p>
 +
<p>...are online in a nifty and very usable Google Docs folder.</p>
 +
<p>An introduction is included in case you get lost or want more information.</p>
 +
 
 +
<h2 class="extra-spacing">On extraction tools</h2>
 +
<p>We made extensive use of Harzing Publish or Perish (<a href="http://www.harzing.com/pop.htm">Harzing, A.W., 2007.</a>) to scrape Google Scholar results. The tool has many limitations. However, in our experience it is the best available resource for managing the many locations in which scientific publications are deposited.</p>
 +
<p>We did try other things: Manual methods were too slow for hundreds of papers. Various Firefox browser plugins failed due to the search APIs having changed, were extremely awkward to use or produced clearly erroneous results. The Mac OS program <em>Papers</em> was easy to use and found large numbers of papers (as like Google Scholar it could access many sources), but had unacceptably high rates of error, problems with duplicates and could not export the results into a form we could easily process. Hence <em>Publish or Perish</em>.</p>  
 +
</div>
 +
 
 +
</div>
 +
 
 +
</section> <!-- End "data-collection" -->
 +
 
 +
<!-- Metrics
 +
================================================== -->
 +
<section id="metrics">
 +
<div class="page-header">
 +
<h1>Metrics <small>Quantifying scientific impact</small></h1>
 +
</div>
 +
<p>Once we had scraped our data from Google Scholar, we needed a method to quantify the relevance of a given scientific article. There are many ways you can quantify success of a paper. Here are a few we investigated:</p>
 +
<div class="row-fluid">
 +
<div class="span6">
 +
<div class="well">
 +
<h3>Plain citation count</h3>
 +
<p>High citation count can generally be taken as an indicator of a high-quality or high-impact paper. This is the most traditional method of ranking the influence of papers.</p>
 +
<p>The main disadvantage of the "citation count"-method is its lack of consistent standards. Even papers within a distinct scientific fields will have differing reference approaches and citation counts. It is also significant that old papers have an edge over newer papers, as they've had more time to be cited. </p>
 +
</div>
 +
<div class="well">
 +
<h3>h-index</h3>
 +
<p>The h-index is an integer unique to a set of papers. It is used to measure the output and influence of a set of scientists. A greater h-index implies more productive and more influential authors. It was invented by physicist <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1283832/?tool=pmcentrez">J.E. Hirsch (2005)</a> and has since been automatically calculated by many citation databases. Here is its definition: "A set of papers has h-index <em>h</em> if <em>h</em> papers out of that set have been cited at least <em>h</em> times." An image (<a href="http://en.wikipedia.org/wiki/H-index">"Ael 2" and "Vulpecula", 2012.</a>) clarifies:
 +
<br /><img src="https://static.igem.org/mediawiki/2012/c/c1/HIndex.png"/></p>
 +
</div>
 +
</div>
 +
<div class="span6">
 +
<div class="well">
 +
<h3>g-index</h3>
 +
<p>The g-index is a citation index meant to quantify the influence of papers. It was proposed by <a href="http://www.springerlink.com/content/4119257t25h0852w/?MUD=MP">Leo Egghe (2006)</a> as a variation to the h-index. It puts more emphasis on the most cited papers and Egghe argues that it ranks highly cited authors more fairly. He gives the following definition: "A set of papers has a g-index <em>g</em> if <em>g</em> is the highest rank such that the top <em>g</em> papers have, together, at least <em>g²</em> citations." Here's a clarifying image by our Polish friend <a href="http://en.wikipedia.org/wiki/File:Gindex1.jpg">("Ael 2", 2012.)</a> again:
 +
<br /><img src="https://static.igem.org/mediawiki/2012/d/df/GIndex.png" /></p>
 +
</div>
 +
<div class="well">
 +
<h3>Algorithmic methods</h3>
 +
<p>It's worth noting that there are many other ways of quantifying productivity and impact of a set of papers or scientists. For example, <a href="http://iopscience.iop.org/1367-2630/14/3/033033/">Y.B. Zhou et al (2012)</a> propose a more complete method for "distinguishing prestige from popularity". In their algorithm, the weight of a citation to the influence of a paper is also dependent on the (already calculated) influences of the citing papers and their authors. This requires running a recursive algorithm on sufficiently complete bipartite network of papers and their authors.</p>
 +
</div>
 +
</div><!--close well-->
 +
</div><!--close row-->
 +
<div class="row-fluid">
 +
<div class="span12">
 +
<p>Though the alternatives look enticing, we ended up looking mainly at citation count.</p>
 +
<p>The algorithmic methods were beyond our reach of data availability: We would have had to find the names of every involved person in every iGEM team and all papers they've written, filtering out large amounts of false matches. This was impracticable.</p>
 +
<p>The h and g-indexes don't actually show more than the raw citation counts when it comes to tendencies over time. Also, we had relatively few search queries to compare against each other, given that two were discarded for having various flaws (discussed just below). This meant that the h and g-index, while valuable methods, were not suitable for our particular data analysis.</p>
 +
</div>
 +
 
</section>
</section>
-
    </div><!-- /container -->
+
<!-- Data analysis
 +
================================================== -->
 +
<section id="data-analysis">
 +
<div class="page-header">
 +
<h1>Data analysis <small>What does this data actually mean?</small></h1>
 +
</div>
 +
<h2>Is our data usable?</h2>
 +
<h3>Yes. Mostly...</h3>
 +
<p>Given our doubts over accuracy of Google Scholar's data, we considered it a priority to exercise caution with our search query results. This paid off: data we compiled using the search queries of IDs 5 and 2 had fatal flaws. They were rejected from further analysis (discussed below). The other data sets were found to be suitable.</p>
 +
<p>Our method to examine data suitability was empirical: Random samplings of each data set were passed under human eyes. For <em>most</em> queries, this observation of random subsets showed acceptably low levels of "background static" (i.e. results that Google Scholar had automatically matched to the query, which were not actually relevant). These would form only a drop of error in the ocean of relevant data.</p>
-
   </body>
+
<h3>...but <em>only</em> mostly.</h3>
 +
<p><strong>Query 5</strong> (<code>iGEM OR "International Genetically Engineered Machine"</code>) was found to have an unacceptably large level of static. The reason was quickly identified: Because the two quoted terms in Query ID 5 (<code>"iGEM"</code>, <code>"International Genetically Engineered Machine"</code>) were separated by a disjunction (<code>OR</code>), the query would easily match anything that contained just the acronym "IGEM"! This meant acronyms in economics such as "Inter-temporal General Equilibrium Model (IGEM)" or the British "Institution of Gas Engineers & Managers (IGEM)" and various medical terms and chemical names snuck in. The entire data set with ID 5 was dismissed from further analysis.</p>
 +
<p>The lesson we took from this is not to search for short acronyms by themselves. Query 1 (<code>(“synthetic biology” OR "genetic engineering") AND (“iGEM” OR “International Genetically Engineered Machine")</code>) could be thought of as the "Version Two" of the problematic Query 5. It searches for the same terms, but includes a conjunction (<code>AND</code>) with either <em>synthetic biology</em> or <em>genetic engineering</em>, which bends results toward <em>our</em> iGEM. This tunes static down to an acceptable level.</p>
 +
 
 +
<p><strong>Query 2</strong> (<code>synthetic biology</code>) had a different problem: It was too big. The query was an attempt to capture stats for the entire field of synthetic biology, so we could statistically determine the relative influence of the iGEM competition. However, we had forgotten the 1000-result cap imposed by Google Scholar. It is impossible to retrieve results beyond this 1k "event horizon". Google does not publish information regarding how the order of results is determined. Hence these first 1000 results (out of what are likely to be 10s or 100s of thousands of papers) are all biased by some unknown force. Were more cited papers favoured? Were papers published more recently favoured? No conclusions can be drawn from a biased and small subset of the full data. We also discounted data set 2 from any further analysis.</p>
 +
 
 +
<p>To emphasize: Data sets 2 and 5 are <em>not included</em> in any further analysis.</p>
 +
 
 +
<h2>What's is iGEM's impact?</h2>
 +
<p>What does "impact" mean within a scientific context? We will operate under the assumption that <em>having impact</em> correlates strongly with <em>being mentioned</em> in scholarly articles. Hence, we quantify the impact that a term is having simply by searching for that term, and summing up result counts.</p>
 +
<p>Here is a chart summarising how many papers mention various terms floating about iGEM over time:</p>
 +
<div class="row-fluid">
 +
<div class="span8">
 +
    <a class="thumbnail" href="#modal-attention-summary" data-toggle="modal">
 +
      <img src="https://static.igem.org/mediawiki/2012/a/af/DataSummaryStats.png" alt="">
 +
<p class="label-set">
 +
<span class="label" style="background-color:#ffaf2b">Number of teams</span>
 +
<span class="label" style="background-color:#d2eaa2;color:#353535">Parts submitted (10s)</span>
 +
<span class="label" style="background-color:#81ccf1">Papers mentioning iGEM</span>
 +
<span class="label" style="background-color:#7fb02b">Papers mentioning Registry of Parts</span>
 +
<span class="label" style="background-color:#cc6200">Papers mentioning specific Registry Parts</span></p>
 +
      <h5>Chart 1: <em>Summary of iGEM-related attention over time</em></h5>
 +
      <p>This chart shows the amount of "attention" the iGEM competition is receiving over time, shown relative to the growth of the competition itself (quantified by amount of registered teams). "Attention" is measured by amount of papers mentioning certain keywords deemed to be relevant to iGEM or the Registry of Standard Parts. An overall positive trend in proportion with the growth of the competition can be seen.</p>
 +
</a>
 +
</div>
 +
 
 +
<div class="span4">
 +
<h3>And the answer is...</h3>
 +
<p>Things are looking good for iGEM! Since the first iGEM competition in 2003, more teams are participating each year and more parts are being submitted. The efforts to expose iGEM to the community has paid off - the amount of scientific papers mentioning iGEM and the Registry of Standard Parts has risen, proportional to the increase of the competition itself.</p>
 +
<p>Papers mentioning a specific Registry BioBrick have only begun to appear in recent years, but the numbers show growth. We hypothesize that the Registry's contents is only now reaching the critical mass to become a useful research tool. The founders' dream (<a href="https://igem.org/Press_Kit">iGEM Foundation, 2012b</a>) of "making genetics modular" is becoming reality.</p>
 +
 
 +
<h3>Where this data comes from</h3>
 +
<p>Data about participating teams and number of submitted BioBricks comes from the <a href="https://igem.org/Previous_iGEM_Competitions">iGEM Foundation (2012a)</a>. The other data sets come from the results Google Scholar queries with IDs 1, 3 and 4 (see query summary table above).</p>
 +
</div>
 +
</div>
 +
<h2>Is the relationship between iGEM and the <abbr title="Registry of Standard Biological Parts">Registry</abbr> clear?</h2>
 +
<div class="row-fluid">
 +
<div class="span12 well">
 +
    <ul class="thumbnails">
 +
    <li class="span4">
 +
    <a href="#modal-igem-attention" data-toggle="modal" class="thumbnail">
 +
    <img src="https://static.igem.org/mediawiki/2012/1/12/Data1Chart_ignoresBefore2k.png" alt="">
 +
    <p class="label-set">
 +
    <span class="label" style="background-color:#ff7900">Papers published in that year</span>
 +
    <span class="label" style="background-color:#b0db54">Times those papers were cited</span>
 +
    </p>
 +
    <p><h5>Chart 2: <em>"iGEM" attention</em></h5></p>
 +
    </a>
 +
    </li>
 +
    <li class="span4">
 +
    <a href="#modal-registry-attention" data-toggle="modal" class="thumbnail">
 +
    <img src="https://static.igem.org/mediawiki/2012/3/3f/Data3Chart.png" >
 +
    <p class="label-set">
 +
    <span class="label" style="background-color:#ff7900">Papers published in that year</span>
 +
    <span class="label" style="background-color:#b0db54">Times those papers were cited</span>
 +
    </p>
 +
    <p><h5>Chart 3: <em>"Registry" attention</em></h5></p>
 +
    </a>
 +
    </li>
 +
    <li class="span4">
 +
    <a href="#modal-igem-registry-attention" data-toggle="modal" class="thumbnail">
 +
    <img src="https://static.igem.org/mediawiki/2012/e/ee/Data6Chart.png" alt="">
 +
    <p class="label-set">
 +
    <span class="label" style="background-color:#ff7900">Papers published in that year</span>
 +
    <span class="label" style="background-color:#b0db54">Times those papers were cited</span>
 +
    </p>
 +
    <p><h5>Chart 4: <em>"iGEM and Registry" attention</em></h5></p>
 +
    </a>
 +
    </li>
 +
    </ul>
 +
<p>These three graphs show the amount of papers published each year containing certain search queries, as well as the number of times these papers were cited. All graphs show positive tendencies: the competition is becoming more wide-spread and more iGEM-related papers are being published and recognized. The search queries were chosen to show which part of iGEM is usually cited: the iGEM competition, the Registry of Standard Parts, or both. The data shows that only around half of papers will cite both elements. Some far outlying data points (clear errors) were not plotted to maintain uniform scale.</p>
 +
</div>
 +
</div>
 +
 
 +
<div class="row">
 +
<div class="span5">
 +
<h3>Expectations</h3>
 +
<p>One of the iGEM competition's important goals is to build up well-characterized BioBrick content in the Registry. Thus, the iGEM competition and the Registry are inherently linked. Hence we would expect publications that mention iGEM to also refer to the Registry. However, as the Registry is not <em>only</em> used by iGEM, we expected to also find a large number of papers that mention the Registry <em>without</em> mentioning iGEM.</p>
 +
<p>In other words, we expected:
 +
<ul>
 +
<li>a <em>large proportion</em> of papers that mention <em>iGEM and the Registry</em>,</li>
 +
<li>a <em>large proportion</em> that mention <em>only the Registry</em> and
 +
<li>only a <em>very small proportion</em> that mention <em>only iGEM</em>.</li>
 +
</ul>
 +
</p>
 +
<h3>Yes and No</h3>
 +
<p>The expected proportion of papers mentioning <em>only the Registry</em> was found. However, of the papers that mention iGEM, about half do not mention the Registry. This was a greater proportion than we expected.</p>
 +
<p>We theorise that this may be due to the iGEM Foundation and iGEM teams <em>under-emphasizing</em> their importance to part standardisation. Indeed iGEM is usually presented by the Foundation and teams as "a synthetic biology competition", when really that's just half the picture. We're not just competing, but we also exist for a greater good: <em>Make the Registry better</em> and do our part in helping organise synthetic biology!</p>
 +
</div>
 +
<div class="span7">
 +
<a href="#modal-proportional-mentions" data-toggle="modal" class="thumbnail">
 +
<img src="https://static.igem.org/mediawiki/2012/1/15/DataIgemVsRegistry.png" />
 +
    <p class="label-set">
 +
    <span class="label" style="background-color:#b0db54">Just iGEM</span>
 +
    <span class="label" style="background-color:#ff7900">iGEM and Parts Registry</span>
 +
    <span class="label" style="background-color:#81ccf1">Just Parts Registry</span>
 +
    </p>
 +
<h5>Chart 5: <em>Proportional mentions of iGEM, Registry and Both</em></h5>
 +
<p>This chart is a <strong>stacked</strong> combination of the three charts above. It shows how often <em>iGEM</em>, <em>the Registry</em> and <em>both</em> are mentioned in scientific articles (i.e. how many papers from each year match our search terms). Each of the three quantities has stayed <em>roughly equal</em> to the others throughout the years.</p>
 +
</a><!-- /thumbnail -->
 +
</div>
 +
</div><!-- Close row -->
 +
</section><!-- Close data analysis -->
 +
 
 +
 
 +
<section id="citations">
 +
<div style="background-image:url('https://static.igem.org/mediawiki/2012/6/61/Papers_colour.jpg');background-position:top right;background-repeat:no-repeat;background-size:30%" />
 +
<div class="row">
 +
<div class="span9">
 +
<div class="page-header">
 +
<h1>Citations <small>How is iGEM cited and what can we learn from it?</small></h1>
 +
</div>
 +
<div class="row-fluid">
 +
<div class="span6">
 +
<h2>Sourcing data</h2>
 +
<p>In order to analyse how exactly iGEM and the Registry are being cited, we decided to manually examine a set of the papers in our results. We had discarded Scopus and Web of Knowledge earlier when carrying out wide-range data collection. However, for this focused search, the small but certain selection of papers that Scopus and Web of Knowledge gave us was perfect!</p>
 +
 
 +
<p>We manually combined the results obtained from Scopus and Web of Knowledge. We then deleted duplicates and the few remaining irrelevant publications.</p>
 +
 
 +
<p>The keyword “iGEM” gave <strong>41</strong> publications combined from Scopus and <acronym title="Web of Knowledge">WoK</acronym>. Of these, we discarded 16 publications for various reasons:
 +
<ul>
 +
<li>5 texts were non-research articles, such as magazine articles</li>
 +
<li>2 were articles on bioethics that mentioned iGEM only in passing</li>
 +
<li>1 article cited an articles with “iGEM” in the title but was itself unrelated</li>
 +
<li>6 were articles we couldn’t locate or access despite the University of St Andrews having subscriptions to various publishers</li>
 +
<li>2 were in French.</li>
 +
</ul>
 +
</p>
 +
 
 +
<p>This left <strong>25</strong> articles.</p>
 +
</div>
 +
<div class="span6">
 +
<h2>Discussion</h2>
 +
 
 +
<p>We considered citations <em>sufficient</em> when they cited the Registry or named the specific BioBricks used (when appropriate) <em>and</em> citing either the Knight or Endy paper about BioBrick assembly.</p>
 +
 
 +
<p>We examined those 25 articles for how they cited iGEM and the Registry. We found that <strong>11</strong> articles cited iGEM and/or the Registry of Standard Biological Parts satisfactorily.</p>
 +
 
 +
<p>There were two common forms of citation:</p>
 +
<p>
 +
<ul>
 +
<li><em>Registry of Standard Biological Parts [http://www.partsregistry.org].</em></li>
 +
<li><em>Knight, T. F. (2003). Idempotent Vector Design for Standard Assembly of Biobricks.<br />
 +
DOI: 1721.1/21168.</em></li>
 +
</ul>
 +
</p>
 +
 
 +
<p>5 articles did not cite the Registry or BioBricks sufficiently according to our demands.  For one of them, we question its content relevancy. The remaining 4 show examples of simple in-text citation of the Registry, with or without a hyperlink, no mention of the specific BioBrick used, or no mention of BioBricks at all. Surprisingly, each of these papers was associated with an iGEM team.</p>
 +
 
 +
<p>There are 9 additional papers we found in the journal IELTS Synthetic Biology, all of which show vast variations in content, research, and citation quality. The journal seems to have been published once, and seems to have asked all 2006 iGEM finalists to submit a paper based on their research. This resulted in some sub-par articles. Because they weren't peer-reviewed and due to their strong biasing affiliation with iGEM, we disregarded them from our overall data set.</p>
 +
</div>
 +
</div>
 +
</div>
 +
<div class="span3"></div>
 +
</div>
 +
<div class="well">
 +
<h2>Our recommendations</h2>
 +
<p>Reviewing the data, we have concluded that there are standard methods of citation being used by the scientific community to refer to the Parts Registry. In order for the Registry to uphold <em>referencing standards</em> as well as Parts standards within synthetic biology, we think this method of referencing should be officially recommended on the Registry and iGEM website.</p>
 +
 
 +
<p>A clear and standard citation method would support teams who are attempting to publish and let them set an example of citation style to the rest of the scientific community. Additionally, clearly stating a standard method of citation would make citing of the Registry easier and so motivate its citation in general. Of course, more citations means more attention and adoption within the field of synthetic biology.</p>
 +
 
 +
<p>Teams participating in iGEM should be encouraged to cite properly and to try to publish their work. A tutorial of some kind hosted on the iGEM website would help. Such things (among many other bright ideas) were proposed in 2008  by <a href="http://openwetware.org/wiki/User:Macowell/Making_iGEM_Better">Cowell</a>, but haven't been implemented. We see a standardised citation method as a high priority for maximising iGEM's scientific influence.</p>
 +
</div>
 +
</section>
 +
 
 +
<section id="conclusion">
 +
<div class="page-header">
 +
<h1>Conclusion <small>Our <em>Human Practices</em> in a nutshell...</small></h1>
 +
</div>
 +
<p>We found that <span style="background-color:#ecf7d5"><strong>the iGEM competition is making a positive impact.</strong></span> The competition is growing in size and scope, and both iGEM and the Registry are netting a proportionally increasing amount of attention from the scientific community. We are doing well!</p>
 +
 
 +
<p>However, we also found that <span style="background-color:#d1edf9"><strong>quite a number of discussions of iGEM <em>miss</em> the important connection between our iGEM competition and the Parts Registry</strong></span>. We recommend that the iGEM Foundation and future teams emphasise the iGEM competition's <abbr title="reason for existence">raison d'être</abbr> clearly in the future.</p>
 +
 
 +
<p>We also noticed that <span style="background-color:#ffe6cd"><strong>some papers do not give sufficient or clear credit to the Registry.</strong></span> We interpret this as confusion as to how the Registry should be cited. We recommend that a standardised referencing should be introduced. This would support the publishing process for inexperienced undergraduates involved in the competition, gather greater attention for the iGEM Foundation and Parts Registry and hence further our aims of synthetic biology standardisation. ∎</p>
 +
</section>
 +
 
 +
<!-- References
 +
================================================== -->
 +
<section id="references">
 +
<div class="page-header">
 +
<h1>References</h1>
 +
</div>
 +
 
 +
<p><strong>"Ael 2" and "Vulpecula", 2012.</strong> h-index (Hirsch). <i>Wikipedia</i>. [image online] Available at: &lt;<a href="http://en.wikipedia.org/wiki/File:H-index-en.svg">http://en.wikipedia.org/wiki/File:H-index-en.svg</a>&gt; [Accessed Jul 27, 2012].</p>
 +
 
 +
<p><strong>"Ael 2", 2012.</strong> Illustrated example for the g-index proposed by Egghe. <i>Wikipedia</i> [image online]  Available at: &lt;<a href="http://en.wikipedia.org/wiki/File:Gindex1.jpg">http://en.wikipedia.org/wiki/File:Gindex1.jpg</a>&gt; [Accessed Jul 27, 2012].</p>
 +
 
 +
<p><strong>Cowell, M.L., 2008.</strong> Making iGEM Better. [web page] Available at &lt;<a href="http://openwetware.org/wiki/User:Macowell/Making_iGEM_Better">http://openwetware.org/wiki/User:Macowell/Making_iGEM_Better</a>&gt; [Accessed Jul 19, 2012].</p>
 +
 
 +
<p><strong>Egghe, L., 2006.</strong> Theory and practise of the g-index. <i>Scientometrics</i> [online], Volume 69 (Issue 1), p.131-152. Available at: &lt;<a href="www.springerlink.com/content/4119257t25h0852w/?MUD=MP">www.springerlink.com/content/4119257t25h0852w/?MUD=MP</a>&gt; [Accessed Jun 7, 2012].</p>
 +
 
 +
<p><strong>Harzing, A.W., 2007.</strong> Publish or Perish. [computer program] Available from &lt;<a href="http://www.harzing.com/pop.htm">http://www.harzing.com/pop.htm</a>&gt;</p>
 +
 
 +
<p><strong>Hirsch, J.E., 2005.</strong> An index to quantify an individual's scientific research output. <i>Proceedings of the National Academy of Sciences of the United States of America</i>, Volume 102 (Issue 46). [online] Available at: http://&lt;<a href="www.ncbi.nlm.nih.gov/pmc/articles/PMC1283832/?tool=pmcentrez">www.ncbi.nlm.nih.gov/pmc/articles/PMC1283832/?tool=pmcentrez</a>&gt; [Accessed 5th Jul, 2012]</p>
 +
 
 +
<p><strong>iGEM Foundation, 2012a.</strong> Previous iGEM Competitions. [web page] Available at: &lt;<a href="https://igem.org/Previous_iGEM_Competitions">https://igem.org/Previous_iGEM_Competitions</a>&gt; [Accessed Jul 30, 2012]</p>
 +
 
 +
<p><strong>iGEM Foundation, 2012b.</strong> Press Kit. [web page] Available at: &lt;<a href="https://igem.org/Press_Kit">https://igem.org/Press_Kit</a>&gt; [Accessed Aug 3, 2012]</p>
 +
 
 +
<p><strong>Iselid, L., 2006.</strong> Research on citation search in Web of Science, Scopus and Google Scholar. <i>One Entry to Research</i> [blog] Available at: &lt;<a href="http://oneentry.wordpress.com/2006/08/11/research-on-citation-search-in-web-of-science-scopus-and-google-scholar/">http://oneentry.wordpress.com/2006/08/11/research-on-citation-search-in-web-of-science-scopus-and-google-scholar/</a>&gt; [Accessed Jun 20, 2012].</p>
 +
 
 +
<p><strong>Péter J., 2006.</strong> Dubious hit counts and cuckoo's eggs. <i>Online Information Review</i> [online] Volume 30 (Issue 2) p.188-193. Available at: &lt;<a href="http://www.emeraldinsight.com/journals.htm?articleid=1550726&show=abstract">http://www.emeraldinsight.com/journals.htm?articleid=1550726&show=abstract</a>&gt; [Accessed Jun 20, 2012].</p>
 +
 
 +
<p><strong>Zhou  Y., Liyan L. and Menghui L., 2012.</strong> Quantifying the influence of scientists and their publications: distinguishing between prestige and popularity. <i>New Journal of Physics</i>, [online] Volume 14 (March 2012) Available at: &lt;<a href="http://iopscience.iop.org/1367-2630/14/3/033033/">http://iopscience.iop.org/1367-2630/14/3/033033/</a>&gt; [Accessed Jun 7, 2012].</p>
 +
 
 +
</section><!-- Close references -->
 +
 
 +
 
 +
</div><!-- /container -->
 +
 
 +
<!-- Chart modals -->
 +
 
 +
<div class="modal large hide" id="modal-attention-summary">
 +
   <div class="modal-header">
 +
    <button type="button" class="close" data-dismiss="modal">×</button>
 +
    <h3>Chart zoom: <em>iGEM success summary</em></h3>
 +
  </div>
 +
  <div class="modal-body">
 +
    <img src="https://static.igem.org/mediawiki/2012/a/af/DataSummaryStats.png" alt="">
 +
  </div>
 +
</div>
 +
 
 +
<div class="modal large hide" id="modal-igem-attention">
 +
  <div class="modal-header">
 +
    <button type="button" class="close" data-dismiss="modal">×</button>
 +
    <h3>Chart zoom: <em>iGEM attention</em></h3>
 +
  </div>
 +
  <div class="modal-body">
 +
    <img src="https://static.igem.org/mediawiki/2012/1/12/Data1Chart_ignoresBefore2k.png" alt="">
 +
  </div>
 +
</div>
 +
 
 +
 
 +
<div class="modal large hide" id="modal-igem-registry-attention">
 +
  <div class="modal-header">
 +
    <button type="button" class="close" data-dismiss="modal">×</button>
 +
    <h3>Chart zoom: <em>iGEM and Registry attention</em></h3>
 +
  </div>
 +
  <div class="modal-body">
 +
    <img src="https://static.igem.org/mediawiki/2012/e/ee/Data6Chart.png" alt="">
 +
  </div>
 +
</div>
 +
 
 +
<div class="modal large hide" id="modal-registry-attention">
 +
  <div class="modal-header">
 +
    <button type="button" class="close" data-dismiss="modal">×</button>
 +
    <h3>Chart zoom: <em>Registry attention</em></h3>
 +
  </div>
 +
  <div class="modal-body">
 +
    <img src="https://static.igem.org/mediawiki/2012/3/3f/Data3Chart.png" alt="">
 +
  </div>
 +
</div>
 +
 
 +
<div class="modal large hide" id="modal-proportional-mentions">
 +
  <div class="modal-header">
 +
    <button type="button" class="close" data-dismiss="modal">×</button>
 +
    <h3>Chart zoom: <em>Proportional mentions</em></h3>
 +
  </div>
 +
  <div class="modal-body">
 +
    <img src="https://static.igem.org/mediawiki/2012/1/15/DataIgemVsRegistry.png" alt="">
 +
  </div>
 +
</div>
 +
 
 +
<!-- End of chart modals -->
 +
 
 +
</body>
</html>
</html>
 +
{{:Team:St_Andrews/Template:Footer}}
{{:Team:St_Andrews/Template:Footer}}

Latest revision as of 20:00, 26 September 2012

Scientific impact of iGEM

"Most influential synthetic biology competition" vs. "Just some kids playing"?

We wanted to determine how relevant the iGEM competition is for the greater SynBio community. So we investigated the scientific attention garnered by both the iGEM and the Registry of Standard Parts. A data-driven approach was chosen: We extracted data from search results using various queries (such as ("iGEM" OR "International Genetically Engineered Machine") AND ("synthetic biology" OR "genetic engineering")) from various publication search engines. We searched Web of Knowledge, Scopus, PubMed and Google Scholar. Google Scholar was chosen to perform more detailed data analysis, as we found the alternatives to have various shortcomings.

We found that our data results are conclusive with our initial hypothesis: iGEM is an important contributor to the SynBio community. These findings have some implications for the iGEM competition, which we discuss.

In order to quantify these results further, we analyzed how exactly iGEM and the Registry has been cited. Examining around 50 papers in closer detail, we recommend all papers published by iGEM teams or related to iGEM or the Registry to use a standard citation.

Query summary

Here's a quick breakdown of what we queried for on Google Scholar and what sort of data was returned. (The ID matches the name of the data set in our data tables). The h- and g-indexes are explained just below!

Dataset ID Plain English query Query Nº Papers Nº Citations h-index g-index Query date
1 Papers mentioning iGEM in context of synbio (“synthetic biology” OR "genetic engineering") AND (“iGEM” OR “International Genetically Engineered Machine") 770 3253 26 45 17/7/2012
2 All synthetic biology synthetic biology 1000 68482 127 214 17/7/2012
3 Papers mentioning Registry of Parts "Registry of Standard Biological Parts" OR "partsregistry.org" OR "parts.mit.edu" 751 6442 39 69 17/7/2012
4 Papers citing a particular Part partsregistry.org/Part: 54 263 5 16 17/7/2012
5 Papers mentioning iGEM iGEM OR "International Genetically Engineered Machine" 1000 9095 36 64 17/7/2012
6 Papers mentioning iGEM and Registry ("iGEM" OR "International Genetically Engineered Machine") AND("Registry of Standard Biological Parts" OR "partsregistry.org" OR "parts.mit.edu") 330 2208 23 42 17/7/2012

Note Searches were capped at a maximum of 1000 results. Hence getting 1000 results for a query implies that more exist! Those first 1000 are only the ones the search engine judged most relevant.

Why we used Google Scholar

All in all, we found Google Scholar to most closely meet our analytic needs.

As WoK, Scopus and PubMed are strictly curated databases and limited in scope, they missed many obviously relevant publications. We also found their search options unsuitable: Many of them did not support either full text search (they looked at titles, keywords and abstracts only) or boolean operators. But for the following reasons, we needed a search to include both:

  • iGEM cannot be expected to always be the main subject of a paper, hence full text search.
  • There are many relevant terms floating about iGEM, hence boolean operators like "OR" to allow treating papers that contain "International Genetically Engineered Machine" or the acronym "iGEM" equally.

Just how much wider is Google's search scope? Here is an example: PubMed gave so few results (16 for iGEM genetic*) that we quickly discarded it. Manually merging the Web of Knowledge and Scopus results for the query iGEM AND genetic* (discarding obviously irrelevant results) gave 43 results. Then we queried Google Scholar. It gave us 770 for (“synthetic biology” OR "genetic engineering") AND (“iGEM” OR “International Genetically Engineered Machine").

Of course, Google Scholar too is but a bronze bullet: It brings its own drawbacks. It is engineered to pick up things that only seem like scholarly articles. Like Google's search results in general, the results are not curated by a human. This has been criticised in the literature (Péter J., 2006.). We found the occasional hilarious total miss. Google Scholar is also known to somewhat overestimate citation counts (Iselid, L., 2006.). However, we concluded from empirical manual examination of a random sample that the majority of the results are plausible and (most importantly) far greater in scope than searches in curated databases. Taking these aspects into consideration, we found Google Scholar best fulfilled our requirements.

Caveat! We only wanted to identify trends. Large and coarse pieces of data with some error were sufficient for this. These values should not be taken as exact!

Browse the data

...are online in a nifty and very usable Google Docs folder.

An introduction is included in case you get lost or want more information.

On extraction tools

We made extensive use of Harzing Publish or Perish (Harzing, A.W., 2007.) to scrape Google Scholar results. The tool has many limitations. However, in our experience it is the best available resource for managing the many locations in which scientific publications are deposited.

We did try other things: Manual methods were too slow for hundreds of papers. Various Firefox browser plugins failed due to the search APIs having changed, were extremely awkward to use or produced clearly erroneous results. The Mac OS program Papers was easy to use and found large numbers of papers (as like Google Scholar it could access many sources), but had unacceptably high rates of error, problems with duplicates and could not export the results into a form we could easily process. Hence Publish or Perish.

Once we had scraped our data from Google Scholar, we needed a method to quantify the relevance of a given scientific article. There are many ways you can quantify success of a paper. Here are a few we investigated:

Plain citation count

High citation count can generally be taken as an indicator of a high-quality or high-impact paper. This is the most traditional method of ranking the influence of papers.

The main disadvantage of the "citation count"-method is its lack of consistent standards. Even papers within a distinct scientific fields will have differing reference approaches and citation counts. It is also significant that old papers have an edge over newer papers, as they've had more time to be cited.

h-index

The h-index is an integer unique to a set of papers. It is used to measure the output and influence of a set of scientists. A greater h-index implies more productive and more influential authors. It was invented by physicist J.E. Hirsch (2005) and has since been automatically calculated by many citation databases. Here is its definition: "A set of papers has h-index h if h papers out of that set have been cited at least h times." An image ("Ael 2" and "Vulpecula", 2012.) clarifies:

g-index

The g-index is a citation index meant to quantify the influence of papers. It was proposed by Leo Egghe (2006) as a variation to the h-index. It puts more emphasis on the most cited papers and Egghe argues that it ranks highly cited authors more fairly. He gives the following definition: "A set of papers has a g-index g if g is the highest rank such that the top g papers have, together, at least citations." Here's a clarifying image by our Polish friend ("Ael 2", 2012.) again:

Algorithmic methods

It's worth noting that there are many other ways of quantifying productivity and impact of a set of papers or scientists. For example, Y.B. Zhou et al (2012) propose a more complete method for "distinguishing prestige from popularity". In their algorithm, the weight of a citation to the influence of a paper is also dependent on the (already calculated) influences of the citing papers and their authors. This requires running a recursive algorithm on sufficiently complete bipartite network of papers and their authors.

Though the alternatives look enticing, we ended up looking mainly at citation count.

The algorithmic methods were beyond our reach of data availability: We would have had to find the names of every involved person in every iGEM team and all papers they've written, filtering out large amounts of false matches. This was impracticable.

The h and g-indexes don't actually show more than the raw citation counts when it comes to tendencies over time. Also, we had relatively few search queries to compare against each other, given that two were discarded for having various flaws (discussed just below). This meant that the h and g-index, while valuable methods, were not suitable for our particular data analysis.

Is our data usable?

Yes. Mostly...

Given our doubts over accuracy of Google Scholar's data, we considered it a priority to exercise caution with our search query results. This paid off: data we compiled using the search queries of IDs 5 and 2 had fatal flaws. They were rejected from further analysis (discussed below). The other data sets were found to be suitable.

Our method to examine data suitability was empirical: Random samplings of each data set were passed under human eyes. For most queries, this observation of random subsets showed acceptably low levels of "background static" (i.e. results that Google Scholar had automatically matched to the query, which were not actually relevant). These would form only a drop of error in the ocean of relevant data.

...but only mostly.

Query 5 (iGEM OR "International Genetically Engineered Machine") was found to have an unacceptably large level of static. The reason was quickly identified: Because the two quoted terms in Query ID 5 ("iGEM", "International Genetically Engineered Machine") were separated by a disjunction (OR), the query would easily match anything that contained just the acronym "IGEM"! This meant acronyms in economics such as "Inter-temporal General Equilibrium Model (IGEM)" or the British "Institution of Gas Engineers & Managers (IGEM)" and various medical terms and chemical names snuck in. The entire data set with ID 5 was dismissed from further analysis.

The lesson we took from this is not to search for short acronyms by themselves. Query 1 ((“synthetic biology” OR "genetic engineering") AND (“iGEM” OR “International Genetically Engineered Machine")) could be thought of as the "Version Two" of the problematic Query 5. It searches for the same terms, but includes a conjunction (AND) with either synthetic biology or genetic engineering, which bends results toward our iGEM. This tunes static down to an acceptable level.

Query 2 (synthetic biology) had a different problem: It was too big. The query was an attempt to capture stats for the entire field of synthetic biology, so we could statistically determine the relative influence of the iGEM competition. However, we had forgotten the 1000-result cap imposed by Google Scholar. It is impossible to retrieve results beyond this 1k "event horizon". Google does not publish information regarding how the order of results is determined. Hence these first 1000 results (out of what are likely to be 10s or 100s of thousands of papers) are all biased by some unknown force. Were more cited papers favoured? Were papers published more recently favoured? No conclusions can be drawn from a biased and small subset of the full data. We also discounted data set 2 from any further analysis.

To emphasize: Data sets 2 and 5 are not included in any further analysis.

What's is iGEM's impact?

What does "impact" mean within a scientific context? We will operate under the assumption that having impact correlates strongly with being mentioned in scholarly articles. Hence, we quantify the impact that a term is having simply by searching for that term, and summing up result counts.

Here is a chart summarising how many papers mention various terms floating about iGEM over time:

And the answer is...

Things are looking good for iGEM! Since the first iGEM competition in 2003, more teams are participating each year and more parts are being submitted. The efforts to expose iGEM to the community has paid off - the amount of scientific papers mentioning iGEM and the Registry of Standard Parts has risen, proportional to the increase of the competition itself.

Papers mentioning a specific Registry BioBrick have only begun to appear in recent years, but the numbers show growth. We hypothesize that the Registry's contents is only now reaching the critical mass to become a useful research tool. The founders' dream (iGEM Foundation, 2012b) of "making genetics modular" is becoming reality.

Where this data comes from

Data about participating teams and number of submitted BioBricks comes from the iGEM Foundation (2012a). The other data sets come from the results Google Scholar queries with IDs 1, 3 and 4 (see query summary table above).

Is the relationship between iGEM and the Registry clear?

These three graphs show the amount of papers published each year containing certain search queries, as well as the number of times these papers were cited. All graphs show positive tendencies: the competition is becoming more wide-spread and more iGEM-related papers are being published and recognized. The search queries were chosen to show which part of iGEM is usually cited: the iGEM competition, the Registry of Standard Parts, or both. The data shows that only around half of papers will cite both elements. Some far outlying data points (clear errors) were not plotted to maintain uniform scale.

Expectations

One of the iGEM competition's important goals is to build up well-characterized BioBrick content in the Registry. Thus, the iGEM competition and the Registry are inherently linked. Hence we would expect publications that mention iGEM to also refer to the Registry. However, as the Registry is not only used by iGEM, we expected to also find a large number of papers that mention the Registry without mentioning iGEM.

In other words, we expected:

  • a large proportion of papers that mention iGEM and the Registry,
  • a large proportion that mention only the Registry and
  • only a very small proportion that mention only iGEM.

Yes and No

The expected proportion of papers mentioning only the Registry was found. However, of the papers that mention iGEM, about half do not mention the Registry. This was a greater proportion than we expected.

We theorise that this may be due to the iGEM Foundation and iGEM teams under-emphasizing their importance to part standardisation. Indeed iGEM is usually presented by the Foundation and teams as "a synthetic biology competition", when really that's just half the picture. We're not just competing, but we also exist for a greater good: Make the Registry better and do our part in helping organise synthetic biology!

Sourcing data

In order to analyse how exactly iGEM and the Registry are being cited, we decided to manually examine a set of the papers in our results. We had discarded Scopus and Web of Knowledge earlier when carrying out wide-range data collection. However, for this focused search, the small but certain selection of papers that Scopus and Web of Knowledge gave us was perfect!

We manually combined the results obtained from Scopus and Web of Knowledge. We then deleted duplicates and the few remaining irrelevant publications.

The keyword “iGEM” gave 41 publications combined from Scopus and WoK. Of these, we discarded 16 publications for various reasons:

  • 5 texts were non-research articles, such as magazine articles
  • 2 were articles on bioethics that mentioned iGEM only in passing
  • 1 article cited an articles with “iGEM” in the title but was itself unrelated
  • 6 were articles we couldn’t locate or access despite the University of St Andrews having subscriptions to various publishers
  • 2 were in French.

This left 25 articles.

Discussion

We considered citations sufficient when they cited the Registry or named the specific BioBricks used (when appropriate) and citing either the Knight or Endy paper about BioBrick assembly.

We examined those 25 articles for how they cited iGEM and the Registry. We found that 11 articles cited iGEM and/or the Registry of Standard Biological Parts satisfactorily.

There were two common forms of citation:

  • Registry of Standard Biological Parts [http://www.partsregistry.org].
  • Knight, T. F. (2003). Idempotent Vector Design for Standard Assembly of Biobricks.
    DOI: 1721.1/21168.

5 articles did not cite the Registry or BioBricks sufficiently according to our demands. For one of them, we question its content relevancy. The remaining 4 show examples of simple in-text citation of the Registry, with or without a hyperlink, no mention of the specific BioBrick used, or no mention of BioBricks at all. Surprisingly, each of these papers was associated with an iGEM team.

There are 9 additional papers we found in the journal IELTS Synthetic Biology, all of which show vast variations in content, research, and citation quality. The journal seems to have been published once, and seems to have asked all 2006 iGEM finalists to submit a paper based on their research. This resulted in some sub-par articles. Because they weren't peer-reviewed and due to their strong biasing affiliation with iGEM, we disregarded them from our overall data set.

Our recommendations

Reviewing the data, we have concluded that there are standard methods of citation being used by the scientific community to refer to the Parts Registry. In order for the Registry to uphold referencing standards as well as Parts standards within synthetic biology, we think this method of referencing should be officially recommended on the Registry and iGEM website.

A clear and standard citation method would support teams who are attempting to publish and let them set an example of citation style to the rest of the scientific community. Additionally, clearly stating a standard method of citation would make citing of the Registry easier and so motivate its citation in general. Of course, more citations means more attention and adoption within the field of synthetic biology.

Teams participating in iGEM should be encouraged to cite properly and to try to publish their work. A tutorial of some kind hosted on the iGEM website would help. Such things (among many other bright ideas) were proposed in 2008 by Cowell, but haven't been implemented. We see a standardised citation method as a high priority for maximising iGEM's scientific influence.

We found that the iGEM competition is making a positive impact. The competition is growing in size and scope, and both iGEM and the Registry are netting a proportionally increasing amount of attention from the scientific community. We are doing well!

However, we also found that quite a number of discussions of iGEM miss the important connection between our iGEM competition and the Parts Registry. We recommend that the iGEM Foundation and future teams emphasise the iGEM competition's raison d'être clearly in the future.

We also noticed that some papers do not give sufficient or clear credit to the Registry. We interpret this as confusion as to how the Registry should be cited. We recommend that a standardised referencing should be introduced. This would support the publishing process for inexperienced undergraduates involved in the competition, gather greater attention for the iGEM Foundation and Parts Registry and hence further our aims of synthetic biology standardisation. ∎

"Ael 2" and "Vulpecula", 2012. h-index (Hirsch). Wikipedia. [image online] Available at: <http://en.wikipedia.org/wiki/File:H-index-en.svg> [Accessed Jul 27, 2012].

"Ael 2", 2012. Illustrated example for the g-index proposed by Egghe. Wikipedia [image online] Available at: <http://en.wikipedia.org/wiki/File:Gindex1.jpg> [Accessed Jul 27, 2012].

Cowell, M.L., 2008. Making iGEM Better. [web page] Available at <http://openwetware.org/wiki/User:Macowell/Making_iGEM_Better> [Accessed Jul 19, 2012].

Egghe, L., 2006. Theory and practise of the g-index. Scientometrics [online], Volume 69 (Issue 1), p.131-152. Available at: <www.springerlink.com/content/4119257t25h0852w/?MUD=MP> [Accessed Jun 7, 2012].

Harzing, A.W., 2007. Publish or Perish. [computer program] Available from <http://www.harzing.com/pop.htm>

Hirsch, J.E., 2005. An index to quantify an individual's scientific research output. Proceedings of the National Academy of Sciences of the United States of America, Volume 102 (Issue 46). [online] Available at: http://<www.ncbi.nlm.nih.gov/pmc/articles/PMC1283832/?tool=pmcentrez> [Accessed 5th Jul, 2012]

iGEM Foundation, 2012a. Previous iGEM Competitions. [web page] Available at: <https://igem.org/Previous_iGEM_Competitions> [Accessed Jul 30, 2012]

iGEM Foundation, 2012b. Press Kit. [web page] Available at: <https://igem.org/Press_Kit> [Accessed Aug 3, 2012]

Iselid, L., 2006. Research on citation search in Web of Science, Scopus and Google Scholar. One Entry to Research [blog] Available at: <http://oneentry.wordpress.com/2006/08/11/research-on-citation-search-in-web-of-science-scopus-and-google-scholar/> [Accessed Jun 20, 2012].

Péter J., 2006. Dubious hit counts and cuckoo's eggs. Online Information Review [online] Volume 30 (Issue 2) p.188-193. Available at: <http://www.emeraldinsight.com/journals.htm?articleid=1550726&show=abstract> [Accessed Jun 20, 2012].

Zhou Y., Liyan L. and Menghui L., 2012. Quantifying the influence of scientists and their publications: distinguishing between prestige and popularity. New Journal of Physics, [online] Volume 14 (March 2012) Available at: <http://iopscience.iop.org/1367-2630/14/3/033033/> [Accessed Jun 7, 2012].

Back to top

University of St Andrews, 2012.

Contact us: igem2012@st-andrews.ac.uk, Twitter, Facebook

This iGEM team has been funded by the MSD Scottish Life Sciences Fund. The opinions expressed by this iGEM team are those of the team members and do not necessarily represent those of Merck Sharp & Dohme Limited, nor its Affiliates.