Team:Wellesley HCI/SynBio Search
From 2012.igem.org
SynBio Search
Tool Overview
SynBio Search is an online tool that generates data sheets for over 2700 biological parts by aggregating data from various publicly available resources. It integrates and links information from various data sources, including the Registry of Standard Biological Parts, the iGEM Archive, Google Scholar, and PubMed. SynBio Search builds on the collected sources by providing a structured view that relates heterogeneous information, links back to original data sources, and allows users to customize and organize the display. It enables researchers to discover the most comprehensive view of freely available data about biological parts from a single online search. SynBio Search allows users to search by keyword (e.g. tuberculosis) or by part name.
Implementation
As a web application, SynBio Search was implemented using multiple platforms. Information is gathered by crawling through multiple databases using CGI scripts written in Ruby. Then, the search results are then outputted into JSON files. After, the files are parsed, and the information is displayed in a clear and logical format on a web page (using HTML 5.0, CSS 3.0, JavaScript, and JQuery) for the user to access.
Purpose
While there are already several search engines for synthetic biology, none are exactly what users neither want nor need. For example, many iGEM participants and biology scientists and engineers use the Registry of Standard Biological Parts to document their experience on biological parts. However, the Registry is poorly organized, information is hard to locate, and certain data are missing. Thus, the purpose of our project is to create a resource that is not only qualitatively and quantitatively comprehensive, but also more accessible to users. It collects results from multiple data sources, allowing users to view more complete information in an organized and efficient manner.
Results
We conducted a preliminary user study for SynBio Search with 14 students from a biology background. The students were given various tasks so they could explore every aspect of the web application. The users found it useful to have multiple sources listed in one place; while some students preferred to use Google Scholar and Pub Med over the other two sources because they were "more reliable sources" and "more well-known," others liked that they were able to "search between literature databases and parts/iGEM databases" all in one search engine. In addition, there was overwhelming support for the intuitiveness of the application and the ability to link to many other resources. Users suggested displaying a count of entries found by each database source and ordering the results by relevance or top hit.
Users were asked to measure the level of difficulty (1 being Very Easy to 5 being Very Difficult) for sub-tasks after the usability study
Quotes from users
- "Fusing search results from multiple sources is very useful. I did not have to retype my query."
- "count of entries found by each search might be helpful"
- "I liked that I could narrow down the searches based on database source."
- "It's intuitive to use and seems to gather all related data into one place, which is very helpful for project design purposes. I particularly like being able to filter out publications and just search for protocols/ lab notebooks."
- "I think its useful because instead of having to search in 100 different places for resources on one topic, all I had to do was look at one page."
- "I found the ability to search between literature databases and parts/ iGEM databases in one search engine very helpful. It was nice to be able to limit my search to only literature or only parts/ lab notebooks, depending on the stage of the research project I might be in. I also liked how hyperlinked everything was - I could navigate from the datasheet to the lab notebooks describing that part, or to publications that might be related. It was very nice to have everything laid out in one place."
Demo Video
Future Work
- Integrate more databases such as: Clotho, SynBERC, Based on user feedback.
- Include more information in the data sheet for each part.
- Increasing the search engine capabilities by including options for customizable search, such as boolean operators (AND, OR).
- Iterate on the design based on user feedback.
- Incorporate a sophisticated feedback and experience feature.