A quick round-up of possibilities for open science: research, equipment, education, data sharing, publication, funding.
Table of Contents
Open Science
Over the past few weeks I’ve been intensively researching how best to contribute to the fields of science that interest me. In the process I’ve read a lot about the problems and benefits of academic science.
Academic science unites training, knowledge dissemination, funding, and equipment. Most of the world’s advances in science occur in academia, and it remains a good way to get into science.
On the other hand, several folks who have been there and done that have told me there are more efficient ways to get educated and begin research. More broadly, and concerningly, institutional scientific practice suffers from poor data practices that lead to inaccurate results - exhibit 1, Ioannidis’ conclusions about medical results, exhibit 2, the social science reproducibility crisis.
If, like me, you get frustrated when you read about negative results that disappear due to the “file-drawer” effect, about data hidden by frightened, lazy, or greedy researchers, about papers behind paywalls, about inefficient scientific education, or what have you, take heart.
The tools for a completely open path to research, equipment, education, data management and analysis, funding, and publication already exist! While I’ll admit I still worry about scaling up research (surely a big academic lab is going to be hard to beat if you want your research to mean something), this is a very encouraging discovery.
A lot of other people, senior and junior, are thinking about these problems too. There is a broad coalition already in place. All we have to do is join them.
Research
This one has recently become much easier. LibGen is a huge repository of the bulk of scientific research ever published. If you can’t find it elsewhere, you can often find it there. The sister project is Sci-Hub, which uses members’ library access to find results, and then donates a copy to LibGen.
Equipment: How to Build a Synbio Lab
To get started, take a look at DIYbio’s list, The Quest for the $500 Home Molecular Biology Lab, Hackpad’s list. OpenWetWare has another good list, and here’s a slightly older article on the same quest.
In addition, here are two collections of open-source tools.
One of the links on OpenWetWare disrecommends working with live animals for awhile, as the data will be too noisy.
Education
I’m most excited about Synthetic Biology: A Primer, and Synthetic Biology: A Lab Manual. I’ll also look at Draft Primer for Synthetic Biology and Principles of Synthetic Biology.
You can also look around for good synbio starter projects, and - especially - can pick a published result and try to replicate it.
In addition to those and standard molecular biology textbooks, I’ve become convinced that a high level of statistics knowledge is essential to do anything worthwhile.
Preregister experiments on the blockchain?
Suppose an independent or beginner researcher wants to practice open science. Open data is easy - as Jeffrey Rouder shows, a shell script with git makes automatically backing up data easy. Proper analysis is harder, but again the math and techniques can’t be that hard to figure out.
But how about pre-registration? Using a blockchain to pre-register your experiments would be the simplest and most flexible approach, as it doesn’t require a journal or any institution. You just register whatever experiment you design and go on your merry way, secure in the knowledge that you’ve publicly committed yourself to your experiment. No one can say you changed horses mid-course.
I didn’t find any website specifically devoted to pre-registering experiments, though it will probably be easy just to upload a document to existing blockchain websites.
These websites hash - encode - the contents of the file in such a way that no one can read it, but preserves the identity of your document. In this way, a blockchain proves your document (for instance, your experiment design) existed at that point in time.
I did find these two articles on using the blockchain to store all sorts of data. They seemed more speculative than just using the blockchain to pre-register, but interesting nonetheless.
Data management and sharing for ongoing projects
Now suppose I’ve open-sourced my data, workflow, techniques, and analysis. I publish to an open-access online journal, I’m commenting on my own work and reviewing others’ work on my blog or some other platform. Congratulations, I’m now Doing Science Openly.
However, a million atomized researchers all working in perfect openness have solved the larger half of the problem of making science work faster - transparency, reproducibility in principle - yet we now have a search problem. How do you efficiently find other work, and how do others find (and credit you for!) your work?
Dryad, Figshare, Dataverse Project, and Open Science Framework are four pieces of that puzzle. Figshare and Dataverse function like GitHub in that they are repositories for open data, and also say they assist with publication venues.
OSF appears to be related to The Center for Open Science. Open Science Framework deserves a section of its own really: it’s a full-fledged scientific project management platform, which assists in finding and citing other projects. In fact, while I delved into the blockchain above, OSF boasts undeletable preregistration, in addition to its host of integrations (Box, Drobbox, GitHub, Amazon S3), wiki, privacy settings, citation assistance, and more, all for free!
Open access, post-pub peer review
PLoS, BiorXiv and rXiv are obvious examples. eLife and PeerJ look like wonderful venues, and Peerage of Science does free peer reviews and publishing (!).
In addition, The Winnower and Hypothesis look like good ways to collect and annotate existing research.
Funding
Walacea and Experiment.com both look like excellent sources of crowd-funding, Kickstarter-style. There is an art to running a successful crowdfunding campaign, just as there’s an art to a successful grant application. Maybe the first can lead to the second: maybe some successful crowd-funded experiments can give you the credentials to land a real grant via Instrumentl.
A hypothetical work process
Having run through all of that, let me throw out a hypothetical path to getting a small yet serious lab up and running.
First, spend a year or so with the synthetic biology Primer and Lab Manual, and other educational projects. In this way you’ll learn lab techniques, a lot of science, and maybe even start looking for a project that no one has done yet - original research.
Second, as you self-educate, build open data practices into your workflow. Pre-register, back up your data and techniques and workflow publicly, make your analysis open.
Third, when you have an original line of research you’d like to pursue, use Experiment.com to get funding.
Fourth, when you have a result worth publishing, use OSF, eLife, BiorXiv, and so on to publish first. Then look for another open-access venue in which to publish.
When you’re published, or when someone cites your work, break open a bottle of champagne! You’re a real scientist.