Here’s an example write-up from a past class that demonstrates the kind of high quality report we’re looking for.
One of the awesome things about crowdsourcing is that it lets us quickly run scientific experiments. You’re going to practice doing this by reading an academic paper that uses crowdsourcing and trying to replicate its findings. Good science should be repeatable, so you’re helping science by replicating someone else’s experiments. And it’s good for your own understanding of crowdsourcing and academic research. Did you know that your professors secretly spend most of their time doing research? It’s true. If you think that their lectures sometimes seem underprepared, it’s likely because they’re in their lab, bringing down lightening from the Gods and cackling like Dr. Frankenstein until their calendar app reminds them to go to the lecture hall.
For this assignment, you can work in teams of 4-6 people. Your final project will require that you work in groups, so this is a good chance for you to form a group and test out whether you want to work with those people.
You have five options for academic papers that you can replicate:
I have sorted the papers roughly in an order of how difficult they will be to replicate. Because the difficulty level varies, and because I don’t want to read 50 demographic studies of Mechanical Turk, I’m going to award different maximum point values to them based on their difficulties. You can choose whichever one you want to work on.
Here are the steps and deliverables for this project:
You should begin this project by reading through 3 of the academic papers. The goals of this step are for you to get a sense of what constitutes interesting research on crowdsourcing, and for you to see how academic papers describe their experimental designs and how they present their results. While you are reading the papers, you should try to estimate of how long it will take you to replicate one or more of the experiments that were presented in the paper.
After you’ve read the papers, you should write a 1-2 paragraph summary of what they were about and what their main findings were. The summary of the paper should be written in your own words, and not copied and pasted from the paper itself.
Your team should meet and come to a consensus about which paper that you want will reproduce. You should pick a paper that you find to be interesting, and that you will be able to re-create in less than two weeks. It is not necessary to recreate every finding in a paper, it’s fine to pick one or more of the experiments.
For this part of the homework assignment, you should write a complete description of the experimental methodology described in the paper, and what its findings were. For the experimental methodology you should:
Write up the experimental methodology in your final report. Once again, you should use your own words. If you want to reproduce a figure or a table from the paper, that is fine, so long as you attribute it.
Pick one or more of the experiments that you want to replicate. If the paper described a set of experiments, then you can pick one of them. For instance, Financial Incentives and the Performance of Crowds performs two studies – one where they have workers organize traffic cam photos chronologically and another where they have workers perform word puzzles. Similarly, Exploring Iterative and Parallel Human Computation Processes had 3 studies, one involving writing image descriptions, one on brainstorming, and one blurry image recognition. If you are reproducing either of those papers, you only need to pick one of those studies.
When you create your own version of the paper’s experiment, it’s fine to deviate somewhat from the paper’s design. For instance, if the crowd was shown a particular set of images and asked to write captions or label the images, it might not be possible for you to get exactly the same set of images. It’s fine to choose your own. When you deviate from the setup of the original experiment, you should note why. You should also briefly explain if you think it might result in a different outcome than the findings of the original paper.
In this step, you should collect any materials that you’ll be presenting to workers (images, prompts, survey questions, etc). You should save these in a directory and write a README describing them. You’ll submit them along with your final writeup.
Create a task on Mechanical Turk that you’ll use to perform the experiment. You should write clear instructions on what you’d like the workers to do (and how they will be rewarded, if that’s relevant to the experiment).
Please save your instructions, and take a screenshot of your task design, and/or submit an HTML template for it. If you write any code with it (like for a Javascript alert for the Crowds in Two Seconds), then also include that code and a README describing what it does.
Decide on an appropriate reward amount, and an appropriate number of crowd workers that you need to hire to complete your experiment, and then run it. You may consider starting with a small-scale pilot version of your experiment to make sure that everything is working properly, and that your instructions are written clearly enough for workers to understand.
In your final write up on your experiment, you should describe how many workers you hired, and how much you paid them. If you placed restrictions on who can participate (based on their country or their past approval rate, etc), then document that too. If you had to remove any workers for giving spammy results, describe what criteria you used to select whom to exclude from your experiment.
Analyze the results of your experiment. Try to perform the same analysis that the original paper did. You should also present your results in a similar format (for instance, a similar style of graph). Are your high-level findings the same as the original paper or different? If they are different, how so? What do you attribute the differences to?
We had many excellent reports written the last time this class was offered. Here is one example submission that demonstrates the kind of high quality report we’re looking for. They replicated the Financial Incentives and the Performance of Crowds paper.
Write a final report. Your final report should be submitted as a PDF, and should be approximately 3,500 words long (~5 pages excluding figures and appendix). The maximum length is 3,750 words. If you need more than that, you’re welcome to include an appendix containing additional figures or analysis, but your main document should be constructed so that it is readable as a standalone report.
Your paper should include the following information:
You will submit your final report and your other materials via Gradescope. Your report should be titled report.pdf and your other deliverables should be in a zip file titled deliverables.
This assignment is worth 5 points of your overall grade in the course. The rubric for the writeup is given below.
and contrast the two.