Introduction
-
Overview of uses of crowdsourcing
-
Non-language uses of crowdsourcing
-
types of problems (labels, text, speech, people...)
</td>
|
</tr>
Taxonomy of crowdsourcing and human computation
-
Categorization system: motivation, quality control, aggregation, human skill, process flow.
</td>
|
</tr>
Crowdsourcing platforms: Mechanical Turk and CrowdFlower
-
Terminology and mechanics: Turkers, Requesters, HITs, micropayments
-
Demographics and motivation of Mechanical Turk workers
</td>
|
</tr>
How to set up and run experiment
-
Designing and running experiments (step-by-step overview)
-
Example of quickly using MTurk via the web interface
-
Accessing MTurk via the boto API and the CrowdFlower API that People Pattern created
</td>
|
</tr>
Ethics of Crowdsourcing
-
Fair wages?
-
Privacy/IRB considerations
</td>
|
</tr>
Quality Control
-
MTUrk's Reputation system and qualifications
-
Aggregation and redundancy
-
Embedded gold standard data
-
Second-pass reviewing
-
Economic incentives
-
Statistical models
-
Should we do quality control if we are training a statistical model?
</td>
|
</tr>
Statistical analysis of MTurk data
-
Quality versus quantity
-
Active data collection
-
Accounting for worker variation
</td>
|
</tr>
Case studies in NLP: Chris Callison-Burch
-
Crowdsourcing Translation
-
Using crowdsourcing to collect data to train SMT systems
</td>
-
Crowdsourcing Translation: Professional Quality from Non-Professionals
by Omar Zaidan and Chris Callison-Burch
-
The Language Demographics of Amazon Mechanical Turk
by Ellie Pavlick, Matt Post, Ann Irvine, Dmitry Kachaev, and Chris Callison-Burch
-
Machine Translation of Arabic Dialects
by Rabih Zbib, Erika Malchiodi, Jacob Devlin, David Stallard, Spyros Matsoukas, Richard Schwartz, John Makhoul, Omar F. Zaidan, and Chris Callison-Burch
|
</tr>
Case studies in NLP: Lyle Ungar
-
Word Sense Disambiguation
-
Computational Social Science at the World Well-Being Project (WWBP)
</td>
-
World Well-Being Project (WWBP)
-
New Insights from Coarse Word Sense Disambiguation in the Crowd
by Adam Kapelner, Krishna Kaliannan, H. Andrew Schwartz, Lyle Ungar and Dean Foster
-
Personality, Gender, and Age in the Language of Social Media: The Open-Vocabulary Approach
by Schwartz, H. A., Eichstaedt, J. C., Kern, M. L., Dziurzynski, L., Ramones, S. M., Agrawal, M., Shah, A., Kosinski, M., Stillwell, D., Seligman, M. E., & Ungar, L. H
-
Psychological Language on Twitter Predicts County-Level Heart Disease Mortality
by Eichstaedt, J. C., Schwartz, H. A., Kern, M. L., Park, G., Labarthe, D. R., Merchant, R. M., Jha, S., Agrawal, M., Dziurzynski, L. A., Sap, M., Weeg, C., Larson, E. E., Ungar, L. H., & Seligman, M. E.
|
</tr>
Wrap up
|
|
</tbody>
</table>
| | | | | | | | |