Cookies?
Library Header Image
LSE Research Online LSE Library Services

Crowd-sourced text analysis: reproducible and agile production of political data

Benoit, Kenneth, Conway, Drew, Lauderdale, Benjamin E., Laver, Michael and Mikhaylov, Slava (2016) Crowd-sourced text analysis: reproducible and agile production of political data. American Political Science Review, 110 (2). pp. 278-295. ISSN 0003-0554

[img]
Preview
PDF - Accepted Version
Download (1895Kb) | Preview

Abstract

Empirical social science often relies on data that are not observed in the field, but are transformed into quantitative variables by expert researchers who analyze and interpret qualitative raw sources. While generally considered the most valid way to produce data, this expert-driven process is inherently difficult to replicate or to assess on grounds of reliability. Using crowd-sourcing to distribute text for reading and interpretation by massive numbers of non-experts, we generate results comparable to those using experts to read and interpret the same texts, but do so far more quickly and flexibly. Crucially, the data we collect can be reproduced and extended transparently, making crowd-sourced datasets intrinsically reproducible. This focuses researchers’ attention on the fundamental scientific objective of specifying reliable and replicable methods for collecting the data needed, rather than on the content of any particular dataset. We also show that our approach works straightforwardly with different types of political text, written in different languages. While findings reported here concern text analysis, they have far-reaching implications for expert-generated data in the social sciences.

Item Type: Article
Official URL: http://journals.cambridge.org/action/displayJourna...
Additional Information: © 2015 The Authors
Library of Congress subject classification: H Social Sciences > H Social Sciences (General)
J Political Science > JA Political science (General)
Sets: Departments > Methodology
Project and Funder Information:
Project IDFunder NameFunder ID
2011-StG 283794-QUANTESSEuropean Research Councilhttp://dx.doi.org/10.13039/501100000781
Projects: 2011-StG 283794-QUANTESS
Funders: European Research Council
Date Deposited: 08 Jun 2015 16:02
URL: http://eprints.lse.ac.uk/62242/

Actions (login required)

Record administration - authorised staff only Record administration - authorised staff only

Downloads

Downloads per month over past year

View more statistics