09:30 - 11:10
PS6
Room:
Room: Club C
Panel Session 6
Charles 1, William Marx - GPT-2 Generates Biased Texts
Sebastian Block - Cross-Domain Classification Of Political Texts: Introducing A Lean And Versatile Two-Step Workflow
Alona Dolinsky - Multilingualism in Computational Text Analysis Methods—Evidence From A Pre-Registered Survey Experiment.
Hauke Licht - Going cross-lingual: A guide to multilingual text analysis
Going cross-lingual: A guide to multilingual text analysis
PS6-4
Presented by: Hauke Licht
Hauke Licht 1, Fabienne Lind 2
1 University of Zurich, Department of Political Science
2 University of Vienna , Department of Communication
Text-as-data methods have revolutionized the study of political behavior and human communication more generally, and the increasing availability of multilingual text collections promises exciting new applications of these methods in comparative research. To encourage researchers to seize these opportunities, we provide a guide to multilingual quantitative text analysis. We first discuss common challenges researchers face when dealing with multilingual text corpora, including concerns about resource requirements, limits to translatability, and the measurement of context-dependent concepts. We then discuss reasons for why it can nevertheless be worthwhile to go cross-lingual in applied research. We complement this discussion with a systematic overview of multilingual text analysis methods developed for political science and communication research, focusing on cross-lingual dictionary analysis, supervised text classification, topic modeling, and text scaling. To structure this overview, we distinguish between output alignment, bridging, and input alignment approaches to cross-lingual text analysis and thus integrate existing methods into a common framework that provides orientation in a very active field of research. We conclude with an outlook on future directions for method development and potential fields of applications.