Search on EES

Date/Time
Date(s) - 11/07/2023
0:00

Categories


AERA

Date: Tuesday, 11th July 2023
Instructor: Manuel S. Gonzalez Canche, University of Pennsylvania

Description:
Labeling or classifying textual data is an expensive and consequential challenge for Mixed Methods and Qualitative researchers. The rigor and consistency behind the construction of these labels may ultimately shape research findings and conclusions. A methodological conundrum to address this challenge is the need for human reasoning for classification that leads to deeper and more nuanced understandings, but at the same time manual human classification comes with the well-documented increase in classification inconsistencies and errors, particularly when dealing with vast amounts of texts and teams of coders.

This course offers an analytic framework designed to leverage the power of machine learning to classify textual data while also leveraging the importance of human reasoning in this classification process. This framework was designed to mirror as close as possible the line-by-line coding employed in manual code identification, but relying instead on latent Dirichlet allocation, text mining, MCMC, Gibbs sampling and advanced data retrieval and visualization. A set of analytic output provides complete transparency of the classification process and aids to recreate the contextualized meanings embedded in the original texts.

Further Information