2015 Spring - Law 217

Location

Boalt Hall, room 134

Start Date

27-4-2015 12:15 PM

End Date

27-4-2015 1:55 PM

Description

Abstract:

Social scientists are embracing the idea of using `text as data’ as a way to quantify and evaluate social theories. I’ll discuss a brief history of how this strategy has worked and evolved, and pitch some new approaches for combining social measurement with state-of-the-art natural language processing. We'll focus on the massive multinomial regression models that serve as a basis for text analysis and the distributed computing strategies that allow inference on truly Big Data. I'll then work through a number of examples of social science questions being asked and answered via statistical NLP, with data from online reviews on Yelp, the US congressional record, and communications between buyers and sellers on eBay.

Comments

Speaker Profile: Matt Taddy

Share

COinS
 
Apr 27th, 12:15 PM Apr 27th, 1:55 PM

Measuring Rhetoric: Statistical Language Models in Social Science

Boalt Hall, room 134

Abstract:

Social scientists are embracing the idea of using `text as data’ as a way to quantify and evaluate social theories. I’ll discuss a brief history of how this strategy has worked and evolved, and pitch some new approaches for combining social measurement with state-of-the-art natural language processing. We'll focus on the massive multinomial regression models that serve as a basis for text analysis and the distributed computing strategies that allow inference on truly Big Data. I'll then work through a number of examples of social science questions being asked and answered via statistical NLP, with data from online reviews on Yelp, the US congressional record, and communications between buyers and sellers on eBay.