Spring 2026 Data Science for Policy IA7140 section 001

Text as Data

Call Number 10284
Day & Time
Location
R 10:00am-11:50am
To be announced
Points 3
Grading Mode Standard
Approvals Required None
Instructor Tamar Mitts
Type SEMINAR
Method of Instruction In-Person
Course Description

Pre-req: SIPA IA6500 - Quant I, and prior experience with R are required. Instructor permission required. Join the waitlist in Vergil to request registration. 

This course introduces students to the quantitative analysis of text, an increasingly important method in the social sciences and public policy. With vast amounts of textual information now available from sources such as social media, news articles, political speeches, and government documents, the ability to analyze text systematically is essential. Students will learn how to collect, process, and analyze text data to answer meaningful research questions.

The course covers a range of methods including dictionary-based approaches, supervised classification, topic modeling, word embeddings, and emerging applications of Large Language Models. Emphasis is placed on practical application through hands-on exercises using the R programming language. By the end of the semester, students will develop an original research project using text as data.

Web Site Vergil
Department Data Science for Policy
Enrollment 25 students (25 max) as of 12:06PM Friday, November 28, 2025
Status Full
Subject Data Science for Policy
Number IA7140
Section 001
Division School of International and Public Affairs
Open To Architecture, Schools of the Arts, Business, Engineering:Graduate, GSAS, SIPA, Journalism, Law, Public Health, Professional Studies, Social Work
Section key 20261DSPC7140U001