Spring 2026 Data Science for Policy IA7140 section 001

Text as Data

Call Number 10284
Points 3
Grading Mode Standard
Approvals Required None
Instructor Tamar Mitts
Type SEMINAR
Method of Instruction In-Person
Course Description

Pre-req: SIPA IA6500 - Quant I, and prior experience with R are required. This course introduces students to the quantitative analysis of text, an increasingly important method in the social sciences and public policy. With vast amounts of textual information now available from sources such as social media, news articles, political speeches, and government documents, the ability to analyze text systematically is essential. Students will learn how to collect, process, and analyze text data to answer meaningful research questions.

The course covers a range of methods including dictionary-based approaches, supervised classification, topic modeling, word embeddings, and emerging applications of Large Language Models. Emphasis is placed on practical application through hands-on exercises using the R programming language. By the end of the semester, students will develop an original research project using text as data.

Web Site Vergil
Department Data Science for Policy
Enrollment 0 students (25 max) as of 5:06PM Monday, August 11, 2025
Subject Data Science for Policy
Number IA7140
Section 001
Division School of International and Public Affairs
Open To SIPA
Section key 20261DSPC7140U001