Summer 2024 Political Science BC3730 section 002

Data Science for Politics

DATA SCIENCE FOR POLITICS

Call Number 00054
Day & Time
Location
MW 1:00pm-4:10pm
To be announced
Points 3
Grading Mode Standard
Approvals Required None
Instructor Michael Miller
Type LECTURE
Course Description

This course explores techniques to harness the power of ``big data'' to answer questions related to political science and/or American politics. Students will learn how to use R---a popular open-source programming language---to obtain, clean, analyze, and visualize data. No previous knowledge of R is required.

We will focus on applied problems using real data wherever possible, using R's ``Tidyverse.'' In total, in this course we will cover concepts such as reading data in various formats (including ``cracking'' atypical government data sources and pdf documents); web scraping; data joins; data manipulation and cleaning (including string variables and regular expressions); data mining; making effective data visualizations; using data to make informed prediction, and basic text analysis. We will also cover programming basics including writing functions and loops in R. Finally, we will discuss how to use R Markdown to communicate our results effectively to outside audiences. Class sessions are applied in nature, and our exercises are designed around practical problems: Predicting election outcomes, determining the author of anonymous texts, and cleaning up messy government data so we can use it. 

Web Site Vergil
Subterm 05/20-06/28 (A)
Department BARNARD SUMMER PROGRAMS
Enrollment 1 student (10 max) as of 11:44PM Monday, June 16, 2025
Subject Political Science
Number BC3730
Section 002
Division Barnard College
Open To Schools of the Arts, Business, Columbia College, Engineering:Undergraduate, Engineering:Graduate, GSAS, General Studies, SIPA, Professional Studies, Social Work
Campus Barnard College
Note All Columbia students must register for Section 002
Section key 20242POLS3730X002