| Call Number | 00054 |
|---|---|
| Day & Time Location |
MW 1:00pm-4:10pm To be announced |
| Points | 3 |
| Grading Mode | Standard |
| Approvals Required | None |
| Instructor | Michael Miller |
| Type | LECTURE |
| Course Description | This course explores techniques to harness the power of ``big data'' to answer questions related to political science and/or American politics. Students will learn how to use R---a popular open-source programming language---to obtain, clean, analyze, and visualize data. No previous knowledge of R is required. We will focus on applied problems using real data wherever possible, using R's ``Tidyverse.'' In total, in this course we will cover concepts such as reading data in various formats (including ``cracking'' atypical government data sources and pdf documents); web scraping; data joins; data manipulation and cleaning (including string variables and regular expressions); data mining; making effective data visualizations; using data to make informed prediction, and basic text analysis. We will also cover programming basics including writing functions and loops in R. Finally, we will discuss how to use R Markdown to communicate our results effectively to outside audiences. Class sessions are applied in nature, and our exercises are designed around practical problems: Predicting election outcomes, determining the author of anonymous texts, and cleaning up messy government data so we can use it. |
| Web Site | Vergil |
| Subterm | 05/20-06/28 (A) |
| Department | BARNARD SUMMER PROGRAMS |
| Enrollment | 1 student (10 max) as of 5:05PM Saturday, December 13, 2025 |
| Subject | Political Science |
| Number | BC3730 |
| Section | 002 |
| Division | Barnard College |
| Open To | Schools of the Arts, Business, Columbia College, Engineering:Undergraduate, Engineering:Graduate, GSAS, General Studies, SIPA, Professional Studies, Social Work |
| Campus | Barnard College |
| Note | All Columbia students must register for Section 002 |
| Section key | 20242POLS3730X002 |