Assignments

Assignment titles that are clickable are either due, or contain some boilerplate information that will be expanded upon. Non-clickable assignments are in flux. When finalize, they will be announced in class well-in-advance of their due dates. But you can anticipate them for now.

Assignments are also listed, in brief, in the Schedule/Syllabus

Week 1
Write a FOIA letter to send the FBI, requesting the files they've gathered for a well-known deceased person,
Thursday
Sep 29, 1:30 PM
5
Let's build our own dataset of Google self-driving accidents, even if it means reading each of their monthly reports and manually recording the accident information into a spreadsheet.
Thursday
Sep 29, 1:30 PM
10
Week 2
A little warmup of how to use functions and pivot tables in Google Sheets, and an introduction to evictions data.
Tuesday
Oct 4, 12:00 AM
5
Create a spreadsheet that compares datapoints from five non-profits across a single year.
Tuesday
Oct 4, 12:00 AM
10
For an investigative story that you've read, use LexisNexis to do 5 queries, to find 5 more stories of at least 500+ words, from 5 different years and publications that have insights/context not found in the investigative story you read.
Thursday
Oct 6, 1:30 PM
5
Read an investigative story, then document all of its sources with a spreadsheet. You should get an idea of how many people and organizations were contacted, as well as the document trail that was followed.
Thursday
Oct 6, 1:30 PM
10
Quiz: Quiz 01
Quiz covering the first 2 chapters of "Homicide" and "Art of Access". And some spreadsheets.
Thursday
Oct 6, 1:30 PM
5
Week 3
Browse MuckRock. Document 5 successful and 5 rejected public records requests.
Thursday
Oct 13, 12:00 AM
10
Week 4
Week 5
Quiz: SQL Basics
A review of basic syntax for SELECT statements, from FROM to some aggregations with GROUP BY
Tuesday
Oct 25, 1:50 PM
10
Log into Lexis-Nexis again and find 10 stories from the past, 5 each focusing specifically on Rep. Mike Honda and his challenger for the CA-17, Ro Khanna. From five of the 10 stories, come up with a question that you think is worth asking.
Thursday
Oct 27, 1:30 PM
5
Week 6
Quiz covering chapters 3, 4, 5 of "Homicide" and "Art of Access". And some SQL.
Tuesday
Nov 1, 1:30 PM
5
Exercises from last year
Tuesday
Nov 1, 1:30 PM
0
Just some exercises covering SQL up to GROUP BY, and to familiarize you with earthquake data.
Tuesday
Nov 1, 1:30 PM
5
Ryan Shapiro is a Ph.D. candidate at MIT and a research affiliate at the Berkman Center for Internet & Society at Harvard University. He is an historian of national security specializing in governmental transparency and the policing of dissent. Politico has referred to Shapiro as “a FOIA guru at the Massachusetts Institute of Technology”, while the FBI has declared Shapiro’s FOIA research methodologies themselves to be a threat to national security. Shapiro will speak to us about his research and journey from animal rights activist to FOIA-powered-scholar and transparency activist.
Thursday
Nov 3, 9:00 AM
5
This is a walkthrough as exercise, trying out some SQL JOINs, and slight practice on data-wrangling outside of the database.
Thursday
Nov 3, 1:30 PM
5
Week 7
Today is Election Day. Learn new SQL statement, compete with your classmates, and win the prediction pool by copying Nate Silver.
Tuesday
Nov 8, 5:00 PM
5
(Reviewing) An in-class midterm evaluating your understanding and proficiency of Structured Query Language and relational database concepts. If you've already taken the midterm, here are the answers.
Thursday
Nov 10, 2:50 PM
33
Week 8
Some readings, a quick signup, stuff to prep for the last 2 weeks of data journalism work. No points, but please do the work or face a very difficult quiz after Thanksgiving...
Thursday
Nov 17, 1:30PM
0
Week 9
Week 10
Week 11
Final quiz covering the rest of "Homicide" and "Art of Access".
Tuesday
Dec 6, 1:30 PM
20
Get ready for a New Year and a new President with 10 public records requests (FOIA, state, and local)
Thursday
Dec 8, 1:30PM
50
Week 12
Alone, or with a partner, find a significant dataset or (several) that you want to dig into deeper. Use spreadsheets, SQL, Carto -- whatever gets the job done.
Thursday
Dec 15, 11:59 PM
100

Grading Policies

Rubric
Homework 10%
Quizzes 10%
Projects 30%
Exams 50%
Grading Scale
A 90+%
B 80+%
C 70+%
D 60+%

Pickiness

For assignments that require prose, I will not be penalizing students for failing to follow AP Style or even perfect grammar and spelling (within reason).

However, in the spirit of database code parsers, students will be expected to follow naming and other technical requirements to the exact letter. For example, if an assignment requires you to create a spreadsheet named foia-log, naming it foialog or Foia_log will result in a penalty. Get used to caring about the details when it comes to computational methods.

Late Assignments

Each assignment has a cutoff time. For the most part, I won't be checking things right at the crack of midnight, or the minute before class. However, assignments that are noticeably late, as in, a day or more later, will lose 10% of their maximum point value. After a week, late assignments cannot be turned in.

Attendance

You are allowed to drop one pop-quiz (but must turn in homework beforehand) if you give 3 days notice. In extreme situations, we can consider makeup work.

Why This Work?

The intent behind this coursework is to better acclimate journalism students, who have a wide range of technical and mathematical backgrounds, to the rigorous mindset needed to learn database programming and efficiently collect, wrangle, and analyze data. The lowest common denominator is to just provide as much practice as possible.

The other factor is that among the other required core classes for Stanford journalism students, Public Affairs Data Journalism I is by far the easiest and most predictable. I understand that students, at a moment's notice, will have to miss class to rush to a scene of a news event. All I ask is that you cram in the coursework on your own time and pace. Unfortunately, students without much technical experience assume they can cram programming knowledge like they can churn out pages for a term paper.

So the quantity of coursework is meant to encourage students to approach it at a regular and frequent pace, like practicing any other skill.

  • The homework is meant to be easy and somewhat repetitive, even to the point where the answers will be mostly given. The expectation is that you practice on your own until you can do it on your own, not just to get something turned in.
  • That quizzes at the beginning of class help enforce attendance and serve as a benchmark for how well students can recall the concepts covered in homework.
  • The midterm is almost entirely focused on database programming. And it is, for all intents and purposes, a final exam. If you've done the work so far and paid attention, it won't be difficult. If, however, you think you can skate on the homework and quizzes and cram the concepts, you'll learn in brutal fashion that programming is not something to be crammed in. Don't learn this the hard way.
  • For students who've passed the midterm, the project work will feel routine. That's the point. Learning the programming means learning how to clearly think about data in its most fundamental form.