Thursdays 2:40-3:55pm
w/ Professor Dennis Yi Tenen
Office Hours M&W 4-5pm
Philsophy Hall, 408e
email dt2406
Readings: Moretti, Franco, and Dominique Pestre. "Bankspeak." New Left Review 92 (2015): 75–99.
Ideas: Close and distant reading
Lab Assignment: Weasel Words I. Due Monday, April 6 by midnight.
Readings: "Facing the Language Challenge" from Natural Language Processing with Python by Steven Bird, Ewan Klein and Edward Loper.
Ideas: files and folders, projects, home directory, plain text and binary formats, bits and bytes, form and content, pipes, lines and words, destructive vs. non-destructive transformations (data munging), dataflow programming, bag of words, stop words
Method & Tools: Natural Language Processing, command line, basic unix utilities
Readings: Mosteller, Frederick, and David L. Wallace. "Inference in an Authorship Problem." Journal of the American Statistical Association 58.302 (1963): 275–309.
Ideas: stem, lemma, bag of words, n-gram, frequency, collocation, model
Method & Tools: stylistics, iPython, NLTK
Lab Assignment: Weasel Words II. Due Monday, April 20th by midnight. Start on Automatic Essay Grader, due Monday, April 27 by midnight.
Reading:
-
Evans, Courtney, and Ben Jasnow. "Mapping Homer’s Catalogue of Ships." Literary and Linguistic Computing 29.3 (2014): 317–325. Web. 24 Mar. 2015.
-
Moretti, Franco. “Graphs, Maps, Trees - 2.” New Left Review 26 (2004): 79–103.
Ideas: more on stems, lemmas, n-grams, bag of words, named entities
Method: named-entity recognition
Lab Assignment: Go through the lecture notes for this week. Make sure you
understand each step. A small homework: download "Around the World in 80
days"
by Jules Verne. Extract place names. Submit .csv along with your code.
Reading:
- Fish, Stanley. “Literature in the Reader: Affective Stylistics.” New Literary History 2, no. 1 (October 1, 1970): 123–62.
- Reyes, Antonio, and Paolo Rosso. “Making Objective Decisions from Subjective Data: Detecting Irony in Customer Reviews.” Decision Support Systems 53, no. 4 (November 2012): 754–60.
Method: sentiment analysis
Method: network analysis
Lab Assignment: Web of Science
Method: topic modeling, supervised machine learning
Lab Assignment: Final Project