SML 543

Syllabus - Spring 2024

Sarah-Jane Leslie
sjleslie@princeton.edu
sarahjaneleslie.org

Lecture: Monday 1-3pm.

Machine learning – and in particular, deep learning – is rapidly opening new horizons for research in the humanities and social sciences. However, scholars in the humanities and social sciences can encounter barriers to learning about such techniques – for example, machine learning courses, especially at the graduate level, often require multivariate calculus, linear algebra and prior coding experience, which students in the humanities and less quantitative social sciences may lack. This course offers a practical introduction to deep learning for graduate students, without assuming knowledge of calculus or other college-level math, or any prior experience with coding. By the end of the course, students will:

be able to code and train a variety of basic deep learning models
develop an appreciation of the range of humanities/social science research questions to which deep learning can be applied
be fluent collaborators on research projects that involve machine learning experts
gain an understanding than will inform theorizing about machine learning (e.g., for research in AI ethics, technology policy, etc)

Here is a Princeton homepage story on the spring 2023 version of the class, which features some helpful student perspectives.

Readings: The primary text for the course is Francois Chollet, 2021, Deep learning with Python. Manning. We will work our way through the book in a lot of detail. In the second half of the course, once we have worked through the basics, there are supplemental readings each week, which give examples of how related models have been used in humanities/social science research. The supplemental readings aim to provide a sense of the breadth of applications, as well as to raise interesting issues for further thought and consideration.

Enrollment eligibility: Open only to Princeton graduate students, except with special permission of the instructor. Princeton undergraduates should take the undergraduate version of this course, SML 354

Evaluation and assignments: There are simple weekly coding assignments to cement knowledge and build skills. Students taking the course for credit are required to complete these, though they are ungraded. That is, as long as a reasonable effort is made on them, students receive full credit for the week. In addition, there are three graded projects spaced throughout the semester. For these projects, students will be supplied with a data set and asked to build a model that addresses a question appropriate to the data set. Evaluation will be based on the appropriateness of the model for the question and the success of the code used to build/train the model, as well as answers to any other questions posed along the way. The three projects will cover the following areas respectively: classification/regression, computer vision, natural language processing. (For philosophy students only: successful completion of the above assignments will constitute a logic unit. Philosophy students wishing to receive a unit other than logic should contact me directly.)

Final grades are determined in the following way: 20% homework, 20% classification/regression assignment, 20% computer vision assignment, 30% natural language processing assignment, and 10% class participation.

Auditing: Graduate students are welcome to audit this course, however I request that even auditors engage with the weekly coding assignments, since the value of this class is largely associated with the development of practical skills. As with any kind of coding, these skills can only be learned by doing. Post-docs and other researchers/faculty are welcome to informally audit the class also.

Important: Getting started in Python

Prior coding experience is not required for this class, but students without Python experience will need to complete a crash course in the language. In particular, I have made video lectures covering Python basics which will be available via Canvas upon registering for the class. The first several homeworks will assume that you have watched certain portions of the crash course, so you can do this concurrently with the start of classes, however if you are new to coding I strongly recommend getting started in January so you can work at your own pace and do additional practice as needed. If you fall behind on this once the semester starts, it may be prohibitively difficult to catch up. The videos have accompanying code notebooks with exercises. If you can complete the exercises in the notebooks, that means you are on a good track. Students with prior experience in Python should test their understanding by making sure they can easily complete those exercises.

Please note that the video tutorial does not cover all aspects of Python; rather the focus is on the key aspects of the language required for the course.

Additional resources: A repository of good tutorials can be found here. For the W3 Schools tutorial, I recommend working through it up to Classes/Objects. You might also consider attending a Wintersession course on Python.

Schedule of Topics

Week 1: Overview of machine learning
Reading: Chollet, chapter 1

Week 2: Fundamentals of neural networks: the forward pass
Reading: Chollet, chapter 2

Week 3: Fundamentals of neural networks: the backward pass
Reading: Chollet, chapter 2 cont.

Week 4: Introduction to Keras; Classification and regression
Reading: Softmax Tutorial; Tasks, Activations, and Losses

Week 5: Introduction to Keras; Training, validation, and test sets; Regularization
Reading: Chollet, sections 3.1-3.3, 3.6, & 4.1-4.3; section 7.2.2

Week 6: Practicalities of training models; Best practices for model evaluation; Begin computer vision
Reading: Chollet, chapters 5 & 6
Kapoor, S. & Narayanan, A. (2022)

Classification/regression project assignment distributed

Spring break

Week 7: Computer vision I: Introduction to convolutional neural networks
Reading: Chollet, sections 8.1-8.2
Supplemental reading: Melvin Wevers, Thomas Smits, The visual digital turn: Using neural networks to study historical images, Digital Scholarship in the Humanities, Volume 35, Issue 1, April 2020, Pages 194–207.

Classification/regression project due

Week 8: Computer vision II: Classification; Leveraging pretrained models
Reading: Chollet, sections 8.3; 9.3
Supplemental reading: Leszek M. Pawlowicz, Christian E. Downum, Applications of deep learning to decorated ceramic typology and classification: A case study using Tusayan White Ware from Northeast Arizona, Journal of Archaeological Science, Volume 130, 2021, 105375.

Computer vision project assignment distributed

Week 9: Natural Language Processing I: Representing words as numbers
Reading: Jay Alammar, Illustrated Word2Vec, Chollet sections 11.1-11.3
Supplemental reading: Charlesworth, T.E.S., Caliskan, A., & Banaji, M.R., (2022) Historical representations of social groups across 200 years of word embeddings from Google Books. Proceedings of the National Academy of Sciences, 119(28).
Grand, G., Blank, I.A., Pereira, F. et al. Semantic projection recovers rich human knowledge of multiple object features from word embeddings. Nat Hum Behav 6, 975–987 (2022).

Week 10: NLP II: Introduction to Transformers; Masked Language Models
Reading: Jay Alammar, The Illustrated Transformer. Chollet chapter 11.4
Supplemental reading: Assael, Y., Sommerschield, T., Shillingford, B. et al. Restoring and attributing ancient texts using deep neural networks. Nature 603, 280–283 (2022).
Manning, C., Clark, K., Hewitt, J., & Levy, O. Emergent linguistic structure in artificial neural networks trained by self-supervision. PNAS 117 (48).

Computer vision project due

Week 11: NLP III: Generative models
Reading: Chollet chapter 11.5, 12.1
Supplemental reading: Osmanovic Thunström, A. (2022). We Asked GPT-3 to Write an Academic Paper about Itself—Then We Tried to Get It Published. Scientific American, June 30 2022.
Gpt Generative Pretrained Transformer, Almira Osmanovic Thunström, Steinn Steingrimsson. Can GPT-3 write an academic paper on itself, with minimal human input?. 2022. hal-03701250.
Hutson, M. (2021). Robo-writers: the rise and risks of language-generating AI. Nature 591, 22-25.
Stokel-Walker & Van Noorden (2023). What ChatGPT and generative AI mean for science. Nature.
Tregoning, J. (2023). AI writing tools could give scientists the gift of time. Nature.
van Dis, E. et al. (2023). ChatGPT: Five priorities for research. Nature.

NLP project distributed

Week 12: Machine learning ethics role play
Supplemental readings: Benn, C., & Lazar, S. (2022). What’s wrong with Automated Influence. Canadian Journal of Philosophy, 1-24.

NLP project due on Dean's date

Machine Learning: A practical introduction for humanists and social scientists

Important: Getting started in Python

Schedule of Topics