Course content

The aim of the course is to introduce fundamental skills and critical issues in computational analytics of text. Students will learn key theoretical concepts and methods, and gain hands-on experience with tools and techniques, including the following: Quantitative properties of language. Frequencies. Text processing in the Unix command-line environment. Regular expressions. Occurrences and co-occurrences. Annotation using manual and machine learning methods. Practice with tools for annotation, search, statistics and linguistic analysis. Ethics, integrity, copyright.

Learning outcomes

In order to pass the course, students are expected to be able to:

  • describe and apply methods for collection and cleaning of textual data, whether originally in digitized or non-digitized form
  • describe and apply methods for constructing, annotating, mining and analysing text corpora
  • critically review a dataset or corpus and assess the kinds of research questions it could be used to address

Practical information

The course has the following obligatory elements: active participation in seminars and completion of all written assignments.

The course will be examined through a learning journal of at most 4000 words, consisting of completed assignments and short written reflections relating to the student’s own field.

The course is conducted in the form of seminars that mix theory-oriented lecturing and practical laboratory assignments using computers. All sessions take place in a computer room. Each session is devoted to a particular theme and (with exception of the last two) using one or more digital tools in accordance with the course contents. No technical skills are required at the outset. Students are encouraged to cooperate and some assignments require collaboration.

The course is given in English.

NB. The course will not be offered online.

Period: Second half of Spring semester 2021

Course dates: Link to TimeEdit

Language of instruction: English

Course directors and Examiners: Maria Kuteeva and Mats Wirén

Theme coordinator: Alison Klevnäs

Course title in Swedish: Digitala metoder för humaniora: text

The course is offered by the Department of Archaeology and Classical Studies Department of History in collaboration with the Department of Culture and Aesthetics and the Department of Asian, Middle Eastern, and Turkish Studies.

The course is part of the theme Digital Humanities