• Name: Harry Potter: Who Said What?
  • Role: Research, Data visualisation, Data processing
  • Year: Dec 2019
  • More Details: Full Report

Can the trends in a characterʼs dialogues over the 8 movies reflect their character development?

Background

When fiction books are developed into movies, character development is limited by screen time that is to be shared with plot development. Dialogues not only serve in carrying the story forward, but also reveal the nature of the character saying it, by what is said, and how it is said.

Method

Corpus: We found the scripts of all the movies online. We then converted them into text format, and used simple Python codes to extract the dialogues of Harry, Ron and Hermione separately.

Dialogue Tagging: We used NLTK, a tool in Python that allows us to deal with human language data. The program performed Dialogue Act Tagging, which identifies the nature of the dialogue, and tags it accordingly. We categorized our dialogues 4 main categories - Answers, Questions, Statements, Emphasis.

Data Visualization: We used Floursih, a data visualization tool to understand and identify the patterns and relationships in our data.