- Name: Harry Potter: Who Said What?
- Role: Research, Data visualisation, Data processing
- Year: Dec 2019
- More Details: Full Report
Can the trends in a characterʼs dialogues over the 8 movies reflect their character development?
Background
When fiction books are developed into movies, character development
is limited by screen time that is to be shared with plot development.
Dialogues not only serve in carrying the story forward,
but also reveal the nature of the character saying it, by what is
said, and how it is said.
Method
Corpus: We found the scripts of all the movies online. We then
converted them into text format, and used simple Python codes to
extract the dialogues of Harry, Ron and Hermione separately.
Dialogue Tagging: We used NLTK, a tool in Python that allows
us to deal with human language data. The program performed Dialogue
Act Tagging, which identifies the nature of the dialogue,
and tags it accordingly. We categorized our dialogues 4 main categories
- Answers, Questions, Statements, Emphasis.
Data Visualization: We used Floursih, a data visualization tool to understand and identify the patterns and relationships in our data.