When scholars try to make sense out of large collections of text, they frequently do two things: compare, and collect. They collect samples of “interesting” things, and compare them with each other along various relevant dimensions.
In this post, I demonstrate the collection and comparison features of WordSeer by using it to compare the usage of the word “love” in Shakespeares comedies and tragedies. You can watch the screencast, or simply read on.
The first thing to do is collect the comedies and tragedies into separate lists. To do this, I created a new collection called “tragedies” using the new “collections” feature.
Next, I had to collect all of Shakespeare’s tragedies into that collection. Figure 2 shows WordSeer’s list of plays. I walked down this list and clicked the checkboxes next to the tragedies, using Wikipedia as an authoritative source of tragedies.
Once I’d selected all the tragedies, I clicked the “Add Items” button to add them to a collection. I selected the “tragedies” collection and added the plays.
This populated the collection with the plays. I did the same for the comedies, ending up with two collections
I was now ready to compare my collections. I opened up two windows to the heat map view. One was going to visualize the tragedies, and one the comedies.
Finally, I was ready to compare the two. I was interested in the word “love”, and whether there would be any differences in how frequently it was used in the comedies and the tragedies. To that end, I typed in “love” into the comedies window and got the heat map in Figure 7.
Not surprisingly, “love” is everywhere. But what about the tragedies? In the other window, typing in “love” yielded the results in Figure 8.
To my surprise, the tragedies were equally full of “love”. Which, among other things, reveals my poor knowledge of Shakespeare.
Still, the hope is that our Shakespeare scholar, Michael Ullyot, (@ullyot) will use collections and heat maps to discover something truly interesting.