Vocabulary Analysis of the Gutenberg Collection

Vocabulary Analysis of the Gutenberg Collection

I found this page when looking for things on vocabulary density – something of relevance for reading books designed for language learners.  The guy who wrote it is also interesting, in that he has a non-traditional career path into academia.

He shows his analysis is of ~2000 Gutenberg texts based on vocabulary – the kind of thing I like to muck around with.  It is “unpublished” work, so lacks a few things, like references and axis labels, making it less useful than it otherwise might be.

Advertisements

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s