Something I have been pondering lately is the enormous vocabulary load that occurs in some graded readers that are intended for beginners. The grammar is simple but the vocabulary load is huge. Sure, things are often glossed, which speeds up the process of finding out the meaning of words, but it still prevents fluent reading.
When I first found booklets from the Bibliobus series published by Mary Glasgow, I thought they were wonderful. I only had 3 of them, at levels 6 and 8. I also loved Le Chapeau Rouge released by the same publisher. I’ve since collected more Bibliobus stories, and also acquired a collection of Lire Davantage booklets. What is clear to me, and I have been reminded of it by a friend who has been reading them lately, is that there is not much text but a lot of vocabulary load. The stories do vary a little in terms of quality and difficulty within the published levels, so some are probably of greater value than others.
In theory, these high vocabulary load stories provide language exposure that will increase a learner’s skill, since there are things that are unknown. Provided the learner can read them quickly they have some value for extensive reading. However, if there is a choice between another story with lower vocabulary load, more text and a smoother gain in vocabulary, then that would be better. It’s all down to availability. However, ultimately what matters is whether the story appeals to you enough that you are keen to read it. If not, it is best to find something else to read. As long as you are reading at least 10 minutes per day at a level that allows you to fluently read, follow the story, but not already know all the language that you encounter then you will improve your language knowledge.
Here are a couple more books/stories I’ve analysed for general vocabulary size at the ~95% cut-off, based on the first 100 words. Note that just using this figure in isolation is a bit misleading, because books like the one by Ford and Hicks use a lot of repetition and a relatively small vocabulary overall, making it possible to learn relatively easily. It just isn’t necessarily all highly frequent vocabulary. This is where vocabulary density is also a useful guide, so I’ve included this figure as well.
If the 95% general vocabulary size is high and the vocabulary density is high, it means that you may be able to read comfortably, learning the vocabulary of the given text, but its relative usefulness will depend on whether it matches the vocabulary that you need for your language goals.
|Title||Author||Publisher/Series||Gen Vocab Size at 95%||Vocab Density at (n) words|
|Reading approach to French||Ford and Hicks||J.M. Dent and Sons (Canada) Ltd.||12,059||2.77 (122)|
|Le Visiteur||Sue Finnie||Mary Glasgow/Bibliobus||11,260||2.05 (123)|
This follows on from my previous table of figures. When using general vocabulary rank frequency lists it certainly seems normal for graded readers to effectively use a very wide vocabulary, leading to an expected general (raw) vocabulary size of 4,000 to 12,000. To do something considerably less requires careful vocabulary control, such as occurs in my Gnomeville series, which achieves this through exclusively using French-English cognates and the most frequently occurring words. Initially it may seem a little artificial, but becomes more natural and flowing as the stories progress. A similar approach is used in Si Nous Lisions, in that a very small vocabulary is used initially, and then a new word is added every 90 or so words. While I came up with the idea independently, the concept of vocabulary control is attributed to Michael West.
At some point I’ll publish a comprehensive list of graded readers with these statistics, but I’ll first need to automate the process a bit more and get rid of a few bugs. Meanwhile, let’s keep reading at least 10 minutes a day of easy but not too easy text in the languages we want to learn.