Reader Levels: Thoughts as I do another Tadoku month


Level 0: Single-word nouns or adjectives – if the book is nicely illustrated in a way that makes the words identifiable, not too long, and maybe has some punchline equivalent at the end, as some do, then these are good for practising an unfamiliar alphabet such as hiragana and katakana.  The words are typically not high priority words, but tend to recur in stories anyway.  I have had enough repetition of certain animal words that I know them, even though they are not very useful for me when communicating to others.

Level 1: Repeated sentence structure – as above, these are excellent reading practice, and can help people learn some basic grammatical structures, while a story of some kind is told via the repeated sentence having different substituted nouns that are identifiably illustrated.  The LOTE series by Nelson Price Milburn are very good in this regard.  If they were longer than they are, then they would be tedious, but there are about 6-7 repetitions with minor variations, followed by a punchline of some sort.  The books by Evrat Jones, published by PCS Publications, are not as good, largely because of the illustrations.  Maybe I’m biased against old-fashioned repetitive images that look like dorky Grade 1 readers from the sixties, but their lack of appeal makes them more of a chore to read through.  They would also benefit from a glossary at the back.

Level 2: Small vocabulary and a small set of grammatical constructions.  Here is where the typical vocabulary-controlled reader fits into the scheme of things.  Within this level are all the stages of most published reading schemes, taking readers from around 300 words of vocabulary to 2,000, and from present tense to all the normal grammatical constructions.

Level 3: Native text.

Reading at levels 0 and 1 for the past week or so has me thinking there is a niche for books at these levels for adults.  Given an adult’s greater world knowledge and sophistication, it should be possible to create a more interesting narrative with these levels than is currently seen.

Thoughts on Up Goer Five and Constrained Vocabulary Writing


When I first saw the Up Goer Five comic by xkcd, I loved it.  It epitomised what I do with my comic book and my research, and is a convenient example to show people, when explaining the idea of constrained vocabulary writing.

Fans figured out that the 1,000 words used by xkcd for it were the contemporary fiction list, shown in Wiktionary.  This frequency list is based on over 9 million words of on-line contemporary fiction.  It combines plurals and simple verb forms into one listed word (lemmas), which is a good choice, since if the root word is known, then the plurals with s, and simple verb forms are usually also understood.

As someone who writes using lists generated based on frequency, I’ve noticed that several problems arise.  One is that, typically, male pronouns and nouns occur at higher frequencies than female ones.  The Wiktionary list is not overly biased in this way, possibly because it is based on contemporary fiction.  “he” is ranked at 8, “her” and “she” at 12 and 13 respectively, and “his” at 16.  However, we find “man” at 163 and “woman” at 452, but “girl” is at 133 and “boy” at 217.  This hints at what has been termed the systemic “infantilization” of women in society.  The figures are probably quite different due to the common pairing of “guy” (at 178) with “girl” in colloquial speech.  Google’s auto-suggest, which is also based on frequency, has occasionally come up with phrases that are considered racist, sexist or otherwise problematic – and it is purely a reflection of what we as a society tend to write.  When writing in a principled manner for language learners, it may be important to balance what word frequency lists tell us, with what is a more equitable representation.  I didn’t really think very much about this when I started writing Gnomeville years ago, but have become more aware of these issues thanks to some of my friends who are more knowledgeable in them.

Another issue that needs to be considered is what is culturally appropriate to write for the target audience.  For example, I have recently been made aware that it is inappropriate to use words referring to alcoholic beverages when the audience is Islamic.  Obviously for work intended for children (or for experimental subjects) it is customary to exclude expletives.  For this reason, several words on the list would need to be excluded.  There seems to be an expressive set of expletives in the list.

For the method of writing I employ in the Gnomeville story, I  introduce one new high frequency word per page of story, and somewhat less frequently I introduce a grammatical pattern.  Sometimes I’ve changed the order in which I add words due to the story.  This happened in episode one, in which I introduced “se” very early instead of after about a dozen other words.  Also, I recall that “le” was added before “de”, even though their ranks are reversed.  Having said that, my first 20 words were based on a corpus of newspaper articles.  Every corpus gives a different ranking of words.  There are some similarities across corpora however.  For example, if the corpus is large enough, the frequency of the word “the” is likely to be about 7% for English text.

Anyway, back to Up Goer Five.  The upcoming book “Thing Explainer”, as well as the text uploaded to the up goer five text editor provide some good practice at reading for people still consolidating their first 1000 words of the English language.  If going beyond that, the writing should have less than 5% of words outside the vocabulary set to be suitable for improving language skill while fluently reading for comprehension.  A text editor with more flexibility is the OGTE Editor, designed for writing English text for different language learner levels.

Children’s books


While searching for books that are simple enough for me to read in Japanese, I’ve been musing about the attributes that make these suitable for language learners.
If starting from scratch you need repetitious text with obviously illustrated nouns. This describes some of the books I purchased. The downside of the simplest books is that they become an illustrated list of nouns (or adjectives like colours), and therefore have no narrative.
In Japanese you have the added complication of the writing system. Beginner books use hiragana only. Then there are some that have katakana with hiragana transliterations. Then there are a few that use both the alphabets without guides. At the next level kanji are included with hiragana guides. The level of support for kanji varies.

Mots Croisés (Crosswords)



I was recently in Perth for a couple of weeks, so made the most of the opportunity to visit le forum in Fremantle.  It is a lovely little French bookstore in an almost abandoned small shopping mall at the edge of the shopping area of the city.

I came away with a large bundle of books, mostly for reading as a foreign language, but also a couple of crossword books for children.  People may have the impression that crossword books for 8-year olds would be easy for language learners, but that is not the case.  Children seem to be exposed to and know many more nouns than the typical language learner.  I worked my way through a few crosswords in my new Mots Fléchés book for 8-year olds and it revealed the huge gaps in my vocabulary.  Each crossword had a particular theme.  I did ok on common animals (lion, tigre, zebre, léopard, éléphant), European cities and words about Asia (sumo, sushi, sari, panda), but completely failed on words about the snow, medicine or the kitchen.

My other purchase, “Jeux de mots” for 8+, was easier, due to the dense French-style crossword grids that have clues like “the first and 4th letters of the alphabet”, “double vowel”,  “the first two letters of italien”,  “the second person singular of the present tense of avoir”.  (I’ve translated the clues here.)  These provide lots of hints for the other words of the puzzle.  There were also many more core vocabulary words like man, woman, place, with, pretty etc.

I’ve done other crosswords in French or for French in the past.  One book starts with small crossword triangles with simple words up to 3 letters long and 3 clues in total, and works its way up to 12 by 12 grids and the more difficult verb tenses.  I seem to have lost the original though, so I can’t tell you its name or publisher.  ELI has a book entitled “Jeux faciles en français”, which is for primary school children.  They have a page of vocabulary, such as the numbers from 1 to 10, then a join the word to the item, followed by a crossword and word search.  This pattern repeats for each set of vocabulary.  I remember enjoying this kind of activity as a 5-year-old, so perhaps it works for young kids.

I have a couple of books of vocabulary games including crosswords by Maurie N. Taylor, published by the National Textbook Company.  These are for English-speaking students of French, and are not immersive, but can help cement vocabulary.  However, I think that given the amount of vocabulary and language knowledge assumed in the books, that less English could have been used in the book to give more practice.

I also do crosswords in Dutch sometimes.  The children’s ones are possibly marginally easier for me than the simplest adult ones, but again, the adult ones usually provide more cross-clues, which offsets the slightly more difficult language.

I believe it is possible to create immersive crosswords for use at the earliest stages of language learning – certainly for European languages.  I do this in my comic book.  The crossword uses the episode’s target vocabulary and incidental cognate vocabulary, as well as the sentences in the story that have just been read, to provide reading practice and vocabulary production practice without reverting to English.

All the Drama


I’ve been reading a lot of easy readers in French and German of late, and some of the recent ones are of the Lectures CLE en français facile series.  I’m now at a point where reading the 500-600 word vocab ones are easy enough for the story to be enjoyed.

In the past I’ve not looked forward to reading abridged and adapted versions of classics, either because I wanted to read the original at some point, or just because I found them uninteresting.  However, I’ve changed my mind.  The CLE 500-word vocabulary adaptations of En Famille (by Hector Malot) and La Guerre des Boutons (by Louis Pergaud) were both thoroughly engaging.  I’m now reading Jacquou le croquant by Eugene le Roy, which is a 600-word vocab adaptation.  There seems to be a lot of 19th century French literature about people living in poverty and hard times, and CLE has made it accessible to learners of French.  I find the lengths of the books to be about right too, at about 50 pages of story – probably about 12,000 words all up.

I hope that I can find a similar resource for German. 

Green Eggs and Ham


Continuing my articles on easy readers, today’s post is dedicated to Dr Seuss.  His first constrained vocabulary book was The Cat in the Hat, which was released in 1957.  It had a vocabulary of 236 words (though other sources state slightly different numbers) , and yet was entertaining.  Dr Seuss then went on to write other books with a reduced vocabulary, including Green Eggs and Ham, which has only 50 different words.  Wikipedia lists the words used as: a, am, and, anywhere, are, be, boat, box, car, could, dark, do, eat, eggs, fox, goat, good, green, ham, here, house, I, if, in, let, like, may, me, mouse, not, on, or, rain, Sam, say, see, so, thank, that, the, them, there, they, train, tree, try, will, with, would, you.

These books are truly brilliant, in that they make children want to keep reading, while being easy for them to read.  A 50 word vocabulary is an amazing achievement.

Green Eggs and Ham has been translated into many languages.  We have a copy in Italian: “Prosciutto e uova verdi”.  However, once translated, the number of distinct words increased to about 127.  A translator has to somehow convey the original story as well as the rhyme and rhythm.  I’m not sure if the translator considered the goal of using the smallest vocabulary possible, as 127 for a text length of 620 words seems quite high.

I’m reminded of a section in the book “Gödel, Escher, Bach: an eternal golden braid” by Douglas Hofstadter, which looks at German and French translations of Lewis Carroll’s Jabberwocky poem.  The original has many made-up words that hint at meanings due to their phonetic similarity to other words.  For example “slithy” hints at “slimy” and “slither”, among other words.  It is translated to “lubricilleux” in the French and “schlichten” in the German version.  Combining made-up words with rhyme and meter makes it a very difficult translation challenge.

But back to easy readers.  While it may be useful to translate an easy reader to another language – particularly when they are as entertaining as those of Dr Seuss –  I think the best work can be achieved by exploiting the quirks of the original language.  Design your easy reader for your specific target language.


Easy Reader Hall of Shame


Having finished reading yet another boring German easy reader, I thought I’d dedicate today’s post to easy readers that should never have existed.

Authors of easy readers are trying to trick language learners into learning target language while being entertained with a story.  However, some authors don’t do enough entertaining, and it is transparently obvious that the story (or part of it) is just a vehicle for exposure to vocabulary or, in the case of CEFR-based books, practising common scenarios required for surviving in the target language.

So here is my list of the worst easy readers I’ve read:

  1. “Sur les Routes de France” by G. Colquhoun and E. Guergady, published by John Murray, (French).
    Not only was this boring but it was long: 137 pages of tedium.  It is basically a story of a family going on holiday in France, following their trek through different locations.  I read this over a number of weeks as it took enormous willpower to finish it.  Happily this is out of print.  I thought that I must have the last copy in the world, but a quick Google search reveals that there are second-hand copies on the market.  What amazes me is that this work, that was first published in 1959, got reprinted several times.  I have a second edition copy from 1964, but I see on-line that there was a 1973 3rd edition .  That means that at least 14 years of students had to suffer this text!  Steer clear of this one.  There are many better stories to read.
  2. “Lustige Dialoge” by Harry A. Walbruck, published by National Textbook Company in 1985 (German).
    This contains 30 short dialogues that are “humorous”.  My Goodreads review states it succinctly: “Full of annoying anecdotes involving dated stereotypes passed off as humour. I really struggled to finish the book.
    To be fair, some of the dialogues were somewhat humorous, but for the purpose of providing extensive reading material it fails, as the individual dialogues are too short, and there is no motivating reason to read the next one.
  3. “Deuxième Acte” by O.M. Fordham and V.L.R.Lewis, published by Harrap in 1965 (French).
    This book is a sequence of letters written between members of two families.  The content of the letters, as well as the dialogues in the back of the book are very mundane.  It was another book that was difficult to finish, but was mercifully short (compared to no. 1) at 76 pages, including exercises that I tend to skip over.  One thing that I have found too many of is stories of typical traditional nuclear families doing ordinary things, such as the mother in the kitchen or doing the shopping, the father coming home from work, blah blah.  Admittedly some of these books are old, the authors of the various books probably didn’t know of each others work, and the average student wouldn’t have been exposed to more than one of these, but I’m rather sick of the genre.
  4. “Glück gehabt” by Theo Scherling and Elke Burger, published by Langenscheidt in 2010 (German).
    There are many other books I could list that are like no. 3 in their irksomeness, but I’ve decided to go for variety rather than comprehensiveness.  Langenscheidt do a good job at making their text and their audio easy.  They also do well in avoiding stereotypes for their characters.  For example, in Leo & Co, one of the regular characters is a single mother who is a car mechanic, working with her dad, while Leo himself is an artist who runs a pub/eatery.  On the down side, the stories are a bit on the mundane side, with this particular one being the worst I’ve read so far.  The story mostly follows the lead character in the act of getting a new apartment.  While listening to it I kept waiting for the plot to start, but nothing really happens other than finding out about an apartment, getting it, then moving into it, with a small side story of a colleague hurting their back.  Skip this one unless you’re desperate for the practice.
  5. “Technik und Natur” by Mike Lynch, published by Heinemann 1996.
    I’ve chosen this reading booklet as an example of another problem that often occurs with supposedly easy readers.  Some authors control the grammar but use far too much difficult vocabulary.  This particular booklet has interesting subject matter for those interested in science and technology, but the percentage of difficult words is too high.  There is vocabulary support, but text should really be written in a way that less than 5% of the words require it, since it interrupts the flow of reading.

I think that will do for now.  So the moral of the story is, when writing for language language learners:

  • Don’t be boring,
  • Optimise the length of the text to allow a good reading session,
  • Don’t inundate with vocabulary.

It is possible to do the above.  There are many examples.  I love some of the stories in the Découverte series, such as “Dans la Maison Bleue”, which are full of imagination, and have illustrations that complement the stories.  Here’s hoping there are more interesting books in the future.  I hope mine will also be enjoyed.  I’ve had good feedback so far.