Category Archives: Resources

Resources for language learning and for producing language learning resources

Book cover with musketeer holding a boot, saying "Diable !"

Repetition in Graded Readers

There is a class of graded readers that make much use of repetition in order to give learners the best chance of picking up vocabulary via their reading. This is the approach used by TPRS, Wayside Publishing, and the Old English book Osweald Bera. On re-reading Otto Bond’s Sept d’un Coup, published in 1936, I noticed that a similar technique was used, albeit more subtly, so it is not a new idea.

My observation of reading books with lots of repetition is that on the first read this is fine and helpful, but on rereading it becomes extra tedious. If things get too dull, the learner won’t engage. It occurs to me that my approach in the Gnomeville comics saves people from rereading unless they really want to, via their design.

The first couple of episodes of Gnomeville are moderately repetitive, in that they are not the most repetitive French graded readers around (Bootstrapping the Three Musketeers is the most repetitive I know of, followed by Le français par la methode nature, followed by Édi l’élephant), but they are more repetitive than other books, based on analyses of the first roughly 100 words. I try to ensure at least five occurrences of each new vocabulary word in the comic but I don’t force it into the Gnomeville story. Instead, I have a separate story (La Question du Moment), which uses any words that haven’t had enough repetition yet. Then the following episode has a revision page at the start, which recaps the story so far, using all the vocabulary and grammar used so far. Learners can read this and decide whether they are familiar with everything or need to revisit the previous episode. When revisiting, learners can skip La Question with all its repetition and just reread the Gnomeville story. Or they can use some other method of checking the words that are still unfamiliar.

When rereading the first few chapters of Osweald Bera, which I do periodically because I only dip into the book occasionally in my world of many demands, commitments and distractions, I have been wishing for a summary for each chapter, which summarises the story so far and includes all the vocabulary I should have picked up from it without the additional repetition. I consider it my “homework” to create this for my own use. For languages like French, Italian, German and Spanish, there are quite a few options for reading material, so it is probably better to read something else instead of trying to reread these often long, verbose, repetitive books. For Old English, however, Osweald is the only reader of its kind, so rereading is likely to be inevitable.

Episode 4: Easy Authentic Sentences from French Classics

On this page are short extracts, titles and sentences which occur in classic French texts and only use the vocabulary of Episode 4 of my Gnomeville comic book series, that is, 39 frequently occurring words in French newspapers plus exact cognates and names. (I may update this list periodically, when I find more things with my scripts.)

Un signal !
Un danger pour nous !

Cinq semaines en ballon by Jule Verne.

Ah!
C’est vous, Tréville !

Oh !
C’est vous, c’est vous !

Les trois mousquétaires by Alexandre Dumas.

Il descend !

Elle est en prison ?

Mais qui ?

Elle est dans la prison de Mantes.

Mais silence, silence !

Qui, elle ?

Une lettre !

Ah mais !

Elle !

Impossible de continuer le commerce.

Mais ce secret ?

Signez, Milord !

Plus horrible !

Mais c’est horrible !

Qui est-elle ? et qu’est-elle ?

Mais, c’est impossible !

Ce n’est pas ELLE !

Ah ça !
Ah ça !

C’est vous, madame Poisson ?

Ça c’est grave !

C’est une supposition.

C’est un secret.

C’est ça !

Oh ! c’est elle ! c’est elle !

Le Canada est content de vous !

C’est nous !
Nous, Blanche et Rose !

C’est elle , n’est-ce pas ?
C’est elle.

Qu’est-ce que c est que ça ?

C’est ça , Lieutenant.

Est-ce vous , Adolphe ?

Et ça , qu’est-ce que c’est ?

Mais c’est abominable !

C’est pour ça.

C’est pour vous.

Avec elle , c’est impossible !
Ah !

Ah !
Et une lettre de madame Bonaparte.

C’est terrible, ça !

Oh !
Impossible de refuser.

Zdenko a de la consolation, consolation, consolation !

Entre !

Je n’entre pas.

Un ordre de Madame.
Un ordre de Madame ?

C’est un monstre !

C’est une lettre !

Mais c’est une fortune !

C’est ça qui est une transition !
Ah !

Oh ! qu’est-ce qu’il a ?

Ah ! c’est vous, baron !

(Hamlet entre.)

C’est possible ! mais … ce n’est pas certain.

C’est grave, ça !

Il est avec nous.

Mais la Bastille ! la Bastille !

Ça n’est pas possible !

Est-elle noble !

La phrase est-elle simple ?

Mais il est innocent !

Continuez, madame.

Un aide de camp entre.

Mais qui est-ce ? … Qui est-ce ?

Mais c’est Julien !

Qu’est-ce qui arrive ?

Elle est en France !

C’est d’elle !

Mais c’est un commencement , c’est un signe.

Mais, madame, c’est impossible.

Elle est l’unique force.

Gnomeville Comics are Easier than I Thought

On reviewing my readability measure results for various items in my collection, I suddenly thought, “hang on, how can the expected vocabulary size for Gnomeville Episode 1 be 25 when only 12 very frequent words are introduced?” Clearly something had gone wrong somewhere.

I blame the fact that part of my analysis is manual, and I probably didn’t follow the procedure very well. I run various scripts to produce a ranked list of words in the text in the frequency order of a large corpus of written French (mostly from Project Gutenberg). The manual bit is counting up cognates, or at least starting at the least frequent word end and counting up until I find 5% of the words that are not cognates or names. I think I went astray previously by having a less reliable process.

Results can differ depending on decisions that are made, such as whether to include titles (which I treat as sentences), the “Présentation” section that has brief notes about each character, and what is counted as a cognate. It is reasonably clear-cut for Gnomeville, but for other texts, it is less clear. Should “habiter” be considered a cognate due to its similarity to “inhabit”? And there are other words that are cognates in the linguistic sense but not particularly obvious from a learner perspective. The choice of general frequency list will also make a difference. Spoken text has different characteristics to written text, especially in French. Also, the very frequent words used for Episode 1 and 2 are the 20 most frequent in French newspapers, which is not the same set of words as any other corpus of text. The text I use for calculating expected vocabulary size has some of those words at lower ranks (“se” at 25, “au” at 31, and “on” at 40), which explains why there was the potential for the expected vocabulary size to be larger than the number of words introduced. But unless those words made up about 5% of the extract it was unlikely they would receive those scores.

Anyway, on revisiting my incorrect assessments of the Gnomeville episodes, I have the following updated vocabulary sizes.

EpisodeOld Expected Vocab SizeNew Expected Vocab SizeNew Readability Score
12532.20
216143.23
340173.83
4153.66

You may notice that Episode 4 has a lower expected vocabulary size at 95% and a lower readability score than Episode 3. There’s not a lot in it, but Episode 3 had longer sentences in the extract.

Well, there you are. Gnomeville’s expected vocabulary size is much smaller than originally calculated – at least for Episodes 1 and 3.

Review: Le Français par la méthode nature

I’ve seen this book by Arthur M. Jensen mentioned a few times and I thought it was worth a look, given its philosophy. This book, which was originally published in 1958 (and still in copyright, according to the death+70 rule), is a reading-based introduction to French, with the pronunciation of the text in IPA under each line. The text is quite mundane and repetititve, but the repetition is intentional to allow the language to be acquired by reading. Things get a little more interesting after a dozen or so chapters. A similar approach is used in several (relatively entertaining) stories in French, such as those by Wayside Publishing and TPRS, in addition to other languages. It is also used for an engaging story in Old English called Osweald Bera.

Jensen’s book makes no assumption about the learner’s first language. There are no glosses or definitions in another language. There are pictures to illustrate nouns that are introduced and names of characters talked about. This makes it a good choice for those with a language background that is not English, since many books assume English (or another common European language). It may be less useful for those who have no prior exposure to the Roman alphabet. That would need to be learnt first.

I ran my usual analysis on the first approximately 100 words and confirmed that based on my measures, it is easy French. Its type-token ratio (measure of repetition and learnability from text) is the lowest I have found so far, meaning there is a high chance of learning the vocabulary when reading the text. Its overall score, encompassing expected vocabulary at 95% coverage and sentence length (assuming that the text above and below the pictures on the first page, such as “une fille”, were sentences) was 4.84, making it the 7th easiest in my small table of the readability of French texts for learners.

One weakness of the text is that it is a bit old-fashioned. A lot of the conversation examples are not how people speak nowadays. Also, some of the words are more what would be read rather than spoken, such as “demeurer”, which I only come across in texts, whereas to my knowledge, “habiter” has been the most common verb to use for decades.

Ayan Academy has audio of many chapters on Youtube. This can be useful simple audio comprehensible input.

In summary, its strengths are that there is no assumption of first language, it is comprehensive, and there is a high chance of learning the language from reading due to its high level of repetitivness. Its weaknesses are it is dull and dated.

Extensive Listening

Mostly I have focused on extensive reading, both in my research and practice. This is obvious from my DELF results, where listening, and especially speaking, were much worse than my reading and writing results in French. To build skill in listening requires suitable comprehensible input. There are various podcast and youtube channels that are recommended. Some have beginner content, others intermediate, and others cater to multiple skill levels. Partly for my own benefit and partly as a resource for others, I thought I’d list those I have found out about.

I should also mention that if something is too hard to comprehend because of the speed, you can always slow it down. If it is too hard because you are unfamiliar with most of the words, it is best to start with something easier and/or do more reading first. (Also, if it is too easy, you can speed it up).

Beginner channels.

There are also a few easy songs you can listen to (based on repetitiousness and vocabulary size). I’ve also become aware of a few easy children’s programmes.

Intermediate channels

For the intermediate channels, you may need to have developed your vocabulary a bit more via extensive reading with graded readers and the like.

At the intermediate level, the innerfrench.com is the most recommended. It starts slowly and has many episodes, which generally become gradually more difficult, apart from a few exceptions. There are some interesting topics amongst the series.

Another that has some interesting comprehensible content is Français avec Fluidité. There is an A2+ playlist (start here) as well as various interesting topic playlists. I feel as though it is slightly more difficult than the innerfrench podcasts but still quite comprehensible for an intermediate.

Little Talk in Slow French is beginner-intermediate level but has some words translated into English during the podcast.

Lingua.com has recordings that are labelled with CEFR competence levels, from A1 to B2, with questions to test your comprehension. They also have resources for other languages.

Native Content

Some documentaries start to be accessible at intermediate levels. An example recently recommended is https://www.youtube.com/watch?v=qcaSou1FUAQ.

Ted talks are often more clearly enunciated than conversations but still sufficiently challenging.

Elsewhere

Since creating this post, I have learnt that there is a comprehensible input wiki with resources for many languages. The French list of listening resources is more comprehensive than what I list here, so worth checking out.

Excerpt of sheet music for the song La Mission by Uitdenbogerd

The Easiest Songs in French

Someone recently asked whether there were any A0 songs in French for improving listening skills. There may be some learner songs. I wrote a few to go with my comic books but they are optimised for reading, not listening.

So what would be the criteria for easy songs? I think that the easiest would have a very small set of words and be repetitive. Where two songs equate in vocabulary and repetition, perhaps the one with the easiest grammar or the most standard expressions would be ranked easier.

I cannot comment on whether there are any easy popular songs. Most have extensive lyrics, with only the chorus being repetitive. However, in the collection of children’s songs (comptines) and folk songs, several can be found. Some were printed in my old Horan and Wheeler textbooks. Others I have found elsewhere. Given the above criteria, here is my list and a rough sequence of difficulty based on vocabulary size; type-token ratio, which captures repetitiveness; and a grammar scale. Beginners will probably want to read the lyrics for the first few listens.

SongVocabulary SizeLyrics LengthType Token RatioGrammar LevelMeasure
Bonsoir mes amis3210.14300.06
Savez-vous planter les choux ?191180.16110.23
Frère Jacques7140.50010.25
Sur le pont d’Avignon231300.17710.27
Alouette13960.13530.28
Quand trois poules14290.48310.29
Dansons la capucine
(or more seriously)
26910.28610.32
J’ai mangé un croissant10200.50010.32
Au petit pas militaire12390.3130.33
J’ai mangé un croissant remix19560.33930.39
Didi et Dada32720.44410.41
J’ai du bon tabac341110.30630.48
La mission33480.68810.49
Au clair de la lune701330.52640.86

Book cover with musketeer holding a boot, saying "Diable !"

Bootstrapping the Three Musketeers

Those who have visited my blog this year will know that I have put up some “filtered French”, such as a list of the most common one-word sentences in French classic literature, and sentences that fit the highly constrained vocabulary of my comic books. After musing on language acquisition, in particular how babies learn, not to mention our experience of picking up a few words and phrases in a foreign language by ear, I thought I’d try a different approach. This has resulted in producing a book (with more volumes to come) where I filter Les trois mousquetaires, and add vocabulary one word at a time based on which word will complete the most sentences. Using a combination of manual and automatic filtering, I have created extracts that have sufficient repetition in their vocabulary for people to become familiar with the words.

It has been fascinating to see what happens as I add each new word. The algorithm tends to find dialogue first, gradually increasing in average sentence length, then short non-dialogue sentences – after the 93rd word of vocabulary was added.

Anyway, if you’d like to have a look, it’s on Amazon, with a substantial preview.

(Affiliate links in this post.)

French Novels Recommended to Learners of French

At some point learners of French should tackle novels in French that are written for native speakers. However, they vary considerably in difficulty. I thought I’d keep a list of those I’ve seen recommended. At some point I’ll add a readability score, as modelled for learners of French with an English-speaking background.

Several others are listed at https://www.private-frenchlessons-paris.com/blog/10-books-for-french-learners.

Novels that are translations from English to French are often easier than those originally written in French, but it is definitely not always the case.

(This page includes affiliate links.)

Common One-Word Sentences in French, Revisited

I’ve been playing with my scripts lately, filtering French. Previously I have published a list of the top one-word sentences in a corpus of French classic texts, as well as my lists of very easy extracts, based on the language repertoire covered by my Gnomeville comics. Today, while waiting for my very inefficient scripts to finish processing my old download of the French texts from Project Gutenberg, I revisited the frequent one-word sentences. I decided to keep the exclamation marks and question marks this time, so it is clear whether something is being used as a question or not. Here is what is coming up so far…

  1. Ah !
  2. Oh !
  3. Eh !
  4. Hélas !
  5. Oui.
  6. Non !
  7. Non.
  8. Oui !
  9. Comment !
  10. Quoi !
  11. Bah !
  12. <name>. (most likely names of characters in a play, the first one being Bonaparte.)
  13. Pourquoi ?
  14. Bon !
  15. etc. (probably an artifact of how things were processed)
  16. Allons !
  17. Ha !
  18. Tiens !
  19. Hé !
  20. Moi.

There’s quite a bit in common with the previous list of one-word sentences. The exclamations that showed in the previous list (Diable ! Parbleu !) still occur in the top 30, so there isn’t a lot of change despite the much larger corpus. I suspect further changes to be quite minor as the processed corpus grows.

The Second Easiest Series of Books in French

At last I’ve found them. The books that can be read after Gnomeville.

As those who have been following my blog or buying my comics know, my comics start from a vocabulary of zero French but an English speaking background. Episode 1 introduces twelve very frequent words (with over 300 words of text); Episode 2 adds the remaining eight of the top twenty words occurring in French newspapers (while giving over 700 words of text to read); Episode 3 adds nine more frequent words (with 1200 words of text) and Episode 4 adds ten more (in 1800 words of text). This makes a total of 39 frequently occurring words. In addition, the comic uses many French-English cognates to make entertaining stories.

While I’m sure that the books I’ve found don’t restrict themselves to frequent words, they do start with a very small vocabulary and include repetition to allow the vocabulary to be acquired easily. The book with the smallest, at 55 words, in an illustrated text of 2100 words, is Edi l’éléphant. From there you can go to Les abeilles exploratrice at 88 words, then Émeraude, le bébé tortue, at 90 words. From there you can go to Brandon Brown dit la verité (95), Brandon Brown veut un chien (104), Brandon Brown à la conquête de Québec (165), and Obsession dangereuse (200). Some of the “Novice Mid” books have smaller vocabularies than these but use past tense.

I’ve now had a chance to look at a couple of sample chapters of two of the books. I can say there is definitely a narrative, but the low vocabulary in many words of text mean that there is quite a bit of repetition. This is great for acquiring vocabulary, but if you already have this vocabulary, you will probably want to choose something a little more challenging.