Tag Archives: research

Function word frustrations

I recently re-watched Dilili in Paris, which is a fabulous animation movie for children, with French dialogue that is slow enough for French language learners to follow. I originally watched the movie during the Melbourne French Film Festival and considered buying the movie later so I could try watching it without English subtitles.

Frustration 1: Memory

There is a frequently repeated phrase when Dilili meets new people: “Je suis heureuse de vous rencontrer”. It was semi-humorous, and certainly designed to be remembered, to teach how to be polite when meeting someone new. However, what I actually remembered after a week or two was: “Je suis heureuse __ vous rencontrer.” Despite being exposed to many occurrences, the function word was lost. Function words don’t provide semantic content and therefore appear to be harder to retain. There is certainly research evidence that concrete nouns are easier to remember than various other types of words. This movie brought that home to me in a big way.

Frustration 2: Resources

(Not really about function words…)

I bought the DVD of the movie, and then when viewing it, discovered that the subtitles could not be switch off, and that the only subtitles were in English. I don’t know who makes these decisions when preparing DVDs for sale, but perhaps they don’t really consider their audience carefully enough. A French movie sold in Australia would have various audience segments: French ex-pats – possibly including some French people who are hard of hearing, Australian francophiles, Australians learning French. To me, movies and TV episodes are highly useful for practising comprehension of the spoken language. Ideally it can be done at three levels of difficulty (with the example given for L2 referring to the language being learnt and L1 referring to the native language):

  1. L2 audio with L1 subtitles,
  2. L2 audio with L2 subtitles,
  3. L2 audio without subtitles

I even do this with DVDs that were originally in English. I’ve watched two entire series of Perry Mason with French audio, which was quite illuminating. If you are short of practice material, check your DVD collection for audio in your target language. You may be pleasantly surprised to find a good selection amongst your favourite shows.

Frustration 3: Vocabulary Size

(Function words are frequent words…)

One of the excellent things about some graded readers was that they were designed for a specific vocabulary size. For me, vocabulary makes all the difference between a readable text and an unreadable one.  CLE International used to publish books targeting a specific vocabulary size. For example, Niveau 1 had vocabularies of 400-700 words. Through extensive reading, I have successfully moved from 300-word vocabulary books to 700-1000 word ones, and I hope to continue to progress through further reading. However, as with other publishers, the publications have now been converted to CEFR levels: A1, A2 etc. and as far as I can tell, the subtleties of vocabulary size have been removed from the book information.

I have completed a CEFR B1 in French, yet I’m most comfortable reading A1 texts (and texts with less than 1000 word vocabularies) and with few exceptions they are not easy apart from the grammar, which is too easy for me, but the books are still sometimes challenging vocabulary-wise. What frustrates me is that A2 covers such a wide range of vocabularies, depending on the source material, from readable to incomprehensible. Published vocabulary sizes for A2, where they occur at all, vary from 400 to >1200 words. The level of frustration with some of these graded readers is the same as for texts written for native speakers. I oscillate between A1, A2, native texts and back again. The original memoirs of Céleste de Chabrillan are as easy and more exciting than many A2 texts.

CEFR is designed, as far as I can tell, to describe a person’s practical skill in a language, and for that it is useful. However, the jumps between levels are quite large, so that the defined levels are not very useful for the learner themselves. Some publishers solve this by dividing up levels. ELI uses A0, A1, A1.1. The Danish Teen Readers/Easy Readers also divide up the levels, and still appear to quote target vocabulary sizes. Indie publishers tend to ignore vocabulary size in their writing. However, writers and publishers should remember that:

  1. Extensive reading is at its best if learners are reading at a comfortable level while not being familiar with all vocabulary. Ideally learners should know 98% of the words in text they are reading.
  2. Readability of text largely consists of grammar and vocabulary components.
  3. The more readable AND interesting reading material is, the more learners will read, the better their vocabularies will become, and the better their skill in a language will be.
  4. Publishing vocabulary levels required for 95-98% coverage of the text will assist learners in finding materials of the right level for them at any point. Vocabulary levels should be (loosely) based on general word frequency.

This is why I write my comic books for language learners. This is why I research extensive reading, readability and language acquisition.

Advertisement

Extensive Reading Musings

I’ve been reading some more research on extensive reading and readability lately. One paper showed gains in reading rate, vocabulary and comprehension with students reading about 150K words over 15 weeks at an intermediate level. This was contrasted with another study where learners read ~65K words over 28 weeks and failed to show improvement. I think there is probably a threshold of some kind where you need to read a certain amount per week to improve language skill. The amount probably varies with the level of skill you already have. Someone still improving their knowledge of the most frequent 400 words of the language will not need to read as much to achieve vocabulary gain (assuming appropriate graded readers) as someone reading at the 2000 word level. The study that showed gains had students reading with vocabularies of 800+.

Given the 10K words per week guide, and the typical reading rate in foreign languages often being around 150 words per minute, that equates to about an hour of reading per week, or 10 minutes a day. That’s not a bad aim for maintaining and hopefully improving your language skills.

Vocabulary Analysis of the Gutenberg Collection

Vocabulary Analysis of the Gutenberg Collection

I found this page when looking for things on vocabulary density – something of relevance for reading books designed for language learners.  The guy who wrote it is also interesting, in that he has a non-traditional career path into academia.

He shows his analysis is of ~2000 Gutenberg texts based on vocabulary – the kind of thing I like to muck around with.  It is “unpublished” work, so lacks a few things, like references and axis labels, making it less useful than it otherwise might be.