All posts by Sandra Bogerd

Unknown's avatar

About Sandra Bogerd

Music: composer, singer-songwriter, recording artist, conductor. Language: author and illustrator (A. L. Uitdenbogerd) of the Gnomeville comic book series for absolute beginners in French. Work: Chief Scientist of Ad Hoc Software Pty Ltd trading as Agile Electronics. Academia: RMIT honorary fellow with computer science research interests in music, language, and usability.

Thoughts on Up Goer Five and Constrained Vocabulary Writing

When I first saw the Up Goer Five comic by xkcd, I loved it.  It epitomised what I do with my comic book and my research, and is a convenient example to show people, when explaining the idea of constrained vocabulary writing.

Fans figured out that the 1,000 words used by xkcd for it were the contemporary fiction list, shown in Wiktionary.  This frequency list is based on over 9 million words of on-line contemporary fiction.  It combines plurals and simple verb forms into one listed word (lemmas), which is a good choice, since if the root word is known, then the plurals with s, and simple verb forms are usually also understood.

As someone who writes using lists generated based on frequency, I’ve noticed that several problems arise.  One is that, typically, male pronouns and nouns occur at higher frequencies than female ones.  The Wiktionary list is not overly biased in this way, possibly because it is based on contemporary fiction.  “he” is ranked at 8, “her” and “she” at 12 and 13 respectively, and “his” at 16.  However, we find “man” at 163 and “woman” at 452, but “girl” is at 133 and “boy” at 217.  This hints at what has been termed the systemic “infantilization” of women in society.  The figures are probably quite different due to the common pairing of “guy” (at 178) with “girl” in colloquial speech.  Google’s auto-suggest, which is also based on frequency, has occasionally come up with phrases that are considered racist, sexist or otherwise problematic – and it is purely a reflection of what we as a society tend to write.  When writing in a principled manner for language learners, it may be important to balance what word frequency lists tell us, with what is a more equitable representation.  I didn’t really think very much about this when I started writing Gnomeville years ago, but have become more aware of these issues thanks to some of my friends who are more knowledgeable in them.

Another issue that needs to be considered is what is culturally appropriate to write for the target audience.  For example, I have recently been made aware that it is inappropriate to use words referring to alcoholic beverages when the audience is Islamic.  Obviously for work intended for children (or for experimental subjects) it is customary to exclude expletives.  For this reason, several words on the list would need to be excluded.  There seems to be an expressive set of expletives in the list.

For the method of writing I employ in the Gnomeville story, I  introduce one new high frequency word per page of story, and somewhat less frequently I introduce a grammatical pattern.  Sometimes I’ve changed the order in which I add words due to the story.  This happened in episode one, in which I introduced “se” very early instead of after about a dozen other words.  Also, I recall that “le” was added before “de”, even though their ranks are reversed.  Having said that, my first 20 words were based on a corpus of newspaper articles.  Every corpus gives a different ranking of words.  There are some similarities across corpora however.  For example, if the corpus is large enough, the frequency of the word “the” is likely to be about 7% for English text.

Anyway, back to Up Goer Five.  The upcoming book “Thing Explainer”, as well as the text uploaded to the up goer five text editor provide some good practice at reading for people still consolidating their first 1000 words of the English language.  If going beyond that, the writing should have less than 5% of words outside the vocabulary set to be suitable for improving language skill while fluently reading for comprehension.  A text editor with more flexibility is the OGTE Editor, designed for writing English text for different language learner levels.

An extract of my French comic book is now available

I have finally produced an extract of the comic book for people to look at.  It contains 12 of the 28 pages, with images reduced to readable low resolution.

The extract contains all the text that explains the rationale for the approach, as well as showing a summary of the language covered in the first episode.  There are 3 pages of the Gnomeville story in the extract.  The first two show how the story begins with no prior French knowledge, and how the language is introduced.  The third page shows how the text increases in complexity and length later in the story, with a very short word definition on the page, so that the person reading is not slowed down too much in their reading in French.

Note that the extract doesn’t show the true page format, as it is an ordinary A4 pdf file, whereas normally the pages are processed into book form, re-numbered appropriately and trimmed to size.  The story pages are colour right to the edge of the paper in the physical copies.

Children’s books

While searching for books that are simple enough for me to read in Japanese, I’ve been musing about the attributes that make these suitable for language learners.
If starting from scratch you need repetitious text with obviously illustrated nouns. This describes some of the books I purchased. The downside of the simplest books is that they become an illustrated list of nouns (or adjectives like colours), and therefore have no narrative.
In Japanese you have the added complication of the writing system. Beginner books use hiragana only. Then there are some that have katakana with hiragana transliterations. Then there are a few that use both the alphabets without guides. At the next level kanji are included with hiragana guides. The level of support for kanji varies.

Recent language learning experiences.

Recently I sat my French B1 exam. In preparation I had conversations with colleagues and friends in French over a few drinks, switched my various gadgets and accounts to French, read books in French, did a few exercises from textbooks, revised a few points of grammar, tried duolingo, and did some practice exams. Looking at my results, my main weakness is speaking. My French colleague says that my main problem is being hesitant. Add a few drinks and I’m more fluent.
Currently my obsession is Japanese, while I’m in Japan for a conference. I’m getting a bit better at katakana and kanji thanks to repeated efforts at reading signs.

Constrained Writing

In 1996 I first heard about a book written without the letter E (Gadsby, by Wright, published 1939). Then a couple of years later I met a French colleague and was telling him about my comic book in French that exclusively uses French-English cognates and one new French word per page. It reminded him of constrained writing, particularly “lipograms”, and he introduced me to the work of Georges Perec, who wrote various works with or without certain vowels. We exchanged DNA poetry. More recently I dabbled in pilish, adding the constraint of writing in haiku verses.

A recent blog post about OULIPO reminded me about my fascination with such things.

The experience of writing my comic in French is quite different to my dabblings in German and Dutch, due to the differences in cognates (similar looking words with similar meaning) in the different languages.  In French it is hard to generate much text initially, but there is soon an abundance of identically spelt nouns, adjectives and verbs (albeit with slightly different endings).  In Dutch and to a lesser extent in German it is possible to write 20-odd words of meaningful text entirely using exact cognates.  But eventually you hit a wall where there are not many verbs to work with.  I’m still figuring out how to get past that wall before I commit to drawing the (publishable) artwork for and publishing a first episode in those languages.

 

Mots Croisés (Crosswords)

crosswordpic

I was recently in Perth for a couple of weeks, so made the most of the opportunity to visit le forum in Fremantle.  It is a lovely little French bookstore in an almost abandoned small shopping mall at the edge of the shopping area of the city.

I came away with a large bundle of books, mostly for reading as a foreign language, but also a couple of crossword books for children.  People may have the impression that crossword books for 8-year olds would be easy for language learners, but that is not the case.  Children seem to be exposed to and know many more nouns than the typical language learner.  I worked my way through a few crosswords in my new Mots Fléchés book for 8-year olds and it revealed the huge gaps in my vocabulary.  Each crossword had a particular theme.  I did ok on common animals (lion, tigre, zebre, léopard, éléphant), European cities and words about Asia (sumo, sushi, sari, panda), but completely failed on words about the snow, medicine or the kitchen.

My other purchase, “Jeux de mots” for 8+, was easier, due to the dense French-style crossword grids that have clues like “the first and 4th letters of the alphabet”, “double vowel”,  “the first two letters of italien”,  “the second person singular of the present tense of avoir”.  (I’ve translated the clues here.)  These provide lots of hints for the other words of the puzzle.  There were also many more core vocabulary words like man, woman, place, with, pretty etc.

I’ve done other crosswords in French or for French in the past.  One book starts with small crossword triangles with simple words up to 3 letters long and 3 clues in total, and works its way up to 12 by 12 grids and the more difficult verb tenses.  I seem to have lost the original though, so I can’t tell you its name or publisher.  ELI has a book entitled “Jeux faciles en français”, which is for primary school children.  They have a page of vocabulary, such as the numbers from 1 to 10, then a join the word to the item, followed by a crossword and word search.  This pattern repeats for each set of vocabulary.  I remember enjoying this kind of activity as a 5-year-old, so perhaps it works for young kids.

I have a couple of books of vocabulary games including crosswords by Maurie N. Taylor, published by the National Textbook Company.  These are for English-speaking students of French, and are not immersive, but can help cement vocabulary.  However, I think that given the amount of vocabulary and language knowledge assumed in the books, that less English could have been used in the book to give more practice.

I also do crosswords in Dutch sometimes.  The children’s ones are possibly marginally easier for me than the simplest adult ones, but again, the adult ones usually provide more cross-clues, which offsets the slightly more difficult language.

I believe it is possible to create immersive crosswords for use at the earliest stages of language learning – certainly for European languages.  I do this in my comic book.  The crossword uses the episode’s target vocabulary and incidental cognate vocabulary, as well as the sentences in the story that have just been read, to provide reading practice and vocabulary production practice without reverting to English.

All the Drama

I’ve been reading a lot of easy readers in French and German of late, and some of the recent ones are of the Lectures CLE en français facile series.  I’m now at a point where reading the 500-600 word vocab ones are easy enough for the story to be enjoyed.

In the past I’ve not looked forward to reading abridged and adapted versions of classics, either because I wanted to read the original at some point, or just because I found them uninteresting.  However, I’ve changed my mind.  The CLE 500-word vocabulary adaptations of En Famille (by Hector Malot) and La Guerre des Boutons (by Louis Pergaud) were both thoroughly engaging.  I’m now reading Jacquou le croquant by Eugene le Roy, which is a 600-word vocab adaptation.  There seems to be a lot of 19th century French literature about people living in poverty and hard times, and CLE has made it accessible to learners of French.  I find the lengths of the books to be about right too, at about 50 pages of story – probably about 12,000 words all up.

I hope that I can find a similar resource for German. 

Gnomeville Episode 1 is done!

Having given myself a hard deadline of Friday so it would be ready for launching at a concert of French music, it is done!

The “I Can’t Believe I’m Reading French” Comic Book Series, Episode 1: Gnomeville: Dragon!

Gnomeville comic book cover containing head of dragon with smoke billowing out of its mouth and the title "DRAGON!" in large red letters

  • 28-page booklet plus audio CD containing 3 stories in comic book format, a crossword, a song and a language summary.
  • Assumes no prior knowledge of French but a native or near-native English speaking background.
  • Introduces 12 of the twenty most commonly occurring words in French newspapers, one new word per page of the Gnomeville story.
  • Uses the words that are common to French and English to ensure that all words in the stories are familiar, such as “dragon”, “gnome”, “arrive”.
  • The Fido story provides further reading practice, ensuring each target word has been read at least 5 times.
  • The crossword allows you to actively use the language to increase retention
  • The songs provide further practice, including pronunciation
  • The narration provides pronunciation information and listening practice
  • The Taxi story is a nice easy story to read after the other stories.
  • The audio CD contains the stories read by a native speaker, plus two songs related to the stories.
  • $20 plus postage from Melbourne Australia
  • Currently only available from me directly, but stay tuned for updates.

Top French Movies

One of the ways that I pick up foreign language listening skills is watching movies and TV series in the language.  For Italian I took it one step further in that I made bingo sheets of common expressions to encourage listening instead of just reading subtitles.

I will probably do the same for French at some point.  Here is a list from 2011 that I found, of the top 10 French films.

  1. Bienvenue chez les Ch’tis
  2. La Grande vadrouille
  3. Astérix et Obélix : Mission Cléopâtre
  4. Les Visiteurs
  5. Le Petit monde de Don Camillo
  6. Corniaud
  7. Taxi 2
  8. 3 hommes et un couffin
  9. Les Bronzés 3 amis pour la vie

Oops, one missing from the top 10.  Maybe I misunderstood the last paragraph of the article.