Xn xx n
On December 17th 2012, I got a nice letter from Mark Mayzner, a retired 85-year-old researcher who studied the frequency of letter combinations in English words in the early 1960s.His 1965 publication has been cited in hundreds of articles.(If you want you can download the word count file; note that it is 1.5 MB.) My distillation of the Google books data gives us 97,565 distinct words, which were mentioned 743,842,922,321 times (37 million times more than in Mayzner's 20,000-mention collection).Each distinct word is called a "type" and each mention is called a "token." To no surprise, the most common word is "the".
The band consists of Klaudia Szafrańska (vocals) and Michał Wasilewski a.k.a Voo (instrumentals).The Sxe'xn'x are a First Nations people of the Lower Nicola Valley in the southern Southern Interior of the Canadian province of British Columbia.They are part of the larger Nlaka'pamux (Thompson) people and closely connected to the Scw'exmx, a major subgroup of the Nlakapamuxtsin-speaking peoples.This sample broke down as follows: three-letter words, 6,807 tokens, 187 types; four-letter words, 5,456 tokens, 641 types; five-letter words, 3,422 tokens, 856 types; six-letter words, 2,264 tokens, 868 types; seven-letter words, 2,051 tokens, 924 types.I then proceeded to construct tables that showed the frequency counts for three, four, five, six, and seven-letter words, but most importantly, broken down by word length and letter position, which had never been done before to my knowledge.