|
Coverage Calculator v.3 4 NEW:: NFL-0 hyper-FAMS May 2026 The percentage of words in a list that appear in a corpus |
This program calculates how many times the words on a list appear in a corpus. A list of the 2,000 most common word families is often said to 'cover' up to 80% of the individual words (tokens) in a general corpus of English - i.e., 80% of the words in the corpus are words from the list. || Treatment of proper nouns is a checkbox option.|| Headword lists can be expanded into family/lemma lists here || List coverage in texts can be calculated here (Demo 7). || Known max of this routine 2024: 13,000 wds in list by ≈ 1 million wds in corpus (test corpora/texts will be reduced by program if needed)RESEARCH: > 1. Nation (2006) 2. Laufer Ravenhorst (2010) 3. Schmitt Jiang Grabe (2011) 4. Schmitt Cobb et al (2015) 5. Laufer (2020) 6. Cobb Laufer (2021)