Home > Coverage
 Coverage Calculator v.2 +FRENCH       
    The percentage of words in a corpus that are covered by the words in a list
This program calculates how many times the words on a list appear in a corpus. A list of the 2000 most common word families is often said to 'cover' up to 80% of the individual words in a general corpus of English - i.e., 80% of the words in the corpus will be words from that list. || Treatment of proper nouns is a checkbox option.|| Headword lists can be expanded into family/lemma lists here || List coverage in texts can be calculated here (Demo 7). || Known max of this routine mid-2020: 13k wds in list x 2.5m wds in corpus

 

RESEARCH: >   1. Nation (2006) 2. Laufer Ravenhorst (2010) 3. Schmitt Jiang Grabe (2011) 4. Schmitt Cobb et al (2015)   5. Cobb Laufer (2021)  


DEMO LISTS

AWL Heads | Fams

BNC-1k Fams | Lems

NGSL Lems 1k | 1-2k

BN-Coca Fams
1k | 2k | 3k
1-2k | 1-3k

Nuclear (Eng)   [?]
nfl-0 nfl-1 nfl-2
nfl-3 nfl-4 nfl-5 nfl-6
Fams@7%
1k | 2k | 3k
1-2k | 1-3k

Nuclear (Fr) FAMS @ nfl-x
(members > x% of family)

fr_nfl-0
fr_nfl-1
fr_nfl-2
fr_nfl-3
fr_nfl-4
fr_nfl-5
fr_nfl-6
fr_nfl-7
fr_nfl-8
fr_nfl-9
fr_nfl-10
(1) Click or paste LIST/name

 

  (2) Choose Corpus
      (Fr at bottom)

 

(3) Handle
Propers 
Subtract
from corpus
Add
to list

(4) Click
   

(5) See result