Home > VP > Count profiler
  VP Count Profiler v.1.5 (FIX 2026-3-25 OF PROBLEM lists with words stuck together)
    Profile texts by individual word frequencies in a reference corpus
RENAMED SEPT 3 2025: VP-COCA-COUNT PROFILER --> VP COUNT PROFILER
This method of text profiling involves no families, lemmas, or bands, which are arguably costly, artificial, and arbitrary. It matches every word of a text to its number of occurrences in a small 'standard corpus,' namely the most frequent 100-thousand words in the 400-million word COCA (Corpus of Contemporary American, Davies) corpus. To learn more, see (a) Mark Davies' description of the 100k list; (b) further description of this idea on VP-Compleat under 'count index'; or (c) the use of this type of profiler in a study by Crossley, Cobb & McNamara (2013).

(1) Enter a text, list, or Demo (NYT, Lit, AWL)

(2)

 

 
(3a) All listed words
In sequence (Clear output)      
(3b) Unique words
Most first, then offlist