VP Count Profiler v.1.5 (FIX 2026-3-25 OF PROBLEM lists with words stuck together)
Profile texts by individual word frequencies in a reference corpus
RENAMED SEPT 3 2025: VP-COCA-COUNT PROFILER --> VP COUNT PROFILER
This method of text profiling involves no families, lemmas, or bands, which are arguably costly, artificial, and arbitrary. It matches every word of a text to its number of occurrences in a small 'standard corpus,' namely the most frequent 100-thousand words in the 400-million word COCA (Corpus of Contemporary American, Davies) corpus. To learn more, see (a) Mark Davies' description of the 100k list; (b) further description of this idea on VP-Compleat under 'count index'; or (c) the use of this type of profiler in a study by Crossley, Cobb & McNamara (2013).