Home > N-Gram Extractor input
N-Gram Extractor     Program under review October 2018
Identify repeated word or family strings throughout a text                         
RESEARCH:
  1. Review, this routine 2. Review Alison Wray 3. Review Nadia Nesselhauf 4. Cobb (04) Learner Corpus   5. Erman & Warren (2000)   6. Cobb (2018) new  

INPUT METHOD "A"
SMALL/MEDIUM TEXTS BUT RICHER OUTPUT (concordance lines); max 400,000 chars/60,000 words

 1   Enter the title of your text.      2   Copy or type the text in the space below.

          | Demo 1 (written) | Demo 2 (spoken) | NEW*Demo 3 (Intervening Items + Families)

 3   Choose max string: 2 |3 |4 |5 wds     4   Interveners (2+3 wd)   5   Families   [Inters+fams?]   6  

INPUT METHOD "B"
TEXT FILE UPOAD - LARGER TEXTS - MORE COMPLETE OUTPUT (But no Intervenors in Upload Mode temporarily)
    1.   2. Families? Interveners? 3.  Max string       4.  
OR small corpus (to approx. 350,000 wds @ max=6)
>> Families? +Max