Word Prevalence Values
As we collect more data using our online vocabulary tests, we will add word prevalence values for multiple languages and groups of speakers to this page.
Dutch
These are the word prevalence values for 54,319 Dutch words used in the paper: Word Knowledge in the Crowd: Measuring vocabulary size and word prevalence in a massive online experiment.
- In .csv format for Belgium and the Netherlands
- In .Rdata format for Belgium and the Netherlands (for use with R)
- In .xlsx format, all in one
- The Dutch word prevalence measures have been validated in the the Dutch Lexicon Project 2.
English
There are word prevalence values for 61,858 English words. You find them here.
We have the same prevalence measures for speakers of English as a second language.