Created by Mark Davies, BYU.
Funded by the US National Endowment for the Humanities
(2001-2002, 2015-2017). Part of the BYU collection of
The new addition to the Corpus del Espaņol (2016) contains nearly two billion words of data in web pages from 21 different Spanish-speaking
countries. This corpus allows you to look at very recent Spanish (the texts were collected 2013-14), and to compare
among the different dialects.
The new corpus is also much larger than the previous corpus -- more than 100
times as large for Modern Spanish (two billion words, compared to just 20
million words from the 1900s in the original corpus). So where you might have
10-12 tokens with the original corpus, you might have 1,000 or more with the new