el corpus del espaņol

Corpus size
Compare to other corpora
   Larger corpora
Related resources
Contact us


English Espaņol

Created by Mark Davies, BYU. Funded by the US National Endowment for the Humanities (2001-2002, 2015-2017).

  Corpus Size Created More info
1 Genre / Historical 100 million words 2001 Info
2 Web / Dialects 2 billion words 2016 Info
3 NOW (2012 - 2019) 5.5 billion words 2018 Info
4 Google Books n-grams (BYU) 45 billion words 2011 Info
5 WordAndPhrase Top 40,000 words 2017 Info

This is the "original" Corpus del Espaņol (2001), but with a new interface (2016).

The corpus contains 100 million words of data from the 1200s-1900s, and it can be used to look at the history of Spanish. For the 1900s, it is equally divided between spoken, fiction, newspaper, and academic texts, which means that you can use it to compare genres of Spanish.

The new interface for the corpus was released in Summer 2016. It allows you to create "virtual corpora" (e.g. a particular set of authors, sources, or topics). It also has a much "cleaner" design, better help files, and it works great on mobile devices.

(Click here to use the older interface from 2008).