el corpus del espaņol

el corpus del espaņol


OVERVIEW (PDF) (ES)   English Espaņol 

Created by Mark Davies. Funded by the US National Endowment for the Humanities (2001-2002, 2015-2017).

    Corpus Size Created
1 Info Genre / Historical 100 million words 2001
2 Info Web / Dialects * 2 billion words 2016
3 Info NOW (2012 - 2019) 7.3 billion words 2018
4 Info Google Books n-grams 45 billion words 2011

(Now part of #2) WordAndPhrase-Spanish allows you to search and browse through the top 40,000 words in Spanish (based on frequency in the corpus). For each word, you can see detailed information (all on one page) -- definition, synonyms, frequency by genre, frequency by country, collocates (nearby words, which give great insight into meaning and usage), topics (co-occurring words on the same web pages), and 200 sample concordance lines (to see the patterns in which it occurs) -- all with useful links from one word to another.

You can also enter and analyze entire texts, such as the content of a web page, or a composition written by a student. It will show you the keywords from the text (based on frequency in the CdE), and you can click on any word in the text to see detailed information, as discussed above. You can also highlight phrases in your text and have it search for related phrases in the Corpus del Espaņol, which helps in getting things to sound "just right".