o corpus do portuguÍs


Corpora
New interface
Corpus size
Compare to other corpora
Related resources
Researchers

Volunteer!

Problems
Contact us




English PortuguÍs

Created by Mark Davies, BYU. Funded by the US National Endowment for the Humanities (2004, 2015). Part of the BYU collection of corpora.

  Corpus Size Created More info
1 Genre / Historical 45 million words 2004-06 Info
2 Web / Dialects 1 billion words 2015-16 Info

This is the "original" Corpus do PortuguÍs (2006), but with a new interface (2016).

The corpus contains 45 million words of data from the 1200s-1900s, and it can be used to look at the history of Portuguese. For the 1900s, it is equally divided between spoken, fiction, newspaper, and academic texts, which means that you can use it to compare genres of Portuguese.

The new interface for the corpus was released in Summer 2016. It allows you to create "virtual corpora" (e.g. a particular set of authors, sources, or topics). It also has a much "cleaner" design, better help files, and it works great on mobile devices.

Click here to use the older (2008) interface.