An analysis of: Brazilian names

analysis
statistics
Author

Vinícius Félix

Published

November 25, 2024

In this analysis, we seek to discover how the Brazilian population’s names were till 2010.

Context

The national institute of geography and statistics (IBGE) provided a dataset with the population’s frequency and first name as part of the 2010 Brazilian Census, this dataset was extracted from Brasil.io.

Biblical Roots

Maria and José are the most common names in Brazil because of their Christian roots and cultural history.

These names represent the country’s largely Catholic faith, which was established during Portuguese colonization, when naming children after biblical figures became customary. Religious celebrations honoring Santa Maria (Saint Mary) and São José (Saint Joseph) increase their significance. Not only that, but other popular names are also biblical, such as: João (John), Paulo (Paul), Pedro (Peter) and others.

Even with this effect is clear how Maria is more common than others names, a reason is that Maria is commonly used as the first name of a compound name, such as Maria Luiza, Maria José, and others.

_maria what?

In Brazil, laws regulate naming children to protect them from embarrassment or ridicule. Civil registries can reject names deemed offensive, overly complex, or difficult to spell or pronounce, while foreign and culturally influenced names are allowed, they should align with Brazilian phonetics.

Even with these restrictions, there is still a lot of room for creativity, thus adding maria as a component of the name is an option; this is not included in the Maria frequency analysis above, but still shows the impact of the name Maria.

How “high” can you go?

When looking at the distribution of the number of letters in a name, we see that we have names with 3 from 14 letters, to give the most popular examples from this extremes, we have:

  • 3 letters, for example: Ana, Eva, Ivo, Eli and Ari.

  • 14 letters, for example: Cristianderson and Vandercleisson.

Besides this, names with five to seven letters cover almost 70% of names in the population.

Sex dynamics in names

We can see that the majority of the names (almost 90%) are practically connected with names either completely associated to females or males.

Furthermore, only names that are less prevalent have a more evenly distributed male and female population, such as Edir, Darcy, and Tainan.