Help for package bacenR

Title:

Access Data from Brazilian Central Bank: IFdata, Active Institutions, Balance Sheets and Normative Acts

Version:

0.4.4

Description:

Provides functions to query, retrieve, and tidy economic and financial data from Brazilian Central Bank web services for use in R analyses and workflows. Active institutions information, balance sheets and normative acts.

License:

GPL (≥ 3)

URL:

https://github.com/rtheodoro/bacenR

BugReports:

https://github.com/rtheodoro/bacenR/issues

Depends:

R (≥ 4.1.0)

Imports:

data.table, dplyr, fs, glue, httr, janitor, jsonlite, lifecycle, purrr, readr, readxl, rlang, stringr, tibble, tidyr, tools, utils, xml2

Suggests:

knitr, rmarkdown, testthat (≥ 3.0.0)

VignetteBuilder:

knitr

Config/testthat/edition:

Encoding:

UTF-8

SystemRequirements:

qpdf

Config/roxygen2/version:

8.0.0

NeedsCompilation:

Packaged:

2026-07-03 12:31:50 UTC; charizard

Author:

Ricardo Theodoro

[aut, cre]

Maintainer:

Ricardo Theodoro <rtheodoro@usp.br>

Repository:

CRAN

Date/Publication:

2026-07-03 12:50:02 UTC

bacenR: Access Data from Brazilian Central Bank: IFdata, Active Institutions, Balance Sheets and Normative Acts

Description

Author(s)

Maintainer: Ricardo Theodoro rtheodoro@usp.br (ORCID)

Authors:

Ricardo Theodoro rtheodoro@usp.br (ORCID)

Download balance sheets from Brazilian Central Bank

Description

Downloads monthly "balancetes" (.CSV or .ZIP) released by the Brazilian Central Bank for one or more institution types and stores them in a local directory. ZIP files found in the destination directory are also extracted (errors during individual extractions are ignored).

Check the function tidy_balance_sheets() for tools to read and process

Usage

get_balance_sheets(
  institution,
  months = 12,
  first_year,
  final_year,
  out_dir,
  overwrite = FALSE
)

Arguments

institution

Character vector of institution types to download. Accepted values (case-insensitive): "BANCOS", "COOPERATIVAS", "CONSORCIOS" (Administradoras de consórcios), "CONGLOMERADOS" (Conglomerados financeiros), "SOCIEDADES", "BLOPRUDENCIAL" (Conglomerados Prudenciais), "COMBINADOS" (Combinados Cooperativos), "LIQUIDACAO" (Instituições em Regime Especial). The function maps these tokens to the type names used in the BCB URL.

months

Integer or integer vector specifying months to download. Values should be 1..12 (single values or vectors like c(6, 12)). When a month single-digit is provided it is zero-padded to two digits in filenames/URLs.

first_year

Integer, first year to download (inclusive). Defaults to 1993 (the oldest available balancete in the source).

final_year

Integer, last year to download (inclusive). Defaults to the previous calendar year.

out_dir

Character, directory where downloaded ZIP files (and extracted CSVs) will be written. The directory is created if it does not exist.

overwrite

Logical, whether to overwrite existing files in out_dir (default FALSE). If FALSE and a file already exists at the destination it is treated as a successful local result.

Details

For each combination of year, month and institution type the function constructs a URL of the form: ⁠https://www.bcb.gov.br/content/estabilidadefinanceira/cosif/{tipo}/{YYYY}{MM}{institution}.CSV.ZIP⁠ and attempts to download it with httr::GET. Successful and failed attempts are recorded in the returned tibble. After attempting all downloads the function tries to unzip any .zip files found in out_dir; unzip errors for individual files are caught and ignored so the process continues.

Invalid institution tokens cause an immediate error listing the invalid entries.

Value

A tibble with one row per attempted download and the following columns:

years: Year attempted (integer).
months: Month attempted (integer).
institution: Institution token used (character).
url: The URL attempted (character).
dest: Destination file path for the downloaded ZIP (character).
success: Logical indicating if the download (or existing-file fallback) was considered successful.
http_status: HTTP status code returned by the request (integer NA if not available).
error: Character string with an error message when success is FALSE, or a short note when a local file existed and was not overwritten.

References

Source Banco Central do Brasil (Bacen): https://www.bcb.gov.br/estabilidadefinanceira/balancetesbalancospatrimoniais Note: The values are not changed or processed in any way, they are downloaded as-is from the source.

Examples

# Download balance sheets of credit unions for December of 2023
get_balance_sheets(
  institution = "COOPERATIVAS",
  months = 12,
  first_year = 2023,
  final_year = 2023,
  out_dir = tempdir(),
  overwrite = FALSE
)


# Download balance sheets of credit unions for December of 2023
get_balance_sheets(
  institution = c("BANCOS", "COOPERATIVAS"),
  months = c(6, 12),
  first_year = 2022,
  final_year = 2023,
  out_dir = tempdir(),
  overwrite = FALSE
)

Download institution registry data from IFdata of Brazilian Central Bank (Bacen) Cadastro

Description

Download institution registry data from the Brazilian Central Bank (Bacen) IFdata API for specified years and months. The function handles multiple combinations of parameters and returns a consolidated data frame with the results.

Usage

get_ifdata_registry(year, month, verbose = TRUE)

Arguments

year

Numeric or vector. Year(s) to download (e.g., 2024 or c(2023, 2024))

month

Numeric or vector. Month(s) to download (1 to 12)

verbose

Logical. If TRUE, prints progress messages (default: TRUE)

Value

Data frame with institution registry data or NULL in case of error

Examples


# Single period
data <- get_ifdata_registry(
 year = 2024,
 month = 12
)
# Multiple years and months (all combinations), take some time to run
data <- get_ifdata_registry(
 year = c(2023, 2024),
 month = c(3, 6, 9, 12)
)

Download institution reports data from IFdata of Brazilian Central Bank (Bacen) Relatorios

Description

Download IFdata reports from the Brazilian Central Bank (Bacen) API for specified years, months, institution types, and report types. The function handles multiple combinations of parameters and returns a consolidated data frame with the results.

Also, check the page to see the available reports for each institution type and date: IFdata.

Usage

get_ifdata_reports(year, month, type_institution, report, verbose = TRUE)

Arguments

year

Numeric or vector. year (ex: 2024 or c(2023, 2024))

month

Numeric or vector. month (ex: 1 to 12, or c(6, 12))

type_institution

Numeric. Type of Institution:

1 = Conglomerados Prudenciais e Instituicoes Independentes
2 = Conglomerados Financeiros e Instituicoes Independentes
3 = Instituicoes Individuais
4 = Instituicoes com Operacoes de Cambio

report

Numeric. Report type:

1 = Resumo
2 = Ativo
3 = Passivo
4 = Demonstracao de Resultado
5 = Informacoes de Capital
6 = Segmentacao
7 = Carteira de Credito Ativa - Por indexador
8 = Carteira de Credito ativa - por nivel de risco da operacao
9 = Carteira de Credito ativa - por regiao geografica
10 = Carteira de Credito ativa - quantidade de clientes e de operacoes
11 = Carteira de Credito ativa Pessoa Fisica - modalidade e prazo de vencimento
12 = Carteira de Credito ativa Pessoa Juridica - por atividade economica (CNAE)
13 = Carteira de Credito ativa Pessoa Juridica - modalidade e prazo de vencimento
14 = Carteira de Credito ativa Pessoa Juridica - por porte do tomador
15 = Movimentacao de Cambio no Trimestre
16 = Carteira de Credito ativa - por carteiras de instrumentos financeiros

verbose

Logical. If TRUE, print progress messages (default: TRUE)

Value

Data frame with IFdata values or NULL in the case of errors.

Examples


# Unique institution type for a specific period
data <- get_ifdata_reports(
 year = 2024,
 month = 12,
 report = 1,
 type_institution = 2
)

# Multiple periods, take some time to run
data <- get_ifdata_reports(
  year = c(2023, 2024),
  month = c(6, 12),
  report = 1,
  type_institution = 2
)

Download and Process Brazilian Financial Institutions Data from Bacen

Description

This function downloads financial institutions data from the Brazilian Central Bank (Bacen) website for specified institution types and date ranges. The data is downloaded as ZIP files, extracted, and can be optionally cleaned up.

Usage

get_institutions(
  institution,
  start_date,
  end_date,
  out_dir,
  cleanup_zip = TRUE,
  verbose = TRUE
)

Arguments

institution

Character vector. Type(s) of financial institutions to download. Valid options are:

"CONGLOMERADOS"
"BANCOS"
"COOPERATIVAS"
"CONSORCIO"
"SOCIEDADES"

Default is "COOPERATIVAS". Case-insensitive. Check the details on Bacen's website.

start_date

Character. Start date in "YYYYMM" format (e.g., "200709") or a parsable date string (e.g., "2007-09-01"). Default is "200709".

end_date

Character. End date in "YYYYMM" format (e.g., "202409") or a parsable date string (e.g., "2024-09-01"). Default is "202409".

out_dir

Character. Directory path where downloaded files will be saved. Default is "data". The directory will be created if it doesn't exist.

cleanup_zip

Logical. If TRUE, removes ZIP files after extraction. Default is TRUE.

verbose

Logical. If TRUE, displays progress messages and warnings. Default is TRUE.

Details

The function performs the following steps:

Validates institution types against known valid options
Generates a sequence of months between start_date and end_date
Downloads ZIP files for each institution and month from BCB website
Extracts the downloaded ZIP files to the output directory
Optionally removes ZIP files after extraction
Displays progress information if verbose = TRUE

Institution type mappings:

CONGLOMERADOS: Conglomerados
BANCOS: Bancos comerciais, múltiplos e caixa
COOPERATIVAS: Cooperativas de crédito
CONSORCIO: Consórcios administrativos
SOCIEDADES: Bancos de investimentos, desenvolvimento e sociedades corretoras

Value

Invisible NULL. The function is called for its side effects (downloading and extracting files to the specified directory).

Examples

# Download cooperative credit unions data for 2023
get_institutions(
  institution = "COOPERATIVAS",
  start_date = "202311",
  end_date = "202312",
  out_dir = tempdir()
)

# Download multiple institution types
get_institutions(
  institution = c("BANCOS", "COOPERATIVAS"),
  start_date = "202201",
  end_date = "202212",
  out_dir = tempdir()
)

# Skip downloading, just use existing files
get_institutions(
  institution = "BANCOS",
  start_date = "202201",
  end_date = "202212",
  out_dir = tempdir(),
  verbose = FALSE
)

Download normative acts from the Brazilian Central Bank (Bacen) by terms and date range

Description

Queries the Bacen normative search API, collects all results within the provided date range and returns a data.frame with the records. Data are downloaded from https://www.bcb.gov.br/estabilidadefinanceira/buscanormas.

Usage

get_normative_data(terms, ini_date, end_date)

Arguments

terms

Character vector. Search terms; they will be concatenated with " OR " and URL-encoded.

ini_date

Character. Start date in "YYYY-MM-DD" format.

end_date

Character. End date in "YYYY-MM-DD" format.

Details

Builds the query URL using the terms and the date range.
Retrieves the total number of results to determine the page size.
Iterates over result pages (incrementing by 500) until a page returns fewer than 13 rows (stop condition).
Performs simple cleaning on returned fields:
- removes "string;#" prefixes,
- removes HTML tags in summaries and subjects,
- removes the decimal part of normative numbers.

Value

data.frame containing the records returned by the API and the post-processed columns. Returns an empty data.frame if the server is unavailable or no internet connection.

Examples

## Not run: 
ini_date <- "2025-01-01"
end_date <- "2025-01-05"
terms <- c("Cooperativas", "Cooperativa")
normas <- get_normative_data(terms, ini_date, end_date)

## End(Not run)

Download normative texts from the Brazilian Central Bank (Bacen) from a reference data.frame

Description

For each row of the input data.frame, constructs the appropriate URL for the Bacen normatives API, performs the request, extracts the returned content and applies basic cleaning to the text fields (Assunto and Texto). Returns a data.frame with all aggregated contents.

Usage

get_normative_txt(normative_data)

Arguments

normative_data

data.frame. A data.frame with at least the columns:

TipodoNormativoOWSCHCS: type of the normative (e.g., "Comunicado", "Ato de Diretor", etc.)
NumeroOWSNMBR: number/identifier of the normative to be queried.
Typically the result of a prior query (e.g., via get_normative_data).

Details

The function performs the following steps:

Iterates over each row of normative_data (prints the iteration index for monitoring).
Encodes the normative type for URL usage (using URLencode(..., reserved = TRUE)).
Chooses the API route depending on the type:
- for "Comunicado", "Ato de Diretor" and "Ato do Presidente" it uses the other-norms endpoint (exibeoutrasnormas);
- for the rest, it uses the standard normative endpoint (exibenormativo).
Makes the HTTP request (via httr::GET) and converts the returned JSON into a data.frame.
Extracts and cleans textual fields:
- converts HTML to plain text for the fields Assunto and Texto (using xml2::read_html + xml2::xml_text),
- removes line breaks and extra spaces (stringr::str_replace_all, stringr::str_squish).
Aggregates each result into a final data.frame returned by the function.

Value

data.frame with the records returned by the API for each reference in normative_data, including the extracted and post-processed fields (e.g., Assunto, Texto).

Note

The function assumes the BCB API returns a conteudo field with the expected fields.
Request errors or changes in the API structure may cause failures; consider handling exceptions around the call if needed.
Packages used internally: httr, jsonlite, xml2, stringr, dplyr, glue, purrr.

Examples


# First, download normative data
normative_data <- get_normative_data(
   "Cooperativa",
    "2023-08-01",
    "2023-08-10")
# Then, download the full texts for the retrieved normatives
normative_txt <- get_normative_txt(normative_data)

Import balancete CSV files with flexible header detection

Description

Imports CSV files from BCB by detecting the header line that contains the tokens "DOCUMENTO", "CONTA", and "SALDO" (case-insensitive). Automatically renames columns and removes unnecessary fields.

This function is intended for internal use by tidy_balance_sheets()

Usage

import_balance_sheets(file_paths)

Arguments

file_paths

Character vector of file paths to import.

Details

For each file:

Reads line-by-line searching for a header containing DOCUMENTO, CONTA, and SALDO
Auto-detects delimiter (⁠;⁠, ⁠,⁠, or whitespace)
Imports using data.table::fread for speed and robustness
Renames: DATA_BASE/DATA BASE/#DATA_BASE -> DATA, NOME INSTITUICAO -> nome_instituicao
Removes columns: ATRIBUTO, NOME CONTA, AGENCIA, COD_CONGL, NOME_CONGL, TAXONOMIA, NOME_CONTA
Returns combined data.frame from all files

Value

A data.frame combining all imported files, or NULL if no valid files.

Process and reshape balances CSV exports from the Brazilian Central Bank (Bacen) downloaded via `get_balance_sheets()`.

Description

tidy_balance_sheets reads raw CSV export files produced by the BCB and downloaded via get_balance_sheets(), normalizes filenames, locates the header row, imports rows matching a specific document number, cleans column names, pivots account balances so that each account becomes a column and each row corresponds to an institution/date, and optionally writes per-institution-type CSVs.

Usage

tidy_balance_sheets(path_raw, out_dir, doc_filter = 4010, save = TRUE)

Arguments

path_raw

Character scalar. Directory containing the raw CSV files. Filenames are expected to start with a YYYYMM prefix (e.g. "202012COOPERATIVAS.CSV"). The function will look for files matching "\.CSV$" (case-insensitive).

out_dir

Character scalar. Directory where output CSVs will be saved when save = TRUE. If it does not exist it will be created.

doc_filter

Numeric or integer scalar. The numeric value used to filter rows in the imported tables by the column DOCUMENTO (e.g. 4010). Other options are: 4010, 4413, 4423, 4433, 4060, 4016 and 4066. Check the webpage for more details: BCB balance page

save

Logical scalar. If TRUE (default) the processed data frames are also written to disk as CSV files (one per institution type). If FALSE the function only returns the processed data.frames.

Details

Behavior and implementation notes:

Filenames are first normalized by removing all underscores. If a target filename after normalization already exists, the function aborts to avoid overwriting files.
The function extracts year/month and an institution type token from the file basename by assuming the first six characters are YYYYMM and the remaining characters (without the ".CSV") represent the type.
For each file the function reads the file as text and attempts to locate the header row by scanning lines until it finds a row that contains (case-insensitive) tokens matching:
- "DOCUMENTO" (or "DOCUMENTOS"),
- "CNPJ",
- and "DATA BASE" (variants matched by regex such as "DATA_BASE" or "#DATA_BASE"). The delimiter for that header line is heuristically detected among semicolon (";"), comma (","), or whitespace. For whitespace-delimited data the function uses read.table's default behavior.
After locating the header, the table is read with utils::read.table using header = TRUE, the detected separator, latin1 encoding, and check.names = FALSE. Rows where the DOCUMENTO column equals doc_filter are kept.
Column names are cleaned via janitor::clean_names() before the wide pivot. The function expects at least the following cleaned column names to be present: conta, saldo, and data. The pivot uses the id columns (data, documento, cnpj, nome_instituicao), conta as names and saldo as values.
If save = TRUE each resulting data.frame is written to CSV using readr::write_csv() to the path returned by output_filename(type_inst) (wrapped with out_dir).
The function will stop with informative errors in several situations: missing CSVs in path_raw, failure to find a header line with the required tokens, missing required columns after import/cleaning, or failure to rename files when removing underscores.

Value

A named list of data.frames (one element per unique institution type detected in the filenames). Each data.frame has one row per unique combination of (data, documento, cnpj, nome_instituicao) and one column per account (the "conta" values) with balances populated from the "saldo" column.

Note

The function prints a message reminding that the numeric values in the balance sheets are not adjusted for inflation. See: https://www.bcb.gov.br/estabilidadefinanceira/balancetesbalancospatrimoniais

Examples

# First, download balance sheets
get_balance_sheets(
  institution = "BANCOS",
  months = 12,
  first_year = 2023,
  final_year = 2023,
  out_dir = tempdir(),
  overwrite = FALSE
)

# Now, tidy the files
# Do not write outputs
tidy_balance_sheets(
  path_raw = tempdir(),
  out_dir = tempdir(),
  doc_filter = 4010,
  save = FALSE
)

# Write outputs
tidy_balance_sheets(
  path_raw = tempdir(),
  out_dir = tempdir(),
  doc_filter = 4016,
  save = TRUE
)

Tidy ifdata Reports Data

Description

Transforms raw ifdata reports downloaded with get_ifdata_reports() into a clean, wide-format dataset suitable for analysis. This function handles data input from multiple sources (direct data.frame), removes unnecessary columns, renames identifiers, and pivots the data from long to wide format.

Usage

tidy_ifdata_reports(data)

Arguments

data

Optional. A data.frame/tibble. The data must be downloaded via get_ifdata_reports. If a data.frame, it should contain the raw ifdata reports structure with columns: cod_inst, grupo, conta, numero_relatorio, nome_relatorio, tipo_instituicao, descricao_coluna, nome_coluna, and saldo.

Details

The function performs the following operations in sequence:

Data Input Handling: Accepts data in two forms:
- Direct tibble/data.frame (preferred for in-memory operations)
Column Removal: Drops metadata columns that are redundant after pivoting (grupo, conta, numero_relatorio, etc.)
Identifier Rename: Renames cod_inst to cnpj for clarity that rows represent financial institutions by their CNPJ identifier
Pivot Operation: Transforms from long format (each report metric in a separate row) to wide format (each metric becomes a column). The pivot uses nome_coluna for column names and saldo for cell values.
Name Cleaning: Applies janitor's name standardization to ensure consistent, R-friendly column naming

Value

A tibble with the following transformations applied:

Columns removed: grupo, conta, numero_relatorio, nome_relatorio, tipo_instituicao, descricao_coluna
Column renamed: cod_inst → cnpj
Structure pivoted: Long format (multiple rows per institution) → Wide format (one row per institution with columns for each report item)
Column names: Cleaned to lowercase with underscores via clean_names

Examples


# Tidy an existing data.frame
raw_data <- get_ifdata_reports(year = 2024, month = 12, 
                                type_institution = 2, report = 4)
tidy_data <- tidy_ifdata_reports(data = raw_data)

Process and unify institution CSV exports from the Brazilian Central Bank (Bacen) downloaded via `get_institutions()`.

Description

This function reads and processes financial institutions data from the Brazilian Central Bank (Bacen) files that were previously downloaded using the get_institutions() function. It takes a directory path containing the institution files, processes them, and optionally saves the output to a specified directory.

Usage

tidy_institutions(path_dir, out_dir, verbose = TRUE)

Arguments

path_dir

Character string specifying the path to the directory containing institution files to be processed.

out_dir

Character string specifying the path to the directory where processed output files should be saved.

verbose

Logical value indicating whether to print progress messages during processing. Default is typically FALSE.

Value

A list of data.frames, where each element corresponds to a processed institution file. The names of the list elements typically correspond to institution identifiers or file names.

Examples

# First, download institution data
 get_institutions(
  institution = "COOPERATIVAS",
  start_date = "202311",
  end_date = "202312",
  out_dir = tempdir()
)
# Process institution files from a directory
institutions <- tidy_institutions(
  path_dir = tempdir(),
  out_dir = tempdir(),
  verbose = TRUE
)

Package {bacenR}

bacenR: Access Data from Brazilian Central Bank: IFdata, Active Institutions, Balance Sheets and Normative Acts

Description

Author(s)

See Also

Download balance sheets from Brazilian Central Bank

Description

Usage

Arguments

Details

Value

References

Examples

Download institution registry data from IFdata of Brazilian Central Bank (Bacen) Cadastro

Description

Usage

Arguments

Value

Examples

Download institution reports data from IFdata of Brazilian Central Bank (Bacen) Relatorios

Description

Usage

Arguments

Value

Examples

Download and Process Brazilian Financial Institutions Data from Bacen

Description

Usage

Arguments

Details

Value

See Also

Examples

Download normative acts from the Brazilian Central Bank (Bacen) by terms and date range

Description

Usage

Arguments

Details

Value

Examples

Download normative texts from the Brazilian Central Bank (Bacen) from a reference data.frame

Description

Usage

Arguments

Details

Value

Note

Examples

Import balancete CSV files with flexible header detection

Description

Usage

Arguments

Details

Value

Process and reshape balances CSV exports from the Brazilian Central Bank (Bacen) downloaded via get_balance_sheets().

Description

Usage

Arguments

Details

Value

Note

Examples

Tidy ifdata Reports Data

Description

Usage

Arguments

Details

Value

See Also

Examples

Process and unify institution CSV exports from the Brazilian Central Bank (Bacen) downloaded via get_institutions().

Description

Usage

Arguments

Value

Examples

Process and reshape balances CSV exports from the Brazilian Central Bank (Bacen) downloaded via `get_balance_sheets()`.

Process and unify institution CSV exports from the Brazilian Central Bank (Bacen) downloaded via `get_institutions()`.