Skip to contents

Downloads the full propose-reason texts (jean-iyu) for all 60,925 bills. The file is approximately 40 MB and is cached locally after the first download. Requires the arrow package to read parquet files.

Usage

get_bill_texts(cache_dir = NULL, force_download = FALSE)

Arguments

cache_dir

Directory to cache downloaded files. Defaults to tools::R_user_dir("assemblykor", "cache").

force_download

Logical. If TRUE, re-download even if cached.

Value

A data frame with 60,925 rows and 3 variables:

bill_id

Bill identifier (links to bills$bill_id)

propose_reason

Full text of the propose-reason statement (Korean)

scrape_status

Data collection status: "ok", "empty", "no_csrf", or "error"

Examples

# \donttest{
texts <- get_bill_texts()
#> Downloading bill texts (~40 MB)...
#> Cached at: /home/runner/.cache/R/assemblykor/bill_texts.parquet
nchar_dist <- nchar(texts$propose_reason)
hist(nchar_dist, breaks = 100, main = "Length of Propose-Reason Texts")

# }