Scrape Ontario SIU Director's Reports into a tidy CSV
Source:R/siu_fetch.R
morie_siu_fetch_cases.RdPulls the SIU index, walks every linked case-detail page, and writes a
six-column CSV (case_number, police_service,
incident_iso, notification_iso, decision_iso,
director_decision_text, source_url) into
cache_dir.
Arguments
- years
Integer vector of fiscal years to scrape, or
NULL(default) to scrape the unfiltered index. Years above2023(the latest published as of release) may return empty results.- cache_dir
Output directory. Default
file.path(tempdir(), "morie", "siu"); passmorie_cache_dir("siu")for persistent caching.- overwrite
Logical; if
FALSEandSIU.csvalready exists, its path is returned without re-scraping.- progress
Logical; print a one-line status per index / case fetch when
TRUE(default).
Details
This is the lightweight R-only path. For the full 64-column corpus
use morie_fetch_siu (compiled C++ harvester).