dataset: Create Data Frames for Exchange and Reuse

The 'dataset' package helps create semantically rich, machine-readable, and interoperable datasets in R. It extends tidy data frames with metadata that preserves meaning, improves interoperability, and makes datasets easier to publish, exchange, and reuse in line with ISO and W3C standards.

Version: 0.4.0
Depends: R (≥ 3.5)
Imports: assertthat, haven, ISOcodes, labelled, pillar, tibble, utils, vctrs
Suggests: dplyr, jsonld, knitr, rdflib, rmarkdown, spelling, tidyr, testthat (≥ 3.0.0)
Published: 2025-08-26
Author: Daniel Antal ORCID iD [aut, cre], Marcelo Perlin ORCID iD [rev], Anna Márta Mester ORCID iD [rev], Mauro Lepore ORCID iD [rev]
Maintainer: Daniel Antal <daniel.antal at dataobservatory.eu>
BugReports: https://github.com/dataobservatory-eu/dataset/issues/
License: GPL (≥ 3)
URL: https://dataset.dataobservatory.eu/
NeedsCompilation: no
Language: en-GB
Citation: dataset citation info
Materials: README, NEWS
CRAN checks: dataset results

Documentation:

Reference manual: dataset.html , dataset.pdf
Vignettes: Modernising Citation Metadata in R: Introducing 'bibrecord' (source, R code)
dataset_df: Create Datasets that are Easy to Share Exchange and Extend (source, R code)
defined: Semantically Enriched Vectors (source, R code)
Design Principles & Future Work Semantically Enriched, Standards-Aligned Datasets in R (source, R code)
Example Dataset Definitions (source)
An Introduction to the dataset Package (source, R code)
From R to RDF (source, R code)

Downloads:

Package source: dataset_0.4.0.tar.gz
Windows binaries: r-devel: dataset_0.3.9.zip, r-release: dataset_0.3.9.zip, r-oldrel: dataset_0.3.9.zip
macOS binaries: r-release (arm64): dataset_0.4.0.tgz, r-oldrel (arm64): dataset_0.4.0.tgz, r-release (x86_64): dataset_0.4.0.tgz, r-oldrel (x86_64): dataset_0.4.0.tgz
Old sources: dataset archive

Linking:

Please use the canonical form https://CRAN.R-project.org/package=dataset to link to this page.