excelDataGuide is an R package that streamlines reading data from standardized Excel spreadsheet templates into R.
Spreadsheet templates are widely used in laboratories to standardize data recording and reduce errors. However, extracting data from these templates into R typically requires writing custom, template-specific code. This is tedious and error-prone.
The excelDataGuide package eliminates this burden by:
- Defining a data guide — a simple YAML file that describes where data are located in your template and how they should be interpreted
- Reading data with one command — the
read_data()function uses the guide to extract data correctly and automatically
The data guide approach also supports the FAIR principles by making your data structure explicit and machine-readable.
You can install the development version of excelDataGuide from GitHub with:
# install.packages("pak")
pak::pak("SystemsBioinformatics/excelDataGuide")Reading data from an Excel template requires just two files: the template itself and a data guide.
library(excelDataGuide)
# Path to your Excel file
datafile <- system.file("extdata", "example_data.xlsx", package = "excelDataGuide")
# Path to the data guide (YAML file)
guidefile <- system.file("extdata", "example_guide.yml", package = "excelDataGuide")
# Read the data
data <- read_data(datafile, guidefile)The output is a list containing the data organized according to your guide.
For detailed guidance on using this package:
-
Designing templates — Best practices for structuring your Excel templates (version numbers, protected cells, parameter sheets, etc.).
-
Writing data guides — Step-by-step instructions for creating YAML guides, with examples of all four data types (keyvalue, cells, table, platedata) and a complete working example.