combined_dataset {ouladFormat} | R Documentation |
Combines multiple OULAD .csv files into one tibble that is formatted for data analysis and where each row represents a unique student.
combined_dataset(
module = c("AAA", "BBB", "CCC", "DDD", "EEE", "FFF", "GGG"),
presentation = c("2013J", "2014J", "2013B", "2014B", "All"),
repeat_students = c("remove", "keep"),
withdrawn_students = c("remove", "keep"),
demographics = FALSE,
registration = FALSE,
VLE = c("omit", "daily", "weekly", "activity", "FSLM", "FSLSM", "OLS", "VARK"),
VLE_clicks = c("total", "binary", "standardise1", "standardise2", "logarithmic"),
week_begin = -4,
week_end = 39,
assessment = FALSE,
na.rm = FALSE,
example_data = FALSE
)
module |
name of the module to be included, either |
presentation |
name of the semester of the module to be included, either |
repeat_students |
indicator of whether students who had previous attempts at the module should be removed, either |
withdrawn_students |
indicator of whether students who had withdrawn from the module should be removed, either |
demographics |
logical. Indicates whether demographic data is included in the outputted data set. |
registration |
logical. Indicates whether registration data is included in the outputted data set. |
VLE |
indicates whether VLE data is included in the outputted data set
(default is |
VLE_clicks |
indicates the format that the VLE data should be returned as, either |
week_begin |
the first semester week of VLE and assessment data to be included in formatted data. Depending on the module presentation, students started to view activities four weeks prior to the initial module start date. Weeks prior to the initial module start are indicated by a negative integer. |
week_end |
the last semester week of VLE and assessment data to be included in the formatted data. Week 39 is the last week material was viewed (and earlier in some module presentations). This parameter is also used to remove withdrawn students. |
assessment |
logical. Indicates whether assessment data is included in the outputted data set. |
na.rm |
logical. Indicates whether NAs should be omitted from the average continuous assessment calculations
or treated as zeroes (default). This calculation only includes continuous assessment that
was due between the period set by |
example_data |
logical. Indicates whether to run a subset of the VLE data as an example. |
Returns the inputs specified for whether assessment, demographics, registration and VLE variables are to be included,
as well as the other inputs. Also, one tibble
(object of class tbl_df
) is returned, dataset_combined.
A tibble
where each row represents a unique student. Depending on the inputs specified,
the tibble
includes assessment, demographics, registration and VLE data for each student.
Kuzilek, J., Hlosta, M., & Zdrahal, Z. (2017). Open university learning analytics dataset. Scientific Data volume 4 , (pp. 1–8). https://doi.org/10.1038/sdata.2017.171.
For more information on different inputs and variables in the dataset_combined tibble
, see:
dataset_assessment()
for information on the assessment performance data,
dataset_demographics()
for information on the demographics data,
dataset_registration()
for information on the registration data,
dataset_VLE_time()
for information on the VLE daily or weekly data,
dataset_VLE_activity()
for information on the VLE activity data,
VLE_learning_classification()
for information on the VLE activities classified under a learning model, and
convert_VLE()
for information on transforming the data type of the VLE data.
# Uses subset of the VLE data set for example
combined_dataset(module = "AAA", presentation = "2013J",
repeat_students = "remove", withdrawn_students = "remove",
demographics = TRUE, registration = TRUE,
assessment = TRUE, na.rm = FALSE,
VLE = "weekly", VLE_clicks = "total",
example_data = TRUE)
# Slow to run as it loads the full VLE data set
combined_dataset(module = "BBB", presentation = "2013J",
repeat_students = "remove", withdrawn_students = "remove",
demographics = TRUE, registration = FALSE,
assessment = TRUE, na.rm = FALSE,
VLE = "activity", VLE_clicks = "standardise1",
week_begin = -4, week_end = 14, example_data = FALSE)