spark_read_warc_sample {sparkwarc} | R Documentation |
Loads the sample warc file in Spark
spark_read_warc_sample(sc, filter = "", include = "")
sc |
An active |
filter |
A regular expression used to filter to each warc entry
efficiently by running native code using |
include |
A regular expression used to keep only matching lines
efficiently by running native code using |