matrix_via_r {corporaexplorer} | R Documentation |
Create document term matrix for fast search of single words
Description
The characters removed
Usage
matrix_via_r(df, matrix_without_punctuation = TRUE)
Arguments
df |
A "data_dok" tibble |
matrix_without_punctuation |
Should punctuation and digits be stripped
from the text before constructing the document term matrix? If
If |
Value
List: 1) Document term matrix (data.table), 2) word vector (character vector).