fst_rm_stop_punct {finnsurveytext} | R Documentation |
Removes stopwords and punctuation from a dataframe containing Finnish survey text data which is already in CoNLL-U format.
fst_rm_stop_punct(data, stopword_list = "nltk")
data |
A dataframe of Finnish text in CoNLL-U format. |
stopword_list |
A valid Finnish stopword list, default is '"nltk"'. |
A dataframe of Finnish text in CoNLL-U format without stopwords and punctuation.
fst_rm_stop_punct(conllu_dev_q11_3)
fst_rm_stop_punct(conllu_dev_q11_1, stopword_list <- "snowball")
fst_rm_stop_punct(conllu_cb_bullying, "stopwords-iso")