book_sections {ascentTraining} | R Documentation |
A mixed up collection of words from different book sections of two books.
book_sections
A tibble with 108,657 observations, each a word on a document. This data set is designed to show how LDA can be used to separate a set of mixed documents into two distinct "topics" (or books).
word
Words from a given section within a book.
document
The book section ID that the word came from.
Data taken from two books of the Gutenberg Project