First corpus building function
assemble_corpus(data, stopwords, remove_twitter)
data | The data from which the corpus is drawn with documents in first column |
---|---|
stopwords | Words to exclude from the clustering |
remove_twitter | Whether to remove text associated with Twitter content, useful for when analyzing data from this source (defaults to FALSE) |
... | The metadata columns following the text column |
Puts together the corpus and dfm from the data frame provided