我有一些句子,我想从句子中分离单词以获得每个行向量。但是这些单词正在重复以与我不想要的最大句子的行向量匹配。我希望无论句子有多大,每个句子的行向量都只会是单词一次。
sentence <- c("case sweden", "meeting minutes ht board meeting st march now also attachment added agenda today s board meeting", "draft meeting minutes board meeting final meeting minutes ht board meeting rd april")
sentence <- cbind(sentence)
word_table <- do.call(rbind, strsplit(as.character(sentence), " "))
test <- cbind(sentence, word_table)
我的意思是不重复。
最佳答案
来自rawr的解决方案,
sentence <- c("case sweden", "meeting minutes ht board meeting st march now also attachment added agenda today s board meeting", "draft meeting minutes board meeting final meeting minutes ht board meeting rd april")
dd <- read.table(text = paste(sentence, collapse = '\n'), fill = TRUE)
test <- cbind(sentence, dd)
或者,
cc <- read.table(text = paste(gsub('\n', '', sentence), collapse = '\n'), fill = TRUE)
test1 <- cbind(sentence, cc)
谢谢。
关于r - 使用 R 将句子转为单词表,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/35855735/