r - 获取R数据框中包含特定字符的每行的列号

如果我有如下所示的数据框。

a <- c('A', 'b', 'c')
b <- c('b', 'c', 'A')
c <- c('c', 'A', 'b')
df <- data.frame(a, b, c)

df
  a b c
1 A b c
2 b c A
3 c A b

我想生成如下所示的附加列。基本上，df$b_pos 指定“b”位于“A”之前还是之后(同样的原则也适用于 df$c_pos)。

df$b_pos <- c('after A', 'before A', 'after A')
df$c_pos <- c('after A', 'before A', 'before A')

df
  a b c    b_pos    c_pos
1 A b c  after A  after A
2 b c A before A before A
3 c A b  after A before A

我想编写如下所示的行，以便我可以自动化该过程。

df$b_pos <- ifelse(get_the_column_index_of_A > 
                     get_the_column_index_of_b, 'before A', 'after A')
df$c_pos <- ifelse(get_the_column_index_of_A > 
                     get_the_column_index_of_c, 'before A', 'after A')

如果有人能给我一些建议，用什么来代替“get_the_column_index_of_A”，我将非常感激。

最佳答案

我们可以使用max.col来做到这一点

df[c('b_pos', 'c_pos')] <- lapply(letters[2:3], function(x) 
         c("before A", "after A")[1+(max.col(df=="A", "first") < max.col(df==x, "first"))])
df
#  a b c    b_pos    c_pos
#1 A b c  after A  after A
#2 b c A before A before A
#3 c A b  after A before A

或者另一种选择是按行粘贴数据集并使用grepl检查模式

df[c('b_pos', 'c_pos')] <- lapply(c("A.*b",  "A.*c"), function(x) 
           c("before A", "after A")[grepl(x, do.call(paste0, df))+1L])

关于r - 获取R数据框中包含特定字符的每行的列号，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/37477209/

r - 获取R数据框中包含特定字符的每行的列号

上一篇：file - 如何使用 VBScript 读取二进制文件

下一篇：c - 使用字符数组进行长数加法时出错