r - 使用 dplyr 管道将 for 循环的输出提取到 R 中的数据帧中

标签 r for-loop dplyr

无法弄清楚如何在 for 循环中进行一系列 t 测试,并在每次测试完成时获取输出并将结果附加到数据帧中。目标是一次运行多个 t 检验并生成所有结果的数据框。

这是用 mtcars 数据集缓慢完成的:

library(tidyverse)
library(rstatix)


# T-test to determine if there is a significant difference between mpg of 
# automatic vs manual transmissions (automatic=0, manual=1)
t1 <- mtcars %>% 
  t_test(mpg ~ am) %>% 
  mutate(var = "am") # add lable to merge by

# Calculate mean mpg of both groups
t1.1 <- mtcars %>% 
  group_by(am) %>% 
  summarize(Mean = mean(mpg, na.rm=TRUE)) %>% 
  pivot_wider(names_from = am, values_from = Mean) %>% # Bring to wide format to add to df
  mutate(var = "am") # add label to merge by

# T-test for vs (v-shape=0, straight line=1)
t2 <- mtcars %>% 
  t_test(mpg ~ vs) %>% 
  mutate(var = "vs") # add lable to merge by
# Calculate mean mpg of both groups
t2.1 <- mtcars %>% 
  group_by(vs) %>% 
  summarize(Mean = mean(mpg, na.rm=TRUE)) %>% 
  pivot_wider(names_from = vs, values_from = Mean) %>% # Bring to wide format to add to df
  mutate(var = "vs") # add label to merge by

# Merge dfs and rename
t_bind <- rbind(t1, t2)
t.1_bind <- rbind(t1.1, t2.1)
t.1_bind <- t.1_bind %>% rename("mean_0" = "0", "mean_1" = "1")
t_merge <- merge(t_bind, t.1_bind, by = "var")

但是当我尝试将其设置为循环时,我迷失了。看起来这应该是相当简单的,只是没有考虑清楚

t_vars <- c("am", "vs")  # etc.

for (i in t_vars) {
  x1 <- mtcars %>% 
    t_test(mpg ~ i) %>% 
    mutate(var = colnames(mpg[[i]]))
  df <- append(x1)
}

# Error: Can't extract columns that don't exist.
# x Column `i` doesn't exist.

感谢您的帮助!!

最佳答案

类似这样的东西吗?

bind_rows(lapply(c("am", "vs"), function(i) {
  mtcars %>% 
    t_test(formula(paste0("mpg ~ ",i)),detailed=T) %>% 
    mutate(var = i)
}))

输出:

# A tibble: 2 × 16
  estimate estimate1 estimate2 .y.   group1 group2    n1    n2 statistic       p    df conf.low conf.high method alternative var  
     <dbl>     <dbl>     <dbl> <chr> <chr>  <chr>  <int> <int>     <dbl>   <dbl> <dbl>    <dbl>     <dbl> <chr>  <chr>       <chr>
1    -7.24      17.1      24.4 mpg   0      1         19    13     -3.77 0.00137  18.3    -11.3     -3.21 T-test two.sided   am   
2    -7.94      16.6      24.6 mpg   0      1         18    14     -4.67 0.00011  22.7    -11.5     -4.42 T-test two.sided   vs   

关于r - 使用 dplyr 管道将 for 循环的输出提取到 R 中的数据帧中,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/70906592/

相关文章:

r - 将矩阵转换为 r 中的数据帧

r - R语言计算

r - 如何随机采样具有唯一列值的数据帧行

r - 如何在 R 中优化以下程序以提高性能? (涉及计算密集型置换测试的蒙特卡罗模拟)

r - R中的for循环下载 map 数据(栅格包)

r - 如何使用 lag/lead 和 ifelse/case_when(或其他解决方案)处理 R 中的纵向症状数据?

r - ggsave 抛出错误

r - 在第二个数据集中搜索第一个数据集中列表中出现的值

java - 比较for循环每一轮的值?

r - left_join 不合并所有值