r - 在 dplyr::funs 的命名参数中，我可以引用其他参数的名称吗？

考虑以下:

library(tidyverse)

df <- tibble(x = rnorm(100), y = rnorm(100, 10, 2), z = x * y)

df %>% 
mutate_all(funs(avg = mean(.), dev = sd(.), scaled = (. - mean(.)) / sd(.)))

有没有办法避免调用mean和 sd通过引用 avg 两次和 dev列。我的想法是

df %>% 
mutate_all(funs(avg = mean(.), dev = sd(.), scaled = (. - avg) / dev))

显然这行不通，因为没有列 avg和 dev , 但是 x_avg , x_dev , y_avg , y_dev ， ETC。

有没有好办法，内funs使用 rlang以编程方式创建这些列引用的工具，以便我可以将由先前命名参数创建的列引用到 funs (当 . 是 x 时，我会引用 x_mean 和 x_dev 来计算 x_scaled 等等)？

最佳答案

我认为如果您将数据转换为长格式会更容易

library(tidyverse)

set.seed(111)
df <- tibble(x = rnorm(100), y = rnorm(100, 10, 2), z = x * y)

df %>% 
  gather(key, value) %>% 
  group_by(key) %>% 
  mutate(avg    = mean(value),
         sd     = sd(value),
         scaled = (value - avg) / sd)
#> # A tibble: 300 x 5
#> # Groups:   key [3]
#>    key    value     avg    sd scaled
#>    <chr>  <dbl>   <dbl> <dbl>  <dbl>
#>  1 x      0.235 -0.0128  1.07  0.232
#>  2 x     -0.331 -0.0128  1.07 -0.297
#>  3 x     -0.312 -0.0128  1.07 -0.279
#>  4 x     -2.30  -0.0128  1.07 -2.14 
#>  5 x     -0.171 -0.0128  1.07 -0.148
#>  6 x      0.140 -0.0128  1.07  0.143
#>  7 x     -1.50  -0.0128  1.07 -1.39 
#>  8 x     -1.01  -0.0128  1.07 -0.931
#>  9 x     -0.948 -0.0128  1.07 -0.874
#> 10 x     -0.494 -0.0128  1.07 -0.449
#> # ... with 290 more rows

创建于 2018-11-04 由 reprex package (v0.2.1.9000)

关于r - 在 dplyr::funs 的命名参数中，我可以引用其他参数的名称吗？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/53143522/

r - 在 dplyr::funs 的命名参数中，我可以引用其他参数的名称吗？

上一篇：ruby-on-rails - Git push Heroku master 需要永远

下一篇：ruby-on-rails - Bundler 正在使用为不同 gem 创建的 binstub