r - 使用 ggplot2 复制食物可视化的节奏

标签 r ggplot2

我正在尝试复制 Google 的 Rhythm of Food 的精美可视化效果用我自己的数据集显示我公司每周雇用了多少人。数据集(名为 hiresbyweek)如下所示(这是 81 行中的第 25 行,link to full dataset here)

            Week Year total.Hires     Month WeekNum
  2014-05-05 0:00:00 2014           1       May      18
  2014-05-12 0:00:00 2014           1       May      19
  2014-05-19 0:00:00 2014           1       May      20
  2014-05-26 0:00:00 2014           1       May      21
  2014-08-04 0:00:00 2014           1    August      31
  2014-09-08 0:00:00 2014           1 September      36
  2015-02-23 0:00:00 2015           3  February      08
  2015-03-23 0:00:00 2015           4     March      12
  2015-05-04 0:00:00 2015           1       May      18
  2015-06-01 0:00:00 2015           1      June      22
  2015-06-08 0:00:00 2015           1      June      23
  2015-09-14 0:00:00 2015           3 September      37
  2015-09-21 0:00:00 2015           4 September      38
  2015-09-28 0:00:00 2015          15 September      39
  2015-10-05 0:00:00 2015          20   October      40
  2015-10-12 0:00:00 2015          47   October      41
  2015-10-19 0:00:00 2015          40   October      42
  2015-10-26 0:00:00 2015          39   October      43
  2015-11-02 0:00:00 2015           5  November      44
  2015-11-09 0:00:00 2015           2  November      45
  2015-11-16 0:00:00 2015           7  November      46
  2015-11-23 0:00:00 2015           1  November      47
  2015-11-30 0:00:00 2015           7  November      48
  2015-12-07 0:00:00 2015           3  December      49
  2015-12-14 0:00:00 2015           7  December      50

目前我已经做到了这一点:

ggplot(hiresbyweek,aes( x=WeekNum, y=total.Hires,fill=as.factor(Year)))
+geom_histogram(stat="identity", aes( x=WeekNum, y=total.Hires,fill=as.factor(Year)))
+coord_polar()
+scale_fill_manual(values=c("#ACD9F4","#005DA6","#EC008C"))
+scale_x_discrete(labels = as.factor(hiresbyweek$Month))
+scale_y_discrete(expand=c(0.5,0))
+theme(text=element_text(family="Avenir")
       , axis.ticks = element_blank()
       , panel.grid = element_blank()
       , panel.background = element_blank()
       )

这产生了一些接近的东西:

enter image description here

本质问题是:

1) 这些标签离它们应该在的地方不近: 请注意 10 月的最大数字,但根据图表,它们主要出现在 4 月或 3 月。

拥有的美好:

1) 我想按照食物图表的节奏对这些标题进行分组和旋转,这样标签会更简单

2) 我想大大减少所述条的相对大小;我已经将其作为计数(geom_historgram(stat="count")或 stat="bin")完成,但这使它们都相等并消除了比例的重要性,这是这里的关键。

3) 我想在条之间插入一些空格。我尝试在 color="white"中添加 ggplot(hiresbyweek,aes( x=WeekNum, y=total.Hires,colour="white",fill=as.factor(Year))) 和 geom_histogram(stat ="identity", aes( x=WeekNum, y=total.Hires,fill=as.factor(Year), color="white")) 奇怪的是都有粉红色的轮廓...

第一部分的帮助是最重要的(当时我觉得它很像样),但任何人都欢迎。感谢您的时间和想法。

最佳答案

我一直在等待其他人发布更好、更简洁的答案,但我希望在此期间能做到这一点。

# 1. We can control the order of geom_bars based on the levels of the factor of X. 
# So we make a new factor variable and ensure that the levels are in the order of 
# < January1, January2, ..., February2, ..., December3, December4 >  
hiresbyweek <- hiresbyweek[order(hiresbyweek$WeekNum),]
hiresbyweek$X <- factor(paste0(hiresbyweek$WeekNum, hiresbyweek$Month), 
                    levels = unique(paste0(hiresbyweek$WeekNum, hiresbyweek$Month)))

# 2. But we don't want the axis labels to be: "Jan1, Jan2, Jan3, ..."
# Instead we'll extract only the month out of the X variable (though notice the weekNum
# variable was important so we could get the right order and distinct factor levels)
# But we also don't want repeated axis labels: "Jan, "Jan", "Jan", "Feb", "Feb", ....
# So try to place the unique axis label close to the middle, and leave the rest blank
# (ie. "", "Jan", "", "", "Feb")
makeLabels <- function(x) {
  x <- gsub("[0-9]", "", x)
  labs <- c();
  for (a in unique(x)) {
    b <- rep("", length(x[x == a]))
    b[ ceiling(length(x[x==a])/2) ] <- a
    labs <- append(labs, b)
  }
  return(labs)
}

# 3. Angle the axis labels to imitate Google's Rhythm of Food
ang <- -360 / length(unique(hiresbyweek$X)) * seq_along(hiresbyweek$X)
ang[ang <= -90 & ang >= -300] <- ang[ang <= -90 & ang >= -300] -180

ggplot(hiresbyweek, aes( x = X, y = total.Hires,fill = as.factor(Year))) +
  geom_histogram(stat="identity", width = 0.5) + # Use width arg for more space between bars
  coord_polar() + 
  scale_x_discrete(labels = makeLabels) + # Apply getLabel function to X
  scale_y_discrete(expand=c(0.5,0)) + 
  scale_fill_manual(values=c("#ACD9F4","#005DA6","#EC008C")) + 
  theme(axis.ticks = element_blank(), 
    panel.grid = element_blank(), 
    panel.background = element_blank(),
    text = element_text(family="Avenir"),
    title = element_blank(), # Remove all titles
    axis.text.x = element_text(angle= ang)) # Apply angles to x-axis labels

结果: result

关于r - 使用 ggplot2 复制食物可视化的节奏,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/42075261/

相关文章:

r - 从 Tidy Dataframes 创建节点和边 Dataframes

r - 使用来自不同数据框的内容操作数据框,类似于 SQL 连接

python - python可以绘制具有3d效果的标记吗?

r - 具有相同功能名称的软件包

r - 使用循环 ggplot2 将各种函数添加到同一个绘图中

r - 如何安装 RHadoop 包(Rmr、Rhdfs、Rhbase)?

r - 在 mgcv::gam() 包装器中传递偏移量参数

r - 将十进制月份和年份转换为日期

r - 改进的马赛克图(如热图或气泡)

r - (ROSE绘图错误)错误:构面变量必须至少具有一个值