首页 \ 问答 \ 数据表聚合中的条件因子水平选择(Conditional Factor Level Selection in Aggregation of Data Table)

数据表聚合中的条件因子水平选择(Conditional Factor Level Selection in Aggregation of Data Table)

 我试图将每个ID的data.table聚合到1行。  
 假设第一列表示ID，最后一列是感兴趣的因素：  
mydt <- data.table(matrix(c(1,2,"Level 1", 1,12,"Level 0", 1,12,"Level 0", 2,12,"Level 3", 2,12,"Level 2"), nrow = 5, ncol = 3, byrow = TRUE))
mydt
   V1 V2      V3
1:  1  2 Level 1
2:  1 12 Level 0
3:  1 12 Level 0
4:  2 12 Level 3
5:  2 12 Level 2
 
 我对如何汇总因素有非直观的规则：  
 
  如果Level 1存在任何ID行，那么聚合行应该具有该Level 1的ID  
  如果没有，那么如果该ID存在Level 2 ，则使用它  
  如果不存在，则存在Level 3  
  如果不是，则为Level 0  
 
 实际的data.table非常大，没有实际因子水平的数字分量，它们只是字符串。 这个脚本每天至少运行一次，所以我试图避免使用循环进行缓慢的预处理。  
 所需的结果如下所示：  
   V1 V2      V3
1:  1  8.67 Level 1
2:  2 12 Level 2
 
 但是我找不到合适的聚合功能...  
mydt[,.(V2 = mean(V2, na.rm = T), V3 = if("Level 1") "Level 1" else if("idk help me out?")), by = "V1"]

I'm trying to aggregate a data.table to 1 row per ID. 
Suppose the first column represents ID and the last column is the factor of interest: 
mydt <- data.table(matrix(c(1,2,"Level 1", 1,12,"Level 0", 1,12,"Level 0", 2,12,"Level 3", 2,12,"Level 2"), nrow = 5, ncol = 3, byrow = TRUE))
mydt
   V1 V2      V3
1:  1  2 Level 1
2:  1 12 Level 0
3:  1 12 Level 0
4:  2 12 Level 3
5:  2 12 Level 2
 
I have non-intuitive rules for how to aggregate the factor: 
 
 if Level 1 exists for any row of ID then the aggregated row should have Level 1 for that ID 
 if not, then if Level 2 exists for that ID then use it 
 if not, then Level 3 if it exists 
 if not, then Level 0  
 
The actual data.table is very large and there is no numeric component of the actual factor levels, they are just strings. This script will be run at least once per day, so I'm trying to avoid slow pre-processing with loops.  
The desired result would look like this: 
   V1 V2      V3
1:  1  8.67 Level 1
2:  2 12 Level 2
 
However I can't find an suitable aggregation function... 
mydt[,.(V2 = mean(V2, na.rm = T), V3 = if("Level 1") "Level 1" else if("idk help me out?")), by = "V1"]

原文：https://stackoverflow.com/questions/35232201

更新时间：2022-11-19 20:11

最满意答案

 您可能正在寻找threshold选项。  
plotOptions: {
    series: {
        threshold: 100
    }
}
 
 演示 

You're probably looking for the threshold option. 
plotOptions: {
    series: {
        threshold: 100
    }
}
 
Demo

数据表聚合中的条件因子水平选择(Conditional Factor Level Selection in Aggregation of Data Table)

最满意答案

相关问答

如何只在Y轴上获得整数？(How to get only whole numbers on the y-axis?)[2023-02-27]

在Highcharts上双y轴匹配(Dual y-axis matching on Highcharts)[2022-04-16]

放大后，y轴在高位图中未对齐(y-axes misaligning in highcharts after zooming in)[2023-08-07]

Highcharts - 负值，将y轴设置为大于0？(Highcharts - Negative values, set y-axis to greater than 0?)[2022-07-23]

区域图y轴的高图最小值(Highcharts minimum value for area charts y-axis)[2023-05-15]

Highcharts：y轴上的注释(Highcharts: annotation on y-axis)[2022-07-02]

更新Highcharts线图的Y轴会导致垂直线(Updating Y-Axis of Highcharts line graph causes vertical line)[2024-04-02]

Highcharts Y轴标签(Highcharts Y-axis labels)[2022-01-24]

Highcharts - 为Y轴提供正负侧的不同范围值(Highcharts - Providing different range value in positive and negative side for Y-axis)[2022-04-14]

R条形图，y轴大于零(R barplot with y-axis greater than zero)[2022-07-11]

相关文章

最新问答