首页 \ 问答 \ 具有不同切片的平均超过2d numpy阵列(Mean over 2d numpy array with varying slices)

具有不同切片的平均超过2d numpy阵列(Mean over 2d numpy array with varying slices)

 我需要计算2D numpy数组的列的平均值，其中每列的切片变化。  
 例如，我有一个数组  
    arr = np.arange(20).reshape(4, 5)
 
 每个列的切片的结束索引均值定义为  
    bot_ix = np.array([3, 2, 2, 1, 2])
 
 那么第一列的平均值就是  
    arr[0:bot_ix[0], 0].mean()
 
 什么是适当的（即Pythonic +高效）方式来做到这一点？ 我的阵列大小是〜（50,50K）。 

I need to calculate the mean over the columns of a 2D numpy array where the slice per column varies. 
For example, I have an array 
    arr = np.arange(20).reshape(4, 5)
 
with the end index of the slice for each column mean defined as 
    bot_ix = np.array([3, 2, 2, 1, 2])
 
The mean of the first column would then be 
    arr[0:bot_ix[0], 0].mean()
 
What's the appropriate (i.e. Pythonic + efficient) way to do this? My array sizes are ~(50, 50K).

原文：https://stackoverflow.com/questions/37822880

更新时间：2022-12-15 19:12

最满意答案

 这里有一个类似的问题：  
 如何将ROW INDEX作为列添加到SQL SELECT查询？  
 从这个问题扩展到你想要的东西：  
SET @row_num = 0;
SELECT
  T.id,T.name,T.status,IFNULL(T.image, 'no-image.png') AS DP,
  (SELECT COUNT(*) FROM badminton_matches MT WHERE (MT.team_one = T.id OR MT.team_two = T.id)) 
  AS played,
  (SELECT COUNT(*) FROM badminton_match_results R WHERE R.winner_id = T.id) AS won,
  (SELECT COUNT(*) FROM badminton_matches MT JOIN badminton_match_results MR 
    ON (MR.match_id = MT.id) 
    WHERE (MT.team_one = T.id OR MT.team_two = T.id) AND MR.winner_id != T.id) AS lost,
  (
   ((SELECT COUNT(*) FROM badminton_match_results R WHERE R.winner_id = T.id) * 2) 
     + 
   ((SELECT COUNT(*) FROM badminton_match_results R JOIN badminton_matches M  ON (M.id = R.match_id AND M.match_type = 'quarter') WHERE R.winner_id = T.id))
  ) AS Points,

  /* here is the magic */      
  (@row_num := @row_num + 1) < 4 AS row_index

FROM badminton_teams T
ORDER BY (Points) DESC
 
 这将添加一个名为row_index的额外列，其中1表示在前row_index表示不在前3。  
 请记住，您必须在每个SELECT之前和同一会话中调用SET 。 

There is a similar question here: 
How to add ROW INDEX as a column to SQL SELECT query? 
Extending from that question you want something like: 
SET @row_num = 0;
SELECT
    T.id, T.name, T.status, IFNULL(T.image, 'no-image.png') AS DP,
    (SELECT COUNT(*)
    FROM badminton_matches MT
    WHERE (MT.team_one = T.id OR MT.team_two = T.id)) 
      AS played,
    (SELECT COUNT(*)
    FROM badminton_match_results R
    WHERE R.winner_id = T.id) AS won,
    (SELECT COUNT(*)
    FROM badminton_matches MT JOIN badminton_match_results MR
        ON (MR.match_id = MT.id)
    WHERE (MT.team_one = T.id OR MT.team_two = T.id) AND MR.winner_id != T.id) AS lost,
    (
       ((SELECT COUNT(*)
    FROM badminton_match_results R
    WHERE R.winner_id = T.id) * 2) 
         + 
       ((SELECT COUNT(*)
    FROM badminton_match_results R JOIN badminton_matches M ON (M.id = R.match_id AND M.match_type = 'quarter')
    WHERE R.winner_id = T.id))
      ) AS Points,

    /* here is the magic */
    (@row_num
:= @row_num + 1) < 4 AS row_index

    FROM badminton_teams T
    ORDER BY
(Points) DESC
 
This will add an extra column called row_index where 1 means in top 3 and 0 means not in the top 3. 
Remember, that you must call the SET before each SELECT and within the same session.

具有不同切片的平均超过2d numpy阵列(Mean over 2d numpy array with varying slices)

最满意答案

相关问答

LINUX 如何查看JPG文件[2022-06-13]

在Elastic Search中检索排名靠前的文档(Retrieve top ranked documents in Elastic Search)[2023-06-02]

找到前三名排名的球队(MySQL Find Top 3 Ranked Teams)[2021-05-16]

Mysql查询前三名客户(Mysql query for top 3 customer)[2022-05-19]

sql - 查询在过去5年的冠军赛中获得前6名的球队(sql - Query to get the teams that were in the top 6 in the last 5 years of a championship)[2022-05-18]

找到前三名的相关类别及其相应的概率(Finding the top three relevant category and its corresponding probabilities)[2024-01-20]

Pinax团队 - 让用户找到用户所属的所有团队(Pinax teams - given a user find all the teams for which the user is a member)[2022-09-06]

在Microsoft Teams中找不到Incoming Webhook连接器(Cannot find Incoming Webhook connector in Microsoft Teams)[2023-11-23]

来自协会的排名最高的项目(Top Ranked Item from Associations)[2023-05-02]

如何在scikit中学习RFECV中的功能（sklearn）？(How are features ranked in RFECV in scikit learn(sklearn)?)[2022-05-15]

相关文章

最新问答