首页 \ 问答 \ MySQL：获取计数和平均值[重复](MySQL: Get Counts and Averages [duplicate])

MySQL：获取计数和平均值[重复](MySQL: Get Counts and Averages [duplicate])

 
  这个问题在这里已有答案：  
  
   多个查询相同的表，但在不同的列mysql 5答案  
  
 
select 
COUNT(pd.property_id) AS `Beginning Total File Count`,
COUNT(pd.recv_dt) as `average days in inventory`,
AVG(pd.status = 'P') as `average days in pre-marketing`,
AVG(pd.status NOT IN('I','C')) as `average days onMarket`,
AVG(pd.status ='U') as `average days UnderContract`,
SUM(pd.status = 'O') as `Total FilesOccupied Status`,
SUM(pd.status = 'O') / COUNT(pd.property_id) as `percentage of Occupied / 
total file count`
from resnet.property_Details pd
 
 我想要  
 
  开始总文件数  
  库存的平均天数  
  上市前的平均天数  
  平均待售天数  
  合同平均天数  
  处于占用状态的文件总数  
  占用/总文件数的百分比  
 
 不确定我的查询是否写得正确，请帮助:)  
  

 
 This question already has an answer here: 
  
   multiple query same table but in different columns mysql  4 answers   
  
 
select 
COUNT(pd.property_id) AS `Beginning Total File Count`,
COUNT(pd.recv_dt) as `average days in inventory`,
AVG(pd.status = 'P') as `average days in pre-marketing`,
AVG(pd.status NOT IN('I','C')) as `average days onMarket`,
AVG(pd.status ='U') as `average days UnderContract`,
SUM(pd.status = 'O') as `Total FilesOccupied Status`,
SUM(pd.status = 'O') / COUNT(pd.property_id) as `percentage of Occupied / 
total file count`
from resnet.property_Details pd
 
I'm trying to get 
 
 Beginning total file count 
 Average days in inventory 
 Average days in Pre-Marketing 
 Average days on market 
 Average days under contract 
 Total files in occupied status 
 Percentage of Occupied / total file count 
 
Not sure if my query is written properly, please help :) 

原文：https://stackoverflow.com/questions/43689973

更新时间：2023-06-19 14:06

最满意答案

 我认为你需要首先使用boolean indexing进行过滤，然后进行groupby和聚合size 。  
 汇总输出并添加reindex以添加由0填充的缺失行：  
print (df)
         Date ID
0  01/01/2016  a
1  05/01/2016  a
2  10/05/2017  a
3  05/05/2018  b
4  07/09/2014  b
5  07/09/2014  c
6  12/08/2018  b
 
 
#convert to datetime (if first number is day, add parameter dayfirst)
df['Date'] = pd.to_datetime(df['Date'], dayfirst=True)
now = pd.datetime.today()
print (now)

oneyarbeforenow =  now - pd.offsets.DateOffset(years=1)
oneyarafternow =  now + pd.offsets.DateOffset(years=1)

#first filter
a = df[df['Date'].between(oneyarbeforenow, now)].groupby('ID').size()
b = df[df['Date'].between(now, oneyarafternow)].groupby('ID').size()
print (a)
ID
a    1
dtype: int64

print (b)
ID
b    2
dtype: int64

df1 = pd.concat([a,b],axis=1).fillna(0).astype(int).reindex(df['ID'].unique(),fill_value=0)
print (df1)
   0  1
a  1  0
b  0  2
c  0  0
 
 编辑：  
 如果需要比较每个日期的第一个日期加上或减去每组的year offset需要自定义函数的条件和sum ：  
offs = pd.offsets.DateOffset(years=1)

f = lambda x: pd.Series([(x > x.iat[-1] - offs).sum(), \
                        (x < x.iat[-1] + offs).sum()], index=['last','next'])
df = df.groupby('ID')['Date'].apply(f).unstack(fill_value=0).reset_index()
print (df)
  ID  last  next
0  a     1     3
1  b     3     2
2  c     1     1

I think you need between with boolean indexing for filter first and then groupby and aggregate size. 
Outputs are concated and add reindex for add missing rows filled by 0: 
print (df)
         Date ID
0  01/01/2016  a
1  05/01/2016  a
2  10/05/2017  a
3  05/05/2018  b
4  07/09/2014  b
5  07/09/2014  c
6  12/08/2018  b
 
 
#convert to datetime (if first number is day, add parameter dayfirst)
df['Date'] = pd.to_datetime(df['Date'], dayfirst=True)
now = pd.datetime.today()
print (now)

oneyarbeforenow =  now - pd.offsets.DateOffset(years=1)
oneyarafternow =  now + pd.offsets.DateOffset(years=1)

#first filter
a = df[df['Date'].between(oneyarbeforenow, now)].groupby('ID').size()
b = df[df['Date'].between(now, oneyarafternow)].groupby('ID').size()
print (a)
ID
a    1
dtype: int64

print (b)
ID
b    2
dtype: int64

df1 = pd.concat([a,b],axis=1).fillna(0).astype(int).reindex(df['ID'].unique(),fill_value=0)
print (df1)
   0  1
a  1  0
b  0  2
c  0  0
 
EDIT: 
If need compare each date by first date add or subtract year offset per group need custom function with condition and sum Trues: 
offs = pd.offsets.DateOffset(years=1)

f = lambda x: pd.Series([(x > x.iat[-1] - offs).sum(), \
                        (x < x.iat[-1] + offs).sum()], index=['last','next'])
df = df.groupby('ID')['Date'].apply(f).unstack(fill_value=0).reset_index()
print (df)
  ID  last  next
0  a     1     3
1  b     3     2
2  c     1     1

MySQL：获取计数和平均值[重复](MySQL: Get Counts and Averages [duplicate])

最满意答案

相关问答

计算特定ID的行数(Count number of rows for specific ID)[2023-01-03]

计算每个组内的行数(Count number of rows within each group)[2022-11-07]

如何计算和记录具有特定月/年值的工作表中的行数(How can I count and log the number of rows in a sheet with a specific month/year value)[2021-10-23]

我如何返回查询以计算行数，以及如何计算行数？(How do I need to return the query in order to count the number of rows, and how do I count the rows?)[2024-02-11]

计算具有... SQL的行数(Count number of rows with having… SQL)[2023-08-04]

总结一年内的公司数量(Sum the number of firms in one year)[2022-02-02]

计算1年内每个ID的行数(Count number of rows for each ID within 1 year)[2022-03-17]

存储过程计数在计数中具有ID号的行数(Stored procedure count number of rows with id numbers in the count)[2023-07-06]

连续行数(Number of successive rows)[2022-09-17]

在Python中计算每年的不同ID(Count distinct IDs for each year in Python)[2021-12-01]

相关文章

最新问答