首页 \ 问答 \ INSERT INTO与SELECT INTO(INSERT INTO vs SELECT INTO)

INSERT INTO与SELECT INTO(INSERT INTO vs SELECT INTO)

 使用有什么区别？  
SELECT ... INTO MyTable FROM...
 
 和  
INSERT INTO MyTable (...)
SELECT ... FROM ....
 
 ？  
 从BOL [ INSERT ， SELECT ... INTO ]，我知道使用SELECT ... INTO将在默认文件组中创建插入表，如果它不存在，并且该语句的日志记录取决于恢复数据库模型。  
 
  哪个说法最好？  
  是否有其他性能影响？  
  什么是SELECT ... INTO在INSERT INTO ...中的一个很好的用例？  
 
 编辑：我已经说过我知道那个SELECT INTO ...创建一个不存在的表。 我想知道的是，SQL包含这个声明是一个原因，是什么？ 它是否在幕后插入行，或者只是语法糖在CREATE TABLE和INSERT INTO之上。 

What is the difference between using 
SELECT ... INTO MyTable FROM...
 
and 
INSERT INTO MyTable (...)
SELECT ... FROM ....
 
? 
From BOL [ INSERT, SELECT...INTO ], I know that using SELECT...INTO will create the insertion table on the default file group if it doesn't already exist, and that the logging for this statement depends on the recovery model of the database. 
 
 Which statement is preferable?  
 Are there other performance implications? 
 What is a good use case for SELECT...INTO over INSERT INTO ...? 
 
Edit: I already stated that I know that that SELECT INTO... creates a table where it doesn't exist. What I want to know is that SQL includes this statement for a reason, what is it? Is it doing something different behind the scenes for inserting rows, or is it just syntactic sugar on top of a CREATE TABLE and INSERT INTO.

原文：https://stackoverflow.com/questions/6947983

更新时间：2024-03-05 13:03

最满意答案

 需要除以div with groupby by level day_of_week with transform for new Series with index with original df ：  
print (X.groupby(level='day_of_week')['count'].transform('sum'))
day_of_week  cat
0            0      145
             1      145
1            0       87
             1       87
2            0       82
             1       82
3            0      170
             1      170
4            0      150
             1      150
5            0      112
             1      112
6            0       25
             1       25
Name: count, dtype: int32
X['ratio'] = X['count'].div(X.groupby(level='day_of_week')['count'].transform('sum'))
print (X)
                 count     ratio
day_of_week cat                 
0           0       52  0.358621
            1       93  0.641379
1           0       15  0.172414
            1       72  0.827586
2           0       61  0.743902
            1       21  0.256098
3           0       83  0.488235
            1       87  0.511765
4           0       75  0.500000
            1       75  0.500000
5           0       88  0.785714
            1       24  0.214286
6           0        3  0.120000
            1       22  0.880000
 
 在最后一个pandas版本可能省略level ：  
X['ratio'] = X['count'].div(X.groupby('day_of_week')['count'].transform('sum'))

Need divide by div with groupby by level day_of_week with transform for new Series with same index as original df: 
print (X.groupby(level='day_of_week')['count'].transform('sum'))
day_of_week  cat
0            0      145
             1      145
1            0       87
             1       87
2            0       82
             1       82
3            0      170
             1      170
4            0      150
             1      150
5            0      112
             1      112
6            0       25
             1       25
Name: count, dtype: int32
X['ratio'] = X['count'].div(X.groupby(level='day_of_week')['count'].transform('sum'))
print (X)
                 count     ratio
day_of_week cat                 
0           0       52  0.358621
            1       93  0.641379
1           0       15  0.172414
            1       72  0.827586
2           0       61  0.743902
            1       21  0.256098
3           0       83  0.488235
            1       87  0.511765
4           0       75  0.500000
            1       75  0.500000
5           0       88  0.785714
            1       24  0.214286
6           0        3  0.120000
            1       22  0.880000
 
In last pandas version is possible omit level: 
X['ratio'] = X['count'].div(X.groupby('day_of_week')['count'].transform('sum'))

INSERT INTO与SELECT INTO(INSERT INTO vs SELECT INTO)

最满意答案

相关问答

pandas - 根据groupby指数水平绘制(pandas - plot according to groupby index level)[2022-06-05]

在熊猫中操纵子指数(Manipulating subindex in Pandas)[2021-05-13]

熊猫：抽样DataFrame [重复](Pandas: Sampling a DataFrame [duplicate])[2022-09-06]

pandas dataframe row multiindex跳过一个(pandas dataframe row multiindex skip one)[2024-05-02]

将列表的列表索引到块矩阵中X = [[A，B]，[C，D]] [重复](Subindex a list of lists into block matrices X = [ [A, B], [C, D]] [duplicate])[2022-01-21]

熊猫柱操纵(Pandas Column Manipulation)[2023-07-06]

提取熊猫中的多指数类型(extract multiindex types in pandas)[2023-03-20]

javascript修改字符串数组的子索引(javascript modify subindex of string array)[2022-02-28]

熊猫：每个小组按月汇总(Pandas: Aggregate by month for every subgroup)[2023-01-15]

Pandas multiIndex：为每个现有索引添加新索引(Pandas multiIndex: add new indexes for each existing index)[2022-02-17]

相关文章

最新问答