首页 \ 问答 \ 无法将具有相同优先级的对象插入ConcurrentSkipListSet(Unable to insert objects of equal priority into ConcurrentSkipListSet)

无法将具有相同优先级的对象插入ConcurrentSkipListSet(Unable to insert objects of equal priority into ConcurrentSkipListSet)

 我在使用ConcurrentSkipListSet处理观察者模式的简单线程安全实现时遇到问题，以便在插入期间跟踪观察者优先级。 大多数观察者都没有任何特殊的优先权归属于他们，并且遵循此Comparable＃compareTo方法在比较时将显示相同的优先级（其中优先级是五个优先级的枚举值，范围从最高到最低）：  
public int compareTo(BaseLink<?> link) {
    return this.priority.compareTo(link.getPriority());
}
 
 当我向ConcurrentSkipListSet添加具有相同优先级的观察者时，似乎在插入过程中会丢失一些添加的对象。 更改我在测试时创建的任何观察者的优先级会导致这些观察者被添加到集合中而没有问题，但我认为如果给予相同优先级的足够观察者，则问题将再次出现。  
 我不确定导致这个问题的原因，以及我应该做些什么来帮助解决这个问题。 我有什么办法可以解决这个问题吗？ 或者，如果这是ConcurrentSkipListSet的固有问题，是否还有其他线程安全的数据结构可以为独特对象提供合理的高效插入和排序时间？ 

I am having an issue with a simplistic thread-safe implementation of the observer pattern using a ConcurrentSkipListSet to handle keeping track of observer priorities during insertion. The majority of observers will not have any special priority attributed to them, and following this Comparable#compareTo method will show as equal priority when compared (where priority is a value in an enum of five priorities ranging from highest to lowest): 
public int compareTo(BaseLink<?> link) {
    return this.priority.compareTo(link.getPriority());
}
 
When I add observers of equal priorities to the ConcurrentSkipListSet , it seems like some of the added objects are simply lost during the insertion process. Changing the priorities of any of the observers I have created while testing this results in those observers being added to the set without issue, though I assume that given enough observers of the same priority the issue will arise again. 
I am unsure about what is causing this issue, and of what I should do to help resolve it. Is there anything I can do to resolve this issue? Alternatively if this is an inherent problem with the ConcurrentSkipListSet, are there any other thread-safe data structures that can give me reasonably performant insertion and sorting times for unique objects?

原文：https://stackoverflow.com/questions/37310754

更新时间：2023-11-03 13:11

最满意答案

 事实证明，我没有正确理解Keppler架构。 正如Greg Smith上面的评论中指出的那样，Keppler可以配置为具有8个字节的32个共享存储体。 在这种情况下，使用cudaDeviceSetSharedMemConfig( cudaSharedMemBankSizeEightByte ) ，共享内存布局如下所示：  
bank:  B0   B1   B2   B3   B4    ..   B31
       ----------------------------------
index: D00  D01  D02  D03  D04   ..   D31
       D32  D33  D34  D35  D36   ..   D63   
 
 现在，对于我的简单示例（使用itot=16 ），在一个warp中处理对例如前两行（ threadIdx.y=0 ， threadIdx.y=1 ）的共享内存的写入/读取。 这意味着对于threadIdx.y=0值D00..D15存储在B0..B15 ，则存在两个双精度的填充，之后在相同的变形值D18..D33中存储在B18..B31+B00..B01 ，导致B00-B01发生银行冲突。 如果没有填充（ ng=0 ），第一行将写入D00..D15中的B00..B15 ，即D00..D15中B00..B15中的第二行，因此不会发生库冲突。  
 对于blockDim.x>=32的线程块，不应该出现问题。 例如，对于itot=32 ， blockDim.x=32 ， ng=2 ，第一行存储在存储体B00..B31 ，然后存储两个单元格填充，第二行存储在B02..B31+B00..B01等中。 

It turns out that I didn’t understand the Keppler architecture correctly. As pointed out in one of the comments above by Greg Smith, Keppler can be configured to have 32 shared memory banks of 8 bytes. In such a case, using cudaDeviceSetSharedMemConfig( cudaSharedMemBankSizeEightByte ), the shared memory layout looks like: 
bank:  B0   B1   B2   B3   B4    ..   B31
       ----------------------------------
index: D00  D01  D02  D03  D04   ..   D31
       D32  D33  D34  D35  D36   ..   D63   
 
Now, for my simple example (using itot=16), the writing/reading to/from shared memory on e.g. the first two rows (threadIdx.y=0, threadIdx.y=1) is handled within one warp. This means that for threadIdx.y=0 values D00..D15 are stored in B0..B15, then there is a padding of two doubles, after which within the same warp values D18..D33 are stored in B18..B31+B00..B01, which causes a bank conflict on B00-B01. Without the padding (ng=0) the first row is written to D00..D15 in B00..B15, the second row in D16..D31 in B16..B31, so no bank conflict occurs.  
For a thread block of blockDim.x>=32 the problem shouldn’t occur. For example, for itot=32, blockDim.x=32, ng=2, the first row is stored in banks B00..B31, then two cells padding, second row in B02..B31+B00..B01, etc.

无法将具有相同优先级的对象插入ConcurrentSkipListSet(Unable to insert objects of equal priority into ConcurrentSkipListSet)

最满意答案

相关问答

什么是银行冲突？(What is a bank conflict? (Doing Cuda/OpenCL programming))[2021-10-31]

CUDA：最大限度地减少大数据类型的银行冲突(CUDA: minimize bank conflict for large data type)[2022-01-07]

cuda共享内存 - 不一致的结果(cuda shared memory - inconsistent results)[2023-04-01]

故意导致CUDA设备上的共享内存冲突(purposely causing bank conflicts for shared memory on CUDA device)[2022-12-17]

CUDA - 确定共享内存中的银行数量(CUDA - determine number of banks in shared memory)[2023-09-01]

针对64位线程独立共享内存最小化银行冲突的策略(Strategy for minimizing bank conflicts for 64-bit thread-separate shared memory)[2022-02-16]

银行冲突CUDA共享内存？(Bank conflict CUDA shared memory?)[2023-06-11]

CUDA中的共享内存分配(Shared memory allocation in CUDA)[2022-11-06]

为什么CUDA Profiler会指示重播说明：82％！=全局重播+本地重播+共享重播？(Why does CUDA Profiler indicate replayed instructions: 82% != global replay + local replay + shared replay?)[2024-01-28]

CUDA：syncthreads（）在共享内存操作之前？(CUDA: syncthreads() before shared memory operation?)[2022-01-29]

相关文章

最新问答

无法将具有相同优先级的对象插入ConcurrentSkipListSet(Unable to insert objects of equal priority into ConcurrentSkipListSet)

最满意答案

相关问答

什么是银行冲突？(What is a bank conflict? (Doing Cuda/OpenCL programming))[2021-10-31]

CUDA：最大限度地减少大数据类型的银行冲突(CUDA: minimize bank conflict for large data type)[2022-01-07]

cuda共享内存 - 不一致的结果(cuda shared memory - inconsistent results)[2023-04-01]

故意导致CUDA设备上的共享内存冲突(purposely causing bank conflicts for shared memory on CUDA device)[2022-12-17]

CUDA - 确定共享内存中的银行数量(CUDA - determine number of banks in shared memory)[2023-09-01]

针对64位线程独立共享内存最小化银行冲突的策略(Strategy for minimizing bank conflicts for 64-bit thread-separate shared memory)[2022-02-16]

银行冲突CUDA共享内存？(Bank conflict CUDA shared memory?)[2023-06-11]

CUDA中的共享内存分配(Shared memory allocation in CUDA)[2022-11-06]

为什么CUDA Profiler会指示重播说明：82％！=全局重播+本地重播+共享重播？(Why does CUDA Profiler indicate replayed instructions: 82% != global replay + local replay + shared replay?)[2024-01-28]

CUDA：__syncthreads（）在共享内存操作之前？(CUDA: __syncthreads() before shared memory operation?)[2022-01-29]

相关文章

最新问答

CUDA：syncthreads（）在共享内存操作之前？(CUDA: syncthreads() before shared memory operation?)[2022-01-29]