首页 \ 问答 \ 对CosmosDB / DocumentDB的数组进行大小写不敏感的搜索(Case insensitive search in arrays for CosmosDB / DocumentDB)

对CosmosDB / DocumentDB的数组进行大小写不敏感的搜索(Case insensitive search in arrays for CosmosDB / DocumentDB)

 比方说，我有这些文件在我的CosmosDB。 （DocumentDB API，.NET SDK）  
{
    // partition key of the collection
    "userId" : "0000-0000-0000-0000",
    "emailAddresses": [
        "someaddress@somedomain.com", "Another.Address@someotherdomain.com"
    ]
    // some more fields
}
 
 我现在需要知道是否有给定电子邮件地址的文档。 但是，我需要查询不区分大小写。  
 有一些方法可以在字段中搜索不区分大小写的字段（但它们会执行全面扫描）：  
 如何在Azure DocumentDb上执行不区分大小写的搜索？  
select * from json j where LOWER(j.name) = 'timbaktu'
e => e.Id.ToLower() == key.ToLower()
 
 这些不适用于数组。 有其他方法吗？ 用户定义的函数看起来可能有所帮助。  
 我主要是在寻找一种暂时的低效解决方案来支持这种情况（我有这样的多个集合）。 我可能需要在某个时候切换到这样的数据结构：  
{
    "userId" : "0000-0000-0000-0000",
    // Option A
    "emailAddresses": [
        {
            "displayName": "someaddress@somedomain.com",
            "normalizedName" : "someaddress@somedomain.com"
        },
        {
            "displayName": "Another.Address@someotherdomain.com",
            "normalizedName" : "another.address@someotherdomain.com"
        }
    ],
    // Option B
    "emailAddressesNormalized": {
        "someaddress@somedomain.com", "another.address@someotherdomain.com"
    }
}
 
 不幸的是，我的生产数据库已经包含需要更新以支持新结构的文档。 我的生产集合中只包含100个这样的项目，所以我甚至想要获取所有项目并在客户端的内存中进行比较。 

Lets say I have these documents in my CosmosDB. (DocumentDB API, .NET SDK) 
{
    // partition key of the collection
    "userId" : "0000-0000-0000-0000",
    "emailAddresses": [
        "someaddress@somedomain.com", "Another.Address@someotherdomain.com"
    ]
    // some more fields
}
 
I now need to find out if I have a document for a given email address. However, I need the query to be case insensitive. 
There are ways to search case insensitive on a field (they do a full scan however): 
How to do a Case Insensitive search on Azure DocumentDb? 
select * from json j where LOWER(j.name) = 'timbaktu'
e => e.Id.ToLower() == key.ToLower()
 
These do not work for arrays. Is there an alternative way? A user defined function looks like it could help. 
I am mainly looking for a temporary low-effort solution to support the scenario (I have multiple collections like this). I probably need to switch to a data structure like this at some point: 
{
    "userId" : "0000-0000-0000-0000",
    // Option A
    "emailAddresses": [
        {
            "displayName": "someaddress@somedomain.com",
            "normalizedName" : "someaddress@somedomain.com"
        },
        {
            "displayName": "Another.Address@someotherdomain.com",
            "normalizedName" : "another.address@someotherdomain.com"
        }
    ],
    // Option B
    "emailAddressesNormalized": {
        "someaddress@somedomain.com", "another.address@someotherdomain.com"
    }
}
 
Unfortunately, my production database already contains documents that would need to be updated to support the new structure. My production collections contain only 100s of these items, so I am even tempted to just get all items and do the comparison in memory on the client.

原文：https://stackoverflow.com/questions/47596474

更新时间：2022-04-14 09:04

最满意答案

 关于这个wiki的testthat包有很好的文档： https ： //github.com/hadley/devtools/wiki/Testing  
 简而言之，您可以在每个test_that嵌入多个expect_that语句。  
 在页面的末尾，在“测试文件和目录”部分中，有关于三个不同报告者的信息（停止，最小和摘要）。  
 我发现这非常强大。 即使test_that发现错误，它也只是报告错误并继续进行其余的测试。  
 PS。 我的经验是将测试结果打印到控制台。 我在R环境中运行我的测试，而不是OS命令行。 

There is a very good documentation about the testthat package at this wiki: https://github.com/hadley/devtools/wiki/Testing 
In a nutshell, you can embed multiple expect_that statements in each test_that. 
Towards the end of the page, in the section 'Testing files and directories' there is information about the three different reporters (stop, minimal and summary). 
I have found this to be quite robust. Even if test_that finds an error, it simply reports the error and carries on with the remainder of the tests. 
PS. My experience is that the test results are printed to the console. I run my testing from within the R environment, not the OS command line.

对CosmosDB / DocumentDB的数组进行大小写不敏感的搜索(Case insensitive search in arrays for CosmosDB / DocumentDB)

最满意答案

相关问答

LINUX 如何查看JPG文件[2022-06-13]

Gitlab CI：找不到构建阶段的输出(Gitlab CI: Cannot find output of build stage)[2022-06-02]

Prolog lexer找不到错误(Prolog lexer can't find the error)[2021-09-07]

如何从find中获得排序输出(How to get sorted output from find)[2023-01-27]

ffmpeg在cygwin下找不到输出目录(ffmpeg can't find output directory under cygwin)[2023-04-23]

编译gcc - 找不到flex的输出;(compiling gcc - cannot find output from flex; giving up)[2023-08-10]

如何对find的输出执行“for each”？(How to do “for each” on output from find?)[2023-07-11]

找不到SummaryReporter输出(Can't find SummaryReporter output)[2022-04-28]

找不到Cocoa静态库的输出.a（在xcode 4中）(Cannot find output .a of Cocoa Static Library (in xcode 4))[2022-08-13]

找不到libstdc ++？(Can't find libstdc++?)[2022-12-19]

相关文章

最新问答