首页 \ 问答 \ Elasticsearch查询字符串查询不能与同义词分析器一起使用(Elasticsearch Query String Query not working with synonym analyzer)

Elasticsearch查询字符串查询不能与同义词分析器一起使用(Elasticsearch Query String Query not working with synonym analyzer)

我正在尝试使用同义词配置弹性搜索。

这些是我的设置:

                "analysis": {
                    "analyzer": {
                        "category_synonym": {
                            "tokenizer": "whitespace",
                            "filter": [
                                "synonym_filter"
                            ]
                        }
                    },
                    "filter": {
                        "synonym_filter": {
                            "type": "synonym",
                            "synonyms_path": "synonyms.txt"
                        }
                    }
                }

映射配置:

        "category": {
            "properties": {
                "name": {
                    "type":"string",
                    "search_analyzer" : "category_synonym",
                    "index_analyzer" : "standard",
                    "fields": {
                        "raw": {
                            "type":  "string",
                            "index": "not_analyzed"
                        }
                    }
                }
            }
        }

以及我的同义词列表

film => video,
ooh => panels , poster,
commercial => advertisement,
print => magazine

我必须说我正在使用Elasticsearch Java API。 我正在使用QueryBuilders.queryStringQuery因为这是我将分析器设置为我的请求的唯一方法。 所以,当我在做的时候:

QueryBuilders.queryStringQuery("name:film").analyzer(analyzer)

它归还给我

[
  {
    "id": 71,
    "name": "Pitch video",
    "description": "... ",
    "parent": null
  },
  {
    "id": 25,
    "name": "Video",
    "description": "... ",
    "parent": null
  }
]

这对我来说是完美的,但是当我打电话的时候

QueryBuilders.queryStringQuery("name:vid").analyzer(analyzer)

我希望它应该返回相同的对象,但没有任何东西: []

所以,我在queryStringQuery添加了星号:

QueryBuilders.queryStringQuery("name:vid*").analyzer(analyzer)

效果很好,但现在

QueryBuilders.queryStringQuery("name:film*").analyzer(analyzer)

给我回复[]

那么,当我搜索videovidfilmfilm时,如何配置我的弹性搜索它将返回相同的对象?

提前致谢!


I am trying to configure elastic search with synonyms.

These are my settings:

                "analysis": {
                    "analyzer": {
                        "category_synonym": {
                            "tokenizer": "whitespace",
                            "filter": [
                                "synonym_filter"
                            ]
                        }
                    },
                    "filter": {
                        "synonym_filter": {
                            "type": "synonym",
                            "synonyms_path": "synonyms.txt"
                        }
                    }
                }

Mappings config:

        "category": {
            "properties": {
                "name": {
                    "type":"string",
                    "search_analyzer" : "category_synonym",
                    "index_analyzer" : "standard",
                    "fields": {
                        "raw": {
                            "type":  "string",
                            "index": "not_analyzed"
                        }
                    }
                }
            }
        }

And the list of my synonyms

film => video,
ooh => panels , poster,
commercial => advertisement,
print => magazine

I must say that I am using Elasticsearch Java API. I am using QueryBuilders.queryStringQuery because this is the only way how I set analyzers to my request. So, when I am making:

QueryBuilders.queryStringQuery("name:film").analyzer(analyzer)

It returns me

[
  {
    "id": 71,
    "name": "Pitch video",
    "description": "... ",
    "parent": null
  },
  {
    "id": 25,
    "name": "Video",
    "description": "... ",
    "parent": null
  }
]

That is perfect for me, but when I am calling something like this

QueryBuilders.queryStringQuery("name:vid").analyzer(analyzer)

I expect that it should return same objects, but there is nothing: []

So, I added asterisk to queryStringQuery:

QueryBuilders.queryStringQuery("name:vid*").analyzer(analyzer)

Works well, but now

QueryBuilders.queryStringQuery("name:film*").analyzer(analyzer)

returns me []

So, how can I configure my elastic search that it will return same objects when I am searching video, vid, film and fil?

Thanks in advance!


原文:https://stackoverflow.com/questions/43502362
更新时间:2022-05-12 10:05

最满意答案

连续显示两种方法:

1)使用额外的explode功能结合list功能:

$customstring = "Eye Width=3/4 in|Finish=Nickel|Hook Opening=7/16 in|Locking Type=Spring Loaded Plunger|Material=Zinc Die Cast|Mounting=Swivel Eye|Overall Length [Nom]=3 1/2 in|Type=Swiveled Securing Hook|Wt.=0.09 lb";

$pairs = explode("|", $customstring);
$result = [];
foreach ($pairs as $p) {
    list($k, $v) = explode('=', $p);
    $result[$k] = $v;
}

print_r($result);

2)另一种替代解决方案是使用preg_match_allarray_combine函数:

$customstring = "Eye Width=3/4 in|Finish=Nickel|Hook Opening=7/16 in|Locking Type=Spring Loaded Plunger|Material=Zinc Die Cast|Mounting=Swivel Eye|Overall Length [Nom]=3 1/2 in|Type=Swiveled Securing Hook|Wt.=0.09 lb";
preg_match_all("/([^=|]+)=([^+|]+)/", $customstring, $m);
$result = array_combine($m[1], $m[2]);

print_r($result);

输出(两种方法都相同):

Array
(
    [Eye Width] => 3/4 in
    [Finish] => Nickel
    [Hook Opening] => 7/16 in
    [Locking Type] => Spring Loaded Plunger
    [Material] => Zinc Die Cast
    [Mounting] => Swivel Eye
    [Overall Length [Nom]] => 3 1/2 in
    [Type] => Swiveled Securing Hook
    [Wt.] => 0.09 lb
)

Two approaches shown consecutively:

1) Use additional explode function combined with list function:

$customstring = "Eye Width=3/4 in|Finish=Nickel|Hook Opening=7/16 in|Locking Type=Spring Loaded Plunger|Material=Zinc Die Cast|Mounting=Swivel Eye|Overall Length [Nom]=3 1/2 in|Type=Swiveled Securing Hook|Wt.=0.09 lb";

$pairs = explode("|", $customstring);
$result = [];
foreach ($pairs as $p) {
    list($k, $v) = explode('=', $p);
    $result[$k] = $v;
}

print_r($result);

2) Another alternative solution would be using preg_match_all and array_combine functions:

$customstring = "Eye Width=3/4 in|Finish=Nickel|Hook Opening=7/16 in|Locking Type=Spring Loaded Plunger|Material=Zinc Die Cast|Mounting=Swivel Eye|Overall Length [Nom]=3 1/2 in|Type=Swiveled Securing Hook|Wt.=0.09 lb";
preg_match_all("/([^=|]+)=([^+|]+)/", $customstring, $m);
$result = array_combine($m[1], $m[2]);

print_r($result);

The output(same for both approaches):

Array
(
    [Eye Width] => 3/4 in
    [Finish] => Nickel
    [Hook Opening] => 7/16 in
    [Locking Type] => Spring Loaded Plunger
    [Material] => Zinc Die Cast
    [Mounting] => Swivel Eye
    [Overall Length [Nom]] => 3 1/2 in
    [Type] => Swiveled Securing Hook
    [Wt.] => 0.09 lb
)

相关问答

更多

相关文章

更多

最新问答

更多
  • 您如何使用git diff文件,并将其应用于同一存储库的副本的本地分支?(How do you take a git diff file, and apply it to a local branch that is a copy of the same repository?)
  • 将长浮点值剪切为2个小数点并复制到字符数组(Cut Long Float Value to 2 decimal points and copy to Character Array)
  • OctoberCMS侧边栏不呈现(OctoberCMS Sidebar not rendering)
  • 页面加载后对象是否有资格进行垃圾回收?(Are objects eligible for garbage collection after the page loads?)
  • codeigniter中的语言不能按预期工作(language in codeigniter doesn' t work as expected)
  • 在计算机拍照在哪里进入
  • 使用cin.get()从c ++中的输入流中丢弃不需要的字符(Using cin.get() to discard unwanted characters from the input stream in c++)
  • No for循环将在for循环中运行。(No for loop will run inside for loop. Testing for primes)
  • 单页应用程序:页面重新加载(Single Page Application: page reload)
  • 在循环中选择具有相似模式的列名称(Selecting Column Name With Similar Pattern in a Loop)
  • System.StackOverflow错误(System.StackOverflow error)
  • KnockoutJS未在嵌套模板上应用beforeRemove和afterAdd(KnockoutJS not applying beforeRemove and afterAdd on nested templates)
  • 散列包括方法和/或嵌套属性(Hash include methods and/or nested attributes)
  • android - 如何避免使用Samsung RFS文件系统延迟/冻结?(android - how to avoid lag/freezes with Samsung RFS filesystem?)
  • TensorFlow:基于索引列表创建新张量(TensorFlow: Create a new tensor based on list of indices)
  • 企业安全培训的各项内容
  • 错误:RPC失败;(error: RPC failed; curl transfer closed with outstanding read data remaining)
  • C#类名中允许哪些字符?(What characters are allowed in C# class name?)
  • NumPy:将int64值存储在np.array中并使用dtype float64并将其转换回整数是否安全?(NumPy: Is it safe to store an int64 value in an np.array with dtype float64 and later convert it back to integer?)
  • 注销后如何隐藏导航portlet?(How to hide navigation portlet after logout?)
  • 将多个行和可变行移动到列(moving multiple and variable rows to columns)
  • 提交表单时忽略基础href,而不使用Javascript(ignore base href when submitting form, without using Javascript)
  • 对setOnInfoWindowClickListener的意图(Intent on setOnInfoWindowClickListener)
  • Angular $资源不会改变方法(Angular $resource doesn't change method)
  • 在Angular 5中不是一个函数(is not a function in Angular 5)
  • 如何配置Composite C1以将.m和桌面作为同一站点提供服务(How to configure Composite C1 to serve .m and desktop as the same site)
  • 不适用:悬停在悬停时:在元素之前[复制](Don't apply :hover when hovering on :before element [duplicate])
  • 常见的python rpc和cli接口(Common python rpc and cli interface)
  • Mysql DB单个字段匹配多个其他字段(Mysql DB single field matching to multiple other fields)
  • 产品页面上的Magento Up出售对齐问题(Magento Up sell alignment issue on the products page)