首页 \ 问答 \ Solr MoreLike这不适用于多个分片？(Solr MoreLikeThis not working for multiple shards?)

Solr MoreLike这不适用于多个分片？(Solr MoreLikeThis not working for multiple shards?)

 我在SolrCloud中有5个节点集群，每个节点有2个分片，  
 Solr版本：6.3.0  
   
 现在，当我运行mlt查询时，它只返回每个节点的结果，并且不会将它们分布在所有分片/节点上，即  
 http://10.0.1.15:8983/solr/test_ingest/mlt?q=advertising_id%w72w9424620427042&fl=score&fl=advertising_id&mlt.fl=channel_name&mlt.fl=show_name&mlt.fl=language&mlt.mindf=1  
 没有结果  
 http://10.0.1.119:8983/solr/test_ingest/mlt?q=advertising_id%w72w9424620427042&fl=score&fl=advertising_id&mlt.fl=channel_name&mlt.fl=show_name&mlt.fl=language&mlt.mindf=1  
 给出结果，  
 我甚至尝试将其指定为param：  
 碎片= 10.0.1.84：8983 / solr的/ test_ingest_shard3_replica1,10.0.1.84：8983 / solr的/ test_ingest_shard8_replica1,10.0.1.206：8983 / solr的/ test_ingest_shard2_replica1,10.0.1.206：8983 / solr的/ test_ingest_shard7_replica1,10.0.1.15：8983 / solr的/ test_ingest_shard5_replica1,10.0.1.15：8983 / solr的/ test_ingest_shard10_replica1,10.0.1.207：8983 / solr的/ test_ingest_shard1_replica1,10.0.1.207：8983 / solr的/ test_ingest_shard6_replica1,10.0.1.119：8983 / solr的/ test_ingest_shard9_replica1,10.0.1.119：8983 / solr的/ test_ingest_shard4_replica1  
 我的请求处理程序  
 <requestHandler name="/mlt" class="solr.MoreLikeThisHandler">
 </requestHandler>
 
 如何配置mlt以运行分布式搜索？ 谢谢 

I have 5 node cluster in SolrCloud, with 2 shards per node, 
Solr version:6.3.0 
 
now when I run mlt query it only returns result per node and doesn't distribute them over all shards/nodes, i.e 
http://10.0.1.15:8983/solr/test_ingest/mlt?q=advertising_id%w72w9424620427042&fl=score&fl=advertising_id&mlt.fl=channel_name&mlt.fl=show_name&mlt.fl=language&mlt.mindf=1  
gives no results while  
http://10.0.1.119:8983/solr/test_ingest/mlt?q=advertising_id%w72w9424620427042&fl=score&fl=advertising_id&mlt.fl=channel_name&mlt.fl=show_name&mlt.fl=language&mlt.mindf=1 
gives results, 
I have even tried specifying this as param: 
shards=10.0.1.84:8983/solr/test_ingest_shard3_replica1,10.0.1.84:8983/solr/test_ingest_shard8_replica1,10.0.1.206:8983/solr/test_ingest_shard2_replica1,10.0.1.206:8983/solr/test_ingest_shard7_replica1,10.0.1.15:8983/solr/test_ingest_shard5_replica1,10.0.1.15:8983/solr/test_ingest_shard10_replica1,10.0.1.207:8983/solr/test_ingest_shard1_replica1,10.0.1.207:8983/solr/test_ingest_shard6_replica1,10.0.1.119:8983/solr/test_ingest_shard9_replica1,10.0.1.119:8983/solr/test_ingest_shard4_replica1 
My request handler: 
 <requestHandler name="/mlt" class="solr.MoreLikeThisHandler">
 </requestHandler>
 
How do I configure mlt to run a distributed search? Thanks

原文：https://stackoverflow.com/questions/43390064

更新时间：2024-05-16 06:05

最满意答案

 一种方法是@Victor Sorokin在他的回答中提出：将每个文件的处理封装在Runnable ，然后提交给Executor或仅从主线程调用run() 。  
 另一种可能性是始终在Runnable执行相同的包装并将其提交给始终给定的 Executor 。  
 每个文件的处理是否同时执行取决于给定的Executor的实现。  
 对于并行处理，您可以调用传递它的函数，即ThreadPoolExecutor作为参数，而对于顺序处理，您可以传入一个伪Executor ，即在调用程序线程中运行提交的任务的执行程序：  
public class FakeExecutor implements Executor {

    @Override
    public void execute(Runnable task) {
        task.run();
    }
}
 
 我相信这种方式是最灵活的方法。 

One way is as @Victor Sorokin suggests in his answer: wrap the processing of every file in a Runnable and then either submit to an Executor or just invoke run() from the main thread. 
Another possibility is to always do the same wrapping in a Runnable and submit it to an always-given Executor. 
Whether processing of each file is executed concurrently or not would depend on the given Executor's implementation.  
For parallel processing, you could invoke your function passing it i.e. a ThreadPoolExecutor as an argument, whereas for sequential processing you could pass in a fake Executor, i.e. one that runs submitted tasks in the caller thread: 
public class FakeExecutor implements Executor {

    @Override
    public void execute(Runnable task) {
        task.run();
    }
}
 
I believe this way is the most flexible approach.

Solr MoreLike这不适用于多个分片？(Solr MoreLikeThis not working for multiple shards?)

最满意答案

相关问答

误解单线程和多线程编程之间的区别(Misunderstanding the difference between single-threading and multi-threading programming)[2022-04-10]

如何在单线程中使用java实现多线程操作系统？(How should implement multi-thread in single-threaded Operating system using java?)[2022-01-21]

如何在Rust中的自定义单线程迭代器上并行`映射（...）`？(How to parallely `map(…)` on a custom, single-threaded iterator in Rust?)[2022-03-07]

Redis是单线程的，那么它如何并行I / O？(Redis is single-threaded, then how does it do concurrent I/O?)[2023-04-10]

将多线程可能性添加到单线程全文件目录迭代器实用程序函数中(Adding multi-threading possibility to a single-threaded all-files-in-directory iterator utility function)[2023-06-09]

Haskell多线程有多难？(How difficult is Haskell multi-threading?)[2019-11-21]

编译单线程v。多线程（和lib命名约定）的重要性？(Importance of compiling single-threaded v. multi-threaded (and lib naming conventions)?)[2023-04-27]

是javascript单线程吗？(Is javascript single-threaded?)[2023-06-02]

使用Swing进行多线程处理(Multi-threading with Swing)[2023-10-29]

Mapper使用多线程？(Mapper using multi-threading?)[2022-03-16]

相关文章

最新问答