首页 \ 问答 \ Solr,仅在EdgeNGramFilterFactory匹配时返回结果(Solr, only returning results if EdgeNGramFilterFactory matches)

Solr,仅在EdgeNGramFilterFactory匹配时返回结果(Solr, only returning results if EdgeNGramFilterFactory matches)

我正在搜索一个多值字段,定义为:

<fieldType name="text" class="solr.TextField" positionIncrementGap="100">
    <analyzer type="index">
        <tokenizer class="solr.StandardTokenizerFactory"/>
        <filter class="solr.LowerCaseFilterFactory"/>
        <filter class="solr.EdgeNGramFilterFactory" minGramSize="2" maxGramSize="15" 
                side="front"/>
    </analyzer>
    <analyzer type="query">
        <tokenizer class="solr.StandardTokenizerFactory"/>
        <filter class="solr.LowerCaseFilterFactory"/>
    </analyzer>
</fieldType>

但是,如果我正在搜索超过15的maxGramSize的单词,我就不会得到任何结果。 所以用以下索引:

vedlikeholdsskjema på toyota corolla Gode ting  kiwi kan spises med skall, banan

我得到的结果是“kiwi”,“banan”等,但不是“vedlikeholdsskjema”。 我想我可以增加maxGramSize大小,但我认为StandardTokenizerFactoryLowerCaseFilterFactory在这种情况下会给我匹配。

这是分析页面的屏幕截图 分析

我正在运行Solr 4.7.0。


I'm searching a multivalued field which is defined as:

<fieldType name="text" class="solr.TextField" positionIncrementGap="100">
    <analyzer type="index">
        <tokenizer class="solr.StandardTokenizerFactory"/>
        <filter class="solr.LowerCaseFilterFactory"/>
        <filter class="solr.EdgeNGramFilterFactory" minGramSize="2" maxGramSize="15" 
                side="front"/>
    </analyzer>
    <analyzer type="query">
        <tokenizer class="solr.StandardTokenizerFactory"/>
        <filter class="solr.LowerCaseFilterFactory"/>
    </analyzer>
</fieldType>

But if I'm searching for words which are longer than the maxGramSize of 15, I don't get any results. So with the following indexed:

vedlikeholdsskjema på toyota corolla Gode ting  kiwi kan spises med skall, banan

I get results on "kiwi", "banan" etc. but not on "vedlikeholdsskjema". I guess I could increase the maxGramSize size, but I would have thought the StandardTokenizerFactory and LowerCaseFilterFactory would have given me matches in this case.

Here is screenshot of the analysis page analysis.

I'm running Solr 4.7.0.


原文:https://stackoverflow.com/questions/28849289
更新时间:2024-02-22 22:02

最满意答案

这是一些Msdn示例: XPath示例 ; XPath参考

根据您的代码段,您可以选择包含属性数据奇数的所有标签TD

private void Form1_Load(object sender, EventArgs e)
{

        string url = "http://www.betexplorer.com/soccer/sweden/allsvenskan/results/";
        HtmlWeb web = new HtmlWeb();
        HtmlAgilityPack.HtmlDocument doc = web.Load(url);
        var nodes = doc.DocumentNode.SelectNodes("//td[@data-odd]");
        foreach (HtmlNode node in nodes)
        {
            //Here process your node
            //Example: to get data-odd value
            var val = node.GetAttributeValue("data-odd", "");
        }
}

Here it is some Msdn Examples: XPath Examples; XPath Reference

According to your snippet you could select all tags TD that contain attribute data-odd.

private void Form1_Load(object sender, EventArgs e)
{

        string url = "http://www.betexplorer.com/soccer/sweden/allsvenskan/results/";
        HtmlWeb web = new HtmlWeb();
        HtmlAgilityPack.HtmlDocument doc = web.Load(url);
        var nodes = doc.DocumentNode.SelectNodes("//td[@data-odd]");
        foreach (HtmlNode node in nodes)
        {
            //Here process your node
            //Example: to get data-odd value
            var val = node.GetAttributeValue("data-odd", "");
        }
}

相关问答

更多

相关文章

更多

最新问答

更多
  • 如何使用自由职业者帐户登录我的php网站?(How can I login into my php website using freelancer account? [closed])
  • 如何打破按钮上的生命周期循环(How to break do-while loop on button)
  • C#使用EF访问MVC上的部分类的自定义属性(C# access custom attributes of a partial class on MVC with EF)
  • 如何获得facebook app的publish_stream权限?(How to get publish_stream permissions for facebook app?)
  • 如何并排放置两个元件?(How to position two elements side by side?)
  • 在MySQL和/或多列中使用多个表用于Rails应用程序(Using multiple tables in MySQL and/or multiple columns for a Rails application)
  • 如何隐藏谷歌地图上的登录按钮?(How to hide the Sign in button from Google maps?)
  • Mysql左连接旋转90°表(Mysql Left join rotate 90° table)
  • 带有ImageMagick和许多图像的GIF动画(GIF animation with ImageMagick and many images)
  • 电脑高中毕业学习去哪里培训
  • 电脑系统专业就业状况如何啊?
  • IEnumerable linq表达式(IEnumerable linq expressions)
  • 如何在Spring测试中连接依赖关系(How to wire dependencies in Spring tests)
  • Solr可以在没有Lucene的情况下运行吗?(Can Solr run without Lucene?)
  • 如何保证Task在当前线程上同步运行?(How to guarantee that a Task runs synchronously on the current thread?)
  • 在保持每列的类的同时向数据框添加行(Adding row to data frame while maintaining the class of each column)
  • 的?(The ? marks in emacs/haskell and ghc mode)
  • 一个线程可以调用SuspendThread传递自己的线程ID吗?(Can a thread call SuspendThread passing its own thread ID?)
  • 延迟socket.io响应,并“警告 - websocket连接无效”(Delayed socket.io response, and “warn - websocket connection invalid”)
  • 悬停时的图像转换(Image transition on hover)
  • IIS 7.5仅显示homecontroller(IIS 7.5 only shows homecontroller)
  • 没有JavaScript的复选框“关闭”值(Checkbox 'off' value without JavaScript)
  • java分布式框架有哪些
  • Python:填写表单并点击按钮确认[关闭](Python: fill out a form and confirm with a button click [closed])
  • PHP将文件链接到根文件目录(PHP Linking Files to Root File Directory)
  • 我如何删除ListView中的项目?(How I can remove a item in my ListView?)
  • 您是否必须为TFS(云)中的每个BUG创建一个TASK以跟踪时间?(Do you have to create a TASK for every BUG in TFS (Cloud) to track time?)
  • typoscript TMENU ATagParams小写(typoscript TMENU ATagParams lowercase)
  • 武陟会计培训类的学校哪个好点?
  • 从链接中删除文本修饰(Remove text decoration from links)