首页 \ 问答 \ 使用php实现自动完成的Solr配置(Solr configuration for autocompletion implementation with php)

使用php实现自动完成的Solr配置(Solr configuration for autocompletion implementation with php)

 我如何索引我的数据并在solr中配置solr和我的搜索选项，可以实现具有以下要求的自动完成（如谷歌）：  
 产品： - 我们的产品有标题，描述，id，例如标题：toshiba tecra s1：centrino 1.5 ghz / xp pro / 15.0“tft / 40 gb / 256 mb + 256mb / cd-rw-dvd-rom / lan / wi-fi - 此产品的此产品或字段必须以下列方式编制索引（如果用户开始输入，则无法区分用户搜索searchterm的方式，例如TOSHIBA或tOSHiba）前三个字符“tos”最多20个结果（完整标题（短语）例如“toshiba tecra s1：centrino 1.5 ghz / xp pro / 15.0”tft / 40 gb / 256 mb + 256mb / cd-rw-dvd-rom / lan / wi-fi“）应出现在自动完成框中。 - 如果用户输入两个术语“toshiba tecra”，则搜索结果必须更加精确，并且只显示所有文档，其中包含（连贯的）术语“toshiba tecra”  
 获得任何提示，使用什么样的tokenizer / searchcomponent等会很棒。  
 我正在使用solr版本3.5  
 谢谢oyur想法Ramo 

how do i have to index my data and configure solr and my search options in solr, that an autocompletion (like google) with the following requirements is possible: 
Products: - We have products with their titles, descriptions, id's, e.g. for the title: toshiba tecra s1: centrino 1.5 ghz/xp pro/15.0" tft/40 gb/256 mb+256mb/cd-rw-dvd-rom/lan/wi-fi - this products or fields of this product has to be indexed in such a way that the following should be possible (no differentation how a user search for the searchterm, e.g. TOSHIBA or tOSHiba) - if a user starts entering the first three characters "tos" max. 20 results (the complete title (phrase) e.g. "toshiba tecra s1: centrino 1.5 ghz/xp pro/15.0" tft/40 gb/256 mb+256mb/cd-rw-dvd-rom/lan/wi-fi") should appear in the autocomplete box. - if a user enters e.g. two terms "toshiba tecra" the searchresult must be more precisly and just all documents should be shown, that contain the (coherent) terms "toshiba tecra" 
It would be great to get any hints for this, what kind of tokenizer/searchcomponent etc. to use. 
I'm using solr Version 3.5 
Thank you for oyur thoughts Ramo

原文：https://stackoverflow.com/questions/8459570

更新时间：2022-10-30 07:10

最满意答案

 您可以使用-1来始终获取最后一部分而不是第二部分。  
df['c'] = df['b'].apply(lambda x: x.split("'")[-1])

print(df)

#    a        b      c
# 0  1     ciao   ciao
# 1  2    hotel  hotel
# 2  3  l'hotel  hotel 
 
 但是，请记住，如果您有两个或更多撇号的字符串，这将会制动（但您的要求无论如何都没有指定在这些情况下要做什么）。 

You can use -1 to always get the last part rather than the second part. 
df['c'] = df['b'].apply(lambda x: x.split("'")[-1])

print(df)

#    a        b      c
# 0  1     ciao   ciao
# 1  2    hotel  hotel
# 2  3  l'hotel  hotel 
 
However, keep in mind that this will brake if you have have strings with 2 or more apostrophes (but your requirement doesn't specify what to do in these cases anyway).

使用php实现自动完成的Solr配置(Solr configuration for autocompletion implementation with php)

最满意答案

相关问答

在pandas数据框中填充一列字符串(Pad a column of strings in a pandas dataframe)[2022-02-04]

为pandas [duplicate]中每个字符串的出现创建一个新列(Create a new column for each occurence of character string in pandas [duplicate])[2023-05-10]

如何在Pandas的列中删除特殊字符前面的部分字符串？(How to remove part of string ahead of special character in a column in Pandas?)[2024-01-16]

熊猫在字符后删除列中的所有字符串(Pandas remove all of a string in a column after a character)[2024-01-13]

如何在pandas数据框列中的数值之前删除字符串？(How to remove strings before a numeric value in a pandas dataframe column?)[2022-03-07]

从Pandas DataFrame列中删除部分字符串(Removing part of string from Pandas DataFrame column)[2022-01-03]

第三次出现某些字符后删除部分字符串(Remove part of string after third occurrence of certain character)[2023-03-11]

如何使用jquery删除字符串中的以下符号/特殊字符¶(How to remove following symbol/special character in string using jquery ¶)[2023-06-24]

Pandas DataFrame：删除非数字字符后的所有内容(Pandas DataFrame: Remove everything after a non-digit character)[2022-06-04]

从字符串中删除单个特殊字符(Remove a single Special Character from a String)[2022-04-13]

相关文章

最新问答