首页 \ 问答 \ SOLR指数减少(SOLR index size reduction)

SOLR指数减少(SOLR index size reduction)

对于一个大型项目,我们有一些大规模的SOLR索引,并且其消耗的空间超过50 GB。

我们已经考虑了几种缩小与改变指数内容相关的规模的方法,但我很好奇是否可以对SOLR指数做出任何改变,从而将其规模缩小2个数量级或更多,它们与(1)我们可以运行的维护命令或(2)可能没有正确设置的简单配置参数直接相关。

另一个相关的问题是(3)是否有一种方法可以在SOLR内部交易业绩指数大小,如果是,它将如何工作?

任何想法,将不胜感激......谢谢!


We have a some massive SOLR indices for a large project, and its consuming above 50 GB of space .

We have considered several ways to reduce the size that are related to changing the content in the indices, but I am curious of wether or not there might be any changes we can make to a SOLR index which will reduce its size by 2 orders of magnitude or more, which are directly related to either (1) maintainance commands we can run or (2) simple configuration parameters which may not be set right.

Another relevant question is (3) Is there a way to trade index size for performance inside of SOLR, and if so , how would it work ?

Any thoughts on this would be appreciated... Thanks!


原文:https://stackoverflow.com/questions/10080881
更新时间:2023-07-24 18:07

最满意答案

,不是因为恶意用户只能访问该URL

www.site.com/domain/product.php?id=123'; DROP TABLE products;

并完全绕过你的mod_rewrite。 虽然只会重写整数,但是要重写它们的URL仍然可用且可访问。 无论何时进行SQL查询,都应使用PDO等清理当前进入查询的每个输入。


No it does not because a malicious user could simply go to the URL

www.site.com/domain/product.php?id=123'; DROP TABLE products;

and bypass your mod_rewrite entirely. Although only integers will be rewritten, the URL that they are being rewritten to use is still live and accessible. Any time you are making an SQL query you should sanitize every input going into your query at that point using PDO etc.

相关问答

更多
  • 试试这个: RewriteEngine on RewriteRule ^([A-Za-z0-9-]+)$ main.php?user=$1 [NC,L] Try this: RewriteEngine on RewriteRule ^([A-Za-z0-9-]+)$ main.php?user=$1 [NC,L]
  • 对于Apache,您可以使用mod_ssl强制使用SSLRequireSSL Directive SSL: 除非针对当前连接启用了HTTP over SSL(即HTTPS),否则此伪指令禁止访问。 这在启用SSL的虚拟主机或目录中非常方便,以防止发生应该受保护的内容的配置错误。 当此指令出现时,所有拒绝使用SSL的请求都被拒绝。 这不会重定向到https。 要重定向,请尝试以下与.htaccess文件中的mod_rewrite RewriteEngine On RewriteCond %{HTTPS} != ...
  • 不 ,不是因为恶意用户只能访问该URL www.site.com/domain/product.php?id=123'; DROP TABLE products; 并完全绕过你的mod_rewrite。 虽然只会重写整数,但是要重写它们的URL仍然可用且可访问。 无论何时进行SQL查询,都应使用PDO等清理当前进入查询的每个输入。 No it does not because a malicious user could simply go to the URL www.site.com/domain/p ...
  • 我对mod重写并不“很棒”,但你可以创建一个超出普通文件夹结构的代理脚本,并使用PHP来获取这些变量并将其传递回应用程序。 I am not "great" with mod rewrite but you can create a proxy script that is outside your normal folder structures and uses PHP to grab that variables and pass it back up to your application.
  • 如果在将数据放入查询之前未正确清理数据,则应始终认为自己容易受到SQL注入攻击。 使用.htaccess重写不是也不应该被视为防止SQL注入的做法。 使用mysql_real_escape_string或pg_escape_string (如果使用PHP)或使用POST或GET变量对所有查询使用pg_escape_string准备语句。 总是。 You should always consider yourself vulnerable to SQL injection attack if you don' ...
  • 重写引擎循环,因此当引擎循环时,重写到查询字符串的最后一条规则会被第一条规则捕获。 您需要匹配%{THE_REQUEST}变量,而不是匹配%{QUERY_STRING}变量,以便它不受其他重写的影响: RewriteEngine On RewriteCond %{THE_REQUEST} /results\.php\?query1=([^&]+)&query2=([^&\ ]+) [NC] RewriteRule ^results. ...
  • 由于您的网址和文件的名称非常相似,因此与MultiView可能会有一些冲突。 尝试禁用它: Options -MultiViews There might be some conflicts with MultiViews as your URL and files have a very similar name. Try to disable it: Options -MultiViews
  • 垂直条表示逻辑OR ,并允许您在“注册”之后指定尾部斜杠。 我更喜欢用'?' 斜线后,使其成为可选: RewriteRule ^registration/?$ /index.php The vertical bar stands for a logical OR, and lets you specify either a trailing slash after 'registration' or not. I prefer using a '?' after the slash, making it ...
  • 改变这一行: RewriteCond %{QUERY_STRING} ^.*(\[|\]|\(|\)|<|>|ê|"|;|\?|\*|=$).* [NC,OR] 至: RewriteCond %{QUERY_STRING} ^.*(\[|\]|\(|\)|<|>|ê|"|;|\?|\*).* [NC,OR] Change this line: RewriteCond %{QUERY_STRING} ^.*(\[|\]|\(|\)|<|>|ê|"|;|\?|\*|=$).* [NC,OR] to: Re ...
  • 你的第一套规则: #redirect domain.com to www.domain.com RewriteCond %{HTTP_HOST} !^www\.domain\.com$ [NC] RewriteRule .+ http://www.domain.com%{REQUEST_URI} [R=301,NC] 如果您没有空请求,则取代您拥有的任何子域。 第二套规则接近你想要的。 但是它们循环并且它们不会将多个子域条目转换为目录路径,因此请首先尝试使用这组规则: # Prevent looping ...

相关文章

更多

最新问答

更多
  • 您如何使用git diff文件,并将其应用于同一存储库的副本的本地分支?(How do you take a git diff file, and apply it to a local branch that is a copy of the same repository?)
  • 将长浮点值剪切为2个小数点并复制到字符数组(Cut Long Float Value to 2 decimal points and copy to Character Array)
  • OctoberCMS侧边栏不呈现(OctoberCMS Sidebar not rendering)
  • 页面加载后对象是否有资格进行垃圾回收?(Are objects eligible for garbage collection after the page loads?)
  • codeigniter中的语言不能按预期工作(language in codeigniter doesn' t work as expected)
  • 在计算机拍照在哪里进入
  • 使用cin.get()从c ++中的输入流中丢弃不需要的字符(Using cin.get() to discard unwanted characters from the input stream in c++)
  • No for循环将在for循环中运行。(No for loop will run inside for loop. Testing for primes)
  • 单页应用程序:页面重新加载(Single Page Application: page reload)
  • 在循环中选择具有相似模式的列名称(Selecting Column Name With Similar Pattern in a Loop)
  • System.StackOverflow错误(System.StackOverflow error)
  • KnockoutJS未在嵌套模板上应用beforeRemove和afterAdd(KnockoutJS not applying beforeRemove and afterAdd on nested templates)
  • 散列包括方法和/或嵌套属性(Hash include methods and/or nested attributes)
  • android - 如何避免使用Samsung RFS文件系统延迟/冻结?(android - how to avoid lag/freezes with Samsung RFS filesystem?)
  • TensorFlow:基于索引列表创建新张量(TensorFlow: Create a new tensor based on list of indices)
  • 企业安全培训的各项内容
  • 错误:RPC失败;(error: RPC failed; curl transfer closed with outstanding read data remaining)
  • C#类名中允许哪些字符?(What characters are allowed in C# class name?)
  • NumPy:将int64值存储在np.array中并使用dtype float64并将其转换回整数是否安全?(NumPy: Is it safe to store an int64 value in an np.array with dtype float64 and later convert it back to integer?)
  • 注销后如何隐藏导航portlet?(How to hide navigation portlet after logout?)
  • 将多个行和可变行移动到列(moving multiple and variable rows to columns)
  • 提交表单时忽略基础href,而不使用Javascript(ignore base href when submitting form, without using Javascript)
  • 对setOnInfoWindowClickListener的意图(Intent on setOnInfoWindowClickListener)
  • Angular $资源不会改变方法(Angular $resource doesn't change method)
  • 在Angular 5中不是一个函数(is not a function in Angular 5)
  • 如何配置Composite C1以将.m和桌面作为同一站点提供服务(How to configure Composite C1 to serve .m and desktop as the same site)
  • 不适用:悬停在悬停时:在元素之前[复制](Don't apply :hover when hovering on :before element [duplicate])
  • 常见的python rpc和cli接口(Common python rpc and cli interface)
  • Mysql DB单个字段匹配多个其他字段(Mysql DB single field matching to multiple other fields)
  • 产品页面上的Magento Up出售对齐问题(Magento Up sell alignment issue on the products page)