SolrCloud/ZooKeeper优化

一切依旧

2014-11-11

关注关注

SolrCloud优化:

1:CPU主频

2:ZooKeeper的优化项: 参考:http://zookeeper.apache.org/doc/trunk/zookeeperAdmin.html

Things to Avoid

Here are some common problems you can avoid by configuring ZooKeeper correctly:

inconsistent lists of servers

The list of ZooKeeper servers used by the clients must match the list of ZooKeeper servers that each ZooKeeper server has. Things work okay if the client list is a subset of the real list, but things will really act strange if clients have a list of ZooKeeper servers that are in different ZooKeeper clusters. Also, the server lists in each Zookeeper server configuration file should be consistent with one another.

incorrect placement of transasction log

The most performance critical part of ZooKeeper is the transaction log. ZooKeeper syncs transactions to media before it returns a response. A dedicated transaction log device is key to consistent good performance. Putting the log on a busy device will adversely effect performance. If you only have one storage device, put trace files on NFS and increase the snapshotCount; it doesn't eliminate the problem, but it should mitigate it.

incorrect Java heap size

You should take special care to set your Java max heap size correctly. In particular, you should not create a situation in which ZooKeeper swaps to disk. The disk is death to ZooKeeper. Everything is ordered, so if processing one request swaps the disk, all other queued requests will probably do the same. the disk. DON'T SWAP.

Be conservative in your estimates: if you have 4G of RAM, do not set the Java max heap size to 6G or even 4G. For example, it is more likely you would use a 3G heap for a 4G machine, as the operating system and the cache also need memory. The best and only recommend practice for estimating the heap size your system needs is to run load tests, and then make sure you are well below the usage limit that would cause the system to swap.

每指定个maxBufferedDocs 为一个 segment ,每指定个mergeFactor 为一个single index file,适当调整maxBufferedDocs 和 mergeFactor 参数以致优化

4:点击solr admin UI 中的 Optimize 按钮,会将 single index file 合成一个索引文件, Optimize 是一个I/O高密集形任务,且 solr数据频繁的更新也会导致 Optimize 后的索引使用不了多长时间就得重新 Optimize ;

5: 参考:http://www.solr.cc/blog/?p=788

1、数据更新频率：每天数据增量有多大，随时更新还是定时更新
2、数据总量：数据要保存多长时间
3、一致性要求：期望多长时间内看到更新的数据，最长允许多长时间延迟
4、数据特点：数据源包括哪些，平均单条记录大小
5、业务特点：有哪些排序要求，检索条件
6、资源复用：已有的硬件配置是怎样的，是否有升级计划

solrcloud

安科网

SolrCloud/ZooKeeper优化

一切依旧

Things to Avoid

一切依旧

相关推荐

SolrCloud 高可用集群搭建

SolrCloud 5.0 路由、Collection创建与数据迁移

solr4.8之solrcloud的使用与日常问题解决

SolrCloud应用

solrcloud编辑zookeeper上的配置文件的方法

solrcloud和zookeeper的搭建、使用、心得、教训

SolrCloud之分布式索引及与Zookeeper的集成

Solr学习(三) 单独ZooKeeper(外部)实例 + SolrCloud(tomcat)实例

SolrCloud简介

solrCloud+tomcat+zookeeper集群配置

Tomcat上部署SolrCloud(翻译官方)

SolrCloud 5.0 路由、Collection创建与数据迁移

SolrCloud安装

solrCloud搭建

Spring Boot 中使用 SolrCloud

CentOs7.3 搭建 SolrCloud 集群服务

Ubuntu 14.04下适应Docker搭建solrCloud集群

Solr集群solrCloud的搭建

Solr6与Zookeeper在Tomcat环境安装部署SolrCloud集群

Linux下部署SolrCloud

一切依旧