Cloudera Runtime Release NotesPDF version

What's New in Cloudera Search

Learn about the new features of Cloudera Search in Cloudera Runtime 7.2.8.

Cloudera Search now supports using local file system instead of HDFS. For more information, see Local file system support.

SolrCell is reintroduced to Cloudera Search with limited functionality, which means that it can only be used with Morphlines as the JAR is not added to the Solr server classpath.

The Tomcat HTTP client library uses a socket timeout of 10 minutes. Spark Crunch Indexer does not override this value, and in case indexing a single batch takes more than 10 minute the entire indexing job fails. This can happen especially if the morphlines contain DeleteByQuery requests.

As of this release, you can configure the socket timeout for the connection in the morphline file by adding the solrClientSocketTimeout parameter to the solrLocator command.

For example:

SOLR_LOCATOR :
{ 
  collection : test_collection 
  zkHost : "zookeeper1.example.com:2181/solr" 
# 20 minutes in milliseconds 
  solrClientSocketTimeout: 1200000  
  # Max number of documents to pass per RPC from morphline to Solr Server
  # batchSize : 10000

}

We want your opinion

How can we improve this page?

What kind of feedback do you have?