Dev:Solr

Aus YaCyWiki
Wechseln zu: Navigation, Suche

Inhaltsverzeichnis

Solr and YaCy integration

YaCy supports the storage of document metadata and plain text to remote solr indexes. This can be activated with one single click (see below). We are currently changing the YaCy architecture to include a solr core as embedded index by default.

Architecture

The remote index scheme is similar (but extended) to SolrCell; see http://wiki.apache.org/solr/ExtractingRequestHandler

Because this default scheme is used, the default example scheme can be used as solr configuration. This is also the same scheme that solr uses if documents are imported with apache tika.


How to attach Solr

Federated solr storage is switched off by default in YaCy, but you can simply switch it on in http://localhost:8090/IndexFederated_p.html

To attach a Solr server do the following:

Until now it is not possible to use the Solr index to search with YaCy in that solr index. But that may be an option in the future.

Screenshot of the Solr integration servlet at http://localhost:8090/IndexFederated_p.html

Screenshot: Solr integration servlet

.. there are many more attribute fields!

Remarks

This functionality is now available because:

  • 1) to compare the functionality of Solr and YaCy and to compare the search speed
  • 2) to use YaCy as a search appliance for people who need a crawler or other source harvesting methods that YaCy provides (like dublin core reading, wikimedia dump reading, rss feed reader etc) if people want to use solr instead of YaCy.
  • 3) to experiment and explore future uses of Solr inside of YaCy
Meine Werkzeuge
Namensräume
Varianten
Aktionen
Gemeinschafts-Portal
Navigation
Werkzeuge