The Best Open Source Searching Platform

Erik Hatcher uses Solr should that be good enough? He uses Ruby on Rail and Solr in conjunction to support http://www.nines.org/.

So what is Solr?

In a nutshell Solr is a wrapper around Lucene which provides all of Lucene’s functionality as a web service.

Here is the description from Apache-Con
Apache Solr, a Lucene based full-text search server, with XML/HTTP interfaces, declarative specification of data types and text analysis with a schema, extensive caching, index replication, and a web admin interface. Solr is optimized for high volume low latency web traffic and has support for faceted browsing and dynamic results grouping.

Whats so great about Solr?

  • Replication: Solr can copy itself and still guarantee read access. Very nice for high availability and scalability
  • Language Independent: Solr makes searching with Lucene a web service, now you can access a Lucene collection in any language.
  • Support for Faceted Searching: Solr provides the infrastructure for faceted search with a plug in module and open bit-sets

Where does Solr shine

Solr works well with highly structured schemes, where the data stored is know well. Solr works great as a full text searching engine, which an update capability. It doesn’t work well as a front end database, the complexity of managing the system for rapid updates is just too hard.

Nutch is another application which uses Lucene, IMHO it looks like a great tool to support unstructured search when your need to scan through and index a lot of different documents.

Red-Piranha is another project which may interest some, but I’m not sure its active any more :(

Leave a Reply