Browse the Ruby on Rails Community.

You are here: Browse Rubygems Rdig

Rdig

Ruby based web site indexing and searching library. RDig provides an HTTP crawler and content extraction utilities to help building a site search for web sites or intranets. Internally, Ferret is used for the full text indexing. After creating a config file for your site, the index can be built with a single call to rdig. For HTML page crawling, hpricot and rubyful_soup are supported.


Homepage: http://rdig.rubyforge.org/