TopAlter.com

Heritrix Alternatives

Heritrix Alternatives

Heritrix

Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

Heritrix (sometimes spelled heretrix, or misspelled or mis-said as heratrix/heritix/ heretix/heratix) is an archaic word for heiress (woman who inherits). Since our crawler seeks to collect and preserve the digital artifacts of our culture for the benefit of future researchers and generations, this name seemed apt.

Best Heritrix Alternatives for Web

You're looking for the best programs similar to Heritrix. Check out our top picks. Below, let's see if there are any Heritrix alternatives that support your platform.

Algolia

Algolia

Free PersonalWebAndroid SDKRubyPythonJavaScriptAngularJScURLRuby on RailsNode.JSObjective-C

Algolia helps product teams connect their users with information by providing the building blocks they need to create fast, relevant, personalized search.

Features:

  • Api
  • Developer Tools
  • Full text search
  • Indexed search
  • Real-time
  • REST API
  • Search engine
  • Search-server
Mixnode

Mixnode

CommercialWeb

Mixnode is a fast, flexible, massively scalable platform to extract and analyze data from the web. Mixnode allows you to think of all resources on the web as rows in...

Features:

  • Content-Type Filtering
  • Support for Amazon S3
  • URL Filtering
  • WARC Output
Google Custom Search Engine

Google Custom Search Engine

FreemiumWeb

With Google Custom Search, add a search box to your homepage to help people find what they need on your website.

Features:

  • Embeddable
  • Search engine

Upvote Comparison

Interest Trends

Heritrix Reviews

Add your reviews & share your experience when using Heritrix to the world. Your opinion will be useful to others who are looking for the best Heritrix alternatives.

Copyright © 2021 TopAlter.com

Sites we Love: AnswerBun, MenuIva, UKBizDB, Sharing RPP