Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.
Heritrix (sometimes spelled heretrix, or misspelled or mis-said as heratrix/heritix/ heretix/heratix) is an archaic word for heiress (woman who inherits). Since our crawler seeks to collect and preserve the digital artifacts of our culture for the benefit of future researchers and generations, this name seemed apt.
Need an alternative to Heritrix? Read on. We've looked at the best Heritrix alternatives available for Windows, Mac and Android.
Algolia helps product teams connect their users with information by providing the building blocks they need to create fast, relevant, personalized search.
Features:
Mixnode is a fast, flexible, massively scalable platform to extract and analyze data from the web. Mixnode allows you to think of all resources on the web as rows in...
Features:
With Google Custom Search, add a search box to your homepage to help people find what they need on your website.
Features:
Expertrec custom search started as a replacement for google site search. It adds super-fast search autocomplete, spell correct, search listing pages to your website.
Features:
Apache Nutch is a highly extensible and scalable open source web crawler software project. Nutch is coded entirely in the Java programming language, but data is...
Features:
StormCrawler is an open source SDK for building distributed web crawlers with Apache Storm.
Search over millions of documents, and give to your users unique, amazing and unforgettable experiences.
Features:
Add your reviews & share your experience when using Heritrix to the world. Your opinion will be useful to others who are looking for the best Heritrix alternatives.
Popular Alternatives
iOS Alternatives
Android Alternatives
Copyright © 2021 TopAlter.com
Sites we Love: AnswerBun, MenuIva, UKBizDB, Sharing RPP