The Lemur Project develops search engines, browser toolbars, text analysis tools, and data resources that support research and development of information retrieval and text mining software. The project is best known for its Indri search engine, Lemur Toolbar, and ClueWeb09 dataset. Our software and datasets are used widely in scientific and research applications, as well as in some commercial applications.
Indri is a search engine that provides state-of-the-art text search and a rich structured query language for text collections of up to 50 million documents (single machine) or 500 million documents (distributed search). Available for Linux, Solaris, Windows and Mac OSX.
Features
Powerful Query Interface
Supports popular structured query operators from INQUERY
Suffix-based wildcard term matching
Field retrieval
Passage retrieval
Flexible Indexing and Document Support
Supports UTF-8 encoded text
Language independent tokenization of UTF-8 encoded documents.
Parses PDF, HTML, XML, and TREC documents
Word and PowerPoint parsing (Windows only)
Text Annotations
Document Metadata
Package Versatility
Open source, with a flexible BSD-inspired license
Includes both command line tools and a Java user interface
API can be used from Java, PHP, or C++
Works on Windows, Linux, Solaris and Mac OS X
Scalability and Efficiency
Best-in-class ad hoc retrieval performance
Can be used on a cluster of machines for faster indexing and retrieval
Scales to terabyte-sized collections
Download
Indri can be obtained from the SourceForge Lemur Project Page.
Release History
The first version (1.0) of Indri was released in Jan 2002. Subsequent releases have been made 2-3 times each year since then. Release notes for the current release can be found on SourceForge.
If you want similar software to Lemur Project, we have a list for that. Are there Lemur Project alternatives out there? Let's find out.
DocFetcher is a portable German/English open source desktop search application. It allows you search the contents of documents on your computer. - You can think of it as...
Features:
Agent Ransack is a tool for finding files and information on your hard drive fast and efficiently. .
Features:
File search utility that provides instant search by file name, and powerful search by file contents, size and date. Supports search inside PDF, Microsoft Office and...
Features:
Instantly find files, e-mails, and attachments stored anywhere on your PC with the free Copernic Desktop Search Home. Copernic Desktop Search (CDS) enables you to...
Features:
Full-text morphological search among documents and e-books of dozens of file types, including text in archives and disk images; on the your desktop, LAN, clouds (WebDAV)...
Features:
dtSearch - The Smart Choice for Text Retrieval since 1991.
Features:
"InSight Desktop Search" is a search engine that keeps track of all the files in your system and makes sure that you can access your files/folders easily and...
SSuite Desktop Search is a useful and extremely fast windows desktop search engine that can find files, folders, and file content. SSuite Desktop Search and Find is...
Features:
Exalead Desktop Search Instantly locates any document stored on local resources like internal and external hard drives, network drives and USB keys .
Features:
Add your reviews & share your experience when using Lemur Project to the world. Your opinion will be useful to others who are looking for the best Lemur Project alternatives.
Popular Alternatives
iOS Alternatives
Android Alternatives
Copyright © 2021 TopAlter.com
Sites we Love: AnswerBun, MenuIva, UKBizDB, Sharing RPP