Hyper Scalable Scales to 200 billion full pages and 100,000 servers.
Efficient Uses very few computers to support a huge index and a large number of queries per second.
Gigabot Gigablast's fast and feature-rich spider is highly configurable.
Real-Time URLs are indexed in real-time. Link analysis is done on the fly.
Intelligent Update Determines the update cycle of each document and tries to spider it at that frequency.
Dual Mode Uses idle cycles to spider and index documents, but will quickly yield resources to handle incoming queries.
Maintainable Comprehensive web-based GUI controls make it easy to administer.
Spam Protection Features a large array of anti-spam tools and algorithms used to keep spam out of the index.
Document Cache Has a cache to hold user-viewable copies of the pages it spiders and indexes. Obeys nocache meta tags as specified here.
Multiple Formats Indexes PDF, Micorosoft Word, Power Point , Excel and Postscript documents.
Dynamic Summaries Search result summaries are generated so that they contain the query terms.
Term Highlighting Performs query term highlighting on the view of cached pages.
Robust Query Syntax Features many different field searches, + and - operators.
Advanced Search Allows users to perform power searches quickly and easily.
Super Recall Returns extra results.
Default AND Capable Can easily limit search results to only pages that have all the query terms.
Boolean Queries Supports nested boolean queries using AND, OR and NOT operators.
Turing Test Uses simple Turing test to prevent real-time addurl abuse.
Redundancy If one server goes down then its twins take over for it.
Error Correction Corrupted data is automatically detected and patched from a mirror host.
Load Balancing Gigablast intelligently distributes load evenly among all hosts in the network.
Collections Allows the administrator to partition the index into many sub indexes.