Gigablast is a powerful, opensource, new search engine that does real-time indexing!
Scalable to thousands of servers.
Has scaled to over 12 billion web pages on over 200 servers.
A dual quad core, with 32GB ram, and two 160GB Intel SSDs, running 8 Gigablast instances, can do about 8 qps (queries per second) on an index of 10 million pages. Drives will be close to maximum storage capacity. Doubling index size will more or less halve qps rate. (Performance metrics can be made about ten times faster but I have not got around to it yet. Drive space usage will probably remain about the same because it is already pretty efficient.)
1 million web pages requires 28.6GB of drive space. That includes the index, meta information and the compressed HTML of all the web pages.
Spider rate is around 1 page per second per core. So a dual quad core can spider and index 8 pages per second which is 691,200 pages per day.
4GB of RAM required per Gigablast instance. (instance = process)
Live demo at http://www.gigablast.com/
Written in C/C++ for optimal performance.
Over 500,000 lines of C/C++.
100% custom. A single binary. The web server, database and everything else is all contained in this source code in a highly efficient manner. Makes administration and troubleshooting easier.
Reliable. Has been tested in live production since 2002 on billions of queries on an index of over 12 billion unique web pages, 24 billion mirrored.
Super fast and efficient. One of a small handful of search engines that have hit such big numbers. The only open source search engine that has.
Supports all languages. Can give results in specified languages a boost over others at query time. Uses UTF-8 representation internally.
Track record. Has been used by many clients. Has been successfully used in distributed enterprise software.
Cached web pages with query term highlighting. ...
Unlimited SearchingNo restrictions on; number of searches, index size, document limit or file types you can index.Best UX availableOur latest AJAX based end user interface is smooth and quick, exactly what users will expect.Craft a custom experienceBuild the best internal search for your app. by using our open API and......
With intuitive interface, evergoing innovation and highest possible accuracy RankTrackr strives to be top rank tracker on the market.......
Banckle Site Search provides a FREE and secure search engine solution for your websites. Its advanced control panel allows you to re-index or schedule automatic indexing and provides website visitors with relevant search results against accurate phrases. They can also search most common file types like; HTML, text, XML, Microsoft......
This is Site Search engine script that uses MySQL to store your website's indexed pages, to add Search Functionality to Your Web Site. It is build with PHP and JavaScript, the search results are loaded via Ajax.The search system combine MySQL full text with SQL regexp, and words weight according......
Algolia provides a developer-friendly RESTful API for website and app instant search. Most web services and mobile apps, such as Spotify, Salesforce or Amazon need to provide a fast and meaningful access to database objects via a simple search box. People want to find songs, invoices, products in just a......
a small daemon that can index information using the new crawler.* very fast crawling* very small memory footprint* no hammering of the system* pluggable backend, currently clucene and hyperestraier, sqlite3 and xapian are in the works* communication between daemon and search program over an abstract interface, this is currently a......
THE SEARCH ENGINE FROM EUROPE 100% NEUTRAL, SECURE & CLEAN MADE & HOSTED IN GERMANY- More than one Search Engine- Unbiased search results- Sustainable and private search- So easy yet powerfulSearch the whole web without being limited to a single search engine. We search through 20 selected web, info,......
We deliver proven metasearch technologies, white label tools and industry-standard best practices that have been tested and implemented on our own properties. Our long-standing relationships with Google, Yahoo! and Yandex enable us to provide greater......
nmzmail is a tool, primarily to be used with mutt, for indexing and searching maildir folders. Based on the result of a search query using the search engine namazu2, nmzmail generates a maildir folder containing symbolic links to the mail(s) matching the query. A simple mutt macro makes it easy......
Cuil (pronounced /'ku?l/ "cool" according to the creators) was a search engine that organized web pages by content and displayed relatively long entries along with thumbnail pictures for many results. It claimed to have a larger index than any other search engine, with about 120 billion web pages. It went......
Patent Data doesn't have to be difficult to work with...IP Street provides semantic patent search, claim text analytics, automated due diligence, and clean patent data all available at an API endpoint.......
Notmuch is a system for indexing, searching, reading, and tagging large collections of email messages. It uses the Xapian library to provide fast, full-text search of very large collection of email with a very convenient search syntax.Its more a library than a real search engine, but nevertheless interessting.......
Site Plow is a free tool for information exploration on the Internet.An easy to use web app for building advanced search queries for Google and Bing. The tool also integrates your search with some of the most popular sites on the internet.......
ZoomEye is a search engine for CyberspaceIn Chinese ancient legends, there's a famous ghost buster named Zhong Kui. Just like him, ZoomEye is created for hunting the demons in Cyberspace.......
Welcome to enack.net, the world's meanest search engine. Enack is a search engine website with an unusual and radical user experience and interface design. We are redefining what people thinks search engine websites should be and how they interact search sites in general.......
About | Terms |
About UsContact Us | TpSort ScorePrivacy PoliceDMCA Policy |
© 2015-2016 www.tpsort.com, Inc