search engines : Java Glossary

go to home page S words local find full screen, hide local find menu Google search web for more information on this topic jump to foot of page translate this page with Babelfish by Roedy Green ©1996-2009 Canadian Mind Products
index page for letter ⇒ punctuation 0-9 A B C D E F G H I J K L M N O P Q R S T U V W X Y Z (all)
CurrCon neededThe CurrCon Java Applet displays prices on this web page converted with today’s exchange rates into your local international currency, e.g. Euros, US dollars, Canadian dollars, British Pounds, Indian Rupees… CurrCon requires Java 1.1 or later, preferably 1.6.0_11. If you can’t see the prices, or if you just want to learn more about CurrCon, click here for help.
sherlock Holmes’ hatsearch engines
Search engines help you find material on the web. They are free. They make their money by showing you ads while you search. This entry will list the major search engines, tell you how to submit to them, and discuss future search engine technology.
Introduction Google Robots
General Search Obscure and National Search Engines Rant
Legend Set Up Your Own Search Engine Meta Words
Search Engine Sites Submit Your Website to Mulitiple Engines Links

Introduction

Search engines help you find material on the web. They are free. They make their money by showing you ads while you search. Submitting your site to search engines for indexing is a major way to increase traffic to your site.

My personal favourite is Google. It weeds out the trash and goes straight for the gold. If you are looking for a particular class of product, give the name of several examples rather than trying to describe the category. About 40% of people use Yahoo, which is not a true search engine. It in more a catalog of webpages, manually created.

General Search

JavaScript Required
General Search
Search for: ⇐ what to search for
Web Engine:         ⇐ which web search engine to use
Span:  |   |   |   |   |  ⇐ breadth of search
Website: return results from website: ⇐ any one particular website of interest?
Preferences: Advanced  |  Preferences  | Languages
Results: Return results |   |   
Google:
Yahoo:
Glossary:
eBay:
Bookstores:
Domains:

Legend

Submit Colour Code
submit code meaning
(add) Free to list
Pay to list
Does not accept submissions

Search Engine Sites

Name URL Submit Notes
GoodSearch logo goodsearch.com (add) Donates to your selected charity when you use it. You can add your favourite charity to its list. Powered by Yahoo. Has an optional toolbar. It tracks your previous choice of favourite charity with a cookie.
logo www.google.com (USA)
www.google.ca (Canada)
www.google.fr (France)
www.google.de (Germany)
www.google.ie(Ireland)
www.google.co.in (India)
www.google.co.il (Israel)
www.google.it (Italy)
www.google.co.jp (Japan)
www.google.co.nz (NZ)
www.google.ru (Russia)
www.google.es (Spain)
www.google.se (Sweden)
www.google.co.uk (UK)
others
(add)
(add Canada)

Limiting Searches

  • Limiting a Search To A Single Site

    Use site:mindprod.com in your search criteria to limit search to one site.
  • Finding a similar Page

    Use related:mindprod.com/religion/god.html Find pages similar to this URL.
  • Find a Definition

    Use define:zeugma definitions of a term.
  • Find a Website

    Use inurl:sun find websites with the word “sun” in the domain name.
  • Finding links To a Site

    Use link:mindprod.com to find links to that website from other websites. Find out who is talking about you.
  • Excluding a word

    nuts -almonds means documents containg he word nuts, but not documents also containing the word almonds.
  • Exact phrase

    "peanut butter" with the quotes, insists the words appear in that order with nothing in between.
  • Relaxed Search

    "almonds OR nuts" means get documents containing either almonds or nuts or both. A normal search insists on all the words.
  • Extension filetype

    filetype:pdf gets only Adobe pdf documents. filetype:html gets only html documents, (especially useful for Google Desktop that indexes more file types.) -filetype:html gets everything but pdf documents.
You can get at other limiters with Google Advanced Search checkboxes, then look at at the HTTP query generated, and learn to compose it directly yourself.

Google Parts

Google has many parts for searching and other services:
web maps translation
newsgroups earth Apple Macintosh
images desktop Directory Categories
videos toolbar Gmail
books Services & Tools iGoolgle
news retail catalogs Code
800go logo www.800go.com (add) née Magellan. Works by linking to 12 other search engines.
about logo www.about.com (add) human guides, categories and subcategorise links.
All The Web www.alltheweb.com (add) Good for finding very obscure entries. aka FAST. They now get their data from Yahoo.
logo www.altavista.com (add) Very large database — indexes everything on the web pages, not just the keywords. Popular with job recruiters. Tends to overwhelm. No longer has a free personal version called Discovery for searching your own hard disks. Buried on the main menu, on the left bottom, in tiny type, is "submit a URL" to submit your web pages for indexing. It takes one to two weeks for them to index your site.
logo www.altavistacanada.com (add) Like Altavista, but with more emphasis on Canadian content. Buried on the main menu, on the left, in tiny type, is "add a URL" to submit your web pages for indexing.
aol logo search.aol.com (add) I could not figure out how to add an URL.
logo www.askJeeves.com (add) I quite like this site because it collects data from several other search engines, and filters out most of the junk. The owners have also done quite a bit of work organising answers to common queries pointing you right away to a good site. I found it most useful for example for researching holidays. To submit a URL you write to url@askjeeves.com and a human decides if, where and how to include it in Jeeve’s database of questions.
BWP Web Search
www.blackwebportal.com (add) black community interest.
Copernic
logo
copernic.com It uses mama.com which queries other search engines. The image search is particularly good. Ties into the Copernic desktop search to index and search your desktop machine.
DMOZ
logo
dmoz.org (add) Has a Yahoo-like manual category system.
Dogpile
logo
www.dogpile.com Works by asking other search engines in parallel to look. Does not accept submissions.
excite logo www.excite.com (add) Slightly cleverer at putting most important hits first better than Altavista. Excite the company is buying up other search Engine companies such as Magellan. Excite also sells search engines to sites that want an index just of their site. The "add your site" link is very easy to overlook. It is just below the thin horizontal blue line. It takes two to six weeks for them to index your site.
Go Network
go.com logo
www.go.com (add)  
HotBot
HotBot Logo
www.hotbot.com (add) Now associated with either Ask Jeeve or Google. Formerly associated with Lycos.
HoundDog
HoundDog Logo
www.hounddog.com (add)  
Ilectric
ilectric logo
ilectric.com Metasearch that uses other search engines then compiles the results by category. Also has a great whois.
Infoseek www.infoseek.com (add) Defunct now Go.com.
InfoSpace
InfoSpace logo
www.infospace.com (add)  
iWon
Iwon Logo
www.iWon.com (add) a sort of Publisher’s Clearing house prize site.
Jayde
Jayde logo
www.jayde.com (add)  
LastMinuteSearch www.lastminutesearch.com (add) links to various national search engines.
LookSmart www.looksmart.com (add)  
Lycos
logo
www.lycos.com (add) Grandfather of all search engines. It takes two to six weeks for them to index your site. Lycos (and brothers) form the second largest database, second only to Google. However, it refreshes it’s 2.1 billion pages every nine to eleven days. It now collapses multiple hits to the same website to one hit per website.
Lycos Germany
logo
www.lycos.de (add) Only for Germany
Lycos Italia
logo
www.lycos.it (add) Only for Italy
InFind defunct.
MapPlanet www.mapplanet.com (add) Find things by latitude and longitude on a map. You can claim a cell for your web page.
Miva
logo
www.miva.com Must pay to list. Search rankings given to the highest bidder.
logo multimeta.com (add) Dogpile style uses Acoon, Altavista, Viola, Excite, Hotbot, Lycos, MSN, Infoseek, Yahoo.
MSN search.msn.com (add)
Northern Light
logo
www.northernlight.com Now a subscription based business news engine.
Overture
Overture Search the Web.
www.overture.com Must pay to list. Bid for rankings. Née GoTo.com.
PlanetOcean www.searchenginenews.com Search Engine News, paid journal
Power Search accessweb.ws (add) New kid on the block. Not much content yet. Crude category scheme.
Real Names www.realnames.com Must pay to list.
RocketLinks www.rocketlinks.com Must pay to list. Search rankings given to the highest content-providing bidder.
Search.com www.search.com (add) Searches through 200 search engines, auctions and newsgroups. Née SavvySearch. The ultimate domain name for a search engine.
SearchHound
logo
www.searchhound.com Must pay to list. Search rankings given to the highest bidder.
sympatico logo www.sympatico.msn.ca Has optional Canada-only filter. Lycos/MSN affiliate. Requires cookies turned on. Does not appear to accept submissions.
The Yellow Pages
TheYellowPages logo
www.theyellowpages.com (add)
TopClick www.topclick.com (add)  
Torrentz www.torrentz.com (add) a search engine for BitTorrents
WebCrawler www.webcrawler.com (add) originally owned by AOL, now part of Excite. It takes two to six weeks for them to index your site.
xrefer www.xrefer.com (can’t add) Index to published reference works such as dictionaries and encyclopedias.
Yahoo
logo
www.yahoo.com Has an organised library-like directory structure, not just keyword search. Sometimes free, but sometimes you must pay $199.00 USD (or $600.00 USD if there is any adult content to be listed). I think they are cutting their throats with these extremely high fees. It takes them 8 to 12 weeks to index your site. See the suggest a site button on each page of the category tree to submit.

Obscure and National Search Engines

Name URL Submit Notes
123 India www.123india.com (add) India Related Only
7Search 7search.com (add)  
AAA Australia www.aaa.com.au (add)  
Aache aache.com (add) Spanish
Aardvark www.aardvark.co.za (add) South African Only
Adm City admcity.com (add)  
AEIWI www.aeiwi.com (add)  
All Deal www.alldeal.com (add)  
Aqueous www.aqueous.com (add) Water Related Only
Austronaut austronaut.at (add)  
Banners Shoppe www.bannershoppe.com (add)  
Big Finder www.bigfinder.de (add)  
Big Stuff www.bigstuff.com (add) domain name search
Catalog Central www.catalogcentral.com (add) free mail order catalogs
Charred Classifieds www.charred.com (add)  
Columbus Finder www.columbus-finder.de (add) in German
GIF Animations www.gifanimations.com (add)  
Home Run Links hrun.com (add)  
Hot Info www.mini-mall.com (add)  
Latin World www.latinworld.com (add) Latin America
Linkz www.linkz.com (add)  
Made in USA madeinusa.org (add) search for American-made products, perhaps so you could boycott them as I do.
Mirago www.mirago.co.uk (add) UK and Ireland
MrMister www.mrmister.com (add) shopping links
Munkey.com munkey.com mostly advertising for goods and services
National Directory www.nationaldirectory.com (add) USA only.
Net Search www.netsearch.org (add)  
Ottawa Kiosk www.ottawakiosk.com (add) Ottawa Canada only
PrimeFind www.primefind.com (add) in five languages
Saydar’s Syrcrawler www.saydar.org (add) penpal links
SCI Seek www.sciseek.com (add) Science and Nature
Scrub The Web www.scrubtheweb.com (add) search engine toolbar
Submit One www.submit-one.com (add) Submit your site to multiple search engines
Super Promotions Web Search www.superpromo.com (add)  
The Hub www.thehub.co.nz (add) New Zealand News
The Net 1 www.thenet1.com (add)  
Teoma www.teoma.com   Works by ranking sites on how authorative they are.
The Web 100 www.web100.com (add)  
Traffic Dispatch www.trafficdispatch.com (add)  
Voila www.voila.fr (add) French
Witch www.witch.de (add) In German and English
Wombat www.webwombat.com.au (add) Only for Australia
Yellow www.yellow.com.mx (add) Mexico

Set Up Your Own Search Engine

Software to set up your own search engine:

Services To Submit Your Website to Multiple Search Engines

You can submit your website to be included in multiple search engines via services such as: They will submit your home page to a number of search engines for incorporation. They may charge $40.00 USD or more for the service. See my student project to write a URL submitter.

googod.net offers a service to optimise your site to raise its ranking in the search engines.

Robots

you can discourage search engines from indexing certain pages by using metatags like this:

<META name="robots" content="noindex, nofollow">

You can also have a central robots.txt file like this:

user-agent: * # directed to all robots
Disallow: zips # whatever directory/file you don’t want indexed.

Rant

I would like to get the authors of the search engine software and the authors of the browsers such as:
Click the corresponding browser icon to download the latest free browser software, or click the browser name for more information.
Opera 9.63 Opera 9.63with the Java 1.6.0_11 JRE (Java Runtime Environment). Again works with Java.
Firefox 3.0.5 Firefox 3.0.5with the Java 1.6.0_11 JRE (Java Runtime Environment). Most widely supported next to IE.
Sea Monkey 1.1.14 Sea Monkey 1.1.14with the Java 1.6.0_11 JRE (Java Runtime Environment). Similar to Firefox, with integrated Email.
Safari 3.2.1 Safari 3.2.1with the Java 1.6.0_11 JRE (Java Runtime Environment). Now works both on Macs and PCs. Some rendering problems.
Flock 2.0.3 Flock 2.0.3with the Java 1.6.0_11 JRE (Java Runtime Environment). Similar to Firefox for social networking.
Avant 11.7 Avant 11.7with the Java 1.6.0_11 JRE (Java Runtime Environment). It is a fast browser, especially at starting up. Has problems with JavaScript.
IE 8 beta 2 (8.0.6001.18241) IE 8 beta 2 (8.0.6001.18241)with the Java 1.6.0_11 JRE (Java Runtime Environment). It is pain to get Java Web Start working. It has many bugs and crashes frequently. Renders column classes correctly.
IE 7 (7.0.6000) IE 7 (7.0.6000)with the Java 1.6.0_11 JRE (Java Runtime Environment). It is pain to get Java Web Start working.
IE 6 (6.0.2800) IE 6 (6.0.2800)with the Java 1.6.0_11 JRE (Java Runtime Environment). Not secure.
Google Chrome 1.0.154.36 Google Chrome 1.0.154.36with the Java 1.6.0_11 JRE (Java Runtime Environment).
Get Java Sun’s Java1.6.0_11 JRE (Java Runtime Environment)
together for a month on some nice Caribbean island. They must learn to make their products work together more smoothly. Imagine the possibilities!

Meta Words

Meta words is something I hope search engines will start implementing. Unlike the features mentioned in my rant above, meta words do not require the co-operation of browser manufacturers.

Consider these sorts of question:

You can spend hours wading through more than you wanted to know about the battle of Hastings without ever finding out when it was fought. Similarly you will find all kinds detailed technical specs on ADSL, without ever getting a clue what it is or what it stands for. I propose adding meta words to queries that help direct the search engine. e.g. You won’t necessary find these metawords on the target pages. You infer them. E.g. HTML <DT> tags imply definition; retail sites imply buy numbers and A.D. imply dates. Right now search engines discard most potential meta words simply because they are too common as ordinary search targets.

Users could start using metawords without any change to their current habits. They are likely already including them unaware they are being ignored.

Copernic desktop search: free local search engine
Google
Lucene: free local search engine
people-finder
Search-It
spider
traffic

CMP homejump to top
CMP logo
feedback Please email your feedback for publication, errors, omissions, broken/redirected link reports
and suggestions to improve this page to Roedy Green : feedback email
made with CSS
HTML Checked!
ICRA ratings logo
mindprod.com IP:[65.110.21.43]
Your face IP:[38.103.63.62] The information on this page is for non-military use only.
You are visitor number 58,550. Military use includes use by defence contractors.
You can get a fresh copy of this page from: or possibly from your local J: drive (Java virtual drive/mindprod.com website mirror)
http://mindprod.com/jgloss/searchengines.html J:\mindprod\jgloss\searchengines.html