Effective Internet Searching
Tangled in the Web?
Using Search Guides & Engines
to Untangle Information Resources: A
Central Florida Library Cooperative Workshop
Scope:
This
page will examine both subject
searching sites and search engines
available for use on the Internet.
Background:
The
following sites provide much background information, tutorials, and guides
about searching the Internet and the World-Wide Web.
- AskScott - Your guide to finding it
on the Internet
- http://www.askscott.com/
- A well-organized site from a librarian trying to organize where to search
by type of information needed.
- CFLC's Original Search the Internet
Page
- http://cflc.net/search.htm
- NOTE: No longer maintained! This page has no tutorials, simply
listings by broad category; it was added to this page by request of CFLC
members.
- Finding Information: Search Engines
- http://www.philb.com/whichengine.htm
- A great page defining the type of question and identifying the best places to search.
- Geniusfind
- http://www.geniusfind.com/
- "Geniusfind categorizes thousands of topic-specific search engines and databases." A search directory of search directories and engines.
- How to Choose
a Search Engine or Research Database
- http://www.internettutorials.net/choose.html
- From the University at Albany Library, an excellent chart in an "if you
want..." format linking to selected search engines or subject searching
sites by specific features, fields, or options.
The same person has created an outstanding search guide to some of the newer search tools called Second Generation Searching on the Web.
- Information Literacy: Search Strategies
- http://www.noodletools.com/debbie/literacies/information/5locate/adviceengine.html
- From NoodleTools, an abbreviated checklist similar to the "How to Choose..." tool above.
- Internet
Web Text: Index
- http://www.december.com/web/text/index.html
- A good outline overview of the Web, which distinguishes
between "Subject-Oriented Searching" and "Keyword-Oriented Searching" and
links to many of the popular subject sites.
- NoodleQuest: Search Strategy Wizard
- http://www.noodletools.com/noodlequest/
- Also from NoodleTools, an interactive form that helps with search strategy choices.
- Pandia: Search Engine Tutorials
- http://www.pandia.com/resources/tutorials.html
- A list linking to a number of Web-based searching tutorials.
- Search Engine Use
- http://www.pewinternet.org/PPF/r/167/report_display.asp
- From the Pew Internet & American Life Project, this is a report from November of 2005 reflecting statistics about searching that show search edging up on e-mail in terms of overall Internet use.
- Web Search Tools : Databases & E-Resources (Library of Congress)
- http://www.loc.gov/rr/ElectronicResources/subjects.php?subjectID=69
- Provides links to a number of WWW search tools, organized alphabetically, with brief descriptions of individual sites plus links with further information.
- Yahoo:
Computers and Internet: Internet: World Wide Web: Searching the Web: How
to Search the Web
- http://www.yahoo.com/Computers_and_Internet/Internet/World_Wide_Web/
Searching_the_Web/How_to_Search_the_Web/
- At last count, lists seventeen sites with significant information about
how to search the Web, most including information about subject guides as
well as search engines.
Subject
Searching Sites
Definition
- Subject searching sites are those where human beings have indexed and
often rated and summarized Internet sites. For purposes of this page, "subject
searching" includes subject guides (often called "Webliographies"),
i.e., documents that list many types of Internet sources but about a single
topic, subject directories, i.e., sites that list one type of Internet
source but about multiple, categorized topics, and the newest entry in the
field, "hybrid" sites, i.e., those that attempt to rate Web sites. For
broad subject searching, any of these classified sites usually provides fewer
but more relevant results than those provided by search engines, which rely on
computer-generated algorithms searching on keyword hit numbers. For narrow
topics, a combination of subject searching and search engines can be the most
effective way to search.
WWW Subject Searching Sites
Background
- Pandia: Alternative Academic Search Directories
- http://www.pandia.com/sw-2002/16-directories.html
- Tutorial with lists and detailed descriptions of four major academic search directories.
- Pandia: The Best Search Directories
- http://www.pandia.com/resources/search-directories.html
- Tutorial with lists and detailed descriptions of search directory tools.
- Recommended Subject Directories
- http://www.lib.berkeley.edu/TeachingLib/Guides/Internet/SubjDirectories.html
- From the library at the University of Berkeley, this is a table showing features from several of the major subject directories.
Good Starting Points
(Alphabetically)
- About.com
- http://about.com
- Formerly The Mining Company, this site relies on human "Guides" to
index Internet content in their own areas of subject expertise. Designed
so that users may search by "channel" or by keyword, the results are annotated
lists with occasional added original content; guides are identified and their
background listed.
-
- INFOMINE: Scholarly Internet Resource
Collections
- http://infomine.ucr.edu/
- In its own words, "INFOMINE is intended for the introduction and use of
Internet/Web resources of relevance to faculty, students, and research staff
at the university level." It categorizes 20,000-plus scholarly Internet and
Web resources and provides indexing and annotations about the sites listed.
Begun in 1994 at the University of California (UC), Riverside, it is now
maintained by librarians at all nine UC campuses and Stanford University. A
few links are limited to UC patrons, but subscriptions to the same sources may
be available at other university libraries--ask!
- Internet Public Library: Subject
Collections
- http://www.ipl.org/div/subject/
- This is an ever-growing and increasingly useful annotated and indexed
collection of selective sites, hosted by the School of Information at the
University of Michigan.
- The Librarians' Internet Index
- http://lii.org
- Begun as one librarian's bookmark file to sites useful in a public library
setting, this is now an organization employing over 100 librarian-indexers. An
extremely well-organized and well-annotated site searchable in various ways.
- Yahoo!
- http://www.yahoo.com/
- "Yet Another Hierarchical Officious Oracle," YAHOO is one of the older and
better searching sites around. With broad categories determined by human
beings, keyword searching can be done across all categories or limited to a
specific category. Search results show site title, brief summary, Yahoo
category, and links to other sites and search engines. An excellent place to
start if searching for what sort of information can by found on the Web by
discipline.
Other Selected Subject Sites
- BUBL Information Service
- http://bubl.ac.uk/index.html
- Originated as BUlletin Board for Libraries, and while
it retains a strong library element, the subject trees (accessible
alphabetically or by Dewey Classification number) now provide
broader access to research and academic Internet sites.
- Internet Resources
For...
- http://www.ala.org/ala/acrl/acrlpubs/crlnews/internetresources.htm
- Archives of a monthly column that appears in College & Research
Libraries News, there are few bibliographies here, but those found are
extensive and highly authoritative. Listed by title and by date.
Now, a wiki updates the content; linked off this page.
- The World-Wide Web Virtual Library
- http://www.vlib.org/
- This is probably the oldest search site on the Internet. It allows keyword searching plus viewing the subject list either
hierarchically or alphabetically.
"Hybrid" Sites: Reviews and
Ratings of the Internet
- Many of the parent sites listed in this section show up in other portions
of this page, as they produce subject directories or search engines, but each
of these organizations also employs teams of professionals specifically to
review and/or rate Internet sites. In most cases, searching the master site
does not automatically include the associated reviewed directory.
- The Internet Public Library:
Ready Reference Collection
- http://ipl.org/div/subject/browse/ref00.00.00
- "Not intended to be a comprehensive hotlist to all sites on every subject,
but rather an annotated collection, chosen to help answer specific questions
quickly and efficiently. Sources are selected according to ease of use,
quality and quantity of information, frequency of updating, and
authoritativeness." Arranged by broad category then by subcategories, the
"collection" (several thousand items) is also searchable by keyword. Entries
give title, URL, a review, the site author, and the IPL subject headings and
keywords.
-
- The Scout
Report
- http://scout.wisc.edu/
- A weekly publication of the Internet Scout Project (part of the InterNIC),
this valuable current awareness tool is available by subscription or on the
Web. Professional librarians and subject matter experts select, research, and
annotate what they judge to be "the best" Internet resources available and
more than three years of issues are archived and searchable by keyword, fields,
subject category, or LC Classification.
-
"Also-Rans": Recommended With Qualifications
- With more time and work, these sites will probably eventually become quite
useful, but for now, they fall short of the mark. Use them for exploring
or surfing, but be warned that they are frustrating if you are on an
expeditious hunt for information!
- Ask
- http://www.ask.com/
- (Formerly AskJeeves.) Though really more in competition with search engines than with subject
searching sites, this is definitely an "also-ran." Ask uses
natural language processing and knowledge bases rather than just Boolean
keyword searching, but unfortunately, the "millions of researched answer
links" frequently do not contain the specific answers sought. Ask
responds with the questions for which it does have answers, checks
several of the popular search sites, and provides lists of their links.
Still a far cry from fulfilling its claim that "each answer link is guaranteed
to be relevant to the question asked," this might be useful for homework help
(see particularly Ask for Kids!),
but not yet for research.
- Britannica.com
- http://www.britannica.com/
- This is much more than the online version of the well-known encyclopedia, especially for subscribers. Even for non-subscribers, however, it provides expert reviews of selected Websites in search results.
- IPL Pathfinders Subject
Index
- http://www.ipl.org/div/pf/
- A relatively new service from the Internet Public Library, this is a
limited, but ever-growing, list of guides to starting research on particular
topics. They primarily link to Internet resources, but most also cite
books of interest.
- WebRing
- http://dir.webring.com/rw
- WebRing operates from a sound basic premise, i.e., linking like subject
sites together so that a searcher can move from one to the next and know that
they are related to the desired topic. The organization of and search
engine for the 50,000-plus rings greatly improved while it was previously
owned by Yahoo!, but the included pages
are still predominantly personal and commercial.
World-Wide Web Search Engines
Definition
- World-Wide Web search engines are sites that use software (often referred
to as spiders, crawlers, worms, or robots) to automatically create searchable
databases attempting to "index" the Internet. Many engines "weigh" the results
for relevancy, relying on a computer-generated algorithm to compare the
numbers of times keyword hits appear. Meta-search sites provide an
advantage, in that they will allow querying several search engines from a
single site (either one or several at a time), but they usually do so without
taking full advantage of the features of each search engine. For narrow
searches where a specific term is required, search engines are the most
effective way of finding those sites which use the term. For broader searches
going beyond the keyword into a discipline or subject area, a combination of
search engines with subject searching sites, or directories, is more
effective.
Background Information
-
- Pandia: The Best Search Engines
- http://www.pandia.com/resources/search-engines.html
- Tutorial with lists and detailed descriptions of search engines.
- Search Engine Features Chart
- http://www.searchengineshowdown.com/features/
- A table by Greg Notess, a librarian who writes Search Engine Showdown, comparing the search capabilities and
commands for Ask, Exalead, Gigablast, Google, Live Search, WISEnut, and Yahoo Search.
- Search Engine Watch
- http://www.searchenginewatch.com/
- For Web surfers and Web developers, this provides massive amounts of
information on search engine news, design, tips, statistics, and more.
For added value, true fans may subscribe.
- Search Engines
- http://www.libsuccess.org/index.php?title=Search_Engines
- From Library Success: A Best Practices Wiki, this is an annotated and categorized list of search engines and specialty search engines.
- Thumbshots.com Ranking
- http://ranking.thumbshots.com/
- A tool that allows comparing either results between two engines or two different searches on the same engine.
- Yahoo > Directory > Computers and Internet > Internet > World Wide Web > Searching the Web >
Search Engines and Directories > Comparisons and Reviews
- http://dir.yahoo.com/Computers_and_Internet/Internet/World_Wide_Web/
Searching_the_Web/Search_Engines_and_Directories/Comparisons_and_Reviews/
- At last count, lists five sites with significant information about
search engines and their various functions and features.
WWW Search Engines: Meta-Sites
(more than one search engine listed and accessible)
Meta-Sites That Provide Search Forms & Links; Search One At A
Time
- SEARCH.COM
- http://www.search.com/
- A search engine to find search engines, this is a very professional
meta-index compiled by the staff of CNET: The Computer Network (whose
television show, C/NET Central, airs on USA Network and the Sci-Fi Channel).
Provides subject menus as well as an A-Z listing of some 500 search engines.
- iTools - Search Tools
- http://www.itools.com/search/
- A site with many forms linking to search engines, there is also a
companion site, iTools - Research Tools, where users can
easily use forms to look up language, geographical, financial, and other
questions.
- Scandoo
- http://www.scandoo.com/
- Designed to scan the results of the search (Google, Yahoo!, or Live) for family friendliness prior to displaying them.
-
Meta-Sites That Search
More Than One Engine Simultaneously (also called Clustering Engines)
- Clusty
- http://clusty.com/
- Formerly the Vivísimo search site; developed at Carnegie Mellon University in 2001, this
searches multiple search engines and then organizes the search results
into "document clusters" or meaningful groups. Very easy to restrict search by host. Now available for searching with Wii!
- Dogpile, the Friendly Multi-Engine
Search Tool
- http://www.dogpile.com/
- Dogpile searches for Web documents (using thirteen popular engines),
USENET entries (using four sources), FTP sites (using two engines), plus a
variety of specialized areas, including weather, stock quotes, business news
and other news wires. Results are listed by engine, with a small
proportion of entries [10-20] from each shown (in the style of that
particular engine), but with the option of going to the engine itself for
any entries not listed.
- Don Busca
- http://www.donbusca.com/
- Has some interesting new features, but includes many "pay per inclusion" search engines.
- Exalead
- http://www.exalead.com/
- Gives thumbnail pictures of the search results pages; allows easy refining of search results by related terms, type of site, media, language, file type, and more.
- Ixquick Metasearch
- http://ixquick.com/
- Ixquick, "the world's most powerful metasearch engine,"
searches for Web documents, news, MP3 files, and pictures; combines
results into one list, ranking results based on how many times they appear
in the top ten rankings from the different search engines; allows any type
of search syntax; and is available in over a dozen languages.
- KartOO
- http://kartoo.com/
- Visual metasearch engine. Almost more of a tag cloud result than just thumbnail screenshots.
- Mamma: Mother of All Search Engines
- http://www.mamma.com:80/
- This meta-search engine site uses
Alta Vista, Excite, Infoseek, Lycos, WebCrawler, Yahoo, and HotBot
simultaneously to generate one set of results. Items are ranked by
percentage, identified by engine used, and give summaries if requested.
- MetaCrawler Searching
- http://www.metacrawler.com/index.html
- A sophisticated meta-site, this sends queries to eleven different search
engines simultaneously, compiles the results on one page (attempting to
eliminate duplicates), and provides a "relevance ranking" for each hit.
"Metaspy" is a new feature great for voyeurs that shows ten real-time search
topics; the page refreshes every 15 seconds. Also has added a subject
guide on the home page.
- SurfWax
- http://www.surfwax.com/
- Displays results in a very different format, allowing choice of views. Also gives a "SiteSnap" to see keywords in context and more.
- URL.com
- http://www.url.com/
- A metasearch engine of Google, MSN, and Yahoo!; allows users to vote on search results.
- WebCrawler Searching
- http://webcrawler.com/
- Allows searching for Web sites, images, audio, video, news sites, and more. Includes Google, Yahoo!, Ask, and MSN Search.
-
-
WWW Search Engines: The Largest Single Search Engine
Sites
(Listed by Size as of
12/31/02)
- Google
- http://www.google.com/
- Google uses a technique, "link analysis", which allows it to be
a huge index (over 3 billion pages), not all of which are indexed. While
it is not the same as full-text indexing of all of those pages, if
sheer numbers are desired, Google is the engine. Results are returned on
the basis of link popularity, which works remarkably well, and the engine also allows
searching within the results. The 3 billion figure does not include the Google Groups
[formerly Deja.com] discussion posts (about 700 million), the 425 million image
files, or the 4500 news sources, all of which are separately searchable.
- All the Web Search
- http://www.alltheweb.com/
- Formerly called "Fast", this entered the search engine scene in 1999 and for a good
portion of that year was the largest search engine on the Web by far.
Given that it is a Dell partner and trying to demonstrate its technology and
speed, it is not a fancy search engine, but it is a good alternative if others
are not finding relevant results.
- AltaVista:
The Search Company
- http://www.altavista.com/
- This is an extremely popular search engine, as it includes more documents
and allows for more sophisticated searching than most (including 25
languages). Has BASIC and ADVANCED
search modes, each with a lengthy help file, but no help is available if "Bad
Query" is the system response. Results are not ranked; each lists title,
summary, and URL. Allows turning on a family filter; searching for
images, video, audio, MP3, and news; and searching within results.
- Windows Live
- This is the MSN entry into search, along with its companion Live Search Academic Beta.
-
WWW Search Engines: Selective Smaller Single
Search Engine Sites (Listed Alphabetically)
- accoona
- http://www.accoona.com/
- Displays news, business, and Web search results; allows "supertargeting" results by displaying drop-down menus of included terms, geographic tags, companies, people, languages, and publishers.
- Gigablast
- http://beta.gigablast.com/
- Not a particularly remarkable search engine upon first glance, but contains one of the most useful search filters around, the "GigaBits" box, which allows quickly focusing on themes or subjects that occur frequently in a search result.
- HotBot
- http://www.hotbot.com/
- This search engine, now owned by Lycos, has gone through many changes
since its award-winning entry into the search engine field in 1996.
Still is a good place to try if overwhelmed by one of the larger engines or if
advanced search capabilities are desired.
- IceRocket
- http://www.icerocket.com/index
- Includes channels for blogs, web, myspace, and more. One of the best blog search engines.
- Lycos
- http://www.lycos.com/
- Results show title, summary, and URL. Has a sophisticated way of displaying results, including "basic facts," "advanced reading," and more.
- Mooter
- http://mooter.com/
- A search engine that organizes the results as "a diagram of themes." One of the emerging "visual search engines."
-
-
Back
to top of section. Back to
top of page.
WWW Search Engines:
Specialized Search Engines or Lists
-
- Business.com
- http://business.com
- Though built on a paid search model, this business-focused search engine and
directory is useful, containing "more than 400,000 listings within 25,000 industry, product and service subcategories."
Results show the paid listings first but then go into a strict alphabetical
listing; some entries include additional links.
- Daypop
- http://www.daypop.com
- A current events/weblog/news search engine; updated each hour.
-
- Google
Groups
- http://www.google.com/grphp?hl=en
- This search engine was formerly known as Deja News, then Deja.com, and it began its life
designed to search Usenet postings exclusively. It is now providing
"...complete access to Usenet data since 1995" and includes more
than 700 million messages.
- Health Vault Search
- https://health.live.com/default.aspx
- From Microsoft, this is an unusual site intended to be used as a personal medical portal. Has a search engine targeting medical sites.
- MsDewey
- http://msdewey.com/
- Not the greatest search engine, but a hoot to use, as the Flash Ms. Dewey impatiently taps on the glass and entertains herself waiting for you to respond or responds in part based on the context of your search.
- Northern Light Business Research Engine
- http://www.northernlight.com/library.html
- Once a general search engine, this site now concentrates on business. Some free and some fee.
- Search Engine Colossus
- http://www.searchenginecolossus.com/
- From Canada, this is a list of more than 1,500 search engines and subject
directories from 176 countries, organized by country or broad category. Indicates the
language of the source and includes a brief summary of the resource.
- Truveo
- http://www.truveo.com/
- Search engine and directory for nothing but videos. Includes many sources.
- USAGov: The U.S. Government's Official Web Portal
- http://www.usa.gov/
- Begun in September 2000, this is self-described as "the only official
U.S. Government portal to 30 million pages of government information,
services, and online transactions." Excellent, award-winning
site.
- Wink: Where People Find People
- http://wink.com/
- Searches several social networks looking for people.
-
-
Created 8/23/97; Last revised 3/12/2008
|