You are on page 1of 18

Seminar on

“User-Oriented Evaluation
Methods for Interactive
Web Search Interfaces”

Under the Guidance of: Presented by:


PROF V.V.KONDHALKAR RAHUL. P. GUPTA
AGENDA
 INTRODUCTION
 HISTORY
 TYPES OF SEARCH ENGINE
 WORKING
 METHODS OF TEXT SEARCHING
 RELEVANCE RANKING
 META SEARCH ENGINE
 CONCLUSION
INTRODUCTION
 Web Search Engine is a software program that
searches the Internet (bunch of websites) based
on the words that you designate as search terms
(query words).

 Search engines look through their own databases


of information in order to find what it is that you are
looking for.

 Web Search Engines are a good example for


massively sized Information Retrieval Systems.
HISTORY

Archie – First search tool for the Internet

Gopher – indexed plain text documents

Jughead – searched the files stored in Gopher


index systems

Wandex – first Web search engine


TYPES OF SEARCH ENGINE

 DIRECTORIES
Directories are staffed by human editors who consider every
new website submitted and, if they decide it is acceptable,
assign it to the appropriate category (YAHOO).

 WEB CRAWLER

An automated Web browser which follows every link it sees.


(GOOGLE)
WORKING
A search engine operates in the following 3 steps

1. Web crawling

2. Indexing

3. Searching
1. WEB
CRAWLING
 It is the process of scanning web sites to add new pages
and to update existing one.

 A web spiders is an automated system.

Googlebot is Google’s web


crawling robot. It functions like web
browser, by sending a request to a
web server for a web page ,
downloading the entire page, then
handing it off to Google’s indexer.

Spiders are always crawling


2. INDEXING
 It allows information to be found as quickly as possible

 The most effective ways is to build a hash table

 For example, that the "M" section of the dictionary is much thicker
than the "X" section
 Lycos indexes the title, headings, subheadings and the hyperlinks to
other sites, along with the first 20 lines of text and the 100 words that
occur most often

 Infoseek uses a full-text indexing system, picking up every word in


the text except commonly occurring stop words such as "a," "an,"
"the," "is," "and," "or," and "www."

 AltaVista claims to index all words, even the articles, "a," "an," and "the."
3. SEARCHING
METHODS OF TEXT
SEARCHING
 KEYWORD SEARCHING
Most search engines do their text query and retrieval
using keywords.
search engines have trouble with so-called stemming.

 CONCEPT SEARCHING (CLUSTERING)


Concept-based search systems try to determine what
you mean, not just what you say.
Excite is currently the best-known general-purpose
search engine site on the Web that relies on concept-
based searching
RELEVANCE RANKING

Term Frequency
Locations of Terms
Link Analysis
Popularity
Date of Publication
Proximity of Query Terms
META SEARCH ENGINE
A meta-search engine is a search tool that sends user
requests to several other search engines and/or databases
and aggregates the results into a single list or displays them
according to their source.

Meta-search engines do not


own a database of Web pages

E.g.; DOGPILE
Examples of different Search Engine

SEARCH META-SEARCH DIRECTORY


www.4websearch.com www.Ixquick.com www.yahoo.com

www.altavista.com www.mamma.com www.about.com

www.alltheweb.com www.metacrawler.com www.galaxy.com

www.google.com www.redesearch.com www.goguides.org

www.hotbot.com www.surfwax.com www.looksmart.com

www.lycos.com www.turbo10.com www.zeal.com


CONCLUSION

 Though there are many search engines available on the


web, the searching methods and the engines need to go a
long way for efficient retrieval of information on relevant
topics.

None of the search engines out there today are perfect,


but using the right one at the right time can make all the
difference.

 Use Meta search engines. They minimize your search


to a great extent. The good news is that new search
engines are evolving every day to improve retrieval
efficiency.
REFERENCE

www.howstuffworks.com
www.scribd.com
www.searchenginewatch.com
www.informationplease.com

You might also like