You are on page 1of 38

M.U.I.E.

T, Aligarh

ABSTRACT

A web search engine is designed to search for information on the World Wide
Web and FTP servers. The search results are generally presented in a list of
results and are often called hits. The information may consist of web pages,
images, information and other types of files. Some search engines also mine
data available in databases or open directories. Unlike web directories, which
are maintained by human editors, search engines operate algorithmically or are
a mixture of algorithmic and human input

To engineer a search engine is a challenging task. Search engines index


tens to hundreds of millions of web pages involving a comparable number of
distinct terms. They answer tens of millions of queries every day. Despite the
importance of large-scale search engines on the web, very little academic
research has been done on them. Furthermore, due to rapid advance in
technology and web proliferation, creating a web search engine today is very
different from three years ago.

The web creates new challenges for information retrieval. The amount
of information on the web is growing rapidly, as well as the number of new
users inexperienced in the art of web research. People are likely to surf the
web using its link graph.

CHAPTER – 1

INTRODUCTION
Web Search Engine 1
M.U.I.E.T, Aligarh

A web search engine is designed to search for information on the World Wide Web and
FTP servers. The search results are generally presented in a list of results and are often
called hits. The information may consist of web pages, images, information and other types
of files. Some search engines also mine data available in databases or open directories.
Unlike web directories, which are maintained by human editors, search engines operate
algorithmically or are a mixture of algorithmic and human input.

Search engines are the key to finding specific information on the vast expanse of the World
Wide Web. Without sophisticated search engines, it would be virtually impossible to locate
anything on the Web without knowing a specific URL.

Search engine is a software program that searches for sites based on the words that you
designate as search terms.

Search engines look through their own databases of information in order to find what it is
that you are looking for.

“Search engine” is the popular term for an Information Retrieval (IR) system.

1.1 PROBLEM DEFINITION

As the web evolves towards the direction of providing more and more information, locating
the desired information efficiently becomes a very important issue. Web search engines are
very useful information search tools in the Internet.

1.2 PROJECT SCOPE

The System should be well equipped with many powerful features and content delivery
system that combines various services, making them an integrated unit. Unlimited users can
access the system anytime & anywhere equipped with an internet connection.

Administrator can upload the files and delete the uploaded files on the server and
also search the text and files. User is only allowed to search the text and files.

Web Search Engine 2


M.U.I.E.T, Aligarh

The technology which should be used to develop the Web search engine should
ensure that searching is fast and accurate; operation is hassle free with easy maintenance.

1.3 FUTURE SCOPE

This software can be easily upgraded in the future. And also include many more features for
existing system.

In future this system will provide security all over the network.

In future user can search images, video and other types of content.

This project will provide greater level of security to user’s requirements.

This project will provide highly secure transmission.

CHAPTER – 2

SYSTEM ANALAYSIS

Web Search Engine 3


M.U.I.E.T, Aligarh

System analysis is a software engineering task that bridges the gap between system level
requirements engineering and software design. Requirements engineering activities result in
the specification of software’s operational characteristics (function, data and behaviour),
indicate software’s interface with other system elements, and establish constraints that
software must meet. The most commonly used requirements technique is to conduct a
meeting or interview. The first meeting between a software engineer (the analyst) and the
customer can be likened to the awkwardness of a first date between two adolescents. Data
collection is done by taking the copies of the documents involved in its working from the
organization.

2.1 PROJECT PLANNING

We were supposed develop software that can provide secure transmission. RSA is a
cryptographic algorithm designed to help users to communicate safely and provide a secure
transmission. Working in team reinstates the team for some common guidelines and
standard to be followed by all the team members across all the team. For the optimum use
of practical time it is necessary that every session is planned. Planning of this project will
include the following things:

 Topic understanding.
 Modular break-up of the system.
 Programming of each module.

2.1.1 TASK IDENTIFICATION PLAN

Gantt chart is a project scheduling technique. Progress can be represented easily in Gantt
chart, by colouring each milestone when completed. The project will start in the month of
February and end will at the starting May

Web Search Engine 4


M.U.I.E.T, Aligarh

February March April May

2.2 FEASIBLE STUDY

Feasibility study is conducted to select the best system that meets the performance requirements.
This entails an identification, description, and evaluation of the candidate system, and the section of
the best system for the job. Many feasibility studies are disillusioning for both user and analyst. First
the study often pre-supposes that when feasibility of the documents is being prepared, the analysis is
in position to evaluate solutions. Second most studies tend to overlook the confusion inherent in the
system development … the constraints and the assumed attitudes. If the feasibility study is to
serve as decision document, it must answer three key questions:

• Is there new and better way to do a job that will benefit the user?

• What are the cost and saving of the alternatives?

• What is recommended?

Web Search Engine 5


M.U.I.E.T, Aligarh

The most successful system projects are not necessary or most visible in business but
rather those that truly need user expectations. More projects failed because of inflated
expectations than for any other reasons.

Their main considerations are involved in feasibility analysis:

2.2.1 TECHNICAL FEASIBILITY


Technical feasibility centres on the current system and to what extent it can support the
proposed system, it includes current computer system specifications such as hardware,
software etc. it also involves financial considerations to accommodate the technical
enhancements. If the budget is serious constraint then the project is judged not feasible.
Though the system is developed in the generalized form, which covers all the procedures
and operations carried out in to produce and run the “Web Search Engine”. The version
used in the system is JAVA programming tool, Net beans, Apache Tomcat and My SQL.

The features embedded in the system are latest and according to the need of the
client. The backend used is the latest Java tools and Net beans. Any up gradations need
can be easily made in the source code, thus decreasing the headache of changing each and
every code. If in near future, the latest version of java tools and net beans to be installed
then , the source code handling the connection of the code can easily be modified. The
codes are easily compatible for the changes, as the latest version doesn't affect the core
code.

2.2.2 FINANCIAL FEASIBILITY


The cost to conduct investigation was negligible, as the centre manager and teachers of
centre provided most of information. The cost of essential hardware and software
requirement is not very expensive. Moreover hardware like Pentium I PC and software like
java tools and net beans are easily available in the market.

2.2.3 OPERATIONAL FEASIBILITY

Web Search Engine 6


M.U.I.E.T, Aligarh

In this we determine what change will be brought in system, new skills required and other
human organization and political aspects. Each user can easily use our algo. However it is
desirable that the user has the basic knowledge of the computers.

Without making any changes in the rules and regulations of the existing system
proposed system can easily adopted.

2.2.4 LEGAL FEASIBILITY

Our Project does not infringe with known acts, status or any pending legislation. Hence it is
legally feasible.

2.2.5 SOCIAL FEASIBILITY


Since, the proposed system is inexpensive, easy to use, reduces human errors, fast and
simple, hence socially accepted by all organization and institutes being directly.

CHAPTER – 3

SYSTEM DESIGN

Web Search Engine 7


M.U.I.E.T, Aligarh

Design is an activity of translating the specifications generated in the software requirements


analysis into specific design. The design involves designing a system that satisfies customer
requirements.
In order to transform requirements into a working system, we must satisfy both the
customer and the system builders on development team. The customer understands what the
system is to do. At the same time, the system builders must understand how the system is to
work. For this reason, system design is really a two-part process. First, we produce a system
specification that tells the customer exactly what the system will do. This specification is
sometimes called a conceptual system design.
This mirrors the two parts of requirement description. The conceptual design
concentrates on the function of the system, while the technical design describes the form the
system will take.

3.1 CONCEPTUAL DESIGN

The conceptual design tells ‘what the system will do? The system is described in terms of its
boundary, entities, attributes, and relationships. In the conceptual designing phase we have
considered the following questions: -

• Where will the data come from?


• What will happen to it in the system?
• What will the system look like to user?
• What choices will user are offered?
• What will the reports and screen look like?

Moreover, the system is described in language that the customer can understand,
rather than in computer jargon and technical terms. For example, the customers of the
system have been told that a menu on display screen will give users access to the system
functions.

The system description may even list acceptable user responses and the actions that
may result. However, the customer is not told how the data are manipulated in the system or
what kind of techniques is used for data manipulation. At the time of conceptual design, we

Web Search Engine 8


M.U.I.E.T, Aligarh

have written in the client’s language, which does not contain technicalities. It describes the
functions of the systems and incorporates all requirements in adequate details.

3.2 DATA FLOW DIAGRAM

Table 3.2.1 User DFD

Web Search Engine 9


M.U.I.E.T, Aligarh

Table 3.2.2 Admin DFD

Table 3.2.3 Database DFD

Web Search Engine 10


M.U.I.E.T, Aligarh

3.3 DATABASE DESIGN

Login

Field Type Collation Null Default Privileges


user_name varchar(40 latin_swedish NO select,insert,up
) _ci date
password varchar(20 latin_swedish NO select,insert,up
) _ci date
username varchar(40 latin_swedish NO select,insert,up
) _ci date
address1 varchar(60 latin_swedish YES (NULL) select,insert,up
) _ci date
address2 varchar(60 latin_swedish YES (NULL) select,insert,up
) _ci date
city varchar(20 latin_swedish YES (NULL) select,insert,up
) _ci date
state varchar(15 latin_swedish YES (NULL) select,insert,up
) _ci date
postcode varchar(20 latin_swedish YES (NULL) select,insert,up
) _ci date
country varchar(20 latin_swedish YES (NULL) select,insert,up
) _ci date
tel varchar(15 latin_swedish YES (NULL) select,insert,up
) _ci date
fax varchar(15 latin_swedish YES (NULL) select,insert,up
) _ci date
email varchar(20 latin_swedish NO select,insert,up
) _ci date
mob varchar(15 latin_swedish YES (NULL) select,insert,up
) _ci date
notes varchar(10 latin_swedish YES (NULL) select,insert,up
0) _ci date

Table 3.3.1 Login Database Table

Upload

Field Type Collation Null Default Privileges


author varchar(40 latin_swedish YES (Null) select,insert,up
Web Search Engine 11
M.U.I.E.T, Aligarh

) _ci date
file_name varchar(10 latin_swedish NO select,insert,up
24) _ci date
upload_dat datetime (Null) NO select,insert,up
e date
file_size int(11) (Null) NO select,insert,up
date
file_type varchar(10 latin_swedish NO select,insert,up
) _ci date

Table 3.3.2 Upload Database Table

Upload html

Field Type Collation Null Default Privileges


author varchar(40 latin_swedish YES (Null) select,insert,up
) _ci date
file_name varchar(10 latin_swedish NO select,insert,up
0) _ci date
upload_dat datetime (Null) NO select,insert,up
e date
file_size int(11) (Null) NO select,insert,up
date
file_type varchar(10 latin_swedish NO select,insert,up
) _ci date
meta_cont varchar(10 latin_swedish YES (Null) select,insert,up
ent 0) _ci date

Table 3.3.3 Uploadhtml Database Table

View_search

Field Type Collation Null Defaul Privileges


t
search_text varchar(4 latin1_swedis YES (Null) select,insert,up
0) h_ci date
file_names varchar(2 latin1_swedis YES (Null) select,insert,up
00) h_ci date

Web Search Engine 12


M.U.I.E.T, Aligarh

search_date datetime (Null) YES (Null) select,insert,up


date
viewed_files varchar(1 latin1_swedis YES (Null) select,insert,up
00) h_ci date
boolean_search_t varchar(3) latin1_swedis NO select,insert,up
ype h_ci date

Table 3.3.4 View_search Database Table

3.4 TECHNICAL DESIGN

The technical design explains the system to those hardware and software experts
who will implement it. The design describes the hardware configuration, the software
needs, the communication interfaces, the input and output of the system and anything else
that translates the requirements into a solution to the customer’s problem. The design
description is a technical picture of the system specification. Thus we include the following
items in the technical design:

 The System Architecture: A description of the major hardware components and their
functions.
 The System Software Structure: The hierarchy and function of the software
components.
 The data structure and flow through the system.

3.5 SYSTEM REQUIREMENT SPECIFICATION


PROJECT EVALUATION AND REVIEW TECHNIQUE

Introduction:

Software project managers take the overall responsibility of steering a


project to success. This surely is a very hazy job description. But, it is very difficult to
objectively describe the job responsibilities of a project manager.

Web Search Engine 13


M.U.I.E.T, Aligarh

WORK BREAKDOWN STRUCTURE


Work breakdown structure is used to decompose a given task set recursively into
small activities. WBS provides a notation for representing the major tasks needed to be
carried out in order to solve a problem. The root of the tree is labelled by the problem name.
Each node of the tree is broken down into smaller activities that are made the children of the
node. Each activity is recursively decomposed into smaller sub-activities until at the leaf
level; the activities require approx. two weeks to develop. If a task is broken down into a
large number of very small activities, these can be distributed to a large number of
engineers. If the activity ordering permits, the solutions to these can be carried out
independently. Thus, it becomes possible to develop the product faster.

The following figure represents the WBS of application software:

Web

Figure 3.5.1 WBS

Web Search Engine 14


M.U.I.E.T, Aligarh

CHAPTER – 4

MODELING

 Admin module — Administrator can login and upload the files (such as .txt, .pdf,
.doc, .docx, .html… etc.).

 User module —user can search the files and text.

 A document processor— Process the document to search the content.

 A query processor—processes the query of the user.

 A search and matching function—Search the content that user want to search.

 A ranking capability—it ranks the result according to user preferences.

 Summarizing and presenting documents—it shows the final result to the user.

Web Search Engine 15


M.U.I.E.T, Aligarh

Figure 4.1 Web Search Engine Working

Web Search Engine 16


M.U.I.E.T, Aligarh

CHAPTER – 5

CODING

5.1 HARDWARE REQUIREMENT

• Pentium 4 processor
• 256 MB RAM
• 20 GB Hard drive

5.2 SOFTWARE REQUIREMENT

• JAVA TOOL KIT


• NET BEANS
• My SQL
• Tomcat Apache server

5.3 TECHNOLOGIES USED

JAVA

Web Search Engine 17


M.U.I.E.T, Aligarh

Java is a programming language originally developed by James Gosling at Sun


Microsystems (which is now a subsidiary of Oracle Corporation) and released in 1995 as a
core component of Sun Microsystems' Java platform. The language derives much of its
syntax from C and C++ but has a simpler object model and fewer low-level facilities. Java
applications are typically compiled to byte code (class file) that can run on any Java
Virtual Machine (JVM) regardless of computer architecture. Java is a general-purpose,
concurrent, class-based, object-oriented language that is specifically designed to have as
few implementation dependencies as possible. It is intended to let application developers
"write once, run anywhere". Java is currently one of the most popular programming
languages in use, and is widely used from application software to web applications.

The original and reference implementation Java compilers, virtual machines, and class
libraries were developed by Sun from 1995. As of May 2007, in compliance with the
specifications of the Java Community Process, Sun relicensed most of its Java technologies
under the GNU General Public License. Others have also developed alternative
implementations of these Sun technologies, such as the GNU Compiler for Java, GNU
Class path, and Dalvik.

*Paradigm- Object-oriented, structured, imperative

*Appeared in- 1995

*Designed by -Sun Microsystems (now owned by Oracle Corporation)

*Developer- James Gosling & Sun Microsystems

*Stable release -Java Standard Edition 6 (1.6.0_25) (April 21, 2011; 16 days ago)

*Typing discipline- Static, strong, safe, nominative, manifest

*Major implementations –Open JDK, Hot Spot, many others

*Dialects -Generic Java, Pizza

*Influenced by -Ada 83, C++, C#, Delphi Object Pascal, Eiffel, Generic Java, Mesa,
Modula- 3, Objective-C, UCSD Pascal, Smalltalk

Web Search Engine 18


M.U.I.E.T, Aligarh

* Influenced- Ada 2005, BeanShell, C#, Clojure, D, ECMAScript, Groovy, J#, JavaScript,
PHP, Python, Scala

* OS- Cross-platform (multi-platform)

*License- GNU General Public License / Java Community Process

* Usual file extensions -.java, .class, .jar

TOMCAT-It’s an application server which is mostly used in the web-applications. It


implements the Servlet 2.5 &JSP 2.1 specifications. It’s a cross-platform application Server.

JSP- Java Server Pages (JSP) is a server side Java technology that allows software
developers to create dynamically generated web pages, with HTML, XML or other
document types. JSPs are compiled into Servlets by a JSP compiler.

SERVLET- Servlets are Java programming language objects that dynamically process
requests & construct responses. The Servlet APIs are contained in the javax.servlet &
javax.servlet.http packages. Servlets can be generated automatically by Java server Pages
(JSP) compiler.

Java Script – JavaScript is a programming language that is used to make web pages
interactive. It runs on your visitor's computer and so does not require constant downloads
from your web site.

Ajax – Ajax (sometimes called Asynchronous JavaScript and XML) is a way of


programming for the Web that gets rid of the hourglass. Data, content, and design are

Web Search Engine 19


M.U.I.E.T, Aligarh

merged together into a seamless whole. When your customer clicks on something on an
Ajax driven application, there is very little lag time.

CSS - Cascading Style Sheets (CSS) is a style sheet language used to describe the
presentation semantics (the look and formatting) of a document written in a markup
language. It’s most common application is to style web pages written in HTML and
XHTML, but the language can also be applied to any kind of XML document, including
SVG and XUL.

MySQL- MySQL is one such RDBMS. It provides a set of functional programs that we
use a tool to build structure and performs tasks, in MySQL data is stored and displayed in
tables. A table is a data structure that holds data in a relational database. A table comprises
of rows and columns. Table can also show relationship between entities. The formal name
of table is relation, hence the name Relational Database Management System.

Free_Text_Search_or.java

package searchEngine;

import java.io.BufferedReader;
import java.io.IOException;
import java.io.InputStream;
import java.io.InputStreamReader;
import java.io.PrintWriter;
import javax.servlet.ServletContext;
import javax.servlet.ServletException;
import javax.servlet.http.HttpServlet;
import javax.servlet.http.HttpServletRequest;
import javax.servlet.http.HttpServletResponse;
import java.sql.*;
import java.text.DateFormat;
import java.text.SimpleDateFormat;
import java.util.Date;
public class FreeText_Search_or extends HttpServlet {

String fileName = null;


String textstring = null;

Web Search Engine 20


M.U.I.E.T, Aligarh

String logfileName = "";


Connection con = null;
ResultSet rst = null;
PreparedStatement pst = null;
PrintWriter out = null;

@Override
protected void doGet(HttpServletRequest request,HttpServletResponse
response) throws ServletException, IOException {

response.setContentType("text/xml");
out = response.getWriter();
textstring = request.getParameter("searchText");//"Hello rit";//

String textarry[]=textstring.split(" ");


int arLen=textarry.length;
System.out.println ("array length "+arLen);
try {
con = Connect.Connect_S_Engine.makeCon();
pst = con.prepareStatement("select file_name from upload" );
rst = pst.executeQuery();
String result="";
while (rst.next()) {

fileName = rst.getString(1);
String filename = "/upload/"+fileName;
ServletContext context = getServletContext();

InputStream inp = context.getResourceAsStream(filename);


if (inp != null) {
InputStreamReader isr = new InputStreamReader(inp);
BufferedReader reader = new BufferedReader(isr);
String text = "";
Boolean lgfile=false;

while ((text = reader.readLine()) != null) {

for(int i=0;i<arLen;i++){
if(text.contains(textarry[i])){

Web Search Engine 21


M.U.I.E.T, Aligarh

if(!lgfile)
{logfileName=logfileName+fileName+"#";lgfile=true;}
String resultarry[]=text.split(textarry[i]);
int len=resultarry.length;
result=result+fileName+"#~#";
System.out.println("result is1 "+result);
for(int j=0;j<len;j++){
result=result+resultarry[j]+"#!!#";
System.out.println("result is2 "+result);
}
result=result+"#~#"+textarry[i]+"#~!#";
System.out.println("result is3 "+result);
}
}
}
}
}
System.out.println("result is "+result);

out.write(result);
con.close();

} catch (Exception e) {
e.printStackTrace();

}
savelog();
}
public void savelog(){
Date curntDate=new Date();
DateFormat dateFormat = new SimpleDateFormat("yyyy/MM/dd
HH:mm:ss");

try {
con = Connect.Connect_S_Engine.makeCon();
pst = con.prepareStatement("insert into
view_search(search_text,file_names,search_date,search_type,boolean_search
_type) values(?,?,?,?,?)");
pst.setString(1, textstring);
pst.setString(2, logfileName);
pst.setString(3,dateFormat.format(curntDate));
pst.setString(4, "free text search");

Web Search Engine 22


M.U.I.E.T, Aligarh

pst.setString(5, "or");
pst.executeUpdate();
con.close();
} catch (Exception e) {
e.printStackTrace();
}
}
}

CHAPTER – 6

TESTING

There are following rules that can serve well as testing objectives:

Testing is a process of executing a program with the intent of finding an error.

A good test case is one that has a high probability of finding an as-yet-undiscovered error.

A successful test is one that uncovers as-yet-undiscovered error.

6.1 TESTING TECHNIQUES

Black-box (Functional) Testing

Testing against specification of system or component. Study it by examining its inputs and
related outputs. Key is to devise inputs that have a higher likelihood of causing outputs that
reveal the presence of defects. Use experience and knowledge of domain to identify such
test cases. Failing this a systematic approach may be necessary. Equivalence partitioning is
where the input to a program falls into a number of classes. E.g. positive numbers vs.
negative numbers. Programs normally behave the same way for each member of a class.
Partitions exist for both input and output. Partitions may be discrete or overlap. Invalid data
(i.e. outside the normal partitions) is one or more partitions that should be tested. Test cases
are chosen to exercise each portion. Also test boundary cases (atypical, extreme, zero) since
Web Search Engine 23
M.U.I.E.T, Aligarh

these frequently show up defects. For completeness, test all combinations of partitions.
Black box testing is rarely exhaustive (because one doesn't test every value in an
equivalence partition) and sometimes fails to reveal corruption defects caused by "weird"
combination of inputs. Black box testing should not be used to try and reveal corruption
defects caused, for example, by assigning a pointer to point to an object of the wrong type.
Static inspection (or using a better programming language!) is preferable for this.

White-box (structural) Testing:

Testing based on knowledge of structure of component (e.g. by looking at source code).


Advantage is that structure of code can be used to find out how many test case need to be
performed. Knowledge of the algorithm (examination of the code) can be used to identify
the equivalence partitions. Path testing is where the tester aims to exercise every
independent execution path through the component. All conditional statements tested for
both true and false cases. If a unit has n control statements, there will be up to 2n possible
paths through it. This demonstrates that it is much easier to test small program units than
large ones. Flow graphs are a pictorial representation of the paths of control through a
program (ignoring assignments, procedure calls and I/O statements). Use flow graph to
design test cases that execute each path. Static tools may be used to make this easier in
programs that have a complex branching structure. Tools support. Dynamic program
analysers instrument a program with additional code. Typically this will count how many
times each statement is executed. At end, print out report showing which statements have
and have not been executed. Problems with flow graph derived testing:

Data complexity not taken into account.


Does not test all paths in combination.
Really only possible at unit and module testing stages because beyond that complexity is
too high.

DEBUGGING:

Debugging is a cycle of detection, location, repair and test. Debugging is a


hypothesis testing process. When a bug is detected, the tester must form a hypothesis about
the cause and location of the bug. Further examination of the execution of the program
Web Search Engine 24
M.U.I.E.T, Aligarh

(possible including many returns of it) will usually take place to confirm the hypothesis. If
the hypothesis is demonstrated to be incorrect, a new hypothesis must be formed.
Debugging tools that show the state of the program are useful for this, but inserting print
statements is often the only approach. Experienced debuggers use their knowledge of
common and/or obscure bugs to facilitate the hypothesis testing process. After fixing a bug,
the system must be reset to ensure that the fix has worked and that no other bugs have been
introduced. In principle, all tests should be performed again but this is often too expensive
to do.

6.2 TEST PLANNING

Testing need to be planned to be cost and time effective. Planning is setting out
standards for tests. Test plans set the context in which individual engineers can place their
own work. Typical test plan contains:
 Overview of Testing Process.
 Recording procedures so that tests can be audited.
 Hardware and Software Requirements.
 Constraints.

OVERVIEW OF TESTING STRATEGIES

A strategy for software testing integrates test case design methods into a well-
planned series of steps that result in the successful construction of software. It provides a
road map for the software developer, the quality assurance organization and the customer- a
road map that describes the steps to be conducted as part of testing, when these steps are
planned and then undertaken, and how much effort, time and resources will be required.
Therefore, any testing strategy must incorporate test planning, test case design, test
execution, and resultant data collection and evaluation.

Large systems are usually tested using a mixture of strategies. Different strategies
may be needed for different parts of the system or at a stage of the process.

Web Search Engine 25


M.U.I.E.T, Aligarh

CHAPTER – 7

SOFTWARE COST ESTIMATE & LIMITATION &


SCOPE OF PROJECT

Software cost:
• Java tool kit (freeware on internet)

• Net beans (freeware on internet)

• Tomcat (freeware on internet)

• My SQL (freeware on internet)

Manpower cost

• Team Cost 36000

• System Cost 40000

• Maintenance cost 25000

• Profit 24000

Total cost: 125000

Web Search Engine 26


M.U.I.E.T, Aligarh

7.1 LIMITATION

• This system cannot search images, video.

• This system cannot store information about the user for user preferences.

• This system does not povide the security for the user.

7.2 FUTURE SCOPE

This software can be easily upgraded in the future. And also include many more features
for existing system.

In future this system will store about the user preference.


• In future this system will provide the facility of user login.

• In future this system will search images and videos.

• It will work all over the networks of any type.

• We can optimize the searching time by other searching algorithms.

Web Search Engine 27


M.U.I.E.T, Aligarh

CONCLUSION

Web Search Engine is an interface that provide user to search files and text over
the network.

• Search engine plays important role in accessing the content over the internet, it
fetches the pages requested by the user.

• It made the internet and accessing the information just a click away.

• The need for better search engines only increases

• The search engine sites are among the most popular websites.

Web Search Engine 28


M.U.I.E.T, Aligarh

REFERENCES

[1] Bruce Eckel, “Thinking in Java”, Pearson Education, 4th Edition

[2] Scott Duffy, “How to Do Everything with Java Script” McGraw-Hill/Osborne

[3] Simon Brown, Sam Dalton, Daniel Jepp, Dave Johnson, Sing Li and Matt Raible, “Pro
JSP 2” Apress, 4th Edition

[4] O’Reilly ,”Head First Servlet and JSP” 2nd Edition

[5] Bing Liu (2007), Web Data Mining: Exploring Hyperlinks, Contents and Usage
Data. Springer, ISBN 3540378812

[6] Bar-Ilan, J. (2004). The use of Web search engines in information science research.
ARIST, 38, 231-288.
[7] “Guidelines From”,

• www.google.com,

• www.wikipedia.com

• www.ieee.org,

• www.sciencedirect.com

[8] Ross, Nancy; Wolfram, Dietmar (2000). "End user searching on the Internet: An
analysis of term pair topics submitted to the Excite search engine". Journal of the American
Society for Information Science 51 (10): 949–958.

[9] Xie, M.; et al. (1998). "Quality dimensions of Internet search engines". Journal of
Information Science 24 (5): 365–372.

[10] Information Retrieval: Implementing and Evaluating Search Engines. MIT Press.
2010.

Web Search Engine 29


M.U.I.E.T, Aligarh

APPENDIX-A USER MANUAL

There is some information about the project for the user.

First : install My SQL

Second: Install Apache Tomcat

Third : Deploy S_Engine.war file using apache tomcat server

Start Apache tomcat

Open web browser and go to http://localhost:8080

Then go to manager tab and deploy the S_Engine.war file

Forth : go to http://localhost:8080/S_Engine

Username: admin

Password: admin

APPENDIX-B SNAP SHOTS OF GUI


Web Search Engine 30
M.U.I.E.T, Aligarh

Admin Login Page

Free Text Search Page


Web Search Engine 31
M.U.I.E.T, Aligarh

Data Searching Page

Web Search Engine 32


M.U.I.E.T, Aligarh

Data Search Order By page

Web Search Engine 33


M.U.I.E.T, Aligarh

Upload File Page

Web Search Engine 34


M.U.I.E.T, Aligarh

Upload Html Page

Web Search Engine 35


M.U.I.E.T, Aligarh

Delete Page

Web Search Engine 36


M.U.I.E.T, Aligarh

Delete Successful Page

Web Search Engine 37


M.U.I.E.T, Aligarh

Web Search Engine 38

You might also like