Sie sind auf Seite 1von 17

WEB TECHNOLOGY - I

PAPER V

CONTENTS
Introduction to Internet ............................................................................................................................................2
History of Internet ...................................................................................................................................................2
Internet Users .........................................................................................................................................................4
Internet Engineering Task Force (IETF) .............................................................................................................4
Internet Research Task Force (IRTF).................................................................................................................4
Internet Architecture Board (IAB) .......................................................................................................................4
Internet Society (ISOC).......................................................................................................................................4
Internet Engineering Steering Group (IESG) ......................................................................................................5
Internet Assigned Numbers Authority (IANA) .....................................................................................................5
World Wide Web Consortium (W3C) ..................................................................................................................5
Internet Working .....................................................................................................................................................6
Information on Internet ...........................................................................................................................................6
Requirements for connecting to Internet ................................................................................................................7
Basic Internet Terms...............................................................................................................................................7
Introduction to World Wide Web .............................................................................................................................8
Evaluation of World Wide Web ...............................................................................................................................9
Basic Features ........................................................................................................................................................9
Information System .............................................................................................................................................9
Graphical Navigation ..........................................................................................................................................9
Platform...............................................................................................................................................................9
Distribution ....................................................................................................................................................... 10
Dynamic ........................................................................................................................................................... 10
Accessing Information ..................................................................................................................................... 10
Interactivness ................................................................................................................................................... 10
Web Browsers ..................................................................................................................................................... 10
Popular Web Browsers ........................................................................................................................................ 11
Web Servers ........................................................................................................................................................ 12
HTTP (HyperText Transfer Protocol) ................................................................................................................... 13
URL ...................................................................................................................................................................... 13
Locating a page by its URL using internet explorer ......................................................................................... 14
Search Engines ................................................................................................................................................... 14
Search Engine Categories ................................................................................................................................... 14
Search Criterion ................................................................................................................................................... 16
How to search using Search Engines?................................................................................................................ 16

INTRODUCTION TO INTERNET
Let us start with a small example, imagine that, you are working as a marketing manager in a
Company ABC, that has branches over various locations, and you want to know the details of the sales
of your company over various branches, one solution is by visiting through various branches and
collecting information about the sales, the second solution is, by connecting the computers at your
branches through a network. This allows you to complete your assignment from your seat itself.
Networking of computers allows sharing of the resources among themselves. When you
connect two computers so that they can communicate, you form a network. You can also connect two
or more networks to form an internet-network or Internet.
The Internet is the word used to describe the massive world-wide network of computers. The
word Internet literally means, Network of networks. Internet is super-network of millions of
interconnected computers and millions of users.
The foundation of the web was laid around the 1960s in USA, when the ARPA (Advanced
Research Projects Agency renamed as DARPA Defense Advanced Research Projects Agency)
researched into a localized networking concept on how they can share text documents from one
computer to another. The technology they researched was aimed at how these computers can
communicate even if a portion of the phone network crashed during say a war or flood or any calamity.
When the then phone network was based on circuit switching technology, this research actually gave
rise to the birth of packet switching in networks. The new model aligned well with the fact with that
information was sent in bursts or packets at some time intervals.
In other words, Internet is a network of networks that consists of millions of private, public,
academic, business, and government networks, of local to global scope, that are linked by a broad
array of electronic, wireless, and optical networking technologies.
FNC (Federal Networking Council) defines term Internet as,
Internet refers to the global information system that
1. Is logically linked together by a globally unique address space based on the Internet
Protocol (IP) or its subsequent extensions/follow-ons;
2. Is able to support communications using the Transmission Control Protocol/Internet
Protocol (TCP/IP) suites or its subsequent extensions/follow-ons, and/or other IPcompatible protocols; and
3. Provides, uses or makes accessible, either publicly or privately, high level services
layered on the communications and related infrastructure described herein.

History of Internet
The history goes back to the packet switching technology first used when two computers talked
to each other. Multiple nodes operate independently thus reducing impact in case of point failures. Thus
TCP/IP

Late 1960's to early 1970's Dept. of Defense Advanced Research Projects Agency (ARPA)
ARPANET served as basis for early networking research as well as a central backbone during the

development of the Internet. TCP/IP evolved as the standard networking protocol for exchanging data
between computers on the network.
Mid-To-Late 1970's Basic services were developed that make up the Internet: - Remote
connectivity - File Transfer - Electronic mail
1979-80 Usenet systems for newsgroups
1982 Internet gopher
1991 Public introduction to World Wide Web (mostly text based) - In the early 1990s, the
developers at CERN spread word of the Web's capabilities to scientific audiences worldwide. - By
September 1993, the share of Web traffic traversing the NSFNET Internet backbone reached 75
gigabytes per month or one percent. By July 1994 it was one terabyte per month.
1994 Prior to this time the WWW was not used for commercial business purposes - The Internet
is one-third research and education network - Commercial communications begin to take over the
majority of Internet traffic

Private network.
-FTP
-SMTP
-Email

1960

70

WWW
-Search
-Charts
-HTTP
-HTML
-Graphical

Public Networks.
-Newsgroups
-IRC

80

90

2000

Social Networks
-RIA
-Wiki
-Blog
-Web 2.0
-Web 3.0

10

In Short,

DARPA creates ARPANET 1968


First set of nodes connecting universities among UCLA, Stanford, and University of Utah
1970
First E-Mail sent 1972
Development on TCP/IP 1973
USENET decentralized news group
NSF creates CSFNET which is a 56 kbps network within institute
TCP/IP made used in ARPANET
IPv4 formal introduction 1980
IBM PC 1981
DNS 1983
Internet, WWW 1992

1000s of new hosts added to internet 1994


The Federal Networking Council (FNC) defines the term Internet 1995

Internet Users
In fact, nobody runs or owns it. Every network that is connected to the internet is responsible for
its own part. However, there are 3 main organizations which are supposed to guide and coordinate the
working of Internet. They are:
INTERNET ENGINEERING TASK FORCE (IETF)
The Internet Engineering Task Force is an international, self-organized group of people who
contribute to the engineering and evaluation of Internet technologies. It is the principal body engaged in
the development of new Internet standard specifications. IETF are aligned or structured as per area
groups. Each area is managed by an Area Director, who is a member of IESG. The following are
examples of some of the broad areas,
APP applications concerned with protocols for E-Mail, etc.
INT Internet concerned with ways of moving IP packets and DNS information
SEC Security concerned with authentication, privacy, etc.
The first step in the process of a W3C technology specification is the formation of a working
group, comprising members from W3C and the industry. This group develops the standards required
for the technology to work, passing through a number of intermediate stages consisting of the following.
Working draft
Final working draft
Recommendation
INTERNET RESEARCH TASK FORCE (IRTF)
IRTFs mission is to promote research of importance to the evolution of the future Internet. This
is done by creating focused, long-term, and small research groups working on topics related to Internet
protocols, applications, architecture, and technology. Some of the topics span security, anti-spam,
network management, name space management, etc. One of the important architectural components
include SNMP.
INTERNET ARCHITECTURE BOARD (IAB)
The IAB was earlier called the Internet Advisory Board, changed to current one in 1989. The
internet is vast and hence there is every possibility than a particular standard or policy loss sight of the
global picture. The IAB is responsible to ensure that the big picture is intact and focus on long-term
planning and coordination among the board areas of the IETF. IAB thus reviews the activities (and
information of new) working groups for integrity and architectural consistency. Another offshoot of an
important activity is the IRTF.
INTERNET SOCIETY (ISOC)
ISOC is an international, nonprofit membership-based organization founded in 1992 to provide
leadership in Internet-related standards, education and policy. It provides financial and/or legal support
to other groups and task forces related to the internet. Most of the Internet standards with respect to
technology are developed and set by the Internet Society (ISOC) and the units operating under its
auspices.

INTERNET ENGINEERING STEERING GROUP (IESG)


The IESG is responsible for technical management of IETF activities and the internet standards
process. It administers the process according as per ISOC procedures. The IESG ratifies or corrects
the output from the IETFs working groups, gets them started and finished, and ensures that nonworking-groups drafts that are about to become RFCs (request for comments) are correct.
Working groups are how the vast majority of IETFs done. There are hundreds of working

groups at a time and anyone can participate in their discussions.


INTERNET ASSIGNED NUMBERS AUTHORITY (IANA)
Some of the activities need to be monitored even after the protocol is finalized. Examples
include registry activities for MIME, Port number, etc. IANA is the central coordinator for the assignment
of unique parameter values for Internet protocols. The IANA is chartered by the Internet Society (ISOC)
to act as the clearinghouse to assign and coordinate the use of numerous Internet protocol parameters.
It is responsible for Domain names, IP address, port numbers (default port numbers like 80). It
delegates the administration to other bodies like the following:

RIPE NCC (Rseaux IP Europeans Network Coordination Centre)


ARIN (American Registry for Internet Numbers)
APNIC (Asia Pacific Network Information Centre)
LACNIC (Latin American and Caribbean IP address Regional Registry)
AfriNIC (African Regional Registry for Internet Number Resources)

In 1998, another organization called ICANN (The Internet Corporation for Assigned Names and
Numbers) was created as a private sector, not-for-profit Internet stakeholder aimed at administering
policy for Internet name and address system. ICANN operates under the US Department of the
Commerce. As a result, it performs part of the previously assigned IANA functions IP address
allocation, top level domain name management (like .com, .net, .org), etc. It ensures that every IP
address is unique and can be found on the Internet.
WORLD WIDE WEB CONSORTIUM (W3C)
The W3C was founded in 1994, by Tim Berners-Lee, in collaboration with CERN. Its mission is
centered on the web and most of its work revolves around the standardization of web technologies to
ensure interoperability. It achieves this by working on recommendations which are generated by
members.

Internet Working
All computers and other equipment within any given network are basically connected to each
other with the help of cables. The messages travel across the network with the help of networking
protocols. The protocols used over the Internet provide addresses for the computers attached to the
physical network. In this way, different types of networks communicate with each other using the same
protocol. To interpret the information being transmitted it is essential that the right software and
hardware be in place.
The commonly used protocols are:
Internet Protocols (IP)
Transmission Control Protocol (TCP)
Together they are called as TCP/IP protocol. If offers a simple naming and addressing
schemes, whereby resources on the Internet can be easily located.
Internet is dependent on the existing infrastructure developed by telephone and other
telecommunication companies, for transmission. Internet Service Providers (ISP) lease data circuits
form the telephone networks, and have dedicated computers at the end points or nodes. They rely on
the distributed intelligence of networking equipment known as routers, thus bypassing the telephone
companys expensive switching computers, while using their telephone lines. It can also be said that
the networks on the Internet use a router to communicate with other networks. The router on a network
accepts packets addressed to it and passes them as addressed to other networks. Each computer
system with a direct connection to the Internet, has to have the necessary hardware and software to
allow it to work with these packets of information.
All the contents of the Internet are held by computers knows as severs which are owned by
organizations or companies who want to distribute the information. Most of the Internet services
operate according to a scheme called client and server. Most of you who are familiar with networking,
know these words. Server is the main computer which controls all the other computers known as
clients. When a request is made to these servers for information, they bundle the requested information
in small packets with the address as to where it is to be sent, and send them down to the nearest
connection to the Internet. The packets are read by the router and then are sent down in the same
general directions as the address. A similar thing happens, at the next junction on the Internet. This
goes on till the packet is delivered to the right address, where it is put together again with other packets
to make up the original information.

Information on Internet
To understand how the information is transferred through the Internet, we will have to study
more about the Internet technology. After all it is very strange that it allows you share information not in
the form of written words but also allows you to have the information in the vocal form.

Remember what I had said earlier, about Packet Switching. This packet switching technology is
still used to transfer the data. Digital data made up of a series of 0s (OFFs) and 1s (ONs) are grouped
in unique sequences. Each sequences of 0s (OFFs) and 1s (ONs) have a particular meaning, which is
translated by the computers to enable you to view the matter on your computers screen. To understand

it more clearly, do remember that computers do not understand languages in the manner in which we
speak and write. Instead, they understand languages based on electrical impulses that go ON and
OFF. Speaking scientifically, such a system is called the Binary System in which specific combinations
of ON and OFF.
Thus, a message, sent by you through the Internet, first gets converted (translated) by your
computer in to a digital format, made up of a series of ONs (1s) and OFFs (0s) that are grouped in
specific and unique sequences. Each sequence of 1s and 0s has a particular meaning. At the final
receiving point, these sequences are reconverted (re-translated) by your friends computer so that your
friend can read your message on his computers screen.

Requirements for connecting to Internet


To connect to Internet, you need to have the followings:
1. A computer system with a software like Microsoft Internet Explorer or Netscape Navigator
loaded into it.
2. A telephone line. It will be better to have a dedicated telephone line, i.e., a line exclusively for
your Internet connection only. It is because, with the passage of time, you may find that you are
spending most of your time on the Net. This will prevent others from using your telephone or
calling you, as they will often get an engaged tone. However, if you wish to use the Internet
occasionally, then your existing telephone line will be sufficient.
3. A modem (modular-demodulator) is an electronic device that converts digital data from
computers into signals. These signals can then be transmitted over a normal telephone line. At
the receiving end, another modem converts the signals back into digital data understood by
computers. Modems can be internal, i.e., inserted in a slot on your computers motherboard or
external, i.e., fitted externally. Irrespective of whether your modem is internal or external, you
will need a jack to connect your telephone line to your computer.
4. As mentioned earlier, there are companies who provide you with the services of providing you
with the Internet Services. They are called Internet Service Provider (ISP). You have to open an
account with ISP to have the connection. An ISP is a company that gives you access to the
Internet for a fee. Presently, a number of ISPs are available in India. These include VSNL
(Videsh Sanchar Nigam Limited); Satyam Online; Mantra Online; Tata Nova; etc. each one of
these allow you to open an account with them and they would give an e-mail address too.

Basic Internet Terms


Before start using (browsing/surfing) the Web, it is necessary for you to understand the following
terms and their meanings especially in connection with the Internet:
Home page

Hypertext and Hyperlinks

It is the first page that you would see on the website. Also known as
the Welcome page. It is from here that you would start the navigation
of various other page of the site.
Information on the Web is made available in the form of Hypertext. It
is a method of presenting information wherein some portion is
highlighted. When this highlighted portion is selected, it displays more
information on the topic/s that you choose. The highlighted items

Internet Information Server


Internet Protocol
Internet Service Provider
Multimedia

Transmission Control
Protocol
Web Browser

Web Server

Web Site

Web Page

selected by you are technically called Hyperlinks. In fact, they allow


you to navigate from one Web document to another on the same
computer or on a different computers in your own city, country or
anywhere else in the world.
It is a group of Internet servers including the additional capabilities of
Windows NT and Windows 2000.
It is responsible for the addressing and sending data from one
computer to another computer.
It is one gateway to the Internet. As mentioned earlier, you need this
service to connect to Internet from your computer.
At the heart of the web is the ability to display multimedia information,
such as images, audio, video, animation, and other multimedia data
types.
It uses a set of rules to exchange messages with other Internet points
at the information packet level.
It is a software application that resides on your PC and can display
text, images, and multimedia data found on different Web pages. It
allows you to specify a Web page, navigate using links, and
bookmark your favorite Web pages. The commonly used Web
browsers are Internet Explorer and Netscape Navigator.
A Web server refers to a location (computer) on the Internet that
contains information in the form of Web pages. Technically speaking,
a Web Server means a computer on the Internet having the capability
to run software. A page stored on a Web server can be accessed by
Web users. It may also be mentioned here that Internet Service
Providers (ISPs) normally offer space on their Web servers on which
their registered users can publish their Web pages free of charge.
A web site comprises of a collection of Web pages that may be
maintained and updated by an organization like a Government or
University department, a business house, a research institution etc.
even a single individual can also create and maintain his/her own
Web site is stored in Hypertext.
A Web page refers to a document on the Web. Web pages can be
used to display written text, show pictures, play music/sound effects
and run video. Do also remember that you need to use Hyper Text
Markup Language (HTML) to create Web pages.

Introduction to World Wide Web


A global hypertext system that uses the Internet as its transport mechanism is called World
Wide Web. In a hypertext system, you navigate by clicking hyperlinks, which display another document
(which also contains hyperlinks). What makes the Web such an exciting and useful medium is that the
next document you see could be housed on a computer next door-or halfway around the world. The
Web makes the Internet easy to use.

Evaluation of World Wide Web


Created in 1989 at the European Laboratory for Particle Physics (CERN), a research institute in
Switzerland, the Web relies upon the Hyper Text Transport Protocol (HTTP), an Internet standard the
specifies how an application can locate and acquire resources (such as document, sound or graphic)
stored on another computer on the Internet. HTTP provides transparent, easy-to-use access to Web
documents, FTP (File Transfer Protocol) file archives, Gopher menus, and even UseNet newsgroups,
users of Web navigation software, called Web browsers, simply click on an underlined word or phrase,
and HTTP takes care of locating and downloading desired documents:
Most Web documents are created using the Hyper Text Markup Language (HTML), a markup language
that is easy to learn and will soon be supplanted by automated tools. Incorporating hypermedia
(graphics, sound, animations, and video), the Web has become the ideal medium for publishing
information on the Internet.
A key 1995 agreement on security protocols ensures that Web will quickly become an important
commercial medium, in which consumers can browse on-line catalog and place orders without worrying
about anyone intercepting the supplied credit card information.

Basic Features
Basics of WWW,
1.
2.
3.
4.
5.
6.

Information System
Graphical Navigation
Platform
Distribution
Accessing Information
Interactive-ness
Detail study of above terms,

INFORMATION SYSTEM
As we know that Internet is full of information. It is up to you to explore it. The information is
available in the form of a book. As you can jump from one page to another, the Web also allows you to
skip from one page to another using the Hyperlink. Once this information is there on the screen, you
can copy, move or transfer it to your hard disk for later reading.
GRAPHICAL NAVIGATION
The best part of Web is that it can display both text and graphic with equal ease. Whats more
there are sites which have sound and video too. So much so that they can be called truly multimedia
applications.
PLATFORM
The World Wide Web is not limited to any one kind of machine, or developed by any one
company. The Web is entirely cross-platform. It can be displayed on any computer hardware using any
operating system.

DISTRIBUTION
When you run the Internet and access any site, everything works in RAM. There is no hard disk
space involved. The information is totally distributive and can be distributed anywhere even to the hard
disk, only if need be.
DYNAMIC
A Web site goes through various changes at various instances. So every time you load the site,
you get the latest site on your screen. You need not go to buy any new software or hardware for that
purpose. The web site developer would probably not come and tell you when the site is updated. So
while getting the data from the site make sure that you are getting the latest data.
ACCESSING INFORMATION
As mentioned earlier, Internet is full of information. You can get the information from the net using any
of the method, may be FTP, Usenet, Telnet or even e-mail. Although the most popular way of getting
the information is by using the software called browsers and the famous ones are Microsoft Internet
Explorer and Netscape Navigator. Web browsers can also read files from other Internet services and
you can create links to information on those systems just as you would create links to information on
Web pages.
INTERACTIVNESS
This means that web sites are not just one way information. There are sites which ask for
information from you, the users too. This information is mostly taken in the form of Forms. This way the
Web Site developers not only get the information but also are able to access and compile the
information in any form.

Web Browsers
A software program that is used to view Web pages is called a Browser. Browser help you to
connect to Web sites. You can surf the Web, view Web sites and/or download files and programs. A
Web browser works by using a protocol called Hypertext Transfer Protocol (HTTP) to request a
specially encoded text document from the web server.
Browsers are the most common interface available to the user to access resources from the web. It is
the thinnest form of the client possible. A thin client does not need any extra installation on the client
machine as a browser is normally available as part of the O/S (IE is available on Windows, but other
browsers like Chrome or Firefox have to be installed in the system). On the other hand a thick client
needs installation on each client. A pure internet architecture for internet based application is supposed
to work on thin clients like thebrowser. However,it is not just a line that separates thick vs. thin clients.
There are a lot of degrees of states in between. Accordingly we see a lot of extensions in the form of
Adobe Flash, Silver light, etc. that enhance the basic functionality. As long as they are standardized,
there is little of an issue otherwise, there will be installation headaches.
The early generation of Web browsers were predominantly text based, so even on low
bandwidth the performance was good. An example of such browsers include Mosaic.

10

The commonly used web browsers are Internet Explorer from Microsoft (IE), Mozilla Firefox
from the Mozilla Foundation and Safari from Apple. All of them are meant to perform basic task of
browsing. They all do it well, but they are far from being standardized.

Popular Web Browsers


To begin with, there was only one browser, named Mosaic, in use and it was available free of
cost. It was written by Marc Andreessen, who later started a new venture for making a new browser
called Netscape. With the passage of time, Microsoft incorporated in its browser Internet Explorer many
of the features for which Netscape used to be preferred by the users. They are:

Lynx
It is the text only browser and works on UNIX platform. It was developed by University of
Kansas. It requires the
emulation of terminal
VT-100.
Designing
pages that work equally
well in Lynx and in
graphical browsers is
one of the more
interesting challenges
of Web page design.

Netscape
Navigator

11

It is available for
Windows, Macintosh
Operating System and
UNIX too. It supports
features like integrated

news and mail reader, etc.

Microsoft Internet Explorer


No doubt the most popular web browser. It also runs on most of the operating systems. Whats
more it is usually free with Windows operating system.

ncSA Mosaic
To begin with this was the most popular browser for net. Slowly giving way to others. It was the
first full-colour graphical browser and is the instrumental in making the Web as popular as it is today. It
was developed by ncSA at the University of Ilinois. ncSA Mosaic is free for personal use and comes in
versions for Windows, Macintosh and UNIX (the X Window System).

Mozilla Firefox
Firefox has become popular because it is fast, full featured Web browser and makes browsing
very efficient. Firefox includes pop-up blocking, tab-browsing, integrated Google searching, simplified
privacy controls. It

Web Servers

12

A Web Server is a program that runs on a Web site and is responsible for replying to Web
browser requests for files. You need a Web server to publish documents on the Web. To avail the
services of Web pages Web sites need a hosta computer and server software that runs on the host.
The host manages communication protocols and related software required to create a web site on the
internet. The server software resides on host and serve up the pages and other wise acts on the
request sent from the client browser software.
In the context, the terms ever applied in different ways and related to a piece of software rather
than hardware. In a generic sense, a server refers to different contexts performing different roles-a
database server primarily host an a RDBMS: a front-end server takes care of routing requests. Likewise
a web server takes care of handling web requests. It does not matter whether these roles are served in
the same physical hardware or distributed across multiple machines or even if a single role requires
multiple machines to perform.
In its simplest sense, a web server is a software program that accepts HTTP requests from
clients, passing it on to another server and serving the response back to the client. The response pages
usually offer specific format called HTML.
A server application can run on same the same computer as the client application using it, or
they can connect through a computer network. Examples, includes file server, database server, backup
server, mail server, print server, web server, ftp server, firewall, etc.
A server operating system intended, enable or better able to run server applications. The
differences between the server version and the workstation version of an operating system vary
sometimes the primary differences is the removal of arbitrary license dependent limits on the number
of networks file share connection accepted. Some server edition includes additional server applications
bundle with the operating system. Some server applications impose arbitrary limits on the number of
HTTPs accepted depending on wheatear they are running under a server operating system or not.
A computer i.e. designated for only one server application is often named for that application.
For example, when Apache HTTP server is a companys web server, the computer running it is also
called the web server. Server applications can be divided among server computer over an extreme
range depending upon the work load. Under light loading, multiple server computers may be required
for each application. Under medium loading, it is common to use one server computer per server
application, in order to limit the amount of damage caused by failure of any single server computer or
security breach of any single server application. Any server computer can also be used as a
workstation, but it is avoided in practice again to contain risk.

To upload a web site a host computer and server software is needed. This manages the
communications protocols and the related software required to create a web site on the internet. The
host machine usually uses UNIX. There are different types of server available that perform different
types of services for different types of clients. Specifically, a web server is an HTTP server and its
function is to send information to the client software using the HyperText Transfer Protocol. The client
browser requests and the server return a Hypertext Markup Language (HTML). Document or MHTML
document the server receives the requests and sends backup response.
A web server mostly performs the following jobs:
1. The server software includes configuration files and utilities to secure and manage the web
site in a variety of ways.
2. It passes requests to run CGI (Common Gateway Interface) scripts to the CGI applications.
These scripts run external mini-programs, such as a database lookup or interactive forms
processing.
Internet browser software sends its request for data to the host, and the Web server software
processes the requests. Included in the browsers requests is the desired information and the file
formats the browser can accept. If the browser asks for an HTML file, the Web server retrieves the file,
attaches a header to the file, and sends it to the browser. If the browser has asked for specific database
information, the web server will pass a request through CGI to the application, which performs a data
base lookup. The CGI scripts returns the result to the web server which in turn attaches a header to the
data and sends it to the browsers.

HTTP (HyperText Transfer Protocol)


It is used as the communication protocol to transport hypertext documents over the Internet. It
tells the server what to send to the client, so that the client can view web pages, ftp sites or another
area on the Net. It is the mechanism that opens the related documents when you select the HyperText
link, no matter where that document resides on the web. It is non-proprietary, platform independent,
open document architecture which works on most of the computers.
It is a method of preparing and publishing text ideally suited to the computers in which readers
can choose their own paths through the material. To prepare hypertext you first divide the information
into small, manageable units, such as single pages of text. These units are called nodes. You then
embed hyperlinks in the text. When the reader clicks on hyperlink, the hypertext software displays a
different node. The process of navigating among the nodes link in this way is called browsing. A
collection of nodes that are interconnected by hyperlinks is called a web. The World Wide Web is hyper
text system on a global scale.
Hypertext transfer protocol is a protocol that utilizes TCP to transfer information between
computers connected on a web-this includes web server and client as well. This is the communication
mechanism that browsers used to exchange data from clients and servers. The client makes an HTTP
to a web server using a web browsing, and the web server sends the requested information to the
client. This standard port used for connection between the client and server is 80.

URL
A URL describe the location and the method of accessing a resource on the Internet. For
example, the URL http://www.msn.com/ describes the name of the site of MSN. The term URL is also
used to describe the address system used on the web.

13

Each URL is unique, identifying a particular point on the internet. It consists of a string that
supplies the Internet address of a resource on the WWW along with the protocol by which the resource

is access. The most common URL is http which gives the internet address of a World Wide Web
page. Some other URL is types are gopher which gives the internet address of a Gopher directory, and
ftp which gives the address of FTP resource. All the web sites have URLs. One could say a URL is
what a telephone number is to a telephone or a street address is to a house.

LOCATING A PAGE BY ITS URL USING INTERNET EXPLORER


Every page on the Web has a unique address, called a Uniform Resource Locator shows the
components that make up a URL. The URL used as the example in the following explanation is:
http://www.entertainment.satyamonline.com/contents/horoscope.asp
http://

HTTP is the protocol used to transmit web pages for this URL

www.entertainment.satyamonline.com
The server where the page is located. Every domain name
is unique. Here internet service Provider is Satyam Online.
contents/

the directory where the page is located.

horoscope.asp

the filename of the web page.

Search Engines
They are program that locate needed information in a database, but specially an internet
accessible search service that enables you to search for information on the internet. To use a search
engine, you type one or more keywords: the result is a list of documents or files that contain one or
more of these words in their titles, descriptions, or text. The database of most internet search engines
contain World Wide Web documents: some also contain items found in Gopher menus and file transfer
protocol file archives.
Compiling the data base requires an automated search engine called a spider, forms filled out
by web authors, or a search of other database of internet documents. the effectiveness of search
engine can be measured in terms of indexing and term specify. Since the internet is a vast collection of
information, it is difficult to find specific information you specially need. Therefore, search engines are
more than a boom.
the World Wed Web is a huge repository of distributed information, but can be search as if the
information is present in a single repository. A web search engine is used to search for information
based on keywords entered by the user. Google, Yahoo!, and Bing are the most popular search
engines today and need of introduction about the way they is used and the way results are presented.

Search Engine Categories

14

Some of the popular search engines are:

Yahoo!
Lycos
Web Crawler
Excite
Google
Bing

Yahoo!
No doubt, the most popular and commonly used search engine. It is very simple to use and
mostly gives you the best results. It indexes web pages, Usenet and email address. The search has 14
categories listed on its home page. Each of these categories is divided into several sub categories. If
the search fails, it is automatically passed on to Alta vista for more searches. Yahoo provides other
services to like email accounts, sites, NEWS, searches to locate people, etc.
Lycos
This one has robot to help the software locate the needed in search. For each document index
the robot keeps the outgoing links in a queue and selects URL from it. Lycos indexes, titles, headings,
and sub headings of HTML, FTP and Gopher documents.
Web Crawler
It has a web robot called the Webbot that creates a daily index of keywords from documents all
over the web. The search engine directs the navigation in a modified breadth first node. It indexes both
the title and the full text of HTML documents.
Excite
It uses a spider and indexer for the full text search of documents. The spider retrieves only the
web and Usenet, NEWS Group documents. It offers services like search that are case sensitive. It uses
the various Boolean operators.
Google
Google's search engine is a powerful tool. Google uses a special algorithm to generate search
results. Google uses automated programs called spiders or crawlers, just like most search engines.
Also like other search engines, Google has a large index of keywords and where those words can be
found. What sets Google apart is how it ranks search results, which in turn determines the order
Google displays results on its search engine results page (SERP). Google uses a trademarked
algorithm called PageRank, which assigns each Web page a relevancy score.
Bing
Bing's results win in terms of the smoothness of its social integrations. The company's contracts
with both Facebook and Twitter give it access to more social data than Google, which must rely on the
lesser-used Google+ network. The way it integrates social recommendations into its SERPs is also
much less cluttered than Google's results. Bing's search results pages have a certain visual appeal.

15

Search Criterion
So far we have learn about the various search engines. Let us know in brief know how these
search engines work. As was mentioned in the case of various search engines above, each one of
them uses either the directories, spider, and even the robots for searches. A spider strips a way many
other markup features so that it simply sees the pure HTML source. Search sites are basically are of
two types,
Search Directories
They contain a list of websites organized hierarchically into categories and sub categories.
These are created manually.
Search Engines
It continuously sends out the so called spiders which stats on a home page of a server and
pursue all links stepwise.
When you open a search engine, it asks you what search. If it is to be case sensitive, make sure that
you are using the right case. Click OK to begin the search and the results is shown in the form of URLs
leading to the Pages you need.

How to search using Search Engines?


For searching for any information, just type the data to be searched, in the space provided at
the search engines current page, and click search. The results will be displayed with information
corresponding to the search in the form of clickable URLs leading to the pages you seek. In some
search engines, the data is related only after additional processing.

16

Das könnte Ihnen auch gefallen