What is HTTP?
HTTP, the Hypertext Transfer Protocol, is the application-level protocol that is used to transfer data on the Web. HTTP comprises the rules by which Web browsers and servers exchange information.
HTTP is a set of standards that allow users of the World Wide Web to exchange information found on web pages. When wanting to access any web page enter http:// in front of the web address, which tells the browser to communicate over HTTP. For example, the full URL for Computer Hope is http://www.computerhope.com. Today’s modern browsers no longer require HTTP in front of the URL since it is the default method of communication. However, it is still used in browsers because of the need to access other protocols such as FTP through the browser.
Although most people think of HTTP only in the context of the World-Wide Web, it can be, and is, used for other purposes, such as distributed object management systems.
How Does HTTP Work?
HTTP Is a request-response protocol. For example, a Web browser initiates a request to a server, typically by opening a TCP/IP connection. The request itself comprises
- a request line,
- a set of request headers, and
- an entity.
The server sends a response that comprises
- a status line,
- a set of response headers, and
- an entity.
The entity in the request or response can be thought of simply as the payload, which may be binary data. The other items are readable ASCII characters. When the response has been completed, either the browser or the server may terminate the TCP/IP connection, or the browser can send another request.
As an illustration of HTTP, here is an example exchange between a Web browser and the BBC server, www.bbc.com. In response to a user request to go to the URL ;
the browser sends the following HTTP request to www.bbc.com:
GET / HTTP/1.1
User-Agent: Mozilla/5.0 (compatible; Konqueror/2.2-11; Linux)
Accept: text/*, image/jpeg, image/png, image/*, */*
Accept-Encoding: x-gzip, gzip, identity
Accept-Charset: Any, utf-8, *
Accept-Language: en, en_US
-- blank line –
A brief explanation:
- The first line is the request line that comprises three fields;
- a method: The GET method indicates that the server is supposed to return an entity.
- a request – URI (Universal Resource Identifier). The / indicates the root of the document system on the server, and
- HTTP protocol version: 1.1 in this case.
- The second line is the optional Connection header informs the server that the browser would like to leave the connection open after the response.
- The third line is the optional User-Agent header that identifies the kind of browser that is sending the request, its version, and its operating system.
- The Accept headers specify the type, language, and encoding for the returned entity that the browser would prefer to receive from the server.
Responding to the browser, the www.bbc.com server sends the following response:
HTTP/1.1 200 OK
Date: Thu, 24 Jan 2002 17:33:52 GMT
Last-Modified: Mon, 21 Jan 2002 22:08:33 GMT
-- blank line--
-- HTML entity --
A brief explanation:
The first line is the status line consisting of three fields:
- HTTP protocol version of the response: 1.1 in this case,
- a three-digit numeric status code, and
- a short description of the status code.
The Content-Length, Content-Type, Etag, and Last-Modified header lines describe the entity returned.
HTTPS (Secure HTTP)
HTTPS denotes the use of HTTP with SSL (Secure Socket Layer) protocol or its successor protocol Transport Layer Security (TLS), a transport-layer protocol. Either of these protocols, which use encryption, can be used to create a secure connection between two machines. The browser uses SSL or TLS when connecting to a secure part of a website indicated by an HTTPS URL, that is, a URL with the prefix https://. The browser then uses HTTP to send and receive requests over this secure connection.