31
CS 898N – Advanced World Wide Web Technologies Lecture 3: The Internet and World Wide Web Chin-Chih Chang [email protected]

The Internet and World Wide Web

Embed Size (px)

DESCRIPTION

 

Citation preview

Page 1: The Internet and World Wide Web

CS 898N – Advanced World Wide Web Technologies

Lecture 3: The Internet and World Wide Web

Chin-Chih [email protected]

Page 2: The Internet and World Wide Web

A Network of Networks

• The Internet has been made possible by the use of standard data communications protocols. Every computer on the Internet understands this specific set of protocols.

• A communication protocol is a standardized method for transmitting data between computers in a way that it can be sent, received, and processed without error.

Page 3: The Internet and World Wide Web

World Wide Web

• The World Wide Web (WWW, Web) was originally designed by Tim Berners-Lee as a global hypertext project in 1990.

• Hypertext is a method of linking text together.

• Hypertext Markup Language (HTML) is a language for hypertext layout.

• The purpose of the original Web browser, Mosaic, was to display formatted hypertext.

Page 4: The Internet and World Wide Web

World Wide Web

• The theory is the hypertext can create a unified knowledge base that united all the information in the universe into an interlinked whole.

• If we were to cross-reference every relevant piece of information with every other, our Web documents would represent a complete and formidable knowledge engine.

Page 5: The Internet and World Wide Web

World Wide Web• When the Web documents can be further

forms of data storage, such as audio and video, we come up with a larger concept called hypermedia, all kept in a world called hyperspace.

• HTTP is the primary protocol that all Web browsers are programmed to use.

• HTTP is HyperText Transfer Protocol. This is the protocol for transferring hypertext information on the Internet.

Page 6: The Internet and World Wide Web

Domain• A domain was one of the main hosts or

subnetworks of the Internet, a domain name was a way to access that specific host or network.

• A domain name is the central part of the Internet address.

• Domain names are split into two parts: the first (or top) level and second level.

• The second-level domain is the name you choose.

Page 7: The Internet and World Wide Web

Domain Name• The first-level domain is the extension. The

first-level domain is assigned according to what kind of domain it represents.

• You can check the up-to-the-minute status of all of the top-level domains at www.iana.org/domain-names.html

• You can register the domain name of your choice at Network Solutions, the registration arm of InterNIC (Internet Network Information Center).

Page 8: The Internet and World Wide Web

URL

• The Uniform Resource Locator (URL) is used to find an exact target within a domain.

• URL can be broken down into five parts:– the protocol designator such as http:// or ftp://,– the subdomain name,– the actual domain name,– the port number,– the path of a specific file to access.

Page 9: The Internet and World Wide Web

The Internet and URL

• IP means Internet Protocol. Every domain on the Internet is assigned a unique number. This number is 12 digitals long (four sets of 3 digits each) and is called the IP address.

• When you type in a domain this is translated into the 12-digital number.

• The organizations maintaining the IP address list publish an Internet phone book.

Page 10: The Internet and World Wide Web

From Browser to Server

Browser

ProviderInternetRouters

Server

Page 11: The Internet and World Wide Web

From Browser to Server• The browser calls a program to make a dial-

up connection to your local ISP access number.

• The provider’s end runs a program that constantly checks for incoming calls for the connection.

• Routers use the numeric addresses to route traffic from source to destination and back.

• The server runs a program awaiting an incoming request.

Page 12: The Internet and World Wide Web

Server

• A server is a computer with two features: – It’s hardwired into the Internet,– It has a great deal of specialized server software.

• To set up a Web server, you need a server software.

• The Apache server is available to download without cost at www.apache.org.

• You can have a series of options of services.

Page 13: The Internet and World Wide Web

Internet Architecture• The architecture is a specification that

defines exactly how electronic communication will occur between computers on the Internet.

• The Internet architecture is based on the network architecture.

• The OSI 7-Layer Reference Model [ISO,1984] is a guide that specifies what each layer should do, but not how each layer is implemented.

Page 14: The Internet and World Wide Web

Internet Architecture• OSI 7-Layer Reference Model

– Application layer: various applications (ftp, http)

– Presentation layer: present data in a meaningful format

– Session layer: provide session semantics (RPC)– Transport layer: reliable end-to-end byte stream

(TCP)– Network layer: unreliable end-to-end

transmission of packets

Page 15: The Internet and World Wide Web

Internet Architecture• OSI 7-Layer Reference Model (continued)

– Data link layer: reliable transmission of frames– Physical layer: unreliable transmission of raw

bits

• The conceptual intention here is that each the software which implements each layer communicates with its Peer Layer software, using services provided by the lower layers.

Page 16: The Internet and World Wide Web

Layered Architecture• TCP/IP stands for a combination of

Transport Control Protocol/Internet Protocol.

• The TCP layer takes responsibility for ensuring the communication is completed.

• The TCP layer converts messages that are handed to it by the application layer into TCP format by adding the TCP control information to the front of message, now called a TCP header.

Page 17: The Internet and World Wide Web

Layered Architecture• The TCP layer then hands the whole

message over the IP layer.• The IP layer takes responsibility for

ensuring that the communications are correctly routed.

• The physical layer performs the transmission of the data.

• At each level, the protocol handling a data block either adds its protocol-specific information or removes it from that data.

Page 18: The Internet and World Wide Web

Communications Protocols

• The connection between browser and provider is accomplished in four steps:– The modem connection,– Login– IP connection– HTTP connection

• Refer to these sites for more information: www.internic.net, www.iana.org, www.arin.net, www.nic.gov

Page 19: The Internet and World Wide Web

The HTTP Connection

• The HTTP protocol is text-based.

• HTTP headers:– GET: identifies the request as HTTP version

1.1.– Accept: identifies what image formats are

accepted.– Accept-Language: specifies the language used.– Accept-Encoding: specifies the data

compression.

Page 20: The Internet and World Wide Web

The HTTP Connection• HTTP headers (continued):

– User-Agent: identifies the user agent.– Host: requests the homepage.– Connection: specifies to keep the connection

open.– Extension: Something about security.

Page 21: The Internet and World Wide Web

The Domain Name Server• The provider’s end convert the domain

name for the Web page requested into an IP address.

• The originating server calls a program called a name resolver. This program accesses a table on the server with the addresses of the local name server.

• The name server will either have the IP address on the requested DNS (Domain Name Server) or query a remote name server.

Page 22: The Internet and World Wide Web

The Domain Name Server• The domain name system is set up in a

hierarchical fashion.• The application eventually looks up for the

root name server. The root name server will replay the address resolution request to the appropriate server of the requested domain.

• The scheme follows the network numbering scheme, also called dotted decimal notation.

• The ping command checks if a machine responds.

Page 23: The Internet and World Wide Web

The Domain Name Server

• IP addresses are handed out according to the size of the network.

• The actual number handed out is called the network (or subnet) mask, because the network addresses will have that part as a fixed value with the rest of the address variable.

Page 24: The Internet and World Wide Web

Packet Switching

• The Internet is a packet-switched network. All data is packaged in TCP and IP headers and sent through routers.

• A packet is a block of data packaged for transmission. Data packets are smaller pieces of a larger block of data that is broken down and sent in the individual packets, then received and reassembled.

Page 25: The Internet and World Wide Web

Communication Cycle

• In packet switching, individual packets of data may go one way or another, their route switched according to what is most efficient at that time.

• In page 48 an example of the communication cycle is illustrated.

Page 26: The Internet and World Wide Web

The Internet as a Managed Network

• There are two categories of organizations trying to keep the Internet in order:– The Internet Society (ISOC) consisting mostly

of individual members.– The W3C (World Wide Web Consortium)

consisting entirely of corporate memberships.

• ISOC is also at the top level of a hierarchy of Internet organizations.

www.isoc.org

Page 27: The Internet and World Wide Web

Internet Organizations• ISOC provides leadership in addressing

issues that confront the future of the Internet, and is the home for the groups responsible for Internet infrastructure standards, including the Internet Engineering Task Force (IETF) and the Internet Architecture Board (IAB).

• The IAB is a technical advisory group of the Internet Society.

www.isi.edu/iab

Page 28: The Internet and World Wide Web

Internet Organizations

• The IETF is engaged in the development of new Internet standard specifications.

www.ieft.org

• The Internet Engineering Steering Group (IESG) is part of IETF and is responsible for technical management of IETF activities and the Internet standards process.

www.ieft.org/ietf

Page 29: The Internet and World Wide Web

Internet Organizations

• The Internet Research Task Force (IRTF) is also a part of ISOC. Its purpose is a more farsighted version of IESG.

www.irtf.org

• The Internet Assigned Numbers Authority (IANA) is responsible for assigning a unique identifier to everything involving a standard or protocol that needs one.

www.iana.org

Page 30: The Internet and World Wide Web

Internet Organizations

• The World Wide Web Consortium (W3C) is to develop common protocols to enhance the interoperability and lead the evolution of the World Wide Web.

www.w3c.org

• RFC means Request for Comment. RFCs contain all of the protocols in use throughout the Internet.

Page 31: The Internet and World Wide Web

RFC

• The IETF recommends and approves working group that are run by the IESG under the IETF.

• These working groups tackle the task of putting together a specification:– Internet draft– Proposed standard– Draft standard– Internet Standard