This is a little background information on HTML and URLs. For more information type mosaic http://fire.clarkson.edu/doc/html/htut.html ============================================================= HTML HTML documents consists of headers, paragraphs, lists, tables and figures. A simple example of an HTML document is: -------------------------------------------------------------
This angle-bracket-p represents a new paragraph. Note that unlike title and header tags the matching end tag is not needed.
-------------------------------------------------------------
This would look like (as best can be illustrated in this email at least)
as follows where my comments are enclosed below in parentheses():
-------------------------------------------------------------
HTML Document Title (appears on separate line in Mosaic viewer)
First level heading (24 point bold)
(blank line inserted above)
Second level heading (18 point bold)
(blank line inserted above)
Text lines that exceed the window width are automatically wrapped around
by the viewer. Note italicized can be embedded within (begin italicized
section) parts of a line of text (end italicized section) if you wish.
(blank line inserted above)
This angle-bracket-p represents a new paragraph. Note that unlike title and header tags the matching end tag is not needed.
-------------------------------------------------------------
The document text includes tags which are labels giving formatting
instructions to a viewer. Tags are represented as enclosed in . When a matching end tag is needed it is preceded by the "/"
character as in the case for headers where start new paragraph
.....
. Tag and
attribute names are not case sensitive. Unrecognized tags are
ignored.
There are tags for paragraphing, lists, headers, lists, tables, images, and
figures. Tags may also be accompanied by attributes, e.g. in some tags the
name destinations for hypertext links may be used.
General HTML Structural tags
... encloses the entire html file
... encloses a header, where i=1,...,6 (levels of headers)
Paragraphing tags
note: Blank lines, tabs, newlines, and extra spaces are ignored in an HTML.
carriage return
draw a long horizontal line
....
text is written verbatim
Lists
note: Nesting is allowed with lists.
unnumbered enumeration lists of items following each
numbered lists
definition lists
Font tags
... boldface
... italics
... fixed-width font
... underline
Escapes (a space must come after each escape sequence)
< for <
> for >
& for &
" for "
Unlike the fine control provided (e.g.) by LaTeX, HTML does not provide
fine precise layout instructions; hence the viewer must make many
decisions about exact layout and presentation. Different viewers will
often render documents slightly differently because of the HTML imprecision.
Sometime this is annoying when trying to specify exact font names and
point size, the margins, tab settings and how much white space to leave before
and after things. Subscripts and superscripts are not currently part of
HTML but may be in the near future.
-------------------------------------------------------------
URL
URL stands for uniform resource locator. URL is a pointer to a file.
There are many variations and subtleties about its format and interpretation.
Of particular important and interest is that it can point to a file on
a remote machine that has a world-wide-web node. One needs only to
specify an access protocol, a machine name, and a relative file
pathname. For example, http://www.eit.com/xyz/abc would mean: an
access protocol http (hypertext transfer protocol), site name
www.eit.com, and xyz/abc as the relative path to the file abc. For
example, the URL to an anonymous ftp site, ftp.cs.brown.edu, and the
file README would be: ftp://ftp.cs.brown.edu/README
Interestingly Mosaic/WWW can transparently handle many different protocols
including: ftp, gopher, and http.
Links to Other Documents
... fname will be displayed, when text inbetween
bracketted tags (which appears underlined)
on the screen is clicked with mouse
where the file extension on fname could be one of the following
.txt plain text
.html html markup text
.gif image text
.ps postscript
Also, fname is one of two forms,
1. relative
ex. When viewing the CS department home page //cs.nyu.edu/,
the current directory is
/usr/httpd/htdocs
Within this home page, a link to Bob Paige's home page, which is
stored in file /usr/httpd/htdocs/cs/faculty/paige/index.html, could
be stored as a relative link in which "fname" would be
cs/faculty/paige/index.html
Within Bob Paige's home page directory, a relative link to ps file
atlantique.ps, which is stored in that directory would just be
atlantique.ps
2. absolute
Here, fname has the form
scheme: //host.domain/path/filename
where scheme is,
file file is on an anoymous ftp server
http file is on a web server
gopher file is on a gopher server
-------------------------------------------------------------