Hypertext markup language. Hypertext Markup Languages SGML
In 1989, hypertext represented a promising new technology that had a relatively large number of implementations, on the one hand, and on the other hand, attempts were made to build formal models of hypertext systems that were more descriptive in nature and were inspired by the success of the relational approach to describing data.
HTML is a hypertext markup language used to encode documents. The HTML language is a set of commands according to which the browser displays the contents of a document; HTML commands are not displayed. The HTML language implements a hypertext linking mechanism that allows one document to be linked to others. These documents may be located on the same server as the page from which they are linked, or they may be hosted on a different server.
The HTML idea is an example of an extremely successful solution to the problem of building a hypertext system using a special display control tool.
Contextual hypertext links were recognized as the most effective form of hypertext organization, and in addition, the division into links associated with the entire document as a whole and with its individual parts was recognized.
All HTML documents have the same structure, defined by a fixed set of structure tags. An HTML document should always start with a tag< HTML >and end with the appropriate closing tag ( HTML>). There are two main sections within a document: the headings section and the body of the document, in that order. The headers section contains information that describes the document as a whole and is limited by tags<НЕАD>AndНЕАD>. In particular, the headings section should contain the general title of the document, delimited by the paired tag<ТITLE>.
ТITLE>). However, it is not recommended to omit structure tags when creating an HTML document. The simplest valid HTML document containing all the tags that define the structure might look like this:
< TITLE >Document title< /TITLE >Document text
HTML elements.
For paired tags, the scope is defined by the portion of the document between the opening and closing tags. This part of the document is considered an element of the HTML language. So, we can talk about a “BODY element” that includes the tag, the body of the document and the closing tag BODY >. The entire HTML document. can be thought of as an "HTML element." For unpaired tags, the element is the same as the tag that defines it.
Most elements of the HTML language. describes parts of the document's content and is placed between tags
. And, that is, inside the BODY structural element. Such elements are divided into block and text. Block elements refer to paragraph-level pieces of text. Text elements describe the properties of individual phrases and even smaller parts of text.Now we can formulate rules for nesting elements.
Elements must not intersect. In other words, if the opening tag is located inside an element, then the corresponding closing tag must be located inside the same element.
Block elements can contain nested block and text elements.
Text elements can contain nested text elements.
Text elements cannot contain nested block elements.
Functional block elements.
In most documents, the main functional elements are headings and paragraphs. HTML language. supports six levels of headings. They are specified using paired tags from<Н1>before<Н6>. When displayed, Web documents are displayed using this method; tag (document on the computer screen, these elements are shown using fonts of different sizes.
Regular paragraphs are specified using a paired tag<Р>. HTML language. does not contain a means for creating a paragraph indent (“red line”), so when displayed on a computer screen, paragraphs are separated by a blank line. Closing tagР>is considered optional. It is understood that it comes before the tag, which specifies the beginning of the next paragraph of the document. For example:
Heading
<Р>First paragraph<Р>Second paragraph
Second level headingН2>
A consequence of having a special tag that defines a paragraph is that the usual end-of-line character entered by pressing the ENTER key is not enough to create a paragraph indentation. HTML language. treats end-of-line characters and spaces in a special way. Any sequence; consisting only of spaces and end-of-line characters, is treated as a single space when the document is displayed. This, in particular, means that the end-of-line character does not even lead to a new line (a text element specified by an unpaired tag is used for this purpose
.
A horizontal ruler can also be used as a paragraph delimiter. This element is specified by an unpaired tag
. When a document is displayed on the screen, a ruler separates parts of the text from each other. Its length and thickness are specified by the tag attributes
.
This tag creates a 10 pixel wide horizontal ruler that takes up half the width of the window and is positioned to the right.
Website creation is one of the widely available opportunities in the modern Internet industry. The actual creation of websites is, in principle, not much more difficult than creating personal email accounts and electronic business cards.
To create a website, first of all, you need a server connected to the Internet on which you can place the necessary hypertexts. In addition, it is necessary to register the saiga name with the provider serving the selected server.
On the Internet you can find providers offering free opening of websites on their servers. Free sites can be opened on domestic servers narod.ru, boom.ru, hotmail.ru and on foreign servers, for example geocities.com, tripod.com.
On these servers you can register domain names like:
<имя>. narod.ru
name>.boom.ru,
Examples of registered domain names:
wdu.da.ru - website of the electronic university;
wduniv.newmail.ru - website of a distributed university.
After registering a site's domain name, you can host hypertexts on it. Hypertexts are placed on the site using special programs that allow you to create, edit, accumulate and copy a wide variety of hypertexts. Immediately after the placement of the very first (main) hypertext page, its information can be read using a browser in any country from any computer connected to the Internet. To do this, enter the website address on the Internet in the browser window. For example: http://bak.boom.ru
All posted files must be hypertexts, written in HTML format and having identifiers of the form<имя>.html.
HTML is a hypertext markup language.
By structure, hypertext is text with links to other hypertexts located on this server or on other servers. When you click on such a link, the browser automatically loads a hypertext page onto your computer screen, regardless of what server it is on and in what country it is located.
Using these tools and programs on the Internet, a wide variety of information sites and systems can be created - personal sites, company sites, electronic newspapers, magazines, electronic books, encyclopedias, as well as electronic archives and libraries.
The difference between sites is the amount of information, their structure and updating procedures. In general, for Internet sites, as for any organization, we can talk about the life cycles of their creation, development, modernization and liquidation.
The volume of information is determined by the owners - people or organizations that created sites and post their information on them. The amount of information on websites can range from several kilobytes to several gigabytes (millions of kilobytes).
The structure of sites can be very diverse. The simplest structure is a main page with links to a set of texts. These links can be in the text of the main page or highlighted in the table of contents at the beginning of it.
Each page of the site can be provided with a title, which appears on the top line of the screen when the site is loaded by the browser.
In addition, on the main page of the site you can specify a list of keywords for search engines.
Search engines weekly scan all servers on the Internet and record the addresses of all sites and hypertexts found along with the keywords highlighted in them. For these reasons, no later than a week later, any information published on the Internet can be found using the keywords contained in them.
Hypertext markup language (HTML) is most often used to create electronic training programs.
This choice is due to the fact that, along with the ease of creating this type of document, hypertext markup language has enormous capabilities, such as outputting formatted text, using graphic objects of almost all known formats, using a background image, inserting objects such as background sound, video and etc.
In addition, HTML makes it easy to organize links to other objects or fragments of text in the document itself.
The great advantage of HTML is that most modern tools (such as text and graphic editors, visual programming languages, Internet Explorer...) support working and saving documents in HTML format.
Therefore, HTML is often used to create such software products. However, creating various types of demos, testing procedures and surveys, in my opinion, is still made easier with the help of visual programming languages.
Therefore, this thesis examines the integration of various tools for creating training, testing programs and electronic textbooks.
However, the use of HTML documents greatly facilitates the writing of the theoretical part of the program and makes it more lively. Let's look at a few issues related to creating HTML documents. You can work on the Web without knowing the HTML language, since HTML texts can be created by various special editors and converters.
However, it is better to write directly in HTML, or at least monitor and modify the HTML code occasionally. Writing directly in HTML is not difficult. It may even be easier than learning an HTML editor or converter, which are often limited in their capabilities, buggy, or produce bad HTML that doesn't work on various platforms.
The first version of HTML was developed in the early 90s by Tim Beners-Lee for the formerly popular Mosaic browser. But in those days, neither the browser nor the language itself had yet found a worthy use. HTML+ appeared in 1993, and this version also went virtually unnoticed. The widespread use of hypertext began with version 2.0, which appeared in June 1994.
This was the moment when WWW began to grow in popularity around the world. The elements included in version 2 are, for the most part, still in use today.
Version 3.0 of HTML, which appeared a year later, introduced the ability to draw mathematical symbols (integral signs, infinity signs, fractions, parentheses, etc.) using language elements. Browsers (Arena) were also developed for this version. But this project turned out to be a dead end and did not receive further distribution.
In 1996, HTML version 3.2 appeared. This was an innovative solution; it is enough to mention that frames were introduced into the language specification, which have now become very popular among Web page developers.
Even now, very good design solutions can be implemented based on this specification. Almost all modern browsers fully support version 3.2, so the authors have no doubts about the functionality of the declared elements.
Along with the official language specifications that were developed by the W3C organization (W3 Consortium), browser manufacturing companies created their own elements (extensions).
Subsequently, some of these elements, after gaining general acceptance, were included in the specification of the next version of the language. It is interesting, for example, that an innovative solution - frames - that many developers loved, was not included in the 3.2 specification.
But browsers supported frames, and many books on HTML included descriptions of frames without mentioning that they were non-standard elements. And it was right, because frames became the de facto standard. They were already included in language version 4 for good reason.
Conversely, the APPLET and SCRIPT elements needed to extend HTML with other code did not play the role they were intended to play in version 3.2.
This was explained by the fact that browsers of different versions interpreted programs in Java, JavaScript, and Visual Basic VBScript differently. As a result, it was not possible to obtain code that worked reliably enough, and these languages were used by HTML enthusiasts mainly for experiments.
The official HTML 4 (Dynamic HTML) specification appeared in 1997. At this time, it was already obvious that the further development of hypertext would be carried out through script programming. This turned out to be much more effective than introducing new elements into the language.
The browsers that appeared at that time (Netscape Navigator 4, Microsoft Internet Explorer 4, etc.) already interpreted the program code quite reliably (a certain level of standardization was achieved). However, the developers still have problems. As an example, it can be noted that many scripts begin by determining the browser version in order to then use this or that piece of code.
Obviously, the programmer is responsible for testing pages on all currently popular browsers. In addition, the problem of using old or not very popular programs remains relevant. Microsoft and Netscape are rightfully considered the leaders in browser development, but there are also other companies.
As a result, using all the capabilities of Dynamic HTML has become the responsibility of programmers in fairly large organizations, where there are conditions for developing complex programs and their comprehensive testing. Creators of personal Web pages sometimes have to compromise between reliability and innovation in order to obtain sufficiently competent HTML code.
Anatomy of a Web Page
Below is a sample of a typical Web document. In this example, we will look at the structure of HTML pages.
Example (template) of a Web page
<Т1Т1Е>Web page structure
If you look at the source texts of various Web pages, you can easily see the similarity of their structures. This is explained by the fact that documents are created according to certain rules.
The syntax of the HTML language is based on the ISO 8879:1986 standard "Information processing. Text and office systems. Standard Generalized Markup Language (SGML)". True, there is a big difference between the official standard and the actual standard. HTML is constantly evolving, supplemented with new elements, and it should be studied not from official primary sources, but in practice, turning to the latest developments of leading companies and specialists.
To understand the structure of a Web page, you need to consider the weight of the elements included in the listing above. When considering language elements, we will use both tags: start and end.
For example: . This can be emphasized that in most cases the developer should use two tags for each element. The number of cases where only a start tag is allowed (some elements have no end tag at all) is small, and they are specifically specified. For tag names, you can use both uppercase and lowercase letters of the Latin alphabet.
Some users write starting tags in uppercase letters and ending tags in lowercase letters. This helps you understand the source text of a Web page.
HTML syntax.
HTML document notation. It was mentioned above that one of the principles of the language is multi-level nesting of elements. This element is the outermost one, since the entire Web page must be located between its start and end tags.
In principle, this element can be considered as a formality. It has the version, lang and dir attributes, which are rarely used in this case, and allows the nesting of HEAD, BODY, FRAMESET and other elements that determine the overall structure of the Web page. Naturally, the end tag