Want to create interactive content? It’s easy in Genially!

Get started free

Infographic about Markup Languages

Nicolás Navarro Weis

Created on November 28, 2023

Start designing with a free template

Discover more than 1500 professional designs like these:

Transcript

Infographic about Markup Languages

01

03

What is a markup language?

UNICODE

02

04

ASCII

KINDS OF MARKUPS

Infographic about Markup Languages

05

07

09

SGML

XML AND XHTML

POST SCRIPT AND JSON

06

08

HTML

tex y latex

UNICODE

Unicode defines three encoding forms under the name UTF: UTF-8: It uses variable-length symbols (ranging from 1 to 4 bytes per character depending on the symbol being represented). It includes the 7-bit ASCII specification, ensuring that any ASCII message is represented without changes. UTF-16: In contrast to UTF-8, it uses 2 or 4 bytes to represent each character. UTF-32: It uses 32 bits (4 bytes) to represent each character.

SGML

In the 1970s, IBM faced the need to store vast amounts of diverse information. To address this, IBM experts led by Charles Goldfarb developed GML (Generalized Markup Language), a language designed to classify information comprehensively and write documents for proper processing. GML gained popularity, leading to its standardization by ISO in 1986 as SGML (Standard Generalized Markup Language). Essentially, SGML was the standardized version of the original GML, established through ISO 8879. SGML (Standard Generalized Markup Language) is a text-based language used for marking data, i.e., adding self-descriptive metadata. These markings enable the logical structuring of text and specify how it should be presented on the screen.

XML AND XHTML

XML, or eXtensible Markup Language, introduced in 1996 and adopted by the W3C in 1998, is a standardized language that emphasizes extensibility by allowing the creation of unlimited tags. Serving as a metalanguage, XML doesn't prescribe tag usage but provides syntactic rules for creating documents. Its main advantage lies in data independence from presentation, making it suitable for various applications beyond the web, such as databases and text editors. XML facilitates self-description of data, as demonstrated in an example of representing book information. Its versatility extends to website development, application communication, and program configuration, making it a fundamental technology for secure and reliable information sharing across systems. XHTML, a reformulation of HTML as an XML application according to W3C's recommendation, maintains HTML 4.01 features but introduces XML restrictions. It addresses compatibility issues across platforms and browsers by being both a content language and XML-compliant. XHTML allows effective separation of content and presentation, enhancing flexibility across different platforms through technologies like CSS.

What is a markup language?

A markup language, is a way to encode a document that, along with the text, includes tags or marks containing additional information about the structure of the text or its presentation.

POST SCRIPT AND JSON

PostScript, initiated by John Warnock in 1976 and further developed at Xerox and Adobe Systems, is a page description language widely used for instructing how to display information on the final device. Functioning as a programming language, it specifies the presentation of information, encompassing text, font types, individual pixels, and vector shapes (lines, curves). Its capabilities are extensive. JSON, short for JavaScript Object Notation, originated from the JavaScript standard and gained browser support from 2002. Evolving into an independent notation, JSON competes with XML. Unlike markup languages, JSON separates data and metadata using colons, while braces and brackets facilitate proper data grouping.

KINDS OF MARKUPS

Procedural Markup Language: The annotations or marks in procedural markup languages describe the form and meaning of typographic operations to be applied to each element of the document. Structural or Descriptive Markup Language: The structural description is based on specifying the logical structure of content (headings, normal text paragraphs, enumerations, definitions, quotes, etc.).

TeX y LaTeX and RTF

In the 1970s, Donald Knuth developed TeX to generate scientific documents with consistent typography across different computers, emphasizing high-quality results. TeX employs commands, necessitating a program to convert TeX files into a printable format. The success of TeX gave rise to various derivatives, with LaTeX being the most popular. LaTeX simplifies TeX by using fewer commands while preserving its typographic structure, widely adopted for producing high-quality scientific documents. The concept is to enable scientists to focus on content rather than presentation. An example of LaTeX code demonstrates its ability to express scientific formulas and text formatting. Additionally, RTF (Rich Text Format), conceived by Microsoft in 1987, is utilised in Windows environments to create text documents with formatting annotations, being the native format for Word Pad.

HTML

HTML, a language derived from SGML, is pivotal for crafting hypertext on the World Wide Web (WWW), facilitating non-sequential information organization. Developed by Tim Berners-Lee in the 90s, the WWW employs hypertext as its foundational structure, enabling the linkage of digital documents through hyperlinks. HTML, a straightforward language, delineates the logical structure of content and desired effects, with the final presentation handled by browsers. It has undergone continual changes, overseen by the World Wide Web Consortium (W3C) to ensure consistent interpretation across various browsers. The HTML5 version, established in 2017, stands as the current recommendation, defining the structure and presentation of content on the web.

ASCII

The ASII code (American Standard Code for Information Interchange).A character encoding is a character set based on the Latin alphabet to represent characters of the English language. Created in 1963 by the American National Standards Institute (ANSI) as a revision or evolution of the codes used at that time in telegraphy, aiming to bring order to the various encoding systems of the era.

EXTENDED ASCII

The term "Extended ASCII" refers to an 8-bit character set (as opposed to the standard 7-bit ASCII). In this set, codes 32 to 126 align with the printable characters in standard ASCII. With 256 available combinations (128 more than standard ASCII), it becomes possible to encode characters from the majority of Western languages.