HTML has been in use since
1991, but HTML 4.0 (December 1997) was the first standardized version where international
characters were given reasonably complete treatment. When an HTML document includes special characters outside the range of seven-bit
ASCII two goals are worth considering: the information's
integrity, and universal
browser display.
See more at Wikipedia.org...
In
SGML and its derived
markup languages HTML and
XML, an entity is a named body of data associated with a document, or the unnamed document entity itself. Once defined, a named entity can be referenced any number of times within the document, via entity references.An entity typically consists of a sequence of one or more
characters, either encapsulated within a declaration of the entity's existence, or
encoded and
stored externally in a
text file. Such entities and their references are comparable to
macros and
include files in
programming languages. However, entities don't necessarily have to contain
parseable text; they may also be
binary files that are included in a document only by reference, with the entity name as the value of an ENTITY-type attribute.
See more at Wikipedia.org...