In
computer science, Unicode is an
industry standard allowing
computers to consistently represent and manipulate
text expressed in any of the world's
writing systems. Developed in tandem with the
Universal Character Set standard and published in book form as The Unicode Standard, Unicode consists of a repertoire of about 100,000
characters, a set of code charts for visual reference, an encoding methodology and set of standard
character encodings, an enumeration of character properties such as upper and lower
case, a set of reference data
computer files, and a number of related items, such as character properties, rules for
text normalization, decomposition,
collation, rendering and bidirectional display order (for the correct display of text containing both right-to-left scripts, such as
Arabic or
Hebrew, and left-to-right scripts).
See more at Wikipedia.org...