ASCII and Unicode
ASCII and Unicode
What are character sets: A character set is a collection of characters
that a computer system can recognize and use. Example includes
ASCII
Unicode
Character set were created as shortcuts to commonly used characters
to save programming time.
In a character set, each letter, number or symbol is represented by a
short binary sequence.
What is ASCII?
ASCII stands for American Standard Code for Information
Interchange.
What is Unicode?
Is a character set designed to cover virtually a characters used in
writing systems around the world supporting a wide varieties of
languages and symbols.
The ASCII table
Below is a short extract of original 7-bit ASCII table. It follows a logical
sequence
ASCII Binary Character
number(Denary)
64 1000000 @
65 1000001 A
66 1000010 B
67 1000011 C
The table is broken down into the following sections:
Non-printed control characters such as return key and enter key
Symbols such as # and @
The numbers 0 -9
Capital letters
Lower-case letters.
Encoding messages
This is the conversion of data from one form to another for correct
processing using ASCII. Example
Text: Hello world!
ASCII:72 101 108 108 111 32 119 111 114 108 100 33
Binary:1001000 11001011 1101100 1101100 1101111 0100000
1110111 1101111 1110010 1101100 1100100 0100001