ASCII
The American Standard Code for Information Interchange (ASCII) is a 7-bit character-encoding scheme for the Western lower- and uppercase letters (letters without accents only), digits, and punctuation marks. In addition it defines 33 control characters (see Table 1), many of which are obsolete. In total ASCII has space for 2^7 = 128 codes. The ASCII codes from 32 to 126 represent printable characters in binary form (see Table 2), such as are used in computers, telecommunication equipment, and other electronic and mechanical devices. The last ASCII code (127) is DEL(ete) (111 1111) that, being in the last position, has all bits up. Although a control character, it is last in the ASCII table for the historic reason that it enabled the rubout of an incorrect 7-bit paper-tape character (7 holes in the tape) without requiring a complete copy of the tape.
Historically, ASCII developed from telegraph codes. Work on the ASCII standard formally began on October 6, 1960, with the first meeting of the American Standards Association's (ASA) X3.2 subcommittee. The first edition of the standard was published in 1963, [1] a major revision (1967) included the the lowercase letters,[2] and the most recent update was in 1986.[3]
The order of the printable characters in ASCII is such that alphabetic sorting coincides to a large extent with numeric sorting.
As stated, many of the control characters are obsolete. However, ASCII 9 (tab stop), ASCII 10 (line feed), and ASCII 13 (carriage return) are still in common use. The tab character has the same function as on typewriters and lets the cursor jump from tab stop to tab stop. The pair ASCII 13 and ASCII 10 is used under MS-Windows to indicate end of line ("hard return"), whereas under Unix type operating systems only linefeed (ASCII 10) is used to give a hard return.
Later encoding standards, such as ISO 8859, ANSI 1252 and Unicode UTF-8, are compatible with ASCII, that is, their codes for the first 128 characters are identical to the ASCII codes.
Dec | Hex | Binary | Abbr | Description |
---|---|---|---|---|
0 | 00 | 000 0000 | NUL | Null character |
1 | 01 | 000 0001 | SOH | Start of Header |
2 | 02 | 000 0010 | STX | Start of Text |
3 | 03 | 000 0011 | ETX | End of Text |
4 | 04 | 000 0100 | EOT | End of Transmission |
5 | 05 | 000 0101 | ENQ | Enquiry |
6 | 06 | 000 0110 | ACK | Acknowledgment |
7 | 07 | 000 0111 | BEL | Bell |
8 | 08 | 000 1000 | BS | Backspace |
9 | 09 | 000 1001 | HT | Horizontal Tab |
10 | 0A | 000 1010 | LF | Line feed |
11 | 0B | 000 1011 | VT | Vertical Tab |
12 | 0C | 000 1100 | FF | Form feed |
13 | 0D | 000 1101 | CR | Carriage return |
14 | 0E | 000 1110 | SO | Shift Out |
15 | 0F | 000 1111 | SI | Shift In |
16 | 10 | 001 0000 | DLE | Data Link Escape |
17 | 11 | 001 0001 | DC1 | Device Control 1 |
18 | 12 | 001 0010 | DC2 | Device Control 2 |
19 | 13 | 001 0011 | DC3 | Device Control 3 |
20 | 14 | 001 0100 | DC4 | Device Control 4 |
21 | 15 | 001 0101 | NAK | Negative Acknowledgement |
22 | 16 | 001 0110 | SYN | Synchronous idle |
23 | 17 | 001 0111 | ETB | End of Transmission Block |
24 | 18 | 001 1000 | CAN | Cancel |
25 | 19 | 001 1001 | EM | End of Medium |
26 | 1A | 001 1010 | SUB | Substitute |
27 | 1B | 001 1011 | ESC | Escape |
28 | 1C | 001 1100 | FS | File Separator |
29 | 1D | 001 1101 | GS | Group Separator |
30 | 1E | 001 1110 | RS | Record Separator |
31 | 1F | 001 1111 | US | Unit Separator |
127 | 7F | 111 1111 | DEL | Delete |
|
|
|
[edit] References
- ↑ American Standard Code for Information Interchange, ASA X3.4-1963, American Standards Association, June 17, 1963. Copy of text
- ↑ USA Standard Code for Information Interchange, ANSI X3.4-1967, United States of America Standards Institute, July 7, 1967 Review of 1967 revision
- ↑ American National Standard for Information Systems — Coded Character Sets — 7-Bit American National Standard Code for Information Interchange (7-Bit ASCII), ANSI X3.4-1986, American National Standards Institute, Inc., March 26, 1986