其他分享
首页 > 其他分享> > ASCII Unicode UTF-8 编码

ASCII Unicode UTF-8 编码

作者:互联网

编码

ASCII

一个字节可表示256个字符,ASCII只用了128个字符

ASCII打印字符对照表

DECOCTHEXBIN缩写/符号HTML实体描述
00000000000000NUL�Null char (空字符)
10010100000001SOHStart of Heading (标题开始)
20020200000010STXStart of Text (正文开始)
30030300000011ETXEnd of Text (正文结束)
40040400000100EOTEnd of Transmission (传输结束)
50050500000101ENQEnquiry (请求)
60060600000110ACKAcknowledgment (收到通知)
70070700000111BELBell (响铃)
80100800001000 BSBack Space (退格)
90110900001001 HT	Horizontal Tab (水平制表符)
100120A00001010 LF
Line Feed (换行键)
110130B00001011 VTVertical Tab (垂直制表符)
120140C00001100 FFForm Feed (换页键)
130150D00001101 CR
Carriage Return (回车键)
140160E00001110 SOShift Out / X-On (不用切换)
150170F00001111 SIShift In / X-Off (启用切换)
160201000010000DLEData Line Escape (数据链路转义)
170211100010001DC1Device Control 1 (设备控制1)
180221200010010DC2Device Control 2 (设备控制2)
190231300010011DC3Device Control 3 (设备控制3)
200241400010100DC4Device Control 4 (设备控制4)
210251500010101NAKNegative Acknowledgement (拒绝接收)
220261600010110SYNSynchronous Idle (同步空闲)
230271700010111ETBEnd of Transmit Block (传输块结束)
240301800011000CANCancel (取消)
250311900011001 EMEnd of Medium (介质中断)
260321A00011010SUBSubstitute (替补)
270331B00011011ESCEscape (溢出)
280341C00011100 FSFile Separator (文件分割符)
290351D00011101 GSGroup Separator (分组符)
300361E00011110 RSRecord Separator (记录分离符)
310371F00011111 USUnit Separator (单元分隔符)
320402000100000  Space (空格)
330412100100001!!Exclamation mark
340422200100010""Double quotes
350432300100011##Number
360442400100100$$Dollar
370452500100101%%Procenttecken
380462600100110&&Ampersand
390472700100111''Single quote
400502800101000((Open parenthesis
410512900101001))Close parenthesis
420522A00101010**Asterisk
430532B00101011++Plus
440542C00101100,,Comma
450552D00101101--Hyphen
460562E00101110..Period, dot or full stop
470572F00101111//Slash or divide
48060300011000000Zero
49061310011000111One
50062320011001022Two
51063330011001133Three
52064340011010044Four
53065350011010155Five
54066360011011066Six
55067370011011177Seven
56070380011100088Eight
57071390011100199Nine
580723A00111010::Colon
590733B00111011;&#59;Semicolon
600743C00111100<&#60;Less than
610753D00111101=&#61;Equals
620763E00111110>&#62;Greater than
630773F00111111?&#63;Question mark
641004001000000@&#64;At symbol
651014101000001A&#65;Uppercase A
661024201000010B&#66;Uppercase B
671034301000011C&#67;Uppercase C
681044401000100D&#68;Uppercase D
691054501000101E&#69;Uppercase E
701064601000110F&#70;Uppercase F
711074701000111G&#71;Uppercase G
721104801001000H&#72;Uppercase H
731114901001001I&#73;Uppercase I
741124A01001010J&#74;Uppercase J
751134B01001011K&#75;Uppercase K
761144C01001100L&#76;Uppercase L
771154D01001101M&#77;Uppercase M
781164E01001110N&#78;Uppercase N
791174F01001111O&#79;Uppercase O
801205001010000P&#80;Uppercase P
811215101010001Q&#81;Uppercase Q
821225201010010R&#82;Uppercase R
831235301010011S&#83;Uppercase S
841245401010100T&#84;Uppercase T
851255501010101U&#85;Uppercase U
861265601010110V&#86;Uppercase V
871275701010111W&#87;Uppercase W
881305801011000X&#88;Uppercase X
891315901011001Y&#89;Uppercase Y
901325A01011010Z&#90;Uppercase Z
911335B01011011[&#91;Opening bracket
921345C01011100\&#92;Backslash
931355D01011101]&#93;Closing bracket
941365E01011110^&#94;Caret - circumflex
951375F01011111_&#95;Underscore
961406001100000`&#96;Grave accent
971416101100001a&#97;Lowercase a
981426201100010b&#98;Lowercase b
991436301100011c&#99;Lowercase c
1001446401100100d&#100;Lowercase d
1011456501100101e&#101;Lowercase e
1021466601100110f&#102;Lowercase f
1031476701100111g&#103;Lowercase g
1041506801101000h&#104;Lowercase h
1051516901101001i&#105;Lowercase i
1061526A01101010j&#106;Lowercase j
1071536B01101011k&#107;Lowercase k
1081546C01101100l&#108;Lowercase l
1091556D01101101m&#109;Lowercase m
1101566E01101110n&#110;Lowercase n
1111576F01101111o&#111;Lowercase o
1121607001110000p&#112;Lowercase p
1131617101110001q&#113;Lowercase q
1141627201110010r&#114;Lowercase r
1151637301110011s&#115;Lowercase s
1161647401110100t&#116;Lowercase t
1171657501110101u&#117;Lowercase u
1181667601110110v&#118;Lowercase v
1191677701110111w&#119;Lowercase w
1201707801111000x&#120;Lowercase x
1211717901111001y&#121;Lowercase y
1221727A01111010z&#122;Lowercase z
1231737B01111011{&#123;Opening brace
1241747C01111100|&#124;Vertical bar
1251757D01111101}&#125;Closing brace
1261767E01111110~&#126;Equivalency sign (tilde)
1271777F01111111&#127;Delete

Unicode

将世界所有的符号都纳入其中,每一个符号都有其自己的编码,因此使用Unicode编码没有乱码问题

但是使用Unicode编码,每一个英文字母或汉字都占有2个字节,2个字节最多可编码 2 ^ 16 = 65536 个字符

Unicode兼容ASCII,'a’在ASCII码表是0x61,在Unicode码是ox0061,都对应着97

UTF-8

UTF-8是在互联网使用最广的一种Unicode码的实现

UTF-8是一种变长的编码方式,它可以使用1-6个字节来表示一个符号,根据不同的符号而变化字节的长度

UTF-8字母占一个字节,汉字占3个字节

标签:Lowercase,UTF,字节,Uppercase,&#,Unicode,ASCII
来源: https://blog.csdn.net/qq_44737446/article/details/121078518