Unicode character set which is also called UTF-8 is developed by the Unicode Consortium. The usual for the Unicode charset and its subsets is called Unicode customary. Over time quite a lot of completely different character units are developed by completely different entities. This has created a large number the place compatibility and portability are very low. So the primary purpose of the Unicode Consortium is to unify the character units in a single umbrella. The umbrella is called “Unicode Transformation Format” or just UTF. The UTF-8 is a subset of the UTF. The UTF-8 is the preferred subset of the UTF character set household. When the phrases “Unicode”, “UTF” are expressed they merely used to seek advice from the UTF-8.
Unicode (UTF-8) Charset Use Instances
As a preferred character set Unicode Charset is utilized in completely different conditions for various circumstances.
- HTML Internet functions Growth
- XML Internet Companies and Purposes Growth
- Java Desktop and Internet Utility Growth
- JavaScript Internet Utility Growth
- E-Mail Content material
- ASP.NET Internet Utility Growth
- PHP Internet Utility Growth
UTF-8 vs UTF-16 Character Units
As acknowledged beforehand there are completely different character units below the UTF. Truly, they’re the identical the place solely the encoding measurement is completely different. UTF-8 shops a single character between 1 and 4 bytes. UTF-8 can be appropriate with the previous ASCII character set. UTF-8 is the popular encoding for the UTF household due to its efficiency and backward compatibility with the ASCII. UTF-16 is a variable-length character set the place a single character will be saved in variable sizes. The character storage measurement is said to the character. UTF-16 is utilized by main working techniques like Microsoft Home windows, Linux, Java programming language, .NET Framework.
Quantity | ASCII | ANSI | 8859 | UTF-8 | Description | ||||
---|---|---|---|---|---|---|---|---|---|
32 | area | ||||||||
33 | ! | ! | ! | ! | exclamation mark | ||||
34 | “ | “ | “ | “ | citation mark | ||||
35 | # | # | # | # | quantity signal | ||||
36 | $ | $ | $ | $ | greenback signal | ||||
37 | % | % | % | % | p.c signal | ||||
38 | & | & | & | & | ampersand | ||||
39 | ‘ | ‘ | ‘ | ‘ | apostrophe | ||||
40 | ( | ( | ( | ( | left parenthesis | ||||
41 | ) | ) | ) | ) | proper parenthesis | ||||
42 | * | * | * | * | asterisk | ||||
43 | + | + | + | + | plus signal | ||||
44 | , | , | , | , | comma | ||||
45 | – | – | – | – | hyphen-minus | ||||
46 | . | . | . | . | full cease | ||||
47 | / | / | / | / | solidus | ||||
48 | 0 | 0 | 0 | 0 | digit zero | ||||
49 | 1 | 1 | 1 | 1 | digit one | ||||
50 | 2 | 2 | 2 | 2 | digit two | ||||
51 | 3 | 3 | 3 | 3 | digit three | ||||
52 | 4 | 4 | 4 | 4 | digit 4 | ||||
53 | 5 | 5 | 5 | 5 | digit 5 | ||||
54 | 6 | 6 | 6 | 6 | digit six | ||||
55 | 7 | 7 | 7 | 7 | digit seven | ||||
56 | 8 | 8 | 8 | 8 | digit eight | ||||
57 | 9 | 9 | 9 | 9 | digit 9 | ||||
58 | : | : | : | : | colon | ||||
59 | ; | ; | ; | ; | semicolon | ||||
60 | < | < | < | < | less-than signal | ||||
61 | = | = | = | = | equals signal | ||||
62 | > | > | > | > | greater-than signal | ||||
63 | ? | ? | ? | ? | query mark | ||||
64 | @ | @ | @ | @ | business at | ||||
65 | A | A | A | A | Latin capital letter A | ||||
66 | B | B | B | B | Latin capital letter B | ||||
67 | C | C | C | C | Latin capital letter C | ||||
68 | D | D | D | D | Latin capital letter D | ||||
69 | E | E | E | E | Latin capital letter E | ||||
70 | F | F | F | F | Latin capital letter F | ||||
71 | G | G | G | G | Latin capital letter G | ||||
72 | H | H | H | H | Latin capital letter H | ||||
73 | I | I | I | I | Latin capital letter I | ||||
74 | J | J | J | J | Latin capital letter J | ||||
75 | Ok | Ok | Ok | Ok | Latin capital letter Ok | ||||
76 | L | L | L | L | Latin capital letter L | ||||
77 | M | M | M | M | Latin capital letter M | ||||
78 | N | N | N | N | Latin capital letter N | ||||
79 | O | O | O | O | Latin capital letter O | ||||
80 | P | P | P | P | Latin capital letter P | ||||
81 | Q | Q | Q | Q | Latin capital letter Q | ||||
82 | R | R | R | R | Latin capital letter R | ||||
83 | S | S | S | S | Latin capital letter S | ||||
84 | T | T | T | T | Latin capital letter T | ||||
85 | U | U | U | U | Latin capital letter U | ||||
86 | V | V | V | V | Latin capital letter V | ||||
87 | W | W | W | W | Latin capital letter W | ||||
88 | X | X | X | X | Latin capital letter X | ||||
89 | Y | Y | Y | Y | Latin capital letter Y | ||||
90 | Z | Z | Z | Z | Latin capital letter Z | ||||
91 | [ | [ | [ | [ | left square bracket | ||||
92 | reverse solidus | ||||||||
93 | ] | ] | ] | ] | proper sq. bracket | ||||
94 | ^ | ^ | ^ | ^ | circumflex accent | ||||
95 | _ | _ | _ | _ | low line | ||||
96 | ` | ` | ` | ` | grave accent | ||||
97 | a | a | a | a | Latin small letter a | ||||
98 | b | b | b | b | Latin small letter b | ||||
99 | c | c | c | c | Latin small letter c | ||||
100 | d | d | d | d | Latin small letter d | ||||
101 | e | e | e | e | Latin small letter e | ||||
102 | f | f | f | f | Latin small letter f | ||||
103 | g | g | g | g | Latin small letter g | ||||
104 | h | h | h | h | Latin small letter h | ||||
105 | i | i | i | i | Latin small letter i | ||||
106 | j | j | j | j | Latin small letter j | ||||
107 | ok | ok | ok | ok | Latin small letter ok | ||||
108 | l | l | l | l | Latin small letter l | ||||
109 | m | m | m | m | Latin small letter m | ||||
110 | n | n | n | n | Latin small letter n | ||||
111 | o | o | o | o | Latin small letter o | ||||
112 | p | p | p | p | Latin small letter p | ||||
113 | q | q | q | q | Latin small letter q | ||||
114 | r | r | r | r | Latin small letter r | ||||
115 | s | s | s | s | Latin small letter s | ||||
116 | t | t | t | t | Latin small letter t | ||||
117 | u | u | u | u | Latin small letter u | ||||
118 | v | v | v | v | Latin small letter v | ||||
119 | w | w | w | w | Latin small letter w | ||||
120 | x | x | x | x | Latin small letter x | ||||
121 | y | y | y | y | Latin small letter y | ||||
122 | z | z | z | z | Latin small letter z | ||||
123 | { | { | { | } | } | } | proper curly bracket | ||
126 | ~ | ~ | ~ | ~ | tilde | ||||
127 | DEL | ||||||||
128 | € | euro signal | |||||||
129 | | | | NOT USED | |||||
130 | ‚ | single low-9 citation mark | |||||||
131 | ƒ | Latin small letter f with hook | |||||||
132 | „ | double low-9 citation mark | |||||||
133 | … | horizontal ellipsis | |||||||
134 | † | dagger | |||||||
135 | ‡ | double dagger | |||||||
136 | ˆ | modifier letter circumflex accent | |||||||
137 | ‰ | per mille signal | |||||||
138 | Š | Latin capital letter S with caron | |||||||
139 | ‹ | single left-pointing angle citation mark | |||||||
140 | Œ | Latin capital ligature OE | |||||||
141 | | | | NOT USED | |||||
142 | Ž | Latin capital letter Z with caron | |||||||
143 | | | | NOT USED | |||||
144 | | | | NOT USED | |||||
145 | ‘ | left single citation mark | |||||||
146 | ’ | proper single citation mark | |||||||
147 | “ | left double citation mark | |||||||
148 | ” | proper double citation mark | |||||||
149 | • | bullet | |||||||
150 | – | en sprint | |||||||
151 | — | em sprint | |||||||
152 | ˜ | small tilde | |||||||
153 | ![]() |
commerce mark signal | |||||||
154 | š | Latin small letter s with caron | |||||||
155 | › | single right-pointing angle citation mark | |||||||
156 | œ | Latin small ligature oe | |||||||
157 | | | | NOT USED | |||||
158 | ž | Latin small letter z with caron | |||||||
159 | Ÿ | Latin capital letter Y with diaeresis | |||||||
160 | no-break area | ||||||||
161 | ¡ | ¡ | ¡ | inverted exclamation mark | |||||
162 | ¢ | ¢ | ¢ | cent signal | |||||
163 | £ | £ | £ | pound signal | |||||
164 | ¤ | ¤ | ¤ | forex signal | |||||
165 | ¥ | ¥ | ¥ | yen signal | |||||
166 | ¦ | ¦ | ¦ | damaged bar | |||||
167 | § | § | § | part signal | |||||
168 | ¨ | ¨ | ¨ | diaeresis | |||||
169 | © | © | © | copyright signal | |||||
170 | ª | ª | ª | female ordinal indicator | |||||
171 | « | « | « | left-pointing double angle citation mark | |||||
172 | ¬ | ¬ | ¬ | not signal | |||||
173 | | | | gentle hyphen | |||||
174 | ® | ® | ® | registered signal | |||||
175 | ¯ | ¯ | ¯ | macron | |||||
176 | ° | ° | ° | diploma signal | |||||
177 | ± | ± | ± | plus-minus signal | |||||
178 | ² | ² | ² | superscript two | |||||
179 | ³ | ³ | ³ | superscript three | |||||
180 | ´ | ´ | ´ | acute accent | |||||
181 | µ | µ | µ | micro signal | |||||
182 | ¶ | ¶ | ¶ | pilcrow signal | |||||
183 | · | · | · | center dot | |||||
184 | ¸ | ¸ | ¸ | cedilla | |||||
185 | ¹ | ¹ | ¹ | superscript one | |||||
186 | º | º | º | masculine ordinal indicator | |||||
187 | » | » | » | right-pointing double angle citation mark | |||||
188 | ¼ | ¼ | ¼ | vulgar fraction one quarter | |||||
189 | ½ | ½ | ½ | vulgar fraction one half | |||||
190 | ¾ | ¾ | ¾ | vulgar fraction three quarters | |||||
191 | ¿ | ¿ | ¿ | inverted query mark | |||||
192 | À | À | À | Latin capital letter A with grave | |||||
193 | Á | Á | Á | Latin capital letter A with acute | |||||
194 | Â | Â | Â | Latin capital letter A with circumflex | |||||
195 | Ã | Ã | Ã | Latin capital letter A with tilde | |||||
196 | Ä | Ä | Ä | Latin capital letter A with diaeresis | |||||
197 | Å | Å | Å | Latin capital letter A with ring above | |||||
198 | Æ | Æ | Æ | Latin capital letter AE | |||||
199 | Ç | Ç | Ç | Latin capital letter C with cedilla | |||||
200 | È | È | È | Latin capital letter E with grave | |||||
201 | É | É | É | Latin capital letter E with acute | |||||
202 | Ê | Ê | Ê | Latin capital letter E with circumflex | |||||
203 | Ë | Ë | Ë | Latin capital letter E with diaeresis | |||||
204 | Ì | Ì | Ì | Latin capital letter I with grave | |||||
205 | Í | Í | Í | Latin capital letter I with acute | |||||
206 | Î | Î | Î | Latin capital letter I with circumflex | |||||
207 | Ï | Ï | Ï | Latin capital letter I with diaeresis | |||||
208 | Ð | Ð | Ð | Latin capital letter Eth | |||||
209 | Ñ | Ñ | Ñ | Latin capital letter N with tilde | |||||
210 | Ò | Ò | Ò | Latin capital letter O with grave | |||||
211 | Ó | Ó | Ó | Latin capital letter O with acute | |||||
212 | Ô | Ô | Ô | Latin capital letter O with circumflex | |||||
213 | Õ | Õ | Õ | Latin capital letter O with tilde | |||||
214 | Ö | Ö | Ö | Latin capital letter O with diaeresis | |||||
215 | × | × | × | multiplication signal | |||||
216 | Ø | Ø | Ø | Latin capital letter O with stroke | |||||
217 | Ù | Ù | Ù | Latin capital letter U with grave | |||||
218 | Ú | Ú | Ú | Latin capital letter U with acute | |||||
219 | Û | Û | Û | Latin capital letter U with circumflex | |||||
220 | Ü | Ü | Ü | Latin capital letter U with diaeresis | |||||
221 | Ý | Ý | Ý | Latin capital letter Y with acute | |||||
222 | Þ | Þ | Þ | Latin capital letter Thorn | |||||
223 | ß | ß | ß | Latin small letter sharp s | |||||
224 | à | à | à | Latin small letter a with grave | |||||
225 | á | á | á | Latin small letter a with acute | |||||
226 | â | â | â | Latin small letter a with circumflex | |||||
227 | ã | ã | ã | Latin small letter a with tilde | |||||
228 | ä | ä | ä | Latin small letter a with diaeresis | |||||
229 | å | å | å | Latin small letter a with ring above | |||||
230 | æ | æ | æ | Latin small letter ae | |||||
231 | ç | ç | ç | Latin small letter c with cedilla | |||||
232 | è | è | è | Latin small letter e with grave | |||||
233 | é | é | é | Latin small letter e with acute | |||||
234 | ê | ê | ê | Latin small letter e with circumflex | |||||
235 | ë | ë | ë | Latin small letter e with diaeresis | |||||
236 | ì | ì | ì | Latin small letter i with grave | |||||
237 | í | í | í | Latin small letter i with acute | |||||
238 | î | î | î | Latin small letter i with circumflex | |||||
239 | ï | ï | ï | Latin small letter i with diaeresis | |||||
240 | ð | ð | ð | Latin small letter eth | |||||
241 | ñ | ñ | ñ | Latin small letter n with tilde | |||||
242 | ò | ò | ò | Latin small letter o with grave | |||||
243 | ó | ó | ó | Latin small letter o with acute | |||||
244 | ô | ô | ô | Latin small letter o with circumflex | |||||
245 | õ | õ | õ | Latin small letter o with tilde | |||||
246 | ö | ö | ö | Latin small letter o with diaeresis | |||||
247 | ÷ | ÷ | ÷ | division signal | |||||
248 | ø | ø | ø | Latin small letter o with stroke | |||||
249 | ù | ù | ù | Latin small letter u with grave | |||||
250 | ú | ú | ú | Latin small letter u with acute | |||||
251 | û | û | û | Latin small letter with circumflex | |||||
252 | ü | ü | ü | Latin small letter u with diaeresis | |||||
253 | ý | ý | ý | Latin small letter y with acute | |||||
254 | þ | þ | þ | Latin small letter thorn | |||||
255 | ÿ | ÿ | ÿ | Latin small letter y with diaeresis |