Multilingual SEO Forums by WebCertain

Go Back   Multilingual SEO Forums by WebCertain > Multilingual SEO Issues > Languages and Character Sets
Register FAQ Members List Calendar Search Today's Posts Mark Forums Read

Languages and Character Sets Challenges faced when handling different characters sets

Reply

 

LinkBack Thread Tools Display Modes
  #1 (permalink)  
Old 01-10-07, 09:07 AM
shasha95 shasha95 is offline
New Member
 
Join Date: Sep 2007
Posts: 15
shasha95 is on a distinguished road
Default What is the different between UTF8 and UTF16 (UTF32)?

What is the different between UTF8 and UTF16 (UTF32)?
Reply With Quote
  #2 (permalink)  
Old 01-10-07, 09:41 AM
Peter Kersbergen's Avatar
Peter Kersbergen Peter Kersbergen is offline
 
Join Date: May 2007
Native Country: The Netherlands
Posts: 40
Peter Kersbergen is on a distinguished road
Default

Code unit size of 8-bit, 16-bit and 32-bit.

The Maximal bytes/character is 4 for all of the above but the Minimal bytes/character are respectively 1, 2 and 4.

UTF-8 is most common on the web. UTF-16 is used by Java and Windows. UTF-32 is used by various Unix systems. The conversions between all of them are algorithmically based, fast and lossless. This makes it easy to support data input or output in multiple formats, while using a particular UTF for internal storage or processing.

For further details and information:
http://en.wikipedia.org/wiki/UTF-8
http://en.wikipedia.org/wiki/UTF-16
http://unicode.org/
http://www.utf-8.com/
Reply With Quote
  #3 (permalink)  
Old 27-06-08, 10:46 AM
priji priji is offline
New Member
 
Join Date: Jun 2008
Posts: 3
priji is on a distinguished road
Default

UTF-8 uses a byte as its atomic unit while UTF-16 uses a 16-bit word which is generally represented by a pair of bytes.
Reply With Quote
Reply



Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On


All times are GMT +1. The time now is 12:32 AM.


Copyright © 2008 Web Certain Europe Ltd

Search Engine Friendly URLs by vBSEO 3.0.0