Recommended: UTF-8
UTF-8 is a variable-length character encoding for Unicode. It is able to represent any
universal character in the Unicode standard, yet still consistent with ASCII. For these
reasons, it is steadily becoming the preferred encoding for e-mail, web pages, etc. The
Internet Engineering Task Force (IETF) requires all Internet protocols to identify the
encoding used for character data with UTF-8 as at least one supported encoding. The
Internet Mail Consortium (IMC) recommends that all email programs be able to display and
create mail using UTF-8. UTF-8 is the only Unicode encoding fully supported by all
browsers.
KSW is a
native UTF-8 application
KSW was built from scratch for UTF-8. It was also crafted to support all other page
encodings in general use today.
When you start from scratch,
use UTF-8 only
When you start a new KB web from scratch, always set the default encoding in the page
editor to UTF-8.
Using other encodings
When you bring pages from other sources into your web, or when you start from an existing,
older web, it is likely page encoding is based on a Windows code page (ANSI). KSW does
support such pages as they are, without conversion. But you need to make sure it's the
default code page of your machine, not another (files encoded by a Windows code page are
not signed; characters are assigned to the same code range in all such encodings; thus,
technically, it is required that when using a Windows cod page encoding, it is your
machine's default, not another).
ASCII and ANSI
UTF-8 encoding is identical to ASCII for characters in the ASCII range. Thus, ASCII pages
(codes 0-127) are, implicitly, valid UTF-8 pages. If such a page is added extended ASCII
(codes 128-255), it becomes ANSI, and is interpreted by a Windows code page.
The 'List ANSI Topics' button
After a project is loaded, and before it is opened, under the right corner of the report window, a button 'List
ANSI Topics' is located. Press it to see a list of topics that are not UTF-8, but ANSI.
See also:
UTF-8 - With Signature or Without?
Converting Legacy Pages to UTF-8