WikklyText A wikitext server and rendering library


files/feed-icon-14x14.png Recently Edited Valid XHTML 1.0 Strict
MultiLingual edit
frank, 18 June 2008 (created 20 October 2007)
Tags: -rss
WikklyText fully supports multilingual content using both traditional multibyte character sets as well as Unicode. Source file encodings can be specified in two ways:
  1. Files written in UTF-8, UTF-8-SIG, UTF-16LE, and UTF-16BE formats are automatically detected from their Byte Order Marker (BOM).
  2. Non-UTF multibyte formats are detected by placing the following comment somewhere in the wikitext:
    /% encoding​: ENCODING %/

    Where ENCODING is any Python-supported encoding.

First, a sample showing multilanguage content directly from this source file (this source file is encoded in UTF-8, as are all the samples below):

LanguageSample
BulgarianЗдравей, свят!
Chinese, simplified世界你好!
Chinese, traditional哈囉,大家好!
EnglishHello world!
EstonianTere kõik!
Georgianსალამი მსოფლიოს!
GreekΓεια σου, κόσμε!
Hebrewםלועה לכל םולש
Japanese世界よ、こんにちは!
Korean안녕하세요, 여러분!
Persianسلام بر هم
PolishWitaj świecie!
RussianЗдравствуй, мир!
SerbianПоздрав свима!
TurkishMerhaba, dünya!
UkranianПривіт світ!
VietnameseChào thế giới !
If the characters are not displayed correctly in your browser, you need to install the approriate font.

References:


It is possible to mix content encoding types within a document by using the <<include ...>> macro to import other files. This table below mixes a variety of traditional multibyte encodings in a single table. WikklyText handles them using a combination of BOM detection and /% encoding ...%/ tags.

FilenameLanguage (Encoding): Sample
bulgarian.txtBulgarian (UTF-8): Здравей, свят!
chinese-simp.txtChinese, simplified (GB2312): 世界你好!

chinese-trad-big5.txtChinese, traditional (Big5): 哈囉,大家好!
chinese-trad-utf8.txtChinese, traditional (UTF-8): 哈囉,大家好!
english.txtEnglish (ASCII): Hello world!
estonian.txtEstonian (ISO-8859-15): Tere kõik!
georgian.txtGeorgian (UTF-8): სალამი მსოფლიოს!
greek.txtGreek (ISO-8859-7): Γεια σου, κόσμε!

hebrew.txtHebrew (ISO-8859-8): !םלועה לכל םולש

japanese-eucjp.txtJapanese (EUC-JP): 世界よ、こんにちは!

japanese-shiftjis.txtJapanese (Shift-JIS): 世界よ、こんにちは!

korean.txtKorean (EUC-KR): 안녕하세요, 여러분!

persian.txtPersian (UTF-8): سلام بر هم
polish.txtPolish (ISO-8859-2): Witaj świecie!

russian.txtRussian (KOI8-R): Здравствуй, мир!

serbian.txtSerbian (UTF-8): Поздрав свима!

turkish.txtTurkish (UTF-8): Merhaba, dünya!
ukrainian.txtUkranian (KOI8-U): Привіт світ!

vietnamese.txtVietnamese (UTF-8): Chào thế giới !


(The text samples were taken from GNU hello, which is Copyright (C) 2006 Free Software Foundation, Inc.)