[w3m-dev-en 00043] Re: w3m-i18n/m17n

From: Christian Weisgerber (w3m-dev-en@mips.rhein-neckar.de)
Date: Wed Jan 26 2000 - 16:47:14 CST


Hironori Sakamoto wrote:

> I have tried the internationalizaion / muntilingualization of w3m.
> The patch for w3m-0.1.4 is available on the following site.

YES! YES!
w3m has been sorely lacking in that area.

I'm looking at w3m-0.1.6-i18n right now.

I tried to compile it on Red Hat Linux 6.0/alpha, and it failed to
link with "undefined reference to `addeffect'". terms.c:addeffect()
is only compiled in if ANSI_COLOR is defined, which it apparently
isn't by default.

I have added "#define ANSI_COLOR" to config.h and it now builds--and
runs.

I've set the display code to ISO 8859-2 and displayed a page which
contains the character entity ł. This is handled correctly.

BUG! In the options panel, I set

Display code Latin 2 (ISO-8859-2)
Preferred document code Latin 1 (ISO-8859-1)

and click [OK]. If I then return to the options panel, the preferred
document code has switched to "Arabic (ISO-8859-6)". In fact, this
options always seems to set itself to Arabic.

I tried <URL:http://www.msk.ru/> (Moscow, Russia), which is quite
interesting since that page is offered in several encodings. Alas,
w3m always insists on setting itself to Arabic (although the server
sends the correct encoding).

I'm now running w3m-i18n inside an xterm with KOI-8R (Cyrillic)
font. The default page at the URL above is in KOI-8R. If I manually
set the document encoding to KOI-8R, the page is displayed correctly.
(I don't know Russian, but I can read the letters, and the title
looks in transcription like "Moskovskaya alternativa", which appears
sensible.) Now, if I go to the CP1251-encoded page, set the document
encoding to CP1251, and enable Unicode code conversion... Yes! It
works. This is great!

> NOTE:
> * UTF-8 is not used as display coding system.

This should be added.

Are you aware that the current version of plain xterm now supports
UTF-8? (No combining and double width characters yet, but it's a
start.) This will be part of XFree86 4.0 to be released soon. As
far as I know, Markus Kuhn's basic Unicode fonts will also be
included. This means effectively that within the next few months,
most X11 desktops in the world will be UTF-8 capable.

-- 
Christian "naddy" Weisgerber                  naddy@mips.rhein-neckar.de



This archive was generated by hypermail 2b29 : Wed Jul 19 2000 - 10:30:43 CDT