To create new wiki account, please join us on #znc at Libera.Chat and ask admins to create a wiki account for you. You can say thanks to spambots for this inconvenience.

Charset: Difference between revisions

From ZNC
Jump to navigation Jump to search
Gavin (talk | contribs)
restored from archive.org
Gavin (talk | contribs)
page fix
 
Line 1: Line 1:
{{Core Module | version = 1.0}}
ZNC has supported character encodings in core since [[ChangeLog/1.6.0|1.6.0]]. If you want to look at the earlier "charset" module, it's located at [[Charset/old]]. ZNC uses UTF-8 internally. You can configure charset used between ''clients and ZNC'', and charset used between ''ZNC and the IRC server''. Likely values are <code>UTF-8</code> and <code>ISO-8859-1</code>, but you might need other [[wikipedia:Character encoding#Common character encodings|common character encodings]].


Normalizes (i.e. converts) character encodings.
= Prerequisites =
[http://site.icu-project.org/ ICU library] does the actual conversion. If it's available, <code>./configure</code> will find it.


== Usage ==
= Configure via Web Admin =
=== Arguments ===
[[Image:Charset2.png|frame]][[Image:Charset1.png|frame]]
[-force] <client_charset1[,client_charset2[,...]]>  <server_charset1[,server_charset2[,...]]>
GUI is available in [[webadmin]] on both ''User Settings'' pages (''Client Encoding'', between the user's client and ZNC) and ''Network Settings'' pages (''Server Encoding'', between ZNC and the IRC server).


<code>-force</code> makes the module convert ALL messages instead of probing whether they are already encoded using the target charset (<code>client_charset</code>). This is useful for encodings with byte data that is also valid in the target charset. <code>client_charset2</code> (and so forth, in prioritized order) is the charset used if the message couldn't be encoded to charset1.
If one of the "Try to parse as UTF-8 and ___" options is selected, when ZNC receives a line from socket (either client, or server), at first it checks whether it's valid UTF-8, or not. If it's not, the fallback charset is used to decode the line.


=== Supported charsets ===
* After changing the ''Client Encoding'' on the user settings page, the user must reconnect their client to ZNC for settings to take effect.
All charsets supported by iconv are also supported by the module. At least in theory. Use <code>iconv --list</code> on your shell to get a list.
* After changing the ''Server Encoding'' on the network settings page, ZNC must be reconnected to the IRC server for settings to take effect, for example by typing <code>/msg *status Connect</code>.


=== Examples ===
<div style="clear:both"></div>
If your client understands and/or receives UTF-8 only, but some weird server you are on wants ISO-8859-1 exclusively:
/znc loadmod charset UTF-8 ISO-8859-1


If people on that server confuse basic charsets:
= Configure via Control Panel module =
/znc loadmod charset UTF-8 ISO-8859-1,ISO-8859-15,WINDOWS-1252
Make sure the [[controlpanel]] module is enabled. The following commands are available:
This will make ZNC send ISO-8859-1 to the server, but convert ISO-8859-1, ISO-8859-15 and WINDOWS-1252 to UTF-8 in the other direction.
:<code>/msg *controlpanel Set ClientEncoding <user> <value></code>
:<code>/msg *controlpanel SetNetwork Encoding <user> <network> <value></code>


If your client sends ISO-8859-1 but you want UTF-8 to be sent to your channels:
The <code><value></code> comes in 4 flavors, which map to 4 options in webadmin:
/znc loadmod charset ISO-8859-1 UTF-8
* <code></code> (empty): Don't ensure any encoding at all (legacy mode, not recommended)
You should also use this one if your clients sends ISO-8859-1 and you heavily use webadmin. Webadmin only displays UTF-8 (weird characters appear for other encodings).
* <code>^foo</code>: Try to parse as UTF-8 and as <code>foo</code>, send as UTF-8 (recommended)
* <code>*foo</code>: Try to parse as UTF-8 and as <code>foo</code>, send as <code>foo</code>
* <code>foo</code>: Parse and send as <code>foo</code> only.
If one of the "Try to parse as UTF-8 and ___" options is used (by prepending '''^''' or '''*''' to the value), when ZNC receives a line from socket (either client, or server), at first it checks whether it's valid UTF-8, or not. If it's not, the fallback charset is used to decode the line.


If your client only speaks UTF-8, but the server wants CP1251 exclusively:
==Examples==
/znc loadmod charset -force UTF-8 CP1251
; To set the encoding between user ''Kathryn'''s client and ZNC to "''Try to parse as UTF-8 and as <code>ISO-8859-15</code>, send as UTF-8 (recommended)''," you would use:
: <code>/msg *controlpanel Set ClientEncoding Kathryn ^ISO-8859-15</code>
: …and then user ''Kathryn'' would have to reconnect her client to ZNC.


== Warning ==
; To set user ''Kathryn'''s encoding between ZNC and the IRC network named ''freenodeIRC'' to "''Try to parse as UTF-8 and as <code>UTF-8</code>, send as UTF-8 (recommended)''," you would use:
If you connect multiple clients, and send a message from client A, client B will receive the ''converted'' version of the message. This is just how ZNC works and cannot be changed by a module.
: <code>/msg *controlpanel SetNetwork Encoding Kathryn freenodeIRC ^UTF-8</code>
: …and then ZNC would have to reconnect to network ''freenodeIRC'', for example by typing <code>/msg *status Connect</code>

Latest revision as of 23:29, 3 April 2017

ZNC has supported character encodings in core since 1.6.0. If you want to look at the earlier "charset" module, it's located at Charset/old. ZNC uses UTF-8 internally. You can configure charset used between clients and ZNC, and charset used between ZNC and the IRC server. Likely values are UTF-8 and ISO-8859-1, but you might need other common character encodings.

Prerequisites

ICU library does the actual conversion. If it's available, ./configure will find it.

Configure via Web Admin

GUI is available in webadmin on both User Settings pages (Client Encoding, between the user's client and ZNC) and Network Settings pages (Server Encoding, between ZNC and the IRC server).

If one of the "Try to parse as UTF-8 and ___" options is selected, when ZNC receives a line from socket (either client, or server), at first it checks whether it's valid UTF-8, or not. If it's not, the fallback charset is used to decode the line.

  • After changing the Client Encoding on the user settings page, the user must reconnect their client to ZNC for settings to take effect.
  • After changing the Server Encoding on the network settings page, ZNC must be reconnected to the IRC server for settings to take effect, for example by typing /msg *status Connect.

Configure via Control Panel module

Make sure the controlpanel module is enabled. The following commands are available:

/msg *controlpanel Set ClientEncoding <user> <value>
/msg *controlpanel SetNetwork Encoding <user> <network> <value>

The <value> comes in 4 flavors, which map to 4 options in webadmin:

  • (empty): Don't ensure any encoding at all (legacy mode, not recommended)
  • ^foo: Try to parse as UTF-8 and as foo, send as UTF-8 (recommended)
  • *foo: Try to parse as UTF-8 and as foo, send as foo
  • foo: Parse and send as foo only.

If one of the "Try to parse as UTF-8 and ___" options is used (by prepending ^ or * to the value), when ZNC receives a line from socket (either client, or server), at first it checks whether it's valid UTF-8, or not. If it's not, the fallback charset is used to decode the line.

Examples

To set the encoding between user Kathryn's client and ZNC to "Try to parse as UTF-8 and as ISO-8859-15, send as UTF-8 (recommended)," you would use
/msg *controlpanel Set ClientEncoding Kathryn ^ISO-8859-15
…and then user Kathryn would have to reconnect her client to ZNC.
To set user Kathryn's encoding between ZNC and the IRC network named freenodeIRC to "Try to parse as UTF-8 and as UTF-8, send as UTF-8 (recommended)," you would use
/msg *controlpanel SetNetwork Encoding Kathryn freenodeIRC ^UTF-8
…and then ZNC would have to reconnect to network freenodeIRC, for example by typing /msg *status Connect