Charset module¶
The charset module contains the list of all the encoding schemes supported by LocalSolver and the methods required to query that list.
A charset is not useful on its own, but it can be used in conjunction with the I/O module to read or write files with the appropriate encoding.
Complete list of supported encodings¶
The following table lists all the encodings supported by the modeler of LocalSolver with their common aliases and the languages for which the encoding is usually employed.
Encoding name | Aliases | Languages |
---|---|---|
ascii | us-ascii | English |
cp437 | pc-8, ibm437 | English |
cp1250 | windows-1250 | Central and Eastern Europe |
cp1251 | windows-1251 | Bulgarian, Serbian and Macedonian |
cp1252 | windows-1252 | Western Europe |
cp1253 | windows-1253 | Modern greek |
cp1254 | windows-1254 | Turkish |
cp1255 | windows-1255 | Hebrew |
cp1256 | windows-1256 | Arabic |
cp1257 | windows-1257 | Estonian, Latvian and Lithuanian |
cp1258 | windows-1258 | Vietnamese |
iso-8859-1 | latin, latin1 | Western Europe |
iso-8859-2 | latin2 | Central and Eastern Europe |
iso-8859-3 | latin3 | Esperanto, Maltese, Turkish |
iso-8859-4 | latin4 | Estonian, Latvian, Lithuanian, Sami |
iso-8859-5 | cyrillic | Bulgarian, Serbian, Russian |
iso-8859-6 | arabic | Arabic |
iso-8859-7 | greek | Greek |
iso-8859-8 | hebrew | Hebrew |
iso-8859-9 | latin5 | Turkish |
iso-8859-10 | latin6 | Nordic languages |
iso-8859-11 | thai | Thai |
iso-8859-13 | latin7 | Baltic languages |
iso-8859-14 | latin8 | Celtic languages |
iso-8859-15 | latin9 | Western Europe |
iso-8859-16 | latin10 | South-eastern Europe |
utf-8 | All languages | |
utf-16 | All languages | |
utf-16be | All languages | |
utf-16le | All languages | |
utf-32be | All languages | |
utf-32le | All languages |
Note
To use the features of this module, you have to put a
special import statement at the begining of your LSP file: use charset;
Constants¶
This module instantiate 2 encodings by default: utf-8 and iso-8859-1. You can use
them directly through these built-in variables without using the
charset.lookup()
method.
-
charset.
ISO_8859_1
¶ Latin-1 encoding. This encoding is the most widely used for western europe. This is also the default encoding of LocalSolver.
-
charset.
UTF_8
¶ UTF-8 encoding. This encoding should be the default one for your new projects with LocalSolver. The parser of LocalSolver will use it automatically if a Byte-Order-Mark (BOM) is detected at the begining of your files.
Functions¶
-
charset.
default
()¶ Returns the default encoding used by LocalSolver. For the current version of LocalSolver, this method returns
charset.ISO_8859_1
.
-
charset.
lookup
(name)¶ Returns the encoding corresponding to the given name. If no encoding exists with the given name, an exception is thrown. You can use the name of the encoding or one of its aliases. The lookup is not case sensitive.