8000 GitHub - zr-tex8r/BXjatoucs: LaTeX: To convert Japanese character code to Unicode
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

zr-tex8r/BXjatoucs

Repository files navigation

BXjatoucs Package

LaTeX: To convert Japanese character code to Unicode

This package provides function-like (fully-expandable) macros that convert a character code value in several Japanese encodings to a Unicode value. Supported source encodings are: ISO-2022-JP (jis), EUC-JP (euc), Shift_JIS (sjis) and the Adobe-Japan1 glyph set.

System requirement

  • TeX format: LaTeX and plain TeX.
  • TeX engine: Anything that has e-TeX extension.
  • Dependent packages:
    • infwarerr
    • ltxcmds

Installation

  • *.sty → $TEXMF/tex/latex/bxjatoucs
  • *.tfm → $TEXMF/fonts/tfm/public/bxjatoucs

License

This package is distributed under the MIT License.

The bxjatoucs Package ― main

Package Loading

In plain TeX:

\input bxjatoucs.sty

In LaTeX:

\usepackage{bxjatoucs}

Usage

All the macros take an argument that forms a valid number and represents the input code value, and expands (in two steps) to the string that represents the output Unicode scalar value in decimal.

  • \bxjaJisToUcs{<value>}: converts from a jis scalar value.
  • \bxjaEucToUcs{<value>}: converts from an euc scalar value.
  • \bxjaSjisToUcs{<value>}: converts from a sjis scalar value.
  • \bxjaCidToUcs{<value>}: converts from an AJ1 CID value.

There are variants that return in hexadecimal (zero-padded to at least four digits):

  • \bxjaJisToUcsHex{<value>}
  • \bxjaEucToUcsHex{<value>}
  • \bxjaSjisToUcsHex{<value>}
  • \bxjaCidToUcsHex{<value>}

Note: These macros return zero (decimal 0 and hexadecimal 0000) if the input number is out of the valid range of source encoding. If the input is malformed, the macros issue an error Missing number and then return zero.

Additional variants:

  • \bxjaFastCidToUcs{<value>}: same as bxjaCidToUcs, except that the argument is assumed to be a valid decimal string.
  • \bxjaFastCidToUcsHex{<value>}: same for \bxjaCidToUcsHex.

Conversions from Unicode

Starting from v0.3, macros for some conversions from Unicode (rather than conversions to Unicode) are available. The rule of input/output is the same as the existing macros.

  • \bxjaUcsToJis{<value>}: converts to a jis scalar value.
  • \bxjaUcsToEuc{<value>}: converts to an euc scalar value.
  • \bxjaUcsToSjis{<value>}: converts to a sjis scalar value.

Example

The following \message lines all display 23551:21496.

\message{\bxjaJisToUcs{"3C77}:\bxjaJisToUcs{"3B4A}}
\message{\bxjaEucToUcs{"BCF7}:\bxjaEucToUcs{"BBCA}}
\message{\bxjaSjisToUcs{"8EF5}:\bxjaSjisToUcs{"8E69}}
\message{\bxjaCidToUcs{2339}:\bxjaCidToUcs{2200}}

The following \message lines all display 5BFF:53F8.

\message{\bxjaJisToUcsHex{"3C77}:\bxjaJisToUcsHex{"3B4A}}
\message{\bxjaEucToUcsHex{"BCF7}:\bxjaEucToUcsHex{"BBCA}}
\message{\bxjaSjisToUcsHex{"8EF5}:\bxjaSjisToUcsHex{"8E69}}
\message{\bxjaCidToUcsHex{2339}:\bxjaCidToUcsHex{2200}}

Revision History

  • Version 0.3a ‹2024/07/22›
    • Renamed TFM 'bxjatoucs-fromjis' to 'bxjatoucs-tojis'.
  • Version 0.3 ‹2024/07/20›
    • Added “Unicode to JIS”type conversions.
  • Version 0.2 ‹2019/10/20›
    • The first public version.

Takayuki YATO (aka. "ZR")
https://github.com/zr-tex8r

About

LaTeX: To convert Japanese character code to Unicode

Resources

License

Stars

Watchers

Forks

Packages

No packages published
0