Conversion between Traditional and Simplified Chinese
Open Chinese Convert (OpenCC, 開放中文轉換) is an opensource project for conversions between Traditional Chinese, Simplified Chinese and Japanese Kanji (Shinjitai). It supports character-level and phrase-level conversion, character variant conversion and regional idioms among Mainland China, Taiwan and Hong Kong. This is not translation tool between Mandarin and Cantonese, etc.
中文簡繁轉換開源項目,支持詞彙級別的轉換、異體字轉換和地區習慣用詞轉換(中國大陸、臺灣、香港、日本新字體)。不提供普通話與粵語的轉換。
Discussion (Telegram): https://t.me/openchineseconvert
See Download.
Warning: This is NOT an API. You will be banned if you make calls programmatically.
https://opencc.byvoid.com/
npm install opencc
const OpenCC = require('opencc'); const converter = new OpenCC('s2t.json'); converter.convertPromise("汉字").then(converted => { console.log(converted); // 漢字 });
import { OpenCC } from 'opencc'; async function main() { const converter: OpenCC = new OpenCC('s2t.json'); const result: string = await converter.convertPromise('汉字'); console.log(result); }
See demo.js and ts-demo.ts.
pip install opencc(Windows, Linux, Mac)
import opencc converter = opencc.OpenCC('s2t.json') converter.convert('汉字') # 漢字
#include "opencc.h"int main() { const SimpleConverter converter("s2t.json"); converter.Convert("汉字"); // 漢字 return 0; }
#include "opencc.h"int main() { opencc_t opencc = opencc_open("s2t.json"); const char* input = "汉字"; char* converted = opencc_convert_utf8(opencc, input, strlen(input)); // 漢字 opencc_convert_utf8_free(converted); opencc_close(opencc); return 0; }
Document 文檔: https://byvoid.github.io/OpenCC/
opencc --help
opencc_dict --help
opencc_phrase_extract --help
s2t.jsonSimplified Chinese to Traditional Chinese 簡體到繁體
t2s.jsonTraditional Chinese to Simplified Chinese 繁體到簡體
s2tw.jsonSimplified Chinese to Traditional Chinese (Taiwan Standard) 簡體到臺灣正體
tw2s.jsonTraditional Chinese (Taiwan Standard) to Simplified Chinese 臺灣正體到簡體
s2hk.jsonSimplified Chinese to Traditional Chinese (Hong Kong variant) 簡體到香港繁體
hk2s.jsonTraditional Chinese (Hong Kong variant) to Simplified Chinese 香港繁體到簡體
s2twp.jsonSimplified Chinese to Traditional Chinese (Taiwan Standard) with Taiwanese idiom 簡體到繁體(臺灣正體標準)並轉換爲臺灣常用詞彙
tw2sp.jsonTraditional Chinese (Taiwan Standard) to Simplified Chinese with Mainland Chinese idiom 繁體(臺灣正體標準)到簡體並轉換爲中國大陸常用詞彙
t2tw.jsonTraditional Chinese (OpenCC Standard) to Taiwan Standard 繁體(OpenCC 標準)到臺灣正體
hk2t.jsonTraditional Chinese (Hong Kong variant) to Traditional Chinese 香港繁體到繁體(OpenCC 標準)
t2hk.jsonTraditional Chinese (OpenCC Standard) to Hong Kong variant 繁體(OpenCC 標準)到香港繁體
t2jp.jsonTraditional Chinese Characters (Kyūjitai) to New Japanese Kanji (Shinjitai) 繁體(OpenCC 標準,舊字體)到日文新字體
jp2t.jsonNew Japanese Kanji (Shinjitai) to Traditional Chinese Characters (Kyūjitai) 日文新字體到繁體(OpenCC 標準,舊字體)
tw2t.jsonTraditional Chinese (Taiwan standard) to Traditional Chinese 臺灣正體到繁體(OpenCC 標準)
g++ 4.6+ or clang 3.2+ is required.
make
build.cmd
make test
test.cmd
make benchmark
Example results (from Github CI):
1: ------------------------------------------------------------------ 1: Benchmark Time CPU Iterations 1: ------------------------------------------------------------------ 1: BM_Initialization/hk2s 1.56 ms 1.56 ms 442 1: BM_Initialization/hk2t 0.144 ms 0.144 ms 4878 1: BM_Initialization/jp2t 0.260 ms 0.260 ms 2604 1: BM_Initialization/s2hk 23.8 ms 23.8 ms 29 1: BM_Initialization/s2t 25.6 ms 25.6 ms 28 1: BM_Initialization/s2tw 24.0 ms 23.9 ms 30 1: BM_Initialization/s2twp 24.6 ms 24.6 ms 28 1: BM_Initialization/t2hk 0.052 ms 0.052 ms 12897 1: BM_Initialization/t2jp 0.141 ms 0.141 ms 5012 1: BM_Initialization/t2s 1.30 ms 1.30 ms 540 1: BM_Initialization/tw2s 1.39 ms 1.39 ms 529 1: BM_Initialization/tw2sp 1.69 ms 1.69 ms 426 1: BM_Initialization/tw2t 0.089 ms 0.089 ms 7707 1: BM_Convert2M 582 ms 582 ms 1 1: BM_Convert/100 1.07 ms 1.07 ms 636 1: BM_Convert/1000 11.0 ms 11.0 ms 67 1: BM_Convert/10000 113 ms 113 ms 6 1: BM_Convert/100000 1176 ms 1176 ms 1
Apache License 2.0
All these libraries are statically linked by default.
Please update this list you have contributed OpenCC.