What are the best open-source alternatives to ToolGood.Words?

30 open-source projects similar to toolgood/toolgood.words, ranked by shared features. Top picks: mozillazg/python-pinyin, zh-lx/pinyin-pro, byvoid/opencc, isnowfy/snownlp, overtrue/pinyin, hotoo/pinyin, houbb/sensitive-word, codemayq/chinese-chatbot-corpus, hoothin/userscripts, protectai/llm-guard.

Is mozillazg/python-pinyin a good alternative to ToolGood.Words?

python-pinyin is a Python library for transliterating simplified and traditional Chinese characters into phonetic pinyin. It functions as a transliteration system that converts text while supporting tone sandhi and providing utilities to transform pinyin between different formats, such as numeric t…

Is zh-lx/pinyin-pro a good alternative to ToolGood.Words?

pinyin-pro is a Chinese pinyin transcription library and text segmentation tool. It converts Chinese characters into pinyin with support for tones, initials, and finals, while resolving polyphonic characters based on context. The project includes a pinyin pattern matching engine that enables searc…

Is byvoid/opencc a good alternative to ToolGood.Words?

OpenCC is a library and command-line tool for converting text between Simplified Chinese, Traditional Chinese, and Japanese Kanji. It operates at both the individual character and multi-character phrase levels, and applies region-specific vocabulary choices for Mainland China, Taiwan, and Hong Kong…

Is isnowfy/snownlp a good alternative to ToolGood.Words?

SnowNLP is a Python library for Chinese natural language processing. It provides tools for text segmentation, sentiment analysis, document classification, and phonetic transliteration. The library includes capabilities for training and saving custom machine learning models for tokenization and sen…

Is overtrue/pinyin a good alternative to ToolGood.Words?

This is a dictionary-based Chinese Pinyin transliteration library used to convert Chinese characters into Pinyin with support for various tone styles and formats. It provides specialized utilities for polyphonic character resolution to manage multiple pronunciations and a generator for extracting t…

Is hotoo/pinyin a good alternative to ToolGood.Words?

This is a Chinese text segmentation library that converts Chinese characters into their phonetic pinyin representation. It functions as a polyphone disambiguation tool, resolving ambiguous pronunciations for multi-sound characters using word segmentation and context analysis, and also serves as a p…

Is houbb/sensitive-word a good alternative to ToolGood.Words?

This project is a high-performance Java library and content moderation framework designed to detect and mask prohibited words in text. It utilizes a Deterministic Finite Automaton (DFA) scanner to implement efficient longest-match word detection. The engine distinguishes itself through a text norm…

Is codemayq/chinese-chatbot-corpus a good alternative to ToolGood.Words?

This project provides a collection of processed Chinese conversational datasets and preprocessing workflows designed for training and instruction tuning of large language models. It functions as a training corpus of cleaned, standardized Chinese text formatted as query-answer pairs. The repository…

Is hoothin/userscripts a good alternative to ToolGood.Words?

UserScripts is a collection of JavaScript browser userscripts designed to modify website behavior and add custom functionality to web browsers. It serves as a multi-purpose toolset for web page content automation, web interface enhancement, and specialized web scraping and downloading. The project…

Is protectai/llm-guard a good alternative to ToolGood.Words?

LLM Guard is a security firewall and guardrail framework designed to scan and sanitize inputs and outputs for large language models. It functions as a proxy gateway and security layer to block prompt injections, toxicity, and sensitive data leakage while ensuring that model interactions remain comp…

Back to toolgood/toolgood.words

Open-source alternatives to ToolGood.Words

30 open-source projects similar to toolgood/toolgood.words, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best ToolGood.Words alternative.

mozillazg/python-pinyin
mozillazg/python-pinyin
5,325View on GitHub
python-pinyin is a Python library for transliterating simplified and traditional Chinese characters into phonetic pinyin. It functions as a transliteration system that converts text while supporting tone sandhi and providing utilities to transform pinyin between different formats, such as numeric tones, accent marks, or phonetic initials. The library features a polyphonic character resolver that analyzes surrounding word context to select the correct pronunciation for characters with multiple sounds. It also includes a customizable dictionary system that allows the extension of default transl
Pythonchinesehanzihanzi-pinyin
View on GitHub5,325
zh-lx/pinyin-pro
zh-lx/pinyin-pro
4,646View on GitHub
pinyin-pro is a Chinese pinyin transcription library and text segmentation tool. It converts Chinese characters into pinyin with support for tones, initials, and finals, while resolving polyphonic characters based on context. The project includes a pinyin pattern matching engine that enables searching Chinese text using full spellings, initials, or hybrid phonetic patterns. It also features a pinyin HTML generator that wraps characters and their transcriptions in markup tags for styled web display. The library provides capabilities for Chinese text segmentation, surname pronunciation priorit
TypeScripthanzihanzi-pinyinhanzi2pinyin
View on GitHub4,646
byvoid/opencc
BYVoid/OpenCC
9,772View on GitHub
OpenCC is a library and command-line tool for converting text between Simplified Chinese, Traditional Chinese, and Japanese Kanji. It operates at both the individual character and multi-character phrase levels, and applies region-specific vocabulary choices for Mainland China, Taiwan, and Hong Kong during conversion. The conversion engine resolves ambiguous character mappings using semantic and contextual rules, normalizes variant character forms for consistent orthography, and sequences multiple dictionary files into a configurable pipeline. It supports embedding custom conversion rules dire
C++chinesechinese-conversionchinese-translation
View on GitHub9,772

Open-source alternatives to ToolGood.Words

mozillazg/python-pinyin

zh-lx/pinyin-pro

BYVoid/OpenCC

isnowfy/snownlp

overtrue/pinyin

hotoo/pinyin

houbb/sensitive-word

codemayq/chinese-chatbot-corpus

hoothin/UserScripts

protectai/llm-guard

xiaoyifang/goldendict-ng

nalgeon/sqlean

konsheng/Sensitive-lexicon

HIT-SCIR/ltp

chatopera/Synonyms

huyingxi/Synonyms

brightmart/nlp_chinese_corpus

esbatmop/MNBVC

amzxyz/rime_wanxiang

zongzibinbin/MallChat

ownthink/KnowledgeGraphData

ymcui/Chinese-BERT-wwm

649453932/Bert-Chinese-Text-Classification-Pytorch

osfans/trime

chmln/sd

folke/todo-comments.nvim

fcitx5-android/fcitx5-android

itorr/nbnhhsh

bensadeh/tailspin

dwisiswant0/apkleaks