What are the best open-source alternatives to Pkuseg Python?

30 open-source projects similar to lancopku/pkuseg-python, ranked by shared features. Top picks: nlpchina/ansj_seg, fxsjy/jieba, isnowfy/snownlp, hit-scir/ltp, baidu/lac, hankcs/hanlp, stanfordnlp/stanza, chatopera/synonyms, spencermountain/compromise, infinilabs/analysis-ik.

Is nlpchina/ansj_seg a good alternative to Pkuseg Python?

ansj_seg is a Java NLP toolkit and segmentation library designed for processing Chinese text. It functions as a word segmenter, part-of-speech tagger, and named entity recognizer to divide continuous Chinese characters into meaningful words and tokens. The library utilizes statistical models for t…

Is fxsjy/jieba a good alternative to Pkuseg Python?

This project is a Chinese text segmentation library and tokenizer designed to split Chinese sentences into individual words. It serves as a natural language processing tool for splitting characters into words, tagging parts of speech, and extracting keywords using statistical analysis. The library…

Is isnowfy/snownlp a good alternative to Pkuseg Python?

SnowNLP is a Python library for Chinese natural language processing. It provides tools for text segmentation, sentiment analysis, document classification, and phonetic transliteration. The library includes capabilities for training and saving custom machine learning models for tokenization and sen…

Is hit-scir/ltp a good alternative to Pkuseg Python?

This is a Chinese natural language processing toolkit providing a suite of tools for word segmentation, part-of-speech tagging, and named entity recognition. It includes a neural dependency parser for analyzing syntactic and semantic relationships between words and a machine learning training suite…

Is baidu/lac a good alternative to Pkuseg Python?

LAC is a Chinese lexical analysis engine and toolkit designed for joint word segmentation, part-of-speech tagging, and named entity recognition. It functions as a high-performance system that identifies word boundaries and grammatical categories using trained machine learning models. The project f…

Is hankcs/hanlp a good alternative to Pkuseg Python?

HanLP is a natural language processing library and deep learning framework specifically optimized for the Chinese language, while also functioning as a multilingual text processor. It serves as a toolkit for performing linguistic analysis, semantic understanding, and script conversion. The project…

Is stanfordnlp/stanza a good alternative to Pkuseg Python?

Stanza is a Python natural language processing library designed for tokenization, lemmatization, and dependency parsing across many human languages using neural models. It provides a neural processing pipeline that converts raw text into structured linguistic data objects, alongside a specialized a…

Is chatopera/synonyms a good alternative to Pkuseg Python?

Synonyms is a natural language processing library and semantic similarity engine specifically designed for Chinese text. It functions as a word embedding toolkit and tokenizer that extracts semantic meaning and identifies synonyms by calculating the conceptual closeness between words and sentences.…

Is spencermountain/compromise a good alternative to Pkuseg Python?

Compromise is a natural language processing library and rule-based text parser designed to analyze unstructured text. It functions as a toolkit for identifying parts of speech, linguistic patterns, and semantic meaning, while providing specialized engines for named entity recognition and the parsin…

Is infinilabs/analysis-ik a good alternative to Pkuseg Python?

Analysis-ik is a Chinese text segmenter and analysis plugin for Lucene-based search engines. It provides a specialized analyzer for splitting Chinese sentences into meaningful words to improve indexing and search accuracy within Elasticsearch and OpenSearch. The project features a dynamic dictiona…

Back to lancopku/pkuseg-python

Open-source alternatives to Pkuseg Python

30 open-source projects similar to lancopku/pkuseg-python, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Pkuseg Python alternative.

nlpchina/ansj_seg
NLPchina/ansj_seg
6,528View on GitHub
ansj_seg is a Java NLP toolkit and segmentation library designed for processing Chinese text. It functions as a word segmenter, part-of-speech tagger, and named entity recognizer to divide continuous Chinese characters into meaningful words and tokens. The library utilizes statistical models for text segmentation and provides capabilities for identifying and extracting person names from unstructured documents. It also assigns grammatical categories to tokens to determine their linguistic roles within a sentence. The toolkit supports domain-specific text processing through the use of custom d
Javaansjchinesejava
View on GitHub6,528
fxsjy/jieba
fxsjy/jieba
35,027View on GitHub
This project is a Chinese text segmentation library and tokenizer designed to split Chinese sentences into individual words. It serves as a natural language processing tool for splitting characters into words, tagging parts of speech, and extracting keywords using statistical analysis. The library distinguishes itself through support for custom dictionary configuration and vocabulary file management, allowing users to override default segmentation rules for domain-specific accuracy. It also includes a TF-IDF keyword extractor to identify significant words and core topics within documents. Th
Python
View on GitHub35,027
isnowfy/snownlp
isnowfy/snownlp
6,631View on GitHub
SnowNLP is a Python library for Chinese natural language processing. It provides tools for text segmentation, sentiment analysis, document classification, and phonetic transliteration. The library includes capabilities for training and saving custom machine learning models for tokenization and sentiment analysis using raw training datasets. It covers a range of linguistic processing areas, including parts of speech tagging, sentence splitting, and text similarity measurement. The toolkit also provides utilities for extracting key information through text summarization and calculating word im
Python
View on GitHub6,631

Open-source alternatives to Pkuseg Python

NLPchina/ansj_seg

fxsjy/jieba

isnowfy/snownlp

HIT-SCIR/ltp

baidu/lac

hankcs/HanLP

stanfordnlp/stanza

chatopera/Synonyms

spencermountain/compromise

infinilabs/analysis-ik

huyingxi/Synonyms

wainshine/Chinese-Names-Corpus

ownthink/KnowledgeGraphData

Embedding/Chinese-Word-Vectors

ymcui/Chinese-BERT-wwm

zalandoresearch/flair

nlpxucan/WizardLM

nltk/nltk

stanfordnlp/CoreNLP

flairNLP/flair

zjunlp/DeepKE

clips/pattern

mozilla/TTS

sloria/TextBlob

JohnSnowLabs/spark-nlp

SCIR-HI/Huatuo-Llama-Med-Chinese

huggingface/tokenizers

languagetool-org/languagetool

fastai/course-v3

duanhongyi/genius