hankcs
UserOn the leaderboard
| Rank | Repository | Stars |
|---|---|---|
| 712 | hankcs/HanLP | 36,236 |
Top repositories by stars
- hankcs/HanLP(on leaderboard)
中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
Python36,137 - hankcs/pyhanlp
中文分词
Python3,211 - hankcs/AhoCorasickDoubleArrayTrie
An extremely fast implementation of Aho Corasick algorithm based on Double Array Trie.
Java1,012 - hankcs/CS224n
CS224n: Natural Language Processing with Deep Learning Assignments Winter, 2017
Python684 - hankcs/Viterbi
An implementation of HMM-Viterbi Algorithm 通用的维特比算法实现
Java379 - hankcs/multi-criteria-cws
Simple Solution for Multi-Criteria Chinese Word Segmentation
Python303 - hankcs/hanlp-lucene-plugin
HanLP中文分词Lucene插件,支持包括Solr在内的基于Lucene的系统
Java298 - hankcs/TextRank
TextRank算法提取关键词的Java实现
Java205 - hankcs/LDA4j
A Java implemention of LDA(Latent Dirichlet Allocation)
Java197 - hankcs/aho-corasick
Aho-Corasick的Java实现,针对Ascii优化,支持Unicode。
Java191 - hankcs/TreebankPreprocessing
Python scripts preprocessing Penn Treebank and Chinese Treebank
Python161 - hankcs/MainPartExtractor
主谓宾提取器的Java实现(对斯坦福的代码失去兴趣,不再维护)
Java142 - hankcs/ID-CNN-CWS
Source codes and corpora of paper "Iterated Dilated Convolutions for Chinese Word Segmentation"
Python133 - hankcs/neural_net
反向传播神经网络及应用
Python86 - hankcs/udacity-deep-learning
Assignments for Udacity Deep Learning class with TensorFlow in PURE Python, not IPython Notebook
Python65 - hankcs/AveragedPerceptronPython
Clone of "A Good Part-of-Speech Tagger in about 200 Lines of Python" by Matthew Honnibal
Python49 - hankcs/text-classification-svm
The missing SVM-based text classification module implementing HanLP's interface
Java46 - hankcs/MaxEnt
这是一个最大熵的简明Java实现,提供提供训练与预测接口。训练算法采用GIS训练算法,附带示例训练集和一个天气预测的Demo。
Java45 - hankcs/hidden-markov-model
First order HMM with Viterbi, Forward-Backward and Baum-Welch implementations
Python33 - hankcs/IceNAT
IceNAT
Java33 - hankcs/Coursera_NLP_MC
Coursera Natural Language Processing by Michael Collins Columbia University
JavaScript29 - hankcs/BERT-token-level-embedding
Generate BERT token level embedding without pain
Python27 - hankcs/HanLPAndroidDemo
HanLP Android Demo
Java27 - hankcs/sub-character-cws
Sub-Character Representation Learning
Python25 - hankcs/coursera-neural-net
Assignments for Geoffrey Hinton's Neural Net Course on Coursera, translated from (gross)Matlab into (beautiful)Python.
Python23 - hankcs/maxent_iis
最大熵-IIS(Improved Iterative Scaling)训练算法的Java实现
Java18 - hankcs/gohanlp
Golang RESTful Client for HanLP
Go14 - hankcs/distributed-bert
TensorFlow code and pre-trained models for BERT
Python11 - hankcs/iparser
Yet another dependency parser, integrated with tokenizer, tagger and visualization tool.
Python11 - hankcs/DeepBiaffineParserMXNet
An experimental implementation of biaffine parser using MXNet
Python10 - hankcs/OpenCC-to-HanLP
无损转换OpenCC词典为HanLP格式
Python9 - hankcs/chinese-corpus
中文相关词典和语料库。
9 - hankcs/libsvm
A Java Maven port of libsvm
9 - hankcs/RenrenAlbumDownloader
人人网全部好友相册下载器
Python7 - hankcs/word2vec-lucene
This tool extracts word vectors from Lucene index.
Java3 - hankcs/gluon-nlp
NLP made easy
Python2 - MATLAB2
- hankcs/DeepLearning
Deep Learning (Python, C, C++, Java, Scala, Go)
Java2 - Java2
- hankcs/word2vec
word2vec的Java并行实现
Java2 - hankcs/MA-FSA
This is a minimal acyclic finite-state automata algorithm in Java based on the paper, "Incremental Construction of Minimal Acyclic Finite-State Automata".
Java2 - hankcs/dsa-java
Data Structures and Algorithms in Java
Java1 - hankcs/bolt_splits
Split Broad Operational Language Translation corpus into train/dev/test set
Python1 - hankcs/keras-rl2
Reinforcement learning with tensorflow 2 keras
Python1 - hankcs/web-data
The repo to host all the web data including images for documents in dmlc projects.
Jupyter Notebook1 - hankcs/stanford_dl_ex-solutions
Deep neural network and deep convolutional network with AdaGrad and AdaDec
MATLAB1 - hankcs/zanata-platform
Zanata is a web-based system for translators to translate documentation and software online using a web browser.
Java1 - hankcs/word2vec-google
Google's word2vec with CMake configuration
C1 - hankcs/hmm
First commit
Python1 - hankcs/sumy
Module for automatic summarization of text documents and HTML pages.
Python1 - hankcs/word2vec-sentiments
Tutorial for Sentiment Analysis using Doc2Vec in gensim (or "getting 87% accuracy in sentiment analysis in under 100 lines of code")
Jupyter Notebook1 - hankcs/datumbox-framework
Datumbox is an open-source Machine Learning framework written in Java which allows the rapid development of Machine Learning and Statistical applications.
Java1 - hankcs/TrieHard
A generic compact Trie implementation in Java. Built for high-performance applications.
Java1 - Python1
- hankcs/myutil
hello
Java1 - hankcs/Dictionaries
Java implementation of a DFA dictionary data structure
Java1 - hankcs/appworld
🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Paper.
Python0 - hankcs/OpenDF
Code to reproduce LREC Paper Simplifying Semantic Annotations of SMCalFlow
Python0 - hankcs/swne
Switchboard Named Entity Corpus
Python0 - hankcs/mini_racer
Minimal embedded v8
JavaScript0 - hankcs/discourse-elasticsearch
discourse plugin to support elasticsearch
JavaScript0 - hankcs/data-science
Practical Approaches to Data Science with Text
TeX0 - hankcs/elit
Evolution of Language and Information Technology
Python0 - hankcs/tree-viewer
A d3.js syntax tree viewer
HTML0 - hankcs/UFLDL-tutorial
Deep Learning and Unsupervised Feature Learning Tutorial Solutions
Jupyter Notebook0 - hankcs/ufldl_tutorial
Stanford Unsupervised Feature Learning and Deep Learning Tutorial
Python0 - Python0
- hankcs/Large-Margin-Structured-Perceptron
Implementation of a large margin structured perceptron, including instances for sequence labeling, quotation extraction and dependency parsing.
Java0 - hankcs/aptagger
An implementation of Matthew Honnibal's fast and accurate part-of-speech tagger based on the Averaged Perceptron
Java0 - hankcs/pylbfgs
the python implementation of L-BFGS
Python0 - hankcs/python-crf
Python implementation of linear-chain conditional random fields.
Python0 - hankcs/nndl
Another Chinese Translation of Neural Networks and Deep Learning
TeX0 - hankcs/bayon
a simple and fast clustering tool
C++0 - hankcs/cmake-swig-java
Minimal example of using SWIG to call C++ code from Java, with a CMake cross-platform build system.
CMake0 - hankcs/word2vec-c11
Word2Vec in C++ 11
C++0 - hankcs/word2vec-doc2vec
An extension of word2vec to efficiently represent new text as vectors. New text can be query, sentence and paragraph.
C0 - hankcs/commons
Common classes, mostly pertaining to concurrency.
Java0 - hankcs/pymining
python data mining platform
Python0 - hankcs/maxent-1
A simple maximum entropy implementation with IIS learning algorithm.
Java0 - hankcs/Java-BloomFilter
A stand-alone Bloom filter implementation written in Java
Java0 - hankcs/Txt2Vec
Txt2Vec is a toolkit to represent text by vector. It's based on Google's word2vec project.
C#0 - hankcs/CRFSharp
CRFSharp is Conditional Random Fields implemented by .NET(C#), a machine learning algorithm for learning from labeled sequences of examples.
C#0 - hankcs/Contest
Contest
C++0 - hankcs/darts-clone-java
A Java port of darts-clone.
Java0 - hankcs/pcrf
Python Linear CRF
Python0