nlp.md

July 15, 2021 ยท View on GitHub

Bookmarks tagged [nlp]

www.codever.land/bookmarks/t/nlp

getlang

https://github.com/rylans/getlang

Fast natural language detection package.


go-eco

https://github.com/ThePaw/go-eco

Similarity, dissimilarity and distance matrices; diversity, equitability and inequality measures; species richness estimators; coenocline models.


go-i18n

https://github.com/nicksnyder/go-i18n/

Package and an accompanying tool to work with localized text.


go-mystem

https://github.com/dveselov/mystem

CGo bindings to Yandex.Mystem - russian morphology analyzer.


go-nlp

https://github.com/nuance/go-nlp

Utilities for working with discrete probability distributions and other tools useful for doing NLP work.


go-pinyin

https://github.com/mozillazg/go-pinyin

CN Hanzi to Hanyu Pinyin converter.


go-stem

https://github.com/agonopol/go-stem

Implementation of the porter stemming algorithm.


go-unidecode

https://github.com/mozillazg/go-unidecode

ASCII transliterations of Unicode text.


go2vec

https://github.com/danieldk/go2vec

Reader and utility functions for word2vec embeddings.


gojieba

https://github.com/yanyiwu/gojieba

This is a Go implementation of jieba which a Chinese word splitting algorithm.


golibstemmer

https://github.com/rjohnsondev/golibstemmer

Go bindings for the snowball libstemmer library including porter 2.


gotokenizer

https://github.com/xujiajun/gotokenizer

A tokenizer based on the dictionary and Bigram language models for Golang. (Now only support chinese segmentation)


gounidecode

https://github.com/fiam/gounidecode

Unicode transliterator (also known as unidecode) for Go.


gse

https://github.com/go-ego/gse

Go efficient text segmentation; support english, chinese, japanese and other.


icu

https://github.com/goodsign/icu

Cgo binding for icu4c C library detection and conversion functions. Guaranteed compatibility with version 50.1.


kagome

https://github.com/ikawaha/kagome

JP morphological analyzer written in pure Go.


libtextcat

https://github.com/goodsign/libtextcat

Cgo binding for libtextcat C library. Guaranteed compatibility with version 2.2.


MMSEGO

https://github.com/awsong/MMSEGO

This is a GO implementation of MMSEG which a Chinese word splitting algorithm.


nlp

https://github.com/Shixzie/nlp

Extract values from strings and fill your structs with nlp.


nlp

https://github.com/james-bowman/nlp

Go Natural Language Processing library supporting LSA (Latent Semantic Analysis).


paicehusk

https://github.com/rookii/paicehusk

Golang implementation of the Paice/Husk Stemming Algorithm.


petrovich

https://github.com/striker2000/petrovich

Petrovich is the library which inflects Russian names to given grammatical case.


porter

https://github.com/a2800276/porter

This is a fairly straightforward port of Martin Porter's C implementation of the Porter stemming algorithm.


porter2

https://github.com/zhenjl/porter2

Really fast Porter 2 stemmer.


prose

https://github.com/jdkato/prose

Library for text processing that supports tokenization, part-of-speech tagging, named-entity extraction, and more.


RAKE.go

https://github.com/Obaied/RAKE.go

Go port of the Rapid Automatic Keyword Extraction Algorithm (RAKE).


segment

https://github.com/blevesearch/segment

Go library for performing Unicode Text Segmentation as described in Unicode Standard Annex 29


sentences

https://github.com/neurosnap/sentences

Sentence tokenizer: converts text into a list of sentences.


shamoji

https://github.com/osamingo/shamoji

The shamoji is word filtering package written in Go.


snowball

https://github.com/goodsign/snowball

Snowball stemmer port (cgo wrapper) for Go. Provides word stem extraction functionality Snowball native.


stemmer

https://github.com/dchest/stemmer

Stemmer packages for Go programming language. Includes English and German stemmers.


textcat

https://github.com/pebbe/textcat

Go package for n-gram based text categorization, with support for utf-8 and raw text.


whatlanggo

https://github.com/abadojack/whatlanggo

Natural language detection package for Go. Supports 84 languages and 24 scripts (writing systems e.g. Latin, Cyrillic, etc).


when

https://github.com/olebedev/when

Natural EN and RU language date/time parser with pluggable rules.


retext

https://github.com/wooorm/retext

An extensible natural language system.


franc

https://github.com/wooorm/franc

Detect the language of text.


leven

https://github.com/sindresorhus/leven

Measure the difference between two strings using the Levenshtein distance algorithm.


natural

https://github.com/NaturalNode/natural

Natural language facility.


CogCompNLP

https://github.com/CogComp/cogcomp-nlp

Provides common annotators for plain text input.


CoreNLP

https://nlp.stanford.edu/software/corenlp.shtml

Provides a set of fundamental tools for tasks like tagging, named entity recognition, and sentiment analysis.


DKPro

https://dkpro.github.io

Collection of reusable NLP tools for linguistic pre-processing, machine learning, lexical resources, etc.


Lingua

https://github.com/pemistahl/lingua

Natural language detection library, especially suited for short paragraphs of text.


LingPipe

http://alias-i.com/lingpipe

Toolkit for tasks ranging from POS tagging to sentiment analysis.


awesome-speech-language-processing

https://github.com/edobashira/speech-language-processing#readme

A curated list of speech and natural language processing resources - edobashira/speech-language-processing


awesome-spanish-nlp

https://github.com/dav009/awesome-spanish-nlp#readme

Curated list of Linguistic Resources for doing NLP & CL on Spanish - dav009/awesome-spanish-nlp


awesome-nlp-with-ruby

https://github.com/arbox/nlp-with-ruby#readme

Curated List: Practical Natural Language Processing done in Ruby - arbox/nlp-with-ruby


awesome-qa

https://github.com/seriousmac/awesome-qa#readme

๐Ÿ˜Ž A curated list of the Question Answering (QA). Contribute to seriousmac/awesome-qa development by creating an account on GitHub.