상세 컨텐츠

본문 제목

한국어 임베딩 책에 소개된 오픈소스들

카테고리 없음

by 그라치에 2020. 7. 30. 15:16

본문

한국어 임베딩 책에서 발췌

- TensorFlow : https://www.tensorflow.org

TensorFlow

모두를 위한 엔드 투 엔드 오픈소스 머신러닝 플랫폼입니다. 도구, 라이브러리, 커뮤니티 리소스로 구성된 TensorFlow의 유연한 환경입니다.

www.tensorflow.org

- Gensim : https://radimrehurek.com/gensim

gensim: topic modelling for humans

Efficient topic modelling in Python

radimrehurek.com

- FastText : https://fasttext.cc

fastText

Library for efficient text classification and representation learning

fasttext.cc

- GloVe : https://nlp.stanford.edu/projects/glove

GloVe: Global Vectors for Word Representation

GloVe is an unsupervised learning algorithm for obtaining vector representations for words. Training is performed on aggregated global word-word co-occurrence statistics from a corpus, and the resulting representations showcase interesting linear substruct

nlp.stanford.edu

- Swivel : https://github.com/tensorflow/models/tree/master/research/swivel

tensorflow/models

Models and examples built with TensorFlow. Contribute to tensorflow/models development by creating an account on GitHub.

github.com

- ELMo : https://allennlp.org/elmo

AllenNLP

AllenNLP is a free, open-source natural language processing platform for building state of the art models.

allennlp.org

- BERT : https://github.com/google-research/bert

google-research/bert

TensorFlow code and pre-trained models for BERT. Contribute to google-research/bert development by creating an account on GitHub.

github.com

- Scikit-Learn : https://scikit-learn.org

scikit-learn: machine learning in Python — scikit-learn 0.16.1 documentation

scikit-learn.org

- KoNLPy : https://konlpy.org/en/latest

KoNLPy: Korean NLP in Python — KoNLPy 0.5.2 documentation

KoNLPy: Korean NLP in Python KoNLPy (pronounced “ko en el PIE”) is a Python package for natural language processing (NLP) of the Korean language. For installation directions, see here. For users new to NLP, go to Getting started. For step-by-step instr

konlpy.org

- Mecab : https://eunjeon.blogspot.com

          은전한닢 프로젝트

은전한닢 프로젝트: 오픈 소스 한국어 / 한글 형태소 분석기 Lucene/Solr, ElasticSearch 플러그인

eunjeon.blogspot.com

- soynlp : https://github.com/lovit/soynlp

lovit/soynlp

한국어 자연어처리를 위한 파이썬 라이브러리입니다. 단어 추출/ 토크나이저 / 품사판별/ 전처리의 기능을 제공합니다. - lovit/soynlp

github.com

- Khaiii : https://tech.kakao.com/2018/12/13/khaiii

kakao의 오픈소스 Ep9 - Khaiii : 카카오의 딥러닝 기반 형태소 분석기

“카카오의 오픈소스를 소개합니다” 아홉 번째는 jamie.lim과 자연어 처리 파트 동료들이 함께 개발한 khaiii(Kakao Hangul Analyzer III)입니다. khaiii는 세종 코퍼스를 이용하여 CNN(Convolutional Neural Network,

tech.kakao.com

- Bokeh : https://bokeh.pydata.org

Bokeh 2.1.1 Documentation

docs.bokeh.org

- sentencepiece : https://github.com/google/sentencepiece

google/sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation. - google/sentencepiece

github.com

댓글 영역