![](https://github.com/paithiov909/gibasa/raw/HEAD/man/figures/tidytext_fig5_1_mod.drawio.png)
gibasa - An Alternative 'Rcpp' Wrapper of 'MeCab'
A plain 'Rcpp' wrapper of 'MeCab' that can segment Chinese, Japanese, and Korean text into tokens. The main goal of this package is to provide an alternative to 'tidytext' using morphological analysis.
Last updated 22 days ago
mecabpos-taggingrcpp
14 stars 2.61 score 33 dependencies![](https://github.com/paithiov909/audubon/raw/HEAD/man/figures/logo.png)
audubon - Japanese Text Processing Tools
A collection of Japanese text processing tools for filling Japanese iteration marks, Japanese character type conversions, segmentation by phrase, and text normalization which is based on rules for the 'Sudachi' morphological analyzer and the 'NEologd' (Neologism dictionary for 'MeCab'). These features are specific to Japanese and are not implemented in 'ICU' (International Components for Unicode).
Last updated 2 months ago
japanesejavascript
8 stars 2.21 score 38 dependencies 2 dependents![](https://github.com/paithiov909/pipian/raw/HEAD/man/figures/logo.png)
pipian - Tiny Interface to CaboCha for R
A tiny interface to 'CaboCha'; a Japanese dependency structure parser. The main goal of this package is to implement a parser for that XML output.
Last updated 3 months ago
cabocha
4 stars 1.08 score 33 dependenciesldccr - Utilities for Various Japanese Corpora
The goal of ldccr package is to make easy to use Japanese language resources. This package provides parsers for several Japanese corpora that are free or open licensed and a downloader of zipped text files published on Aozora Bunko.
Last updated 21 days ago
1 stars 1.00 score 35 dependenciesjprailway - Dataset of Japanese Railway
Provides an extended dataset of Japanese railway revised from <https://github.com/Seo-4d696b75/station_database>. The original dataset is sourced from <https://www.ekidata.jp/>, the digital national land information download site, or other resources, and licensed under 'CC BY 4.0' <https://creativecommons.org/licenses/by/4.0/>.
Last updated 21 days ago
1 stars 1.00 score 13 dependenciesbaritsu - Wrappers for 'mlpack'
A collection of wrappers for the 'mlpack' package that allows passing formula as their argument.
Last updated 2 months ago
tidymodels
3 stars 1.00 score 45 dependenciesjisx0402 - Datasets Related to 'JIS X 0402:2020'
Provides datasets for handling Japanese municipality code defined in 'JIS X 0402' and 'JIS X 0401'.
Last updated 6 months ago
3 stars 1.00 score 12 dependencies![](https://github.com/paithiov909/apportita/raw/HEAD/man/figures/logo.png)
apportita - Utility for Handling 'magnitude' Word Embeddings
A partial R port from 'magnitude', which is a fast, simple utility library for handling vector embeddings. The main goal of this package is to enable access to user's local magnitude data store.
Last updated 6 months ago
embeddings
1 stars 0.71 score 39 dependenciesRNGT - Wrappers for 'NGT'
Wrappers for 'NGT' (Neighborhood Graph and Tree for indexing high-dimensional data) which performs high-speed approximate nearest neighbor searches against a large volume of data in high dimensional vector data space.
Last updated 6 months ago
approximate-nearest-neighbor-searchrcpp
1 stars 0.61 score 13 dependenciesRcppMeCab - 'Rcpp' Wrapper for 'MeCab' Library
R package based on 'Rcpp' for 'MeCab': Yet Another Part-of-Speech and Morphological Analyzer. The purpose of this package is providing a seamless developing and analyzing environment for CJK texts. This package utilizes parallel programming for providing highly efficient text preprocessing 'posParallel()' function.
Last updated 8 months ago
0.61 score 34 dependencies![](https://github.com/paithiov909/tangela/raw/HEAD/man/figures/logo.png)
tangela - rJava Interface to Kuromoji
An rJava wrapper of atilika/kuromoji (v0.7.7).
Last updated 8 months ago
kuromojirjava
1 stars 0.36 score 40 dependencieskelpbeds - Dictionary Tool for 'MeCab'
Provides the source 'IPAdic' for 'MeCab'.
Last updated 3 months ago
0.09 score 25 dependencies![](https://github.com/paithiov909/sudachir/raw/HEAD/man/figures/logo.png)
sudachir - R Interface to 'Sudachi'
Interface to 'Sudachi' <https://github.com/WorksApplications/sudachi.rs>, a Japanese morphological analyzer. This is a port of what is available in Python.
Last updated 1 years ago
0.09 score 46 dependencies