• About
  • Documentation

  • More Universes
  • Recent Updates
  • Leader board

  • All repositories
  • All packages
  • All articles
  • All datasets
  • All system Libraries
paithiov909
  • Builds
  • Packages
  • Articles
  • Datasets
  • Contribution
  • Badges
  • API
  • Feed

Links topaithiov909

audubon - Japanese Text Processing Tools

A collection of Japanese text processing tools for filling Japanese iteration marks, Japanese character type conversions, segmentation by phrase, and text normalization which is based on rules for the 'Sudachi' morphological analyzer and the 'NEologd' (Neologism dictionary for 'MeCab'). These features are specific to Japanese and are not implemented in 'ICU' (International Components for Unicode).

Last updated

japanesejavascript

5.47 score 11 stars 1 dependents 3 scripts 885 downloads

gibasa - An Alternative 'Rcpp' Wrapper of 'MeCab'

A plain 'Rcpp' wrapper for 'MeCab' that can segment Chinese, Japanese, and Korean text into tokens. The main goal of this package is to provide an alternative to 'tidytext' using morphological analysis.

Last updated

mecabpos-taggingrcppcpp

4.77 score 17 stars 3 scripts 618 downloads

skiagd - Creative Coding Pipeline for R

A toy R wrapper for 'rust-skia' <https://github.com/rust-skia/rust-skia> (the Rust crate 'skia_safe' <https://rust-skia.github.io/doc/skia_safe/>, a binding for 'Skia' <https://skia.org/>).

Last updated

graphicssavvyrustcargoquartofontconfigfreetype

4.66 score 4 stars 10 scripts

RMeCab - Interface to 'MeCab'

Parses Japanese texts with 'MeCab'. The original 'MeCab' is licensed under the BSD 3-Clause "New" or "Revised" License. See the "LICENSE.note" file for its license notice.

Last updated

mecabcpp

3.59 score 156 scripts

aznyan - Image Filters with 'OpenCV'

Offers image filters wrapping 'OpenCV' <https://opencv.org/>, ported from <https://github.com/5PB-3-4/AviUtl_OpenCV_Scripts>.

Last updated

opencvcpp

3.20 score 2 stars 1 scripts

rasengan - Generation of Geometric Curves

Provides functions to generate and sample geometric curves. Each function returns a data frame of 2D coordinates, suitable for visualization or further geometric processing.

Last updated

cpp

2.85 score 1 scripts

rravif - AVIF Image Encoder

Encodes images in AVIF format with 'ravif' Rust crate.

Last updated

rustcargo

2.70 score 1 stars 2 scripts

jprailway - Dataset of Japanese Railway

Provides an extended dataset of Japanese railway revised from <https://github.com/Seo-4d696b75/station_database>. The original dataset is sourced from <https://www.ekidata.jp/>, the digital national land information download site, or other resources, and licensed under 'CC BY 4.0' <https://creativecommons.org/licenses/by/4.0/>.

Last updated

2.70 score 2 stars

convlog - Read Mahjong Logs From 'tenhou.net/6' Format

Offers wrappers for the 'convlog' crate from 'mjai-reviewer' <https://github.com/Equim-chan/mjai-reviewer> that can directly read mahjong logs from 'tenhou.net/6' format into tibbles.

Last updated

rustcargo

2.60 score 2 stars 3 scripts

pipian - Tiny Interface to CaboCha for R

A tiny interface to 'CaboCha'; a Japanese dependency structure parser. The main goal of this package is to implement a parser for that XML output.

Last updated

cabochacpp

2.60 score 4 stars 1 scripts

ldccr - Utilities for Various Japanese Corpora

The goal of ldccr package is to make easy to use Japanese language resources. This package provides parsers for several Japanese corpora that are free or open licensed and a downloader of zipped text files published on Aozora Bunko.

Last updated

cpp

2.54 score 1 stars 1 scripts

pnglitchr - PNG Glitching in R

Offers a thin wrapper around <https://github.com/chikoski/png-glitch>, a library to glitch PNG images.

Last updated

rustcargo

2.48 score 1 scripts

sudachir2 - R Wrapper for 'sudachi.rs'

Offers bindings to 'sudachi.rs' <https://github.com/WorksApplications/sudachi.rs>, a Rust implementation of 'Sudachi' Japanese morphological analyzer.

Last updated

pos-taggingrustcargo

2.48 score 3 stars 3 scripts

nativeshadr - Introduction of 'HLSL' Syntax to 'Rcpp'

Brings 'HLSL'-like syntax to 'Rcpp' codes via the 'HLSL++' library <https://github.com/redorav/hlslpp>.

Last updated

rcppshaderscpp

2.48 score 2 stars 3 scripts

shikakusphere - Miscellaneous Functions for Japanese Mahjong

A collection of miscellaneous functions for Japanese mahjong that wraps C++ sources of 'shanten-number' <https://github.com/tomohxx/shanten-number> and 'cmajiang' <https://github.com/TadaoYamaoka/cmajiang>.

Last updated

mahjongrcppcpp

2.30 score 4 stars 5 scripts

pipopaplot - Seamless Data Sonification with 'ggplot2'

Provides a simple framework for data sonification in R, mapping 'ggplot2'-style aesthetics to MIDI events. Designed for creative exploration rather than strict auditory data analysis.

Last updated

midisonificationcpp

2.30 score 1 stars 5 scripts

mixboxr - Color Blending with 'Mixbox'

Offers blending method for natural color mixing with the C/C++ implmentation of 'Mixbox' <https://github.com/scrtwpns/mixbox> library.

Last updated

cpp

2.18 score 1 stars 1 scripts

jisx0402 - Datasets Related to 'JIS X 0402:2020'

Provides datasets for handling Japanese municipality code defined in 'JIS X 0402' and 'JIS X 0401'.

Last updated

2.18 score 3 stars

vibrrt - An R Wrapper for 'vibrato'

An R wrapper for 'vibrato' <https://github.com/daac-tools/vibrato>, a Rust reimplementation of 'MeCab' for fast tokenization.

Last updated

pos-taggingrustcargo

2.00 score 1 scripts

kelpbeds - Dictionary Tool for 'MeCab'

Provides the source 'IPAdic' for 'MeCab'.

Last updated

1.70 score

gsdmm - GSDMM Short Text Clustering via Dirichlet Mixture Models

This package implements a Dirichlet Mixture Model and accompanying Gibbs sampler for short text clustering proposed by Yin and Wang 2014.

Last updated

cpp

1.70 score 1 scripts