Package: gibasa 1.1.1

gibasa: An Alternative 'Rcpp' Wrapper of 'MeCab'

A plain 'Rcpp' wrapper of 'MeCab' that can segment Chinese, Japanese, and Korean text into tokens. The main goal of this package is to provide an alternative to 'tidytext' using morphological analysis.

Authors:Akiru Kato [aut, cre], Shogo Ichinose [aut], Taku Kudo [aut], Jorge Nocedal [ctb], Nippon Telegraph and Telephone Corporation [cph]

gibasa_1.1.1.tar.gz
gibasa_1.1.1.zip(r-4.5)gibasa_1.1.1.zip(r-4.4)gibasa_1.1.1.zip(r-4.3)
gibasa_1.1.1.tgz(r-4.4-arm64)gibasa_1.1.1.tgz(r-4.4-x86_64)gibasa_1.1.1.tgz(r-4.3-arm64)gibasa_1.1.1.tgz(r-4.3-x86_64)
gibasa_1.1.1.tar.gz(r-4.5-noble)gibasa_1.1.1.tar.gz(r-4.4-noble)
gibasa_1.1.1.tgz(r-4.4-emscripten)gibasa_1.1.1.tgz(r-4.3-emscripten)
gibasa.pdf |gibasa.html
gibasa/json (API)
NEWS

# Install 'gibasa' in R:
install.packages('gibasa', repos = c('https://paithiov909.r-universe.dev', 'https://cloud.r-project.org'))

Peer review:

Bug tracker:https://github.com/paithiov909/gibasa/issues

Uses libs:
  • c++– GNU Standard C++ Library v3
Datasets:
  • ginga - Whole text of 'Ginga Tetsudo no Yoru' written by Miyazawa Kenji from Aozora Bunko

On CRAN:

mecabpos-taggingrcpp

19 exports 14 stars 2.61 score 33 dependencies 213 downloads

Last updated 22 days agofrom:357b0f64db

Exports:as_tokensbind_lrbind_tf_idf2build_sys_dicbuild_user_diccollapse_tokensdictionary_infogbs_tokenizeget_dict_featuresget_transition_costis_blanklex_densitymute_tokensngram_tokenizerpackposDebugRcppposParallelRcppprettifytokenize

Dependencies:bitbit64clicliprcpp11crayondplyrfansigenericsgluehmslatticelifecyclemagrittrMatrixpillarpkgconfigprettyunitsprogresspurrrR6RcppRcppParallelreadrrlangstringitibbletidyselecttzdbutf8vctrsvroomwithr