Package: gibasa 1.1.2
data:image/s3,"s3://crabby-images/61955/619559fd683ca03177a09aaa510e20ec427265f8" alt=""
gibasa: An Alternative 'Rcpp' Wrapper of 'MeCab'
A plain 'Rcpp' wrapper for 'MeCab' that can segment Chinese, Japanese, and Korean text into tokens. The main goal of this package is to provide an alternative to 'tidytext' using morphological analysis.
Authors:
gibasa_1.1.2.tar.gz
gibasa_1.1.2.zip(r-4.5)gibasa_1.1.2.zip(r-4.4)gibasa_1.1.2.zip(r-4.3)
gibasa_1.1.2.tgz(r-4.5-x86_64)gibasa_1.1.2.tgz(r-4.5-arm64)gibasa_1.1.2.tgz(r-4.4-x86_64)gibasa_1.1.2.tgz(r-4.4-arm64)gibasa_1.1.2.tgz(r-4.3-x86_64)gibasa_1.1.2.tgz(r-4.3-arm64)
gibasa_1.1.2.tar.gz(r-4.5-noble)gibasa_1.1.2.tar.gz(r-4.4-noble)
gibasa_1.1.2.tgz(r-4.4-emscripten)gibasa_1.1.2.tgz(r-4.3-emscripten)
gibasa.pdf |gibasa.html✨
gibasa/json (API)
NEWS
# Install 'gibasa' in R: |
install.packages('gibasa', repos = c('https://paithiov909.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/paithiov909/gibasa/issues
Pkgdown site:https://paithiov909.github.io
- ginga - Whole text of 'Ginga Tetsudo no Yoru' written by Miyazawa Kenji from Aozora Bunko
Last updated 4 days agofrom:c5f4dfd53d. Checks:5 OK, 6 NOTE. Indexed: yes.
Target | Result | Latest binary |
---|---|---|
Doc / Vignettes | OK | Feb 16 2025 |
R-4.5-win-x86_64 | OK | Feb 16 2025 |
R-4.5-mac-x86_64 | OK | Feb 16 2025 |
R-4.5-mac-aarch64 | OK | Feb 16 2025 |
R-4.5-linux-x86_64 | OK | Feb 16 2025 |
R-4.4-win-x86_64 | NOTE | Feb 16 2025 |
R-4.4-mac-x86_64 | NOTE | Feb 16 2025 |
R-4.4-mac-aarch64 | NOTE | Feb 16 2025 |
R-4.3-win-x86_64 | NOTE | Feb 16 2025 |
R-4.3-mac-x86_64 | NOTE | Feb 16 2025 |
R-4.3-mac-aarch64 | NOTE | Feb 16 2025 |
Exports:as_tokensbind_lrbind_tf_idf2build_sys_dicbuild_user_diccollapse_tokensdictionary_infogbs_tokenizeget_dict_featuresget_transition_costis_blanklex_densitymute_tokensngram_tokenizerpackposDebugRcppposParallelRcppprettifytokenize
Dependencies:bitbit64clicliprcpp11crayondplyrfansigenericsgluehmslatticelifecyclemagrittrMatrixpillarpkgconfigprettyunitsprogressR6RcppRcppParallelreadrrlangstringitibbletidyselecttzdbutf8vctrsvroomwithr
Readme and manuals
Help Manual
Help page | Topics |
---|---|
Create a list of tokens | as_tokens |
Bind importance of bigrams | bind_lr |
Bind term frequency and inverse document frequency | bind_tf_idf2 |
Build system dictionary | build_sys_dic |
Build user dictionary | build_user_dic |
Collapse sequences of tokens by condition | collapse_tokens |
Get dictionary information | dictionary_info |
Tokenize sentences using 'MeCab' | gbs_tokenize |
Get dictionary features | get_dict_features |
Get transition cost between pos attributes | get_transition_cost |
Whole text of 'Ginga Tetsudo no Yoru' written by Miyazawa Kenji from Aozora Bunko | ginga |
Check if scalars are blank | is_blank |
Calculate lexical density | lex_density |
Mute tokens by condition | mute_tokens |
Ngrams tokenizer | ngram_tokenizer |
Pack a data.frame of tokens | pack |
Prettify tokenized output | prettify |
Tokenize sentences using 'MeCab' | tokenize |