Package: gibasa 1.1.1.9002
gibasa: An Alternative 'Rcpp' Wrapper of 'MeCab'
A plain 'Rcpp' wrapper of 'MeCab' that can segment Chinese, Japanese, and Korean text into tokens. The main goal of this package is to provide an alternative to 'tidytext' using morphological analysis.
Authors:
gibasa_1.1.1.9002.tar.gz
gibasa_1.1.1.9002.zip(r-4.5)gibasa_1.1.1.9002.zip(r-4.4)gibasa_1.1.1.9002.zip(r-4.3)
gibasa_1.1.1.9002.tgz(r-4.4-x86_64)gibasa_1.1.1.9002.tgz(r-4.4-arm64)gibasa_1.1.1.9002.tgz(r-4.3-x86_64)gibasa_1.1.1.9002.tgz(r-4.3-arm64)
gibasa_1.1.1.9002.tar.gz(r-4.5-noble)gibasa_1.1.1.9002.tar.gz(r-4.4-noble)
gibasa_1.1.1.9002.tgz(r-4.4-emscripten)gibasa_1.1.1.9002.tgz(r-4.3-emscripten)
gibasa.pdf |gibasa.html✨
gibasa/json (API)
NEWS
# Install 'gibasa' in R: |
install.packages('gibasa', repos = c('https://paithiov909.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/paithiov909/gibasa/issues
- ginga - Whole text of 'Ginga Tetsudo no Yoru' written by Miyazawa Kenji from Aozora Bunko
Last updated 2 hours agofrom:7918648291. Checks:OK: 3 NOTE: 6. Indexed: yes.
Target | Result | Date |
---|---|---|
Doc / Vignettes | OK | Nov 17 2024 |
R-4.5-win-x86_64 | OK | Nov 17 2024 |
R-4.5-linux-x86_64 | OK | Nov 17 2024 |
R-4.4-win-x86_64 | NOTE | Nov 17 2024 |
R-4.4-mac-x86_64 | NOTE | Nov 17 2024 |
R-4.4-mac-aarch64 | NOTE | Nov 17 2024 |
R-4.3-win-x86_64 | NOTE | Nov 17 2024 |
R-4.3-mac-x86_64 | NOTE | Nov 17 2024 |
R-4.3-mac-aarch64 | NOTE | Nov 17 2024 |
Exports:as_tokensbind_lrbind_tf_idf2build_sys_dicbuild_user_diccollapse_tokensdictionary_infogbs_tokenizeget_dict_featuresget_transition_costis_blanklex_densitymute_tokensngram_tokenizerpackposDebugRcppposParallelRcppprettifytokenize
Dependencies:bitbit64clicliprcpp11crayondplyrfansigenericsgluehmslatticelifecyclemagrittrMatrixpillarpkgconfigprettyunitsprogressR6RcppRcppParallelreadrrlangstringitibbletidyselecttzdbutf8vctrsvroomwithr
Readme and manuals
Help Manual
Help page | Topics |
---|---|
Create a list of tokens | as_tokens |
Bind importance of bigrams | bind_lr |
Bind term frequency and inverse document frequency | bind_tf_idf2 |
Build system dictionary | build_sys_dic |
Build user dictionary | build_user_dic |
Collapse sequences of tokens by condition | collapse_tokens |
Get dictionary information | dictionary_info |
Tokenize sentences using 'MeCab' | gbs_tokenize |
Get dictionary features | get_dict_features |
Whole text of 'Ginga Tetsudo no Yoru' written by Miyazawa Kenji from Aozora Bunko | ginga |
Check if scalars are blank | is_blank |
Calculate lexical density | lex_density |
Mute tokens by condition | mute_tokens |
Ngrams tokenizer | ngram_tokenizer |
Pack a data.frame of tokens | pack |
Prettify tokenized output | prettify |
Tokenize sentences using 'MeCab' | tokenize |