Package: audubon 0.6.3

audubon: Japanese Text Processing Tools

A collection of Japanese text processing tools for filling Japanese iteration marks, Japanese character type conversions, segmentation by phrase, and text normalization which is based on rules for the 'Sudachi' morphological analyzer and the 'NEologd' (Neologism dictionary for 'MeCab'). These features are specific to Japanese and are not implemented in 'ICU' (International Components for Unicode).

Authors:Akiru Kato [cre, aut], Koki Takahashi [cph], Shuhei Iitsuka [cph], Taku Kudo [cph]

audubon_0.6.3.tar.gz
audubon_0.6.3.zip(r-4.7)audubon_0.6.3.zip(r-4.6)audubon_0.6.3.zip(r-4.5)
audubon_0.6.3.tgz(r-4.6-any)audubon_0.6.3.tgz(r-4.5-any)
audubon_0.6.3.tar.gz(r-4.7-any)audubon_0.6.3.tar.gz(r-4.6-any)
audubon_0.6.3.tgz(r-4.6-emscripten)
manual.pdf |manual.html✨
DESCRIPTION |NEWS
card.svg |card.png
audubon/json (API)

# Install 'audubon' in R:

install.packages('audubon', repos = c('https://paithiov909.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://github.com/paithiov909/audubon/issues

Pkgdown/docs site:https://paithiov909.github.io

Datasets:

hiroba - Whole tokens of 'Porano no Hiroba' written by Miyazawa Kenji from Aozora Bunko
polano - Whole text of 'Porano no Hiroba' written by Miyazawa Kenji from Aozora Bunko

On CRAN:

japanese javascript

5.47 score 11 stars 1 packages 3 scripts 492 downloads 17 exports 32 dependencies

Last updated from:bc553b3c38. Checks:9 OK. Indexed: yes.

Target	Result	Time
linux-devel-x86_64	OK	138
source / vignettes	OK	193
linux-release-x86_64	OK	141
macos-release-arm64	OK	84
macos-oldrel-arm64	OK	91
windows-devel	OK	83
windows-release	OK	72
windows-oldrel	OK	75
wasm-release	OK	125

Exports:default_format label_date_jp label_date_jp_gen label_wrap_jp label_wrap_jp_gen read_rewrite_def strj_fill_iter_mark strj_hiraganize strj_katakanize strj_normalize strj_parse_date strj_rewrite_as_def strj_romanize strj_segment strj_tinyseg strj_tokenize strj_transcribe_num

Dependencies:bit bit64 cli clipr cpp11 crayon curl dplyr generics glue hms jsonlite lifecycle magrittr pillar pkgconfig prettyunits progress purrr R6 Rcpp readr rlang stringi tibble tidyselect tzdb utf8 V8 vctrs vroom withr

Citation

Development and contributors

Readme and manuals

Help Manual

Help page	Topics
Default Japanese date format	default_format
Whole tokens of 'Porano no Hiroba' written by Miyazawa Kenji from Aozora Bunko	hiroba
Japanese date labeller for ggplot2	label_date_jp label_date_jp_gen
Japanese word-wrapping labeller for ggplot2	label_wrap_jp label_wrap_jp_gen
Whole text of 'Porano no Hiroba' written by Miyazawa Kenji from Aozora Bunko	polano
Read rewrite definition file	read_rewrite_def
Fill Japanese iteration marks	strj_fill_iter_mark
Convert text following the rules of 'NEologd'	strj_normalize
Parse Japanese calendar dates	strj_parse_date
Rewrite Japanese text using normalization rules	strj_rewrite_as_def
Romanize Japanese text	strj_romanize
Tokenize Japanese text	strj_segment strj_tinyseg strj_tokenize
Transcribe integers into Japanese kanji numerals	strj_transcribe_num
Convert Japanese kana characters	strj-hira-kana strj_hiraganize strj_katakanize