Package: ldccr 2024.10.10

ldccr: Utilities for Various Japanese Corpora

The goal of ldccr package is to make easy to use Japanese language resources. This package provides parsers for several Japanese corpora that are free or open licensed and a downloader of zipped text files published on Aozora Bunko.

Authors:Akiru Kato [aut, cre]

ldccr_2024.10.10.tar.gz
ldccr_2024.10.10.zip(r-4.5)ldccr_2024.10.10.zip(r-4.4)ldccr_2024.10.10.zip(r-4.3)
ldccr_2024.10.10.tgz(r-4.4-any)ldccr_2024.10.10.tgz(r-4.3-any)
ldccr_2024.10.10.tar.gz(r-4.5-noble)ldccr_2024.10.10.tar.gz(r-4.4-noble)
ldccr_2024.10.10.tgz(r-4.4-emscripten)ldccr_2024.10.10.tgz(r-4.3-emscripten)
ldccr.pdf |ldccr.html
ldccr/json (API)

# Install 'ldccr' in R:
install.packages('ldccr', repos = c('https://paithiov909.r-universe.dev', 'https://cloud.r-project.org'))

Peer review:

Bug tracker:https://github.com/paithiov909/ldccr/issues

Datasets:
  • AozoraBunkoSnapshot - Meta data of text files published on Aozora Bunko
  • NekoText - Whole text of ‘Wagahai Wa Neko Dearu’ written by Natsume Souseki from Aozora Bunko

On CRAN:

2.30 score 1 stars 1 scripts 13 exports 35 dependencies

Last updated 3 months agofrom:6b79ddaf25. Checks:3 OK, 4 NOTE. Indexed: yes.

TargetResultLatest binary
Doc / VignettesOKJan 08 2025
R-4.5-winOKJan 08 2025
R-4.5-linuxOKJan 08 2025
R-4.4-winNOTEJan 08 2025
R-4.4-macNOTEJan 08 2025
R-4.3-winNOTEJan 08 2025
R-4.3-macNOTEJan 08 2025

Exports:clean_emojiclean_urldownload_unidicis_within_erajrte_rte_filesldnws_categoriesparse_jrte_reasoningparse_to_jdateread_aozoraread_ja_text8read_jrteread_ldnwsunidic_availables

Dependencies:bitbit64cachemclicliprcpp11crayondplyrfansifastmapgenericsgluehmslifecyclemagrittrmemoisepillarpkgconfigprettyunitsprogresspurrrR6RcppRcppSimdJsonreadrrlangstringitibbletidyselecttzdbutf8vctrsvroomwithryesno