Package: ldccr 2024.07.07

ldccr: Utilities for Various Japanese Corpora

The goal of ldccr package is to make easy to use Japanese language resources. This package provides parsers for several Japanese corpora that are free or open licensed and a downloader of zipped text files published on Aozora Bunko.

Authors:Akiru Kato [aut, cre]

ldccr_2024.07.07.tar.gz
ldccr_2024.07.07.zip(r-4.5)ldccr_2024.07.07.zip(r-4.4)ldccr_2024.07.07.zip(r-4.3)
ldccr_2024.07.07.tgz(r-4.4-any)ldccr_2024.04.24.tgz(r-4.4-any)ldccr_2024.07.07.tgz(r-4.3-any)ldccr_2024.04.24.tgz(r-4.3-any)
ldccr_2024.07.07.tar.gz(r-4.5-noble)ldccr_2024.07.07.tar.gz(r-4.4-noble)
ldccr_2024.07.07.tgz(r-4.4-emscripten)ldccr_2024.07.07.tgz(r-4.3-emscripten)
ldccr.pdf |ldccr.html
ldccr/json (API)

# Install 'ldccr' in R:
install.packages('ldccr', repos = c('https://paithiov909.r-universe.dev', 'https://cloud.r-project.org'))

Peer review:

Bug tracker:https://github.com/paithiov909/ldccr/issues

Datasets:
  • AozoraBunkoSnapshot - Meta data of text files published on Aozora Bunko
  • NekoText - Whole text of ‘Wagahai Wa Neko Dearu’ written by Natsume Souseki from Aozora Bunko

On CRAN:

13 exports 1 stars 1.00 score 35 dependencies

Last updated 21 days agofrom:52c44cee7d

Exports:clean_emojiclean_urldownload_unidicis_within_erajrte_rte_filesldnws_categoriesparse_jrte_reasoningparse_to_jdateread_aozoraread_ja_text8read_jrteread_ldnwsunidic_availables

Dependencies:bitbit64cachemclicliprcpp11crayondplyrfansifastmapgenericsgluehmslifecyclemagrittrmemoisepillarpkgconfigprettyunitsprogresspurrrR6RcppRcppSimdJsonreadrrlangstringitibbletidyselecttzdbutf8vctrsvroomwithryesno