Package: ldccr 2026.05.23
ldccr: Utilities for Various Japanese Corpora
The goal of ldccr package is to make easy to use Japanese language resources. This package provides parsers for several Japanese corpora that are free or open licensed and a downloader of zipped text files published on Aozora Bunko.
Authors:
ldccr_2026.05.23.tar.gz
ldccr_2026.05.23.zip(r-4.7)ldccr_2026.05.23.zip(r-4.6)ldccr_2026.05.23.zip(r-4.5)
ldccr_2026.05.23.tgz(r-4.6-x86_64)ldccr_2026.05.23.tgz(r-4.6-arm64)ldccr_2026.05.23.tgz(r-4.5-x86_64)ldccr_2026.05.23.tgz(r-4.5-arm64)
ldccr_2026.05.23.tar.gz(r-4.7-arm64)ldccr_2026.05.23.tar.gz(r-4.7-x86_64)ldccr_2026.05.23.tar.gz(r-4.6-arm64)ldccr_2026.05.23.tar.gz(r-4.6-x86_64)
ldccr_2026.05.23.tgz(r-4.6-emscripten)
manual.pdf |manual.html✨
card.svg |card.png
ldccr/json (API)
| # Install 'ldccr' in R: |
| install.packages('ldccr', repos = c('https://paithiov909.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/paithiov909/ldccr/issues
Pkgdown/docs site:https://paithiov909.github.io
- AozoraBunkoSnapshot - Meta data of text files published on Aozora Bunko
- NekoText - Whole text of ‘Wagahai Wa Neko Dearu’ written by Natsume Souseki from Aozora Bunko
Last updated from:b6e8af704d. Checks:13 OK. Indexed: yes.
| Target | Result | Time | Files | Syslog |
|---|---|---|---|---|
| linux-devel-arm64 | OK | 174 | ||
| linux-devel-x86_64 | OK | 201 | ||
| source / vignettes | OK | 175 | ||
| linux-release-arm64 | OK | 156 | ||
| linux-release-x86_64 | OK | 151 | ||
| macos-release-arm64 | OK | 104 | ||
| macos-release-x86_64 | OK | 236 | ||
| macos-oldrel-arm64 | OK | 104 | ||
| macos-oldrel-x86_64 | OK | 165 | ||
| windows-devel | OK | 127 | ||
| windows-release | OK | 113 | ||
| windows-oldrel | OK | 116 | ||
| wasm-release | OK | 123 |
Exports:clean_emojiclean_urldownload_unidicjrte_rte_filesldnws_categoriesparse_jrte_reasoningread_aozoraread_ja_text8read_jrteread_ldnwssqidsunidic_availablesunsqids
Dependencies:bitbit64cachemclicliprcpp11crayondplyrfastmapgenericsgluehmslifecyclemagrittrmemoisepillarpkgconfigprettyunitsprogresspurrrR6RcppRcppSimdJsonreadrrlangstringitibbletidyselecttzdbutf8vctrsvroomwithr
Readme and manuals
Help Manual
| Help page | Topics |
|---|---|
| Meta data of text files published on Aozora Bunko | AozoraBunkoSnapshot |
| Data for Textual Entailment | jrte_rte_files |
| List of categories of the Livedoor News Corpus | ldnws_categories |
| Whole text of ‘Wagahai Wa Neko Dearu’ written by Natsume Souseki from Aozora Bunko | NekoText |
| Parse reasoning column of 'rte.*.tsv' | parse_jrte_reasoning |
| Download text file from Aozora Bunko | read_aozora |
| Read the ja.text8 corpus | read_ja_text8 |
| Read the JRTE Corpus | read_jrte |
| Read the Livedoor News Corpus | read_ldnws |
| Generate random-looking IDs from integer ranks | sqids unsqids |
| Download and unzip 'UniDic' | download_unidic unidic_availables |
| Utility functions | clean_emoji clean_url utils |
