audubon 0.6.2
- Modified some examples to address addtional issues on CRAN. There are no user-facing changes.
audubon 0.6.1
CRAN release: 2025-12-21
New features
- Added
label_wrap_jp()andlabel_wrap_jp_gen()for Japanese word wrapping in ggplot2 labellers. - Added
label_date_jp()andlabel_date_jp_gen()for Japanese calendar date labels in ggplot2. - Added
strj_parse_date()to parse Japanese calendar date strings into POSIXct values.
Changes
- Removed
mecabandsudachipyengines and related arguments fromstrj_tokenize(). - Removed functions overlapping with those provided by the gibasa package. Users requiring morphological analysis or related features should use gibasa.
audubon 0.5.2
CRAN release: 2024-04-27
- Corrected probabilistic IDF calculation by
global_idf3. - Refactored
bind_tf_idf2.- Changed behavior when
norm=TRUE. Cosine nomalization is now performed ontf_idfvalues as in the RMeCab package. - Added
tf="itf"andidf="df"options.
- Changed behavior when
- Refactored
packfor performance.
audubon 0.5.0
CRAN release: 2023-03-04
- Added
bind_lrfunction which can calculate the ‘LR’ value of bigrams. -
packnow always returns a tibble, not a data.frame.
audubon 0.4.0
CRAN release: 2022-12-15
- Added some new functions.
-
bind_tf_idf2can calculate and bind the term frequency, inverse document frequency, and tf-idf of the tidy text dataset. -
collapse_tokens,mute_tokens, andlexical_densitycan be used for handling a tidy text dataset of tokens.
-
-
strj_tokenizenow preserves the original order of text names. -
prettifynow can getdelimargument.
audubon 0.3.0
CRAN release: 2022-07-22
- Updated
strj_fill_iter_markfunction.-
strj_fill_iter_marknow replaces a sequence of iteration marks recursively.
-
- Updated
strj_tokenizefunction.-
strj_tokenizenow can retrieveengineargument to switch tokenizers for splitting text into tokens.
-
audubon 0.2.0
CRAN release: 2022-05-24
- Updated
ngram_tokenizerfunction. - Added a wrapper function of the ‘TinySegmenter’ written by Taku Kudo.
