Package: textdata 0.4.5.9000
textdata: Download and Load Various Text Datasets
Provides a framework to download, parse, and store text datasets on the disk and load them when needed. Includes various sentiment lexicons and labeled text data sets for classification and analysis.
Authors:
textdata_0.4.5.9000.tar.gz
textdata_0.4.5.9000.zip(r-4.5)textdata_0.4.5.9000.zip(r-4.4)textdata_0.4.5.9000.zip(r-4.3)
textdata_0.4.5.9000.tgz(r-4.4-any)textdata_0.4.5.9000.tgz(r-4.3-any)
textdata_0.4.5.9000.tar.gz(r-4.5-noble)textdata_0.4.5.9000.tar.gz(r-4.4-noble)
textdata_0.4.5.9000.tgz(r-4.4-emscripten)textdata_0.4.5.9000.tgz(r-4.3-emscripten)
textdata.pdf |textdata.html✨
textdata/json (API)
NEWS
# Install 'textdata' in R: |
install.packages('textdata', repos = c('https://emilhvitfeldt.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/emilhvitfeldt/textdata/issues
Last updated 6 months agofrom:7a99a97b4e. Checks:OK: 7. Indexed: yes.
Target | Result | Date |
---|---|---|
Doc / Vignettes | OK | Oct 25 2024 |
R-4.5-win | OK | Oct 25 2024 |
R-4.5-linux | OK | Oct 25 2024 |
R-4.4-win | OK | Oct 25 2024 |
R-4.4-mac | OK | Oct 25 2024 |
R-4.3-win | OK | Oct 25 2024 |
R-4.3-mac | OK | Oct 25 2024 |
Exports:cache_infocataloguedataset_ag_newsdataset_dbpediadataset_imdbdataset_sentence_polaritydataset_trecembedding_glove27bembedding_glove42bembedding_glove6bembedding_glove840blexicon_afinnlexicon_binglexicon_loughranlexicon_nrclexicon_nrc_eillexicon_nrc_vadload_dataset
Dependencies:bitbit64clicliprcpp11crayonfansifsgluehmslifecyclemagrittrpillarpkgconfigprettyunitsprogressR6rappdirsreadrrlangtibbletidyselecttzdbutf8vctrsvroomwithr
Readme and manuals
Help Manual
Help page | Topics |
---|---|
List folders and their sizes in cache | cache_info |
Catalogue of all available data sources | catalogue |
AG's News Topic Classification Dataset | dataset_ag_news |
DBpedia Ontology Dataset | dataset_dbpedia |
IMDB Large Movie Review Dataset | dataset_imdb |
v1.0 sentence polarity dataset | dataset_sentence_polarity |
TREC dataset | dataset_trec |
Global Vectors for Word Representation | embedding_glove embedding_glove27b embedding_glove42b embedding_glove6b embedding_glove840b |
AFINN-111 dataset | lexicon_afinn |
Bing sentiment lexicon | lexicon_bing |
Loughran-McDonald sentiment lexicon | lexicon_loughran |
NRC word-emotion association lexicon | lexicon_nrc |
NRC Emotion Intensity Lexicon (aka Affect Intensity Lexicon) v0.5 | lexicon_nrc_eil |
The NRC Valence, Arousal, and Dominance Lexicon | lexicon_nrc_vad |