yardstick - Tidy Characterizations of Model Performance
Tidy tools for quantifying how well model fits to a data set such as confusion matrices, class probability curve summaries, and regression metrics (e.g., RMSE).
Last updated 24 days ago
15.44 score 369 stars 56 packages 2.1k scripts 36k downloadspaletteer - Comprehensive Collection of Color Palettes
The choices of color palettes in R can be quite overwhelming with palettes spread over many packages with many different API's. This packages aims to collect all color palettes across the R ecosystem under the same package with a streamlined API.
Last updated 5 months ago
color-palettepalettes
13.94 score 938 stars 21 packages 6.6k scripts 186k downloadsprismatic - Color Manipulation Tools
Manipulate and visualize colors in a intuitive, low-dependency and functional way.
Last updated 5 days ago
colorcolor-manipulationcolour
11.74 score 138 stars 28 packages 384 scripts 163k downloadslime - Local Interpretable Model-Agnostic Explanations
When building complex models, it is often difficult to explain why the model should be trusted. While global measures such as accuracy are useful, they cannot be used for explaining why a model made a specific prediction. 'lime' (a port of the 'lime' 'Python' package) is a method for explaining the outcome of black box models by fitting a local model around the point in question an perturbations of this point. The approach is described in more detail in the article by Ribeiro et al. (2016) <arXiv:1602.04938>.
Last updated 2 years ago
caretmodel-checkingmodel-evaluationmodeling
11.28 score 484 stars 1 packages 720 scripts 3.0k downloadstextrecipes - Extra 'Recipes' for Text Processing
Converting text to numerical features requires specifically created procedures, which are implemented as steps according to the 'recipes' package. These steps allows for tokenization, filtering, counting (tf and tfidf) and feature hashing.
Last updated 13 days ago
10.77 score 160 stars 1 packages 1.0k scripts 1.0k downloadstextdata - Download and Load Various Text Datasets
Provides a framework to download, parse, and store text datasets on the disk and load them when needed. Includes various sentiment lexicons and labeled text data sets for classification and analysis.
Last updated 6 months ago
text-datasets
9.96 score 75 stars 1 packages 1.1k scripts 7.3k downloadsthemis - Extra Recipes Steps for Dealing with Unbalanced Data
A dataset with an uneven number of cases in each class is said to be unbalanced. Many models produce a subpar performance on unbalanced datasets. A dataset can be balanced by increasing the number of minority cases using SMOTE 2011 <arXiv:1106.1813>, BorderlineSMOTE 2005 <doi:10.1007/11538059_91> and ADASYN 2008 <https://ieeexplore.ieee.org/document/4633969>. Or by decreasing the number of majority cases using NearMiss 2003 <https://www.site.uottawa.ca/~nat/Workshop2003/jzhang.pdf> or Tomek link removal 1976 <https://ieeexplore.ieee.org/document/4309452>.
Last updated 14 days ago
9.72 score 141 stars 1 packages 756 scripts 7.2k downloadsrules - Model Wrappers for Rule-Based Models
Bindings for additional models for use with the 'parsnip' package. Models include prediction rule ensembles (Friedman and Popescu, 2008) <doi:10.1214/07-AOAS148>, C5.0 rules (Quinlan, 1992 ISBN: 1558602380), and Cubist (Kuhn and Johnson, 2013) <doi:10.1007/978-1-4614-6849-3>.
Last updated 1 months ago
9.62 score 40 stars 1 packages 20k scripts 1.4k downloadsembed - Extra Recipes for Encoding Predictors
Predictors can be converted to one or more numeric representations using a variety of methods. Effect encodings using simple generalized linear models <arXiv:1611.09477> or nonlinear models <arXiv:1604.06737> can be used. There are also functions for dimension reduction and other approaches.
Last updated 1 days ago
9.23 score 142 stars 900 scripts 1.5k downloadssparsevctrs - Sparse Vectors for Use in Data Frames
Provides sparse vectors powered by ALTREP (Alternative Representations for R Objects) that behave like regular vectors, and can thus be used in data frames. Also provides tools to convert between sparse matrices and data frames with sparse columns and functions to interact with sparse vectors.
Last updated 1 months ago
8.17 score 13 stars 30 packages 21 scripts 470 downloadsdiscrim - Model Wrappers for Discriminant Analysis
Bindings for additional classification models for use with the 'parsnip' package. Models include flavors of discriminant analysis, such as linear (Fisher (1936) <doi:10.1111/j.1469-1809.1936.tb02137.x>), regularized (Friedman (1989) <doi:10.1080/01621459.1989.10478752>), and flexible (Hastie, Tibshirani, and Buja (1994) <doi:10.1080/01621459.1994.10476866>), as well as naive Bayes classifiers (Hand and Yu (2007) <doi:10.1111/j.1751-5823.2001.tb00465.x>).
Last updated 1 months ago
7.90 score 28 stars 924 scripts 2.5k downloadsemoji - Data and Function to Work with Emojis
Contains data about emojis with relevant metadata, and functions to work with emojis when they are in strings.
Last updated 26 days ago
7.72 score 28 stars 2 packages 314 scripts 614 downloadsggpage - Creates Page Layout Visualizations
Facilitates the creation of page layout visualizations in which words are represented as rectangles with sizes relating to the length of the words. Which then is divided in lines and pages for easy overview of up to quite large texts.
Last updated 5 years ago
data-visualizationdatavisualizationdatavizggplot2
7.53 score 341 stars 66 scripts 171 downloadsmodelenv - Provide Tools to Register Models for Use in 'tidymodels'
An developer focused, low dependency package in 'tidymodels' that provides functions to register how models are to be used. Functions to register models are complimented with accessor functions to retrieve registered model information to aid in model fitting and error handling.
Last updated 1 months ago
7.35 score 4 stars 39 packages 1 scripts 32k downloadstidyclust - A Common API to Clustering
A common interface to specifying clustering models, in the same style as 'parsnip'. Creates unified interface across different functions and computational engines.
Last updated 5 months ago
7.17 score 108 stars 125 scripts 1.4k downloadsorbital - Predict with 'tidymodels' Workflows in Databases
Turn 'tidymodels' workflows into objects containing the sufficient sequential equations to perform predictions. These smaller objects allow for low dependency prediction locally or directly in databases.
Last updated 25 days ago
5.95 score 18 stars 8 scripts 157 downloadsfastTextR - An Interface to the 'fastText' Library
An interface to the 'fastText' library <https://github.com/facebookresearch/fastText>. The package can be used for text classification and to learn word vectors. An example how to use 'fastTextR' can be found in the 'README' file.
Last updated 12 months ago
5.79 score 4 stars 2 packages 43 scripts 436 downloadsfriends - The Entire Transcript from Friends in Tidy Format
The complete scripts from the American sitcom Friends in tibble format. Use this package to practice data wrangling, text analysis and network analysis.
Last updated 3 years ago
4.97 score 62 stars 30 scripts 207 downloadsmodeldatatoo - More Data Sets Useful for Modeling Examples
More data sets used for demonstrating or testing model-related packages are contained in this package. The data sets are downloaded and cached, allowing for more and bigger data sets.
Last updated 8 months ago
4.85 score 7 stars 34 scripts 181 downloadshcandersenr - H.C. Andersens Fairy Tales
Texts for H.C. Andersens fairy tales, ready for text analysis. Fairy tales in German, Danish, English, Spanish and French.
Last updated 5 years ago
andersens-fairy-talestext-mining
4.60 score 10 stars 79 scripts 215 downloadswalmartAPI - Walmart Open API Wrapper
Provides API access to the Walmart Open API <https://developer.walmartlabs.com/>, that contains data about stores, Value of the day and products which includes names, sale prices, shipping rates and taxonomies.
Last updated 5 years ago
walmart-api
4.37 score 18 stars 13 scripts 108 downloadsextrasteps - More Miscellaneous Steps for the 'recipes' Package
Contains additional miscellaneous steps for the 'recipes' package. These steps are useful, but doesn't have a good home in other 'recipes' packages or its extensions.
Last updated 2 months ago
4.32 score 10 stars 14 scripts 249 downloadswordsalad - Provide Tools to Extract and Analyze Word Vectors
Provides access to various word embedding methods (GloVe, fasttext and word2vec) to extract word vectors using a unified framework to increase reproducibility and correctness.
Last updated 4 years ago
3.60 score 8 stars 9 scripts 182 downloadsmethcon5 - Identify and Rank CpG DNA Methylation Conservation Along the Human Genome
Identify and rank CpG DNA methylation conservation along the human genome. Specifically it includes bootstrapping methods to provide ranking which should adjust for the differences in length as without it short regions tend to get higher conservation scores.
Last updated 5 years ago
2.70 score 6 scripts 134 downloads