Basic concepts

Basic concepts

Get more information

Apply learned concepts

Get more in-depth infos on

Learn new concepts

Get expert information

Learn new concepts

Apply your skills

Where to go next

↓ Text Mining ↓ Business AnalyticsOpen source, java, tools for data pre-processing, classification, regression, clustering, association rules, visualization

*Upvote*
*
*

Principles of popular algorithms

*Upvote*
*
*

*Upvote*
*
*

*Upvote*
*
*

*Upvote*
*
*

Classification, regression, clustering, dimensionality reduction, model selection, preprocessing, built on NumPy, SciPy, and matplotlib

*Upvote*
*
*

a linear approach to modelling the relationship between a scalar response (or dependent variable) and one or more explanatory variables (or independent variables).

*Upvote*
*
*

The problem of identifying to which of a set of categories a new observation belongs, on the basis of a training set of data containing observations whose category membership is known.

*Upvote*
*
*

the conflict in trying to simultaneously minimize these two sources of error that prevent supervised learning algorithms from generalizing beyond their training set

*Upvote*
*
*

k-means clustering aims to partition n observations into k clusters in which each observation belongs to the cluster with the nearest mean, serving as a prototype of the cluster

*Upvote*
*
*

task of grouping a set of objects in such a way that objects in the same group (called a cluster) are more similar (in some sense) to each other than to those in other groups (clusters)

*Upvote*
*
*

Dimensionality reduction, statistical procedure that uses an orthogonal transformation to convert a set of observations of possibly correlated variables (entities each of which takes on various numerical values) into a set of values of linearly uncorrelated variables called principal components

*Upvote*
*
*

condense the information of a large set of correlated variables into a few variables

*Upvote*
*
*

a flowchart-like structure in which each internal node represents a "test" on an attribute, each branch represents the outcome of the test, and each leaf node represents a class label, the paths from root to leaf represent classification rules.

*Upvote*
*
*

A rule-based machine learning method for discovering interesting relations between variables in large databases

*Upvote*
*
*

Classification, regression, clustering, dimensionality reduction, model selection, preprocessing, built on NumPy, SciPy, and matplotlib

*Upvote*
*
*

Open source, java, tools for data pre-processing, classification, regression, clustering, association rules, visualization

*Upvote*
*
*

Principles of popular algorithms

*Upvote*
*
*

*Upvote*
*
*

*Upvote*
*
*

*Upvote*
*
*

Data Mining