View Source API Reference Scholar v0.3.1

Modules

Scholar.Cluster.AffinityPropagation

Model representing affinity propagation clustering. The first dimension of :clusters_centers is set to the number of samples in the dataset. The artificial centers are filled with :infinity values. To fillter them out use prune function.

Scholar.Cluster.DBSCAN

Perform DBSCAN clustering from vector array or distance matrix.

Scholar.Cluster.GaussianMixture

Gaussian Mixture Model.

Scholar.Cluster.Hierarchical

Performs hierarchical, agglomerative clustering on a dataset.

Scholar.Cluster.KMeans

K-Means Algorithm.

Scholar.Decomposition.PCA

Principal Component Analysis (PCA).

Scholar.Impute.SimpleImputer

Univariate imputer for completing missing values with simple strategies.

Scholar.Integrate

Module for numerical integration.

Scholar.Interpolation.BezierSpline

Cubic Bezier Spline interpolation.

Scholar.Interpolation.CubicSpline

Cubic Spline interpolation.

Scholar.Interpolation.Linear

Linear interpolation.

Scholar.Linear.BayesianRidgeRegression

Bayesian ridge regression: A fully probabilistic linear model with parameter regularization.

Scholar.Linear.IsotonicRegression

Isotonic regression is a method of fitting a free-form line to a set of observations by solving a convex optimization problem. It is a form of regression analysis that can be used as an alternative to polynomial regression to fit nonlinear data.

Scholar.Linear.LinearRegression

Ordinary least squares linear regression.

Scholar.Linear.LogisticRegression

Logistic regression in both binary and multinomial variants.

Scholar.Linear.PolynomialRegression

Least squares polynomial regression.

Scholar.Linear.RidgeRegression

Linear least squares with $L_2$ regularization.

Scholar.Linear.SVM

Support Vector Machine linear classifier.

Scholar.Manifold.MDS

Multidimensional scaling (MDS) seeks a low-dimensional representation of the data in which the distances respect well the distances in the original high-dimensional space.

Scholar.Manifold.TSNE

t-SNE (t-Distributed Stochastic Neighbor Embedding) is a nonlinear dimensionality reduction technique.

Scholar.Manifold.Trimap

TriMap: Large-scale Dimensionality Reduction Using Triplets.

Scholar.Metrics.Classification

Classification Metric functions.

Scholar.Metrics.Clustering

Metrics related to clustering algorithms.

Scholar.Metrics.Distance

Distance metrics between multi-dimensional tensors. They all support distance calculations between any subset of axes.

Scholar.Metrics.Neighbors

Metrics for evaluating the results of approximate k-nearest neighbor search algorithms.

Scholar.Metrics.Ranking

Provides metrics and calculations related to ranking quality.

Scholar.Metrics.Regression

Regression Metric functions.

Scholar.Metrics.Similarity

Similarity metrics between multi-dimensional tensors.

Scholar.ModelSelection

Module containing cross validation, splitting function, and other model selection methods.

Scholar.NaiveBayes.Complement

The Complement Naive Bayes classifier.

Scholar.NaiveBayes.Gaussian

Gaussian Naive Bayes algorithm for classification.

Scholar.NaiveBayes.Multinomial

Naive Bayes classifier for multinomial models.

Scholar.Neighbors.BruteKNN

Brute-Force k-Nearest Neighbor Search Algorithm.

Scholar.Neighbors.KDTree

Implements a k-d tree, a space-partitioning data structure for organizing points in a k-dimensional space.

Scholar.Neighbors.KNNClassifier

K-Nearest Neighbors Classifier.

Scholar.Neighbors.KNNRegressor

K-Nearest Neighbors Regressor.

Scholar.Neighbors.LargeVis

LargeVis algorithm for approximate k-nearest neighbor (k-NN) graph construction.

Scholar.Neighbors.NNDescent

Nearest Neighbors Descent (NND) is an algorithm that calculates Approximated Nearest Neighbors (ANN) for a given set of points[1].

Scholar.Neighbors.RadiusNearestNeighbors

The Radius Nearest Neighbors.

Scholar.Neighbors.RandomProjectionForest

Random Projection Forest for k-Nearest Neighbor Search.

Scholar.Preprocessing

Set of functions for preprocessing data.

Scholar.Preprocessing.MaxAbsScaler

Scales a tensor by dividing each sample in batch by maximum absolute value in the batch

Scholar.Preprocessing.MinMaxScaler

Scales a tensor by dividing each sample in batch by maximum absolute value in the batch

Scholar.Preprocessing.Normalizer

Implements functionality for rescaling tensor to unit norm. It enables to apply normalization along any combination of axes.

Scholar.Preprocessing.OneHotEncoder

Implements encoder that converts integer value (substitute of categorical data in tensors) into 0-1 vector. The index of 1 in the vector is aranged in sorted manner. This means that for x < y => one_index(x) < one_index(y).

Scholar.Preprocessing.OrdinalEncoder

Implements encoder that converts integer value (substitute of categorical data in tensors) into other integer value. The values assigned starts from 0 and go up to num_classes - 1.They are maintained in sorted manner. This means that for x < y => encoded_value(x) < encoded_value(y).

Scholar.Preprocessing.StandardScaler

Standardizes the tensor by removing the mean and scaling to unit variance.

Scholar.Stats

Statistical functions

Next Page → README