View Source Explorer.Series (Explorer v0.7.1)

The Series struct and API.

A series can be of the following data types:

:binary - Binaries (sequences of bytes)
:boolean - Boolean
:category - Strings but represented internally as integers
:date - Date type that unwraps to Elixir.Date
{:datetime, precision} - DateTime type with millisecond/microsecond/nanosecond precision that unwraps to Elixir.NaiveDateTime
{:duration, precision} - Duration type with millisecond/microsecond/nanosecond precision that unwraps to Explorer.Duration
:float - 64-bit floating point number
:integer - 64-bit signed integer
:string - UTF-8 encoded binary
:time - Time type that unwraps to Elixir.Time

A series must consist of a single data type only. Series may have nil values in them. The series dtype can be retrieved via the dtype/1 function or directly accessed as series.dtype. A series.name field is also available, but it is always nil unless the series is retrieved from a dataframe.

Many functions only apply to certain dtypes. These functions may appear on distinct categories on the sidebar. Other functions may work on several datatypes, such as comparison functions. In such cases, a "Supported dtypes" section will be available in the function documentation.

Creating series

Series can be created using from_list/2, from_binary/3, and friends:

Series can be made of numbers:

iex> Explorer.Series.from_list([1, 2, 3])
#Explorer.Series<
  Polars[3]
  integer [1, 2, 3]
>

Series are nullable, so you may also include nils:

iex> Explorer.Series.from_list([1.0, nil, 2.5, 3.1])
#Explorer.Series<
  Polars[4]
  float [1.0, nil, 2.5, 3.1]
>

Any of the dtypes above are supported, such as strings:

iex> Explorer.Series.from_list(["foo", "bar", "baz"])
#Explorer.Series<
  Polars[3]
  string ["foo", "bar", "baz"]
>

Summary

Types

datetime_dtype()

dtype()

duration_dtype()

inferable_scalar()

lazy_t()

non_finite()

t()

time_unit()

Functions: Conversion

from_binary(binary, dtype, opts \\ [])

Builds a series of dtype from binary.

from_list(list, opts \\ [])

Creates a new series from a list.

from_tensor(tensor, opts \\ [])

Converts a Nx.Tensor.t/0 to a series.

replace(series, tensor_or_list)

Replaces the contents of the given series by the one given in a tensor or list.

to_binary(series)

Returns a series as a fixed-width binary.

to_enum(series)

Converts a series to an enumerable.

to_iovec(series)

Returns a series as a list of fixed-width binaries.

to_list(series)

Converts a series to a list.

to_tensor(series, tensor_opts \\ [])

Converts a series to a Nx.Tensor.t/0.

Functions: Aggregation

argmax(series)

Gets the index of the maximum value of the series.

argmin(series)

Gets the index of the minimum value of the series.

correlation(left, right, ddof \\ 1)

Compute the Pearson's correlation between two series.

count(series)

Counts the number of elements in a series.

covariance(left, right)

Compute the covariance between two series.

cut(series, bins, opts \\ [])

Bins values into discrete values.

frequencies(series)

Creates a new dataframe with unique values and the frequencies of each.

max(series)

Gets the maximum value of the series.

mean(series)

Gets the mean value of the series.

median(series)

Gets the median value of the series.

min(series)

Gets the minimum value of the series.

n_distinct(series)

Returns the number of unique values in the series.

nil_count(series)

Counts the number of null elements in a series.

product(series)

Reduce this Series to the product value.

qcut(series, quantiles, opts \\ [])

Bins values into discrete values base on their quantiles.

quantile(series, quantile)

Gets the given quantile of the series.

skew(series, opts \\ [])

Compute the sample skewness of a series.

standard_deviation(series)

Gets the standard deviation of the series.

sum(series)

Gets the sum of the series.

variance(series)

Gets the variance of the series.

Functions: Element-wise

abs(series)

Gets the series absolute values.

add(left, right)

Adds right to left, element-wise.

all_equal(left, right)

Checks equality between two entire series.

left and right

Returns a boolean mask of left and right, element-wise.

cast(series, dtype)

Cast the series to another type.

categorise(series, categories)

Categorise a series of integers or strings according to categories.

clip(series, min, max)

Clip (or clamp) the values in a series.

coalesce(list)

Finds the first non-missing element at each position.

coalesce(s1, s2)

Finds the first non-missing element at each position.

divide(left, right)

Divides left by right, element-wise.

equal(left, right)

Returns boolean mask of left == right, element-wise.

exp(series)

Calculates the exponential of all elements.

greater(left, right)

Returns boolean mask of left > right, element-wise.

greater_equal(left, right)

Returns boolean mask of left >= right, element-wise.

left in right

Checks if each element of the series in the left exists in the series in the right, returning a boolean mask.

is_nil(series)

Returns a mask of nil values.

is_not_nil(series)

Returns a mask of not nil values.

less(left, right)

Returns boolean mask of left < right, element-wise.

less_equal(left, right)

Returns boolean mask of left <= right, element-wise.

log(s)

Calculates the natural logarithm.

log(series, base)

Calculates the logarithm on a given base.

mask(series, mask)

Filters a series with a mask.

multiply(left, right)

Multiplies left and right, element-wise.

not series

Negate the elements of a boolean series.

not_equal(left, right)

Returns boolean mask of left != right, element-wise.

left or right

Returns a boolean mask of left or right, element-wise.

peaks(series, max_or_min \\ :max)

Returns a boolean mask with true where the 'peaks' (series max or min, default max) are.

pow(left, right)

Raises a numeric series to the power of the exponent.

quotient(left, right)

Element-wise integer division.

rank(series, opts \\ [])

Assign ranks to data with appropriate handling of tied values.

remainder(left, right)

Computes the remainder of an element-wise integer division.

select(predicate, on_true, on_false)

Returns a series from two series, based on a predicate.

strftime(series, format_string)

Converts a datetime series to a string series.

strptime(series, format_string)

Converts a string series to a datetime series with a given format_string.

subtract(left, right)

Subtracts right from left, element-wise.

transform(series, fun)

Returns an Explorer.Series where each element is the result of invoking fun on each corresponding element of series.

Functions: Datetime ops

day_of_week(series)

Returns a day-of-week number starting from Monday = 1. (ISO 8601 weekday number)

hour(series)

Returns the hour number from 0 to 23.

minute(series)

Returns the minute number from 0 to 59.

month(series)

Returns the month number starting from 1. The return value ranges from 1 to 12.

second(series)

Returns the second number from 0 to 59.

year(series)

Returns the year number in the calendar date.

Functions: Float ops

acos(series)

Computes the the arccosine of a number. The resultant series is going to be of dtype :float, in radians, with values between 0 and pi.

asin(series)

Computes the the arcsine of a number. The resultant series is going to be of dtype :float, in radians, with values between -pi/2 and pi/2.

atan(series)

Computes the the arctangent of a number. The resultant series is going to be of dtype :float, in radians, with values between -pi/2 and pi/2.

ceil(series)

Ceil floating point series to highest integers smaller or equal to the float value.

cos(series)

Computes the the cosine of a number (in radians). The resultant series is going to be of dtype :float, with values between 1 and -1.

floor(series)

Floor floating point series to lowest integers smaller or equal to the float value.

is_finite(series)

Returns a mask of finite values.

is_infinite(series)

Returns a mask of infinite values.

is_nan(series)

Returns a mask of nan values.

round(series, decimals)

Round floating point series to given decimal places.

sin(series)

Computes the the sine of a number (in radians). The resultant series is going to be of dtype :float, with values between 1 and -1.

tan(series)

Computes the tangent of a number (in radians). The resultant series is going to be of dtype :float.

Functions: String ops

contains(series, pattern)

Detects whether a string contains a substring.

downcase(series)

Converts all characters to lowercase.

lstrip(series)

Returns a string series where all leading Unicode whitespaces have been removed.

lstrip(series, string)

Returns a string series where all leading examples of the provided string have been removed.

replace(series, pattern, replacement)

Replaces all occurences of pattern with replacement in string series.

rstrip(series)

Returns a string series where all trailing Unicode whitespaces have been removed.

rstrip(series, string)

Returns a string series where all trailing examples of the provided string have been removed.

strip(series)

Returns a string series where all leading and trailing Unicode whitespaces have been removed.

strip(series, string)

Returns a string series where all leading and trailing examples of the provided string have been removed.

substring(series, offset)

Returns a string sliced from the offset to the end of the string, supporting negative indexing

substring(series, offset, length)

Returns a string sliced from the offset to the length provided, supporting negative indexing

upcase(series)

Converts all characters to uppercase.

Functions: Introspection

categories(series)

Return a series with the category names of a categorical series.

dtype(series)

Returns the data type of the series.

iotype(series)

Returns the type of the underlying fixed-width binary representation.

size(series)

Returns the size of the series.

Functions: Shape

argsort(series, opts \\ [])

Returns the indices that would sort the series.

at(series, idx)

Returns the value of the series at the given index.

at_every(series, every_n)

Takes every nth value in this series, returned as a new series.

concat(series)

Concatenate one or more series.

concat(s1, s2)

Concatenate two series.

distinct(series)

Returns the unique values of the series.

first(series)

Returns the first element of the series.

format(list)

Returns a string series with all values concatenated.

head(series, n_elements \\ 10)

Returns the first N elements of the series.

last(series)

Returns the last element of the series.

reverse(series)

Reverses the series order.

sample(series, n_or_frac, opts \\ [])

Returns a random sample of the series.

shift(series, offset)

Shifts series by offset with nil values.

shuffle(series, opts \\ [])

Change the elements order randomly.

slice(series, indices)

Slices the elements at the given indices as a new series.

slice(series, offset, size)

Returns a slice of the series, with size elements starting at offset.

sort(series, opts \\ [])

Sorts the series.

tail(series, n_elements \\ 10)

Returns the last N elements of the series.

unordered_distinct(series)

Returns the unique values of the series, but does not maintain order.

Functions: Window

cumulative_max(series, opts \\ [])

Calculates the cumulative maximum of the series.

cumulative_min(series, opts \\ [])

Calculates the cumulative minimum of the series.

cumulative_product(series, opts \\ [])

Calculates the cumulative product of the series.

cumulative_sum(series, opts \\ [])

Calculates the cumulative sum of the series.

ewm_mean(series, opts \\ [])

Calculate the exponentially weighted moving average, given smoothing factor alpha.

fill_missing(series, value)

Fill missing values with the given strategy. If a scalar value is provided instead of a strategy atom, nil will be replaced with that value. It must be of the same dtype as the series.

window_max(series, window_size, opts \\ [])

Calculate the rolling max, given a window size and optional list of weights.

window_mean(series, window_size, opts \\ [])

Calculate the rolling mean, given a window size and optional list of weights.

window_median(series, window_size, opts \\ [])

Calculate the rolling median, given a window size and optional list of weights.

window_min(series, window_size, opts \\ [])

Calculate the rolling min, given a window size and optional list of weights.

window_standard_deviation(series, window_size, opts \\ [])

Calculate the rolling standard deviation, given a window size and optional list of weights.

window_sum(series, window_size, opts \\ [])

Calculate the rolling sum, given a window size and optional list of weights.

Functions

to_date(series) deprecated

to_time(series) deprecated

Types

datetime_dtype()

@type datetime_dtype() :: {:datetime, time_unit()}

dtype()

@type dtype() ::
  :binary
  | :boolean
  | :category
  | :date
  | :time
  | datetime_dtype()
  | duration_dtype()
  | :float
  | :integer
  | :string

duration_dtype()

@type duration_dtype() :: {:duration, time_unit()}

inferable_scalar()

@type inferable_scalar() ::
  number()
  | non_finite()
  | boolean()
  | String.t()
  | Date.t()
  | Time.t()
  | NaiveDateTime.t()

lazy_t()

@type lazy_t() :: %Explorer.Series{
  data: Explorer.Backend.LazySeries.t(),
  dtype: dtype(),
  name: term()
}

non_finite()

@type non_finite() :: :nan | :infinity | :neg_infinity

t()

@type t() :: %Explorer.Series{
  data: Explorer.Backend.Series.t(),
  dtype: dtype(),
  name: term()
}

time_unit()

@type time_unit() :: :nanosecond | :microsecond | :millisecond

Functions: Conversion

from_binary(binary, dtype, opts \\ [])

@spec from_binary(
  binary(),
  :float
  | :integer
  | :boolean
  | :date
  | :time
  | datetime_dtype()
  | duration_dtype(),
  keyword()
) :: t()

Builds a series of dtype from binary.

All binaries must be in native endianness.

Options

:backend - The backend to allocate the series on.

Settings View Source Explorer.Series (Explorer v0.7.1)

datetime_dtype()

dtype()

duration_dtype()

inferable_scalar()

lazy_t()

non_finite()

t()

time_unit()

from_binary(binary, dtype, opts \\ [])

from_list(list, opts \\ [])

from_tensor(tensor, opts \\ [])

Warning

replace(series, tensor_or_list)

to_binary(series)

to_enum(series)

Warning

to_iovec(series)

to_list(series)

Warning

to_tensor(series, tensor_opts \\ [])

Warning

argmax(series)

argmin(series)

correlation(left, right, ddof \\ 1)

count(series)

covariance(left, right)

cut(series, bins, opts \\ [])

frequencies(series)

max(series)

mean(series)

median(series)

min(series)

n_distinct(series)

nil_count(series)

product(series)

qcut(series, quantiles, opts \\ [])

quantile(series, quantile)

skew(series, opts \\ [])

standard_deviation(series)

sum(series)

variance(series)

abs(series)

add(left, right)

all_equal(left, right)

left and right

cast(series, dtype)

categorise(series, categories)

clip(series, min, max)

coalesce(list)

coalesce(s1, s2)

divide(left, right)

equal(left, right)

exp(series)

greater(left, right)

greater_equal(left, right)

left in right

is_nil(series)

is_not_nil(series)

less(left, right)

View Source Explorer.Series (Explorer v0.7.1)