Getting Started
Copy MarkdownThis guide walks you through setting up Milvex and performing basic vector operations.
Prerequisites
- Elixir 1.19 or later
- A running Milvus instance (local or cloud)
Installation
Add milvex to your dependencies in mix.exs:
def deps do
[
{:milvex, "~> 0.1.0"}
]
endThen fetch dependencies:
mix deps.get
Connecting to Milvus
Basic Connection
{:ok, conn} = Milvex.Connection.start_link(host: "localhost", port: 19530)Named Connection
For use throughout your application, start a named connection:
{:ok, _} = Milvex.Connection.start_link([host: "localhost"], name: :milvus)
# Use the named connection
Milvex.search(:milvus, "collection", vectors, vector_field: "embedding")Under a Supervisor
The recommended approach for production:
defmodule MyApp.Application do
use Application
def start(_type, _args) do
children = [
{Milvex.Connection, [host: "localhost", port: 19530, name: MyApp.Milvus]}
]
opts = [strategy: :one_for_one, name: MyApp.Supervisor]
Supervisor.start_link(children, opts)
end
endConnection Options
Milvex.Connection.start_link(
host: "localhost", # Milvus server hostname
port: 19530, # gRPC port (default: 19530, or 443 for SSL)
database: "default", # Database name
user: "root", # Username (optional)
password: "milvus", # Password (optional)
token: "api_token", # API token (alternative to user/password)
ssl: true, # Enable SSL/TLS
ssl_options: [], # SSL options for transport
timeout: 30_000 # Connection timeout in ms
)You can also use a URI:
{:ok, config} = Milvex.Config.parse_uri("https://user:pass@milvus.example.com:443/mydb")
{:ok, conn} = Milvex.Connection.start_link(config)Creating a Collection
Define the Schema
Use the fluent builder API to define your collection schema:
alias Milvex.Schema
alias Milvex.Schema.Field
schema = Schema.build!(
name: "movies",
fields: [
Field.primary_key("id", :int64, auto_id: true),
Field.varchar("title", 512),
Field.vector("embedding", 128)
],
enable_dynamic_field: true
)Create Collection and Index
alias Milvex.Index
# Create the collection
:ok = Milvex.create_collection(conn, "movies", schema)
# Create an HNSW index for vector search
index = Index.hnsw("embedding", :cosine, m: 16, ef_construction: 256)
:ok = Milvex.create_index(conn, "movies", index)
# Load collection into memory for searching
:ok = Milvex.load_collection(conn, "movies")Inserting Data
Insert data as a list of maps:
{:ok, result} = Milvex.insert(conn, "movies", [
%{title: "The Matrix", embedding: generate_embedding("The Matrix")},
%{title: "Inception", embedding: generate_embedding("Inception")}
])
# result.ids contains the auto-generated IDs
IO.inspect(result.ids)Searching Vectors
Perform similarity search:
query_vector = generate_embedding("science fiction action")
{:ok, results} = Milvex.search(conn, "movies", [query_vector],
vector_field: "embedding",
top_k: 10,
output_fields: ["title"],
filter: "title like \"The%\""
)
# Access results
for hit <- results.hits do
IO.puts("#{hit.id}: #{hit.fields["title"]} (score: #{hit.score})")
endQuerying by Expression
Query records using filter expressions:
{:ok, results} = Milvex.query(conn, "movies", "id > 0",
output_fields: ["id", "title"],
limit: 100
)Index Types
Choose the right index for your use case:
# HNSW - best for high recall with good performance
Index.hnsw("field", :cosine, m: 16, ef_construction: 256)
# IVF_FLAT - good balance for medium datasets
Index.ivf_flat("field", :l2, nlist: 1024)
# AUTOINDEX - let Milvus choose optimal settings
Index.autoindex("field", :ip)
# IVF_PQ - memory efficient for large datasets
Index.ivf_pq("field", :l2, nlist: 1024, m: 8, nbits: 8)
# DiskANN - for datasets that don't fit in memory
Index.diskann("field", :l2)Supported metric types: :l2, :ip, :cosine, :hamming, :jaccard
Using Partitions
Organize data into partitions for efficient querying:
# Create partition
:ok = Milvex.create_partition(conn, "movies", "movies_2024")
# Insert into partition
{:ok, _} = Milvex.insert(conn, "movies", data, partition_name: "movies_2024")
# Search specific partitions
{:ok, _} = Milvex.search(conn, "movies", vectors,
vector_field: "embedding",
partition_names: ["movies_2024", "movies_2023"]
)
# Load/release partitions
:ok = Milvex.load_partitions(conn, "movies", ["movies_2024"])
:ok = Milvex.release_partitions(conn, "movies", ["movies_2024"])Next Steps
- Read the Architecture guide to understand how Milvex works internally
- Learn about Error Handling patterns
- Explore the
Milvexmodule documentation for all available operations