Kreuzberg.ExtractionResult (kreuzberg v4.9.5)

Structure representing the result of a document extraction operation.

Matches the Rust ExtractionResult struct.

Fields

:content - The main extracted text content
:mime_type - The MIME type of the processed document
:metadata - Metadata struct with document information
:tables - List of extracted tables
:detected_languages - List of detected language codes
:chunks - Optional list of text chunks with embeddings
:images - Optional list of extracted images
:pages - Optional list of per-page content
:elements - Optional list of semantic elements
:ocr_elements - Optional list of OCR elements with positioning and confidence
:djot_content - Optional rich Djot content structure
:document - Optional hierarchical document structure
:extracted_keywords - Optional list of extracted keywords with scores
:quality_score - Optional quality score for the extraction (0.0 to 1.0)
:processing_warnings - Optional list of warnings generated during processing
:annotations - Optional list of PDF annotations (text, highlight, link, etc.)
:uris - Optional list of URIs extracted from the document
:children - Optional list of child extraction results (e.g., from archive entries)

Summary

Types

t()

Functions

new(content, mime_type, metadata \\ %Kreuzberg.Metadata{}, tables \\ [], opts \\ [])

Creates a new ExtractionResult from extracted data.

to_map(result)

Converts an ExtractionResult struct to a map.