API Reference
- Top
- Modules
Changelog
- Top
- 2.4.0 — 2021-01-26
- 2.3.4 — 2020-12-02
- 2.3.3 — 2020-11-10
- 2.3.2 — 2020-11-06
- 2.3.1 — 2020-11-06
- 2.3.0 — 2020-11-06
- 2.2.0 — 2020-10-12
- 2.1.0 — 2020-08-21
- 2.0.0 — 2020-07-10
- 2.0.0-rc.3 — 2020-07-01
- 2.0.0-rc.2 — 2020-06-23
- 2.0.0-rc.1 — 2020-06-12
- 2.0.0-rc.0 — 2020-06-03
Guides
Installation
- Top
Troubleshooting
- Top
- Querying the Jobs Table
- Heroku
- PG Bouncer
Release Configuration
- Top
- Using Config Providers
Writing Plugins
- Top
- Example Plugin
- Calling Interface Functions
- Caveats
Recipes
Recursive Jobs
- Top
- Use Case: Backfilling Timezone Data
- Building On Recursive Jobs
Reliable Scheduled Jobs
- Top
- Use Case: Delivering Daily Digest Emails
- More Flexible Than CRON Scheduling
Reporting Job Progress
- Top
- Use Case: Exporting a Large Zip File
- Coordinating Processes
- Made Possible by Unlimited Execution
Handling Expected Failures
- Top
- Use Case: Silencing Initial Notifications for Flaky Services
Splitting Queues Between Nodes
- Top
- Use Case: Isolating Video Processing Intensive Jobs
- Flexible Across all Environments
Upgrade Guides
Upgrading to v2.0
- Top
- Bump Your Deps
- Oban.Worker Changes
- Update Your Config
- Update Your Tests
- Update Telemetry
- Update ObanWeb (Optional)
Oban Pro
Overview
- Top
- Plugins
- Workers
Installation
- Top
Lifeline Plugin
- Top
- Using and Configuring
- Rescuing Exhausted Jobs
- Implementation Notes
- Instrumenting with Telemetry
Dynamic Cron Plugin
- Top
- Installation
- Using and Configuring
- Runtime Updates
- Isolation and Namespacing
- Instrumenting with Telemetry
Dynamic Pruner Plugin
- Top
- Using and Configuring
- Providing Overrides
- Keeping Up With Inserts
- Implementation Notes
- Instrumenting with Telemetry
Reprioritizer Plugin
- Top
- Using and Configuring
- Providing Overrides
- Instrumenting with Telemetry
Batch Worker
- Top
- Using and Configuring
- Inserting Batches
- Handler Callbacks
- Generating Batch IDs
- Inserting Large Batches
- Implementation Notes
Workflow Worker
- Top
- Using and Configuring
- Visualizing Workflows
- Workflow Options
- Generating Workflow IDs
- Implementation Notes
Changelog
- Top
- v0.6.0 — 2021-01-26
- v0.5.3 — 2020-12-09
- v0.5.2 — 2020-11-27
- v0.5.1 — 2020-11-06
- v0.5.0 — 2020-11-06
- v0.4.2 — 2020-10-19
- v0.4.1 — 2020-10-11
- v0.4.0 — 2020-10-05
- v0.3.2 — 2020-08-28
- v0.3.1 — 2020-08-05
- v0.3.0 — 2020-07-10
- v0.2.1 — 2020-07-01
- v0.2.0 — 2020-06-12
- v0.1.0 — 2020-06-03
Oban Web
Overview
- Top
- Features
Installation
- Top
- Running Multiple Dashboards
- Using LongPolling
- Customizing with a Resolver Callback Module
- Integrating with Telemetry
Customizing the Dashboard
- Top
- Current User
- Action Controls
- Default Refresh
Telemetry
- Top
- Action Events
- Action Logging
Troubleshooting
- Top
Changelog
- Top
- v2.5.0 — 2021-01-15
- v2.4.0 — 2020-12-11
- v2.3.1 — 2020-11-27
- v2.3.0 — 2020-11-06
- v2.2.3 — 2020-10-15
- v2.2.2 — 2020-10-11
- v2.2.1 — 2020-09-29
- v2.2.0 — 2020-09-11
- v2.1.1 — 2020-08-24
- v2.1.0 — 2020-08-06
- v2.0.0 — 2020-07-10
- v1.5.0 — 2020-04-27
- v1.4.0 — 2020-03-24
- v1.3.1 — 2020-03-18
- v1.3.0 — 2020-03-10
- v1.2.0 — 2020-02-07
- v1.1.2 — 2020-02-07
- v1.1.1 — 2020-02-06
- v1.1.0 — 2020-02-06
- v1.0.1 — 2020-01-29
- v1.0.0 — 2020-01-29
- v0.8.0 — 2020-01-23
- v0.7.0 — 2020-01-08
- v0.6.3 — 2019-12-15
- v0.6.2 — 2019-12-05
- v0.6.1 — 2019-11-22

Lifeline Plugin

🌟 This plugin is available through Oban.Pro

The Lifeline plugin records queue activity as heartbeats and periodically rescues orphaned jobs, i.e. jobs that are stuck in the executing state because the node was shut down before the job could finish. Without the Lifeline plugin you need to manually rescue jobs stuck in the executing state.

Lifeline must be included used in order for Oban.Web to run properly.

Using and Configuring

To use the Lifeline plugin add the module to your list of Oban plugins in config.exs:

config :my_app, Oban,
  plugins: [Oban.Pro.Plugins.Lifeline]
  ...

There isn't any configuration necessary. By default the plugin will record heartbeats every 1 second, prune old heartbeats every 5 minutes, and rescue orphaned jobs every 1 minute. If necessary you can configure any or all of those intervals:

plugins: [{
  Oban.Pro.Plugins.Lifeline,
  delete_interval: :timer.minutes(10),
  record_interval: :timer.seconds(10),
  rescue_interval: :timer.minutes(5)
}]

This configuration will record 10x fewer heartbeats per minute, retain them 2x as long and attempt to rescue 5x less frequently. It optimizes for less database activity, at the expense of fidelity and recovery speed.

Note that rescuing orphans relies on recent heartbeats. Be sure that the delete_interval is always longer than the rescue_interval or it will look like all executing jobs are orphaned.

Rescuing Exhausted Jobs

When a job's attempt matches its max_attempts its retries are considered "exhausted". Normally, the Lifeline plugin transitions exhausted jobs to the discarded state and they won't be retried again. It does this for a couple of reasons:

To ensure at-most-once semantics. If a long running job interacted with a non idempotent service and was shut down while waiting for a reply you may not want that jot to retry.
To prevent infinitely crashing BEAM nodes. Poorly behaving jobs may crash the node (through NIFs, memory exhaustion, etc.) We don't want to repeatedly rescue and rerun a job that repeatedly crashes the entire node.

Discarding exhausted jobs may not always be desired. Use the retry_exhausted option if you'd prefer to retry exhausted jobs when they are rescued, rather than discarding them:

plugins: [{Oban.Pro.Plugins.Lifeline, retry_exhausted: true}]

During rescues, with retry_exhausted: true, a job's max_attempts is incremented and it is moved back to the available state.

Implementation Notes

Some additional notes about heartbeat records and how you can expect Lifeline to operate:

Orphan rescuing is guaranteed to only rescue jobs that belong to dead queue processes or nodes.
Heartbeat records are written to the oban_beats table very efficiently, as a single batch.
Heartbeat records are only retained for five minutes, by default. This prevents bloat or exhausting the row limit on free tier databases.
Only a single node will delete heartbeats or rescue orphans at any given time, which prevents potential deadlocks and churn.

Instrumenting with Telemetry

The Lifeline plugin adds the following metadata to the [:oban, :plugin, :stop] event:

:action — one of :delete, :record or :rescue.
:deleted_count — the number of jobs deleted for the :delete action
:rescued_count — the number of jobs rescued for the :rescue action
:recorded_count — the number beats inserted for the :record action

See the docs on Plugin Events for details.