DocumentationPricingJoin Waitlist

Getting Started

  • Introduction
  • Installation & Setup

Features

  • Modules
  • Platform
  • Recipes
    • Tasks
    • Actions
    • Agents

Cluster

  • Details & Setup
  • Tasks, Actions & Agents
  • Assets & Files

API Reference

  • Command-Line Interface
  • Recipe Development
  • Recipes API

Recipes

Recipes are functions that define workflows for annotation, model training, data analysis, automated actions and more. Ellf comes with a range of built-in workflows for different use cases and also lets you implement your own custom recipes that run on your cluster.

Use Ellf to configure recipes for you

If you’ve connected Ellf to your coding assistant, it will be able to create and start tasks, actions and agents for you. You can also use the in-app chat and reference resources via @, for example to start a task using a data source, train from a dataset or assign an agent to a running task.

Tasks Annotation and review

Tasks are workflows that preprocess and queue up data for annotation or review and start the annotation server. You can view and create them in the UI via Tasks or using the CLI commands under ellf tasks.

Example Task

Annotate 'hello world'

example_task

Named Entity Recognition

Annotate labeled text spans representing real-world objects like names, persons, countries or products.

ner

Span Categorization

Annotate potentially overlapping and nested spans in the data.

spans

Text Classification

Assign categories to whole documents or sentences.

textcat

Relation Extraction

Annotate relations between tokens and spans. Also supports joint span and relation annotation.

relations

Coreference Resolution

Annotate coreference, i.e. links of ambiguous mentions like "her" or "the woman" back to an antecedent providing more context about the entity in question

coref

Dependency Parsing

Annotate syntactic dependencies.

dep

Part of Speech tagging recipe

Annotate word types.

pos

Terminology List

Bootstrap a terminology list from word vectors. Terminology lists can be converted into patterns to help pre-select entity spans during annotation.

terms

Image Annotation & Classification

Annotate bounding boxes and segments, or assign categories to images.

image

Annotate Audio

Annotate regions, assign categories to audio content or transcribe audio files.

audio

Annotate Video

Annotate regions, assign categories to video content or transcribe video files.

video

Curate and Explore

View what's in your data and accept or reject examples

curate

Review Annotations

Review existing annotations created by multiple annotators and resolve potential conflicts by creating one final annotation.

review

Secrets Example

Annotate 'hello world'

secrets_example

Sentence Segmentation

Create gold data for sentence boundaries by correcting a model's predictions

sent

Debug Task

Task with tunable delays and errors for debugging.

debug_task

Actions Training, evaluation and more

Actions are workflows that execute any logic and exit, similar to jobs running in a CI system. You can view and create them in the UI via Actions or using the CLI commands under ellf actions.

Download AG News data

Download AG News, filter for quality, sentence-segment with spaCy, and register train/eval assets on the cluster.

download_data_recipe

Download AG News

Download AG News texts and register train/eval input assets.

download_ag_news

Dataset operations

Merge, copy and export annotated data

db_actions

Migrate dataset to structured

Convert an unstructured dataset to the structured format

migrate_to_structured

Hello world

Print 'hello world'

hello_world

Wait and exit

Wait and exit with a given code

wait_and_exit

Print file length

print_file_length

Print dataset or file length

print_dataset_or_file_length

Call PAM with dummy metrics data

send_dummy_metrics

Download spaCy models

Download and install one or more spaCy models to shared storage so they can be loaded with spacy.load()

download_spacy_models

Train a spaCy pipeline

Train a spaCy model with one or more components on annotated data

train

Textcat LLM fetch

Gather text categorization predictions from an LLM

llm_fetch_textcat

Agents Auto-annotation and automation

Agents are autonomous workers and annotators that can be assigned to tasks. They’re typically powered by LLMs and can use models running on the cluster or via APIs. You can view and create them in the UI via Agents or using the CLI commands under ellf agents.

Gemini Annotation Agent

Autonomous annotation agent powered by Google Gemini

gemini_agent

spaCy Test Agent

Deterministic local annotation agent for tests and development

spacy_test_agent

Community Recipes Third-party and other plugins

These recipes can be installed to your cluster separately and are provided by other packages by us or the developer community. If you want to contribute a recipe you’ve built, get in touch! For more details on custom recipes, see the recipe development guide.

Coming soon: This section is still under construction.

from the makers of spaCy and Prodigy

Navigation

  • Home
  • Documentation

Platform

  • Pricing
  • Waitlist

Resources

  • Case Studies
  • Blog
Terms & ConditionsPrivacy PolicyImprint© 2026 Explosion