Recipes
Recipes are functions that define workflows for annotation, model training, data analysis, automated actions and more. Ellf comes with a range of built-in workflows for different use cases and also lets you implement your own custom recipes that run on your cluster.
Use Ellf to configure recipes for you
If you’ve connected Ellf to your coding assistant, it will be able to create and start tasks, actions and agents for you. You can also use the in-app chat and reference resources via @, for example to start a task using a data source, train from a dataset or assign an agent to a running task.
Tasks Annotation and review
Tasks are workflows that preprocess and queue up data for annotation or review and start the annotation server. You can view and create them in the UI via Tasks or using the CLI commands under ellf tasks.
Example Task
Annotate 'hello world'
example_taskNamed Entity Recognition
Annotate labeled text spans representing real-world objects like names, persons, countries or products.
nerSpan Categorization
Annotate potentially overlapping and nested spans in the data.
spansText Classification
Assign categories to whole documents or sentences.
textcatRelation Extraction
Annotate relations between tokens and spans. Also supports joint span and relation annotation.
relationsCoreference Resolution
Annotate coreference, i.e. links of ambiguous mentions like "her" or "the woman" back to an antecedent providing more context about the entity in question
corefDependency Parsing
Annotate syntactic dependencies.
depPart of Speech tagging recipe
Annotate word types.
posTerminology List
Bootstrap a terminology list from word vectors. Terminology lists can be converted into patterns to help pre-select entity spans during annotation.
termsImage Annotation & Classification
Annotate bounding boxes and segments, or assign categories to images.
imageAnnotate Audio
Annotate regions, assign categories to audio content or transcribe audio files.
audioAnnotate Video
Annotate regions, assign categories to video content or transcribe video files.
videoCurate and Explore
View what's in your data and accept or reject examples
curateReview Annotations
Review existing annotations created by multiple annotators and resolve potential conflicts by creating one final annotation.
reviewSecrets Example
Annotate 'hello world'
secrets_exampleSentence Segmentation
Create gold data for sentence boundaries by correcting a model's predictions
sentDebug Task
Task with tunable delays and errors for debugging.
debug_taskActions Training, evaluation and more
Actions are workflows that execute any logic and exit, similar to jobs running in a CI system. You can view and create them in the UI via Actions or using the CLI commands under ellf actions.
Download AG News data
Download AG News, filter for quality, sentence-segment with spaCy, and register train/eval assets on the cluster.
download_data_recipeDownload AG News
Download AG News texts and register train/eval input assets.
download_ag_newsDataset operations
Merge, copy and export annotated data
db_actionsMigrate dataset to structured
Convert an unstructured dataset to the structured format
migrate_to_structuredHello world
Print 'hello world'
hello_worldWait and exit
Wait and exit with a given code
wait_and_exitPrint file length
print_file_lengthPrint dataset or file length
print_dataset_or_file_lengthCall PAM with dummy metrics data
send_dummy_metricsDownload spaCy models
Download and install one or more spaCy models to shared storage so they can be loaded with spacy.load()
download_spacy_modelsTrain a spaCy pipeline
Train a spaCy model with one or more components on annotated data
trainTextcat LLM fetch
Gather text categorization predictions from an LLM
llm_fetch_textcatAgents Auto-annotation and automation
Agents are autonomous workers and annotators that can be assigned to tasks. They’re typically powered by LLMs and can use models running on the cluster or via APIs. You can view and create them in the UI via Agents or using the CLI commands under ellf agents.
Gemini Annotation Agent
Autonomous annotation agent powered by Google Gemini
gemini_agentspaCy Test Agent
Deterministic local annotation agent for tests and development
spacy_test_agentCommunity Recipes Third-party and other plugins
These recipes can be installed to your cluster separately and are provided by other packages by us or the developer community. If you want to contribute a recipe you’ve built, get in touch! For more details on custom recipes, see the recipe development guide.
Coming soon: This section is still under construction.