Pipeline Service
Central orchestrator for Haystack pipeline management and execution.
The PipelineService provides:
Pipeline CRUD - Create, read, update, delete pipeline definitions
Validation - Validates pipeline definitions against
ComponentRegistryBuilding - Constructs Haystack
Pipelineobjects from YAML/dict definitionsExecution - Asynchronous pipeline execution
Caching - Version-based pipeline caching for performance
Key Dependencies
ComponentRegistry- Whitelist of allowed Haystack componentsPipelineValidator- Validates pipeline structure and connectionsDocumentContract- Enforces required metadata (celex,eId)PipelineRepository- Database persistence for pipelines and executions
Execution Flow
Pipeline definition validated via
PipelineValidatorDocuments validated against
DocumentContractHaystack
Pipelinebuilt from component definitionsPipeline executed with input documents
Results persisted via repository layer