Inspired by the spaceflights modular pipelines cod...
# advanced-need-help
s
Inspired by the spaceflights modular pipelines code, I'd like to do something like this. But how do I get those
train_data
and
validation_data
outputs to create the subsequent pipelines with?
Copy code
python
cv_split_pipe = Pipeline(
    [
        node(
            func=nodes.cv_split,
            inputs=["data", "params:fold_config"],
            outputs=["train_data", "validation_data"], # train_data and validation_data are lists, one index per fold
        )
    ]
)
    
# get the train_data and validation_data outputs somehow
train_data = []
validation_data = []
# build modular pipeline
pipelines = []
for i, train_set, validation_set in enumerate(zip(train_data, validation_data)):
    pipelines.append(
        pipeline(
            pipe=new_inference_pipeline(),
            inputs=[train_set, validation_set],
            outputs={"y_pred": f"y_pred_{i}"}
        )
    )
final_pipeline = sum(pipelines)
2 Views