Goss
09/26/2022, 8:56 PMkedro catalog create --pipeline __default__
on the space tutorial, it generates a bunch of datasets not in the catalog:
data_science.active_modelling_pipeline.X_test:
type: MemoryDataSet
data_science.active_modelling_pipeline.X_train:
type: MemoryDataSet
data_science.active_modelling_pipeline.y_test:
type: MemoryDataSet
data_science.active_modelling_pipeline.y_train:
type: MemoryDataSet
data_science.candidate_modelling_pipeline.X_test:
type: MemoryDataSet
data_science.candidate_modelling_pipeline.X_train:
type: MemoryDataSet
data_science.candidate_modelling_pipeline.y_test:
type: MemoryDataSet
data_science.candidate_modelling_pipeline.y_train:
type: MemoryDataSet
Why aren't these included in conf/base/catalog.yml
when their absence causes errors like ValueError: Pipeline input(s) {'data_science.active_modelling_pipeline.y_train', 'data_science.active_modelling_pipeline.X_train'} not found in the DataCatalog
???datajoely
09/27/2022, 3:58 AMGoss
09/27/2022, 11:49 AM