Hello guys I would like to know if it is possible to extract Kedro #beginners-need-help

Hello guys! I would like to know if it is possible...

mulajumento

04/05/2022, 1:15 AM

Hello guys! I would like to know if it is possible to "extract" the file path of a partitioned dateset catalog used as an input in a node. I tried to look in the internet for alternatives but I couldn't find a solution for it.

datajoely

04/05/2022, 4:32 AM

So this has come up before

datajoely

04/05/2022, 4:34 AM

This thread has come up before https://discord.com/channels/778216384475693066/846330075535769601/956485672930275388

datajoely

04/05/2022, 4:34 AM

Maybe useful?

datajoely

04/05/2022, 12:55 PM

The other option is to use hooks that have access to the catalog at runtime

mulajumento

04/05/2022, 2:58 PM

Hello @User ! First of all, thanks for your suggestion. Unfortunately I didn't manage to get it working. However I came with two possible solutions for my problem:

mulajumento

04/05/2022, 2:59 PM

First, through the catalog's attributes.

mulajumento

04/05/2022, 3:00 PM

Second one, creating a new catalog and overriding it for its path.

datajoely

04/05/2022, 4:05 PM

so I'm not sure either is 'Kedrific'

datajoely

04/05/2022, 4:05 PM

the best avenue we have for this sort of thing is to define a

before_node_run

hook

datajoely

04/05/2022, 4:05 PM

https://kedro.readthedocs.io/en/latest/kedro.framework.hooks.specs.NodeSpecs.html#kedro.framework.hooks.specs.NodeSpecs.before_node_run

datajoely

04/05/2022, 4:06 PM

you have access to the catalog, inputs and more

mulajumento

04/05/2022, 5:48 PM

Just to contextualise: my last node writes in a SQL data set some information calculated in the pipeline and also needs to write the path where is stored a STL (Also generated by the pipeline). The path in my database is important because the mesh is not stored directly in a SQL table . Since this mesh hast to be loaded in the front-end, we have to have its path as a reference. In my point of view, hook is rather a way to observe the pipeline's flow and, in my understanding, in your suggestion the hook is playing the role of a node that transforms data. In that way , wouldn't it be more "kedrific" to use a node instead of a hook? Or did i understand it wrong?

datajoely

04/05/2022, 5:49 PM

We like to ensure that the nodes have no knowledge of IO

datajoely

04/05/2022, 5:49 PM

This is a good example of a bit of a grey area

datajoely

04/05/2022, 5:49 PM

In truth we should think of hooks as the main way to extend kedro

datajoely

04/05/2022, 5:49 PM

And if you ever find yourself creating s context it's a good smell that you've gone too far

mulajumento

04/05/2022, 5:57 PM

hmm now i understand why the solutions aren't kedrific. I will give it some thought and find a way to solve my problem "kedrificly".

mulajumento

04/05/2022, 5:57 PM

Thank you very much, @User , for the answers and for being so fast 😄

datajoely

04/05/2022, 5:57 PM

Pleasure

2 Views

Previous Next