RRoger12/17/2021, 9:53 PM
) is dependent on a previous node (node
) having uploaded to a database (e.g.
) and I use
as the input for
automatically try to download
to memory (if not already in memory)? I would not like the data downloaded if: - the data is large, hence most of the pipeline time is spent on downloading - `B`'s code is to run SQL queries without ever requiring the data locally