datajoely
12/09/2021, 3:02 PM**_fs_args
in our read/write optionsdatajoely
12/09/2021, 3:02 PMself._fs = fsspec.filesystem(self._protocol, **_credentials, **_fs_args)
datajoely
12/09/2021, 3:13 PMdatajoely
12/09/2021, 3:13 PMIsaac89
12/09/2021, 3:14 PMdatajoely
12/09/2021, 3:14 PMdatajoely
12/09/2021, 3:14 PMcredentials
stuff that Kedro does by default in teh catalogdatajoely
12/09/2021, 3:15 PMfsspec
is a cool project - big fans of how they've approached thisIsaac89
12/09/2021, 3:17 PMdatajoely
12/09/2021, 3:17 PMdatajoely
12/09/2021, 3:18 PMdatajoely
12/09/2021, 3:18 PMIsaac89
12/09/2021, 3:51 PM# requires paramiko -> pip install paramiko
# in conf/local/catalog.yml
test_fsspec:
type: pandas.CSVDataSet
filepath: "sftp:///path/to/remote_cluster/cool_data.csv"
credentials: cluster_credentials
load_args:
sep: ","
index_col: 0
save_args:
index: True
encoding: "utf-8"
# in conf/local/credentials.yml
cluster_credentials:
username: my_username
host: host_address
port: 22
password: password
# in jupyter lab
catalog.load("test_fsspec")
datajoely
12/09/2021, 4:21 PMdatajoely
12/09/2021, 4:21 PMdatajoely
12/09/2021, 4:21 PM.
character on GitHubIsaac89
12/09/2021, 4:26 PMdatajoely
12/09/2021, 4:28 PMZemeio
12/10/2021, 9:37 AMdatajoely
12/10/2021, 9:37 AMdatajoely
12/10/2021, 9:38 AMZemeio
12/10/2021, 9:39 AMZemeio
12/10/2021, 9:39 AMZemeio
12/10/2021, 9:39 AMdatajoely
12/10/2021, 9:39 AMdatajoely
12/10/2021, 9:42 AMdatajoely
12/10/2021, 9:43 AMZemeio
12/10/2021, 9:45 AMZemeio
12/10/2021, 9:52 AMZemeio
12/10/2021, 9:53 AMZemeio
12/10/2021, 9:53 AM