datajoely
04/28/2022, 9:45 AMZemeio
04/28/2022, 9:46 AMZemeio
04/28/2022, 9:46 AMdatajoely
04/28/2022, 9:46 AMZemeio
04/28/2022, 9:47 AMdatajoely
04/28/2022, 9:47 AMZemeio
04/28/2022, 11:49 AMs3fs==2022.1.0
boto3==1.20.24
aiobotocore==2.1.2
And the catalog:
yaml
my.sample.image.int:
type: PartitionedDataSet
dataset: kedro.extras.datasets.pillow.ImageDataSet
path: ${my.bucket}/${folder.int}/sample-images/
filename_suffix: ".jpg"
Zemeio
04/28/2022, 11:50 AMdatajoely
04/28/2022, 11:50 AMdatajoely
04/28/2022, 11:51 AMZemeio
04/28/2022, 11:51 AMdatajoely
04/28/2022, 11:51 AMyaml
type: PartitionedDataSet
path: ${my.bucket}/${folder.int}/sample-images/
dataset:
type: kedro.extras.datasets.pillow.ImageDataSet
filename_suffix: '.jpg'
datajoely
04/28/2022, 11:51 AMZemeio
04/28/2022, 11:59 AMdatajoely
04/28/2022, 12:00 PMdatajoely
04/28/2022, 12:00 PMZemeio
04/28/2022, 12:00 PMException has occurred: DataSetError (note: full exception trace is shown but execution is paused at: _run_module_as_main)
Failed while saving data to data set ImageDataSet(filepath=<hidden>.jpg, protocol=s3, save_args={}). unknown file extension:
File "\usr\local\lib\python3.8\site-packages\PIL\Image.py", line 2278, in save format = EXTENSION[ext] The above exception was the direct cause of the following exception: File "\usr\local\lib\python3.8\site-packages\kedro\io\core.py", line 210, in save self._save(data) File "\usr\local\lib\python3.8\site-packages\kedro\extras\datasets\pillow\image_dataset.py", line 120, in _save data.save(fs_file, **self._save_args) File "\usr\local\lib\python3.8\site-packages\PIL\Image.py", line 2280, in save raise ValueError(f"unknown file extension: {ext}") from e
Zemeio
04/28/2022, 12:02 PMTraceback (most recent call last):
File "/usr/local/lib/python3.8/site-packages/PIL/Image.py", line 2278, in save
format = EXTENSION[ext] KeyError: ''
datajoely
04/28/2022, 12:51 PM\usr\local\lib\python3.8\site-packages\PIL\Image.py
?datajoely
04/28/2022, 12:53 PMImage.py
in PILZemeio
04/28/2022, 12:55 PMZemeio
04/28/2022, 12:56 PMdatajoely
04/28/2022, 12:56 PM${my.bucket}/${folder.int}/sample-images/
?Zemeio
04/28/2022, 12:58 PMdatajoely
04/28/2022, 12:58 PMPartitionedDataSet
https://github.com/kedro-org/kedro/blob/676600c4b63eec53c13fc4e2536d0a990dac77ce/kedro/io/partitioned_dataset.py#L244datajoely
04/28/2022, 12:59 PM.
in the suffix just in caseZemeio
04/28/2022, 1:00 PMdatajoely
04/28/2022, 1:00 PMdatajoely
04/28/2022, 1:00 PMdatajoely
04/28/2022, 1:00 PM