Hello, everybody! It's my first day playing with K...
# beginners-need-help
n
Hello, everybody! It's my first day playing with Kedro and I'm getting the following error, when I'm running
kedro run
during the "Extend the data processing pipeline" step of the spaceflights tutorial (that's where you add the
pandas.ParquetDataset
to the catalog):
Copy code
kedro.io.core.DataSetError: Class `pandas.ParquetDataset` not found or one of its dependencies has not been installed.
Any ideas on what the issue is? I've spent quite a few hours playing around with conda environments, tried both on Windows and WSL, but I keep getting the same error. Googling around has not helped either.
d
Have you done these steps of the tutorial?
Kedro ships the smallest possible version of itself
And we need you to install the extra parts as and when it's needed
n
I've done these. I've also tried
pip install "kedro[pandas]"
pip install "kedro[all]"
pip install "kedro[pandas.ParquetDataset]"
none made any difference
d
Without the quotes?
n
i did try that but let me give it another shot
d
The other option is pip install
kedro["pandas"]
n
nah, all requirements already satisfied
n
Can u check if you have pyarrow installed?
d
Pip freeze
n
pyarrow is installed--i think it comes as a pandas dependency?
d
When this comes up it's usually two things: 1) dependencies not installed 2) they are installed but to the wrong environment
n
i have created a new conda env specifically for playing around with kedro
i'm pretty sure this is not the issue
d
If requirements are installed to the environment kedro shouldn't have an issue
Can you post the full stack trace?
All we're doing here is a thin wrapper ontop of pandas
n
what does "full stack trace" refer to?
d
The full error that you get when you try and run
Also the results of pip freeze would be helpful
n
the output of
kedro run
d
If you type
which kedro
what do you get?
n
it's the path to kedro in the current conda env:
/home/thanos/miniconda3/envs/test-kedro/bin/kedro
d
This is very weird
n
we agree on that!
n
I think it was a typo
Try
pandas.ParquetDataSet
d
With the space?
n
The error shown u have ParquetDataset instead
Capital "S"
d
Ah gotcha
It's in your catalog entry!
Good spot young @noklam
Yeah dataset is titlecase
n
it should be
pandas.ParquetDataSet
right?
d
Yes
n
wow, good catch!
thanks, guys!
i've wasted more than 4 hours on this
d
No worries shout if you have any other problems
n
πŸ˜‚
d
I'm going to put some thought in how we could give you a better error message
Good luck!
n
actually, it failed again
d
What error?
n
in a different way though
d
Different is good
n
true that
a KeyError
i think i can look into that
d
πŸ‘Œ
n
if i get stuck i'll reach out
@datajoely and @noklam, i really appreciate the help
what a cool community!
wow, it's been a very long time
n
Glad it helpsπŸ˜‰
n
haven't had this since the time when I was playing around with Julia and those guys were all over the place to help people
again, thank you both!