https://kedro.org/ logo
Join the conversationJoin Discord
Channels
advanced-need-help
announcements
beginners-need-help
introductions
job-posting
plugins-integrations
random
resources
welcome
Powered by Linen
beginners-need-help
  • b

    Barros

    09/30/2022, 7:01 PM
    In the base package
  • d

    datajoely

    09/30/2022, 7:11 PM
    @noklam any idea?
  • n

    noklam

    09/30/2022, 7:59 PM
    The workaround postsd there should work. With 0.18.3 we already disable it by default on databricks. https://github.com/Textualize/rich/issues/2461
  • a

    antheas

    10/01/2022, 5:22 PM
    kedro does some things when it hooks logging that it shouldn't that messes up my setup for example, loading a hardcoded default config before initialization and loading my own logging.yml. This causes the loggers to print useless config messages with the wrong formatting for numexpr and kedro project dir message It's the same case with the rich pretty hook and the callbacks. I have to redo all the rich initialization on my settings.py file.
    from rich.traceback import install
    from .progress import RICH_TRACEBACK_ARGS
    
    logging.captureWarnings(True)
    # TODO: verify this works
    # remove handlers added by the default config
    logging.getLogger("kedro").handlers = []
    logging.root.handlers = []
    install(**RICH_TRACEBACK_ARGS)
    I also don't like how rich decides to use html elements for logging and formatting in jupyter, because it's clunky, larger (at least in vs code), and doesn't have a stable appearance when I commit my notebooks, so I need to run the following botch function after reload_kedro
    def _reconfigure_rich():
        from rich import reconfigure, _console
    
        _rich_console_args = {
            "width": PBAR_JUP_NCOLS,
            "height": 100,
        }
    
        reconfigure(**_rich_console_args)
    
        # Disable html rendering when using jupyter
        # force_jupyter=False messes with pretty print
        _console_check_buffer = _console._check_buffer
    
        def non_html_check_buffer(self):
            tmp = self.is_jupyter
            self.is_jupyter = False
            _console_check_buffer.__get__(self)()
            self.is_jupyter = tmp
    
        _console._check_buffer = non_html_check_buffer.__get__(_console)
  • l

    LeonG

    10/03/2022, 3:08 PM
    Started working with versioned datasets using the catalog.yml. I am now trying to define which version of the dataset my pipeline should load by using a config.yml file and passing that config to the run command. But whatever format I try in the config it only loads the latest version of the dataset. My current config file looks like this:
    run:
      load-version: "dataset_name:2022-09-30T10.05.39.612Z"
    Am I formatting my config file/parameters wrong?
  • n

    noklam

    10/03/2022, 3:10 PM
    @LeonG Can you try
    load_version
    instead? https://github.com/kedro-org/kedro/issues/1791 please votes or drop a comment on the issue if you could.
  • l

    LeonG

    10/03/2022, 3:19 PM
    That worked, thank you! Also had to make sure the params were in a list, the following works:
    load_version: ['dataset_name:2022-09-30T10.05.39.612Z']
  • n

    noklam

    10/03/2022, 3:40 PM
    I think equally a native YAML list would work - Maybe I am wrong
    yaml
    load_version:
      - dataset_name:2022-09-30T10.05.39.612Z
  • s

    sjster

    10/03/2022, 5:30 PM
    Hello, I am trying to set up a pipeline in a Kedro project and I get the error on the command line 'No such command: pipeline '
  • a

    Arnaldo

    10/03/2022, 11:42 PM
    @sjster Probably you either haven't Kedro installed in your environment or you're running the command not in the root of your project
  • s

    sjster

    10/03/2022, 11:44 PM
    @Arnaldo - my setup is not the most ideal - I am using Kedro from within a poetry environment
  • a

    Arnaldo

    10/03/2022, 11:52 PM
    you can try running
    poetry run kedro pipeline --help
  • s

    sjster

    10/03/2022, 11:59 PM
    That gives me the error I listed above
  • s

    sjster

    10/04/2022, 12:05 AM
    message has been deleted
  • s

    sjster

    10/04/2022, 12:06 AM
    That is the result of Kedro --help
  • l

    LeonG

    10/04/2022, 7:44 AM
    Just tested it, that works too and is more readable. I'll use this format from now on, thank you!
  • n

    noklam

    10/04/2022, 12:48 PM
    Awesome
  • t

    TESSA_BEIJLOOS

    10/04/2022, 1:05 PM
    Hi, I have a question about the tracking.MetricsDataSet from kedro. I tried to load a metrics dataset through kedro viz but I got a file not found error. I can actually open it in a jupyter notebook (with catalog.load), so it is there. And also when I refer to the dataset as it being a tracking.JSONDataSet. Is this a bug in kedro or am I doing something wrong?
  • n

    noklam

    10/04/2022, 1:34 PM
    Could you share the error stack trace?
  • t

    TESSA_BEIJLOOS

    10/04/2022, 1:39 PM
    message has been deleted
  • n

    noklam

    10/04/2022, 1:41 PM
    Are you running from the root directory?
  • t

    TESSA_BEIJLOOS

    10/04/2022, 1:42 PM
    Yes
  • n

    noklam

    10/04/2022, 1:44 PM
    Could u share the catalog entry for this dataswt?
  • n

    noklam

    10/04/2022, 1:45 PM
    As the error shows that your data is in certifai/data, usually data is at the top level of directory, thus the question
  • n

    noklam

    10/04/2022, 1:46 PM
    Does the error message make sense to you? Is that where your data are?
  • t

    TESSA_BEIJLOOS

    10/04/2022, 1:47 PM
    yes it makes sense, and if the filepath was wrong then it wouldnt work with tracking.JSONDataset right?
  • t

    TESSA_BEIJLOOS

    10/04/2022, 1:48 PM
    message has been deleted
  • t

    TESSA_BEIJLOOS

    10/04/2022, 1:50 PM
    also if I open this in a notebook -> catalog.load("test_attribute_metrics") I can see the data that is in it and it also says "Loading data from 'test_attribute_metrics' (MetricsDataSet)"
  • n

    noklam

    10/04/2022, 1:51 PM
    You are saying JsonDataSet work but Metric will cause error?
  • t

    TESSA_BEIJLOOS

    10/04/2022, 1:52 PM
    Yes, so if I put "type: tracking.JSONDataSet" instead of "type: tracking.MetricsDataSet" it works.
Powered by Linen
Title
t

TESSA_BEIJLOOS

10/04/2022, 1:52 PM
Yes, so if I put "type: tracking.JSONDataSet" instead of "type: tracking.MetricsDataSet" it works.
View count: 1