9/11/2023 0 Comments The homework clubSo what's the size of the saved DictVectorizer file? Tip: go to 02-experiment-tracking/homework/ folder before executing the command and change the value of to the location where you saved the data. Python preprocess_data.py -raw_data_path -dest_path. Your task is to download the datasets and then execute this command: save the preprocessed datasets and the DictVectorizer to disk.fit a DictVectorizer on the training set (January 2022 data),.load the data from the folder (the folder where you have downloaded the data),.Use the script preprocess_data.py located in the folder homework to preprocess the data. We'll use the Green Taxi Trip Records dataset to predict the amount of tips for each trip.ĭownload the data for January, February and March 2022 in parquet format from here. Once you installed the package, run the command mlflow -version and check the output. To get started with MLflow you'll need to install the appropriate Python package.įor this we recommend creating a separate Python environment, for example, you can use conda environments,Īnd then install the package there with pip or conda. The goal of this homework is to get familiar with tools like MLflow for experiment tracking and
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |