In a typical machine learning project, what is the main purpose of the data folder?
Think about where you keep the information your model learns from.
The data folder is where all raw and processed datasets are stored. This includes training, validation, and test data. It is essential for the model to access this data during training and evaluation.
You have written Python code that defines your machine learning model architecture and training functions. Where should you place this code in a well-organized ML project?
Think about where code related to model design logically belongs.
The models folder or a similar src/models path is the best place to keep model architecture and training code. This keeps code organized and separate from data and experiments.
Where in an ML project structure should you place code that calculates and logs evaluation metrics like accuracy or loss during model training?
Think about where you keep code related to checking how well your model performs.
Evaluation metric code is best kept in a dedicated evaluation or metrics folder or module. This separates concerns and makes it easier to update or add new metrics.
You find that your ML project has training scripts inside the data folder and dataset files inside the models folder. What is the main problem caused by this structure?
Think about how clear and easy it is to find things in a project.
Misplacing files causes confusion and makes the project difficult to maintain. Clear separation of data, code, and models helps teamwork and debugging.
In a well-structured ML project, where should you store hyperparameters like learning rate, batch size, and number of epochs to ensure easy tuning and reproducibility?
Think about how to change settings without editing code.
Storing hyperparameters in separate config files like YAML or JSON allows easy tuning without changing code. It also helps track experiments and reproduce results.