# Data Preparation

![](https://4117164708-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2F-MFCNLySTC0Jf6imOp3y%2Fuploads%2Fgit-blob-38085d21ea60c0d42f70eab10b18e90a8801531f%2FLifecycle\(1\).png?alt=media)

As previously mentioned in the [Data Access](https://docs.productizeml.com/productize-ml/data-management/untitled) lecture, getting access to the matching data domain of your problem, and curating this data is a time-consuming and demanding job. For this reason, it can introduce risk factors as previously mentioned in the *ML risk impact assessment* table from [Business Objectives](https://docs.productizeml.com/productize-ml/productize-it/business-objectives).

Data and ML technologies always live in **symbiosis**. We can also see it as ML being the hammer, and data the nail — the hammer will completely be useless without the presence of the nail. And not only that, but we will find plenty of types of nail and hammers, but only finding the right match will make it work.

Keeping this in mind, we can think about three common scenarios:

1. **We have data, but technology is not ready**. This is the case when you must spend time understanding your data in order to know what is the best technology to use. Plenty of data does not particularly mean that it can feed your ML algorithm. Once you have *learned* your data, it is time to hire some data scientists to come up with the most suitable technology.
2. **We (think to) have the technology ready, but we lack of data**. It seems we have the solution well thought, and we know what the direction to go is, but we still need to collect the right data.
3. **Data and technology are available**. This is the clearest of the cases where you should start developing and training your models, deploy your product, and iterate over it to clearly understand your customers' needs.


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.productizeml.com/productize-ml/productize-it/data-preparation.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
