Skip to main content
Associate
October 17, 2024
Question

data collection

  • October 17, 2024
  • 1 reply
  • 1472 views

How many data points are necessary to build a reliable machine learning model (with mlc)? 

Is there a point where adding more data no longer significantly improves the model's performance?

1 reply

Federica Bossi
Technical Moderator
October 23, 2024

Hi @SamuPre ,

There is no rule. ML theory says to divide the dataset into three parts (training, validation and test) and the model is built with the training data, maximising the accuracy on validation and then seeing how it performs on the test (data the model has never seen).

In order to give better visibility on the answered topics, please click on 'Accept as Solution' on the reply which solved your issue or answered your question.