Skip to main content
Explorer
October 17, 2024
Question

data collection

  • October 17, 2024
  • 1 reply
  • 1471 views

How many data points are necessary to build a reliable machine learning model (with mlc)? 

Is there a point where adding more data no longer significantly improves the model's performance?

    This topic has been closed for replies.

    1 reply

    Technical Moderator
    October 23, 2024

    Hi @SamuPre ,

    There is no rule. ML theory says to divide the dataset into three parts (training, validation and test) and the model is built with the training data, maximising the accuracy on validation and then seeing how it performs on the test (data the model has never seen).