-to test it, we need to split the data into training set (70) and testing set (30), but that’s not where splitting stops, we split it more to work on more parameters

(confusing ? don’t worry, i got the bullet points)

image.png

1) Training set

2) Development (Validation) set / Holdout validation

3) Testing set

4) Train-Dev set (optional)

image.png

image.png