IBM Watson Data & AI - Structured Ideas

This Ideas portal is being closed.  Please enter new idea at http://ibm.biz/IBMAnalyticsIdeasPortal

More control over model train/test/holdout data cuts

Given that model can be affected by the set of data used for training it, it would be nice to have better control over how the data is split up.

A fairly easy improvement is to allow the user to set a seed for controlling the splits, so you can replicate the split in the future.

A more complex change is to allow the user to specify how may different variations of the train/test split to generate and run models against, either combining the results into an ensemble or selecting the best of the variations as the "final" result.

 

 

  • Guest
  • Dec 14 2018
  • Under Consideration
Customer Name
Role Summary
  • Attach files