Our Model Training Data sets consist of multiple file. ( Typically 5-10 very large files) We would like the ability to treat these individual files as a Data set so they can be Cataloged, tagged and access controls managed at a folder or data set level. When a user discovers or views this Data set, I want them to be able to see the group of files that make up that data set, and to be able to use one or all of these files for their experiments.
Why is it useful?
|Who would benefit from this IDEA?||Content Provider, Content Consumer, Data Scientist, Data Engineer|
How should it work?