When I'm trying to build some models in Watson, the very first step is to upload my data. I wish I can upload the data in zip format and then unzip it on cloud (so there is less upload from my side). For example, a 743MB data file can be compressed to 18MB (http://kdd.ics.uci.edu/databases/kddcup99/kddcup99.html). It will save so much time for data scientists, especially data engineers to deal with big data.
I know we can still unzip the files in notebooks, but the integration between notebook and COS is actually not good. If I try to save the data to Assets in notebook, the data is in COS bucket but I cannot find it anywhere in project.
Why is it useful?
|Who would benefit from this IDEA?||Users who want to use features other than notebooks (SPSS modeler)|
How should it work?
|Submitting Organization||With Watson|