Milano is a tool for automating hyper-parameters search for your models on a backend of your choice.
(This is a research project, not an official NVIDIA product.)
Milano (Machine learning autotuner and network optimizer) is a tool for enabling machine learning researchers and practitioners to perform massive hyperparameters and architecture searches.
You can use it to: * Tune your model on a cloud backend of your choice * Benchmark Auto-ML algorithms (see how to add new search algorithm)
Your script can use any framework of your choice, for example, TensorFlow, PyTorch, Microsoft Cognitive Toolkit etc. or no framework at all. Milano only requires minimal changes to what your script accepts via command line and what it returns to stdout.
Currently supported backends: * Azkaban - on a single multi-GPU machine or server with Azkaban installed * AWS - Amazon cloud using GPU instances * SLURM - any cluster which is running SLURM
We provide a script to convert the csv file output into two kinds of graphs:
To run the script, use:
python3 visualize.py --file [the name of the results csv file] --n [the number of samples to visualize] --subplots [the number of subplots to show in a plot] --max [the max value of benchmark you care about]