submit.md
June 7, 2024 · View on GitHub
Submit Parameter
Using the command $HBOX_HOME/bin/hbox-submit [[--property value]...] command [argument...] to submit the application to Cluster at the Hbox client.
Please see the example in the part of README Quick Start. The following is more details of the parameter.
| Property Name | Meaning |
|---|---|
| app-name | application name |
| app-type | application type, default as the "Hbox", can set as "TensorFlow", "Caffe" according to the deeplearning framework |
| input | input file path in the format of "the HDFS path"#"local path" |
| output | output file path in the format of "the HDFS path"#"local path" |
| files | the required local files of the application |
| cacheArchive | the required compressed files in the HDFS path |
| cacheFile | the required files in the HDFS path |
| user-path | the append for the environment variable $PATH |
| jars | the required jar files |
| user-classpath-first | whether user job jar should be the first one on class path or not, default as the configure of hbox.user.classpath.first |
| conf | set the configuration |
| driver-cores | number of cores to use for the AM process, default as the configure of hbox.driver.cores |
| driver-memory | amount of memory to use for the AM process (in MB),default as the configure of hbox.driver.memory |
| ps-num | number of ps containers to use for the application, default as the configure of hbox.ps.num |
| ps-cores | number of cores to use for the ps process, default as the configure of hbox.ps.cores |
| ps-memory | amount of memory to use for the ps process (in MB), default as the configure of hbox.ps.memory |
| worker-num | number of worker containers to use for the application, default as the configure of hbox.worker.num |
| worker-cores | number of cores to use for the worker process, default as the configure of hbox.worker.cores |
| worker-memory | amount of memory to use for the worker process(in MB), default as the configure of hbox.worker.memory |
| chiefworker-memory | amount of memory for the chief worker, especially for the index 0 worker of the TensorFlow application, default as the worker-memory |
| evaluatorworker-memory | amount of memory for the estimator worker, especially for the TensorFlow Estimator application, default as the worker-memory |
| queue | the queue of application submitted to, default as the configure of hbox.app.queue |
| priority | the priority of application, default as the configure of hbox.app.priority |
| board-enable | whether to start the service of Board, default as the configure of hbox.tf.board.enable |
| board-index | specify the index of worker which start the Board, default as the configure of hbox.tf.board.worker.index |
| board-logdir | the directory save Board event log, default as the configure of hbox.tf.board.log.dir |
| board-reloadinterval | how often the backend should load more data of event log for tensorboard, default as the configure of hbox.tf.board.reload.interval |
| board-historydir | specify the HDFS path which the Board event log upload to, default as the configure of hbox.tf.board.history.dir |
| board-modelpb | model proto in ONNX format for VisualDL, default as the configure of hbox.board.modelpb |
| board-cacheTimeout | memory cache timeout duration in seconds for VisualDL,default as the configure of hbox.board.cache.timeout |
| input-strategy | the strategy of the input file, default as the configure of hbox.input.strategy |
| inRenameInputFile | whether to rename the download file when input-strategy is "DOWNLOAD", default as the configure of hbox.inputfile.rename |
| stream-epoch | specify the epoch num of the input file read when input-strategy is "STREAM", default as the configure of hbox.stream.epoch |
| inputformat | specify the class of the inputformat when input-strategy is "STREAM", default as the configure of hbox.inputformat.class |
| inputformat-shuffle | whether to shuffle the input splits when input-strategy is "STREAM", default as the configure of hbox.input.stream.shuffle |
| output-strategy | the strategy of the output file, default as the configure of hbox.output.strategy |
| outputformat | specify the class of outputformat when output-strategy is "STREAM", default as the configure of hbox.outputformat.class |
| tf-evaluator | whether to set the last worker as evaluator of distributed TensorFlow job type, default as the configure of hbox.tf.evaluator |
| output-index | specify the index of the worker which to upload the output, default upload the output of all the workers. |
| archiveFiles | Location of local archive files will be uploaded to container and be decompressed. use comma as separator, # with alias name |