submit.md

June 7, 2024 · View on GitHub

Submit Parameter

中文文档

Using the command $HBOX_HOME/bin/hbox-submit [[--property value]...] command [argument...] to submit the application to Cluster at the Hbox client. Please see the example in the part of README Quick Start. The following is more details of the parameter.

Property NameMeaning
app-nameapplication name
app-typeapplication type, default as the "Hbox", can set as "TensorFlow", "Caffe" according to the deeplearning framework
inputinput file path in the format of "the HDFS path"#"local path"
outputoutput file path in the format of "the HDFS path"#"local path"
filesthe required local files of the application
cacheArchivethe required compressed files in the HDFS path
cacheFilethe required files in the HDFS path
user-paththe append for the environment variable $PATH
jarsthe required jar files
user-classpath-firstwhether user job jar should be the first one on class path or not, default as the configure of hbox.user.classpath.first
confset the configuration
driver-coresnumber of cores to use for the AM process, default as the configure of hbox.driver.cores
driver-memoryamount of memory to use for the AM process (in MB),default as the configure of hbox.driver.memory
ps-numnumber of ps containers to use for the application, default as the configure of hbox.ps.num
ps-coresnumber of cores to use for the ps process, default as the configure of hbox.ps.cores
ps-memoryamount of memory to use for the ps process (in MB), default as the configure of hbox.ps.memory
worker-numnumber of worker containers to use for the application, default as the configure of hbox.worker.num
worker-coresnumber of cores to use for the worker process, default as the configure of hbox.worker.cores
worker-memoryamount of memory to use for the worker process(in MB), default as the configure of hbox.worker.memory
chiefworker-memoryamount of memory for the chief worker, especially for the index 0 worker of the TensorFlow application, default as the worker-memory
evaluatorworker-memoryamount of memory for the estimator worker, especially for the TensorFlow Estimator application, default as the worker-memory
queuethe queue of application submitted to, default as the configure of hbox.app.queue
prioritythe priority of application, default as the configure of hbox.app.priority
board-enablewhether to start the service of Board, default as the configure of hbox.tf.board.enable
board-indexspecify the index of worker which start the Board, default as the configure of hbox.tf.board.worker.index
board-logdirthe directory save Board event log, default as the configure of hbox.tf.board.log.dir
board-reloadintervalhow often the backend should load more data of event log for tensorboard, default as the configure of hbox.tf.board.reload.interval
board-historydirspecify the HDFS path which the Board event log upload to, default as the configure of hbox.tf.board.history.dir
board-modelpbmodel proto in ONNX format for VisualDL, default as the configure of hbox.board.modelpb
board-cacheTimeoutmemory cache timeout duration in seconds for VisualDL,default as the configure of hbox.board.cache.timeout
input-strategythe strategy of the input file, default as the configure of hbox.input.strategy
inRenameInputFilewhether to rename the download file when input-strategy is "DOWNLOAD", default as the configure of hbox.inputfile.rename
stream-epochspecify the epoch num of the input file read when input-strategy is "STREAM", default as the configure of hbox.stream.epoch
inputformatspecify the class of the inputformat when input-strategy is "STREAM", default as the configure of hbox.inputformat.class
inputformat-shufflewhether to shuffle the input splits when input-strategy is "STREAM", default as the configure of hbox.input.stream.shuffle
output-strategythe strategy of the output file, default as the configure of hbox.output.strategy
outputformatspecify the class of outputformat when output-strategy is "STREAM", default as the configure of hbox.outputformat.class
tf-evaluatorwhether to set the last worker as evaluator of distributed TensorFlow job type, default as the configure of hbox.tf.evaluator
output-indexspecify the index of the worker which to upload the output, default upload the output of all the workers.
archiveFilesLocation of local archive files will be uploaded to container and be decompressed. use comma as separator, # with alias name