Intel Deep Learing Training Tool Error

Intel Deep Learing Training Tool Error

    When the installing went to 71%, it happened a Error:503 Docker containers cannot be started.

     I checked the log and found this message"Error response from daemon: Get https://gcr.io/v2/: net/http: request canceled while waiting for connection (Client.Timeout exceeded while awaiting headers)."

​    I wonder if that means that there are something worng with the https://gcr.io/v2/: . And I found this below on the page:

     

      404. That’s an error.

      The requested URL /v2/: was not found on this server. That’s all we know.

     Below are the logs includes the error messages

  • 20:18:58 error docker containers cannot be started
  • 20:18:58 error error: <password>Error: Command failed: tools\win\plink.exe 172.23.19.128 -l <password> -pw mayi -batch -t -ssh "chmod +x dlsdk_install_scripts/*.sh && echo 'mayi' | sudo -S -E -k dlsdk_install_scripts/install_training_tool.sh -stage 5 -type multi -username 'mayi' -toolpassword '<password>' -startport 31000 -selfsigncertificate 'undefined' -volume ~/ "
  • 20:18:58 info
  • 20:18:58 info Error on or near line 87; exiting with status 1
  • 20:18:58 info !!! [1102 05:17:19] etcd failed to start. Exiting...
  • 20:18:58 info See 'docker run --help'.
  • 20:18:58 info docker: Error response from daemon: Get https://gcr.io/v2/: net/http: request canceled while waiting for connection (Client.Timeout exceeded while awaiting headers).
  • 20:18:58 info Unable to find image 'gcr.io/google_containers/etcd-amd64:3.0.4' locally
  • 20:18:58 info +++ [1102 05:16:43] Launching etcd...
  • 20:18:58 info +++ [1102 05:16:41] Launching docker bootstrap...
  • 20:18:58 info +++ [1102 05:16:41] Killing all kubernetes containers...
  • 20:18:58 info +++ [1102 05:16:41] --------------------------------------------
  • 20:18:58 info +++ [1102 05:16:41] USE_CONTAINERIZED is set to: false
  • 20:18:57 info +++ [1102 05:16:41] USE_CNI is set to: false
  • 20:18:57 info +++ [1102 05:16:41] IP_ADDRESS is set to: 172.23.19.128
  • 20:18:57 info +++ [1102 05:16:41] ARCH is set to: amd64
  • 20:18:57 info +++ [1102 05:16:41] MASTER_IP is set to: localhost
  • 20:18:57 info +++ [1102 05:16:41] RESTART_POLICY is set to: unless-stopped
  • 20:18:57 info +++ [1102 05:16:41] FLANNEL_BACKEND is set to: udp
  • 20:18:57 info +++ [1102 05:16:41] FLANNEL_NETWORK is set to: 172.16.0.0/16
  • 20:18:57 info +++ [1102 05:16:41] FLANNEL_IPMASQ is set to: true
  • 20:18:57 info +++ [1102 05:16:41] FLANNEL_VERSION is set to: v0.6.1
  • 20:18:57 info +++ [1102 05:16:41] ETCD_VERSION is set to: 3.0.4
  • 20:18:57 info +++ [1102 05:16:41] K8S_VERSION is set to: v1.5.2
  • 20:18:57 info curl: (35) gnutls_handshake() failed: Error in the pull function.
  • 20:18:57 info +++ [1102 05:15:55] Done.
  • 20:18:57 info +++ [1102 05:15:55] Killing all kubernetes containers...
  • 20:18:57 info curl: (7) Failed to connect to storage.googleapis.com port 443: Connection refused
  • 20:18:57 info 0 upgraded, 0 newly installed, 0 to remove and 416 not upgraded.
  • 20:18:57 info curl is already the newest version (7.47.0-1ubuntu2.4).
  • 20:18:57 info Reading state information... 0% Reading state information... 2% Reading state information... Done
  • 20:18:57 info Building dependency tree... 0% Building dependency tree... 0% Building dependency tree... 50% Building dependency tree... 50% Building dependency tree... 72% Building dependency tree
  • 20:18:57 info Reading package lists... 0% Reading package lists... 100% Reading package lists... Done
  • 20:18:57 info COMMANMD=master
  • 20:18:57 info Linux distribution: ubuntu
  • 20:18:57 info multi node installation
  • 20:18:57 info chown -R dlsdk-user:dlsdk-group /home/mayi//dlsdk/
  • 20:18:57 info mkdir -p /home/mayi//dlsdk//dlsdk/security
  • 20:18:57 info mkdir -p /home/mayi//dlsdk/
  • 20:18:57 info volume=/home/mayi/
  • 20:18:57 info selfsigncertificate = undefined
  • 20:18:57 info etcd_port=4001
  • 20:18:57 info tf_rest_port=8010
  • 20:18:57 info caffe_jupyter_port=8001
  • 20:18:57 info caffe_rest_port=8000
  • 20:18:57 info js_jupyter_tf_port=31002
  • 20:18:57 info js_jupyter_caffe_port=31001
  • 20:18:57 info js_ui_port=31000
  • 20:18:57 info username = mayi
  • 20:18:57 info type = multi
  • 20:18:57 info stage = 5
  • 20:18:57 info Version: 16.04
  • 20:18:57 info OS: ubuntu
  • 20:18:57 info DLSDK_ETCD_CONTAINER: intelcorp/dl-training-tool:etcd
  • 20:18:57 info DLSDK_JS_CONTAINER: intelcorp/dl-training-tool:js-release5-latest
  • 20:18:57 info DLSDK_TF_CONTAINER: intelcorp/dl-training-tool:tf-release5-latest
  • 20:18:57 info DLSDK_CAFFE_CONTAINER: intelcorp/dl-training-tool:caffe-release5-mlsl-latest
  • 20:18:57 info [sudo] password for mayi: Installer version: 1.0.1143
  • 20:18:57 info output:
  • 20:17:05 info run: <password>tools\win\plink.exe 172.23.19.128 -l mayi -pw <password> -batch -t -ssh chmod +x dlsdk_install_scripts/*.sh && echo '<password>' | sudo -S -E -k dlsdk_install_scripts/install_training_tool.sh -stage 5 -type multi -username 'mayi' -toolpassword '<password>' -startport 31000 -selfsigncertificate 'undefined' -volume ~/
  • 20:17:05 info ---------------------------------------- Run docker image and install other dependencies

    Can you help me solve this problem?

 

1 post / 0 new
For more complete information about compiler optimizations, see our Optimization Notice.