Skip to content

Instantly share code, notes, and snippets.

@danehans
Last active October 14, 2015 19:38
Show Gist options
  • Save danehans/26993e6818a94699bcbc to your computer and use it in GitHub Desktop.
Save danehans/26993e6818a94699bcbc to your computer and use it in GitHub Desktop.
swarm-troubleshooting
#paste in here
$ git log --oneline
adc909a Updates Swarm Heat Templates for Container Network Model
fd2e760 Baymodel create should allow no network_driver specified
98f915a Merge "Pull metrics by using TLS enabled client"
# systemctl status swarm-manager.service -l
● swarm-manager.service - Swarm Manager
Loaded: loaded (/etc/systemd/system/swarm-manager.service; enabled)
Active: active (running) since Tue 2015-10-13 16:45:57 UTC; 15s ago
Process: 1238 ExecStartPost=/usr/bin/curl -sf -X PUT -H Content-Type: application/json --data-binary {"Status": "SUCCESS", "Reason": "Setup complete", "Data": "OK", "UniqueId": "00000"} http://172.29.74.86:8000/v1/waitcondition/arn%3Aopenstack%3Aheat%3A%3Ae2d247525bc147a297d985167519c95e%3Astacks%2Fswarm-yrevtnbldroo-swarm_master-442fywvkr4ko-0-pmjzdczrpuvr%2F005157e2-a5e5-4d55-9028-f62e18897d6b%2Fresources%2Fmaster_wait_handle?Timestamp=2015-10-13T16%3A44%3A55Z&SignatureMethod=HmacSHA256&AWSAccessKeyId=45386951c8784f8aa6fb80dc8c8e390d&SignatureVersion=2&Signature=XcG0EF%2BRQUwp5zP6RyjjMMmV7Zz2N%2BpQkk3AWZU1xfA%3D (code=exited, status=0/SUCCESS)
Process: 1093 ExecStartPre=/usr/bin/docker pull swarm:0.2.0 (code=exited, status=0/SUCCESS)
Process: 1083 ExecStartPre=/usr/bin/docker rm swarm-manager (code=exited, status=1/FAILURE)
Process: 1074 ExecStartPre=/usr/bin/docker kill swarm-manager (code=exited, status=1/FAILURE)
Main PID: 1235 (docker)
CGroup: /system.slice/swarm-manager.service
└─1235 /usr/bin/docker run --name swarm-manager -v /etc/docker:/etc/docker -p 2376:2375 -e http_proxy= -e https_proxy= -e no_proxy= swarm:0.2.0 manage -H tcp://0.0.0.0:2375 --tlsverify --tlscacert=/etc/docker/ca.crt --tlskey=/etc/docker/server.key --tlscert=/etc/docker/server.crt etcd://10.0.0.3:2379/v2/keys/swarm/
Oct 13 16:45:54 sw-fywvkr4ko-0-pmjzdczrpuvr-swarm-master-agstohdgxsxl.novalocal docker[1093]: 57ec2f5f3e06: Pulling fs layer
Oct 13 16:45:55 sw-fywvkr4ko-0-pmjzdczrpuvr-swarm-master-agstohdgxsxl.novalocal docker[1093]: 57ec2f5f3e06: Download complete
Oct 13 16:45:55 sw-fywvkr4ko-0-pmjzdczrpuvr-swarm-master-agstohdgxsxl.novalocal docker[1093]: 73504b2882a3: Pulling metadata
Oct 13 16:45:55 sw-fywvkr4ko-0-pmjzdczrpuvr-swarm-master-agstohdgxsxl.novalocal docker[1093]: 73504b2882a3: Pulling fs layer
Oct 13 16:45:56 sw-fywvkr4ko-0-pmjzdczrpuvr-swarm-master-agstohdgxsxl.novalocal docker[1093]: 73504b2882a3: Download complete
Oct 13 16:45:56 sw-fywvkr4ko-0-pmjzdczrpuvr-swarm-master-agstohdgxsxl.novalocal docker[1093]: 73504b2882a3: Download complete
Oct 13 16:45:56 sw-fywvkr4ko-0-pmjzdczrpuvr-swarm-master-agstohdgxsxl.novalocal docker[1093]: Status: Downloaded newer image for swarm:0.2.0
Oct 13 16:45:57 sw-fywvkr4ko-0-pmjzdczrpuvr-swarm-master-agstohdgxsxl.novalocal curl[1238]: <resource>master_wait_handle</resource>
Oct 13 16:45:57 sw-fywvkr4ko-0-pmjzdczrpuvr-swarm-master-agstohdgxsxl.novalocal systemd[1]: Started Swarm Manager.
Oct 13 16:45:59 sw-fywvkr4ko-0-pmjzdczrpuvr-swarm-master-agstohdgxsxl.novalocal docker[1235]: time="2015-10-13T16:45:59Z" level=info msg="Listening for HTTP" addr="0.0.0.0:2375" proto=tcp
# cat /var/log/cloud-init-output.log
Cloud-init v. 0.7.5 running 'init-local' at Tue, 13 Oct 2015 16:45:26 +0000. Up 9.57 seconds.
Cloud-init v. 0.7.5 running 'init' at Tue, 13 Oct 2015 16:45:27 +0000. Up 10.93 seconds.
ci-info: +++++++++++++++++++++++++Net device info+++++++++++++++++++++++++
ci-info: +--------+------+-----------+---------------+-------------------+
ci-info: | Device | Up | Address | Mask | Hw-Address |
ci-info: +--------+------+-----------+---------------+-------------------+
ci-info: | lo: | True | 127.0.0.1 | 255.0.0.0 | . |
ci-info: | eth0: | True | 10.0.0.4 | 255.255.255.0 | fa:16:3e:bc:72:27 |
ci-info: +--------+------+-----------+---------------+-------------------+
ci-info: ++++++++++++++++++++++++++++++Route info++++++++++++++++++++++++++++++
ci-info: +-------+-------------+----------+---------------+-----------+-------+
ci-info: | Route | Destination | Gateway | Genmask | Interface | Flags |
ci-info: +-------+-------------+----------+---------------+-----------+-------+
ci-info: | 0 | 0.0.0.0 | 10.0.0.1 | 0.0.0.0 | eth0 | UG |
ci-info: | 1 | 10.0.0.0 | 0.0.0.0 | 255.255.255.0 | eth0 | U |
ci-info: +-------+-------------+----------+---------------+-----------+-------+
Cloud-init v. 0.7.5 running 'modules:config' at Tue, 13 Oct 2015 16:45:29 +0000. Up 12.53 seconds.
Cloud-init v. 0.7.5 running 'modules:final' at Tue, 13 Oct 2015 16:45:30 +0000. Up 13.45 seconds.
removing docker key
Generating RSA private key, 4096 bit long modulus
...............................................++
.................................................++
unable to write 'random state'
e is 65537 (0x10001)
creating /usr/local/bin/flannel-config
Created symlink from /etc/systemd/system/multi-user.target.wants/flannel-config.service to /etc/systemd/system/flannel-config.service.
activating service flanneld
Created symlink from /etc/systemd/system/docker.service.wants/flanneld.service to /usr/lib/systemd/system/flanneld.service.
File descriptor 3 (socket:[18105]) leaked on pvcreate invocation. Parent PID 807: /bin/sh
Physical volume "/dev/vdb" successfully created
File descriptor 3 (socket:[18105]) leaked on vgcreate invocation. Parent PID 807: /bin/sh
Volume group "docker" successfully created
configuring swarm ...
stopping docker
Cannot find device "docker0"
starting services
activating service etcd
Created symlink from /etc/systemd/system/multi-user.target.wants/etcd.service to /usr/lib/systemd/system/etcd.service.
activating service docker
Failed to execute operation: File exists
activating service docker.socket
Created symlink from /etc/systemd/system/sockets.target.wants/docker.socket to /etc/systemd/system/docker.socket.
activating service swarm-manager
Created symlink from /etc/systemd/system/multi-user.target.wants/swarm-manager.service to /etc/systemd/system/swarm-manager.service.
notifying heat
<resource>cloud_init_wait_handle</resource>Cloud-init v. 0.7.5 finished at Tue, 13 Oct 2015 16:45:33 +0000. Datasource DataSourceConfigDriveNet [net,ver=2][source=/dev/sr0]. Up 16.33 seconds
# systemctl status swarm-manager-failure.service -l
● swarm-manager-failure.service - swarm-manager Failure Notifier
Loaded: loaded (/etc/systemd/system/swarm-manager-failure.service; static)
Active: inactive (dead)
$ magnum bay-list
+--------------------------------------+-------+------------+--------------+-----------------+
| uuid | name | node_count | master_count | status |
+--------------------------------------+-------+------------+--------------+-----------------+
| 7c62d0f0-f598-4bfb-b4d9-c8d5ddae970e | swarm | 1 | 1 | CREATE_COMPLETE |
+--------------------------------------+-------+------------+--------------+-----------------+
$ magnum bay-show swarm
+--------------------+------------------------------------------------------------+
| Property | Value |
+--------------------+------------------------------------------------------------+
| status | CREATE_COMPLETE |
| uuid | 7c62d0f0-f598-4bfb-b4d9-c8d5ddae970e |
| status_reason | Stack CREATE completed successfully |
| created_at | 2015-10-13T16:44:44+00:00 |
| updated_at | 2015-10-13T16:47:40+00:00 |
| bay_create_timeout | 0 |
| api_address | None |
| baymodel_id | 71d3b607-d6b4-4450-989e-cfe58dee69e1 |
| node_count | 1 |
| node_addresses | [] |
| master_count | 1 |
| discovery_url | https://discovery.etcd.io/8089b837de11b01182d5c5b9979a7495 |
| name | swarm |
+--------------------+------------------------------------------------------------+
# ifconfig eth0
eth0: flags=4163<UP,BROADCAST,RUNNING,MULTICAST> mtu 1500
inet 10.0.0.4 netmask 255.255.255.0 broadcast 10.0.0.255
....
-bash-4.3# docker -H tcp://10.0.0.4:2376 --tlsverify --tlscacert=/etc/docker/ca.crt --tlskey=/etc/docker/server.key --tlscert=/etc/docker/server.crt run -d --name test1 -p 8080:80 larsks/thttpd
8c3468e11e71b7cd80362e95d6c6400f23255576cc8ae37a83ac013066018600
-bash-4.3#
-bash-4.3# docker -H tcp://10.0.0.4:2376 --tlsverify --tlscacert=/etc/docker/ca.crt --tlskey=/etc/docker/server.key --tlscert=/etc/docker/server.crt info
Containers: 2
Strategy: spread
Filters: affinity, health, constraint, port, dependency
Nodes: 1
sw-6hx3gcgzrmb-0-wnxejc2jfupd-swarm-node-5xij5f2leoh2.novalocal: 10.0.0.5:2375
└ Containers: 2
└ Reserved CPUs: 0 / 1
└ Reserved Memory: 0 B / 2.052 GiB
-bash-4.3# docker -H tcp://10.0.0.4:2376 --tlsverify --tlscacert=/etc/docker/ca.crt --tlskey=/etc/docker/server.key --tlscert=/etc/docker/server.crt ps
CONTAINER ID IMAGE COMMAND CREATED STATUS PORTS NAMES
8c3468e11e71 larsks/thttpd:latest "/thttpd -D -l /dev/ 16 seconds ago Up Less than a second 10.0.0.5:8080->80/tcp sw-6hx3gcgzrmb-0-wnxejc2jfupd-swarm-node-5xij5f2leoh2.novalocal/test1
-bash-4.3# curl 10.0.0.5:8080
<!DOCTYPE html>
<html>
<head>
<title>Your web server is working</title>
<style type="text/css">
body {
text-align: center;
font-family: Arial,"Helvetica Neue",Helvetica,sans-serif;
}
pre {
border: thin solid black;
padding: 1em;
background-color: #c0c0c0;
}
#summary {
max-width: 40em;
margin: auto;
text-align: left;
}
</style>
</head>
<body>
<div id="header">
<pre>
____ _ _ _ _
/ ___|___ _ __ __ _ _ __ __ _| |_ _ _| | __ _| |_(_) ___ _ __ ___
| | / _ \| '_ \ / _` | '__/ _` | __| | | | |/ _` | __| |/ _ \| '_ \/ __|
| |__| (_) | | | | (_| | | | (_| | |_| |_| | | (_| | |_| | (_) | | | \__ \
\____\___/|_| |_|\__, |_| \__,_|\__|\__,_|_|\__,_|\__|_|\___/|_| |_|___/
|___/
</pre>
<p><strong>You have a web server.</strong></p>
</div>
<div id="summary">
<p>This is a statically compiled version of <a href="http://acme.com/software/thttpd/">thttpd</a>
put together to build a demonstration container for my
<a href="https://github.com/larsks/heat-kubernetes">Heat templates for Kubernetes</a>. But maybe
you'll find it useful for other things.</p>
</div>
</body>
</html>
@danehans
Copy link
Author

I posted a new patch. However, the patch you are using should work. It looks like maybe something is wrong with your docker daemon.

@danehans
Copy link
Author

It also looks like you have applied the patch on an older tip of master.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment