[Planetlab-users] suspended slices
Larry Peterson
llp at CS.Princeton.EDU
Wed Jan 24 09:15:25 EST 2007
Many of you are now (painfully) aware that your slices
are suspended if your nodes have been down for an extended
period of time, and the PlanetLab support staff has had
no luck in prodding someone at your site to fix the problem.
To help you understand what state PLC thinks your nodes are
in, so you can be proactive in keeping your nodes healthy,
you just need to look at the same information we do, which
means querying CoMon. CoMon is generally available at
http://comon.cs.princeton.edu
but for specific queries...
Nodes that are up:
http://summer.cs.princeton.edu/status/tabulator.cgi?
table=table_nodeviewshort&select='resptime%20%3E%200'
Nodes are sick (we nag, but don't suspend -- yet):
Bad DNS:
http://summer.cs.princeton.edu/status/tabulator.cgi?
table=table_nodeview&select='dns1udp%20%3E%2080%20&&%20dns2udp%20%3E%
2080'
Clock drift:
http://summer.cs.princeton.edu/status/tabulator.cgi?
table=table_nodeview&select='drift%20%3E%201m'
Bad disks:
http://summer.cs.princeton.edu/status/tabulator.cgi?
table=table_nodeview&select='filerw%3E0'
If your nodes aren't on the above lists, then we treat
them as down. To see the set of down nodes:
http://summer.cs.princeton.edu/status/tabulator.cgi?
table=table_nodeviewshort&select='resptime%20==%200'
Larry
More information about the Users
mailing list