Time Nick Message 14:25 pdurbin are the cool kids using http://prometheus.io these days? please see my comment at https://github.com/IQSS/dataverse/issues/2595#issuecomment-148065637 14:27 pdurbin or should I use Ganglia? 14:54 bene what are you trying to do? 14:54 bene just graph stats from the jvm? 14:57 pdurbin stats from glassfish 14:57 pdurbin which runs on the jvm 14:58 pdurbin and in addition to graphing, perhaps an email, an alert, would be sent when a critical threshold is reached 14:58 pdurbin standard monitoring stuff 15:28 bene is this just for you or something you want to bundle with dvn? 15:43 pdurbin we could bundle it. I'll go add it to this: Dataverse Installation Monitoring Functional Requirements Document (FRD): https://docs.google.com/document/d/11YDzhuilIXktld6PSTv3hgcdJyenEBUO-ooUqJlRpbc/edit?usp=sharing 16:20 dotplus ganglia is an older project, which might mean "feels old fashioned" or "mature" depending on your point of view. Comments: 1) it *wants* multicast but doesn't require it 2) it's aimed at collecting standard (system) metrics from clusters (think "grid computing" (remember that?) and/or the HPC world) and it feels like it. 3) It's coming from the metrics end of the spectrum rather than the alerting end, although like most of the tools it can ... 16:20 dotplus ... be made to do the other end than its primary focus. 16:21 dotplus if this is your thing, go for it, it's good stuff. If that doesn't sound like your use case/world, you might find you're trying to use a screwdriver to drive a nail. 16:22 bene yeah, ganglia is probably not a good choice 16:22 dotplus 4) The docs are hidden: https://github.com/ganglia/monitor-core/wiki 16:22 bene grid computing is still very much a thing 16:24 dotplus yeah, that was a joke, because arguably grid computing was never really anything but a marketing term. Until last year, I was supporting various HPC clusters and the country's largest supercomputer 16:25 bene what country was that? 16:25 dotplus usa 16:25 bene i think the grid vs cluster descriptors are somewhat useful 16:25 dotplus titan (ornl.gov) 16:26 bene i.e. single vs multi admin, relative hardware homogeneity vs very diverse, predictable networking vs internet-connected madness 16:26 bene cool 16:27 bene when i worked at HMS we had one of the first biomed applications running on open science grid 16:29 dotplus perhaps they're useful terms. I don't pretend to be a Real HPC person, I was supporting the standard infrastructure there. There were (small) teams for the Cray and for the clusters, so I got exposure to some HPC things, but in many ways it was not that much different any other similarly sized environment from my perspective. 16:39 bene monit looks interesting as a baked-into-the-application-environment monitoring tool 16:39 bene do you provide AMIs or other machine images for dvn? 