Brandon

Members
  • Content count

    69
  • Joined

  • Last visited

  • Days Won

    24

Community Reputation

31 Excellent

About Brandon

  • Rank
    Community All Star

Recent Profile Visitors

616 profile views
  1. Cluster Alert Routing

    It would be immensely helpful if I could see and test alert routing from the Cluster Alerts page at the device group level similar to the existing Alert Routing button on the Alert Tuning tab. As we begin to more heavily utilize this functionality, it's critical that we can verify that alerts are routed correctly wherever we set it up.
  2. Wow @mnagel! Thanks so much for this! I'm going to look into running this so I can get that list put together. I don't have any plans to systematically delete the datasources - I'm just wanting to compile a list so I can review them. I'll feed the obvious ones into a script as a one-time purge and once I've done that, I can take a closer look at those that should be working, but aren't for whatever reason.
  3. HI @Sarah Terry, Thanks for the tip! I also saw that as a workaround, but unfortunately it wouldn't really help with what I'm attempting to accomplish. What I'm trying to do is find datasources with applies-to functions like "isWindows()" or "isLinux()" that aren't actually discovering any instances. It's almost always because the datasource is built to monitor a product or service that we don't use and likely never will. I'd like our datasource list to only contain datasources that are actually in use and applicable to our systems/services. Similarly, there are datasources that apply to specific hardware that we don't own. I'm currently going through manually and removing them so we don't have to scroll past them when browsing our list of datasources. If and when we ever deploy new hardware/software, I'll go and re-import those (updated) datasources from the LM Repository. I hope this makes sense. Thanks again for your response!
  4. I'm trying to clean up datasources that are in our account that do not have any instances associated with them and likely never will. Currently I have to do this manually by inspecting each datasource in the GUI. It would be really great if the datasource instance count was returned as a property. Even better would be if the instances and associated device ID's were returned as well, but for now I'd be happy with just the device/instance counts.
  5. SDT EventSources

    Have you tried enabling rate-limiting? At least until it all gets sorted out - I'd consider setting up a duplicate escalation chain and an alert rule specifically for some of your syslog eventsources and enable rate-limiting on them. Before my LogicMonitor days, I had this happen a few times and it sucks dealing with a crippled Exchange server while also trying to work out a firewall issue. Syslog is unpredictable sometimes.
  6. We have several clustered devices where metrics are gathered on each node. However, the instances across each node are identical. When attempting to graph this data, this means that I would need to add a new datapoint for each instance and use a glob pattern to select the devices from which to pull those instances. This can mean that a lot of time goes into creating these graphs if there are several instances to monitor. Examples: Three Solr nodes - each servicing search requests for the same 10 collections. In order to see the total number of GET requests for each of those collections, I would need to create a graph that has 10 individual datapoints. Instead, I would like to add one datapoint and have the graph intelligently aggregate all instances that have the same name, regardless of the node. Several device groups exist under a parent group. If I want to see the average CPU utilization across each of these groups on a single graph, I would need to add a separate datapoint for each group. A potential solution could be to allow the integration of regex instead of glob patterns to allow for capture groups. Otherwise a simple checkbox for "aggregate instances by device group" and "aggregate instances by instance names" when selecting aggregated graph types would be extremely useful and time-saving.
  7. FYI: LM can trigger ESXi 6.5 hostd to crash

    @Eric Singer - Any chance VMWare provided you with a KB that documents this as a known issue / bug? I'd like to provide as much context as possible to our ESX admins. Thanks!
  8. FYI: LM can trigger ESXi 6.5 hostd to crash

    DO NOT comment out the applies to field on the datasource! This will remove all historical data - which I can only imagine most of us want to keep. You can disable the datasource by creating a device group (if you don't have one already) and populating it with all of the ESX hosts. Then, at the group level, select the alert tuning tab and uncheck the box next to the datasource. This disables polling and alerting, but allows you to keep historical data.
  9. I have a device property that I would like to update every 15 minutes or so. This is because I have groups with auto-include rules that are looking for that property. I need to have the device move in and out of the groups on the fly. It would be great if we could set individual custom propertysources to update on a more frequent basis. Currently I'm achieving this using the LogicMonitor Rest API which I have baked right into a datasource as a workaround - but I think this solution is messy. Thanks!
  10. Hey Joe, I'm not sure that this exactly meets your needs, but I think it's a good start. Basically, you can call hostProps.toProperties() method which spits out an array that you can now dig through and filter using regex. Something like this: def allProps = hostProps.toProperties() allProps.each{ if(it ==~ /.*\.databases=/){ println it } } Let me know if this doesn't address what you're trying to accomplish.
  11. SOLR JVM Stats (non-JMX)

    Do the instances at least get discovered? If so, could you do a poll-now and shoot me the error? I'm curious as to what the issue is.
  12. SOLR JVM Stats (non-JMX)

    Hey @George Bica! Did you get any of the other datasources to work?
  13. Apache Solr datasources need scrubbing

    @George Bica - Last one. This is an eventsource, so no pretty graphs here. It makes an API call to pull all of the events in the solr log and then alerts on Error and Severe events only. It doesn't apply to servers by default because it can be quite noisy if you don't have it tuned properly. Once you're sure it's not going to blow up after you've applied it, go ahead and change the Applies To rules and you should be good to go. Hope these help!
  14. SOLR Error Logs

    W9PN3Y I thought I had already posted this one, but regardless - here it is. This does not apply to any servers by default as it can be extremely noisy if you don't have it tuned. This makes an API call to solr to pull error and severe logs and then formats them so that LogicMonitor can understand them. Before applying this, it's not a bad idea to review those logs manually to make sure something isn't repeatedly triggering (as is common with SOLR). Still - it's helped us detect and diagnose a range of issues that would have otherwise been difficult to see.
  15. Apache Solr datasources need scrubbing

    @Michael Rodrigues - Thanks! I've got more on the way. @George Bica - Here's another one. It might not work for your version of SOLR, but it has the exact same requirements otherwise. Add solr.port and solr.append to your solr servers and this datasource will provide lots of useful JVM metrics without the need for enabling JMX. I'm still going through my datasources, so I might still have one or two to post. I'll keep replying to this thread with whatever I've got.