Andy Domeier

  • Posts

  • Joined

  • Last visited

  • Days Won


Everything posted by Andy Domeier

  1. I also very much agree! I really need this added, with out being able to see on the report which row is for what data point I'm required to clone and customize data source names doing the same thing just so I can differentiate each data point/row within the SLA report. Would be happy to chat about it more if an example or more info is needed to help.
  2. We are a global company with resources in Minnesota, New Jersey, Australia, Ukraine, and India all using the Logic Monitor tool set. It would be incredibly useful to be able to set the timezone at a user level instead of only at the company level.
  3. It would be super powerful to be able to easily compare historical data quickly. the two examples I can think of: Data Source X - Today v. Last Monday Web server 1 CPU v. Webserver 2 CPU I would view this as an investigative tool that allows you to quickly analyze if something looks out of sorts... being able to put last Monday and this Monday on the same graph allows for a very easy comparison. This would be used a lot in the heat of the moment or for post change validation of performance.
  4. Creating a rule is straight forward but it becomes risky and difficult to manage as we grow our organization or implement new systems and services. It can also be time consuming for someone not familiar with the \'\'Alert Rules\'\' process, it can get messy and easy to break very quickly! I think it would be awesome and super hand to have the ability on the Host tab to be able to adjust the Alert routing for a host, group, or for a \'data source in a host or group\'. with a few clicks. An Example, if I am working on a new host, to be able to get it added to LM and add it to the right groups I\'m on the host tab.. if I want those alerts to go to a different team for a while until we get the server ready for prime time I have to go to the alerts tab and figure out how to make a rule that doesn\'t break other rules. BUT if I could just set from that host or data source, all these alerts go to the \'X\' escalation policy, that would be super slick saving us time and easy to manage.
  5. This is becoming more of a challenge for us as well. We have a large number of hosts and they dont all need the same level of monitoring. It would be really ideal if we could just set the Alert trigger interval (consecutive polls) at a host level if we wanted to, that would help us manage and mitigate noisy alerts more accurately to the situation without having to create new data sources to monitor for that same thing but for a different Alert trigger interval (consecutive polls) than the global threshold.rnMore people should vote for this, this is a cool idea!
  6. Any target date on this one? this summer, this year?
  7. The Service monitors you have are great! they are easy to use and straight forward. It would be awesome if we could also call those from a collector so that we can use Logic monitor to monitor UIs that are internal to our company\'s users. Since there isn\'t a way for LM to hit those URLs externally we have to go about monitoring their availability in other more \'\'Hacky\'\' ways. Thanks!
  8. It would be awesome if when you\'re on the main alerts tab, in the detail associated with an alert there was a column that showed what escalation chain was used. This would help simplify and make it more clear to understand who was notified of which alerts.
  9. There is a lot of value in making the data we build out in Logic Monitor visible to the whole business. That being said there are also a lot of other tools and services that are in the same situation. GeckoBoard does an excellent job of centralizing KPIs from a wide variety of service providers. This helps us keep key metrics transparent to everyone in the business. I think adding integration with Geckoboard would be an awesome feature for Logic Monitor as you both continue growing.
  10. If you are asking for the ability to create subfolders on the services tab, I agree! It would be idea to have the same structure we create on the host tabs. We have different teams managing different services and we are able to divide the hosts up clearly, but we cant for services.
  11. Hey Steve, Great point, I did a terrible job of articulating that this is when I'm consolidating multiple hosts into one graph. This is a really valuable graph when I have a cluster of servers to have them all on the same graph. An example would be the consolidated CPU and Mem graphs out in our dashboard on THE Dashboard in our account. Does that make sense? It's absolutely possible I'm missing something but it looked like Support didn't have a good solution for getting multiple hosts consolidated onto one graph for that data.
  12. Hey Steve,rn I will look into that further. I dont think thats going to quite get us what we need but I will circle back with my team members that are smarter than me to see if they can make that work :)rn rnThanks for your response!
  13. This actually complicated things even more recently because of daylight savings times my scripts cleaning up CST in the date stamp failed because it now prints CDT.
  14. If this gets worked on it would also be valuable to have the ack work back and forth between systems.
  15. Today to create a graph that shows you trends for overall memory and CPU utilization as a percent of total available it\'s a lot of custom work. I think this is a very standard and important datapoint that should be a default for all hosts. An example of this is on our Orchestration app servers we have to use the following datapoints to produce a graph that shows % used for memory and/or CPU (this is just for one host in the cluster) 3 normal datapoints: (All NetSNMPMem) AvailableReal Cached Total Real 2 Virtual Datapoints: "Used" = Total - Avail - Cached "% Used" = 100*"Used"/Total I think it would be really useful to make this a default data point so that you can trend total used overtime as a percent without having do to this math for every host. I looked around to see if I was doing this in an awkward way and it looks like this is the right way to do it. Thanks!
  16. Hi, Within our architecture we have a lot of important real-time information in our databases. A lot of times we are doing \'\'counts\'\' of specific processes or rows. It\'s difficult to ensure we are collecting all the data since the last run of a datapoint for a few reasons. It would be really valuable to be able to have Logic Monitor maintain a time variable on a datapoint so that we can collect data \'\'since the last run\'\'. That would give us a more accurate picture than running a query for the previous 10 minutes, and hoping it runs exactly every 10 minutes. Thanks!!!
  17. When reports are generated the data fields aren\'t valid data fields in excel (or other standards) and as a result you\'re unable to systematically import the data into other systems with out modifying the data columns to be a standard data format. The timezone is helpful but maybe moving that to a different column would solve this problem? Thanks!
  18. For reports adding what group is included in the \'\'alert\'\' report would be really helpful. We have our hosts organized by different service zones and being able to look at the type of alert by service zone would be really valuable. Right now I can trend total alerts by group, but it doesn\'t look like I can add \'\'group\'\' as a column on the alert report. I\'m working with David Lee to validate but it doesn\'t look like it\'s going to work.
  19. This is really important to us as well. Logic Monitor is our primary monitoring tool but we have a large variety of other tools that we need to use as well and PagerDuty is the service we use to consolidate all of the alerting. More flexibility to use webhooks to resolve alerts or a more pure integration with PD is very important to us. I can help facilitate communication with PD if its needed.rn rnThanks!
  20. This is also a really important feature to us. We have a large team working out of logic monitor and if a host is down but in SDT it still shows up in the alerts tab with out a lot of clarity on the fact that its in SDT.