Leaderboard


Popular Content

Showing content with the highest reputation since 02/26/2016 in all areas

  1. 11 points
    Allow devices to be dependent on one another. If a router goes down, the switch behind it will most likely go down or have an error as well.
  2. 8 points
    Hello, We currently use a third party (OpsGenie) for alerting, and currently we have a custom integration configured in LogicMonitor to send alert information to OpsGenie. Although there has been significant improvements to integrations over the past few months, one feature that lacks significantly for us is a supported two way integration between OpsGenie and LogicMonitor. This would be very similar to the partnership/integration that LogicMonitor has already built with PagerDuty. As LogicMonitor releases new integration features, it tends to break current workflows with alert creation in OpsGenie. A supported integration would give us more confidence that as LogicMonitor continues to release new features, that our alert functionality would continue to operate as expected.
  3. 7 points
    We are a global company with resources in Minnesota, New Jersey, Australia, Ukraine, and India all using the Logic Monitor tool set. It would be incredibly useful to be able to set the timezone at a user level instead of only at the company level.
  4. 5 points
    I would love to see LM implement a new feature for taking a built-in, self prescribed, action on an alert. To minimize any exposure that LM might have in an action gone awry, the actions taken could occur as the result of a script that one could upload into the Escalation Chain. Ideally you could define multiple actions or multiple retries on an action and whether that occurred before or after the recipient notification in the notification chain. This would allow for very basic alerts (disk, service restarts, etc) to be resolved programatically. Also being able to support various scripting languages such as PowerCLI, Ansible, etc would allow for some very creative ways to integrate with solutions such as VMWare or Ansible Tower for very complex actions to be crafted by more expert skill level folks.
  5. 5 points
    I have run into too many cases now where a new but slightly different DS is setup due to LM support actions, upgrades, etc. and the result is lost data or noncontinuous data. A good example I recently encountered is with NTP. The standard DS was not working in all cases. I was given a new DS that uses Groovy, and it works (which I appreciate!). But the datapoint list and names have changed, and even if they had not, there is no way to maintain data history from the old DS to the new DS. My recommendation is to add a migrate function so you can indicate how to map old to new datapoints in such a situation and thus avoid data loss. Building in a default migration ruleset into a new DS would be a bonus -- this could allow for zero-touch data migrations in at least some cases. Thanks, Mark
  6. 5 points
    It would be nice to have an export button on any alert table. When we're doing research on an issue and we've finally narrowed the criteria to see the info we need, it helps to have an export button right there, rather than having to go to reports and reconfigure all the parameters to hopefully get the same data.
  7. 5 points
    When there is a legitimate reason for disabling alerts for a device, it would be very useful to be able to leave a note as to why (and by whom). This would prevent confusion with teams, where the case of "why would this be disabled" would come up frequently. For example, there is a known bug with a certain version combination of ESXi and HPE servers that triggers a false-positive hardware alert internally, so we disable alerts for that instance on servers that meet the criteria as we encounter them. Or, some QNAPs will give false-positive alerts that their disk is full when in fact it is "full" due to a RAIN configured as a LUN (we thus rely on the server alerting when the iSCSI volume is actually full). However, another technician may log in and flip alerting for these instances back on, assuming it was a mistake or something, and then we would get flooded with these false-positive alerts, prompting technicians to look into them; as you can see, this causes a loop of wasted time. Simply putting a note associated with the "Alerting Off / On" switch and tagging it with the user invoking it would easily solve issues like this. Something like what is shown for Acknowledgements would be adequate. Perhaps even an admin option to require a note or not?
  8. 5 points
    I have published a PowerShell module, which refactors part of the REST API, to the PowerShell gallery. Please feel free to make requests (or send me cmdlets you want added). https://www.powershellgallery.com/packages/LogicMonitor/
  9. 4 points
    The ability to drag and drop items in the device tree in order to reorganize them is a powerful feature, but the staggered manner in which the tree loads and expands on page load "encourages" a user to accidentally move folders and devices around in the tree, which leads to numerous detrimental effects (not the least is having to try and figure out where you accidentally moved them to). Request is to either introduce a safeguard prompt "Are you sure you wish to move X from Y to Z?" for all drag and drop actions within the device tree, or, alternately, introduce a configuration item on folders and devices allowing them to be "protected" against moves --- either adding a safeguard "are you sure...?" prompt for those specific items, or prevent moving them entirely without first taking the action to "unprotect" them.
  10. 4 points
    We've recently run into issues with users accidentally changing a setting or deleting a device and would like the ability to allow users to Create new devices, but not be able to delete anything or change alert settings. I'd like to either split Manage into Write/Delete groups or add a deny action role that would allow me to give users manage access with a deny delete:*
  11. 4 points
    Useful for inventory, auditing, and auto-grouping. Displays the a list of all installed Windows Features separated by commas. Example below. auto.winfeatures [Active Directory Lightweight Directory Services, .NET Framework 3.5.1 Features, Telnet Client, Remote Server Administration Tools, .NET Framework 3.5.1, Role Administration Tools, AD LDS Snap-Ins and Command-Line Tools, AD DS and AD LDS Tools, Active Directory module for Windows PowerShell] WMN9DN
  12. 4 points
    As we move towards a DevOps model, we increasingly have a need for small teams to have full admin access to the tools they use to manage their IT services. When it comes to LogicMonitor, this has proven difficult with the existing role permission model. DevOps pods would like to be able to manage their own datasources, alerts, and escalation chains but this isn't possible unless we give them broad access rights to those areas, which could cause significant harm to other groups of monitoring users. For example, an inexperienced DevOps user could inadvertently modify a datasource that applies to all Windows devices or they could create an alert rule that causes alerts not to be delivered to other users. To solve this problem, I'd propose that LogicMonitor offer alert groups, escalation chain groups, along with the existing datasource groups. Then, LogicMonitor could provide the ability to restrict roles to manage these specific groups. DevOps pods could be given the ability to manage their own custom subset of datasources and set up their own alerts in a rule range after the main set of rules.
  13. 4 points
    Currently the graph options for Widget includes three was to "Limit Results". The are "Top 10", Aggregate, All. There are times when I want to display the "Bottom 10". For example the lowest 10 CPU devices, or the 10 least available devices, etc. It would be GREAT to have a Bottom 10" option in the Limit Results parameter. Thanks.
  14. 4 points
    As mentioned to Haniz in ticket number 82871, it would be very handy to have a button that can expand all the graphs for you on a device view. It would save quite a bit of time
  15. 3 points
    We have a multi-tenant MSP environment, We find that we cannot use the same display name for our clients even though the similarly named systems are in different Child folders. So \'DC01\' for Domain Controller has to be unique across all of our clients. Please consider changing this.
  16. 3 points
    Our CTO is asking me to provide a daily email report of some key metrics to our clients and internal stakeholders. I'm able to create a suitable report and email it, however his concern is the number of clicks to view the report. The belief is that the email will just go straight to trash if our clients can't see the report in the email body. For me, using gmail, I have to download the attachment first, then open it from my laptop. Two options we talked about were: 1. Use the HTML report as the body of the email rather than an attachment (surely the days of text-only email clients are gone) 2. A configurable/dynamic subject to provide some hint of what the report actually contains, e.g. the uptime for a particular service over the last 24 hours. If the subject said "100% Uptime", there is no reason to open the report, if the subject said "98% Uptime" I can open to see more detail. Does anyone else have any ideas on how to make reports a little more effective for a somewhat apathetic audience?
  17. 3 points
    We use the Debug Console a lot, and usually to debug an issue on a particular device, which means more often than not, we access the Debug Console from the device in question's Raw Data view. We would like for there to be a token (##THIS## or ##DEVICE##) that could be used with Debug Console commands. So if we were to issue the command !ping ##THIS##, the console would automatically interpolate ##THIS## with the IP address of the device on which we opened the Debug Console. It's a small productivity gain, but when you're doing this a lot it makes a big difference.
  18. 3 points
    I'm currently new to LogicMonitor. I think LM has done a pretty good job on their monitoring tool. I love how we could manage our network devices thru SSH remote access from the management console. Without LM, we would have to VPN to our internal network, and then SSH to the network devices from there. One downside about LM is that we couldn't manage network devices thru HTTPS (GUI). Nowadays, new technologies like firewalls are managed by HTTPS/SSH. I would love to have HTTPS remote access right from the management console. It would be much easier and faster to gain access to the GUI interface. I have tested Auvik monitoring software on the HTTPS, it was nice and smooth. This would be one of the nice feature that LM should provide. Thanks, Pao
  19. 3 points
    Don't know if anyone else noticed, but MS released a pretty slick script that enables WMI access remotely without admin rights. I have done a brief test with LM and it seems to be working well. https://blogs.technet.microsoft.com/askpfeplat/2018/04/30/delegate-wmi-access-to-domain-controllers/ That's the article. I created an AD group instead of a user to delegate, and I put the LM collector service in that group. Everything else I've followed as documented. I haven't tested anything else, but this alone is a huge step in the right direction.
  20. 3 points
    We urgently need a way for our regional operations teams to be able to see alert date times in local time. This a critical requirement as we are a global business and for people outside of the account timezone to be constantly trying to work out the local time of an alert is ridiculous.
  21. 3 points
    It would be great to have the granularity to be able to change the Alert Trigger Interval on an instance in the same way a threshold can be modified on an instance. Example: Disk Usage on a netapp There are 100 volumes but one of them needs to be over the threshold for a longer amount of time to cause concern. So the archive log volume will fill up no matter what space is allocated but it is not a concern as long as it is back below the threshold within 1 hour. A custom datasource will not work for this because we would like to alert immediately on all other volumes on this device when they go above the threshold.
  22. 3 points
    Please add the option to alert on "no data" condition to the instance level Alert Tuning configuration dialog. We don't want to generate "no data" alerts for everything and we don't want to split the data sources (extra maintenance when updating), so it would be easier to have this as a instance level override.
  23. 3 points
    Hi All, We have really been enjoying the Remote Management feature of logic monitor. For sites that we don't have a direct interconnect with its great being able to quickly SSH onto our devices to make adjustments or check config without having to open up a separate VPN tunnel. However with HTTP/HTTPS management becoming common with Firewalls, Controllers, Routers etc... I feel there is a huge opportunity to have logic monitor be able to fit almost every management use case by implementing an HTTP/HTTPS remote session functionally in the same way RDP and SSH remote sessions work. We as a company would primarily use this feature for help managing networking Equipment, but functionality would extend to Printers, IPCameras, Security Systems, Phone systems, UPS and many more. Let me know your thoughts, Thanks, Will.
  24. 3 points
    We request another way for Logic Monitor to gather information from Linux based systems besides SNMP. Using SNMP involves installing NET-SNMP so taking up additional storage space. It would be better if Logic Monitor had an option for monitoring Linux systems like we had using Sitescope. In Sitescope all it needed was SSH and a dedicated login credential for the system. The Monitor would connect to the Linux based system through SSH and run scripts pertaining to what aspects were monitoring on that system. Then stores the results to be displayed on the console. Similar to using WMI in Windows.
  25. 3 points
    There are currently far too many opportunities to commit errors in LM from which is is difficult to recover since there is no version tracking. Ideally, it would be possible to revert to a previous version of any object, but especially very sensitive objects like logicmodules, alert policies, etc. I have created my own method of dealing with this, which leverages the API to store JSON streams of all critical elements regularly, changes committed via git (certain adjustments to the original results are needed to avoid a constant update stream). Recovery would be very manual, but at least possible. This would be far more useful within the system itself. Thanks, Mark
  26. 3 points
    I see a need in the design to alert on deviation from rolling average: example 1: Temperature in hardware is based on fixed baseline (default or manual adjusted) or based on fixed Delta. In real world application it would Make a LOT more sense to alert on Deviation from a 5 day or 30 day rolling average Temp of the box. Reason is, units alarm on the weekends because the office shuts off the AC during the summer. or they alert During the week 9-5 because in the winter the offices crank the heat. All of these ignore nuance of RANGE and Average expectation for the location...The alerting should just be how FAR outside the average Range for the site is. My Nashville facility hovers from 56 to 59 all week. I have it set on 57 so I get alerts at least once a weekend. I could move it to 59...but that's a band-aid. The REAL solution would be to have the software TRACK the last 30 days, and alert when we're outside the NORM for that location. furthermore....with hardware it is not the specific temps that kill the hardware....its the RATE at which the temp changes. so, the alerts SHOULD be based on the average range the system has seen in the last 30 days, and alert ONLY when the rate of change accelerates...but I imagine THAT request would be more challenging to reduce to an algorithm. Example 2: PING times.....I have sites where the Latency range is EXTREME (Mumbai, Johannesburg, Taipei etc...) I'd wished the PING would track the 30 day range and common deviation from norm and alert when the sites see latency that is way outside the expected fluctuation range. 30ms typical 90% of the time + 200-500ms spikes 10% of the time. when Ping times hit 300 ms for more then 10% of the last hour of sampling....then notify warning to inform of change in TREND....not fixed threshold in immediate sample
  27. 3 points
    Could we have an option on the various SLA reports to exclude scheduled downtime? It would allow us to give the customer and accurate SLA report, as the period that we have scheduled is excluded from their SLA with us.
  28. 3 points
    I would appreciate it if datapoints marked for alerts on No Data were indicated in the Alert Tuning page with the designated alert level displayed. Right now, to know this, you have to dive into the datasource definition to find out. Thanks, Mark
  29. 3 points
    Currently, a table must have static columns and rows defined before the widget will display data. It would be great to be able to dynamically build a table's rows based on * To expand on this, it would be great for the table to have the option to exclude instances with zero/no data from the list. For example, I would like a table that displays all MSMQ queue names and the number of messages in each queue - but not display anything if the current queue length = 0
  30. 3 points
    I have customers who really need this feature, and they are quite upset to learn the throttling stand-in could cause loss of knowledge about the actual root cause. This thread has been open since 2013. Exactly where on the roadmap is this? Mark
  31. 3 points
    Brandon, I have modified the code slightly to get rid of the annoying open and close brackets(my OCD kicked in hard). groovy script: hostname=hostProps.get("system.hostname") my_query="Select NAME from Win32_serverfeature" def session = WMI.open(hostname); import com.santaba.agent.groovyapi.win32.WMI def result = session.queryAll("CIMv2", my_query, 15); def strResult = result.NAME.toString(); def finalOutput = strResult.replace("["," ").replace("]"," "); println "WindowsFeatures=" + finalOutput P.S. - this script is awesome!
  32. 3 points
    P2HEZR - Public IP property source Kind regards, Matthew Kerfoot
  33. 3 points
    I completely agree with this. Different vendors use TRAPs differently and from my experience may send the same trap multiple times, sometimes several times a minute or more even. The TRAP functionality in LogicMonitor will not be useable in these cases because the noise it will create will be a huge distraction for any NOC to be able to handle. It takes their eyes off of other possibly critical events because of multiple duplicate alerts for the same issue. Let's take Barracuda for instance. Their NextGen Firewalls have a TRAP for HA Partner Unreachable. We received a trap every 5 minutes for about 2 hours while this situation was occurring. From Barracuda's standpoint, this was a single event with notifications that go out every 5 minutes until the error goes away. They don't have a pollable "GET" MIB to track this scenario either. I would propose the logic this way: LM receives a trap that matches an EventSource criteria and triggers the configured alert. That eventsource is configured with a timeout value (let's say 60 minutes). If another Trap from the same device with the same content comes in before the timeout value, don't create a new alert, but rather increase a "count" counter on that alert AND RESET THE TIMER. As long as no new traps come in within the configured timeout (60 minutes in this example), the alert will clear like normal. If a new trap comes in after the timer, a new alert is generated. You may need to provide an interface to view all the Trap data associated with that one Event Alert since there will now be multiple. This issue is plaguing our company right now and we are a very large MSP. We are at the mercy of vendors who don't provide polling MIBs for some critical actions like this, hence why SNMP Traps become more of a necessity.
  34. 3 points
    It is currently impossible to detect certain conditions without having to be bombarded by noise alerts, which I am told is against the philosophy of Logic Monitor. Consider a few cases: * interface flaps a few times versus more frequently -- how do you tell the difference? right now, you have no choice other than perhaps to construct an API script (not tested). A better solution in this example would be to count the number of flaps over a period of time, and use that as your alert trigger. As it stands right now, there is not even a method to select the top 10 most unstable interfaces since it is literally a yes or no value and top 10 makes no sense. * resource utilization (bandwidth, CPU etc.) is sometimes much better checked over a period of time than just a single interval. the answer I have received on that is "require N checks to fail", and this works if the resource is pegged, but not if it is spiky. As it stands now, the longer of a period you want to simulate via "N checks", the higher the chance one check will reset the alert but the overall result is clearly bad on inspection. Please note this problem has been solved long ago by other tools, like Zabbix (https://www.zabbix.com/documentation/3.4/manual/config/triggers/expression), so hopefully this can be added to LM in the near future as well.
  35. 3 points
    We would like to see means to create groups in order to add VM guest instances into. We need the ability to better organize alerts that would be constrained to these specific instance groups rather than all VM instances for a given data source. For example, under a vmware host there is 'VMware VM Disk Capacity'. Within or organization, certain groups care about certain servers disk capacity but don't want to be alerted about another groups server. It seems that today there is no way to achieve this. Support suggested we post this as a feature request. Thank you Aaron
  36. 3 points
    Hello, We'd like to request some more usage for instance groups. Right now, it's just not very useful to group instances on a datasource. We have shared devices with datasources belonging to different teams and we have to create dashboards and alarm rules regarding those. Right now, we have to use the wildcard filter in a "creative" way to have shared device alerts and dashboards from different teams configured. It would be really helpful if the instance-group name could be used in Filters. Use-Case: * To configure alert rules for shared devices for different teams, we can group all datasource-instances in instance groups named "teamname" and then filter on "teamname", this works even when we use "*" for device/devicegroup, as long as instancegroup "teamname" is persistent over multiple shared devices. * To have dashboards for shared devices on a per-team base, we can filter for the teamname when creating those dashboards. This also works with "*" as device/devicegroup query, so instances on new devices will be added automatically. Regards, Bastian
  37. 3 points
    WARNING - This propertysource pulls a list of all Windows services installed. This does not filter the services to only show running or auto-starting services. Useful for auditing, auto-grouping, and inventory. Example below Displays the a list of all installed Windows Services. auto.winservices [AeLookupSvc, ALG, AppIDSvc, Appinfo, AppMgmt, aspnet_state, AudioEndpointBuilder, AudioSrv, BESClient, BESClientHelper, BFE, BITS, Browser, CertPropSvc, clr_optimization_v2.0.50727_32, clr_optimization_v2.0.50727_64, clr_optimization_v4.0.30319_32, clr_optimization_v4.0.30319_64, COMSysApp, CryptSvc, DcomLaunch, defragsvc, Dhcp, DiagTrack, Dnscache, dot3svc, DPS, EapHost, EFS, eventlog, EventSystem, FCRegSvc, fdPHost, FDResPub, FontCache, FontCache3.0.0.0, gpsvc, hidserv, hkmsvc, idsvc, IEEtwCollectorService, IKEEXT, IPBusEnum, iphlpsvc, KeyIso, KtmRm, LanmanServer, LanmanWorkstation, lltdsvc, lmhosts, MMCSS, MpsSvc, MSDTC, MSiSCSI, msiserver, MSSQL$SVSSDB, MSSQLFDLauncher$SVSSDB, MSSQLServerADHelper100, napagent, Netlogon, Netman, NetMsmqActivator, NetPipeActivator, netprofm, NetTcpActivator, NetTcpPortSharing, NlaSvc, nsi, PerfHost, pla, PlugPlay, PolicyAgent, Power, ProfSvc, ProtectedStorage, RasAuto, RasMan, RemoteAccess, RemoteRegistry, RpcEptMapper, RpcLocator, RpcSs, RSoPProv, sacsvr, SamSs, SCardSvr, Schedule, SCPolicySvc, seclogon, SENS, SessionEnv, SharedAccess, ShellHWDetection, SNMPTRAP, Spooler, sppsvc, sppuinotify, SQLAgent$SVSSDB, SQLBrowser, SQLWriter, SSDPSRV, SstpSvc, swprv, TapiSrv, TermService, THREADORDER, TrkWks, TrustedInstaller, UI0Detect, UmRdpService, upnphost, UxSms, VaultSvc, vds, VGAuthService, VMTools, vmvss, VMware Physical Disk Helper Service, VSS, W32Time, WcsPlugInService, WdiServiceHost, WdiSystemHost, Wecsvc, wercplsupport, WerSvc, WinHttpAutoProxySvc, Winmgmt, WinRM, wmiApSrv, WPDBusEnum, WRSVC, wuauserv, wudfsvc] Z4LHDZ
  38. 3 points
    What I'm picturing is a hybrid of the NOC widget and big number widget. The big number (text color or background color) would be in green when not triggering any alert conditions and yellow, orange, or red to match alert level. Alternatively, adding a value column to the NOC widget would accomplish the same goal. Either of these allows the eye to be drawn to the most important things on a dashboard without having to dig for the associated value.
  39. 3 points
    Please add a Settings option to allow users to set the colors for alert severities. This could be as simple as setting the CSS hex values/RGB values via a Settings UI. Reason is some people have color blindness and would prefer to select their own color values for severities.
  40. 3 points
    With today's technology it's easily possible to remove the LME#########, LMI######## and LMC####### from the subject line of email alerts. To keep replies working for ACK/SDT it can be moved to the message body, hidden it in properties or any method that other companies use to accomplish the same thing. Even moving it to the end of the subject line would help. It takes up the majority of the visible subject line when viewing your emails.
  41. 3 points
    I'd like to have additional visual filter options for the big number widget configuration to drive the background color of the widget: Red value expression Amber value expression Otherwise Green or default background color.
  42. 3 points
    This is great, but can it be implemented via a Service Check?
  43. 2 points
    This one goes into some additional detail but hasn't been completely cleaned up for debugging purposes. The ones that have switch stacks pull all the stack serials and model numbers. Work in progress was the versioning. We have about 1500+ network devices across many many different models and versions so this has taken a little bit of work to get to work across all. MWXMXZ - Cisco-IOS FKA79M - Cisco IOS XE 9LF63N - NXOS G366DD - Cisco ASA
  44. 2 points
    The alert "template" system has no way to reference another instance datapoint value currently. Why would this be good? If you have a datapoint that alerts for status, you want to insert the value of the thing you care about, not the status value. I see the contortions LM datasource developers have gone through to workaround this (e.g., Cisco Nexus Temperature). Please make it possible to reference datapoints at least within the same datasource within alert templates. There are many other issues with alerts, but I will stop here for now :).
  45. 2 points
    We would like be able to query historic SDTs to determine if a device,group, or instance has recently come out of SDT via API.
  46. 2 points
    967XA7 This datasource queries the new Solr metrics API to gather JVM performance data without having to actually enable jmx on the solr node. This will only work on Solr nodes running version 6.6 or higher. Feedback always appreciated.
  47. 2 points
    It would be nice if we could group up our AppliesTo functions, similar to how you group them with DataSources, PropertySources, ConfigSources, PropertySources, and JobMonitors. Thanks!
  48. 2 points
    PAGA9W This is a standard healthcheck as outlined in the RabbitMQ API documentation. Returns a value of 1 if the healthcheck is passed and 0 if there is a failure. Runs basic healthchecks in the current node. Checks that the rabbit application is running, channels and queues can be listed successfully, and that no alarms are in effect. If everything is working correctly, will return HTTP status 200 with body: {"status":"ok"} If something fails, will return HTTP status 200 with the body of {"status":"failed","reason":"string"}
  49. 2 points
    Hello, It'd be really great to be able to run datasources by scheduler (Collect every). For example, at 3 am every day or 2 pm every Monday. I'm aware of pollinginterval but even if i set it to run every 3600 min (every day), i can't set time when it should be executed.
  50. 2 points
    Hello, I just wanted to quickly share a library I wrote today with the LM community. I hope this post is not misplaced. The library eases submitting GET and POST calls to the Logicmonitor REST API from C# code. The library also contains additional classes to de-serialize data returned from the REST API into usable objects in your code. NuGet Package is here https://www.nuget.org/packages/Logicmonitor.REST It is also find-able on nuget.org or through Visual Studio's NuGet package manager. Happy Monitoring!