Leaderboard


Popular Content

Showing content with the highest reputation since 02/26/2016 in all areas

  1. 12 points
    Allow devices to be dependent on one another. If a router goes down, the switch behind it will most likely go down or have an error as well.
  2. 8 points
    Hello, We currently use a third party (OpsGenie) for alerting, and currently we have a custom integration configured in LogicMonitor to send alert information to OpsGenie. Although there has been significant improvements to integrations over the past few months, one feature that lacks significantly for us is a supported two way integration between OpsGenie and LogicMonitor. This would be very similar to the partnership/integration that LogicMonitor has already built with PagerDuty. As LogicMonitor releases new integration features, it tends to break current workflows with alert creation in OpsGenie. A supported integration would give us more confidence that as LogicMonitor continues to release new features, that our alert functionality would continue to operate as expected.
  3. 7 points
    We are a global company with resources in Minnesota, New Jersey, Australia, Ukraine, and India all using the Logic Monitor tool set. It would be incredibly useful to be able to set the timezone at a user level instead of only at the company level.
  4. 6 points
    When there is a legitimate reason for disabling alerts for a device, it would be very useful to be able to leave a note as to why (and by whom). This would prevent confusion with teams, where the case of "why would this be disabled" would come up frequently. For example, there is a known bug with a certain version combination of ESXi and HPE servers that triggers a false-positive hardware alert internally, so we disable alerts for that instance on servers that meet the criteria as we encounter them. Or, some QNAPs will give false-positive alerts that their disk is full when in fact it is "full" due to a RAIN configured as a LUN (we thus rely on the server alerting when the iSCSI volume is actually full). However, another technician may log in and flip alerting for these instances back on, assuming it was a mistake or something, and then we would get flooded with these false-positive alerts, prompting technicians to look into them; as you can see, this causes a loop of wasted time. Simply putting a note associated with the "Alerting Off / On" switch and tagging it with the user invoking it would easily solve issues like this. Something like what is shown for Acknowledgements would be adequate. Perhaps even an admin option to require a note or not?
  5. 5 points
    I would love to see LM implement a new feature for taking a built-in, self prescribed, action on an alert. To minimize any exposure that LM might have in an action gone awry, the actions taken could occur as the result of a script that one could upload into the Escalation Chain. Ideally you could define multiple actions or multiple retries on an action and whether that occurred before or after the recipient notification in the notification chain. This would allow for very basic alerts (disk, service restarts, etc) to be resolved programatically. Also being able to support various scripting languages such as PowerCLI, Ansible, etc would allow for some very creative ways to integrate with solutions such as VMWare or Ansible Tower for very complex actions to be crafted by more expert skill level folks.
  6. 5 points
    I have run into too many cases now where a new but slightly different DS is setup due to LM support actions, upgrades, etc. and the result is lost data or noncontinuous data. A good example I recently encountered is with NTP. The standard DS was not working in all cases. I was given a new DS that uses Groovy, and it works (which I appreciate!). But the datapoint list and names have changed, and even if they had not, there is no way to maintain data history from the old DS to the new DS. My recommendation is to add a migrate function so you can indicate how to map old to new datapoints in such a situation and thus avoid data loss. Building in a default migration ruleset into a new DS would be a bonus -- this could allow for zero-touch data migrations in at least some cases. Thanks, Mark
  7. 5 points
    It would be nice to have an export button on any alert table. When we're doing research on an issue and we've finally narrowed the criteria to see the info we need, it helps to have an export button right there, rather than having to go to reports and reconfigure all the parameters to hopefully get the same data.
  8. 5 points
    I have published a PowerShell module, which refactors part of the REST API, to the PowerShell gallery. Please feel free to make requests (or send me cmdlets you want added). https://www.powershellgallery.com/packages/LogicMonitor/
  9. 4 points
    Can you please add an Inventory\List tab on the RHS (right hand side) when we have a group selected. Its very annoying that I cannot just get a full list of all the objects in a group/folder quickly and easily thru the main GUI and have to do this thru a "Report" When I click on a group I just want to see all the devices\items in that group on the RHS and have all the properties I choose to select to view for example: name,ip,OS,mac,class type(computer/router/switch), etc.. Why is there no List/Inventory view OOTB in the GUI? I also need to be able to export the list quickly to any format, HTML, CSV, PDF so I can send a quick list report to whoever is requesting this. I don't want to have to go into Reports and then have to choose the group to run the report on. This is basic what's in this folder/group behavior.
  10. 4 points
    We request another way for Logic Monitor to gather information from Linux based systems besides SNMP. Using SNMP involves installing NET-SNMP so taking up additional storage space. It would be better if Logic Monitor had an option for monitoring Linux systems like we had using Sitescope. In Sitescope all it needed was SSH and a dedicated login credential for the system. The Monitor would connect to the Linux based system through SSH and run scripts pertaining to what aspects were monitoring on that system. Then stores the results to be displayed on the console. Similar to using WMI in Windows.
  11. 4 points
    Hello, As an MSP we've the need to monitor a lot of directories/shares for the same client. Some of those shares are accessible with the collector user, however, we've some clients that restrict their share to specific users (not the ones running the collector service). I've tried to create a datasource that's a simple runas where we can pass the user/passwd as a parameter, however, that isn't possible to run from the collector level (confirmed by LM staff in a case that I've raised). Can this be implemented? This feature would be very important (since we monitor +100 clients).
  12. 4 points
    Hello, As an MSP we've the need to group multiple interface instances (from different devices & different clients as well) in order to set common thresholds, reports, etc... From my research that isn't possible within LM. Anyone able to do that? This would be very useful from the monitoring/management perspective. Regards,
  13. 4 points
    Can you please add the AUDIT tab on the RHS (right hand side) for any device you have selected from under Resources/Websites. I want to quickly check who/what updates were made to the device I selected and why do I have to go into an entirely different section (Settings > Audit) to then check and search for the audit history on the device I selected. Please add this to the GUI so I can just quickly check the history on the device selected.
  14. 4 points
    Hi All, We have really been enjoying the Remote Management feature of logic monitor. For sites that we don't have a direct interconnect with its great being able to quickly SSH onto our devices to make adjustments or check config without having to open up a separate VPN tunnel. However with HTTP/HTTPS management becoming common with Firewalls, Controllers, Routers etc... I feel there is a huge opportunity to have logic monitor be able to fit almost every management use case by implementing an HTTP/HTTPS remote session functionally in the same way RDP and SSH remote sessions work. We as a company would primarily use this feature for help managing networking Equipment, but functionality would extend to Printers, IPCameras, Security Systems, Phone systems, UPS and many more. Let me know your thoughts, Thanks, Will.
  15. 4 points
    Are there any plans to have linked clones for dashboards? At the moment we create a template dashboard using tokens and then clone them and then update the token to match what we need to. The problem is if we make a change to the format of a dashboard, we either have to do it on all of them, or delete all the cloned ones and then clone them out again. It would make things a lot easier if all the clones were linked back to the master, and an update to layout or widgets on the master was automatically replicated to the cloned templates.
  16. 4 points
    It is currently impossible to detect certain conditions without having to be bombarded by noise alerts, which I am told is against the philosophy of Logic Monitor. Consider a few cases: * interface flaps a few times versus more frequently -- how do you tell the difference? right now, you have no choice other than perhaps to construct an API script (not tested). A better solution in this example would be to count the number of flaps over a period of time, and use that as your alert trigger. As it stands right now, there is not even a method to select the top 10 most unstable interfaces since it is literally a yes or no value and top 10 makes no sense. * resource utilization (bandwidth, CPU etc.) is sometimes much better checked over a period of time than just a single interval. the answer I have received on that is "require N checks to fail", and this works if the resource is pegged, but not if it is spiky. As it stands now, the longer of a period you want to simulate via "N checks", the higher the chance one check will reset the alert but the overall result is clearly bad on inspection. Please note this problem has been solved long ago by other tools, like Zabbix (https://www.zabbix.com/documentation/3.4/manual/config/triggers/expression), so hopefully this can be added to LM in the near future as well.
  17. 4 points
    The ability to drag and drop items in the device tree in order to reorganize them is a powerful feature, but the staggered manner in which the tree loads and expands on page load "encourages" a user to accidentally move folders and devices around in the tree, which leads to numerous detrimental effects (not the least is having to try and figure out where you accidentally moved them to). Request is to either introduce a safeguard prompt "Are you sure you wish to move X from Y to Z?" for all drag and drop actions within the device tree, or, alternately, introduce a configuration item on folders and devices allowing them to be "protected" against moves --- either adding a safeguard "are you sure...?" prompt for those specific items, or prevent moving them entirely without first taking the action to "unprotect" them.
  18. 4 points
    We've recently run into issues with users accidentally changing a setting or deleting a device and would like the ability to allow users to Create new devices, but not be able to delete anything or change alert settings. I'd like to either split Manage into Write/Delete groups or add a deny action role that would allow me to give users manage access with a deny delete:*
  19. 4 points
    Useful for inventory, auditing, and auto-grouping. Displays the a list of all installed Windows Features separated by commas. Example below. auto.winfeatures [Active Directory Lightweight Directory Services, .NET Framework 3.5.1 Features, Telnet Client, Remote Server Administration Tools, .NET Framework 3.5.1, Role Administration Tools, AD LDS Snap-Ins and Command-Line Tools, AD DS and AD LDS Tools, Active Directory module for Windows PowerShell] WMN9DN
  20. 4 points
    As we move towards a DevOps model, we increasingly have a need for small teams to have full admin access to the tools they use to manage their IT services. When it comes to LogicMonitor, this has proven difficult with the existing role permission model. DevOps pods would like to be able to manage their own datasources, alerts, and escalation chains but this isn't possible unless we give them broad access rights to those areas, which could cause significant harm to other groups of monitoring users. For example, an inexperienced DevOps user could inadvertently modify a datasource that applies to all Windows devices or they could create an alert rule that causes alerts not to be delivered to other users. To solve this problem, I'd propose that LogicMonitor offer alert groups, escalation chain groups, along with the existing datasource groups. Then, LogicMonitor could provide the ability to restrict roles to manage these specific groups. DevOps pods could be given the ability to manage their own custom subset of datasources and set up their own alerts in a rule range after the main set of rules.
  21. 4 points
    Currently the graph options for Widget includes three was to "Limit Results". The are "Top 10", Aggregate, All. There are times when I want to display the "Bottom 10". For example the lowest 10 CPU devices, or the 10 least available devices, etc. It would be GREAT to have a Bottom 10" option in the Limit Results parameter. Thanks.
  22. 4 points
    As mentioned to Haniz in ticket number 82871, it would be very handy to have a button that can expand all the graphs for you on a device view. It would save quite a bit of time
  23. 3 points
    It would be useful to have SNMP traps that trigger within a specific timeframe to be considered the same alert. We have a few cases where devices start throwing traps every minute and by the time we react to fix we already have dozens of alerts. It would be better to consider the same trap within a time frame to be the same alert to avoid this alert flood.
  24. 3 points
    When you use alert tunning a the host/group level for a particular data source, I would like the ability to select to alert on No-Data when creating custom alert thresholds.
  25. 3 points
    From my understanding, the native WMI-based checks will make a new WMI call for each instance, so 1 WMI call for each windows service and process, hence why you see 12k of them. There are a lot of types of checks that work that way, but there is one option that will let you make one WMI call per device (if you can get all the data in one call) and extract in bulk for all instances at once: BATCHSCRIPT. I'm not sure if it would completely help in your situation, but if you switch from native WMI to using something like a PowerShell or Groovy BatchScript, you can send one WMI query to the server and get data for all services/processes at once. Scripts do cause more load on the collector than most native checks, but 150 script instances (75*2) are likely less load then 12k WMI instances.. Actually I think the collector does WMI queries via powershell anyway, not 100% sure about that, so even less of a concern. You can still keep the old WMI AD method and just move Collector Attributes to use batchscript.
  26. 3 points
    Please add a GUI settings based on user profile that would allow me to disable the auto-expanding of and object in the Resources tree when I click on it Currently any item in the navigation tree when you click on it auto opens and expands to show all its sub items. Why? The tree should just expand if I click on the > next to the items name. Why do you automatically show everything under the resource. If there's 20+ items I have to then scroll down past this to get to the next item. Why are you using different behavior in your GUI? Asking for a setting that controls this so I can disable this behavior.
  27. 3 points
    https://www.logicmonitor.com/support/datasources/creating-managing-datasources/creating-datasources/ We'd like to be able to set "Collect every" to be longer than 1 day, like 1 week.
  28. 3 points
    Rather than have Websites as separate section in the product with a separate hierarchy to manage; how about making all of the Websites stuff part of the Device Tree and rename the Devices section to something that covers both. Then if I want to add a website or service check I simply do it against the "group". This way I wouldn't have to maintain two hierarchies of business services. What do other folks think of this?
  29. 3 points
    We have a use case to show "Response Times" from a subset of configured Websites. Ideally I'd like this to be in the Big Number widget. We also want to able to chart a subset of my Websites' response times over time in the Chart widget. Anyone found a useful workaround to achieve this? Would LM consider "upgrading" widgets to allow the presentation of Website data? Currently only the SLA widget seems capable of handling Website data.
  30. 3 points
    It will be great for the MAP widget to be able to display website. We monitor over 2000 devices and would be good to be able to display the geographical outage of our devices.
  31. 3 points
    We have a multi-tenant MSP environment, We find that we cannot use the same display name for our clients even though the similarly named systems are in different Child folders. So \'DC01\' for Domain Controller has to be unique across all of our clients. Please consider changing this.
  32. 3 points
    Our CTO is asking me to provide a daily email report of some key metrics to our clients and internal stakeholders. I'm able to create a suitable report and email it, however his concern is the number of clicks to view the report. The belief is that the email will just go straight to trash if our clients can't see the report in the email body. For me, using gmail, I have to download the attachment first, then open it from my laptop. Two options we talked about were: 1. Use the HTML report as the body of the email rather than an attachment (surely the days of text-only email clients are gone) 2. A configurable/dynamic subject to provide some hint of what the report actually contains, e.g. the uptime for a particular service over the last 24 hours. If the subject said "100% Uptime", there is no reason to open the report, if the subject said "98% Uptime" I can open to see more detail. Does anyone else have any ideas on how to make reports a little more effective for a somewhat apathetic audience?
  33. 3 points
    Hi, with Meraki enabling Webhooks, can LogicMonitor receive Alerts for any of the events you enable on the dashboard? https://meraki.cisco.com/blog/2018/10/real-time-alerting-with-webhooks/ Additionally, is it any different to poll devices Meraki devices directly versus receiving information from the dashboard?
  34. 3 points
    We use the Debug Console a lot, and usually to debug an issue on a particular device, which means more often than not, we access the Debug Console from the device in question's Raw Data view. We would like for there to be a token (##THIS## or ##DEVICE##) that could be used with Debug Console commands. So if we were to issue the command !ping ##THIS##, the console would automatically interpolate ##THIS## with the IP address of the device on which we opened the Debug Console. It's a small productivity gain, but when you're doing this a lot it makes a big difference.
  35. 3 points
    Don't know if anyone else noticed, but MS released a pretty slick script that enables WMI access remotely without admin rights. I have done a brief test with LM and it seems to be working well. https://blogs.technet.microsoft.com/askpfeplat/2018/04/30/delegate-wmi-access-to-domain-controllers/ That's the article. I created an AD group instead of a user to delegate, and I put the LM collector service in that group. Everything else I've followed as documented. I haven't tested anything else, but this alone is a huge step in the right direction.
  36. 3 points
    We urgently need a way for our regional operations teams to be able to see alert date times in local time. This a critical requirement as we are a global business and for people outside of the account timezone to be constantly trying to work out the local time of an alert is ridiculous.
  37. 3 points
    It would be great to have the granularity to be able to change the Alert Trigger Interval on an instance in the same way a threshold can be modified on an instance. Example: Disk Usage on a netapp There are 100 volumes but one of them needs to be over the threshold for a longer amount of time to cause concern. So the archive log volume will fill up no matter what space is allocated but it is not a concern as long as it is back below the threshold within 1 hour. A custom datasource will not work for this because we would like to alert immediately on all other volumes on this device when they go above the threshold.
  38. 3 points
    There are currently far too many opportunities to commit errors in LM from which is is difficult to recover since there is no version tracking. Ideally, it would be possible to revert to a previous version of any object, but especially very sensitive objects like logicmodules, alert policies, etc. I have created my own method of dealing with this, which leverages the API to store JSON streams of all critical elements regularly, changes committed via git (certain adjustments to the original results are needed to avoid a constant update stream). Recovery would be very manual, but at least possible. This would be far more useful within the system itself. Thanks, Mark
  39. 3 points
    I see a need in the design to alert on deviation from rolling average: example 1: Temperature in hardware is based on fixed baseline (default or manual adjusted) or based on fixed Delta. In real world application it would Make a LOT more sense to alert on Deviation from a 5 day or 30 day rolling average Temp of the box. Reason is, units alarm on the weekends because the office shuts off the AC during the summer. or they alert During the week 9-5 because in the winter the offices crank the heat. All of these ignore nuance of RANGE and Average expectation for the location...The alerting should just be how FAR outside the average Range for the site is. My Nashville facility hovers from 56 to 59 all week. I have it set on 57 so I get alerts at least once a weekend. I could move it to 59...but that's a band-aid. The REAL solution would be to have the software TRACK the last 30 days, and alert when we're outside the NORM for that location. furthermore....with hardware it is not the specific temps that kill the hardware....its the RATE at which the temp changes. so, the alerts SHOULD be based on the average range the system has seen in the last 30 days, and alert ONLY when the rate of change accelerates...but I imagine THAT request would be more challenging to reduce to an algorithm. Example 2: PING times.....I have sites where the Latency range is EXTREME (Mumbai, Johannesburg, Taipei etc...) I'd wished the PING would track the 30 day range and common deviation from norm and alert when the sites see latency that is way outside the expected fluctuation range. 30ms typical 90% of the time + 200-500ms spikes 10% of the time. when Ping times hit 300 ms for more then 10% of the last hour of sampling....then notify warning to inform of change in TREND....not fixed threshold in immediate sample
  40. 3 points
    Currently, a table must have static columns and rows defined before the widget will display data. It would be great to be able to dynamically build a table's rows based on * To expand on this, it would be great for the table to have the option to exclude instances with zero/no data from the list. For example, I would like a table that displays all MSMQ queue names and the number of messages in each queue - but not display anything if the current queue length = 0
  41. 3 points
    I have customers who really need this feature, and they are quite upset to learn the throttling stand-in could cause loss of knowledge about the actual root cause. This thread has been open since 2013. Exactly where on the roadmap is this? Mark
  42. 3 points
    Brandon, I have modified the code slightly to get rid of the annoying open and close brackets(my OCD kicked in hard). groovy script: hostname=hostProps.get("system.hostname") my_query="Select NAME from Win32_serverfeature" def session = WMI.open(hostname); import com.santaba.agent.groovyapi.win32.WMI def result = session.queryAll("CIMv2", my_query, 15); def strResult = result.NAME.toString(); def finalOutput = strResult.replace("["," ").replace("]"," "); println "WindowsFeatures=" + finalOutput P.S. - this script is awesome!
  43. 3 points
    I completely agree with this. Different vendors use TRAPs differently and from my experience may send the same trap multiple times, sometimes several times a minute or more even. The TRAP functionality in LogicMonitor will not be useable in these cases because the noise it will create will be a huge distraction for any NOC to be able to handle. It takes their eyes off of other possibly critical events because of multiple duplicate alerts for the same issue. Let's take Barracuda for instance. Their NextGen Firewalls have a TRAP for HA Partner Unreachable. We received a trap every 5 minutes for about 2 hours while this situation was occurring. From Barracuda's standpoint, this was a single event with notifications that go out every 5 minutes until the error goes away. They don't have a pollable "GET" MIB to track this scenario either. I would propose the logic this way: LM receives a trap that matches an EventSource criteria and triggers the configured alert. That eventsource is configured with a timeout value (let's say 60 minutes). If another Trap from the same device with the same content comes in before the timeout value, don't create a new alert, but rather increase a "count" counter on that alert AND RESET THE TIMER. As long as no new traps come in within the configured timeout (60 minutes in this example), the alert will clear like normal. If a new trap comes in after the timer, a new alert is generated. You may need to provide an interface to view all the Trap data associated with that one Event Alert since there will now be multiple. This issue is plaguing our company right now and we are a very large MSP. We are at the mercy of vendors who don't provide polling MIBs for some critical actions like this, hence why SNMP Traps become more of a necessity.
  44. 3 points
    WARNING - This propertysource pulls a list of all Windows services installed. This does not filter the services to only show running or auto-starting services. Useful for auditing, auto-grouping, and inventory. Example below Displays the a list of all installed Windows Services. auto.winservices [AeLookupSvc, ALG, AppIDSvc, Appinfo, AppMgmt, aspnet_state, AudioEndpointBuilder, AudioSrv, BESClient, BESClientHelper, BFE, BITS, Browser, CertPropSvc, clr_optimization_v2.0.50727_32, clr_optimization_v2.0.50727_64, clr_optimization_v4.0.30319_32, clr_optimization_v4.0.30319_64, COMSysApp, CryptSvc, DcomLaunch, defragsvc, Dhcp, DiagTrack, Dnscache, dot3svc, DPS, EapHost, EFS, eventlog, EventSystem, FCRegSvc, fdPHost, FDResPub, FontCache, FontCache3.0.0.0, gpsvc, hidserv, hkmsvc, idsvc, IEEtwCollectorService, IKEEXT, IPBusEnum, iphlpsvc, KeyIso, KtmRm, LanmanServer, LanmanWorkstation, lltdsvc, lmhosts, MMCSS, MpsSvc, MSDTC, MSiSCSI, msiserver, MSSQL$SVSSDB, MSSQLFDLauncher$SVSSDB, MSSQLServerADHelper100, napagent, Netlogon, Netman, NetMsmqActivator, NetPipeActivator, netprofm, NetTcpActivator, NetTcpPortSharing, NlaSvc, nsi, PerfHost, pla, PlugPlay, PolicyAgent, Power, ProfSvc, ProtectedStorage, RasAuto, RasMan, RemoteAccess, RemoteRegistry, RpcEptMapper, RpcLocator, RpcSs, RSoPProv, sacsvr, SamSs, SCardSvr, Schedule, SCPolicySvc, seclogon, SENS, SessionEnv, SharedAccess, ShellHWDetection, SNMPTRAP, Spooler, sppsvc, sppuinotify, SQLAgent$SVSSDB, SQLBrowser, SQLWriter, SSDPSRV, SstpSvc, swprv, TapiSrv, TermService, THREADORDER, TrkWks, TrustedInstaller, UI0Detect, UmRdpService, upnphost, UxSms, VaultSvc, vds, VGAuthService, VMTools, vmvss, VMware Physical Disk Helper Service, VSS, W32Time, WcsPlugInService, WdiServiceHost, WdiSystemHost, Wecsvc, wercplsupport, WerSvc, WinHttpAutoProxySvc, Winmgmt, WinRM, wmiApSrv, WPDBusEnum, WRSVC, wuauserv, wudfsvc] Z4LHDZ
  45. 3 points
    What I'm picturing is a hybrid of the NOC widget and big number widget. The big number (text color or background color) would be in green when not triggering any alert conditions and yellow, orange, or red to match alert level. Alternatively, adding a value column to the NOC widget would accomplish the same goal. Either of these allows the eye to be drawn to the most important things on a dashboard without having to dig for the associated value.
  46. 3 points
    Please add a Settings option to allow users to set the colors for alert severities. This could be as simple as setting the CSS hex values/RGB values via a Settings UI. Reason is some people have color blindness and would prefer to select their own color values for severities.
  47. 3 points
    With today's technology it's easily possible to remove the LME#########, LMI######## and LMC####### from the subject line of email alerts. To keep replies working for ACK/SDT it can be moved to the message body, hidden it in properties or any method that other companies use to accomplish the same thing. Even moving it to the end of the subject line would help. It takes up the majority of the visible subject line when viewing your emails.
  48. 3 points
    I'd like to have additional visual filter options for the big number widget configuration to drive the background color of the widget: Red value expression Amber value expression Otherwise Green or default background color.
  49. 3 points
    This is great, but can it be implemented via a Service Check?