Hi,
Â
just another idea to address the problem for getting an overview of outages in bigger environments:
Â
For a customer who monitors a network with around 8000 devices and many locations (has nothing to do with the Minion location concept), we placed each node in a surveillance category named "Location-<location-name>" (e.g. Location-Fulda, Location-Stuttgart, ...). For the OpenNMS start page, we implemented a box which shows the top 10 locations with outages.
Â
Example Box:
Â
Location with Device ouatges (top 10):
- Fulda (2/21 nodes)
- Stuttgart (1/40 nodes)
Â
Â
For them it is very useful to see the outages by location, as in a network of this size there will always be an outage. We build some other pages to see the nodes in each locations with status and so they have a drill down view.
Â
This was a very specific solution for this environment, but maybe the idea can be used to build a more generic approach. For example to build groups of nodes and show a box with outages by group:
Â
Example Box:
Â
Group outages (top 10):
- Web-Servers (2/21 nodes)
- Mail-Servers (1/3 nodes)
Â
Best regards
Michael
Â
Michael Batz - Professional Services & Solutions
Â
NETHINKS GmbH | BahnhofstraÃe 16 | 36037 Fulda
T +49 661 25 000 0 | F +49 661 25 000 49 | ***@nethinks.com
GeschÀftsfÌhrer: Uwe Bergmann | Vorsitzender des Aufsichtsrats: Garry Glendown | AG Fulda HRB 2546
Â
Â
-----UrsprÃŒngliche Nachricht-----
Von:Markus von RÃŒden <***@opennms.com>
Gesendet:Mi 10.05.2017 21:16
Betreff:Re: [opennms-discuss] OpenNMS Easterhack Review / Status Box
Anlage:signature.asc
An:General OpenNMS Discussion <opennms-***@lists.sourceforge.net>;
Hey guys,
 I reverted the changes to leave the âXYZ with Problemsâ boxes as they were before.
 Instead of showing 4 donut charts by default, only 3 are shown:
 * Node status based on alarms
 * Node status based on outages
 * Business Service statusÂ
 The order and what to show is configurable in opennms.properties.
 TL;DR:
No change to the Top N boxes.
In addition, status boxes (donut charts) will be shown by default.
Which charts and the position of the chart is configurable via opennms.properties.
 Cheers
- Markus
On 9. May 2017, at 22:22, Ronny Trommer <***@opennms.org <mailto:***@opennms.org> > wrote:
Hi guys,
 talking about the status box feature, I would like focus just on the two options, the whole âMaintenance Modeâ vs. "Scheduled Outages" is completely different story and out of scope for the status box enhancement.
 @Michael
What do you think about the âMouse over Top N tablesâ instead of having them as a box on the start page. Would something like that work for you?
Â
On 9. May 2017, at 19:52, Norbert Steinhoff <***@herr-der-mails.de <mailto:***@herr-der-mails.de> > wrote:
Hi Ronny
 how about a combination of both ?
 The status box should have some mouse over / hover in the "rings" which opens a list with the latest outages/events
from the hovered section with links to the affected nodes.
 This links should not open in the same window but in a popup  or in a new tab. So you donât loose  focus on the dashboard.
 And in the node details  -> a button for Maintenance mode (like Michael described before) till Service / Node comes back and Â
with configurable max.maintenance.duration  (defined in  opennms.properties). This timeframe prevents node from being forgotten ;)
During maintenance, no new events, no notifications.
 If maintenance mode is active it should be shown on the nodes page like the outages, but with date/time when the maintenance
modes ends automatically.
Best
NorbertÂ
Â
Am 09.05.2017 um 19:02 schrieb Ronny Trommer <***@opennms.org <mailto:***@opennms.org> >:
Hi Michael,
 thanks a lot for your detailed explanation and this helps a lot.
 First:
The old list boxes are still available and are not removed. We just donât have included them but the functionality is still there.
I would suggest make them fit for the use case you have described. We could call them âTop N {Alarms/Outages/Applications/Business Services}â, where N is the number you have configured in the opennms.properties as the limit.
 Second:
I definitely like your description "maintenance mode" vs. "scheduled outagesâ. I can help with working out the use cases and get it transformed into JIRA.
 @Reader
What would you like to see as the default view when you have the choice between the âTop Nâ boxes and the new âStatus Overviewâ box?
 * Just the "Top N Boxes"
* Just the Status Overview Box
* Both
 Thanks  Ronny
On 9. May 2017, at 17:16, Seibold, Michael <***@gkvi.de <mailto:***@gkvi.de> > wrote:
Hi Ronny and "the gang",
 first: great work and thanks for that!
 second: I want to give my opinion about the status box on the main page. Yes, I agree that in large environments the list is to short (in ours too). But compared with the donut-design it has also one big advantage:
 -       in "Nodes with pending problems" you see the newest unack'ed alarms first, as they are ordered "new ones on top". If you get a call "since about 10 Minutes we have following problem..." you can see at the first glance (if the list is long enough...) if there were alarms at this time which are still unack'ed. You don't have to click around to get there.
-Â Â Â Â Â Â Â same for "Nodes with outages".
-Â Â Â Â Â Â Â systems with longer during outages probably won't be responsible for problems that just popped up, so in a first glance you probably can ignore them.Â
 Unfortunally I don't have a "great solution" available for this. It's just a hint that there are good functionalities that I will miss with the donut solution.
 One thing that might help: to my experience in every large environment you will have (sometimes a lot of...)  systems that are unavailable for some unspecified time. They will appear in those lists mentioned above filling them up. A functionality like putting them into "maintenance mode" with the (configurable?) possibility not to show them in those lists on the start page might be a help to keep the lists smaller.
 The difference between the "maintenance mode" I know from other monitoring systems and scheduled outages or setting them to unmanaged is that
-Â Â Â Â Â Â Â planned outages have a limited time span
-Â Â Â Â Â Â Â setting them to unmanaged has an unlimited time span (and will probably be forgotten until the system realy fails without an alarm)
 whereas "maintenance mode" is true until the service/system is responding again. If it is responding, the maintenance mode should be cleared automatically and everything like service checks, collecting data, thresholding etc. is up again without additional human actions.
 So if there would be the possibility of defining maintenance mode for objects and not to show the services/systems/alarms for objects in maintenance mode the lists might be a lot shorter.  I thought I opened an enhancement request for that about 8 years ago, but I can't find it any more. Maybe it was just a discussion between David and me.
 -Michael
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org <http://slashdot.org/> ! http://sdm.link/slashdot_______________________________________________ <http://sdm.link/slashdot_______________________________________________>
Please read the OpenNMS Mailing List FAQ:
http://www.opennms.org/index.php/Mailing_List_FAQ <http://www.opennms.org/index.php/Mailing_List_FAQ>
opennms-discuss mailing list
To *unsubscribe* or change your subscription options, see the bottom of this page:
https://lists.sourceforge.net/lists/listinfo/opennms-discuss
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org <http://slashdot.org/> ! http://sdm.link/slashdot_______________________________________________ <http://sdm.link/slashdot_______________________________________________>
Please read the OpenNMS Mailing List FAQ:
http://www.opennms.org/index.php/Mailing_List_FAQ <http://www.opennms.org/index.php/Mailing_List_FAQ>
opennms-discuss mailing list
To *unsubscribe* or change your subscription options, see the bottom of this page:
https://lists.sourceforge.net/lists/listinfo/opennms-discuss
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org <http://slashdot.org/> ! http://sdm.link/slashdot_______________________________________________ <http://sdm.link/slashdot_______________________________________________>
Please read the OpenNMS Mailing List FAQ:
http://www.opennms.org/index.php/Mailing_List_FAQ <http://www.opennms.org/index.php/Mailing_List_FAQ>
opennms-discuss mailing list
To *unsubscribe* or change your subscription options, see the bottom of this page:
https://lists.sourceforge.net/lists/listinfo/opennms-discuss
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org <http://Slashdot.org> ! http://sdm.link/slashdot_______________________________________________ <http://sdm.link/slashdot_______________________________________________>
Please read the OpenNMS Mailing List FAQ:
http://www.opennms.org/index.php/Mailing_List_FAQ <http://www.opennms.org/index.php/Mailing_List_FAQ>
opennms-discuss mailing list
To *unsubscribe* or change your subscription options, see the bottom of this page:
https://lists.sourceforge.net/lists/listinfo/opennms-discuss
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Please read the OpenNMS Mailing List FAQ:
http://www.opennms.org/index.php/Mailing_List_FAQ
opennms-discuss mailing list
To *unsubscribe* or change your subscription options, see the bottom of this page:
https://lists.sourceforge.net/lists/listinfo/opennms-discuss
__
SAVE THE DATE! Unsere kommenden Veranstaltungen:
************************************************
12. Mai 2017 | fibit 2017 - Technologie- und IT-Messe Fulda
17. Mai 2017 | Webinar: Verteiltes Monitoring mit OpenNMS Minions - Eine Einfuehrung
Eine Übersicht über unsere Veranstaltungen erhalten Sie unter: www.nethinks.com/events