How to improve the customer service experience through status pages
With the 2021 holiday season right around the corner and the COVID-19 pandemic still prevalent, businesses are being conducted online now more than ever. The holiday rush also comes with incidents like websites going down, slow load times, and even possible hacking attempts. While planning to tackle the sudden increase in website traffic during the festive season, businesses must have an incident response plan in place to handle unexpected outages and the consequent surge in customer inquiries. With
Site24x7 now suggests the right Instance Type that can cut your costs and optimize utilization
Hi all, You're all aware that Site24x7 has a Guidance Report that suggests recommendations for different AWS services. It provides information on overutilization and underutilization of resources with suggestions to optimize them. Why more than the Guidance Report? With the Guidance Report, you can identify if your instances are highly utilized or underutilized. However, you can't obtain insights on which particular Instance Type needs to be optimized or which may be a better fit. Instance Type recommendations
Docker containers, healthcheck
Hi, The docker container monitors are always green and up, even when the container are failed a healthcheck and is in state unhealthy? Is this a bug or missing functionallity?
Better User Grouping and Access Control
Was wondering if there is a way to add this to the roadmap... Can we get User Groups and Alerting endpoints as distinct items? Currently Usergroups are used for alerting endpoints but it not always true that everyone in a group needs or wants to get alerts. Would be nice to have User-Groups be for segmenting users into roles (job functions) and have alert endpoints be specific to who should get alerting for specific monitors. For example, in our environment we have our Networking group but alerts
VMware Dashboards - Heat Maps and summary dashboards
Dear site24x7 team, please create dashboards for VMware metrics. 1. Heat Maps sorted by CPU, Memory or Disks are very helpful in big environments. This kind of dashboard should also useful for k8s oder legacy server environments to identify the biggest consumers. 2. A summary dashboard for each vCenter. It should display overall CPU, Memory (active, shared, ballooned, swapped), Disk I/O (read and write rate) and Network I/O Utilization (send, receive, including a multiline graph for each distributed
5 lessons from the October 2021 Facebook outage
On October 4, 2021, Facebook services went off the grid gradually, and then suddenly at 15:39 UTC. It took nearly six hours to restore service to normal. With over 3.5 billion users facing a lengthy downtime using one or multiple products from Facebook, Inc. (now known as Meta Platforms, Inc.) conversations flooded the internet about what caused the downtime issues on the American social networking service. This article attempts to outline the events that led to the outage, and help organizations
VMware change tracking
Dear site24x7 team, please implement an automatic VMware change tracking. It should record all changes for VMs and ESX Servers and it should be visible in a table for example in the vCenter Dashboard. For example: Date and Time, Server (ESX or VM), Type (i.e. CPU/Memory/Disk added or removed, on/off/reboot, DRS, VMotion etc.), Before value, After value, Change by (Username) That would be very helpful! Thanks and regards, Torsten
Introducing monitor-group-level permissions for admin users
As a super admin, have you ever wanted to allow user to make modifications, but restrict them to only monitors? If yes, then this feature is for you. With permissions at the monitor-group-level, you can provide admins with full permissions for a particular group of monitors. This allows the super admin to restrict access to resources created by others. This also helps with channelling and gaining control over what monitor groups each admin can access. Where are monitor-group-level permissions useful?
Manage logs from incident management platforms like Opsgenie and PagerDuty
Collect and manage log events from incident management platforms like Opsgenie and PagerDuty using Site24x7 AppLogs. Why is it important to manage logs from incident management platforms? Incident management tools offer solutions to respond to, report on, and investigate incidents online. They help operations teams access, communicate, and deliver information on time. These tools also analyze your IT environment before, during, and after an incident so IT teams get a clear picture on the cause, impact,
Server AI-based Thresholds
I am excited to start testing the server AI-based Thresholds under Threshold and Availability. Is there a list of available AI-based thresholds we can enable and is there more information about this feature posed anywhere? Thanks Eric
Can I get file along with extension in traces or any is there any API which returns file with extension?
Hi Team, Please check the below screenshot. My Actual file with extension is CmnTestReleaseController.java but I am getting traces without extension of the file as CmnTestReleaseController. Is there any way where I can get traces with the file extension or any API to get files with extension. Thanks in advance. ibb.co/zZwvJn5
Add Site24x7 - Sumologic integration with uptime and transaction time transaction metrics
I used to work with Datadog and pingdom where there is a readily integration between the two that provides uptime and transaction time metrics that I can use in Datadog. I'm now using Sumologic and Site24x7, but in the absence of a readily integration, I resorted to writing a script that calls Site24x7 APIs to obtain the performance metrics and then ingest them to Sumo. This solution works but doesn't scale up well a large number of monitors (in our case ~5000). I hope you can look seriously into
SLA Report that captures $ lost in report
It would be great if we had a SLA report that we could put a $ amount in it and when I get the report to an executive then it shows how much money we lost with the endpoint went down.
IP malfunctioned error - AWS SG
Hi all, Did anyone face IP malfunction error while adding "2604:e100:1:0:f816:3eff:fe52:163e" in security group in AWS in Canada Central region?
New monitoring location request
Please add Niger
Associate alert user group to monitor group
Now site24x7 does not support associate alert user group to monitor group. Users must associate user alert group to the monitor one by one.
Plugin Integration for SAP Hanna DB
Sería deseable tener un plugin para monitorear SAP Hanna DB.
Configuration association during PHP - APM Installation
Is there any way to alter the default configurations like associating with the correct monitor group/user alert group etc during APM Insight PHP agent install?
Powershell equivalent of curl to fetch monitor current status
Hi, I am fetching the json response via curl www.site24x7.com/api/current_status/<<monitor_id>> -X GET -H "Accept: application/json; version=2.0" -H "Authorization: Zoho-oauthtoken <<access_token>>" I am not able to make the poweshell equivalent of that.. i Can someone help ?
[Service Update] Planned maintenance of our CN data center: 14 November 2021, 12:00pm CST to 21 November, 12:00pm CST
Dear customers, As part of the regular maintenance at our China data center, we will be switching our apps from our primary data center in Shanghai to our secondary (disaster recovery) data center in Beijing on 14 November, 2021 from 12:00pm CST (04:00am UTC). We'll switch back to our primary data center in Shanghai on 21 November, 2021 by 12:00pm CST (04:00am UTC). We do not expect any major interruption, and you should be able to continue using all the Site24x7 services during this maintenance
Operator User Role access and Admin Role
Hi Please see below feature requests 1. It's good to have Operator Role to have access to Stop, Start and Reboot instances 2. Provide access to Admin Role to an specific Monitor Groups as of the moment it's restricted. Please see attached screen shots
StatusIQ : Building Application Status Page
Hello All, We are evaluating site24x7 for all monitoring needs of it. We are also exploring ways by which we can use the StatusIQ page to host the status of our application to our customers and rest of the world. I understand that we can add Monitors /Monitor groups as part of Status IQ page. But this would directly talk about exposing all backend components which is not right. We have 20 odd microservices and additional frontend applications. Want to know how best we can provide the status page
Feature Request - Sub-Admin
Our Operations teams are divided by several products. Each of these products has their own monitor creation and deletion requirements as well as teams responsible for creation/deletion. We have a unified NOC that monitors all products. Feature Request - I would like to see a "sub-admin" role that would have similar permissions to an "operator" but with the addition of add/delete monitors. This would allow the user to be scoped to a group or individually selected number of monitors that they would
More options for User Roles
We currently use the following roles in Site24x7: Super Admin Admin Operator The problem for us lies in that we have users that fall between Operator and Admin. These users should not be Admins but need to have the following roles that an Admin has: Add/Edit/Suspend/Activate/Delete Monitor Reports, Schedule Reports, Report Settings Custom Dashboard Schedule Maintenance It would be nice if there was a Jr Admin role created that allowed for a user to not be a full admin but still be able to perform
Splunk Integration
Hi folks, we recently started using 24*7 for monitoring our external facing sites and we are looking at options to onboard the data to on prem splunk. We want to collect only availability related data and do not want to onboard everything available. Anyone successfully did splunk integration can share any useful link/doc to start with?
check and alert if the agent stops to send data
How can I detect and trigger an alarm when no more data (Processes, CPU, Memory, Disks, ...) sent from the agent? Example: When no discs utilization data from agent is sent for > 5 minutes alert me Greets Lukas
Monitor Oracle DB in Solaris Servers SPARC
Hi Guys: Do you know if it's possible to monitor oracle Database that exists in various Solaris Servers Sparc?
Feature Request: Alert if long maintenance is still active
Hello, we are heavy users of the site24x7 API and we use for example the maintenance mode for monitors if we do deployments. But - the reason is unknown - sometimes monitors are hanging in maintenance mode. I have to check once a week if monitors are still in maintenance and i snooze the maintanence manually. It would be helpfull - until we have fixed our automatism via the API - if we can get or configure an alert if monitors are in maintenance for a longer or a specific time. (send to admin group
Introducing the dark theme for Site24x7, StatusIQ, and CloudSpend
Hi there, In our quest to avoid monotony and to provide a better UI experience, we've introduced a new theme for our product UIs. We're introducing a new dark theme for three products, Site24x7, StatusIQ, and CloudSpend. You'll experience a fresh look that provides improved viewability options, including greater contrast in the UI for your custom dashboards, Alarms tabs, as well as the Monitor Status, StatusIQ, and CloudSpend Budget or Account pages, and the overall web client. How to enable the
Drawback in Site24X7 bulk installation
Hello Team , I have found a bug/drawback in Bulk Installation of site 24X7 .When we try to deploy a bulk installation there is no way to distinguish which servers we have already installed and which servers we need to install the agents .It is a drawback .Can you address this . Thanks Sujith
Add a feature to alert when multiple requests are sent to the same server from any unknown IP/IPs
Please add a feature so that site24x7 can send alerts through email or/and calls when multiple requests (say more than 200 or any unexpected number) are sent to the same server from any unknown IP/IPs, or from any particular IP and also site24x7 can block them from sending further requests to the server. Please add this feature for both unknown IPs and known IPs so that these can be monitored and managed smoothly.
Bulk update Timeout
Please add the ability to bulk update the timeout for basic monitors. Thank you
How to alert user via phone/sms text message
I have setup a User Group, and configured given users to be notified via Email/SMS/Phone but, when the alert goes out to the group with only EMAIL notification to the other users. None of the other users besides me are also getting the SMS/text msg alert. How do I ensure all users in the group assigned to the Monitor get BOTH email and text/sms message alert?
Device Template for Aruba instant-on 1930 switch
Dear all, I think this is my first post here even though I wanted to get involved for a long time now. Anyway, I´m struggling once again with a device that currently hasn´t a device template present in S24x7. It´s an Aruba instans on 1930 switch (SKU: JL686A) Now, I do have the MIB files but in the MIB directory on my disk there are 121 files. So, when creating a custom device template and want to upload the MIB file, how can I figure out which is the right one? As of now, I haven´t discovered a
StatusIQ - Turkish Language Support
Hello, Can you provide Turkish Language for Status IQ site please? We may help you for translation. Emre Y.
Digital experience and digital experience platforms, defined
Usually, people want a seamless experience when they interact with any organization. Whether a B2B or B2C interaction, the expectation is the same. In our world today, it's nearly impossible to interact with an organization without using technology. When a person interacts with an organization via a digital medium, it can be termed digital experience. In this post, we'll be learning about what digital experience is and digital experience platforms, including some best practices and pitfalls of implementing
AppLogs - Custom Logs retrieval
Hi, I have a custom app which runs once a day and pushes loggin information out to a logfile called proces.log in the format $Datetime:date$ $Message$ I have setup a log profile for this alert and configured the server to send the log file to Site24x7 which it did... once. I have an alert setup to identify if a 500 error is returned by searching the log for 500, however when I attempted to test this by adding 500 to the log nothing happens - then when i go back to search the log it gives me a message
IT Automation - send an SNMP string to reboot a network device
I am needing an option to send an SNMP string to reboot a network device. Is this a possibility to add to the IT automation? Originally posted by ebduncan in this post
Configure notification tones for status alerts in your Site24x7 iOS or Android app
Hi there, Receiving an alert in the middle of something important, like while handling a critical IT task, can hamper your concentration and compel you to check the notification. Each time you get a notification related to a monitor’s status, you'll have to check the alert to understand the status change, whether the monitor is back in Up status or has gone to Down or Trouble. What if you could decipher the status change by setting specific notification tones for each status? We're happy to inform
Any way to change check frequency or notification profile based on business hours?
I have a ping monitor for our business's primary Internet connection. The check frequency is 1 minute and the notification is immediately, because I need to know RIGHT NOW if that connection goes down during business hours. But outside of normal business hours, it's not as critical. I'm tired of getting notifications when the connection goes down for 3 minutes at 2am on a Sunday. Is there some way to have a notification change notification profile or check frequency based on the time or some other
Next Page