[New feature] Investigate graph spikes using APM Insight
At Site24x7, we have always developed~ tools which allow DevOps in getting to the root cause of an issue in the most efficient way. To increase this efficiency further, we have introduced a new feature in our application monitoring tool APM Insight called graph spike. Introducing graph spike Graph spike is a feature that enhances application monitoring capabilities to the next level. The feature~allows users to select a time period during which they had experienced application performance lag and
Windows Agent Seems to un-register it self
I have a few windows servers that seem to have an issue with the agent. It appears (from looking at the logs, included below) that maybe an update failed? The result is that the software is still installed but the agent service it self is gone. I can see it down in the task bar but it is not running and there is no way to start it as the service has been removed. STARTING ReadRegSZ.........Registry Key is IsAgentUpgradeInProgress :: RET VAL0 ReadRegSZ.........Registry Key is ServerProtocol :: RET
Monitor the performance of your Riak server
Configure Riak plugin to monitor the performance metrics of your Riak server. Use these key indicators to ensure continuous functioning of your Riak data store. Take informed troubleshooting decisions by keeping track of critical metrics including: Total allocated memory Total amount of memory allocated for Erlang processes Number of active protocol/protocol buffers connections Total amount of memory currently allocated/used for atom storage, Erlang code and Erlang Term Storage Number of active GET
Site24x7's all new revamped Status Page—Achieve Powerful, Accurate and Transparent Status Reporting
It’s true that business transparency goes a long way in helping you build trust with your customers and colleagues. Thus, it’s always important to keep your customers in the loop by offering them a timely, accurate and transparent communication channel, which would then facilitate prompt sharing of your services' status, related issues and resolutions with your customers. Since we understand this from the ground up, we’ve reworked on our current status page to bring in some cool enhancements to it.
Revamped Site24x7 Status Page - Boosting business transparency
With Site24x7 Status Page, our motto has always been to communicate downtime in an open and transparent way. However, we also understand the ever increasing importance of a Status Page in fostering a stronger relationship with customers. Thus, we've imbibed our learnings from the past and applied it to redesign our Status Page to give it an all new look and feel. To top it all, we've introduced some cool new enhancements too. Let's discuss a scenario to help you understand about the various feature
Alert Escalations
When adding a notification profile to a monitor, do the selected user groups get notified first, and then if the issue persists past the configured period it gets escalated to the specified group? I am not clear if a notification profile overrides the user alerts groups selected?
Monitor the performance of your CouchDB database servers
Configure CouchDB plugin to monitor the performance metrics of the open source database, CouchDB. Use these key indicators to ensure continuous functioning of your CouchDB database. Take informed troubleshooting decisions by keeping track of critical metrics including: Number of authentication cache hits and misses Number of open databases Number of bulk requests Number of HTTP requests Number of times a document was read from a database Number of times a database was changed Number of view reads
Monitor the performance of your RabbitMQ servers
Configure RabbitMQ plugin to monitor the performance metrics of your RabbitMQ servers. Use these key indicators to ensure continuous functioning of your RabbitMQ server. Take informed troubleshooting decisions by keeping track of critical metrics including: Memory used by the server Number of messages ready to be delivered to clients Number of consumers Total number of messages in the queue Used file descriptors Average number of Erlang processes waiting to run shown as process Number of file descriptors
Monitor the performance of your Redis database servers
Configure Redis plugin to monitor the performance metrics of your Redis databases. Identify and resolve issues with Redis-based apps before end users are affected. Take informed troubleshooting decisions by keeping track of critical metrics including: Used Memory Used Peak Memory Used System CPU Keyspace Hits and Misses Total Connections Received and Rejected Connected Clients Connected Slaves System CPU consumed by the background processes User CPU consumed by the background processes Learn how
Introducing the all-new Site24x7 Alarms View - Incident management made easy
At Site24x7, we've embarked on a journey to provide you the most flexible monitoring and alerting solution for your entire IT stack. Thus, we understand the importance of delivering actionable alerts that would inform and empower you to assess the severity and the urgency of the issue at hand. Today, we're glad to introduce you to one of our coolest new features: Alarms View! With this new feature, you can gain better control over your alerts and effectively trim down redundant noises that cause
Question about resource checks
I'm trying to setup a resource check on some of our server monitors. What I want to do in particular is check that a file exists (alert if it doesn't exist) -- this file would be c:\inetput\wwwroot\loadbalancer.html. I would like also like to check the contents of the file for "OK" (alert if OK is not in file). I have tried a Content Check and that check doesn't see to be affected if the actual file doesn't exist. For example, I renamed the file on the server and the check continues to be displayed
Monitor description field.
It would be really useful if we could have a notes/comments/description field associated to each monitor. Is this something that you are still not considering as a feature? Best, Rafael
Web Client Auto Refresh
Hi, I notice that Monitors, Status Page and Operations Dashboard pages do not auto refresh. Is this expected behavior? I am running the client in the latest Chrome browser. Regards, Ole Petter
Acknowledge issues in bulk
It would be highly beneficial to have a bulk acknowledge feature. This would be useful in instances of multiple site/service outage and you want to suppress notifications. One should be able to bulk acknowledge outages, and acknowledging should suppress notifications for some period of time. This should also take into account escalation rules; I believe that if problems have been acknowledged then the escalation notification should not happen either, or it should be configurable on how escalation
Notify users via messaging API
Hi, We are using a messaging API for alerting from our in-house systems, and would like to use this for Site24x7 notifications as well if possible. We send the API the following XML: <soapenv:Envelope xmlns:soapenv=\\"http://schemas.xmlsoap.org/soap/envelope/\\" xmlns:api=\\"http://api.messagenet.com.au/\\"> <soapenv:Header/> <soapenv:Body> <api:LodgeSMSMessage> <api:Username>##########</api:Username> <api:Pwd>##########</api:Pwd> <api:PhoneNumber>${recipient}</api:PhoneNumber> <api:PhoneMessage>${message}</api:PhoneMessage>
Templates for monitors, sorting, bulk moving between groups, filtering and reporting, private dashboards, auto refresh
Hello, We are using Site24x7 for monitoring of 100+ web urls and a bunch of servers, in total we have 230+ monitors. I'm not sure if someone already asked you, but Site24x7 should have better support for management of high amount of monitors I've being using Zabbix for many years, now we are using Site24x7. Site24x7 is a nice product which has a lot of preconfigured checks and multiple locations, but in comparison to Zabbix (and actually to Nagios) it is missing quite a few important features which
support for
You will have support for cloud services as azure? other databases such as Informix ibm? IBM AS400 platforms? HP operating systems like UNIX? storage monitoring as HP 3PAR? monitoring security platforms as PALO ALTO and FORTINET?
Never Give Up: Redesigning APM Insight .NET agent
Prologue I am part of the APM Insight team a.k.a. application scientists (that's how we're referred within the team, and we pride over it) focusing on the .NET side~of the agent. For the one's who are new here, APM Insight is the application performance monitoring tool from Site24x7 that provides end-to-end visibility into the way web transactions behave. We support monitoring of Java, .NET, PHP and Ruby web transactions. Is this blog for everyone? My answer would be NO? It's~a technical blog and
password invalid with my own domain name email
hi there!! Im trying to create new email account from my owndomain name and obsolly i just made it! but every single email account what i created the password is not working! could somobody you pleasee give a hand with this?
Custom WMI Monitors
Hi All! How can i create a custom WMI monitor (site 24x7 plugin right?) to monitor one of the several WMI options that a Windows Server has?, can you give me an example for the average disk queue length of a hard diks? https://www.site24x7.com/help/admin/adding-a-monitor/plugins/custom-plugins.html I just cant make it work. Thanks!
Sort Threshold and Availability Profiles
It be nice if the list of Threshold and Availability Profiles under Admin-->Configuration Profiles-->Threshold and Availability was sorted alphanumerically. It would also be nice when creating a new monitor that the Threshold and Availability drop-down was sorted alphanumerically. Unless I'm missing something, those two areas do not currently appear to be sorted.
On hover, show what metric is being shown in the "Performance" column.
On the "Monitor Status" interface, we have the Monitor Name along with the Performance being shown. The problem is, it isn't overly clear what metric the Performance column is showing for a given metric. My proposal: show the name of the metric being displayed when hovering over the performance value. For example: Hovering over "15%" would show "CPU Usage" in a tooltip.
Question about "Auto upgrade agent when new version available"
I have "Auto upgrade agent when new version available" in the admin settings set to Yes. Can someone explain what that actually does? What action is required by me as the admin when a new windows server agent is released?
Notifications in the public status page
Sometimes we need to post a notification/message to our customers in our status page. It wouldn't be related to an outage or maintenance. Could be something like "Critical bug found in v3.4, upgrade to v3.5 as soon as possible". Statuspage.io allows for this but overall site24x7 works better for us at the moment so I don't really want to switch. -Russ
Advanced Graphs
When monitoring a site it would be awesome to be able to see the average response and as well as load time for the page; and in addition be able to see the difference between each location in the graph. Having advanced graphs would be an awesome improvement to track site performance: 1.) On the existing graph, keep average response time as is but add a line graph for each location to show how they compare and make the graph zoomable but show one hour by default. In Monitor.us they show you response
Threshold and availablity settings hierarchy
I'm in the middle of trying out Site24X7 with a trial account and I had a question about how the Threshold and Availability settings work. Specifically, I'm wondering if there's any hierarchy to which ones take precedence. For example, I have a server monitor ("Server Monitor") that is configured to use a Threshold and Available profile called "Web Server". That threshold profile has "Notify when process is down" set to Down. Now, I can go to that particular server monitor and setup a Windows service
Resource Check failures should be configurable -- either Trouble *or* Down
Would it be possible to allow us to configure Resource Check failures so that they produce either Trouble *or* Down – depending on the choice of the user? At the moment, you only support Trouble for Resource Check failures. In our case, we have set up a Resource Check that is very critical to the server monitor – if the resource check fails, we definitely want the server monitor to indicate Down (not just Trouble). Thanks.
More detailed usage info
I asked about this via email but thought I'd also put it here so others can see it and note if it would also be useful for them. It would be nice to have a report that shows more detailed (last 12 months) RUM page hits so customers can track their usage should they need to plan for expansion or just tracking to see which check is using how many hits.
Sub-Grouping
Was thinking that being able to create sub-groups would be an awesome thing. I read another user's idea of creating dependencies. This could play into that as well and it will help with organization as well. For example, you can have a data center and then break it up into sub groups or you could have a group that classifies a service and then sub groups to the different things that make up that service. Example: sample.com Master Group --+--------------- + Databases Sub Group (db001, db002, db003,
BigPanda Integration
We were looking for Alert co-relation and found BigPanda and thought it would be great to have integration with this tool. https://bigpanda.io/
False alerts
In the past week we have been receiving false alerts intermittently on two different locations for our linux servers We had to disable the alerts The message we get is: Agent service node_x could not establish communication with the Server. Please check if there is a problem with the Network Communication. This could also happen if the Agent Service or the host itself is down. Is this a bug as when you check the server and services there are up and running for months It cant be a network issue either
API for bulk monitoring
HI , I am looking for API to import bulk monitoring .or can I use the same create monitor API for bulk monitoring .Basically I want to create multiple monitoring at the same time.
Disk I/O load monitoring
In the server monitoring we can monitor the load and the cpu usage. But when the the load or CPU usage are high, it is not always clear what the culprit is in case of high disk I/O. To be able to quickly see that the disk I/O is the problem I would like to see the different categories of CPU load Linux has (user, system, nice, iowait etc) stacked in the CPU graph. This way when I/O is the problem, the iowait percentage is high. For even better visibility of disk I/O load problems it would be great
Site24x7 Rolls Out PHP Monitoring
As one of the success stories of the open source initiative, PHP, in recent years, has carved a niche for itself with more and more developers moving toward the open source platform. This increase in usage means that there needs to be effective monitoring and reporting of issues in applications running on PHP. Read more about Site24x7~PHP~Monitoring. With Site24x7 PHP monitoring, DevOps teams get deep visibility into the performance of applications and can trace application threads to pinpoint the
Meet us at Jenkins World 2016
Site24x7 is exhibiting at booth no. K18 at Jenkins World 2016, the annual gathering of DevOps practitioners using open source Jenkins. The conference held at the Santa Clara Convention Center is the largest gathering of Jenkins users in the world and is a multi-day event comprised of sessions, workshops, training and other learning opportunities.~ [caption id="attachment_3610" align="alignnone" width="800"] Meet us at Jenkins World 2016, Booth No. K18[/caption] This is a great opportunity for you
Capturing end user IP address in RUM
Would it be possible to capture the end users IP address in RUM?
Monitor JavaScript errors using Site24x7
JavaScript errors can be critical and can affect the end-user experience. It's important to analyze for JavaScript error before end users are affected. Monitoring JavaScript errors helps ensure that users experience seamless web application performance.
Pause Rum Monitors
I want the ability to pause page reads for a RUM script..... pulling code from production servers is not easy. I would be nice to pause from the client side.
Extract response body and use it in following requests
Hello, What I am trying to achieve is retrieving the response body and use a variable to store this token. The goal is to use this token in the following request header for authorization. My response is not XML nor JSON but Mime Type: application/octet-stream. Is it possible to use an extractor or should I find an other way to do it? Best Regards,
status available for uptime in site24x7
Is there any possibility that I can see uptime of a particular website for a period of time.
Next Page