Splunk ITSI - Getting to the heart of the problem...

Somerford Blog

Its 3 o’clock in the morning and you are unceremoniously awoken by your smartphone. That beautiful dream you were just having dissolves into a stark reality – you’re being beckoned onto a priority 1 emergency incident call. Critical business services have been down for the past half-hour, nobody seems to know why, and customers are ringing the phones off the hook. Your boss has just sent you a very curt text demanding a status update in the next 15 minutes. That’s the third major incident this week!

It’s the start of yet another long and tortuous day, much of which you’ll spend gathering incident updates from the various IT teams. This wouldn’t be so bad if they all spoke the same language, but each team seems to have its own technobabble, and each points the blame at someone else for the issue. Identifying root-cause will be a nightmare.

Symptoms of the issue sound familiar. They are just like an outage you had last Tuesday. You wish you could remember how you’d resolved that one. You recall being told it was something to do with “… an application lock-up due to a database table-space issue because a caching error resulting from a lack of fault tolerant infrastructure when a disk-drive failed” or something like that – phew!

You Might Also Like:

Someone somewhere must surely be monitoring this stuff – so why aren’t they taking action before these issues occur? You also remember the flood of customers complaining that their services had just stopped without warning – no error messages – just a hung screen. Oh, and the service desk going crazy, with over 400 calls queued and 700 abandoned at one point. You just hope your customers will be a little more understanding than last time. Your boss certainly won’t be!

Is this a typical day at the office for your business operations?

Are you struggling to meet your service KPIs due to operational underperformance?

Is technical complexity and lack of visibility into the moving parts of your organisation causing real service performance issues?

In our experience at Somerford, many organisations find this is an all too familiar picture, particularly for those whose businesses who have significant service delivery and operational commitments. Many have already invested in event monitoring tools, but often discover these are point-solutions which will only help identify localised failures. With the ever growing complexity in the technology ‘stack’ used for service operations, it’s difficult to get a holistic picture of the full delivery platform. Keeping track of all the interdependencies between the moving parts within the organisation can be a nightmare, and identifying the root-cause of issues can be complex and time-consuming.

How can Splunk ITSI help?

Splunk IT Service Intelligence (ITSI) is a monitoring and analytics tool for Service and IT Operations. It enhances business effectiveness by improving the efficiency, reliability and cost-effectiveness of key services. Powered by machine learning, it provides visibility into the health state and performance of an organisation’s critical business services, and the underlying IT infrastructure upon which these depend. Splunk ITSI provides a core platform for service operations yielding a number of key benefits:

It provides real-time insights to understand how key services are performing
It uses advanced analytics to identify patterns, anomalies and trends,
It simplifies service monitoring and analytics, allowing faster, more informed decision making
It supports in-depth analysis of service issues and helps identify root-cause and reduce resolution times
It helps automate operations, saving effort on laborious, repetitive operational tasks
It allows organisations to pre-empt and avoid service failure

Where Splunk ITSI really wins out is through its use of Predictive Analytics and Machine Learning capabilities. By looking at patterns of behaviour over time and based on past experience, it can alert and respond to conditions that have previously led to adverse events or service failures. These ‘actionable events’ can be used as a catalyst for deeper investigation and preventative action.

Over time, operations teams can move from a reactive to a predictive state whereby adverse events can be detected and corrected before they have the opportunity to affect live services. Importantly, this saves the cost of incident management and the associated business disruption. Time is returned to the business by avoiding a service failure, and the customer benefits from service continuity, while you maintain your business reputation.

Using the predictive capabilities of Splunk, customers are claiming some significant operational benefits such as:

70-90% reduction in incident investigation time
30-45% reduction in outages
Ability to predict imminent outages 30-45 minutes in advance
Reduced alert noise by 90+%

Cookie	Duration	Description
ADRUM_BT1	past	This cookie is used to optimize the visitor experience on the website by detecting errors on the website and share the information to support staff.
ADRUM_BTa	past	This cookie is used to optimize the visitor experience on the website by detecting errors on the website and share the information to support staff.

Cookie	Duration	Description
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_gat_gtag_UA_1170872_23	1 minute	Set by Google to distinguish users.
_gat_gtag_UA_99925054_1	1 minute	Set by Google to distinguish users.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
CONSENT	16 years 2 months 24 days 11 hours 26 minutes	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.
vuid	2 years	Vimeo installs this cookie to collect tracking information by setting a unique ID to embed videos to the website.

Cookie	Duration	Description
IDE	1 year 24 days	Google DoubleClick IDE cookies are used to store information about how the user uses the website to present them with relevant ads and according to the user profile.
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Cookie	Duration	Description
cookie-test	past	No description
cookielawinfo-checkbox-functional	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
guest	1 month 1 hour	No description available.
jcm	past	No description
jcmc	past	No description
JOTFORM_SESSION	1 month	No description available.
SameSite	past	No description available.
theme	1 month 1 hour	No description available.
userReferer	1 month 1 hour	No description available.

Somerford Blog

ITSI - Getting to the Heart of the Problem

You Might Also Like:

Is this a typical day at the office for your business operations?

Are you struggling to meet your service KPIs due to operational underperformance?

Is technical complexity and lack of visibility into the moving parts of your organisation causing real service performance issues?

How can Splunk ITSI help?

More Resources like this one:

Somerford's Added Value Explained
Partner & Customer Testimonials |
Business Value Panel Discussion

What is Splunk ITSI?
Splunk's Solution for ITOps Explained:
Demonstration & Introduction

Get in Touch

Somerford Blog

ITSI - Getting to the Heart of the Problem

You Might Also Like:

Is this a typical day at the office for your business operations?

Are you struggling to meet your service KPIs due to operational underperformance?

Is technical complexity and lack of visibility into the moving parts of your organisation causing real service performance issues?

How can Splunk ITSI help?

More Resources like this one:

Somerford's Added Value Explained Partner & Customer Testimonials | Business Value Panel Discussion

What is Splunk ITSI?Splunk's Solution for ITOps Explained:Demonstration & Introduction

Get in Touch

Somerford's Added Value Explained
Partner & Customer Testimonials |
Business Value Panel Discussion

What is Splunk ITSI?
Splunk's Solution for ITOps Explained:
Demonstration & Introduction