How are you tracking reliability? "Reliability's not a binary thing. You're not reliable or unreliable, right? Like we always are chasing that next nine of availability, and we know it takes a lot, but that's not how the business sees it. Like the business is like, 'Hey, I'm paying money for something, and what is the return I'm getting on my investment?' If you're just tracking things like outage minutes, that's really a lagging indicator. It's a sign of where your reliability was previously. Because outages are things that happened in the past. Maybe you even resolved them, right? And so it's not something that's showing you how you're winning here. And so the real question that you know is belying this conversation is: How do we show—how do we communicate to the business how we're making improvements? How do we show that we're winning?" —Samuel Rossoff, Gremlin Principal Engineer
Gremlin
Software Development
San Jose, California 11,287 followers
The Reliability Management Platform for high-velocity engineering teams
About us
Gremlin’s Reliability Management Platform enables high-velocity engineering teams to standardize and automate reliability across their organizations without slowing down software delivery. Gremlin's Reliability Score sets the standard for reliability so there's no guesswork, and an automated suite of Reliability Management tools makes it easy to integrate reliability throughout the software lifecycle so there's no slowdown.
- Website
-
http://www.gremlin.com
External link for Gremlin
- Industry
- Software Development
- Company size
- 51-200 employees
- Headquarters
- San Jose, California
- Type
- Privately Held
- Founded
- 2016
- Specialties
- Distributed Systems, Resilience, Failures as a Service, DevOps, and Chaos Engineering
Locations
-
Primary
55 S Market St
Ste 1205
San Jose, California 95113, US
-
555 Montgomery St
Ste 811
San Francisco, California 94111, US
Employees at Gremlin
-
Josh Leslie
CEO of Gremlin | Making applications more reliable | GTM-focused investor & advisor
-
Dave Darwin
Helping Deliver Highly Resilient Digital Services
-
Jason Heller
Helping teams build more reliable systems
-
Stefano Pirovano
IT Sales leader with a special focus on Startups - 4x IPOs
Updates
-
⚡️ NEW: Want to see how Gremlin can help you improve reliability? Our brand new interactive product tours can show you how! See how easy it is to run a Chaos Engineering experiment or reliability test in just a few clicks using Gremlin: https://lnkd.in/givm_bcc
-
Join Andre Newman and Dylan S. for an upcoming webinar on November 7th: "Building Resilience from Architecture to Production with AWS Partners & Gremlin” What they’ll cover: 🚀 Common reliability pitfalls when building on AWS 🚀 How to design resiliency and reliability into your AWS architecture 🚀 How to test resilience throughout your SDLC to uncover issues before they cause outages. Register here: https://lnkd.in/gT4g538m
-
Thinking about integrating Gremlin into your existing pipeline? Look no farther than the Gremlin API. "The next step then was to build the right tooling such that the resiliency tests can be run from a pipeline. Gremlin's API first approach made it possible to do this in a very easy manner because everything that we could do from the UI and manually, we could replicate all of that through the API as well. Gremlin's APIs make it super easy to set up your attacks, scenarios, and experiments and invoke them at the right time." —Kaushal Dalvi, Sr. Principal Engineer, UKG
-
🚀 New at Gremlin: Self-guided product tours! Have you ever wanted to see how easy it is to run a Chaos Engineering experiment, onboard your AWS services, and get a reliability score using Gremlin? Learn how by taking our interactive, self-guided tours of the Gremlin web app—no registration required. https://lnkd.in/gH2JM4mX
-
Come here how Gremlin & AWS are working together to make the world more reliable!
How can you be sure your systems are resilient to failure when they’re based on complex architecture, built by hundreds of teams, and are being updated almost constantly? Join us on November 7 for our latest webinar, "Building Resilience from Architecture to Production with AWS Partners & Gremlin." What you’ll learn: ➡️ Common reliability pitfalls when building on AWS ➡️ How to design resiliency and reliability into your AWS architecture ➡️ How to test resilience throughout your SDLC to uncover issues before they cause outages. Register here: https://lnkd.in/gT4g538m
-
How can you be sure your systems are resilient to failure when they’re based on complex architecture, built by hundreds of teams, and are being updated almost constantly? Join us on November 7 for our latest webinar, "Building Resilience from Architecture to Production with AWS Partners & Gremlin." What you’ll learn: ➡️ Common reliability pitfalls when building on AWS ➡️ How to design resiliency and reliability into your AWS architecture ➡️ How to test resilience throughout your SDLC to uncover issues before they cause outages. Register here: https://lnkd.in/gT4g538m
-
You're going to spend time fixing reliability—but it's your choice whether it's during an outage or ahead of time on your schedule and for less costs. Which will you choose? "We all know when things go wrong, it cost us a million dollars and it was really bad. Let's have that never happen again. But when we say, I need every engineering team to spend one hour, one day a week on reliability, does everyone lose their mind, or is that a reasonable request? Can we amortize out the cost of that? And that's actually how I view it is we're amortizing the cost. We can spend one hour a week up front, or once a month, we could lose two days to dealing with it. That's a net loss in time. So I think giving teams the ability to plan this work, to do this work, to invest in it, and then as we discussed, you have to reward teams for the right behavior." —Kolton Andrus, Gremlin CTO
-
More and more large enterprises are asking for fully on-premise solutions. Because of this clear market signal, after seven years of running exclusively as a public SaaS, Gremlin will soon offer a fully private and self-hosted version of our platform to customers who prefer to run on-prem. Most customers cite security as the driver for this requirement. Customers are concerned about the security of their data, communication between our public SaaS and their infrastructure, and integrations between Gremlin and other vendor services. Personally, I don’t buy this reasoning. My view: most security teams have more internal power than IT operations teams. They have the power to slow down or shut down important projects. Operations teams have therefore concluded it’s easier to build or run a service than to seek approval from their security colleagues. What is the lesser evil: the marginal security risk or getting back into the business of managing on-premise software? (you probably know what I think) Anyways, we are doing it. #Customerchoice.