27Global introduced Site Reliability Engineering (SRE) as a key business pillar, and leverages this capability for its internal DevOps teams and for service offering to customers. The company traditionally used a mix of tools, including Grafana, Graylog, and Zabbix, and the SRE team needed more consistent, consolidated observability across multiple development pipelines on-premises and in the cloud.
27Global has an offshore team of engineers in Vietnam in addition to its U.S. team, and found it difficult to communicate complex operational issues, such as performance problems, between the two teams. The SRE team lacked cohesive metrics as evidence for existence of performance issues. Assembling operational data—such as events, logs and traces, to build end-of-day dashboards—for the teams took 45 minutes each night.
With technology changing at a rapid pace, 27Global’s SRE team needed a solution to provide accurate, consolidated, and easy-to-share measurement data to improve DevOps efficiency and deliver value to SRE customers.