Shell/Bash Script Validation Engineer Job at Openkyber, Georgia

ZHBjMDMxTmhaTXFhRklRT0FJWm9iVkRIMlE9PQ==
  • Openkyber
  • Georgia

Job Description

# SRE Lead & Monitoring Consultant

## Key Responsibilities

SRE Practice Development

Assess operational maturity and build SRE transformation roadmap
Establish SLOs, SLIs, and error budgets for critical services
Design incident management processes and on-call strategies
Implement chaos engineering and resilience testing
Mentor teams on SRE principles and best practices

Monitoring & Observability

Deploy and configure Datadog, Splunk, Grafana, and Prometheus
Implement metrics collection, log aggregation, and APM
Build custom dashboards and alerting configurations
Set up anomaly detection and intelligent alerting
Configure automated health checks and remediation
Establish golden signals monitoring (latency, traffic, errors, saturation)

Reliability & Compliance

Conduct reliability reviews and performance optimization
Design disaster recovery and failover procedures
Implement security monitoring and audit logging
Configure fraud detection and transaction monitoring
Create runbooks and operational documentation

## Required Qualifications

Experience

7+ years in Site Reliability Engineering, DevOps, or infrastructure engineering
3+ years in SRE leadership roles
3+ years hands-on experience with Datadog, Splunk, Grafana, and Prometheus
Previous experience in fintech or regulated industries
Proven track record building SRE practices from scratch

Technical Skills

Deep understanding of SRE principles, error budgets, and SLO/SLI frameworks
Expertise with cloud platforms (AWS, Azure, or Google Cloud Platform)
Proficiency with Kubernetes, Docker, and infrastructure as code (Terraform, Ansible)
Strong programming/scripting skills (Python, Go, Bash)
Experience with incident management and post-mortem culture
Knowledge of compliance requirements (SOC 2, PCI-DSS, ISO 27001)

Soft Skills

Exceptional leadership and mentoring abilities
Strong communication and stakeholder management
Data-driven decision-making approach
Collaborative mindset with ability to drive cultural change

## Preferred Qualifications

Cloud certifications (AWS, Google Cloud Platform, Azure) or Kubernetes certifications (CKA/CKAD)
Experience with ELK stack
Background in cloud cost optimization
Multi-cloud or hybrid cloud experience

## Deliverables

SRE maturity assessment and transformation roadmap
Fully configured monitoring stack with Datadog, Splunk, Grafana, and Prometheus
SLO/SLI definitions and error budgets
Custom dashboards, alerting, and automated remediation
Incident management framework and runbooks
Chaos engineering test suite

Job Tags

Similar Jobs

Delta Dental Of Idaho

Dental Underwriter Job at Delta Dental Of Idaho

 ...committed to better oral health for all Idahoans. TheDental Underwriterat Delta Dental of Idahois responsible forexecuting a wide...  ...reasoning/decision-makingand communicationappropriate toposition/level Benefits Delta Dental of Idahooffers a competitive... 

Local Big Brothers

Experienced Roofer & Siding Installer Job at Local Big Brothers

 ...home repair. We specialize in roofing, siding, door and window installation, and comprehensive handyman services. What sets us apart? We...  ...roofing systems (shingles, Rubber, TPO, underlayment, flashing, gutters) Install vinyl, fiber cement, and wood siding on residential... 

Intercontinental Exchange Holdings, Inc.

Summer Internship Program 2026 - Systems Analyst, Release Engineering Intern Job at Intercontinental Exchange Holdings, Inc.

Overview: Job Purpose The ICE Internship Program offers a dynamic opportunity to combine...  ...projects as assigned Knowledge and Experience Must be currently enrolled in a...  ...year of study Familiarity with common Software Development Life Cycle (SDLC) tools... 

Yale New Haven Health

Lactation Consultant Registered Nurse Job at Yale New Haven Health

 ...patient-centered, respect, accountability, and compassion - must guide what we do, as individuals and professionals, every day. The Lactation Coordinator is a RN certified in Lactation and is a role model in patient education and support. Working collaboratively with the... 

University System of New Hampshire

Enrollment Events and Marketing Coordinator Job at University System of New Hampshire

Enrollment Events and Marketing Coordinator Location Plymouth, NH : Summary Operating Title Enrollment Events and Marketing Coordinator...  ...Events and Marketing Coordinator will focus on the creation, design, and management of all oncampus recruitment events. The ideal candidate...