Site Reliability Engineer (SRE) Job at IT America Inc, Plano, TX

ZXBZOTIxWnRhTWlYR0lJSkRJeHVabG5J
  • IT America Inc
  • Plano, TX

Job Description

Position: Site Reliability Engineer (SRE)

Location: Richmond, VA or Plano, TX

Work Model: Hybrid 3 days onsite per week

Duration: Long term contract

Job Summary:

We are seeking an experienced Site Reliability Engineer (SRE) to support cloud-native platforms and production systems for a large enterprise environment. This role will focus on ensuring high availability, reliability, performance, and scalability of mission-critical applications running on AWS.

Strong Preference: Former Capital One engineers. Candidates must be able to provide verifiable Capital One credentials and be eligible for rehire.

Key Responsibilities:

  • Design, build, and maintain highly reliable, scalable, and resilient systems in AWS
  • Monitor system health, performance, and availability using SRE best practices
  • Implement automation to reduce manual operational work
  • Troubleshoot production incidents and perform root cause analysis (RCA)
  • Develop and maintain scripts and tools to improve system reliability and efficiency
  • Partner with application development, platform, and infrastructure teams
  • Support on-call rotations and incident response as required
  • Enforce operational excellence, security, and compliance standards

Required Skills & Qualifications:

  • Former Capital One experience HIGHLY preferred
  • Must provide credentials for rehire eligibility verification
  • Strong hands-on experience with AWS (EC2, EKS, Lambda, CloudWatch, IAM, etc.)
  • Python scripting experience strongly preferred
  • Bash or Shell scripting experience will also be considered
  • Experience with Linux-based systems and troubleshooting
  • Understanding of SRE concepts: SLIs, SLOs, error budgets, monitoring, and alerting
  • Experience supporting production environments at scale

Preferred Qualifications:

  • Experience with CI/CD pipelines
  • Infrastructure as Code (Terraform, CloudFormation)
  • Containerization and orchestration (Docker, Kubernetes)
  • Observability tools (Prometheus, Grafana, Datadog, CloudWatch)
  • Experience working in highly regulated enterprise environments

Job Tags

Long term contract, 3 days per week,

Similar Jobs

Mayo Clinic

Histology Technician (HT) or Histotechnologist (HTL) - Anatomic Pathology Core Job at Mayo Clinic

 ...advancement opportunities at every turn, you can build a long, successful career with Mayo Clinic. Responsibilities The Histology Laboratory processes over 400,000 paraffin-embedded blocks and 1,500,00 slides per year. Specimens handled in this laboratory include... 

Intercontinental Exchange Holdings, Inc.

Summer Internship Program 2026 - C++ Developer Intern Job at Intercontinental Exchange Holdings, Inc.

Overview: Job Purpose The ICE Internship Program offers a dynamic opportunity to combine...  ..., and mortgage industries. The C++ Developer Intern will work on Curve Engine...  ...projects as assigned Knowledge and Experience Must be currently enrolled in a Master... 

UTMB Health

Patient Care Technician II (PCT) - Med/Surg - CLC (Days) Job at UTMB Health

Minimum Qualifications: High school or equivalent, one year of hospital experience AND completion of one of the following technical programs: Completion of a recognized Nurse Assistant program OR Completion of Medical Corpsman program OR Completion of EMT program OR Is... 

Final Clean

Cleaning/Cleaner Job at Final Clean

 ...We Clean new homes after construction for Home Builders. Immediate Openings. Looking for a few Good Employees, that are detail orientated. This job will entail cleaning of new homes before the home owner moves in. All employees arrive at the office and company... 

ICF

Software developer Job at ICF

 ...Description Were currently hiring a Software Developer Intern to join our team remotely in...  ...is an entry-level, 10-week, full-time internship expected to begin in June and end in August...  ...technical specifications Experience with hands-on development, including an...