Location: Hybrid (Alpharetta, GA – 3 days/week in office)
Type: Full-Time
We are seeking a Site Reliability Engineer to join our team and play in enhancing the stability, performance, and reliability of our production systems. You’ll work closely with development, DevOps, and security teams to improve observability, optimize system performance, and ensure production readiness. From monitoring to automation, you’ll make a direct impact on our cloud infrastructure and service reliability.
In this role, you will work hand-in-hand with our development, operations, and security teams worldwide to implement best practices, automate deployments, and ensure our platforms are reliable, secure, and scalable. Troubleshooting in Kubernetes requires deep understanding of pods, nodes, networking, scaling, logs, and service-to-service communication
This role requires a deep understanding of SRE best practices and a strong ability to troubleshoot complex issues.
Your responsibilities in this role will include:
Maintain and enhance monitoring tools (New Relic, Graylog) for service health and performance metrics.
Implement and maintain high-availability systems with capacity planning, performance optimization, and fault tolerance.
Define and monitor Service Level Indicators, Objectives, and Agreements with teams.
Deploy and manage Kubernetes workloads to AWS EKS(A) using Helm, ArgoCD
Automate operational processes to reduce manual interventions.
Manage Kubernetes workloads on AWS EKS for secure and stable deployments.
Participate in on-call rotation, troubleshoot production issues, and implement permanent fixes.
Work with DevOps to improve CI/CD pipelines and with development teams to embed resilience and observability.
Document operational runbooks, escalation procedures, and production playbooks.
We are looking for you to have the following skills and experience:
Nice to Have
This is a full-time role and we are unable to sponsor so you must be a USC or be a Green Card holder. We are working onsite a few days each week in our Alpharetta offices so you must live in Atlanta and within commuting distance of our office. If you thrive on solving complex technical challenges, have a passion for automation, and want to influence how enterprise platforms evolve and modernize, this is an ideal opportunity for you.
Ready to take the next step in your SRE career? Apply now and help us build the future of reliable systems!
Business Development Agent | Giles Subaru | Lafayette, LA WE ARE GILES AUTOMOTIVE GROUPGiles is proud to be an automotive leader in... ...Pay: Entry-level role starting at $12/hour.* Training & Career Path: No experience needed. We provide full training...
...The Opportunity Fuel Truck Driver at a Growth-Oriented Energy Company Work with a cohesive team to serve a small community Reporting... ...after each delivery. Following the Department of Transportation safety specifications performs pre-delivery trip check on tank...
...family or travel reasons. Founded in 2004, the companys motto is Any Language, Anytime, Anywhere! We might have a job for you as a Portuguese teacher. Some details about the course: One of our clientswould like to have their two children a two-to-one General...
...help us put health first**The Medical Coder is responsible for extracting... ...and assigning accurate codes using ICD-10-CM, CPT, and HCPCS... ...and HCPCS coding standards+ Experience with coding software and electronic... ...**Additional Information**+ Remote Role+ Standard working hours...
...reliable work and strong income potential. This role offers the flexibility and rewards you're after. Se habla espaol! Oportunidades de limpieza disponibles para housekeepers, limpieza de casas, mucamas, limpiadores, criadas y servicios de limpieza....