Site Reliability Engineer (SRE) Job at Kanshe Infotech, Alpharetta, GA

S2dGQ2RFR1loV0RKZ2NJOXdhTk9MNGhLU2c9PQ==
  • Kanshe Infotech
  • Alpharetta, GA

Job Description

Job Title: Site Reliability Engineer (SRE)

Location: Alpharetta, GA- Only Local

Job Description:

We are looking for an experienced Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a strong background in DevOps, cloud infrastructure, automation, monitoring, and system reliability . You will be responsible for ensuring high availability, scalability, and performance of production systems while driving operational excellence through automation.

Key Responsibilities:

  • Design, build, and maintain scalable and reliable infrastructure on AWS / Azure / GCP .

  • Develop automation for deployment, monitoring, and incident response.

  • Implement CI/CD pipelines using tools like Jenkins, GitHub Actions, or GitLab CI.

  • Monitor system performance and ensure uptime, latency, and capacity optimization .

  • Build and maintain infrastructure as code using Terraform, Ansible, or CloudFormation.

  • Collaborate with development teams to improve system reliability and deployment processes.

  • Implement robust monitoring, alerting, and logging using Prometheus, Grafana, ELK, or Datadog.

  • Participate in on-call rotations , incident response, and root cause analysis.

Required Skills:

  • 10+ years of experience as an SRE, DevOps, or Cloud Engineer .

  • Hands-on experience with AWS, Azure, or GCP .

  • Strong scripting skills in Python, Bash, or Go .

  • Proficient with Docker, Kubernetes, Helm .

  • Experience with Terraform, Ansible, or other IaC tools .

  • Expertise in monitoring & observability tools (Prometheus, Grafana, Splunk, ELK, Datadog).

  • Solid understanding of Linux system administration and networking concepts.

  • Strong troubleshooting and problem-solving skills.

Preferred Skills:

  • Experience with microservices and service mesh (Istio/Linkerd) .

  • Familiarity with security best practices and incident management .

  • Experience in performance tuning and capacity planning .

  • Exposure to SLA/SLO/SLI management and reliability metrics

Education:

  • Bachelor's or Master's degree in Computer Science, Information Technology, or related field.

Job Tags

Local area,

Similar Jobs

Delphi Healthcare, PLLC

Emergency Medicine Physician Job at Delphi Healthcare, PLLC

 ...Great Emergency Medicine Opportunity... Delphi Healthcare is well experienced in emergency medicine. Our staff has been providing excellent high-quality care to our patient, long term career satisfaction for our physicians and cost-effective Emergency Department staffing... 

Promoveo Health

Pharmaceutical Sales Representative -Flex Time (12 days/mo) - GI Job at Promoveo Health

 ...Pharmaceutical Sales Representative GI - Flex Time (12 days/mo) Promoveo Health, a leading Pharmaceutical Sales recruiting, and contract sales company has an outstanding position representing one of our strategic clients. Our client is a rapidly growing organization... 

PrincePerelson and Associates

Bilingual Korean - Customer Experience Specialist (Remote- UT ONLY) Job at PrincePerelson and Associates

 ...Bilingual Korean Customer Experience Specialist (Remote- UT ONLY) Compensation: $21$24 per hour DOE Work from Home - UTAH RESIDENTS ONLY Must live within an hour's drive of Lehi Schedule: Full-Time or Part-Time | MondayFriday, 9:00 a.m.5:00 p.m. (occasional... 

MatchBukh Talent Solutions

Physician Assistant Job at MatchBukh Talent Solutions

Licensed Physician Assistant (Pain Management) Northern California Redding region Confidential Healthcare Organization $14...  ...of service, known for providing exceptional pain management care across Northern California. Our integrated care model blends... 

Glidewell Dental

Sr. Network Engineer Job at Glidewell Dental

 ...architects, designs, configures, installs, and manages enterprise network and network equipment. Maintains personal knowledge of...  ...and other WAN circuits and equipment. Mentors other network engineers who are junior to this position. Ensures network meets compliance...