Site Reliability Engineer – ON-SITE

KBC Technologies Group
  • Post Date: January 3, 2025
  • 39390
  • Applications 0
  • Views 1
Job Overview

logoRole-Site Reliability EngineerType-B2B ContractLocation-Warsaw (Hybrid 2 days in a week to onsite)Experience required-3 to 5 years in Devops
DescriptionPrime responsibility of Site Reliability Engineer is to make sure that environment is secure and safe. All security findings should be remediated within required resolution date defined by governance.We do not allow outage, even for a second. If any issue happens, as owner of the environment we do the needful to make sure those environments are up and running. Root cause analysis should be within hours. We make sure that findings are remediated in Production environment after all tests and checks in lower environments.As owner of environment, we keep track of all activities planned or happening in our environments. We are responsible for deploying new code in the environment.We look and analyze our environment regularly. If there is a manual task, we do automation of that. We are increasing selfheal capabilities and will continue to do the same until environments become auto-heal.If a new service is coming under our support or if migration of old environment is going to happen to new technologies, we start interaction with product developer to sketch out planning for production.As our business is running round the clock, we work in shift and synchronize with multiple locations and multiple tracks (sub team).We make sure that every activity is being recorded as per incident or change management process. Technical and related run books need to be prepared and shared with the team.
ResponsibilityExpertise in Troubleshooting applications in different middleware stack like Tomcat, Apache , Kafka , MQ, Streaming services like Flink, Spark.Able to query BigData Systems like Hadoop for reporting and alerting.Ability to build deployment, build scripts and automated solutions using scripting languages such as Shell scripting (Bash) / Java Script / Python / Other.Good understanding of monitoring solutions like Prometheus, Splunk , Grafana.Good experience in application written on Go and Rust and troubleshooting skillset around different system integration issues.
Mandatory SkillsKubernetes, Linux Administration, Network Troubleshooting.Good understanding of DevOps tools like Jenkins, Ansible, ArgoCD.

Job Detail
Shortlist Never pay anyone for job application test or interview.