Job Summary
Our SRE team builds and operates a reliable cloud infrastructure and empowers the product teams with tools and processes to deliver features as fast as possible. This firm is ready to hire a Senior Site Reliability Engineer to assist them to streamline and accelerate the cloud infrastructure operations at Raisin.
- Minimum Qualification:Degree
- Experience Level:Entry level
- Experience Length:2 years
Job Description/Requirements
Responsibilities
- Design, build and operate Kubernetes based and serverless platforms at large-scale on AWS infrastructure.
- Hold yourself and others at high bar when working with production.
- Debug production issues across services and different levels of stack.
- Build a great customer experience for engineers that use your infrastructure.
- Continuously re-evaluate and optimize the infrastructure decisions to enhance performance and reduce costs.
- Collaborate with multiple teams, within and outside the organization and build solutions for Raisin.
- Establish processes and workflows for infrastructure operations.
- Invest in automation and build tooling around infrastructure operations.
- Set best practices for the tech organization and advocate for them and lead by example.
- Set standards and deliver high quality code.
- Help grow and mentor peers within the team and organization.
- Participate in on-call rotations.
Requirements
- University degree in Computer Science, Engineering, Information Systems or equivalent professional experience.
- You have strong hands-on experience in provisioning infrastructure with Terraform and automation around it.
- You have deep knowledge of a wide range of AWS services and networking in cloud environments.
- You have deep knowledge of Kubernetes and Docker.
- You follow a metrics driven approach and can make informed decisions using data.
- You have expertise in writing good quality code in any scripting language (Python and Bash preferred).
- You are passionate about writing and advocating for high quality code.
- You write high quality technical proposals and documentation.
- You have experience with monitoring and logging tools like Prometheus, Newrelic, Grafana, Splunk, Datadog and AWS Cloudwatch.
- You have experience in designing and building automated pipelines with CI tools like Gitlab.
- You are familiar with Agile methodologies like Kanban and SCRUM.
- You have expertise in working with linux based systems in the cloud and on-premise.
- You have expertise in using common linux and network debugging tools.
- You have good organizational skills.
- You have excellent analytical and conceptual abilities.
- You are a team player with strong English communication skills, German is a plus.
- You have experience with incident management and On-Call processes.
Important Safety Tips
- Do not make any payment without confirming with the Jobberman Customer Support Team.
- If you think this advert is not genuine, please report it via the Report Job link below.