Software Site Reliability Engineer

Job description

Stability is key on our environments and we are looking for a Software Site Reliability Engineer who can help us in our efforts to increase stability, also keeping the reliability of our platforms in mind. If you enjoy mitigating risk by quickly detecting issues, fast deployments and building resilient products this could be the job for you. The ideal candidate has a Software Engineering background.


As a Software Engineer in our SRE team you will have the opportunity to handle the complex problems of scale while using your expertise in coding, algorithms, complexity analyses and large scale system design.

We are always on call to keep our networks and applications up and running, ensuring our developers and users have the best and fastest experience possible.


As a Software SRE you will:

  • Design, write and deliver software and tooling to improve the availability, scalability, latency and efficiency of GeriMedica’s products and platforms;
  • Solve problems relating to mission critical services and build automation to prevent problem recurrence;
  • Contribute to the automation of our system maintenance and development procedures;
  • Create and align real-time monitoring and testing approaches with developers and Operations;
  • Provide immediate relief to issues within the GeriMedica hosted environment;
  • Work closely together with our hosting and operations partner, software architects and technical support;
  • Bring an SRE perspective to the development teams.


What we offer:

  • A collaborative, informal and playful work environment with self-organized, multi-disciplinary agile/lean teams;

  • Managers and teammates who are invested in your growth as a technologist and as a person;

  • Work with cool modern technologies such as Kubernetes, Elasticsearch, Docker, RabbitMQ, Openstack Swift, etc.;

  • Access to tools and resources to do your job (best-in-class laptop, IDEs, conferences and meetups, all the coffee and soda you can drink, and much more);

  • Ability to join and participate in internal discussions at GeriMedica on topics like DevOps, Big data, Mobile development, Machine learning and others;

  • Free & healthy daily lunch (yes! There is such a thing)

Requirements

We are looking for someone with the following profile:

  • Expertise in designing, analysing and troubleshooting large-scale distributed systems;

  • Love big scale things and be able to identify risks and recommend actions to mitigate this risk;

  • Ability to work in an Agile cross-functional team (e.g., DevOps, Kanban, Scrum);

  • Easy to communicate with anyone from any discipline;

  • Be coding savvy (Java, Javascript, etc.);

  • Open to learning new skills and technologies;

  • Awareness of security practices in all aspects of systems and software development;

  • Ability to create your own solutions as well as using off-the-shelf ones.


A suitable candidate would have (part of) the following skills:

  • Comfortable working with Linux based systems.

  • Have knowledge of scripting languages like Bash and Python

  • Familiar with basic system and networking concepts.

  • Willing to dive into the world of Docker and Kubernetes. Being savvy is a plus!