Site Reliability Engineer
For rapidly growing UK digital design and development company, primarily servicing the sports betting and gaming industry, which specialises in high traffic applications and websites using the most advanced technologies, we are looking for:
Site Reliability Engineer
Location: Krakow
Responsibilities:
- Continuous Deployment, automated rollback and failover
- Maintenence and development of product infrastructure strategy
- Continuous Performance Management. Measuring performance and working with developers to improve it
- Detect and resolve security, performance and availability issues to ensure maximum uptime and performance
- Investigate, evaluate and recommend new tools and technologies for faster fault finding
- Administration of web servers, Load Balancing (haproxy, nginx)
- Network and Linux virtual machines administration
- Containerisation / packaging. Distributed component integration/troubleshooting (Docker, Kubernetes)
- Log aggregation (Kibana / Logstash / Graylog)
- Monitoring (CA Wily introscope, New Relic)
- Troubleshooting of various issues in cloud environment in a measured, methodical way, often under pressure
We expect:
- experience in SRE, DevOps/Ops role, administration of production software environment,
- strong practical knowledge of Linux/Unix, networking/administration
- practical knowledge of HTTP protocol (cache, debugging, monitoring)
- good understanding of cloud computing paradigm (distributed logging, service discovery, stateless applications, scaling, HA)
- understanding of Infrastructure as a Code paradigm
- experience in troubleshooting issues in distributed systems
- practical knowledge of git SCM and good understanding of git flow concepts
- hands-on experience of writing scripts with at least one of scripting language
- experience in building pipelines for automating of applications scaffolding, testing, building, auto-scaling and integration
- incentive to propose improvements for the development teams (software design patterns, best practices, code styles)
- experience with real-life deployments to any of the major cloud providers (i.e. AWS, Google Cloud)
- understanding of Mesos + Marathon/Kubernetes
- experience with developing in-house IaaS/PaaS solution
- work experience with configuration management tools such as Ansible, Chef
- GCP understanding
- experience in work with developers on middleware/frontend tier
- experience in infrastructure provisioning with Terraform
- very good English skills
We offer:
- flexible working hours
- free fruit
- table football
- spacious and cosy office space near the city centre
- regular team building events
We kindly inform you that we will contact only chosen candidates.
We will contact only selected candidates.