S

DevOps Engineer at Savannah Informatics

Savannah Informatics
Full-time
On-site
Responsibilities

The ideal candidate for this position will be working on the following:

Site Reliability Engineering (SRE):


Implement and maintain best practices for ensuring the reliability and availability of web applications and services.
Implement observability tools, perform advanced debugging, and optimise multi-cloud infrastructures.
Set up and maintain Service Level Objectives (SLOs) and Service Level Indicators (SLIs) working with cross-functional teams to develop the documentation
Lead incident management, post-incident reviews, and root cause analysis to continuously improve system reliability and develop strategies to prevent future occurrences.
Monitor and update reliability processes during the systems or products' entire lifecycle for its adherence, improvement and minimizing wastage
Infrastructure:
Build, implement and maintain scalable, resilient cloud-based infrastructure using tools like Terraform and Ansible to optimize deployment workflows.
Scale and optimize resources across multi-cloud environments, ensuring cost-efficiency and performance.
Design robust CI/CD pipelines using tools like GitLab CI/CD, GitHub Actions, or similar platforms.
Participate in the design, implementation, and optimization of infrastructure, emphasizing scalability, security, and performance.
Tackles complex system-level challenges, anticipates future challenges and devise innovative solutions, and drive continuous improvement in infrastructure and processes.


Developer Experience:


Support and enhance the development process by providing tools and practices that improve developer productivity.
Collaborate with software development teams to set up and streamline the CI/CD (Continuous Integration/Continuous Deployment) pipeline.
Create and maintain development environments, including development, staging, and production environments.
Have the expertise and assist developers in debugging, performance optimization, and troubleshooting issues in the development lifecycle.
Implement advanced incident management, post-incident reviews, and proactive improvements to reduce downtime and enhance system reliability.


Tech Financial Operations:


Manage and optimise technology-related financial aspects, including budgeting, cost tracking, and cost control.
Implement and monitor cost-effective solutions for infrastructure and services, optimizing cloud resources.
Work closely with finance and procurement teams to ensure efficient allocation of technology-related budgets.
Implement cost allocation models to attribute technology expenses accurately.
Manage periodic reporting on the FIn Ops progress to the management using advanced cost allocation models and tools to monitor and control technology-related expenses.
Own strategic initiatives, such as cost optimisation, system reliability, and fostering a culture of observability.
Leadership and Mentorship
Actively mentor junior engineers, providing guidance on best practices, technical challenges, and process improvements.
Align technical operations with business goalsby working with cross-functional teams, to achieve overall success


Skills


The ideal candidate for this position will have the following:
Deep knowledge of Linux systems
The candidate must have strong skills in Operating Systems(Linux/Ubuntu/Debian), understand their way around a UNIX shell and believe that where there is a shell, there is a way.
Good computer network skills - He/She/They understands how networks work, the OSI model and protocols including TCP/IP, UDP, ICMP, HTTP(s), DNS, DHCP, SMTP etc.
Apply now
Share this job