U

Data Engineer at University of Cape Town

University of Cape Town
June 04, 2026
Full-time
On-site

The Data Engineer will be responsible for building, maintaining, and optimising the organisation's data and AI infrastructure. In addition to contributing to digital transformation initiatives, the successful candidate will assist in developing reliable platforms that support data processing, analytics, and machine learning operations.
The role sits at the intersection of data engineering, cloud infrastructure, DevOps, and machine learning operations (MLOps), ensuring that systems are scalable, automated, and production-ready for real-world use.


The successful candidate will work closely with software developer, business analyst and Knowledge Translation teams to:


Build and maintain data platforms (warehouses, pipelines)
Develop and optimise large-scale data processing systems
Build and support AI/ML platforms for model training and deployment
Implement CI/CD pipelines for data and AI workflows
Automate infrastructure using Infrastructure as Code (IaC)
Monitor, troubleshoot, and optimise system performance
Ensure reliability, scalability, and security of platforms
The position provides hands-on technical implementation and operational support for data and AI systems across the organisation.


Qualifications:


Bachelor's degree in Computer Science, Information Systems, or a related field


Required Skills:


Proficiency in Python and SQL
Strong command of data engineering concepts, including ETL/ELT pipelines
Hands-on experience with cloud platforms such as Amazon Web Services, Microsoft Azure, or Google Cloud
Practical experience working with big data technologies
Proficiency in Linux environments and scripting
Strong understanding of DevOps principles and automation practices
Proven problem-solving capability with a systems-thinking approach to designing scalable and efficient solutions
Exceptional attention to detail with a strong, reliability-focused mindset


Experience:

Essential:


Minimum of 3 years experience in data engineering, platform engineering, or similar roles
Proven experience building and maintaining data pipelines and platforms
Experience working with cloud-based infrastructure


Advantageous:


Experience with containerisation and orchestration tools (Docker, Kubernetes)
Exposure tMLOps tools and machine learning workflows
Experience implementing CI/CD pipelines
Familiarity with real-time data processing systems
Experience in healthcare, research, or public sector environments


Key Responsibilities:

Data Platform Engineering


Design, build, and maintain scalable data platforms including data lakes, warehouses, and pipelines.
Develop and optimise systems for processing large volumes of structured and unstructured data.
Ensure efficient, reliable, and secure data flow across systems.


AI/ML Platform Development (MLOps)


Build and maintain platforms that support machine learning model development, training, and deployment.
Enable model versioning, monitoring, and lifecycle management.
Support integration of AI models into production environments through APIs and services.


Cloud and Infrastructure Engineering


Develop and manage cloud-based infrastructure using platforms such as AWS, Azure, or Google Cloud.
Implement Infrastructure as Code (IaC) for scalable and repeatable deployments.
Evaluate and implement appropriate tools and technologies for platform development.


Automation and DevOps


Develop and maintain CI/CD pipelines for data and AI systems.
Automate workflows to improve efficiency, consistency, and scalability.
Implement monitoring, logging, and alerting systems to ensure operational visibility.


Performance, Reliability, and Support


Ensure systems are highly available, scalable, and capable of handling large-scale data and real-time processing.
Identify and resolve performance bottlenecks and system failures.
Provide ongoing maintenance and optimisation of platforms to ensure stability and efficiency.


Systems Implementation and Collaboration


Work closely with developers, data scientists, and Knowledge Translation teams to support platform needs.
Provide technical input into system design and implementation decisions.
Continuously identify opportunities to improve platform performance, scalability, and usability.


Closing Date: 15 June 2026