Senior. AWS/Databricks Engineer
Absa
n/a - n/a
R900–1,085 per hour
Johannesburg, City of Johannesburg Metropolitan Municipality
Role
We are seeking a hands-on AWS Platform & Networking Engineer to own the AWS connectivity and operational foundations that enable Databricks on AWS in a regulated enterprise environment. This role is AWS-first: you will focus on the network, security, and connectivity layer between our Databricks AWS VPC (IRE region) and enterprise/bank connectivity, as well as reliable access to core AWS services such as S3.
You will have administrator rights within the Databricks AWS VPC and will implement changes directly where in scope. For changes outside the VPC boundary (e.g., other AWS accounts, shared network services, or enterprise perimeter controls), you will raise, drive, and follow through change requests with the Absa Cloud team, owning end-to-end resolution and service reliability.
Key Responsibilities
- AWS VPC ownership (Databricks VPC): administer and maintain subnets, route tables, security groups, NACLs, NAT/Internet egress patterns (as applicable), and network segmentation to meet performance and security requirements.
- Connectivity to the bank / enterprise network: troubleshoot and support end-to-end connectivity between the Databricks AWS VPC (IRE) and the bank network across cross-region/cross-account boundaries; coordinate and drive changes with the Absa Cloud team where required.
- Private access to AWS services: design, implement, and operate VPC endpoints and related routing/DNS patterns to enable secure access to services such as S3 while reducing reliance on public internet paths.
- S3 data access enablement (with security controls): partner with platform/security teams to ensure Databricks workloads can reliably read/write required S3 data using appropriate IAM roles/policies and encryption controls; support diagnosis of access failures that present as platform incidents.
- Operational support & reliability: provide production support for the platform connectivity layer (incident response, RCA, preventative actions), maintain runbooks and reference diagrams, and implement improvements to reduce repeat incidents.
- Cross-team change management: raise, manage, and chase change requests with the Absa Cloud team for items outside the Databricks VPC boundary; translate technical needs into clear implementation requirements and validate changes end-to-end.
Required Skills & Experience (Must-have)
- AWS networking: strong hands-on experience with VPC design/operations, routing, security groups/NACLs, and network troubleshooting in production.
- Enterprise cloud operations: experience operating within a regulated/enterprise environment with change management, auditability, and strict security controls.
- Connectivity troubleshooting: ability to diagnose reachability issues across complex boundaries (cross-account/cross-region, enterprise network perimeters) and drive resolution across multiple teams.
- AWS service access patterns: experience enabling secure access to services like S3 (and related IAM policy patterns) in a way that supports production workloads.
- Stakeholder management: proven ability to liaise with a central cloud/network team, raise and drive changes, and communicate clearly during incidents.
Desirable Experience (Nice-to-have)
- Databricks on AWS experience: understanding of Databricks workspace architecture and its connectivity constraints (data plane/control plane concepts, typical network dependencies).
- Private connectivity patterns: experience with private endpoint patterns and enterprise connectivity services (e.g., endpoint-based access, centralised routing constructs).
- Infrastructure-as-Code: Terraform/CloudFormation experience for repeatable, audited changes (nice-to-have).
- Security tooling and monitoring: exposure to logging/monitoring approaches used for network and cloud operations.
Ways of Working
- Owns outcomes end-to-end (hands-on fixes inside the VPC; drives changes outside the boundary through the Absa Cloud team).
- Strong operational mindset: prioritises stability, clear communication, and measurable prevention of repeat incidents.
- Documents and standardises: runbooks, network diagrams, and repeatable change patterns.
Apply
Share