Senior Data Engineer
Absa
n/a - n/a
R900–950
15 Troye Street, City of Johannesburg Metropolitan Municipality, 2001
Data Engineer
Cloud Engineer
Data Engineering
Databricks
Python
SQL
Apache Spark
Role
Role Overview
We are seeking a Senior Data Engineer (senior individual contributor) to design, build, and operate Databricks & Lakehouse data platforms that support analytics, AI, and Generative AI applications.
This role works within product-aligned squads and focuses on delivering high-quality, governed, and scalable data assets consumed by analytics platforms, machine learning models, and GenAI applications including LLM- and agent-based systems.
Key Responsibilities
- Data Engineering & Lakehouse Delivery
- Build, and maintain data pipelines and lakehouse structures
- Deliver data solutions that support:
- Analytics and BI
- Machine learning workloads
- Generative AI applications and agents
- Apply enterprise data lake and lakehouse principles to ensure data is:
- Reliable
- Well-governed and aligned to Absa’s governance
- Secure
- Fit for downstream consumption
- Translate business and analytical requirements into production-ready data solutions
Databricks & Platform Usage
- Build and operate solutions using Databricks, including:
- Delta Lake
- Databricks Jobs and Workflows
- Unity Catalog
- Notebooks and shared libraries
- Enable data consumption by:
- GenAI use cases (e.g. RAG, AI services, agent workflows)
- Analytics and reporting tools
- Downstream operational systems
- Support feature-style and curated data access patterns required by AI and GenAI workloads
Generative AI Enablement
- Build data pipelines that feed Generative AI applications, including:
- Curated knowledge datasets
- Structured and semi-structured data sources
- Metadata and lineage required for AI consumption
- Enable data patterns commonly used in GenAI, such as:
- Retrieval‑Augmented Generation (RAG)
- Context and prompt data preparation
- Model input, output, and feedback data flows
- Work closely with AI Engineers and Product Owners to align data engineering deliverables to GenAI use cases. Note: you will also be involved in AI Engineer development.
Engineering Practices
- Develop production-grade pipelines using Python, SQL, and Apache Spark
- Implement automated testing and CI/CD practices for data workloads
- Ensure data solutions are:
- Observable
- Resilient
- Performant
- Cost-efficient
- Contribute to improving data quality, reliability, and operational stability
Collaboration & Ways of Working
- Work as a senior engineer within a cross-functional product squad
- Collaborate closely with:
- Product Owners
- AI / ML Engineers
- Analytics teams
- Platform and security teams
- Provide engineering input into design discussions and delivery decisions
- Support peer reviews and shared engineering standards
Risk, Governance & Run
- Ensure data solutions comply with enterprise security, risk, and governance standards
- Support operational stability of data pipelines used by analytics and AI workloads
- Participate in incident resolution and root cause analysis
- Maintain appropriate documentation and runbooks
Required Skills & Experience
- Proven experience as a Senior / Lead Data Engineer
- Hands-on experience working in Databricks environments
- Strong understanding of enterprise data lake and lakehouse architectures
- Proficiency in:
- Python
- SQL
- Apache Spark
- Experience building and operating production-grade data platforms
- Experience working in enterprise or regulated environments
Desirable Experience
- Experience enabling AI, ML, or Generative AI use cases from a data engineering perspective
- Familiarity with:
- RAG data patterns
- Feature-style or AI-serving datasets
- Vector or embedding-ready data workflows
- Experience working in Agile, product-aligned squads
- Exposure to cloud-native data platforms (AWS or Azure)
Role Clarification
- This is a senior individual contributor role
- The role does not include formal people management or technical lead accountability
- The focus is on delivery, quality, and enabling AI and GenAI outcomes
Apply
Share