
Data & Databricks Test Automation Engineer
The Citco Group Limited • Dublin, Leinster, Ireland
Role & seniority: Data & Databricks Test Automation Engineer; mid-level (2+ years hands-on Databricks/Spark experience)
Stack/tools
-
Databricks (Spark), Delta Lake, Delta Live Tables, Unity Catalog
-
Python (PySpark), SQL
-
Data testing frameworks/tools; AWS (S3, Glue, Lambda); Git
-
Spark optimization, performance testing; data quality and governance concepts
Top 3 responsibilities
-
Design and implement automated testing for Databricks notebooks/workflows, Delta Lake tables, ACID guarantees, structured streaming, and Delta Live Tables
-
Automate testing of ETL/ELT pipelines, Spark job validation/optimization, data ingestion and transformation checks, and Unity Catalog governance features
-
Develop and execute data quality strategies, automated reconciliation, performance tests, monitoring frameworks, and testing dashboards with documentation
Must-have skills
-
2+ years with Databricks/Spark
-
Python (PySpark) and SQL proficiency
-
Experience with data testing frameworks/tools
-
Knowledge of AWS services (S3, Glue, Lambda)
-
Understanding of Delta Lake and Lakehouse architecture
-
Version control (Git); strong analytical/problem-solving abilities
Nice-to-haves
-
Databricks certifications or data testing credentials
-
Experience with Delta Live Tables, Unity Catalog governance testing
-
Large-scale data processing experience; Agile methodologies
Location & work type
- Location not specified; work type not specified in t
Full Description
Data & Databricks Test Automation Engineer Company Overview Citco is a global leader in financial services, delivering innovative solutions to some of the world's largest institutional clients. We are seeking a Test Automation Engineer specializing in Databricks and data platforms to ensure the quality and reliability of our data solutions. Role Description As a Data & Databricks Test Automation Engineer, you will be responsible for developing and implementing automated testing frameworks for Databricks-based solutions, data pipelines, and data quality validation. You will work closely with data engineering teams to ensure data accuracy and reliability across our Lakehouse architecture. Key Responsibilities
- Databricks Testing
- Design and implement automated testing for Databricks notebooks and workflows
- Create test frameworks for Delta Lake tables and ACID transactions
- Develop automated validation for structured streaming pipelines
- Validate Delta Live Tables implementations
- Data Pipeline Testing
- Automate testing for ETL/ELT processes in Databricks
- Implement Spark job testing and optimization validation
- Create test cases for data ingestion and processing workflows
- Develop automated checks for data transformations
- Test Unity Catalog features and access controls
- Quality Assurance
- Design and execute data quality test strategies
- Implement automated data reconciliation processes
- Develop performance testing for large-scale Spark jobs
- Monitoring & Reporting
- Implement pipeline monitoring test frameworks
- Create automated test dashboards
- Generate quality metrics and testing reports
- Maintain comprehensive test documentation
- Requirements & Qualifications
- Educational Background
- Bachelor’s degree in Computer Science, Data Science, or related field
- Relevant certifications in Databricks or data testing are a plus
- Technical Experience
- 2+ years hands-on experience with Databricks (Spark)
- Strong programming skills in Python (PySpark) and SQL
- Experience with data testing frameworks and tools
- Knowledge of AWS services (S3, Glue, Lambda)
- Understanding of Delta Lake and Lakehouse architecture
- Experience with version control systems (Git)
- Additional Skills
- Strong analytical and problem-solving abilities
- Experience with large-scale data processing
- Knowledge of data quality best practices
- Understanding of data governance and compliance requirements
- Experience with Agile methodologies
- Platform Knowledge
- Databricks workspace and notebook development
- Delta Lake and Delta Live Tables
- Unity Catalog for governance testing
- Spark optimization and performance testing