Browse
···
Log in / Register

Data Engineer (PySpark) - Leading UAE Bank, Cloudera Data Platform Expert

Negotiable Salary

GSSTech Group

Dubai - United Arab Emirates

Favourites
Share

Description

Job Title: Data Engineer (PySpark) ________________________________________ About the Role We are seeking a highly skilled Data Engineer with deep expertise in PySpark and the Cloudera Data Platform (CDP) to join our data engineering team. As a Data Engineer, you will be responsible for designing, developing, and maintaining scalable data pipelines that ensure high data quality and availability across the organization. This role requires a strong background in big data ecosystems, cloud-native tools, and advanced data processing techniques. The ideal candidate has hands-on experience with data ingestion, transformation, and optimization on the Cloudera Data Platform, along with a proven track record of implementing data engineering best practices. You will work closely with other data engineers to build solutions that drive impactful business insights. Responsibilities Data Pipeline Development: Design, develop, and maintain highly scalable and optimized ETL pipelines using PySpark on the Cloudera Data Platform, ensuring data integrity and accuracy. Data Ingestion: Implement and manage data ingestion processes from a variety of sources (e.g., relational databases, APIs, file systems) to the data lake or data warehouse on CDP. Data Transformation and Processing: Use PySpark to process, cleanse, and transform large datasets into meaningful formats that support analytical needs and business requirements. Performance Optimization: Conduct performance tuning of PySpark code and Cloudera components, optimizing resource utilization and reducing runtime of ETL processes. Data Quality and Validation: Implement data quality checks, monitoring, and validation routines to ensure data accuracy and reliability throughout the pipeline. Automation and Orchestration: Automate data workflows using tools like Apache Oozie, Airflow, or similar orchestration tools within the Cloudera ecosystem. Monitoring and Maintenance: Monitor pipeline performance, troubleshoot issues, and perform routine maintenance on the Cloudera Data Platform and associated data processes. Collaboration: Work closely with other data engineers, analysts, product managers, and other stakeholders to understand data requirements and support various data-driven initiatives. Documentation: Maintain thorough documentation of data engineering processes, code, and pipeline configurations.  Qualifications  Education and Experience Bachelor’s or Master’s degree in Computer Science, Data Engineering, Information Systems, or a related field. 3+ years of experience as a Data Engineer, with a strong focus on PySpark and the Cloudera Data Platform.  Technical Skills PySpark: Advanced proficiency in PySpark, including working with RDDs, DataFrames, and optimization techniques. Cloudera Data Platform: Strong experience with Cloudera Data Platform (CDP) components, including Cloudera Manager, Hive, Impala, HDFS, and HBase. Data Warehousing: Knowledge of data warehousing concepts, ETL best practices, and experience with SQL-based tools (e.g., Hive, Impala). Big Data Technologies: Familiarity with Hadoop, Kafka, and other distributed computing tools. Orchestration and Scheduling: Experience with Apache Oozie, Airflow, or similar orchestration frameworks. Scripting and Automation: Strong scripting skills in Linux.  Soft Skills Strong analytical and problem-solving skills. Excellent verbal and written communication abilities. Ability to work independently and collaboratively in a team environment. Attention to detail and commitment to data quality.


Location
Dubai - United Arab Emirates
Show Map

Workable
3,441listings

You may also like

Dubizzle
Telesales Executive (Software B2B)
X8V5+24 Dubai - United Arab Emirates
Join Aphidas as a Telesales Executive (B2B Software Sales) At **Aphidas**, a leading Salesforce Partner in the region, we empower businesses with innovative CRM and digital transformation solutions. We’re growing fast — and we’re looking for a driven **Telesales Executive** to help us expand our reach and connect with new clients. What You’ll Do * Drive **outbound cold calling campaigns** to generate and qualify leads. * Connect with decision-makers in the **B2B technology and software space**. * Build and nurture a healthy sales pipeline to support Aphidas’ growth. * **Handle objections with resilience**, turning rejections into opportunities. * Collaborate closely with the sales team to achieve and exceed targets. What We’re Looking For * **Telesales or call center background**, ideally in software or B2B sales. * Experience in **cold calling, lead generation, and pipeline building**. * **Resilient and target-driven** — you thrive on challenges and bounce back strong. * Excellent communication and persuasion skills. * A positive, team-oriented mindset with the ability to influence outcomes. Why Aphidas? * Be part of a **Salesforce Summit Partner** shaping digital transformation across industries. * Competitive salary plus performance-based incentives. * A collaborative, growth-focused environment where your success is celebrated. * Opportunities for career advancement within the Salesforce ecosystem. Ready to take your sales career to the next level? Apply now and grow with Aphidas!
AED 4,000-5,999/month
Cookie
Cookie Settings
Our Apps
Download
Download on the
APP Store
Download
Get it on
Google Play
© 2025 Servanan International Pte. Ltd.