Wednesday, 6 July 2022

Job Role ¬ Data Engineer Engineer kafka (10+)

Posted On

July 06, 2022

Recruitment House

0 comments

Job Role Data Engineer Engineer kafka (10+)

Location- Irvine, CA

Long Term Contract

Expertise in ETL optimization, designing, coding, and tuning big data processes using Apache Spark or similar technologies.
Experience building robust and scalable data integration (ETL) pipelines using SQL, Python, Spark or PySpark. Advanced knowledge in one of the programming language is must.
Maintain and enhance Confluent Kafka architecture, Confluent Kafka design principles, CI/CD Deployment procedures
Experience with building streaming applications with Confluent Kafka (Confluent Kafka preferred but open-source Kafka acceptable)
Development experience using Kafka producers, consumers, and streams (Confluent Kafka preferred but open-source Kafka acceptable)
Experience with building data pipelines and applications to stream and process datasets at low latencies.
Experience with real-time and scalable systems development using Apache Kafka or Confluent Kafka or Kafka Streams.
Show efficiency in handling data - tracking data lineage, ensuring data quality, and improving discoverability of data.
Good understanding of AWS technologies (S3, AWS Glue, CDK, ECS, EMR, Redshift, Athena)
Sound knowledge of distributed systems and data architecture (lambda)- design and implement batch and stream data processing pipelines, knows how to optimize the distribution, partitioning, and MPP of high-level data structures.
Knowledge of Engineering and Operational Excellence using standard methodologies.
Experience with process improvement, workflow, benchmarking and / or evaluation of business processes.