Role - SQL Developer Location - Minneapolis, MN(Remote) Mandatory Skills - SQL Server, Azure Devops Responsibilities: • Develop, optimize, and maintain ETL/ELT pipelines using PySpark and SQL. • Work with structured and unstructured data to build scalable data solutions. • Write efficient and scalable PySpark scripts for data transformation and processing. • Optimize SQL queries, stored procedures, and indexing strategies to enhance performance. • Design and implement data models, schemas, and partitioning strategies for large-scale datasets. • Collaborate with Data Scientists, Analysts, and other Engineers to integrate data workflows. • Ensure data quality, validation, and consistency in data pipelines. • Implement error handling, logging, and monitoring for data pipelines. • Work with cloud platforms (AWS, Azure, or GCP) for data processing and storage. • Optimize data pipelines for cost efficiency and performance. Technical Skills Required: ✅ Strong experience in Python for data engineering tasks. ✅ Proficiency in PySpark for large-scale data processing. ✅ Deep understanding of SQL (Joins, Window Functions, CTEs, Query Optimization). ✅ Experience in ETL/ELT development using Spark and SQL. ✅ Experience with cloud data services (AWS Glue, Databricks, Azure Synapse, GCP BigQuery). ✅ Familiarity with orchestration tools (Airflow, Apache Oozie). ✅ Experience with data warehousing (Snowflake, Redshift, BigQuery). ✅ Understanding of performance tuning in PySpark and SQL. ✅ Familiarity with version control (Git) and CI/CD pipelines. |
--
You received this message because you are subscribed to the Google Groups "Latest C2C Requirements2" group.
To unsubscribe from this group and stop receiving emails from it, send an email to
latest-c2c-requirements2+unsubscribe@googlegroups.com.
To view this discussion visit
https://groups.google.com/d/msgid/latest-c2c-requirements2/CABPbW4BCY6Roa3v15cW4dx5YD3p_jqLhDTxKezxazN_ri_Y%2BQA%40mail.gmail.com.
For more options, visit
https://groups.google.com/d/optout.