AWS Glue Developer

plano, Texas

Our client is seeking an AWS Glue Developer to join their team in Plano, TX.

They are seeking a passionate and intellectually curious Big Data Spark engineer for their Data Engineering team. The Data Engineering team is responsible for creating data pipelines in big data space using AWS glue. This role requires candidate to analyze requirements, build and test applications using Py-Spark to meet the business requirements for a cloud platform-based application.

Qualifications

  • 6+ years demonstrated experience with software development
  • Experience on designing and developing data pipelines for data ingestion and transformation using Py-Spark
  • 4+ years of demonstrated experience in distributed computing experience using Python and Py-Spark
  • Understanding of spark framework and spark architecture
  • Experience working in AWS Cloud based Big data infrastructure
  • Development experience using Lambda and Python
  • Excellent in trouble shooting the performance and data skew issues
  • Understanding of spark run time metrics and tune applications based on metrics
  • Knowledge in partitioning, bucketing concepts of data ingestion
  • Understanding of AWS services like Glue, Athena, S3, Lamda, Cloudformation
  • Preferred working knowledge on the implementation of datalake ETL using AWS glue
  • Production experience in deploying the Big data applications
  • Knowledge of challenges associated with Bigdata and how to build systems that scale seamlessly
  • Transform complex analytical models in scalable, production ready solutions
  • Strong communication skills
  • Mentoring and peer review of designs and coded implementations
  • Experience with source code control systems like Bitbucket, Gitlab etc.
  • Be proactive in solving the problems and looking for ways to improve the services

Job keywords:

Address
Files must be less than 2 MB.
Allowed file types: doc docx pdf.

Vertical Tabs