Notice: While JavaScript is not essential for this website, your interaction with the content will be limited. Please turn JavaScript on for the full experience.

Python ETL Developer/Data Engineer ( Remote) New
IPV Holdings Ltd
Remote, Remote, Remote

Job Title

Python ETL Developer/Data Engineer ( Remote)

Job Description

Specific Duties

  • Reviewing, designing, developing ETL jobs to ingest data into Data Lake, load data to data marts;
  • extract data to integrate with various business applications.
  • Parse unstructured data, semi structured data such XML etc.
  • Design and develop efficient Mapping and workflows to load data to Data Marts
  • Map XML DTD schema in Python (customized table definitions)
  • Write efficient queries and reports in Hive or Impala to extract data on ad hoc basis for data analysis.
  • Identify the performance bottlenecks in ETL Jobs and tune their performance by enhancing or redesigning them.
  • Responsible for performance tuning of ETL mappings and queries.
  • import tables and all necessary lookup tables to facilitate the ETL process required to process daily XML files in addition to processing the very large (multi-terabytes) historical XML data files

Compensation: Commensurate with experience.

Restrictions

  • Telecommuting is OK
  • No Agencies Please

Requirements

  • Self- Starter
  • Proactive
  • very organized with a high aptitude for learning and solving complex problems
  • Proficiency in using query languages such as SQL, R, Python
  • Proficiency in using Hive, HADOOP, Impala
  • Should have deep knowledge on performance tuning of ETL Jobs, Hadoop Jobs, SQL's, Partitioning, Indexing and various other techniques

About the Company

https://ipvisibility.com/about-us

Contact Info

Previous Part-Time Python Web Developer, Assosiation of Rational Enterprise, LLC in Worldwide, Worldwide Next Software Engineer (d/f/m), Stylight in Munich, Germany