Job Summary
Deep understanding and experience with modern Data Platforms – Cloudera Data Platform (CDP)
• Strong hands-on experience with PySpark.
• Experience with Cloudera Data Platform (CDE, CDW, Ozone, Airflow, SDX), Apache Ranger
• Deep understanding of distributed data systems and Hive Metastore
• Experience and understanding of cataloging, lineage, and governance
• Experience / understanding Open Data Contract Standard (ODCS) and its implementation
• Experience working with SQL, file formats (Iceberg/Parquet), and partitioning/bucketing strategies.
Key Responsibilities
Deep understanding and experience with modern Data Platforms – Cloudera Data Platform (CDP)
• Strong hands-on experience with PySpark.
• Experience with Cloudera Data Platform (CDE, CDW, Ozone, Airflow, SDX), Apache Ranger
• Deep understanding of distributed data systems and Hive Metastore
• Experience and understanding of cataloging, lineage, and governance
• Experience / understanding Open Data Contract Standard (ODCS) and its implementation
• Experience working with SQL, file formats (Iceberg/Parquet), and partitioning/bucketing strategies.
Skill Requirements
Deep understanding and experience with modern Data Platforms – Cloudera Data Platform (CDP)
• Strong hands-on experience with PySpark.
• Experience with Cloudera Data Platform (CDE, CDW, Ozone, Airflow, SDX), Apache Ranger
• Deep understanding of distributed data systems and Hive Metastore
• Experience and understanding of cataloging, lineage, and governance
• Experience / understanding Open Data Contract Standard (ODCS) and its implementation
• Experience working with SQL, file formats (Iceberg/Parquet), and partitioning/bucketing strategies.