{"id":5869,"date":"2026-04-09T14:29:50","date_gmt":"2026-04-09T14:29:50","guid":{"rendered":"https:\/\/cephasconsult.biz\/?post_type=job_listing&#038;p=5869"},"modified":"2026-05-10T00:29:45","modified_gmt":"2026-05-10T00:29:45","slug":"data-engineer-96792","status":"expired","type":"job_listing","link":"https:\/\/cephasconsult.biz\/?post_type=job_listing&p=5869","title":{"rendered":"Data Engineer\u00a096792"},"content":{"rendered":"<p><span class=\"flex-shrink-0\">Positions:<span class=\"font-semibold\">1 <\/span><\/span><span class=\"flex-shrink-0 font-semibold\">Full Time<\/span><\/p>\n<div class=\"col-span-1\">Experience<\/div>\n<div class=\"font-inter-semibold-paragraph2  text-cbrex-light-surface-pb col-span-2\">5 &#8211; 12 Years<\/div>\n<div><\/div>\n<div>\n<p><strong><span data-raw-html=\"span\">Role Overview:<\/span><\/strong><\/p>\n<p><span data-raw-html=\"span\">We are looking for a highly skilled\u00a0<\/span><strong><span data-raw-html=\"span\">Data Engineer<\/span><\/strong><span data-raw-html=\"span\">\u00a0with strong exposure to\u00a0<\/span><strong><span data-raw-html=\"span\">Procurement domain and Master Data Management (MDM)<\/span><\/strong><span data-raw-html=\"span\">. The ideal candidate will work at the intersection of\u00a0<\/span><strong><span data-raw-html=\"span\">data engineering, procurement analytics, and data governance<\/span><\/strong><span data-raw-html=\"span\">, building scalable data pipelines and ensuring high-quality master data across systems.<\/span><\/p>\n<p><span data-raw-html=\"span\">This role requires expertise in\u00a0<\/span><strong><span data-raw-html=\"span\">open-source technologies, data architecture, and procurement data models (supplier, vendor, material master)<\/span><\/strong><span data-raw-html=\"span\">.<\/span><\/p>\n<p><strong><span data-raw-html=\"span\">Roles and Responsibilities:<\/span><\/strong><\/p>\n<p><strong><span data-raw-html=\"span\">\u00a0<\/span><\/strong><\/p>\n<p><span data-raw-html=\"span\">Data Engineering &amp; Architecture<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00a0<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Design, build, and maintain scalable data pipelines and data platforms for procurement and supply chain data<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Develop and optimize ETL\/ELT pipelines for ingesting large, complex datasets<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Build and manage data lakes \/ warehouses using modern open-source stacks<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Ensure high performance, reliability, and scalability of data systems<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00a0<\/span><\/p>\n<p><span data-raw-html=\"span\">Master Data Management (MDM)<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00a0<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Develop and maintain Master Data Management frameworks for supplier, vendor, and material master data<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Ensure data quality, consistency, governance, and standardization across systems<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Implement data validation, cleansing, and enrichment processes<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Define and enforce data governance policies, taxonomies, and naming conventions<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00a0<\/span><\/p>\n<p><span data-raw-html=\"span\">Procurement Data &amp; Domain Integration<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00a0<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Work closely with Procurement, Finance, and Supply Chain teams to understand data requirements<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Manage supplier master data, spend data, contract data, and catalog data<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Enable data-driven procurement insights (spend analytics, vendor performance, risk analysis)<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Integrate data from ERP systems (SAP\/Ariba\/Coupa) and external sources<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00a0<\/span><\/p>\n<p><span data-raw-html=\"span\">Data Governance &amp; Quality<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00a0<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Establish and monitor data quality KPIs and SLAs<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Identify and resolve data inconsistencies, duplication, and integrity issues<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Support data audits, compliance, and regulatory requirements<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00a0<\/span><\/p>\n<p><span data-raw-html=\"span\">Collaboration &amp; Stakeholder Management<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00a0<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Collaborate with data scientists, analysts, and product teams to deliver business insights<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Partner with engineering teams to build self-service data platforms<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Communicate complex data concepts to business stakeholders<\/span><\/p>\n<p><strong><span data-raw-html=\"span\">\u00a0<\/span><\/strong><\/p>\n<p><span data-raw-html=\"span\">Technical Skills (Must-Have)<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00a0<\/span><\/p>\n<p><span data-raw-html=\"span\">Core Data Engineering<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00a0<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Strong experience with Python, SQL<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Experience with ETL tools \/ frameworks (Airflow, dbt, etc.)<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Hands-on with data warehousing (Snowflake, BigQuery, Redshift)<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00a0<\/span><\/p>\n<p><span data-raw-html=\"span\">Open Source Technologies (Important)<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00a0<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Apache Spark \/ Hadoop ecosystem<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Kafka \/ streaming frameworks<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Airflow \/ Prefect<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Delta Lake \/ Iceberg<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00a0<\/span><\/p>\n<p><span data-raw-html=\"span\">MDM &amp; Data Governance<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00a0<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Experience with Master Data Management tools or frameworks<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Strong understanding of data modeling, data lineage, and metadata management<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Knowledge of data quality frameworks and governance practices, Procurement Systems (Preferred)<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Exposure to SAP (MM), Ariba, Coupa, Ivalua, GEP, Zycus<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Understanding of procurement lifecycle and supplier data models<\/span><\/p>\n<p><strong><span data-raw-html=\"span\">\u00a0<\/span><\/strong><\/p>\n<p><strong><span data-raw-html=\"span\">Required Skills:<\/span><\/strong><\/p>\n<p><strong><span data-raw-html=\"span\">\u00a0<\/span><\/strong><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Bachelor\u2019s or Master\u2019s degree in Computer Science, Data Engineering, Information Systems, or a related field<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">4\u201310 years of relevant experience in Data Engineering, with hands-on exposure to Master Data Management (MDM) and\/or Procurement\/Supply Chain data<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Proven experience in building and managing scalable data pipelines, data lakes, and data warehouses using modern architectures<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Strong understanding of data modeling (dimensional &amp; relational), data governance, and data quality frameworks<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Hands-on experience with open-source data technologies such as Apache Spark, Airflow, Kafka, dbt, etc.<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Exposure to ERP and procurement platforms (SAP MM, Ariba, Coupa, Ivalua, GEP, Zycus) and understanding of supplier\/vendor master data structures<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Experience working on MDM implementations, data standardization, deduplication, and governance frameworks<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Strong proficiency in SQL and Python, with experience in handling large-scale datasets<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Familiarity with cloud platforms (AWS, Azure, or GCP) and modern data stack<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Experience working in cross-functional and global stakeholder environments<\/span><\/p>\n<p><span data-raw-html=\"span\">\u00b7<\/span><span data-raw-html=\"span\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span data-raw-html=\"span\">Strong problem-solving skills with the ability to translate business requirements into scalable data solutions<\/span><\/p>\n<\/div>\n","protected":false},"author":1,"featured_media":0,"template":"","meta":{"_job_location":"Mumbai, Maharashtra, India","_application":"hrm@cephasconsult.biz","_company_name":"","_company_website":"","_company_tagline":"","_company_twitter":"","_company_video":"","_filled":0,"_featured":0,"_remote_position":0,"_job_salary":"","_job_salary_currency":"","_job_salary_unit":""},"job-types":[],"class_list":["post-5869","job_listing","type-job_listing","status-expired","hentry"],"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/cephasconsult.biz\/index.php\/wp-json\/wp\/v2\/job-listings\/5869","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/cephasconsult.biz\/index.php\/wp-json\/wp\/v2\/job-listings"}],"about":[{"href":"https:\/\/cephasconsult.biz\/index.php\/wp-json\/wp\/v2\/types\/job_listing"}],"author":[{"embeddable":true,"href":"https:\/\/cephasconsult.biz\/index.php\/wp-json\/wp\/v2\/users\/1"}],"wp:attachment":[{"href":"https:\/\/cephasconsult.biz\/index.php\/wp-json\/wp\/v2\/media?parent=5869"}],"wp:term":[{"taxonomy":"job_listing_type","embeddable":true,"href":"https:\/\/cephasconsult.biz\/index.php\/wp-json\/wp\/v2\/job-types?post=5869"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}