Data Engineer
HAVI is a global, privately owned company focused on innovating, optimizing and managing the supply chains of leading brands. Offering services in marketing analytics, packaging, supply chain management and logistics, HAVI partners with companies to address challenges big and small across the supply chain, from commodity to customer. Founded in 1974, HAVI employs more than 10,000 people and serves customers in more than 100 countries. HAVI’s supply chain services are complemented by the customer engagement services offered by our affiliated company The Marketing Store. For more information, please visit HAVI.com.
Our Global Business Services (GBS) center was launched in 2019. We handle top-notch financial, planning, and supply chain services, actively supporting core business activities including master data management, end-to-end procurement, and supply chain business intelligence analysis for clients in over 30 countries across Europe and beyond. Our teams in Kraków and Katowice focus on efficiency, innovation, and digital transformation, always aiming to exceed our clients’ expectations.
Join us in our mission to promote sustainable development and reduce our environmental footprint. Work in a diverse and inclusive environment where every voice matters. Discover endless career opportunities with exciting projects, hands-on learning, and professional growth.
Come aboard and help us shape the future of supply chain management!
Data Engineer, Integrated Planning Analytics and Insights
Architect, design, implement, enhance, and maintain highly scalable, available, secure, and elastic cloud-ready data solutions using cutting-edge technologies to support our predictive and prescriptive analytics needs. Be an expert in our data domains, act as a trusted partner and advisor to solutions architects and data scientists and become a crucial part of the analytics solution lifecycle – from prototype to production and operations of our data science and advanced analytics solutions in areas such as promotions, supply and demand planning, item/menu level analytics, supply chain simulations, and optimization, competitive benchmarking, and root cause analysis. Continuously improve and advance our data solutions.
Responsibilities:
- Responsible for working with the data management, data science, decision science, and technology teams to address supply chain data needs in demand and supply planning, replenishment, pricing, and optimization
- Develops/refines the data requirements, designs/develops data deliverables, and optimizes data pipelines in non-production and production environments
- Designs, builds, and manages/monitors data pipelines for data structures encompassing data transformation, data models, schemas, metadata, and workload management. The ability to work with both IT and business
- Integrates analytics and data science output into business processes and workflows
- Builds and optimizes data pipelines, pipeline architectures, and integrated datasets. These should include ETL/ELT, data replication/CI-CD, API design, and access
- Works with and optimizes existing ETL processes and data integration and preparation flows and help move them to production
- Works with popular data discovery, analytics, and BI and AI tools in semantic-layer data discovery
- Adepts in agile methodologies and capable of applying DevOps and DataOps principles to data pipelines to improve communication, integration, reuse, and automation of data flows between data managers and data consumers across the organization
- Implements Agentic AI capability to drive efficiency and opportunity
Desired Skills & Experience:
- Bachelor’s degree in computer science, data management, information systems, information science or a related field; advanced degree in computer science, data management, information systems, information science or a related field preferred.
- 3+ years in data engineering building production data pipelines (batch and/or streaming) with Spark on cloud.
- 2+ years hands-on Azure Databricks (PySpark/Scala, Spark SQL, Delta Lake) including:
- Delta Lake operations (MERGE/CDC, OPTIMIZE/Z-ORDER, VACUUM, partitioning, schema evolution).
- Unity Catalog (RBAC, permissions, lineage, data masking/row-level access).
- Databricks Jobs/Workflows or Delta Live Tables.
- Azure Data Factory for orchestration (pipelines, triggers, parameterization, IRs) and integration with ADLS Gen2, Key Vault.
- Strong SQL across large datasets; performance tuning (joins, partitions, file sizing).
- Data quality at scale (e.g., Great Expectations/Deequ), monitoring and alerting; debug/backfill playbooks.
- DevOps for data: Git branching, code reviews, unit/integration testing (pytest/dbx), CI/CD (Azure DevOps/GitHub Actions).
- Infrastructure as Code (Terraform or Bicep) for Databricks workspaces, cluster policies, ADF, storage.
- Observability & cost control: Azure Monitor/Log Analytics; cluster sizing, autoscaling, Photon; cost/perf trade-offs.
- Proven experience collaborating with cross-functional stakeholders (analytics, data governance, product, security) to ship and support data products.
Benefits:
- Possibility of turning your own ideas into success
- Diverse development opportunities
- Varied and interesting field of work
- Responsible task with plenty of leeway
- Collegial working atmosphere
- Open corporate culture
- Cooperation with a dynamic team
- Attractive remuneration models with performance-related pay
- Flat hierarchies and short decision-making processes
- Successful and rapidly growing employer
- Comprehensive, individual familiarization with the work
- Offer for health promotion
- Modern work equipment
- Diverse development opportunities in an international environment
- Training according to training schedule and training regulations in the relevant specialist field
At HAVI GBS, we believe in an inclusive and sustainable workplace where our diverse backgrounds, experiences, characteristics and traits make us better. As an Equal Opportunity Employer we are committed to promoting diversity within the organization, as well as an inclusive environment where everyone can feel valued and respected, regardless of their background.