Skip to content Skip to footer

Your role: Federated Data Steward

Introduction

The Federated Data Steward plays a key supporting role in ensuring that health data used in federated learning (FL) is high quality, well-documented, legally compliant, and FAIR — even when it remains distributed across multiple institutions.

Building on the evolving profession of research data stewardship, this role adapts to the specific needs of secure, privacy-preserving, and cross-site data sharing. Federated data stewards act as local or central points of contact who bridge research, infrastructure, and governance efforts.

They help translate policies into practice, ensure datasets are harmonised and well-annotated, and support institutions in aligning with federated learning protocols and standards.

Key Responsibilities

  • Assist with mapping and transforming local data into harmonised formats or common data models (e.g. OMOP, FHIR)
  • Ensure metadata completeness and consistency to support interoperability
  • Guide researchers and IT teams on FAIR principles and legal requirements
  • Support the documentation of data flows, wrangling pipelines, and data provenance
  • Coordinate with legal teams on data access permissions, pseudonymisation, and retention policies
  • Help validate and test data readiness for FL training rounds
  • Act as a knowledge hub for tools, standards, and training in FL data stewardship

Common Challenges

  • Supporting multiple departments or projects with diverse data formats and quality
  • Translating FAIR and legal principles into operational workflows
  • Working across silos (research, IT, legal) without formal authority
  • Managing uncertainty about roles, responsibilities, and technical expectations in FL
  • Ensuring sustainability and documentation beyond the project lifecycle

Data Models & FAIR Alignment

Data Stewardship Tools

Metadata & Validation

Relevant FLKit Sections

  • Enhance & Wrangle Data: data cleaning, harmonisation, metadata
  • Plan & Govern: local policy support, permissions, documentation
  • Enable Infrastructure: aligning local systems with FL pipelines

Training & Further Reading

Solution

Related pages

More information

FAIR Cookbook is an online, open and live resource for the Life Sciences with recipes that help you to make and keep data Findable, Accessible, Interoperable and Reusable; in one word FAIR.

With Data Stewardship Wizard (DSW), you can create, plan, collaborate, and bring your data management plans to life with a tool trusted by thousands of people worldwide — from data management pioneers, to international research institutes.

Contributors