Monday, August 02, 2021

  

Report on RDMF21: Data Stewardship in Research Institutions

 

I attended the RDMF21 virtual workshop on Zoom on 12 and 13 July 2021. The workshop entitled Data Stewardship in Research Institutions, was hosted by the Digital Curation Centre (DCC). https://www.dcc.ac.uk/

My attendance was supported by a grant from the Digital Preservation Coalition (DPC). https://www.dpconline.org/

 

The two, half day workshops, focused on institutional data stewardship roles, and how national-level communities of practice can help institutions coordinate. The overarching them was coordinating support for research data stewardship for generic and disciplinary role.

In this blogpost I share some notes and thoughts compiled from my personal notes, collaborative notes, tweets, with the hashtag #RDMF21, and presenters' slide decks. 

1.    The keynote was delivered by Mijke Jetten from the Dutch Techcentre for Life Sciences and spoke on the professionalising the data steward roles in the Netherlands.

Some key points:

·         It is agreed that Data stewardship and data management skills are essential in research, but there is no consensus on the responsibilities and tasks of data stewards.

·         Need for skills and capacity building at national and international level

·         Also needs change managers, people that push culture/policy change

·         Human capacity needed - 3 FTE per 100 researchers from OECD report, and estimates 5 FTE per 100 researchers by the EC High Level Expert Group EOSC

·         National initiatives : Dutch National Programme Open Science (NPOS)- developed Basic data steward job profile components,

·     international initiative : ELIXIR-CONVERGE: Toolkit,  RDA Professionalising Data Stewardship IG: topics, and RDA Libraries 4 Research Data IG: 23 things for data stewards

 

2.    Lisa Otty, Centre Manager, Edinburgh Centre for Data, Culture and Society presented on joining Social Science and Humanities (CDCS) support with Central RDM Services at University of Edinburgh.

Key points:

·         Centre for Data, Culture and Society launched in 2019. Supports network of academics with shared interests, based in College of Arts, Humanities and Social Sciences in the University of Edinburgh (UoE).

·         SHAPE acronym is used to refer to Social Sciences, Humanities, and the Arts for People and Humanity, https://thisisshape.org.uk/

·         CDCS mission is to support applied digital research across the research activity in the College, to create a community of practice across CAHSS for data-driven work.

·         Training is a key activity for CDCS. They offer over 50 courses for 600+ people per year. Also offered workshops on how to manage research under pandemic conditions.

·         Offer social events such as coffee mornings for informal discussions on data management and monthly data clinics.

 

3.    Paul van Schayck, from Faculty of Health Medicine and Life Sciences at Maastricht University presented on Data Stewards and Research Software Engineers collaborations on Life Sciences service.

Key points:

·         Data from a perspective as a research data manager (RDM) as well as a researcher

·         Data Hub Maastricht is RDM support provider, it offers the Maastricht Data Repository for high volume data

·         Support is offered by Disciplinary Data Stewards

·         data stewards and how to prioritise development - Stakeholders are the data stewards, not the individual researchers

·         developing discipline specific tools - Data steward provides the domain-specific knowledge and the data hub service provides the IT knowledge

·         training for researchers on how to use repositories, and provide better integration between tools such as XNAT/OMERO and institutional IT resources

 

4.    The presentations were followed by a breakout for a discussion on : Leveraging activities outside your institution to your advantage.

Key points:

·         At institutional level, different stakeholder involved in RDM have different training needs e.g. librarians, data stewards, manager of domain repository, and institutional repository

·         need to define roles across the institution for community/stakeholders involved in RDM

·         Collaboration within an institution takes a lot of work, you need a political decision that this is desired

·         Important to have a coordination group across the institution

·         Working groups are useful, get them started even if attendance isn’t always good, better to have them in place

 

5.    The second day started with a keynote address from Graham Parton, the Senior Data Scientist at the Centre for Environmental Data Analysis (CEDA), on the need to connect data stewardship roles across the research lifecycle and ecosystem. The is a perspective from a domain (subject) repository data scientist.

Some key points:

·         The professionalising Data Steward Interest Task Group under the Research Data Alliance (RDA) is seeking to explore and model, the varying landscape of RDM

·         The diversity of activities, roles and environments can difficult for the RDM community to professionalise the work and services

·         Aiming for an openly available collection of models, to serve in establishing and sustaining data stewardship services at different organisations

·         The UK’s National Environmental Research Council (NERC) has a data lifecycle where the various stakeholders in the data lifecycle are involved, including tracking and reporting from the funder side

·         The data is managed in the lifecycle in a project from instrument to archive and end-user. The interaction between end-user, instrument scientists, data scientists (i.e. archive) and IT specialist/software engineers are crucial to the success of the project

·         NERC policy - 2010 revision stipulates that data be made publicly available within 2 years

·         designated data centre to work with the PI of the project, to do the NERC Data Value Checklist (https://nerc.ukri.org/research/sites/environmental-data-service-eds/policy/data-value-checklist/ ) to establish a full Data Management Plan (DMP). This task needs to be completed within 3-6 months of the project start date - compliance

 

6.    Myriam Mertens, Open Science Coordinator, Ghent University Library presented on setting up a data steward team at Ghent University and the national/regional context in Belgium.

Some key points:

·         Small team of data stewards – promote and facilitate, advancing culture change to open data and re-use

·         Started with an initial 3-phased work plan: 1. Onboarding, 2. Start faculty, 

3. Launch more proactive support with advisory & training services

·         Government funding for open data, and date stewardship in institutions, for the next few years

 

7.    The first breakout discussion on day 2 looked at looked the role of the data stewards in delivering services

The question from a member of the group “How do you measure success as a data steward? Currently? In the future?” engendered a lot of discussion but more questions arose than answer. However, some of the following points are worth noting:

·         People still need humans and human interactions, webpages are nice but people won’t read it

·         ‘Measuring Impact’ - JISC undertook a series of studies on data centres and measured ROI; as a domain repo we also seek user Impact stories)

·         Is data re-use/reproducibility a good measure? (citation? Downloads? How do you measure re-use)

·         Systems that uses registration can give you more insights and then follow up with people via a survey

Discussion concluded with different measures of success for different aspects.

 

8.    The second breakout discussion was on a (national) community of practice for data stewards

·         South Africa has a CoP (network for digital curation) for about 10 years, has been unfunded, initiators are retiring, and community might be losing its mojo.

( Yes, data stewards and Librarians across the world know what mojo means.)

·         Who fills the gap if networks/services retire?

·         International groups are useful for learning about models in other countries, but national context is important to what can be done eg. funding, policies

 

 1.    Helen Clare, Senior e-infrastructure strategy manager at Open Research Competencies Coalition, JISC Digital Research Community presented on UK a tale of 2 communities, *or one community and a coalition (and definitely not the only communities!)

Key points:

·        Data stewardship in SC3: Scholarly Communication Competencies Coalition was formed in 2017 to exploring training and professional development for

·         scholarly communications (and open research)

·         Open Research Competencies Coalition formed in 2020 - Rebranded during Covid to online research support – broader conversation

·         Space for JISC digital research community to connect share and collaborate, but across discipline and roles, across open research