Research Data Management Support

IBB PAS provides support in research data management for staff and doctoral students.

We offer assistance with the following issues:

– selection of datasets to be shared openly from a research project,

– selection of an appropriate database / repository for the data,

– determining the legal status of the data,

– decisions on data licensing,

– preparation of a research data management plan for a grant application, as well as its modification during the project and reporting on the implementation of the plan.

Please feel free to contact me:

dr Marta Hoffman               rdm@ibb.waw.pl        phone 3501 (room 401, building C)

 

Resources

1. Helpsheet meant to support the preparation of a Data Management Plan (DMP) for a research project (document in English):

IBB-DMP-HelpSheet-2025-v2

The document is structured in accordance with the scheme used by the National Science Center (NCN). Since NCN requires the DMP only in English, only an English-language version has been prepared. The document uses examples typical for research projects submitted at IBB PAS – if you have a special situation in your project (non-typical data, patient data, complicated legal situation of the data, etc.), you are welcome to contact me directly.

2. List of suggested data repositories and databases:

Available also as pdf for download: Suggested-databases-2025-v4. The list presented below is partly based on the resource ELIXIR Deposition Databases for Biomolecular Data.

Specialized repositories for selected types of life sciences data – please use these as your first choice:

 

ArrayExpress High-throughput functional genomics data (RNA-seq, ChIP-seq, and other types of gene expression and epigenomics datasets).(Also a good alternative: GEO at NCBI).
ENA Nucleotide sequencing information: raw sequencing data, sequence assembly information and functional annotation.(Alternatively: GenBank, SRA – Sequence Read Archive, TSA – Transcriptome Shotgun Assembly, all at NCBI).
EVA Genetic variation data from all species.
IntAct Molecular interaction data.
LIPID MAPS Information on Lipids and their structures, properties and functions in biological processes.
MetaboLights Metabolite structures and their reference spectra as well as their biological roles, locations and concentrations, and experimental data from metabolic experiments.
BioModels Computational models of biological processes.
ModelArchive Theoretical models of macromolecular structures.
PDBe Experimentally obtained structures of biological macromolecules.
PRIDE Mass spectrometry-based proteomics data.
UniProt Protein sequence and function data.
BioImage Archive All biological image data, including light microscopy, 2D-electron microscopy.
EMPIAR Raw cryo-EM data and other 3D-electron microscopy images.
EMDB Processed cryo-EM data and other 3D- electron microscopy images.
ecbd European Chemical Biology Database, run by EU-OPENSCREEN, collects experimental results from biological screening programs. Deposit option possible for EU-OPENSCREEN partner sites.

Domain repositories (accepting all types of research data from a broad but defined field of research) – recommended option for data that does not fit into any specialized repository:

BioStudies All data from Life Sciences research that do not fit in specialized archives. Part of the ecosystem of data resources hosted and managed by EMBL-EBI, it is integrated with many of the specialized resources listed above as well as with the literature repository Europe PMC (PubMedCentral).
Pangaea All data from Earth and Environmental Sciences research. PANGAEA is hosted by the Helmholtz Center for Polar and Marine Research and the University of Bremen.

Generalist repositories for research data (any type of data, any file formats) – for situations where the data is not suitable for domain-specific repositories:

Zenodo All research-related data. EC-funded repository run at CERN in Geneva for researchers from all countries.
RepOD All research-related data. This is a general repository managed and hosted by ICM University of Warsaw, for all researchers in Poland.

3. Presentation – most important aspects of research data management and a Data Management Plan (IBB, June 20, 2024):

RDM-at-IBB-2024-06-20

 

 

Bio

Marta Hoffman is a biologist, she earned her PhD at IBB PAS in the field of yeast genetics. She did a postdoctoral fellowship in the Theoretical Biophysics group at the Humboldt-University in Berlin, and is currently a member of the IBB Laboratory of Lipid Biochemistry .

She gained experience in research data management and issues related to open sharing of data while working (from 2012 to 2017) as a member of the Open Science Platform ICM UW (https://pon.edu.pl/), where she was running the National Open Access Desk of the OpenAIRE (Open Access Infrastructures for Research in Europe, https://openaire.eu/) project, as well as coordinating the Repository for Open Data RepOD (https://repod.icm.edu.pl/) and leading workshops on Research Data Management. She then used and expanded her knowledge of Open Research Data serving in 2019-2020 as a member of the advisory group FAIR Working Group of the EOSC (European Open Science Cloud, https://eosc.eu/), which prepared recommendations for the implementation of FAIR Data principles within the emerging EOSC service, and in 2020-2022 as a member of the advisory group of the project OCRE (Open Clouds for Research Environments, https://www.ocre-project.eu/).