You are here: Home » Users » Lecarpentier Damien » My News » e-IRG Data Management Task Force: Final report and recommendations endorsed by e-IRG and ESFRI

e-IRG Data Management Task Force: Final report and recommendations endorsed by e-IRG and ESFRI

In 2008, the e-IRG decided to launch a task force to investigate the numerous European activities related to the management of scientific data, and to contribute to the definition of common and shared policies in this field. After months of intensive discussion and work, the task force released its final report investigating the issue in a comprehensive way, and setting a few recommendations. The final report and the recommendations were jointly endorsed by the e-IRG, on 30 November 2009, and by ESFRI, on 11 December 2009.

A fundamental paradigm shift known as Data Intensive Science is changing the way science and research in most disciplines is being conducted. The new paradigm that is emerging is based on access and analysis of large amounts of new and existing data. These challenges have been thoroughly discussed during the 2007-2008 e-IRG workshops, and particular focus has been put on data initiatives linked with the new research infrastructures identified by the European Strategy Forum on Research Infrastructures (ESFRI), created in 2002 by the European Council of Ministers. The e-IRG delegates recognised the importance of data management for the future of research infrastructures and, as a result, established in 2008 the e-IRG Data Management Task Force (DMTF) that also received recognition and support from ESFRI. To conduct its work, the DMTF has been organized in three sub-task forces, each focusing on a special task.

 

Existing Data Management Initiatives

 

The first group conducted a survey describing the landscape of the existing projects and initiatives related to data management. This survey analysed the opportunities, synergies and gaps presented by these initiatives and their potential impact. The survey is divided into three main fields of science: arts and humanities – social sciences; health sciences; and natural sciences and engineering. The analysis of 18 social sciences, 12 health sciences and 33 natural sciences and engineering initiatives gives a global view of the existing data initiatives in Europe.

 

Metadata and quality

 

The second group covered the basic principles and requirements for metadata descriptions and the quality of the resources to be stored in accessible repositories. The principles and requirements specified are considered to be baselines for all research infrastructures, and as such are independent of the scientific research field. Key findings of this part of the report focus on metadata flexibility to allow for the addition of new elements, for using different types of selections and for the possibility of using elements of different sets and re-using existing elements/sets. Metadata topics such as usage, scope, provenance, persistence, aggregation, standardisation, interoperability, quality, earliness and availability are discussed in detail. Quality of data resources is also covered in the context of sharing data and quality assurance, assessing the quality of research data, and data consumers.

 

Interoperability issues in data management

 

The third group sought to propose guidelines for improving interoperability between various archives and repositories. The issue of interoperability is very important to ensure that scientific data is reachable and useful to other scientific fields, i.e. to enable cross-disciplinary Data Intensive Science. At the moment interoperability-related activities are mainly contained within individual communities but with the advent of e-Science, data interoperability needs to be extended to groups of different communities. In addition to providing details of these opportunities and challenges, this part of the document presents several levels and types of interoperability: resource-level operability, general semantic operability and syntactic versus semantic interoperability. The different layers, ranging from device level to communications, middleware and deployment of resource interoperability are analysed in detail. Semantic interoperability is also discussed in terms of data integration, ontology support, simplicity, transcoding and metadata, representation information, conceptual modelling, and distributed systems. Some use cases in the medical field, linguistics, e-humanities ecosystem, earth sciences, astronomy and space science, and particle physics are presented and, in each instance, use cases solutions, tendencies and needs are identified and put forward.

 

 

This report is obviously not intended as the final word in the area of data management, rather it aims to put together several starting points to encourage future efforts in this domain. The participating authors sincerely hope that interested parties will take on board the findings of this document and craft them into a group of concrete, well-aligned initiatives, which fulfil the promises of Data Intensive Research.

 

 

Contact:

 

secretariat@e-irg.eu

http://www.e-irg.eu/

 

esfri@ec.europa.eu

http://cordis.europa.eu/esfri/

 

The e-Infrastructure Reflection Group is an inter-governmental policy body comprising national delegates from more than 30 European countries. Its work is supported by the e-IRG Support Programme 2 (e-IRGSP2), a project financed by the European Commission’s 7th Framework Programme, which includes seven partner institutions: CSC, NCF, ETL, GRNET, AUEB-RC, Genias BV and PSNC.

 

ESFRI, the European Strategy Forum on Research Infrastructures, is a strategic instrument to develop the scientific integration of Europe and to strengthen its international outreach. The mission of ESFRI is to support a coherent and strategy-led approach to policy-making on research infrastructures in Europe, and to facilitate multilateral initiatives leading to the better use and development of research infrastructures, at EU and international level. ESFRI's delegates are nominated by the Research Ministers of the Member and Associate Countries, and include a representative of the Commission, working together to develop a joint vision and a common strategy.

 


URL: www.e-irg.eu


Area of interest
  • Biophysics
  • Chemistry
  • Climate, Environmental Modelling
  • Earth Science
  • Fluid Dynamics
  • Geology
  • Government
  • Health
  • Medicine & Biology
  • Meteorology
  • Physics (HEP, Plasma physics)
Share on...

LATEST NEWS

21-07-2010 Announcing the 8th e-Infrastructure Concertation Meeting – 4-5 November 2010, CERN - Geneva, Switzerland

Unit F3 is pleased to announce the forthcoming 8th Concertation Meeting on e-Infrastructures that will take place in Geneva from 4 to 5 November 2010.

19-07-2010 Two more weeks of early bird registration for the EGI Technical Forum!

The EGI Technical Forum 2010 will be the first major event within the EGI community and will bring together European distributed computing projects and their collaborators in academia and businesses, from around Europe and around the world.


More news...

UPCOMING EVENTS

Multi-Conference on Innovative Developments in ICT

ICGREEN - International Conference on Green Computing ICEHST - International Conference on e-Health Services and Technologies ICTEL - International Conference on Technology-Enhanced Learning

TeraGrid '10

The 5th annual TeraGrid Conference Organisers are pleased to announce that registration is now open! TG'10 will offer a full range of research and technical presentations for scientists, engineers


Events calendar...

Enjoy The Digital Library