Data interoperability refers to the functionality of information systems to exchange data and enable information sharing.  More specifically, itis defined as the ability of systems and services that create, exchange, and consume data to have clear, shared expectations for the format, contents, context, and meaning of that data. Thus, it allows to access and process data from multiple sources in diverse formats without losing sense and then to integrate that data for mapping, visualization, and other forms of representation and analysis. Data interoperability enables people to find, explore, and understand the structure and content of heterogeneous data.

In this context, SEDIMARK aims to provide an enriched secure decentralized data and services marketplace where scattered data from various domains and geographical locations within the EU can be easily generated, cleaned, protected, discovered, and enriched with metadata, AI and analytics, and exploited for diverse business and research scenarios. SEDIMARK involves a combination of heterogeneous data, and achieving data interoperability will allow it to maximize the value of the data and overcome the significant challenges posed by distributed assets (heterogeneity, data formats, sources, etc). For this to happen, SEDIMARK will reuse the semantic models developed in previous and ongoing EU initiatives, such as Gaia-X, IDS and NGSI-LD, and propose extensions to them to create one generic semantic model able to annotate and enrich heterogeneous data from multiple domains semantically.

Besides data, interoperability between AI models that emerge from this data is of great interest. In the decentralized environment of SEDIMARK, decentralized training requires that users train their models locally and then exchange model weights for jointly learning a global model. Ensuring that all SEDIMARK users will use the exact same machine learning platform for training the model and the exact same machines is unrealistic. So, SEDIMARK models will be agnostic to underlying platforms and SEDIMARK will provide tools to convert models to various formats and support models to run on machines of various capabilities and on various platforms.

Data interoperability in SEDIMARK 2

This week, SEDIMARK participated in the workshop on ‘Tech Adoption Scenarios and Data AI 2030’ that was organised by the LeADS project (Leading Europe’s Advanced Digital Skills), which is a Coordination and Support Action (CSA) funded by the Digital Europe Programme, and among other objectives, it aims to provide guidance for the deployment of the DIGITAL programme Advanced Digital Skills (ADS) over the next 7 years.

The aim of the workshop was to assess the key predictions developed by LeADS for market adoption within the AI and Data technology areas which included over 80 tech groupings.

From SEDIMARK, University of Surrey (@cvssp_research) contributed to the co-creation exercise facilitated by Martin Robles from BluSpecs, which focused on the definitions of market dynamics in relation to data management and analysis.

The exercise involved assessing:

  1. The applicability of the identified use cases, grouped under AI, BI/Data Science, Cloud, Cybersecurity and IoT, to data management skills relating to data collection, curation, analysis, quality and interoperability.
  2. The hypotheses presented based on several factors, which included legislation, AI automation and Cybersecurity, that could have an impact on these skills.
  3. The relevance and magnitude of the impact that data management skills would have on the different use cases.

Most of the use cases listed were applicable since data management plays a central role in most technologies, which include remote health monitoring, environmental monitoring detection, manufacturing operations and agricultural field monitoring.

In relation to legislation, compliance oversight over different aspects of data management will be expected to increase, and a possible impact of this is that there will be significant reservation among developers to provide decentralised and distributed solutions to address today’s massive energy consumption of cloud-based centralised systems, by handling data closer to the data source, which in many scenarios will require collaborative data sharing between different data providers. And this will inevitably raise alarms regarding data protection, and therefore compliance entities need to cooperate and clarify, rather than police tomorrow’s developers.

With regards to AI automation, the hypothesis presented was that more automation of digital systems will be driven by AI throughout an application’s development lifecycle. This would be the case for well-established uniform processes, but not yet for processes dealing with new or unfamiliar data sources especially when it comes to handling semantic interoperability.

As for Cybersecurity, an increase in experts in this field is expected due to the increase in federation of data flows and models in systems. This will highly likely be the case as there will be a need for auditing mechanisms for checking data integrity and provenance to ensure the correct use of data and AI models.

Finally, when it came to assessing the relevance and magnitude of the impact of data management on today’s main technology, most areas listed were expected to be highly affected by how skills in data management will evolve.

In conclusion, the workshop highlighted possible scenarios on how skills development in the future will be influenced, especially when it comes to balancing innovation in data management and the protection of data.

SEDIMARK knows the importance of regulating data management issues within a context such as the one posed by the project. A solution will be considered where consortium partners will deposit all underlying information on data-related business processes (data storage, data provisioning, processing etc.) of the SEDIMARK solution clearly and transparently.
The purpose of the Data Management Action Plan (DMAP) is to identify the main data management elements that apply to the SEDIMARK project and the consortium. This document is the first version of the DMAP and will be reviewed as soon as there is a clearer understanding of the types of data that will be collected.
Given the wide range of sources from which data will be collected or become available within the project, this document outlines that the consortium partners will consider embracing and applying the Guidelines on FAIR Data Management in Horizon 2020 and Horizon Europe (HE); “In general terms your data should be ‘FAIR’, that is Findable, Accessible, Interoperable and Re-usable”, as information about data to be collected becomes clearer”.

As the name suggest, SEDIMARK will be a Data and Service marketplace. But SEDIMARK focus is not only on data and services assets: Decentralisation also play a key role…

D as Decentralisation

The decentralisation allows to stay away from a single and central authority for control and decision-making, instead it enables the interactions directly among multiple independent parties.

There are several perks in a decentralised system:

  • Reduced Weakness: relying too much on one entity can lead to systemic failures. Multiple entities shield from unfortunate events.
  • Optimization of resources: in a decentralized system, the resources available can be spread among multiple entities to provide better services.
  • Security and Trust: in a decentralized network, security and trust is a must pre-condition.

SEDIMARK Marketplace achieves the security and trust thanks to Distributed Ledger Technologies (DLT).

D as DLT

A DLT is a network composed of several nodes that independently replicate, share, and synchronize the same data spread across many different physical locations without a central administrator.

The most famous example is the Blockchain, today largely employed for financial transactions with bitcoin crypto-currency. However, the SEDIMARK decentralised architecture will be based on a different DLT, that is the IOTA Tangle designed and deployed by the IOTA Foundation. The IOTA Tangle is an open, feeless and highly scalable distributed ledger, designed to support both data and value transfer with a green fashion.

Do you want to know more? Stay tuned for next blog posts by signing up to our newsletter below.

Follow us on LinkedIn Twitter GitHub

* Source image: shutterstock

A digital twin is at the highest level, an architectural construct that is enabled by a combination of technology streams such as IoT (Internet of Things), Cloud Computing, Edge Computing, Fog Computing, Artificial Intelligence, Robotics, Machine Learning, and Big Data Analytics.

The concept of a Digital Twin is based on the fact that every physical part has a virtual counterpart that is conceptual, structural and functional the same as the physical part. The concept of Digital Twins dates in the 1970s used by NASA in the Apollo 13 mission. Nowadays the Digital Twins are used in various industries, being a key concept in realizing the communication mechanism between the physical and the virtual world by using data.

The primary use case for Digital Twin is asset performance, utilization, and optimization. Digital Twin enables monitoring, diagnosing, and prognostics capabilities for a particular use case.

Some applications for Digital Twin are presented below:

Digital Twin for creating 3D modeling of digital objects from physical objects. This use case is a critical success factor for smart manufacturing initiatives.

Digital Twin is used inside factories to identify symptoms with constant monitoring and finding the root causes of production issues.

In healthcare Digital Twin are used for simulation purposes so doctors can do risky operations first in a simulated environment before doing the operation on a real patient.

Town planners use Digital Twin initiatives by using virtual models to improve the city conditions in a proactive manner. This approach can reduce the complexity and simplify the processes for planners. In conclusion using the Digital Twin architecture can help in a lot of industries making the work easier.

SEDIMARK has been present today in the sustainable mobility event with the participation of @unican and Ayuntamiento de Santander, highlighting the importance of fostering the usage of environmental-friendly mobility solutions, like biking, core part of one of our use cases. More information about the event in

First SEDIMARK deliverable submitted! In this deliverable we present the webpage and other dissemination materials, like our first roll-up. Have a look if you want to know more on how this website has been set up.

This document is Deliverable “D6.1 Project website and dissemination material”, which presents the functional of the official website of the SEDIMARK project. More specifically the document describes the portal website and the implemented features available for the public.
The web portal, which can be publicly accessed at the Uniform Resource Locator (URL) address, has been launched in November 2022. It constitutes one of the most relevant tools in the dissemination and communication activities for the SEDIMARK project.
The website has been designed using WordPress. The website layout presents a project overview, including project objectives, consortium partners, a section for news & events relevant in the field of the project and a download section (including dissemination kit, publications, and deliverables).
The current version of the dissemination website follows the project ́s graphic identity. Additionally, the website follows the European Union (EU) recommendation regarding usability and accessibility and has the EU flag and the Horizon Europe logo and grant agreement (GA) disclaimer.

SEDIMARK attended the EBDVF 22 | Data and AI Event in Prague on the 22nd of November, organized by the BDVA - Big Data Value Association. In his pitch within the "Lessons learnt from Data Platforms projects" session, SEDIMARK's technical leader, Luis Sánchez from Universidad de Cantabria, presented the project goals and ambitions in Data Spaces.

On October 26th - 27th SEDIMARK held its Kick-off meeting at fabulous Magdalena Peninsule in Santander. A great way to devirtualize each other, trigger the technical discussions and enjoy some time together!

During the meeting, we held the initial discussions on the project, setting the ground for the future work to meet the ambitious goals foreseen, including the novel use cases that will be carried out throughout the project's lifetime.

Stay tuned to discover more about our thrilling progress!