Global Standards Watch: What ISO, W3C and the EU AI Act Mean for Knowledge Graphs

In recent years, the landscape of ontology storage has been shaped not only by advances in technology but also by evolving standards and legislative frameworks. As organizations increasingly rely on ontologies to represent complex domains, the need for robust, interoperable, and compliant storage solutions has become paramount. The trajectory of these changes is defined by the interplay of technical innovation and legal responsibilities, both of which are critical for the sustainable development and deployment of ontology-based systems.

The Growth of Ontologies and the Demand for Standards

Ontologies serve as a backbone for knowledge representation, enabling systems to process and interpret data with a level of semantic richness that surpasses traditional databases. As their adoption has expanded across domains such as healthcare, finance, government, and artificial intelligence, the question of how ontologies are stored, accessed, and exchanged has become increasingly significant.

Historically, ontology storage was an internal concern, primarily dictated by technical requirements and the idiosyncrasies of specific projects. However, as collaborative and cross-organizational initiatives proliferated, the demand for common formats and interoperable systems grew. This transition has led to the establishment of foundational standards, notably:

Resource Description Framework (RDF)
Web Ontology Language (OWL)
Simple Knowledge Organization System (SKOS)

These standards, each maintained by the World Wide Web Consortium (W3C), have laid the groundwork for a consistent, machine-readable approach to ontology storage and exchange. Their widespread adoption is a testament to the community’s recognition of the need for interoperability.

Ontologies are not merely data structures; they are vessels for meaning, bridging the chasm between human understanding and machine interpretation.

Emergence of Linked Data Principles

The introduction of Linked Data principles has been a pivotal moment in the evolution of ontology storage. By emphasizing the use of unique identifiers (URIs) and defining clear relationships between entities, Linked Data has enabled the seamless integration of disparate knowledge sources. Storage systems must now support not only local persistence but also the dynamic retrieval and combination of knowledge from distributed resources.

This shift has influenced both the design of storage architectures and the development of new serialization formats such as JSON-LD and Turtle, which facilitate easier data interchange and human-readability while maintaining semantic precision.

Legislative Pressures Shaping Ontology Storage

As ontologies increasingly encode sensitive or regulated information—think healthcare vocabularies, legal taxonomies, or financial models—the implications of legislation have become impossible to ignore. Laws such as the General Data Protection Regulation (GDPR) in the European Union, HIPAA in the United States, and similar frameworks worldwide, have introduced specific requirements for data storage, access, and provenance.

Regulation is no longer a peripheral concern; it is a central pillar of ontology storage strategy, demanding transparency and accountability at every level.

GDPR and Data Provenance

Under GDPR, organizations must ensure that any personal data is stored with explicit consent, with clear tracking of how, where, and why the information is held. For ontologies that contain personally identifiable information (PII) or reference data subjects, this means maintaining detailed provenance records—a non-trivial challenge for dynamic, interconnected ontological systems.

Furthermore, data subjects have the right to access, correct, and erase their information. Ontology storage platforms must therefore feature mechanisms for auditing, versioning, and (when required) selective redaction or deletion of knowledge elements. These demands have encouraged the development of provenance-aware storage models, such as the W3C PROV standard, and the adoption of immutable, append-only stores that facilitate traceability without compromising integrity.

Internationalization and Cross-Border Considerations

Ontologies are intrinsically global, yet legislation is often local. As a result, storage solutions must accommodate varying jurisdictional requirements. For example, data localization laws in some countries mandate that certain types of information be stored within national borders. This constraint affects not only the physical location of ontology stores but also the architectural design of distributed and federated systems.

Standardization efforts, such as the ISO/IEC 21838 series on top-level ontologies, are beginning to address these concerns by recommending practices that facilitate compliance across multiple legal frameworks. However, achieving seamless interoperability remains a work in progress.

Technical Standards and Best Practices

Beyond legal compliance, the ontology community has developed a robust ecosystem of technical standards to ensure that storage systems are performant, reliable, and scalable. Among the most influential are:

SPARQL Protocol and RDF Query Language: Standardizes querying of ontology stores, enabling federated queries across distributed systems.
SHACL and ShEx: Define shapes and constraints for RDF graphs, ensuring data integrity and validating stored ontologies against predefined schemas.
Open Annotation Data Model: Facilitates the linking of ontological concepts to external resources, supporting rich annotation and context preservation.

These standards have encouraged the proliferation of specialized triple stores—databases optimized for RDF data—and hybrid systems that bridge ontological and relational paradigms. The maturation of these technologies has made it possible to store and query vast, complex ontologies at scale, a capability increasingly demanded by AI and data science applications.

Technical standards are the scaffolding upon which the edifice of modern ontology storage is constructed.

Security and Access Control

With the growing importance of ontologies in mission-critical applications, storage systems must provide fine-grained access control. New standards such as Web Access Control (WAC) and Solid have emerged to define access policies at the level of individual resources or triples, empowering organizations to manage permissions with unprecedented precision.

Encryption, both at rest and in transit, is now a baseline expectation. Compliance with frameworks like NIST SP 800-53 or ISO/IEC 27001 is often required for organizations handling sensitive or classified information. These requirements have driven the integration of robust cryptographic practices into ontology storage products, ensuring that data remains protected throughout its lifecycle.

Interoperability and the Role of Open Data

Interoperability is at the heart of ontology-driven systems. The FAIR principles—Findability, Accessibility, Interoperability, and Reusability—have gained widespread acceptance as a guiding philosophy for data and ontology management. Storage solutions are now evaluated not only on their technical merits but also on their ability to align with FAIR criteria.

Open standards have catalyzed the creation of public ontology repositories such as BioPortal, Linked Open Vocabularies (LOV), and Wikidata. These platforms exemplify the ethos of open science, providing free access to ontological knowledge while enforcing rigorous standards for provenance, licensing, and attribution.

However, the tension between openness and privacy remains. Legislative frameworks often impose restrictions on what can be shared or published, and storage systems must reconcile these constraints with the demands of open data communities. The emergence of data use ontologies—explicit representations of data sharing agreements and licensing terms—has provided a partial solution, enabling fine-grained control over knowledge dissemination.

Versioning and Long-Term Preservation

Ontologies are living artifacts, subject to continual refinement and evolution. Standards for versioning and archival have become essential for ensuring the long-term reliability and utility of stored knowledge. Practices such as semantic versioning, persistent identifiers, and change tracking are now integral to ontology storage workflows.

Legislation concerning data retention, such as the EU’s eIDAS Regulation or sector-specific mandates, further underscores the need for durable, auditable storage practices. Institutions must be able to demonstrate not only what knowledge they hold, but also how it has changed over time and who has had access to it.

Looking Forward: Challenges and Opportunities

As the standards and legislation governing ontology storage continue to evolve, several challenges loom on the horizon. The increasing complexity of legal frameworks, coupled with the rapid pace of technological innovation, demands a flexible and forward-thinking approach. Emerging domains such as explainable AI, regulatory technology (RegTech), and digital sovereignty are likely to reshape the requirements for ontology storage in ways that are not yet fully understood.

At the same time, opportunities abound. The convergence of semantic technologies with decentralized architectures—such as blockchain and distributed ledgers—offers new models for provenance, trust, and accountability. The ongoing refinement of interoperability standards promises to unlock new levels of collaboration and knowledge sharing, further cementing the centrality of ontologies in the digital ecosystem.

To store an ontology is to preserve a fragment of collective understanding—an act of stewardship in the ever-expanding landscape of human knowledge.

Ultimately, the standards and legislation shaping ontology storage reflect a collective commitment to rigor, transparency, and responsibility. As these frameworks mature, they will continue to guide the development of technologies that honor the complexity and significance of the knowledge they are designed to steward.