Meta Museum Documentation

Catalog (70)

ID	Document	Updated	Anchors	SHA
agents/ag2-extraction-notes	AG2 Extraction Notes agents/ag2-extraction-notes.md	10/20/2018, 1:46:40 AM	11	e8d0072ebec1
asset-provenance	Asset Provenance asset-provenance.md	10/20/2018, 1:46:40 AM	4	1025c0acc117
closeout-notes	AI-RSI one-click closeout notes closeout-notes.md	10/20/2018, 1:46:40 AM	2	1f560f6a8535
content-credibility-engine	Content Credibility Engine content-credibility-engine.md	10/20/2018, 1:46:40 AM	8	d9aa32358670
demo-script	Demo Video — Shot List & Script (60–90s) demo-script.md	10/20/2018, 1:46:40 AM	2	131ddae42e6e
deployment	Deployment — Vercel + Render deployment.md	10/20/2018, 1:46:40 AM	8	4911b1f459b5
development-roadmap	Meta Museum Development Roadmap development-roadmap.md	10/20/2018, 1:46:40 AM	23	624a8a089d72
development/aidd-tdd	AIDD + TDD Discipline development/aidd-tdd.md	10/20/2018, 1:46:40 AM	5	cd0a0524525a
env	Environment Variables env.md	10/20/2018, 1:46:40 AM	10	9c18634cab1a
evals/golden-museum-questions	Golden Eval Dataset: Complex Museum Questions evals/golden-museum-questions.md	10/20/2018, 1:46:40 AM	6	2876a2b5e78d
linked-art/conformance-matrix	Linked Art 1.0 — Conformance Matrix linked-art/conformance-matrix.md	10/20/2018, 1:46:40 AM	5	53ff87000bf4
linked-art/Linked%20Art%20Notes	Linked Art Notes.md linked-art/Linked Art Notes.md	10/20/2018, 1:46:40 AM	0	aca66d51107b
linked-art/Linked%20Open%20Art%20Data%20Web%20App%20-%20Must-have%20Data%20Sources	Linked Open Art Data Web App (AI) — Must-have Data Sources linked-art/Linked Open Art Data Web App - Must-have Data Sources.md	10/20/2018, 1:46:40 AM	7	7b7d350fe8a0
linked-art/LinkedArtAppFeatures	🏛️ Art Explorer: Linked Art Application & Ecosystem linked-art/LinkedArtAppFeatures.md	10/20/2018, 1:46:40 AM	14	e23b890ecd2a
linked-art/LinkedArtChallenges	LinkedArtChallenges.md linked-art/LinkedArtChallenges.md	10/20/2018, 1:46:40 AM	0	d8c987070277
linked-art/LinkedArtCollaboration	LinkedArtCollaboration.md linked-art/LinkedArtCollaboration.md	10/20/2018, 1:46:40 AM	1	14ccf63edef3
linked-art/LinkedArtDashboard	LinkedArtDashboard.md linked-art/LinkedArtDashboard.md	10/20/2018, 1:46:40 AM	0	6d04d4b2bf79
linked-art/LinkedArtFeatureRoadmap	Feature Roadmap for Linked Open Art Data Apps linked-art/LinkedArtFeatureRoadmap.md	10/20/2018, 1:46:40 AM	8	ac10d8e79c20
linked-art/LinkedArtJobReady	LinkedArtJobReady.md linked-art/LinkedArtJobReady.md	10/20/2018, 1:46:40 AM	0	c60b357bcb87
linked-art/LinkedArtModel1.0-Reference	Linked Art Model 1.0 Reference (Round 1) linked-art/LinkedArtModel1.0-Reference.md	10/20/2018, 1:46:40 AM	344	e6d48d474b3e
linked-art/LinkedArtPatterns	LinkedArtPatterns.md linked-art/LinkedArtPatterns.md	10/20/2018, 1:46:40 AM	0	d45bbbb02d70
linked-art/LinkedArtPRD	🖼️ Product Requirements Document linked-art/LinkedArtPRD.md	10/20/2018, 1:46:40 AM	20	91bc1f37307c
linked-art/LinkedArtRoadmap	LinkedArtRoadmap.md linked-art/LinkedArtRoadmap.md	10/20/2018, 1:46:40 AM	0	e52e71c6bd28
linked-art/LinkedArtSaaS	LinkedArtSaaS.md linked-art/LinkedArtSaaS.md	10/20/2018, 1:46:40 AM	0	3d260738fb29
linked-art/LinkedArtSoftware	Code and Tools linked-art/LinkedArtSoftware.md	10/20/2018, 1:46:40 AM	8	9e8fef24aea9
linked-art/LinkedArtSOTAWebApp	LinkedArt SOTA Web App — Master Build Specification linked-art/LinkedArtSOTAWebApp.md	10/20/2018, 1:46:40 AM	129	a5f0baca89c6
linked-art/LinkedArtUnmetNeeds	LinkedArtUnmetNeeds.md linked-art/LinkedArtUnmetNeeds.md	10/20/2018, 1:46:40 AM	0	cb35fac29cc1
linked-art/LinkedArtUseCases	LinkedArtUseCases.md linked-art/LinkedArtUseCases.md	10/20/2018, 1:46:40 AM	0	5c572ce8e7f3
linked-art/LinkedArtWidgets	LinkedArtWidgets.md linked-art/LinkedArtWidgets.md	10/20/2018, 1:46:40 AM	0	b39911c7d97d
linked-art/LinkedDesign	LinkedDesign.md linked-art/LinkedDesign.md	10/20/2018, 1:46:40 AM	0	0a02240471e5
linked-art/LODEngine	LODEngine.md linked-art/LODEngine.md	10/20/2018, 1:46:40 AM	0	ef73426f80db
linked-art/LODPipeline	LODPipeline.md linked-art/LODPipeline.md	10/20/2018, 1:46:40 AM	0	fe95e61ed9da
linked-art/LODTools	LODTools.md linked-art/LODTools.md	10/20/2018, 1:46:40 AM	0	3167947fc4e4
linked-art/SPARQL	SPARQL.md linked-art/SPARQL.md	10/20/2018, 1:46:40 AM	0	50e00ed51733
linked-art/Vocabularies	Vocabularies.md linked-art/Vocabularies.md	10/20/2018, 1:46:40 AM	0	e0574a338aaa
linked-art/YaleLux	YaleLux.md linked-art/YaleLux.md	10/20/2018, 1:46:40 AM	0	74fd47fae749
meta-wiki-art-bridge	Meta Wiki Art Bridge (MediaWiki + Wikibase) meta-wiki-art-bridge.md	10/20/2018, 1:46:40 AM	7	7a43fb0c48b8
ops/activity-adoption-proof	Activity Feed Adoption Proof Runbook ops/activity-adoption-proof.md	10/20/2018, 1:46:40 AM	5	68a80b43ae58
ops/ag2-worker	AG2 Worker and Bridge Runbook ops/ag2-worker.md	10/20/2018, 1:46:40 AM	9	50efcd4e3318
ops/auth-credential-rotation	Auth credential rotation runbook ops/auth-credential-rotation.md	10/20/2018, 1:46:40 AM	4	449b8b8eecb6
ops/deployment-preflight	Deployment Preflight Runbook ops/deployment-preflight.md	10/20/2018, 1:46:40 AM	5	ac60432d0aed
ops/era-c-exit-gate-evidence	Era C Exit-Gate Evidence Pack ops/era-c-exit-gate-evidence.md	10/20/2018, 1:46:40 AM	6	656b9c7f85c6
ops/go-live-checklist	Go-Live & Evidence-Pipeline Checklist ops/go-live-checklist.md	10/20/2018, 1:46:40 AM	6	ae7f5d71f7dc
ops/k6-slo	k6 SLO Load Test (SOTA §20.4) ops/k6-slo.md	10/20/2018, 1:46:40 AM	4	328b5b3163d4
ops/kpi-evidence	SOTA §26 KPI Evidence Input ops/kpi-evidence.md	10/20/2018, 1:46:40 AM	5	d7b2973d2927
ops/launch-review	Launch Review Packet ops/launch-review.md	10/20/2018, 1:46:40 AM	5	880e41ebcbe3
ops/managed-linked-art-pilot-runbook	Managed Linked Art Pilot Runbook ops/managed-linked-art-pilot-runbook.md	10/20/2018, 1:46:40 AM	11	d4f125c2ddae
ops/otel-local	Local OpenTelemetry Wiring (Tempo / Jaeger) ops/otel-local.md	10/20/2018, 1:46:40 AM	5	1ebbc3b33f92
ops/outbox-projector	Transactional Outbox Projector (Postgres -> Solr/GraphDB) ops/outbox-projector.md	10/20/2018, 1:46:40 AM	5	dc70ad766471
ops/procurement-readiness-packet	Procurement Readiness Packet ops/procurement-readiness-packet.md	10/20/2018, 1:46:40 AM	9	c5685e82cca7
ops/reconciliation-service	Reconciliation Service (C2) ops/reconciliation-service.md	10/20/2018, 1:46:40 AM	6	05162c313ea9
ops/search-graph-provisioning	Solr 9 + GraphDB Provisioning ops/search-graph-provisioning.md	10/20/2018, 1:46:40 AM	6	fc1b15279a84
ops/security-dr-drill	Pen Test Baseline + DR Drill Runbook ops/security-dr-drill.md	10/20/2018, 1:46:40 AM	3	a766ef3e2afc
progress/2026-05-31/era-c-readiness-snapshot	Era C Readiness Snapshot (May 31, 2026) progress/2026-05-31/era-c-readiness-snapshot.md	10/20/2018, 1:46:40 AM	3	9672614ceb53
progress/era-history	Meta Museum — Era Delivery History progress/era-history.md	10/20/2018, 1:46:40 AM	47	cc030755d1e5
providers/harvard-art-museums	Harvard Art Museums API Integration Plan providers/harvard-art-museums.md	10/20/2018, 1:46:40 AM	11	fa8b980154f5
providers/louvre-collections-json	Louvre Collections JSON Integration Plan providers/louvre-collections-json.md	10/20/2018, 1:46:40 AM	11	775f91a8d813
providers/nga-open-data	National Gallery of Art (NGA) Open Data Integration Plan providers/nga-open-data.md	10/20/2018, 1:46:40 AM	11	51c4807c8de0
providers/princeton-art-museum	Princeton University Art Museum API Integration Plan providers/princeton-art-museum.md	10/20/2018, 1:46:40 AM	11	c8823f65ee41
providers/rkd-knowledge-graph	RKD Knowledge Graph Integration Plan providers/rkd-knowledge-graph.md	10/20/2018, 1:46:40 AM	16	2b4b42f2ad42
providers/smithsonian-open-access	Smithsonian Open Access Integration Plan providers/smithsonian-open-access.md	10/20/2018, 1:46:40 AM	12	db1ffa4cab02
providers/vanda-collections-api	Victoria and Albert Museum (V&A) Collections API Integration Plan providers/vanda-collections-api.md	10/20/2018, 1:46:40 AM	11	755d93972233
quality	Quality & Performance quality.md	10/20/2018, 1:46:40 AM	6	174add040960
reconciliation/exhibition-literature-reconciliation	Exhibition + Literature Reconciliation (B6.1) reconciliation/exhibition-literature-reconciliation.md	10/20/2018, 1:46:40 AM	7	293e9d81dd7c
responsible-ai	Responsible AI responsible-ai.md	10/20/2018, 1:46:40 AM	8	f90006650821
risk-register	Risk Register risk-register.md	10/20/2018, 1:46:40 AM	4	becb213d5c5e
roadmap-to-10	Roadmap to 10/10 roadmap-to-10.md	10/20/2018, 1:46:40 AM	15	40a11000dc7e
roadmap	Meta Museum Roadmap roadmap.md	10/20/2018, 1:46:40 AM	18	145d0cbbe54a
rsi-wiki	AI-RSI compounding wiki rsi-wiki.md	10/20/2018, 1:46:40 AM	8	b64914fe6f20
wikibase-cloud-migration-checklist	Wikibase Cloud -> Self-Host Migration Checklist wikibase-cloud-migration-checklist.md	10/20/2018, 1:46:40 AM	12	170657fcbf2b

Current Document: LinkedArtRoadmap.md

Source updated 10/20/2018, 1:46:40 AM · SHA-256 e52e71c6bd28 · 323 lines

Canonical ID: linked-art/LinkedArtRoadmap

JSON for this doc:/api/docs/content?path=linked-art/LinkedArtRoadmap.md

Human link:/docs?doc=linked-art%2FLinkedArtRoadmap.md

Canonical API endpoint:/api/docs/content?path=linked-art%2FLinkedArtRoadmap.md

Sections (stable anchors):

No detectable headings.

The "Middle-Tier" Data Editor (Linked Art Form Builder)

The gap here is the lack of a lightweight entry point. Currently, you either write raw Python code using Crom or deploy a massive enterprise system like Arches. You need a "Goldilocks" solution: a simple web interface that behaves like a standard form but outputs strict Linked Art JSON-LD.

Specific Functionality Details:

• Vocabulary-Driven Input: When a user types "Oil on Canvas," the form should not just save the string; it should query the Getty AAT (Art & Architecture Thesaurus) API in the background to fetch the correct URI (e.g., http://vocab.getty.edu/aat/300015050) and inject it into the classified_as field.

• Pattern-Based Generation: The tool should rely on pre-defined Linked Art patterns (specifically the "Production" and "creation" patterns). The user sees a single field for "Artist," but the backend automatically constructs the complex produced_by → carried_out_by → Person graph structure required by the standard.

• Validation feedback: Instead of generic JSON errors, the editor should provide domain-specific feedback, such as warning the user if an object has a creation date that predates the birth date of the assigned artist.

The "Viewer" Library (Reusable Linked Art UI Components)

Parsing Linked Art is repetitive. Every time you want to display an object's creation date, you have to traverse the same deeply nested path (produced_by → timespan → begin_of_the_begin). A dedicated component library solves this by abstracting the traversal logic into reusable frontend widgets.

Specific Component Details:

• <LinkedArtAttribution />: A component that accepts a HumanMadeObject resource and automatically renders the "Artist Line." It handles the logic to distinguish between carried_out_by (the artist) and technique (the method), ensuring "Oil on Canvas by Van Gogh" renders correctly without manual string formatting.

• <IIIFCanvasViewer />: A wrapper that specifically looks for the digitally_shown_by property in the Linked Art JSON. It should automatically configure a DeepZoom viewer (like OpenSeadragon) using the manifest URL found in that property, seamlessly linking the metadata to the image.

• <ProvenanceStepper />: A visualization component that parses the provenance activity array. It should render a vertical or horizontal timeline of custody transfer, automatically extracting the transferred_title_from (seller) and transferred_title_to (buyer) for each step.

The Visual ETL Mapper (CSV-to-Linked-Art Interface)

Participatory research often involves "messy" data—spreadsheets from community members or legacy CSVs that don't fit a standard schema. Writing custom Python scripts for every new spreadsheet is unsustainable.

Specific Functionality Details:

• Node-Link Mapping Canvas: A UI similar to a flow chart where users drag a column header from their CSV (e.g., "Donor Name") and connect it to a specific node in a visual representation of the Linked Art graph (e.g., connecting it to the transferred_title_from node in an Acquisition event).

• Reconciliation Services: The tool should offer an interface to reconcile string data against authority files. If a CSV contains "V. van Gogh," the tool should present potential matches from ULAN (Union List of Artist Names) and allow the user to select the correct URI before export.

• Template Export: Once a mapping is defined for one spreadsheet (e.g., "19th Century Deeds"), the mapping configuration should be saveable as a JSON template. This allows non-coders to reuse the same mapping logic for future datasets with similar structures.

The Dynamic Graph Explorer (Interactive Debugger)

Static diagrams (like Mermaid) are insufficient for Linked Art because the graphs are too dense; a single object might have hundreds of nodes when fully expanded. You need a tool that allows for "progressive disclosure."

Specific Functionality Details:

• Force-Directed Layouts: Use a library like D3.js or Cytoscape.js to render nodes that float and settle based on their connections. This helps identify "orphan" nodes—data points that exist in the file but aren't properly connected to the main object.

• Drill-Down Expandability: The graph should initially load only the top-level entities (Object, Actor, Place). Clicking an "Event" node should dynamically expand its children (TimeSpan, Sub-Events), preventing information overload.

• Contextual JSON Inspector: When a user clicks a node in the visual graph, a side panel should reveal the raw JSON snippet for that specific entity. This allows developers to instantly verify if a specific property (like id or type) is malformed without searching through a 5,000-line text file.

This outlines the complete technological stack and the correct, phased order of design and development, prioritizing foundation and reusability, which is critical for a modular system.

🛠️ The Technology Stack (Modular Architecture)

The tech stack is broken down by the four core functional areas of your modular system.

Data Processing and Backend (Core Logic)

This layer handles the heavy lifting of converting raw data into valid Linked Art JSON-LD.

• Primary Language: Python (for its data science libraries and the existing Crom library).

• Linked Data Library: Crom (Your existing Python library for creating Linked Art instances).

• ETL Orchestration: Apache Airflow or Prefect (To manage, schedule, and monitor the complex data mapping pipelines defined by the ETL Mapper).

• Data Storage: A Graph Database (e.g., Neo4j, Stardog) or a Document Store (e.g., MongoDB, Elasticsearch) to handle the native JSON-LD/Graph structure of the data, rather than a traditional relational database (SQL).

• Authority Reconciliation: Fast, Cached API Calls to the Getty Vocabularies (AAT, ULAN) and Wikidata for URI lookups during the mapping process.

Frontend and Dashboard (User Interface)

This layer provides the seamless user experience across all four tools.

• Core Framework: React or Vue.js (For building the single-page application (SPA) dashboard shell and the modular components).

• UI Library: Tailwind CSS or Chakra UI (To ensure a unified, modern design language across all modules).

• Graph Visualization: D3.js or Cytoscape.js (Specifically for the Dynamic Graph Explorer to render and manipulate complex node-link diagrams).

• Mapping Interface: ReactFlow or similar library (To enable the drag-and-drop visual connections needed for the ETL Mapping Interface).

API and Communication (The Glue)

This layer allows the decoupled frontend and backend modules to talk to each other securely.

• API Framework: FastAPI (Python) or Node.js/Express (Javascript) (To create lightweight microservices for each tool's backend logic, ensuring low coupling).

• Data Transfer Format: JSON-LD (The native format for all communication between components).

• Containerization: Docker (To package each individual tool—ETL, Editor, API—into deployable, isolated units).

📅 Order of Design and Development (Phased Approach)

The development should follow a three-phase model, prioritizing the foundational data structures and reusable components before tackling the large-scale user interfaces.

Phase 1: Foundation and Core Data Logic (35% Effort)

Goal: Establish the core engine that creates and handles Linked Art data consistently.

Define Core Patterns (Design): Formally document the exact Linked Art patterns (e.g., the Creation pattern, the Acquisition pattern) that the PHR team will support. These patterns become the non-negotiable input/output schema.

Develop Linked Art UI Component Library (Code):

• Prioritize Atomic Components: Build the fundamental parser widgets first (e.g., <LinkedDate />, <LinkedAttribution />).

• Publish as Reusable Package: Ensure this library is published locally (e.g., on a private NPM registry) so all other frontend tools in the stack can import it.

Core Editor Microservice (Code):

• Build the minimal Python/FastAPI microservice that exposes a single endpoint: POST /validate-json. This service uses Crom to check if any incoming JSON is valid Linked Art based on your documented patterns.

Phase 2: User Interfaces and Critical Tools (45% Effort)

Goal: Build the two most critical tools for data creation and ingestion.

Linked Art Form Builder (Design & Code):

• Build the Frontend: Develop the web-based form interface using the React/Vue framework.

• Integrate Validation: Hook up the form fields to the Core Editor Microservice (Phase 1) for real-time validation and URI lookup.

Visual ETL Mapping Interface (Code):

• Develop the drag-and-drop tool for mapping CSV headers to Linked Art nodes.

• This is the heaviest backend component. It must be built as a separate microservice, leveraging Python/Crom/Airflow to handle large, asynchronous transformation jobs.

Dynamic Graph Explorer (Code):

• Develop the standalone visualization tool using D3.js. Its sole initial purpose is to accept a Linked Art JSON-LD file (or URL) and render the collapsible, force-directed graph.

Phase 3: Integration and Deployment (20% Effort)

Goal: Knit the individual tools into the final user-facing system.

PHR Dashboard Shell (Code):

• Build the main SPA Dashboard. This shell contains the navigation, user login, and is responsible for hosting the frontends of the individual tools built in Phase 2.

Data Flow Orchestration (Code):

• Implement the "glue" logic: When the ETL Mapper finishes a job, the dashboard should automatically update its status and present a button that passes the resulting JSON record's ID to the Form Builder.

Final Deployment: Deploy all microservices using Docker containers. This isolation ensures the high-traffic Graph Explorer doesn't bring down the complex ETL Mapper, creating a resilient, scalable system.

This is the complete, high-level project plan structured into sequential phases, designed for the development of modular tools using the Microservices and Micro-Frontends architecture.

🚀 Complete Project Outline: Phased Development

The approach prioritizes building the reusable engine first, then the core tools, and finally assembling the user experience.

Phase 0: Conceptual Foundation and Setup (The Blueprints)

This phase establishes the non-negotiable standards and reusable assets that prevent the modules from becoming inconsistent.

Linked Art Pattern Definition:

• Action: Formally define the exact, strict Linked Art JSON-LD structures (patterns) the PHR system will support for core entities (e.g., 'Creation,' 'Acquisition'). This document is the single source of truth for all validation and generation logic.

• Goal: Enforce data consistency across all four tools.

Shared UI Component Library (React/JS):

• Action: Build a standalone, reusable library of atomic components (e.g., <LinkedDate />, <LinkedAttribution />). These components contain the logic to traverse the Linked Art JSON-LD and render a formatted output string.

• Goal: Guarantee a unified look and feel and drastically reduce development time for the three tool UIs (Phase 2).

Core Infrastructure Deployment:

• Action: Set up the deployment environment, including the API Gateway for routing, Docker for containerization, and the Graph Database (e.g., Neo4j) to host the final, authoritative Linked Art data.

Phase 1: Core Data Engines (The Backends/Microservices)

This phase builds the independent backend services that will power the frontend tools. These must be deployable and testable on their own.

Validation & Production Microservice:

• Action: Create the first Python/FastAPI microservice. It must expose two core endpoints: one to validate incoming JSON-LD against the Phase 0 patterns, and one to produce/save a new, validated Linked Art record to the Graph Database.

• Goal: Provide the core mechanism for the Form Builder.

Authority Reconciliation Microservice:

• Action: Build a second, separate service responsible for communicating with external authority files (Getty, Wikidata). It accepts a string ("Picasso") and returns the authoritative URI and preferred label, which is essential for standardizing data entry.

• Goal: Clean and standardize all entity identifiers before they enter the system.

ETL Transformation Logic:

• Action: Develop the underlying Python code and scripting framework (using Crom) that performs the data mapping—the "engine" for converting CSV data to Linked Art graph structures. This will run as a scheduled job executor (e.g., via Airflow or a dedicated service).

• Goal: Enable bulk data ingestion.

Phase 2: Modular Tool Development (The Micro-Frontends)

This phase builds the user-facing tools as independent web applications that consume the APIs built in Phase 1. Each tool is a complete, self-contained Micro-Frontend.

Linked Art Form Builder (Create/Edit Tool):

• Action: Develop the most complex UI. It uses the Shared UI Component Library for display and relies on the Validation Microservice for real-time form feedback. It must function entirely through its dedicated API endpoints.

Visual ETL Mapping Interface (Ingestion Tool):

• Action: Develop the drag-and-drop web interface. The frontend uses a visual library (like ReactFlow) to let users draw mapping lines, then sends the mapping instructions and the source file to the ETL Transformation Logic service for processing.

Dynamic Graph Explorer (Visualization Tool):

• Action: Develop the interactive graph viewer. It queries the Graph Database directly (or via a dedicated service) and uses D3.js or Cytoscape.js to render the collapsible, force-directed graph. This is the debugging and viewing tool.

Phase 3: Dashboard Integration and Deployment (Assembly)

This phase pieces the independent parts into a single, cohesive system for the end-user.

Dashboard Container Shell:

• Action: Build the extremely lightweight React Single-Page Application (SPA) shell. This application contains only the global navigation, the authentication system, and the routing logic.

• Goal: Provide a consistent user experience and a single point of entry.

Module Orchestration & Hand-off:

• Action: Implement the final routing and communication: use technologies like Module Federation to dynamically load the three completed Micro-Frontends (Form Builder, ETL Mapper, Graph Explorer) into the Dashboard shell at runtime.

• Action: Program the workflow glue: For example, when the ETL job finishes, the Dashboard receives a notification and updates the user's view, allowing them to click an object's status which routes them seamlessly into the Form Builder to review the newly imported record.

Final Deployment:

• Action: Deploy all five components (API Gateway, Form Builder, ETL Mapper, Graph Explorer, Dashboard Shell) as separate Docker containers, ensuring independent versioning and scaling.

This phased approach guarantees that the core data architecture is sound before a single line of integration code is written, ensuring maximum modularity and agility.

Great question, Joseph — your list is already strong, but I see some strategic gaps that, if filled, would make this “Code and Tools” page a more complete ecosystem for your team and project. Right now you’ve got libraries, platforms, validators, and cleaning tools, but you’re missing some categories that are critical for automation, provenance, and interoperability — the things you care most about.

Here’s what I’d suggest adding:

---

🔧 What’s Missing & Useful

Data Transformation & Mapping Tools

• RDF/JSON-LD converters (e.g., PyLD): for expanding/compacting JSON-LD and handling framing.

• Mapping frameworks (e.g., Karma): for mapping tabular/legacy data into Linked Art JSON-LD.

• ETL pipelines (e.g., Apache NiFi, Airflow): to automate ingestion and transformation with provenance tracking.

Ontology & Vocabulary Integration

• SKOS editors (e.g., VocBench): for managing controlled vocabularies like AAT, Wikidata, or custom museum thesauri.

• Ontology alignment tools (e.g., OpenRefine RDF extension): to reconcile local vocabularies with Linked Art or external authorities.

Validation Beyond JSON Schema

• SHACL validators (e.g., TopBraid SHACL API): for semantic validation of Linked Art graphs, not just JSON syntax.

• Linked Art-specific test suites: community-driven validation scripts to check conformance with Linked Art patterns.

Provenance & Workflow Tracking

• Provenance libraries (e.g., PROV Python): to capture and serialize provenance of transformations.

• Workflow visualization (e.g., DVC or Dagster): for tracking data lineage in pipelines.

Narrative & Storytelling Tools

• Story map frameworks (e.g., Knight Lab StoryMapJS): for building multimedia narratives from Linked Art data.

• Graph visualization (e.g., Neo4j Bloom, Cytoscape): to explore connections and generate user-facing stories.

Testing & CI/CD

• Automated test harnesses (e.g., GitHub Actions with JSON-LD validation scripts): to ensure data integrity during migrations.

• Containerization (Docker images for Linked Art pipelines): to make deployments reproducible across institutions.

Community & Collaboration Resources

• Linked Art GitHub repos & examples: curated list of implementations, sample datasets, and mappings.

• Slack/Discourse communities: places where developers share scripts, validators, and workflows.

---

🧩 Why These Matter for Your Team

• They bridge the gap between raw collections data and Linked Art JSON-LD.

• They add provenance guardrails, which you’ve emphasized as essential.

• They scale storytelling by connecting structured data to narrative platforms.

• They future-proof your workflows with validation and CI/CD.

---

Joseph, given your focus on ontology-driven enrichment and provenance-rich storytelling, I’d especially highlight SHACL validators, provenance libraries, and narrative visualization frameworks as the most impactful additions.

Would you like me to sketch a blueprint diagram showing how these missing tools slot into your current categories (Libraries, Platforms, Validators, etc.) so your team sees the ecosystem at a glance?

Linked open data for art has made real progress, but current tools still leave several important gaps. Many of the issues are less about core standards and more about usability, coverage, and equity of representation.[olh.openlibhums +3]

Modeling and expressiveness

Even robust models like Linked Art and CIDOC CRM cannot capture all the subtle, messy realities of art-historical data, especially uncertainty and contested attributions. Edge cases such as ambiguous artist identities, complex authorship, or nuanced curatorial notes often end up in unstructured text instead of rich structured data.[linked +1]

• Limited ways to model uncertainty (e.g., “probably by X, possibly by Y”) in a machine-actionable way.

• Difficulty describing complex events around provenance, restitution, and exhibition histories with sufficient granularity.[olh.openlibhums +1]

Tools for creation and maintenance

Most linked art pipelines still require specialists who understand RDF, JSON‑LD, and graph modeling, so everyday collection managers or scholars often cannot create or maintain data directly. User-friendly interfaces for mapping legacy systems (e.g., MARC, bespoke CMS exports) into Linked Art patterns are still thin and fragile.[chin-rcip.github +2]

• Few “no-code/low-code” tools for mapping, validation, and publishing Linked Art data from existing museum systems.

• Weak support for iterative data cleaning, versioning, and round‑tripping corrections back into source systems.[chin-rcip.github +1]

Discovery, UX, and analysis

Many LOD-driven art applications remain prototypes, offering powerful queries but poor human-facing design for curators, educators, or the public. Cross-collection search often feels like a technical demo rather than an integrated, intuitive research environment.[navigating +2]

• Limited visual exploration tools (network views, timelines, geospatial maps) built natively on Linked Art graphs.

• Few reusable analysis workflows (e.g., for provenance chains, exhibition networks, artist mobility) packaged for non-technical researchers.[sciencedirect +1]

Interoperability and ecosystem

While core vocabularies (Getty AAT, ULAN, TGN) and big hubs like Wikidata exist, aligning diverse institutional data with them remains labor-intensive and inconsistent. Identity resolution (which “Caravaggio” or which “Untitled”) and careful use of equivalence properties such as owl:sameAs are still major pain points.[americanart.si +2]

• Incomplete and uneven linking between institutional graphs, Wikidata, and authority files.

• Lack of shared, widely used reconciliation services specifically tuned to art-domain edge cases and historical naming practices.[euppublishing +1]

Governance, equity, and openness

Linked art infrastructures often replicate existing power imbalances: well-resourced Western museums are overrepresented, while community archives, non-Western collections, and contemporary artist-run spaces are underlinked or absent. Licensing, copyright concerns over images, and institutional risk-aversion also limit how “open” many datasets can actually be.[getty +2]

• Insufficient support and funding models for small institutions to join LOD ecosystems and sustain participation.

• Limited mechanisms to represent contested histories, community-supplied knowledge, or culturally sensitive materials in ways communities control.[euppublishing +1]

If you share which tools or workflows you’re using (e.g., Linked Art in a specific CMS or research platform), a more targeted list of “what’s missing” and practical next steps can be outlined.

AI/agent quick endpoints

/api/docs/manifest — current catalog with IDs, anchors, source timestamps, and checksums.
/api/docs/content?path=roadmap.md — roadmap JSON payload + rendered HTML for downstream tooling.

Meta Museum Docs (Live Markdown Source)

Catalog (70)

Current Document: LinkedArtRoadmap.md

AI/agent quick endpoints