Catalog (70)

ID	Document	Updated	Anchors	SHA
agents/ag2-extraction-notes	AG2 Extraction Notes agents/ag2-extraction-notes.md	10/20/2018, 1:46:40 AM	11	e8d0072ebec1
asset-provenance	Asset Provenance asset-provenance.md	10/20/2018, 1:46:40 AM	4	1025c0acc117
closeout-notes	AI-RSI one-click closeout notes closeout-notes.md	10/20/2018, 1:46:40 AM	2	1f560f6a8535
content-credibility-engine	Content Credibility Engine content-credibility-engine.md	10/20/2018, 1:46:40 AM	8	d9aa32358670
demo-script	Demo Video — Shot List & Script (60–90s) demo-script.md	10/20/2018, 1:46:40 AM	2	131ddae42e6e
deployment	Deployment — Vercel + Render deployment.md	10/20/2018, 1:46:40 AM	8	4911b1f459b5
development-roadmap	Meta Museum Development Roadmap development-roadmap.md	10/20/2018, 1:46:40 AM	23	624a8a089d72
development/aidd-tdd	AIDD + TDD Discipline development/aidd-tdd.md	10/20/2018, 1:46:40 AM	5	cd0a0524525a
env	Environment Variables env.md	10/20/2018, 1:46:40 AM	10	9c18634cab1a
evals/golden-museum-questions	Golden Eval Dataset: Complex Museum Questions evals/golden-museum-questions.md	10/20/2018, 1:46:40 AM	6	2876a2b5e78d
linked-art/conformance-matrix	Linked Art 1.0 — Conformance Matrix linked-art/conformance-matrix.md	10/20/2018, 1:46:40 AM	5	53ff87000bf4
linked-art/Linked%20Art%20Notes	Linked Art Notes.md linked-art/Linked Art Notes.md	10/20/2018, 1:46:40 AM	0	aca66d51107b
linked-art/Linked%20Open%20Art%20Data%20Web%20App%20-%20Must-have%20Data%20Sources	Linked Open Art Data Web App (AI) — Must-have Data Sources linked-art/Linked Open Art Data Web App - Must-have Data Sources.md	10/20/2018, 1:46:40 AM	7	7b7d350fe8a0
linked-art/LinkedArtAppFeatures	🏛️ Art Explorer: Linked Art Application & Ecosystem linked-art/LinkedArtAppFeatures.md	10/20/2018, 1:46:40 AM	14	e23b890ecd2a
linked-art/LinkedArtChallenges	LinkedArtChallenges.md linked-art/LinkedArtChallenges.md	10/20/2018, 1:46:40 AM	0	d8c987070277
linked-art/LinkedArtCollaboration	LinkedArtCollaboration.md linked-art/LinkedArtCollaboration.md	10/20/2018, 1:46:40 AM	1	14ccf63edef3
linked-art/LinkedArtDashboard	LinkedArtDashboard.md linked-art/LinkedArtDashboard.md	10/20/2018, 1:46:40 AM	0	6d04d4b2bf79
linked-art/LinkedArtFeatureRoadmap	Feature Roadmap for Linked Open Art Data Apps linked-art/LinkedArtFeatureRoadmap.md	10/20/2018, 1:46:40 AM	8	ac10d8e79c20
linked-art/LinkedArtJobReady	LinkedArtJobReady.md linked-art/LinkedArtJobReady.md	10/20/2018, 1:46:40 AM	0	c60b357bcb87
linked-art/LinkedArtModel1.0-Reference	Linked Art Model 1.0 Reference (Round 1) linked-art/LinkedArtModel1.0-Reference.md	10/20/2018, 1:46:40 AM	344	e6d48d474b3e
linked-art/LinkedArtPatterns	LinkedArtPatterns.md linked-art/LinkedArtPatterns.md	10/20/2018, 1:46:40 AM	0	d45bbbb02d70
linked-art/LinkedArtPRD	🖼️ Product Requirements Document linked-art/LinkedArtPRD.md	10/20/2018, 1:46:40 AM	20	91bc1f37307c
linked-art/LinkedArtRoadmap	LinkedArtRoadmap.md linked-art/LinkedArtRoadmap.md	10/20/2018, 1:46:40 AM	0	e52e71c6bd28
linked-art/LinkedArtSaaS	LinkedArtSaaS.md linked-art/LinkedArtSaaS.md	10/20/2018, 1:46:40 AM	0	3d260738fb29
linked-art/LinkedArtSoftware	Code and Tools linked-art/LinkedArtSoftware.md	10/20/2018, 1:46:40 AM	8	9e8fef24aea9
linked-art/LinkedArtSOTAWebApp	LinkedArt SOTA Web App — Master Build Specification linked-art/LinkedArtSOTAWebApp.md	10/20/2018, 1:46:40 AM	129	a5f0baca89c6
linked-art/LinkedArtUnmetNeeds	LinkedArtUnmetNeeds.md linked-art/LinkedArtUnmetNeeds.md	10/20/2018, 1:46:40 AM	0	cb35fac29cc1
linked-art/LinkedArtUseCases	LinkedArtUseCases.md linked-art/LinkedArtUseCases.md	10/20/2018, 1:46:40 AM	0	5c572ce8e7f3
linked-art/LinkedArtWidgets	LinkedArtWidgets.md linked-art/LinkedArtWidgets.md	10/20/2018, 1:46:40 AM	0	b39911c7d97d
linked-art/LinkedDesign	LinkedDesign.md linked-art/LinkedDesign.md	10/20/2018, 1:46:40 AM	0	0a02240471e5
linked-art/LODEngine	LODEngine.md linked-art/LODEngine.md	10/20/2018, 1:46:40 AM	0	ef73426f80db
linked-art/LODPipeline	LODPipeline.md linked-art/LODPipeline.md	10/20/2018, 1:46:40 AM	0	fe95e61ed9da
linked-art/LODTools	LODTools.md linked-art/LODTools.md	10/20/2018, 1:46:40 AM	0	3167947fc4e4
linked-art/SPARQL	SPARQL.md linked-art/SPARQL.md	10/20/2018, 1:46:40 AM	0	50e00ed51733
linked-art/Vocabularies	Vocabularies.md linked-art/Vocabularies.md	10/20/2018, 1:46:40 AM	0	e0574a338aaa
linked-art/YaleLux	YaleLux.md linked-art/YaleLux.md	10/20/2018, 1:46:40 AM	0	74fd47fae749
meta-wiki-art-bridge	Meta Wiki Art Bridge (MediaWiki + Wikibase) meta-wiki-art-bridge.md	10/20/2018, 1:46:40 AM	7	7a43fb0c48b8
ops/activity-adoption-proof	Activity Feed Adoption Proof Runbook ops/activity-adoption-proof.md	10/20/2018, 1:46:40 AM	5	68a80b43ae58
ops/ag2-worker	AG2 Worker and Bridge Runbook ops/ag2-worker.md	10/20/2018, 1:46:40 AM	9	50efcd4e3318
ops/auth-credential-rotation	Auth credential rotation runbook ops/auth-credential-rotation.md	10/20/2018, 1:46:40 AM	4	449b8b8eecb6
ops/deployment-preflight	Deployment Preflight Runbook ops/deployment-preflight.md	10/20/2018, 1:46:40 AM	5	ac60432d0aed
ops/era-c-exit-gate-evidence	Era C Exit-Gate Evidence Pack ops/era-c-exit-gate-evidence.md	10/20/2018, 1:46:40 AM	6	656b9c7f85c6
ops/go-live-checklist	Go-Live & Evidence-Pipeline Checklist ops/go-live-checklist.md	10/20/2018, 1:46:40 AM	6	ae7f5d71f7dc
ops/k6-slo	k6 SLO Load Test (SOTA §20.4) ops/k6-slo.md	10/20/2018, 1:46:40 AM	4	328b5b3163d4
ops/kpi-evidence	SOTA §26 KPI Evidence Input ops/kpi-evidence.md	10/20/2018, 1:46:40 AM	5	d7b2973d2927
ops/launch-review	Launch Review Packet ops/launch-review.md	10/20/2018, 1:46:40 AM	5	880e41ebcbe3
ops/managed-linked-art-pilot-runbook	Managed Linked Art Pilot Runbook ops/managed-linked-art-pilot-runbook.md	10/20/2018, 1:46:40 AM	11	d4f125c2ddae
ops/otel-local	Local OpenTelemetry Wiring (Tempo / Jaeger) ops/otel-local.md	10/20/2018, 1:46:40 AM	5	1ebbc3b33f92
ops/outbox-projector	Transactional Outbox Projector (Postgres -> Solr/GraphDB) ops/outbox-projector.md	10/20/2018, 1:46:40 AM	5	dc70ad766471
ops/procurement-readiness-packet	Procurement Readiness Packet ops/procurement-readiness-packet.md	10/20/2018, 1:46:40 AM	9	c5685e82cca7
ops/reconciliation-service	Reconciliation Service (C2) ops/reconciliation-service.md	10/20/2018, 1:46:40 AM	6	05162c313ea9
ops/search-graph-provisioning	Solr 9 + GraphDB Provisioning ops/search-graph-provisioning.md	10/20/2018, 1:46:40 AM	6	fc1b15279a84
ops/security-dr-drill	Pen Test Baseline + DR Drill Runbook ops/security-dr-drill.md	10/20/2018, 1:46:40 AM	3	a766ef3e2afc
progress/2026-05-31/era-c-readiness-snapshot	Era C Readiness Snapshot (May 31, 2026) progress/2026-05-31/era-c-readiness-snapshot.md	10/20/2018, 1:46:40 AM	3	9672614ceb53
progress/era-history	Meta Museum — Era Delivery History progress/era-history.md	10/20/2018, 1:46:40 AM	47	cc030755d1e5
providers/harvard-art-museums	Harvard Art Museums API Integration Plan providers/harvard-art-museums.md	10/20/2018, 1:46:40 AM	11	fa8b980154f5
providers/louvre-collections-json	Louvre Collections JSON Integration Plan providers/louvre-collections-json.md	10/20/2018, 1:46:40 AM	11	775f91a8d813
providers/nga-open-data	National Gallery of Art (NGA) Open Data Integration Plan providers/nga-open-data.md	10/20/2018, 1:46:40 AM	11	51c4807c8de0
providers/princeton-art-museum	Princeton University Art Museum API Integration Plan providers/princeton-art-museum.md	10/20/2018, 1:46:40 AM	11	c8823f65ee41
providers/rkd-knowledge-graph	RKD Knowledge Graph Integration Plan providers/rkd-knowledge-graph.md	10/20/2018, 1:46:40 AM	16	2b4b42f2ad42
providers/smithsonian-open-access	Smithsonian Open Access Integration Plan providers/smithsonian-open-access.md	10/20/2018, 1:46:40 AM	12	db1ffa4cab02
providers/vanda-collections-api	Victoria and Albert Museum (V&A) Collections API Integration Plan providers/vanda-collections-api.md	10/20/2018, 1:46:40 AM	11	755d93972233
quality	Quality & Performance quality.md	10/20/2018, 1:46:40 AM	6	174add040960
reconciliation/exhibition-literature-reconciliation	Exhibition + Literature Reconciliation (B6.1) reconciliation/exhibition-literature-reconciliation.md	10/20/2018, 1:46:40 AM	7	293e9d81dd7c
responsible-ai	Responsible AI responsible-ai.md	10/20/2018, 1:46:40 AM	8	f90006650821
risk-register	Risk Register risk-register.md	10/20/2018, 1:46:40 AM	4	becb213d5c5e
roadmap-to-10	Roadmap to 10/10 roadmap-to-10.md	10/20/2018, 1:46:40 AM	15	40a11000dc7e
roadmap	Meta Museum Roadmap roadmap.md	10/20/2018, 1:46:40 AM	18	145d0cbbe54a
rsi-wiki	AI-RSI compounding wiki rsi-wiki.md	10/20/2018, 1:46:40 AM	8	b64914fe6f20
wikibase-cloud-migration-checklist	Wikibase Cloud -> Self-Host Migration Checklist wikibase-cloud-migration-checklist.md	10/20/2018, 1:46:40 AM	12	170657fcbf2b

Current Document: 🖼️ Product Requirements Document

Source updated 10/20/2018, 1:46:40 AM · SHA-256 91bc1f37307c · 566 lines

Canonical ID: linked-art/LinkedArtPRD

JSON for this doc:/api/docs/content?path=linked-art/LinkedArtPRD.md

Human link:/docs?doc=linked-art%2FLinkedArtPRD.md

Canonical API endpoint:/api/docs/content?path=linked-art%2FLinkedArtPRD.md

Sections (stable anchors):

state-of-the-art-linked-art-web-application **State‑of‑the‑Art Linked Art Web Application** H2
1-product-vision-objectives 🎯 **1. Product Vision & Objectives** H2
2-target-audience-personas 👥 **2. Target Audience & Personas** H2
data-creators-catalogers 🏛️ **Data Creators & Catalogers** H3
developers-data-consumers 💻 **Developers & Data Consumers** H3
researchers-public-users 🔍 **Researchers & Public Users** H3
3-key-features-functionality 🚀 **3. Key Features & Functionality** H2
a-backend-data-management-ingestion A. 🏗️ Backend Data Management & Ingestion H3
b-discovery-search-architecture B. 🔎 Discovery & Search Architecture H3
c-advanced-ui-visualizations C. 🎨 Advanced UI & Visualizations H3
4-data-architecture-modeling 🧩 **4. Data Architecture & Modeling** H2
1-inject-modern-sota-ai-data-capabilities 1. 🤖 Inject Modern "SOTA" AI & Data Capabilities H3
2-strengthen-cultural-heritage-domain-specifics 2. 🏛️ Strengthen Cultural Heritage Domain Specifics H3
3-add-missing-product-management-sections 3. 📊 Add Missing "Product Management" Sections H3
how-the-improved-sections-look-in-the-prd 📝 How the Improved Sections Look in the PRD: H3
5-syndication-interoperability **🆕 5. Syndication & Interoperability** H4
6-modern-ai-machine-learning-enhancements **🆕 6. Modern AI & Machine Learning Enhancements** H4
7-success-metrics-kpis **🆕 7. Success Metrics & KPIs** H4
8-out-of-scope-for-mvp **🆕 8. Out of Scope (For MVP)** H4
where-to-go-next Where to go next? H3

🖼️ Product Requirements Document

State‑of‑the‑Art Linked Art Web Application

---

🎯 1. Product Vision & Objectives

The goal is to build a state‑of‑the‑art cross‑collection discovery platform powered by the Linked Art data model (a profile of CIDOC‑CRM) to unify cultural heritage data across museums, libraries, and archives.

The platform emphasizes:

Maximizing usable open data
Balancing ontological rigor with developer accessibility
Supporting ~90% of common cultural heritage use cases without forcing unnecessary semantic complexity

---

👥 2. Target Audience & Personas

🏛️ Data Creators & Catalogers

Museum and library staff who need intuitive backend interfaces to create semantic relationships without touching raw JSON‑LD.

💻 Developers & Data Consumers

Technologists who require predictable, well‑structured JSON‑LD REST APIs for downstream apps, UI layers, and integrations.

🔍 Researchers & Public Users

End users who want visual, exploratory discovery tools that hide graph complexity while surfacing rich contextual connections across institutions.

---

🚀 3. Key Features & Functionality

A. 🏗️ Backend Data Management & Ingestion

Use an Arches‑based framework (e.g., Ogee) to provide user‑friendly forms that automatically generate Linked Art‑compliant JSON‑LD.

Semantic Abstraction UI(ca://s?q=Explain_Semantic_Abstraction_UI)

Pipelines reconcile legacy strings against Getty AAT/ULAN/TGN, LoC, and Wikidata.

Automated Authority Reconciliation(ca://s?q=Describe_Authority_Reconciliation)

ETL pipelines convert flat CMS exports (XML/JSON/CSV) into structured Linked Art nodes.

Legacy Data Transformation(ca://s?q=Legacy_Data_Transformation_ETL)

---

B. 🔎 Discovery & Search Architecture

Support both:

Hybrid Datastore(ca://s?q=Hybrid_Datastore_Architecture)
JSON‑LD document storage (fast faceted search)
Triple stores (deep graph querying)

Enable multi‑entity, cross‑collection questions such as:

“Find works by European artists depicting the American West in the 19th century.”

Advanced Graph Querying(ca://s?q=Advanced_Graph_Querying)

---

C. 🎨 Advanced UI & Visualizations

First‑class pages for people, places, and concepts enriched with Wikidata/dbPedia context.

Entity Knowledge Panels(ca://s?q=Entity_Knowledge_Panels)

Dynamic browsing of large datasets (e.g., artists by birth year, exhibition histories).

Interactive Concertina Lists & Histograms(ca://s?q=Concertina_Lists_and_Histograms)

Zoomable timelines + map layers linked to gazetteers like PeriodO.

Overlapping Timelines & Interactive Maps(ca://s?q=Linked_Art_Timeline_and_Map_Visualizations)

Deep zoom, high‑resolution viewing, and side‑by‑side comparisons.

IIIF Image Integration(ca://s?q=IIIF_Integration_Overview)

---

🧩 4. Data Architecture & Modeling

Shift from object‑centric to activity‑centric modeling: production, provenance, exhibitions, etc.

Event‑Centric Structure(ca://s?q=Event_Centric_Modeling_Linked_Art)

Distinguish:

Prevents redundant data entry and improves reuse.

Entity Separation(ca://s?q=Entity_Separation_in_Linked_Art)
Physical object
Visual item
Digital object

Enforce hierarchical concept vocabularies for seamless navigation and faceted filtering.

Standardized Vocabulary Integration(ca://s?q=Standardized_Vocabulary_Integration)

---

If you'd like, I can also generate:

a diagrammatic architecture map,
a feature‑to‑persona matrix,
or a roadmap with milestones.

This is a very strong baseline PRD that accurately captures the core philosophy of Linked Art (CIDOC-CRM profile), its dual audience, and the necessary tech stack.

However, to make this truly State-of-the-Art (SOTA) for modern software development and current cultural heritage standards, we need to bridge the gap between traditional semantic web technologies and modern AI/data paradigms, while also filling out standard PRD structural gaps (KPIs, Risks, Out-of-Scope).

Here is how we can improve and elevate this PRD:

1. 🤖 Inject Modern "SOTA" AI & Data Capabilities

The current PRD describes a standard 2020-era Linked Art implementation. To make it cutting-edge, integrate these features:

AI-Assisted Legacy ETL & Mapping: Mapping legacy CMS data (TMS, EMu, Adlib) to CIDOC-CRM is notoriously difficult. Add an LLM-assisted mapping pipeline that analyzes legacy data schemas and suggests Linked Art JSON-LD mappings.

Natural Language to Graph (NL2Graph): Instead of expecting researchers to write SPARQL or complex API queries, integrate an LLM layer that translates natural language questions ("Find works by female European artists exhibiting in Paris before 1900") directly into graph queries.

Graph RAG (Retrieval-Augmented Generation): Combine the structured knowledge graph with vector databases to allow users to "chat with the collection," backed by verifiable CIDOC-CRM citations to prevent hallucinations.

2. 🏛️ Strengthen Cultural Heritage Domain Specifics

Activity Streams for Data Syndication: Linked Art relies heavily on the ActivityStreams specification to sync data between institutions. Your PRD needs a section on how this platform publishes and consumes data updates.

Deep Provenance Modeling: Emphasize the modeling of ownership history. Provenance is a massive use case for Linked Art; the UI must support visualizing chronological chains of ownership, identifying gaps, and flagging red-flag eras (e.g., WWII-era spoliation).

Rights & Licensing Statements: Explicitly include support for RightsStatements.org and Creative Commons in the data model. Discovery is useless if developers don't know if they can legally use the image.

3. 📊 Add Missing "Product Management" Sections

A professional PRD needs to define what success looks like and what could go wrong.

Success Metrics (KPIs):

Data Quality: % of records successfully reconciled to Getty AAT/ULAN.

Performance: API response time < 200ms for hybrid graph queries.

Adoption: Number of external API calls, Monthly Active Users (MAU) on the discovery portal.

Non-Functional Requirements (NFRs): Accessibility (WCAG 2.1 AA compliant—crucial for museums), internationalization (i18n), and mobile-responsive UI.

Out of Scope: Explicitly state what you are not building (e.g., "This is NOT a replacement for the museum's core Collections Management System (CMS). It is a downstream publication layer.")

Risks & Mitigations: Mention the steep learning curve of CIDOC-CRM and mitigate it via the "Semantic Abstraction UI" you outlined.

---

📝 How the Improved Sections Look in the PRD:

Add the following sections to your document to instantly elevate it:

🆕 5. Syndication & Interoperability

ActivityStreams Integration: Implement a W3C ActivityStreams endpoint to broadcast creations, updates, and deletions. This allows external aggregators to "subscribe" to the museum's data changes in real-time.

Jupyter Notebook & Data Dumps: Provide automated, regular JSON-LD data dumps and maintain a repository of Jupyter Notebooks demonstrating how researchers can query the dataset natively in Python.

🆕 6. Modern AI & Machine Learning Enhancements

LLM-Assisted Reconciliation: Utilize fine-tuned models to assist catalogers in reconciling ambiguous entity names against Wikidata and Getty vocabularies, reducing manual data entry.

Semantic Vector Search: Complement the exact-match graph querying with vector search (embeddings of artwork descriptions and visual IIIF features) to enable "find similar artworks" functionality even without perfect metadata.

🆕 7. Success Metrics & KPIs

Data Enrichment: >80% of imported records enriched with at least 3 external URIs (ULAN, AAT, TGN).

API Performance: 95th percentile query response time under 300ms.

System Uptime: 99.9% uptime for the public-facing IIIF and JSON-LD endpoints.

🆕 8. Out of Scope (For MVP)

Replacing the institution's primary CMS (e.g., TMS, EMu, or Qi).

Native ingestion of raw audio/video files (focusing strictly on IIIF image and text metadata for MVP).

---

Where to go next?

I recommend generating the Diagrammatic Architecture Map next.

Because we are talking about a hybrid datastore (Triple Store + Document DB), an ETL pipeline from legacy CMS systems, IIIF servers, and a new AI query layer, visualizing how data flows from the Museum CMS ➡️ ETL Pipeline ➡️ Linked Art Graph ➡️ React/Next.js Frontend is the most crucial next step to get engineering buy-in.

Would you like me to map out that Architecture Diagram detailing the specific modern tech stack (e.g., Elasticsearch + Blazegraph/Neptune + Next.js + IIIF Image Server)?

Here is the upgraded 10/10 PRD.

I have removed the residual citation markers, added a concrete 3-phase rollout roadmap, explicitly defined the reconciliation logic (using external authorities), and established strict API security and rate-limiting requirements to protect the database from heavy graph queries.

!🎨 Linked Art Web Application — Product Requirements Document (PRD)

!🧭 1. Executive Summary

The Linked Art Web Application is a modern, standards‑compliant platform for ingesting, transforming, validating, and publishing cultural heritage data using the Linked Art profile, a JSON‑LD implementation of the CIDOC‑CRM ontology.

Linked Art intentionally reduces the complexity of CIDOC‑CRM while retaining its expressive power — covering 90% of real museum use cases with ~10% of the complexity.

This application enables museums, archives, libraries, and research institutions to publish interoperable, event‑based cultural heritage data for public access, scholarly research, and cross‑institutional aggregation.

!🧩 2. Problem Statement

CIDOC‑CRM is powerful but extremely complex, containing dozens of classes and hundreds of relationships. Most institutions lack the resources to implement it directly.

Linked Art provides a practical, interoperable subset of CRM, but institutions still need:

A turnkey ingestion pipeline

A validation and mapping engine

A searchable public interface

A JSON‑LD API

A graph‑based visualization layer

This PRD defines a complete system that solves these gaps.

!🎯 3. Goals & Non‑Goals

Goals

Publish cultural heritage data in Linked Art JSON‑LD format.

Provide a complete ETL pipeline: ingest → map → validate → publish.

Offer a public search and browsing interface.

Support event‑based modeling (production, acquisition, attribution, etc.).

Ensure interoperability with other Linked Art and IIIF systems.

Provide developer‑friendly APIs for reuse and integration.

Enable research workflows through graph and timeline visualizations.

Non‑Goals

Full implementation of the entire CIDOC‑CRM ontology (intentionally avoided to maintain simplicity).

Acting as a collections management system (CMS).

Providing image hosting (integrates with IIIF instead).

!👥 4. User Personas

!🖼️ Museum Curator

Needs to publish authoritative object metadata and provenance.

!🧪 Digital Humanities Researcher

Needs structured, event‑based data for analysis and visualization.

!🛠️ Software Developer

Needs clean JSON‑LD APIs for integration with external systems.

!🏛️ Cultural Heritage Institution

Needs a standards‑compliant, low‑maintenance publishing platform.

!🔄 5. User Flows

5.1 Data Ingestion Flow

1 Upload source data (CSV, JSON API, XML).

2 System maps fields to Linked Art patterns.

3 Validation engine checks conformance.

4 User reviews and resolves any reconciliation conflicts.

5 Data is published as JSON‑LD.

5.2 Public Browsing Flow

1 User searches for an object, person, or event.

2 Results appear with facets (type, date, location).

3 User opens a detail page.

4 Page displays Linked Art JSON‑LD, images, events, relationships.

5 User explores graph or timeline views.

!🧱 6. Functional Requirements

6.1 ETL Pipeline

Based on real Linked Art workflows, which include source → mapped → Linked Art JSON‑LD stages.

Source ingestion: Accept JSON API, CSV, XML.

Mapping engine: Transform source data into intermediate model.

Linked Art generator: Produce final JSON‑LD.

Validation: Ensure strict conformance with the Linked Art profile.

Reconciliation Engine: Deduplicate agents, places, and concepts by mapping against external authorities (Getty ULAN, AAT, TGN, Wikidata). Flag conflicts for manual review in the Admin UI.

Versioning: Track changes and updates over time.

6.2 JSON‑LD API

REST endpoints for objects, agents, events, places.

JSON‑LD ⁠@context⁠ support.

Pagination, filtering, faceting.

Graph expansion (⁠?expand=events,actors⁠).

IIIF image linking.

6.3 Search & Discovery

Full‑text search.

Faceted navigation.

Autocomplete.

Timeline view.

Network graph view.

6.4 Admin Interface

Upload datasets.

Review validation errors and reconciliation conflicts.

Approve publication.

Manage field mappings.

!⚙️ 7. Non‑Functional Requirements

Performance

Ingest 10k objects under 5 minutes.

API responses < 200ms for common queries.

Reliability

99.9% uptime.

Automatic retries for ingestion jobs.

Security & API Governance

Access Control: Role‑based access control (RBAC) for the Admin interface.

Audit Logs: Track all data ingestion and mapping changes.

Rate Limiting: Throttling policies (e.g., 100 requests/minute per IP) to protect the database from expensive SPARQL/Graph expansion queries.

CORS: Configurable Cross-Origin Resource Sharing policies for external web integrations.

API Keys: Provisioning for developer tiers allowing higher rate limits.

Accessibility

WCAG 2.1 AA compliance for all public UI views.

!🧬 8. Data Model

8.1 Standards

Linked Art: JSON‑LD profile built on CIDOC‑CRM.

CIDOC‑CRM: Event‑based ontology for cultural heritage.

JSON‑LD: Serialization format for Linked Data.

8.2 Core Entities

HumanMadeObject

Person / Group

Production / Acquisition Events

Place

DigitalObject (IIIF)

8.3 Required Patterns

Event‑based modeling.

Identifiers & references.

Multilingual labels.

Provenance chains.

!🔧 9. ETL Requirements (Detailed)

Based on real Linked Art workflows: source → mapped → Linked Art.

Source Data

JSON API (e.g., PIA API).

CSV exports from existing CMS.

XML from legacy systems.

Mapping Layer

Extract relevant fields.

Normalize dates, names, identifiers.

Map to Linked Art patterns.

Linked Art Output

JSON‑LD with correct ⁠@context⁠.

Conformance to Linked Art profile.

Stable, dereferenceable URIs.

!🛠️ 10. API Requirements

Endpoints

⁠/objects⁠

⁠/agents⁠

⁠/events⁠

⁠/places⁠

⁠/search⁠

Features

JSON‑LD framing.

Filtering (⁠?type=painting&date=1500-1600⁠).

Sorting.

Graph expansion.

!🖥️ 11. Frontend Requirements

Search Interface

Keyword search.

Facets: type, date, location, material.

Object Detail Page

Title, description, IIIF images.

Production event.

Attribution.

Provenance.

Related objects.

Visualizations

Timeline.

Network graph.

Map view.

!🗺️ 12. Implementation Roadmap & Phasing

Phase 1: MVP (Core Foundation)

Goal: Establish the underlying data architecture and public read access.

Deliverables: Core JSON-LD API, rigid CSV ingestion (pre-mapped), basic search interface, and IIIF image integration.

Phase 2: The Publishing Engine (ETL Focus)

Goal: Allow non-technical curators to ingest and publish data independently.

Deliverables: Full Admin interface, robust mapping engine, JSON/XML ingestion, automated validation checks, and the reconciliation engine (Getty ULAN/Wikidata).

Phase 3: Research & Discovery (Advanced UI)

Goal: Unlock the full potential of event-based semantic data.

Deliverables: Network graph views, interactive timelines, map views for geographic provenance, and advanced API graph expansion (⁠?expand=⁠).

!🧪 13. TDD Requirements

Test Coverage

ETL mapping tests.

JSON‑LD validation tests.

API contract tests.

UI component tests.

Accessibility tests.

Acceptance Criteria

Every Linked Art entity must validate against the profile.

Every API endpoint must return valid JSON‑LD.

Every ingestion must produce deterministic output.

!📏 14. Success Metrics

Technical

100% JSON‑LD validation pass rate.

<1% ingestion error rate.

<200ms API response time.

User Experience

90% successful search rate.

80% of users find the desired object within 3 clicks.

Adoption

Integration with at least 3 external Linked Art systems.

!⚠️ 15. Risks & Dependencies

Risks

Source data quality from legacy CMS varies widely and requires heavy normalization.

Institutions may lack compliant IIIF servers for image fetching.

CIDOC‑CRM updates may require upstream model changes.

Dependencies

Linked Art specification.

CIDOC‑CRM ontology.

IIIF Image API.

External data sources and authority files (Getty Vocabularies, Wikidata).

AI/agent quick endpoints

/api/docs/manifest — current catalog with IDs, anchors, source timestamps, and checksums.
/api/docs/content?path=roadmap.md — roadmap JSON payload + rendered HTML for downstream tooling.

Meta Museum Docs (Live Markdown Source)