The Datahub

A metadata aggregator for GLAM organisations.

Download latest Github project

Aggregate. Publish. Build.

Sharing metadata doesn't have to be hard. The Datahub is an application for aggregating and publishing metadata records on the web with technologies like HTTP REST and OAI-PMH. It takes care of the hard parts, so you can focus on building digital experiences for your audience.


Web technologies

The Datahub uses well established Web technologies like Symfony, PHP and MongoDB and can be deployed everywhere.

Open Source

The Datahub is an open source project maintained on Github by an active community of contributors.

Formats

The Datahub supports standardised formats like LIDO XML, Dublin Core, EAD, MODS and METS.

Get started

Learn how to get started reading the documentation. Connect your record sources to the Datahub and start consuming metadata in your end-user applications.

Quick installation

Spin up a quick installation of the Datahub

# Clone the Datahub repository
$ git clone https://github.com/thedatahub/Datahub.git datahub

# Go into the directory
$ cd datahub

# Use Composer to install the application
$ composer install

# Run the setup
$ app/console app:setup
$ app/console doctrine:mongodb:fixtures:load --append

# Start the development server
$ app/console server:run

The Datahub Factory

Automate exchanging metadata between record sources, the Datahub and your end-user applications with the Datahub Factory.

ETL pipelining and you

The Datahub Factory makes it easy to set up ETL pipelines. Extract metadata from record management systems, collection registration systems or static data sources. Transform your records to a standard metadata model. Load them into the Datahub. Automate your workflow.

The Datahub Factory uses Perl, the Catmandu API's and the Catmandu Fix language.

Install from MetaCPAN Github project

Who uses the Datahub?