In the case of OAI-PMH-based search, administrators must configure a remote system as well as the current system before OAI-PMH-based search is possible.
Thus, this configuration requires at least two large steps:
OAI-PMH-based search is based upon a protocol called OAI-PMH.
OAI-PMH stands for "Open Archives Initiative Protocol for Metadata Harvesting". More information about it can be found here.
This protocol supports data search and exchange which uses a small API (having 6 functions) and a common XML schema for data-sharing. The protocol presumes that each system participating in a network of OAI-PMH data sharing systems supports the ability to perform these functions on demand.
These functions give participating systems a way to ask another system to:
identify
- provide descriptive information about what data they provide, what abilities they have, and what general kinds of information they containget
(i.e, obtain) an individual metadata record from a given data-providing systemlist identifiers
- get abbreviated (header) information for metadata records of a data-providing systemlist metadata formats
- obtain a list of available/supported metadata formats (XML schemas) from a data-providing systemlist records
- obtain (“harvest”) a list of metadata records (XML files) from a data-providing systemlist sets
– obtain the set structure (types of information) available from a data-providing systemThe system implements these functions and supports the OAI-PMH schema.
data provider
(OAI-PMH server) or of a data harvester
(OAI-PMH client) in a network of OAI-PMH-compliant data sharing systems.This type of data sharing and searching is used most typically by registry systems in order to discover, share, and search data located at other OAI-PMH-aware systems.
When a given user initiates an OAI-PMH-based search on the local (current) system, similar to the federated search scenario, the local (current) system runs OAI-PMH-specific search processes across its registered OAI-PMH data providers (and they, likewise, propagate these searches to their providers, respectively).
The OAI-PMH-based search differs from federated search in several important ways:
That is, OAI-PMH:
Whereas, federated searches:
OAI-PMH operates by, first locating data-providers that have information it needs and then copying that information locally from the remote system back to the local (current) system. Then, secondly, applying local search operations (e.g., search by example, keyword, etc.) to the locally-copied OAI-PMH information.
In summary:
The steps below describe what must be done to configure a network of two-or-more OAI-PMH-compliant systems in order to support OAI-PMH-based searches. As with federated search configurations, only administrators are permitted to create and edit OAI-PMH-based search configurations.
OAI-PMH Data Provider Identity configuration page
:top navigation bar
, click the drop-down-menu
(whose title is the user’s username) -> Administration menu-item
Support the various commands/functions that must be implemented by various OAI-PMH-Data Providers
:
Support Identify
command
"Identity"
(on left panel) -> "Edit" button
."Edit Curator OAI-PMH Server" input form
.Name
: set to a unique name (e.g., “Curator OAI-PMH Server”)Repository Identifier
: set to a unique name (e.g., “server-x’)Enable Harvesting
: set to default (default=checked = True)"Save" button
at the bottom of the page.Support List Metadata Formats
command
"Metadata Formats"
(on left panel)"Templates as Metadata Formats"
add one or more templates which will be used to support data sharing under OAI-PMH."Add" button
. This leads to an input form for adding a metadata format template.Metadata Prefix
: (e.g., “oai_dc”)Template
: enter a template name"Add" button
Support List Sets
command
One is not required to specify sets in order for an OAI-PMH data search to become possible. Indeed, one may specify and perform OAI-PMH-based searches without specifying any sets of templates for OAI-PMH at all.
OAI-PMH Data Harvesters
(clients) with a way of organizing the different templates available in a given Data Provider
:"Sets"
(on left panel) -> "Add Sets" button
, which leads to an "Add Sets" input form
for adding set information.Set spec
Set name
Templates
Description
"Add" button
The administrator for the remote system configuration must share the following information with the administrator for the local (current) system:
Remote system IP-address/URL
The remote system is now configured.
Remote system IP-address/URL
Add New Data Provider page
:top navigation bar
, click the drop-down-menu
(whose title is the user’s username) -> Administration menu-item
-> Data Providers link
, which leads to the "Data Providers" page
."Add" button
to open the "Add New Data Provider" input form
.Enter provider URL
– enter remote system (data provider) ip-address/URL
Harvest Rate (seconds)
– (default = 60)Enable automatic harvesting
– (default = checked = True)"Add data provider" button
at the bottom of the page, which updates the OAI-PMH data providers list
.registered data providers
to ensure they are able to provide data by clicking the "Check All" button
.registered data providers
to ensure they are configured and ready to support OAI-PMH-based searches by clicking on the "Update All" button
.registered data providers
by clicking the "Harvest All" button
.The local (current) system is now configured.
If both the remote and local systems are configured, then the users of the local (current) system may now perform OAI-PMH-based searches on this set of registered data providers.
Over time, an administrator may register many such remote systems for carrying out searches initiated by the local (current) system’s users.