Metadata is often described as “data about data”, or the who, what, where, and when.
In the geospatial world, for each dataset we maintain, we should record information about the data such as:
general description
location
usage restrictions
projection
technical contact
time period
date created
date modified
version
Metadata Standards
Metadata standards have been introduced since the mid 1990’s with the goals of:
outlining specific required parameters
common terminology
consistency
interoperability
Metadata Standards
Dublin Core: established a core/common group of 15 metadata elements
FGDC CSDGM: approved by the U.S. Federal Geographic Data Committee originally in 1994 and composed of Sections, Compound Elements, Data Elements
ISO 19115: International Standards Organization’s TC211 committee created this in 2003 and is composed of more than 400 “Core”, “Mandatory”, and “Optional” elements
ISO 19139: The XML implementation schema for ISO 19115 specifying the metadata record format
Catalogue Services for the Web (CSW) is the HTTP protocol binding of the Catalogue Service Implementation Specificaton that allows for publishing and searching of metadata.
CSW Operations
GetCapabilities (mandatory) - allow clients to retrieve information describing the service instance
DescribeRecord (mandatory) - allows a client to discover elements of the information model supported by the target catalogue service
GetRecords (mandatory) - get metadata records
GetRecordById (optional) - get metadata records by ID
GetDomain (optional) - obtain runtime information about the range of values of a metadata record element or request parameter
Harvest (optional) - references the data to be inserted or updated in the catalog
Transaction (optional) - defines an interface for creating, modifying and deleting catalogue records
Apr 2011: First official release (0.1) was released and already passed all CITE tests
Jul 2011: Version 1.0 released
Feb 2012: pycsw included in OSGeoLive 5.5
Jan 2013: pycsw 1.4 certified as OGC Compliant
Apr 2013: pycsw entered OSGeo Incubation
Aug 2013: Latest stable release (1.6.1)
Goals
Lightweight and easy to setup: a standalone catalogue, no GUI or metadata editing front end, designed for the use case of exposing ready-to-go metadata (files or in existing DB) through a CSW interface
Extensible: the ability to add metadata formats and mapping them to a common information model and core/additional queryables
OGC compliant: always pass CITE tests
Use Cases
Case 1: Publishing against established metadata management workflow
Case 2: Publishing against existing metadata database
Features
Harvesting support for WMS, WFS, WCS, WPS, WAF, CSW, SOS
Implements ISO Metadata Application Profile 1.0.0
Implements FGDC CSDGM Application Profile for CSW 2.0
Implements INSPIRE Discovery Services 3.0
Supports ISO, Dublin Core, DIF, FGDC and Atom metadata models
OWSLib is a Python package for client programming with Open Geospatial Consortium (OGC) web service interface standards, and their related content models
pycsw and Open Data
pycsw is embedded in various Open Data portal software