Open Standard for Metadata

Non-profit organization ODPi has announced Egeria, a new project that supports the free flow of metadata between different technologies and vendor offerings. It promises to allow organizations to locate, manage and use their data more effectively.

Last year’s ODPi white paper on “ The Year of Enterprise-wide Production Hadoop ” found that Data Governance and Security were the biggest blocking factors to enabling enterprises to take big data into true production.

Egeria is claimed to be the only open source driven solution designed to set a standard for leveraging metadata in line of business applications, and enabling metadata repositories to federate across the enterprise.

“A consistent view on data across the entire landscape is essential for any organisation that wants to become data driven. Not just where the data is, but also the quality, the ownership, and the full lineage across the entire set of technologies used,” said Ferd Scheepers, chief information architect, ING. “The open metadata standard delivered by Egeria delivers this consistent view across all the technologies, while reducing the cost of metadata capture, and the management challenges of working with various data tool vendors.”

Egeria is built on open standards and delivered via Apache 2.0 open source license. The ODPi Egeria project creates a set of open APIs, types and interchange protocols to allow all metadata repositories to share and exchange metadata.

From this common base, it adds governance, discovery and access frameworks for automating the collection, management and use of metadata across an enterprise. The result is an enterprise catalogue of data resources that are transparently assessed, governed and used in order to deliver maximum value to the enterprise.

“Egeria’s open source metadata management presents an exciting opportunity to rethink both management and governance of data to provide greater trust and flexibility in how we all share and consume data,” said John Mertic, director of program management, ODPi. “Egeria’s open governance model allows our community and practitioners to develop and evolve the base for use in any offerings and deployments.”

IBM and ING, vendors and end users collaborated on the first Egeria release, which was initially incubated as part of the Apache Atlas project (an open source metadata repository designed for the Apache Hadoop ecosystem). IBM and ING jump-started Egeria with a significant code donation. ODPI members and end-users are actively collaborating to expand the Egeria code base with standard integration points between metadata repositories and line of business tools leveraging data.

An Apache Atlas patch is available for immediate use, and an Egeria proof of concept is complete for IBM’s InfoSphere Information Governance Catalog.

Read about Egeria 1.0 on the ODPi Blog or visit