SPADE: Support for Provenance Auditing in Distributed Environments - Middleware 2012
Conference Papers Year : 2012

SPADE: Support for Provenance Auditing in Distributed Environments

Abstract

SPADE is an open source software infrastructure for data provenance collection and management. The underlying data model used throughout the system is graph-based, consisting of vertices and directed edges that are modeled after the node and relationship types described in the Open Provenance Model. The system has been designed to decouple the collection, storage, and querying of provenance metadata. At its core is a novel provenance kernel that mediates between the producers and consumers of provenance information, and handles the persistent storage of records. It operates as a service, peering with remote instances to enable distributed provenance queries. The provenance kernel on each host handles the buffering, filtering, and multiplexing of incoming metadata from multiple sources, including the operating system, applications, and manual curation. Provenance elements can be located locally with queries that use wildcard, fuzzy, proximity, range, and Boolean operators. Ancestor and descendant queries are transparently propagated across hosts until a terminating expression is satisfied, while distributed path queries are accelerated with provenance sketches.
Fichier principal
Vignette du fichier
978-3-642-35170-9_6_Chapter.pdf (523.16 Ko) Télécharger le fichier
Origin Files produced by the author(s)
Loading...

Dates and versions

hal-01555544 , version 1 (04-07-2017)

Licence

Identifiers

Cite

Ashish Gehani, Dawood Tariq. SPADE: Support for Provenance Auditing in Distributed Environments. 13th International Middleware Conference (MIDDLEWARE), Dec 2012, Montreal, QC, Canada. pp.101-120, ⟨10.1007/978-3-642-35170-9_6⟩. ⟨hal-01555544⟩
115 View
603 Download

Altmetric

Share

More