Summarizing Processes

A large number of processes, of di erent characteristics, are
being created, stored and reused during Services-Oriented Architecture
based implementations. The processes may describe specification of func-
tionality for a to-be built system (e.g., business processes in different
notations), automatically or semi-automatically created compositions/
plans of services to meet some goals (e.g., PDDL plans), or execution
behavior (e.g., web processes). However, a user today can get little in-
sight from such a collection of processes other than to search it with
a keyword based query interface. In this paper, we introduce the prob-
lem of automatically summarizing a collection of processes and propose
a comprehensive solution for it. The solution works on any available
type of process content ranging from metadata, syntactic information
like activities, process features interpreted as semantic annotations, to
multi-dimensional textual content; and lters the analyses based on sig-
ni cance. The summary so created provides insights about what the
repository contains at the aggregate level as well as in subsets (clusters)
of processes, built using an extensible set of process distance measures.
We employ the solution on diverse collections containing hundreds of
processes and demonstrate that the technique can shed correct and valu-
able information where none existed before, while being scalable and

By: Biplav Srivastava

Published in: RI10008 in 2010


This Research Report is available. This report has been submitted for publication outside of IBM and will probably be copyrighted if accepted for publication. It has been issued as a Research Report for early dissemination of its contents. In view of the transfer of copyright to the outside publisher, its distribution outside of IBM prior to publication should be limited to peer communications and specific requests. After outside publication, requests should be filled only by reprints or legally obtained copies of the article (e.g., payment of royalties). I have read and understand this notice and am a member of the scientific community outside or inside of IBM seeking a single copy only.


Questions about this service can be mailed to .