Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Issue title: Dynamic Networks and Knowledge Discovery
Guest editors: Ruggero G. Pensaxy, Francesca Corderoy, Céine Rouveirolz and Rushed Kanawatiz
Article type: Research Article
Authors: Schmidt, Jana | Ghorbani, Asghar | Hapfelmeier, Andreas | Kramer, Stefan; *
Affiliations: Institut für Informatik – I12, TU München, München, Germany | [x] IRPI-CNR, Torino, Italy | [y] University of Torino, Torino, Italy | [z] University of Paris-Nord, Paris, France
Correspondence: [*] Corresponding author: Stefan Kramer, Institut für Informatik, Johannes Gutenberg University Mainz, 55128 Mainz, Germany. Tel.: +49 6131 39 21057; Fax: +49 6131 39 23534; E-mail: kramer@informatik.uni-mainz.de.
Abstract: The growing number of time-labeled datasets in science and industry increases the need for algorithms that automatically induce process models. Existing methods are capable of identifying process models that typically only work on single attribute events. We propose a new model type to address the problem of mining multi-attribute events, meaning that each event is described by a vector of attributes. The model is based on timed automata, includes expressive descriptions of states and can be used for making predictions. A probabilistic real time automaton is created, where each state is annotated by a profile of events. To identify the states of the automaton, similar events are combined by a clustering approach. The method was implemented and tested on a synthetic, a medical and a biological dataset. Its prediction accuracy was evaluated on a medical dataset and compared to a combined logistic regression, which is considered a standard in this application domain. Moreover, the method was experimentally compared to Multi-Output HMMs and Petri nets learned by standard process mining algorithms. The experimental comparison suggests that the automaton-based approach performs favorably in several dimensions. Most importantly, we show that meaningful medical and biological process knowledge can be extracted from such automata.
Keywords: Automata induction, multivariate time series, clustering
DOI: 10.3233/IDA-120569
Journal: Intelligent Data Analysis, vol. 17, no. 1, pp. 93-123, 2013
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
sales@iospress.com
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
info@iospress.nl
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office info@iospress.nl
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
china@iospress.cn
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
如果您在出版方面需要帮助或有任何建, 件至: editorial@iospress.nl