Showing results for: [ machine learning ]
Based on a general definition of a cluster and the quality of a clustering result, this code presents a new method for evaluating existing clustering algorithms, or undertaking clustering, capable of ... morepredicting the number and type of clusters and outliers present in a data set, regardless of the complexity of the distribution of points. This algorithm, referred to as iterative label spreading (ILS), can recognize the characteristics expected of a successful clustering result before any clustering algorithm has been applied, providing a type of hyper-parameter optimization for clustering. In this notebook the algorithm, is assessed using large benchmark two-dimensional synthetic data sets, with tutorial examples. less
Machine Learning Group Operating Cost - Applied Machine Learning - Published 17 Sep 2019
This collection comprises the two synthetic datasets for the assessment of the reliability of predictive process monitoring techniques used in Klinkmüller, C., van Beest, N., Weber, I.: Towards Relia... moreble Predictive Process Monitoring. CAiSE Forum, 2018.
The two datasets are provided as XES-files. For more information on the Extensible Event Stream (XES) standard see http://www.xes-standard.org.
The classes for each traces are captured via the "concept:name" sub-element. Additionally, for each trace the "classifiableFrom" sub-element records the minimum number of events that must be observed for the trace to be classifiable. More information regarding the notion of classifiability is provided in the paper.less
Legacy data - - Published 13 Apr 2018
Example code for the protein droplet processing algorithms
Legacy data - - Published 01 Mar 2018
Classification pipeline for images produced at the Collaborative Crystallisation Centre