data analytics

11 posts

person holding blue ballpoint pen on white notebook

Unsupervised anomaly detection using an ensemble of one-class random forest classifiers

Introduction Unsupervised anomaly detection with unlabeled data – is it possible to detect outliers when all we have is a set of uncommented, context-free signals? The short answer is, yes – this is the essence of how one deals with network intrusion, fraud, and other types of low-instance anomaly. In

Read more
black blue and red graph illustration

Improving black-box process efficiency using Bayesian Optimization

Instead of randomly or exhaustively iterating through combinations of algorithms and parameters, we can use Bayesian Optimization libraries to build up an in-memory approximation to the process we want to fine-tune. We can then make a our selections on prior knowledge.

Read more
macbook pro on brown table

Predicting hidden events: inverting the time-to-event paradigm: Part 2

Time-to-event (TTE) use-cases crop up in many places across industries. Some examples would be: the prediction of customer churn (the sales domain), remaining-useful-life or time-to-failure TTF (predictive maintenance), or anomaly detection (machine monitoring). Some events are difficult to predict as they are hidden. We can instead try to look for interim events to improve prediction accuracy.

Read more