Anomaly Detection
Updated: June 13, 2017
This topic describes the modules provided in Azure Machine Learning for anomaly detection.
Anomaly detection encompasses many important tasks in machine learning:
- Identifying transactions that are potentially fraudulent
- Learning patterns that indicate a network intrusion has occurred
- Finding abnormal clusters of patients
- Checking values input to a system
Because anomalies are by definition rare events, it can be difficult to collect a representative sample of data that can be used for modeling. The algorithms included in this section have been especially designed to address the core challenges of building and training models using imbalanced datasets.
Azure Machine Learning provides the following modules to use for creating an anomaly detection model. Just drag the module into your experiment to begin working with the model.
After setting model parameters, you must train the model using a labeled dataset and a special training module, Train Anomaly Detection Model. The result is a trained model that you can use to test new data. To do this, use the all-purpose Score Model module.
For an example of how these modules work together, see this experiment in the Cortana Intelligence Gallery: Anomaly Detection: Credit Risk
Related tasks
The Time Series Anomaly Detection is a new module that is a bit different from the other anomaly detection models. It is specifically designed for time series data, and is intended to analyze trends in the data. The algorithm identifies potentially anomalous trends in the time series data and flags deviations from the trend's direction or magnitude.
Azure also provides an anomaly detection service API that you can call as a web service: Machine Learning Anomaly Detection API
If you are not sure whether anomaly detection is the right algorithm to use with your data, see these guides:
|
This category includes the following modules:
| Module | Description |
|---|---|
| One-Class Support Vector Machine | Creates a one-class Support Vector Machine model for anomaly detection |
| PCA-Based Anomaly Detection | Creates an anomaly detection model using Principal Component Analysis |
Regression
Classification
Clustering
Text Analytics
OpenCV Library Modules
Module Categories and Descriptions