Lesson 5: Testing Models (Basic Data Mining Tutorial)
Topic Status: Some information in this topic is preview and subject to change in future releases. Preview information describes new features or changes to existing features in Microsoft SQL Server 2016 Community Technology Preview 2 (CTP2).
Now that you have processed the model by using the targeted mailing scenario training set, you will test your models against the testing set. Validation is an important step in the data mining process. Knowing how well your targeted mailing mining models perform against real data is important before you deploy the models into a production environment.
Because the data in the testing set already contains known values for bike buying, it is easy to determine whether the model's predictions are correct. The model that performs the best will be used by the Adventure Works Cycles marketing department to identify the customers for their targeted mailing campaign.
In this lesson you will validate your models using multiple methods:
You’ll make predictions against the testing set to see how accurate the model is on known results. You’ll use a lift chart to measure its effectiveness.
You will test your models on a filtered subset of the data. You can compare multiple models in the same lift chart.
For more information about how model validation in general, see Data Mining Concepts.