Microsoft R Glossary
Quick Links: A B C D E F G H I J K L M N O P Q R S T U V X Y Z
Didn't find what you were looking for? Suggest a term using the "Is this page helpful?" link below.
For general R language terminology, check out CRAN's R Language Definition.
Also, check out our More Resources for other publications and resources.
Use this glossary to find the definitions to common terms in the Microsoft R documentation.
A feature in RevoScaleR
that lets you define an environment, either local or remote, and then transfer R computations to that environment, typically to get better performance or to minimize data transfer. Local is the default. Remote compute context is available for these platforms: SQL Server, HDInsight, Teradata, Hadoop MR and Spark, and Microsoft R Server (Linux and Windows). Learn more…
Using ScaleR on R Server, this is the ability to partition data into multiple parts for processing, reassembling it later for analysis. Learn more…
Similar to parallel computing, but in Microsoft R, it specifically refers to workload distribution across multiple servers. Learn more…
High-performance analytics (HPA)
In ScaleR, HPA refers to functions such as rxLinMod and other RevoScaleR analytics functions that focus on efficiently feeding data to available cores by means of efficient disk I/O, threading, and data management in memory. Learn more…
High-performance computing (HPC)
In ScaleR, HPC refers to functions such as rxExec, foreach, and rmpi that are CPU-centric, involving tremendous amounts of processing on relatively small amounts of data. HPC functions are optimized to share tasks across available computing resources, but can be slowed if large amounts of data need to be transferred.
Microsoft R Client
Microsoft R Client is a free, community-supported, data science tool for high performance analytics. R Client is built on top of Microsoft R Open and includes RevoScaleR so you can benefit from parallelization and remote computing. Learn more…
Microsoft R Open
Microsoft R Open is the enhanced distribution of open source R from Microsoft that is fully compatibility with all R packages, scripts and applications. Enhancements include multi-core processing, a fixed CRAN repository date, and reproducible R with the checkpoint package. Learn more…
Microsoft R Server
Use R—the powerful, statistical programming language—in an enterprise-class, big data analytics platform. R Server, built on ScaleR technology, is an enterprise class server for hosting and managing parallel and distributed workloads of R processes on servers (Linux and Windows) and clusters (Hadoop and Apache Spark). Learn more…
The Intel® Math Kernel Library (Intel® MKL) is a library of optimized math routines. This library makes it possible for so many common R operations, such as matrix multiply/inverse, matrix decomposition, and some higher-level matrix operations, to compute in parallel and use all of the processing power available to reduce computation times. Learn more…
Configure the operationalization feature for Microsoft R Server to act as a deployment server and host analytic web services. Learn more…
A process that breaks a computing task into segments that can be executed independently on separate threads, cores, or computers. Multiple architectures support parallel computing: SMP means parallel processing on a single computer with multiple processors, each running a different set of commands; MPP means processing a task across multiple computers. In both SMP and MPP, the results from all processes are combined at the end to give a single result. Learn more…
A proprietary R package by Microsoft that provides scalable, fast (multicore), and extensible data analysis for large datasets with R. It offers parallel external memory algorithms that help R break through memory and performance limitations. This package is available with Microsoft R Server, SQL Server R Services, and Microsoft R Client. Learn more…
SQL Server R Services
A feature included in SQL Server 2016 and after that supports execution of R scripts in T-SQL stored procedures. R Services integrates an R run-time with optimizations in SQL Server, including resource management and data security.
An algorithm that takes a given set of values and a chunk of data, and then outputs a revised set of values cumulative for all chunks. The simplest example is an updating sum: sum is computed for the first chunk, followed by a second chunk, which each successive chunk contributing to a revised value until reaching the cumulative sum. Updating algorithms are used in ScaleR.
XDF File Format
The External Data Frame (XDF) file format is a high-performance, binary file format for storing big data sets for use with RevoScaleR. This file format has an R interface and optimizes rows and columns for faster processing and analysis. Learn more…
Suggest terms with
"Is this page helpful?"