CAUTION: JavaScript is disabled. This may lead to incorrect rendering of this page. Please enable JavaScript

optimizing performance on parallel hardware

Optimizing performance on parallel hardware is an iterative process. The flow chart shows the various steps involved.

Intel® Parallel Studio XE Cluster Edition is a software development tools suite for boosting application performance with less effort:

Launching the tools

After installing Intel Parallel Studio XE, you can launch the tools using the Eclipse CDT, or from the command-line.
See the Getting Started guides, linked from this page, for additional instructions and command lines for the tools you are interested in.

To easily access the standalone or command line versions of the tools, use the initialization utility provided with Intel Parallel Studio XE, to initialize all the tools in one step:

  1. From a command prompt or script, cd to <install_dir>/parallel_studio_xe_2017.<update number>.<package number>/bin
    By default, <install_dir> is:
    • For root installations: /opt/intel
    • For non-root installations: $HOME/intel
  2. Enter source   or source psxevars.csh.

Design and build your application

tools for high-performance implementation

Use the following Intel Parallel Studio XE components to build optimized executables and libraries. The flowchart shows how the various tools can help you in various stages of the flow.

Click the links to see Getting Started information for the tools.

Getting Started Guide Description
Intel® C++ Compiler - 17.0 C and C++ optimizing compiler creates fast code for modern processors. It uses the latest instruction sets, auto vectorizes code for supporting/utilizing wider vector registers and uses highly tuned parallel models like OpenMP*, and Intel® TBB. It’s a drop-in addition for C and C++ development and has broad support for the latest C and C++ standards.
Intel® Fortran Compiler - 17.0 High-performance, optimized Fortran compiler with support for the standards driven OpenMP* parallel programming model.
Enhanced GNU* Project Debugger (GDB)
  • GDB 7.10 for debugging applications natively on Intel® 64 Architecture systems.
  • GDB 7.10 for debugging applications remotely on Intel® Xeon Phi™ coprocessor systems.
  • Intel® Debugger for Heterogeneous Compute 2017 - enables debugging CPU and GPU simultaneously.
Intel® Integrated Performance Primitives (Intel® IPP) - 2017 Pre-optimized building blocks with computationally intensive functions to help with large dataset problem processing and high-performance computing.
Intel® Math Kernel Library (Intel® MKL) - 2017 A set of highly optimized linear algebra, Fast Fourier Transform (FFT), vector math, and statistics functions.
Intel® Threading Building Blocks (Intel® TBB) - 2017 C and C++ template library for creating high performance, scalable parallel applications.
Intel® MPI Library - 2017 High-performance MPI library.
Intel® MPI Benchmarks - 2017 A set of MPI kernel tests to verify the performance of your cluster or MPI implementation.
Intel® Data Analytics Acceleration Library (Intel® DAAL) - 2017 C++ and Java API library of optimized analytics building blocks to improve big data analytics performance from edge devices to servers. It covers all data analysis stages (preprocessing, transformation, analysis, modeling, decision making) for offline, streaming, and distributed analytics usages. Essential for engineering high performance Big Data applications.

Increase performance and diagnose errors

performance analysis tools for diagnosis

Use the following components to detect various errors and improve performance of your application. Each tool is used in different stages of the flow.

Click the links to see Getting Started information for the tools:

Getting Started Guide Description
Intel® Advisor - 2017 Vectorization optimization and thread prototyping. Use this tool in the vectorization and threading stages of the flow.
Intel® Inspector - 2017 Memory and thread debugger. Use this tool to find races, deadlocks, and illegal memory accesses.
Intel® VTune™ Amplifier XE - 2017 Performance profiler. Use this tool in the threading and bandwidth optimization stages and for advanced vectorization optimization.
Introducing Intel® Performance Snapshot. Use this tool for quick, simple views into performance optimization opportunities.
Intel® Trace Analyzer and Collector - 2017 MPI communications performance profiler and correctness checker. Use this tool in the MPI tuning stage.
Intel® Cluster Checker - 2017 Verification of your system’s Intel® Cluster Ready compliance. To check for Intel Cluster Ready compliance, you can download and install the Intel® Cluster Checker product from the Intel® Registration Center or email

Training and Documentation

Online Training The online training site is an excellent resource for training materials on Parallel Studio XE tools.
Product Code Samples Site with samples for Intel® software tools. Filter based on the tools you have installed to see the available samples.
Intel® Parallel Studio XE Release Notes

Contains information about this release of the product. This document includes the following sections:

  • Introduction
  • Product Contents
  • What's New
  • System Requirements
  • Installation Notes
  • Documentation
  • Issues and Limitations
  • Technical Support
  • Legal Information

Release Notes are installed at:


Intel® Parallel Studio XE Cluster Edition Installation Guide

Describes the installation process for the Intel® Parallel Studio XE. The document is installed at:


Documentation folder


Folder with all the documentation for the Intel Parallel Studio XE tools. Review the subfolders to see all the available documentation for the installed tools.

Evaluate Intel Parallel Studio XE Product tour with videos and samples. You can download step-by-step guides.
Intel® Software Documentation Library (Intel® SDL)

This is the online documentation library for Intel software products.

Disclaimer and Legal Information

Optimization Notice
Intel's compilers may or may not optimize to the same degree for non-Intel microprocessors for optimizations that are not unique to Intel microprocessors. These optimizations include SSE2, SSE3, and SSSE3 instruction sets and other optimizations. Intel does not guarantee the availability, functionality, or effectiveness of any optimization on microprocessors not manufactured by Intel. Microprocessor-dependent optimizations in this product are intended for use with Intel microprocessors. Certain optimizations not specific to Intel microarchitecture are reserved for Intel microprocessors. Please refer to the applicable product User and Reference Guides for more information regarding the specific instruction sets covered by this notice. Notice revision #20110804

Cilk, Intel, the Intel logo, Intel Xeon Phi, VTune and Xeon are trademarks of Intel Corporation in the U.S. and/or other countries.
* Other names and brands may be claimed as the property of others.
© 2016 Intel Corporation.