Index

Altix architecture overview
Altix 3000 series systems
Altix 3000 Series Systems
Altix 4000 series systems
Altix 4000 Series Systems

Amdahl's law
Understanding Parallel Speedup and Amdahl's Law
execution time given n and p
Predicting Execution Time with n CPUs
parallel fraction p
Understanding Amdahl's Law
parallel fraction p given speedup( n )
Calculating the Parallel Fraction of a Program
speedup( n ) given p
Understanding Amdahl's Law
superlinear speedup
Understanding Superlinear Speedup

analysis
system configuration
Determining System Configuration

application tuning process
Performance Analysis and Debugging

automatic parallelization
limitations
Use Compiler Options

cache bank conflicts
Tuning the Cache Performance

cache performance
Tuning the Cache Performance

cacheline traffic and CPU utilization
pmshub(1) Command

cluster environment
Scalable Computing

commands
dlook
Using the dlook Command
dplace
Using the dplace Command
topology
topology(1) Command

common compiler options
Compiler Overview

compiler command line
Compiler Overview

compiler libaries
C/C++
C/C++ Libraries
dynamic libraries
Dynamic Libraries
message passing
SHMEM Message Passing Libraries
overview
Library Overview

compiler libraries
static libraries
Static Libraries

compiler options
tracing and porting
Getting the Correct Results

compiler options for tuning
Using Compiler Options Where Possible

compiling environment
The SGI Compiling Environment
compiler overview
Compiler Overview
debugger overview
Other Compiling Environment Features
libraries
Library Overview
modules
Modules

CPU utilization
pmshub(1) Command

CPU-bound processes
Sources of Performance Problems

csrep command
histx Filters

data decomposition
Data Decomposition

data dependency
Identifying Parallel Opportunities in Existing Code

data parallelism
Data Decomposition

data placement tools
Data Placement Tools
cpusets
Data Placement Tools Overview
dplace
Data Placement Tools Overview
overview
Data Placement Tools Overview
taskset
Data Placement Tools Overview

debugger overview
Other Compiling Environment Features

debuggers
Debugging Tools
gdb
Other Compiling Environment Features
idb
Other Compiling Environment Features
TotalView
Other Compiling Environment Features

denormalized arithmetic
Compiler Overview

determining parallel code amount
Parallelizing Your Code

determining tuning needs
tools used
Determining Tuning Needs

dlook command
Using the dlook Command

dplace command
Using the dplace Command

dumppm
histx Filters

Electric Fence debugger
Managing Heap Corruption Problems

Environment variables
Environment Variables for Performance Tuning

explicit data decomposition
Data Decomposition

False sharing
Fixing False Sharing

file limit resources
resetting
Resetting the File Limit Resource Default

Flexible File I/O (FFIO)
Multithreading Considerations
environment variables to set
Environment Variables
operation
FFIO Operation
overview
Flexible File I/O
simple examples
Simple Examples

floating-point programs
Floating-point Programs Performance

Floating-Point Software Assist
Floating-point Programs Performance

FPSWA
See Floating-Point Software Assist

functional parallelism
Data Decomposition

gdb tool
Debugging Tools

GNU debugger
Debugging Tools

gtopology command
gtopology(1) Command

GuideView tool
Using GuideView

histx
Using histx

histx data collection
histx Data Collection

histx filters
histx Filters

hwinfo command
hwinfo(1) Command

idb tool
Debugging Tools

implicit data decomposition
Data Decomposition

I/O-bound processes
Sources of Performance Problems

iostat command
System Usage Commands

iprep command
histx Filters

latency
Scalable Computing

limits
system
Resetting System Limits

linkstat command
linkstat(1) Command

lipfpm command
histx Data Collection

memory management
The Basics of Memory Management
Managing Memory

memory page
The Basics of Memory Management

memory strides
Tuning the Cache Performance

memory-bound processes
Sources of Performance Problems

Message Passing Toolkit
for parallelization
Use MPT
using profile.pl
profile.pl with MPI programs

modules
Modules
command examples
Modules

MPP definition
Scalable Computing

NUMA Tools
command
dlook
dlook Command
dplace
Using the dplace Command
installing
Installing NUMA Tools

OpenMP
Use OpenMP
environment variables
Environment Variables for Performance Tuning
Guide OpenMP Compiler
Other Performance Tools

parallel execution
Amdahl's law
Understanding Parallel Speedup and Amdahl's Law
parallel fraction p
Understanding Amdahl's Law

parallel speedup
Understanding Parallel Speedup

parallelization
automatic
Use Compiler Options
using MPI
Use MPT
using OpenMP
Use OpenMP

performance
Assure Thread Analyzer
Other Performance Tools
Guide OpenMP Compiler
Other Performance Tools
GuideView
Using GuideView
VTune
Using VTune for Remote Sampling

performance analysis
Performance Analysis and Debugging

Performance Co-Pilot monitoring tools
Performance Co-Pilot Monitoring Tools
linkstat
linkstat(1) Command
Other Performance Co-Pilot monitoring tools
Other Performance Co-Pilot Monitoring Tools
pmshub
pmshub(1) Command
shubstats
shubstats(1) Command

performance gains
types of
Performance Analysis and Debugging

performance problems
sources
Sources of Performance Problems

pfmon tool
Profiling with pfmon

pmshub command
pmshub(1) Command

process placement
Determining Process Placement
MPI and OpenMP
Combination Example (MPI and OpenMP)
set-up
Determining Process Placement
using OpenMP
Example Using OpenMP
using pthreads
Example Using pthreads

profile.pl script
Profiling with profile.pl

profiling
pfmon
Profiling with pfmon
profile.pl
Profiling with profile.pl

ps command
System Usage Commands

resetting default system stack size
Resetting the Default Stack Size

resetting file limit resources
Resetting the File Limit Resource Default

resetting system limit resources
Resetting System Limits

resident set size
The Basics of Memory Management

samppm command
histx Data Collection

sar command
System Usage Commands

scalable computing
Scalable Computing

SHMEM
SHMEM Message Passing Libraries

shortening execution time
Adding CPUs to Shorten Execution Time

shubstats command
shubstats(1) Command

SMP definition
Scalable Computing

stack size
resetting
Resetting the Default Stack Size

superlinear speedup
Understanding Superlinear Speedup

swap space
The Basics of Memory Management

system
overview
System Overview

system configuration
Determining System Configuration

system limit resources
resetting
Resetting System Limits

system limits
address space limit
Resetting System Limits
core file siz
Resetting System Limits
CPU time
Resetting System Limits
data size
Resetting System Limits
file locks
Resetting System Limits
file size
Resetting System Limits
locked-in-memory address space
Resetting System Limits
number of logins
Resetting System Limits
number of open files
Resetting System Limits
number of processes
Resetting System Limits
priority of user process
Resetting System Limits
resetting
Resetting System Limits
resident set size
Resetting System Limits
stack size
Resetting System Limits

system monitoring tools
Monitoring Tools
command
hwinfo
hwinfo(1) Command
topology
topology(1) Command

system usage commands
System Usage Commands
iostat
System Usage Commands
ps
System Usage Commands
sar
System Usage Commands
top
System Usage Commands
uptime
System Usage Commands
vmstat
System Usage Commands
w
System Usage Commands

taskset command
taskset Command

tools
Assure Thread Analyzer
Other Performance Tools
Guide OpenMP Compiler
Other Performance Tools
GuideView
Using GuideView
pfmon
Profiling with pfmon
profile.pl
Profiling with profile.pl
profile.pl with MPI programs
VTune
Using VTune for Remote Sampling

top command
System Usage Commands

topology command
topology(1) Command
topology(1) Command

tuning
cache performance
Tuning the Cache Performance
debugging tools
Electric Fence
Managing Heap Corruption Problems
idb
Debugging Tools
dplace
Using dplace and taskset
Electric Fence
Managing Heap Corruption Problems
environment variables
Environment Variables for Performance Tuning
false sharing
Fixing False Sharing
heap corruption
Managing Heap Corruption Problems
managing memory
Managing Memory
multiprocessor code
Multiprocessor Code Tuning
parallelization
Parallelizing Your Code
profiling
GuideView
Using GuideView
histx command
Using histx
mpirun command
profile.pl with MPI programs
pfmon
Profiling with pfmon
profile.pl script
Profiling with profile.pl
VTune analyzer
Using VTune for Remote Sampling
single processor code
Single Processor Code Tuning
using compiler options
Using Compiler Options Where Possible
using dplace
Using dplace and taskset
using math functions
Using Tuned Code
using taskset
Using dplace and taskset
verifying correct results
Getting the Correct Results

uname command
Determining System Configuration

unflow arithmetic
effects of
Compiler Overview

uptime command
System Usage Commands

virtual addressing
The Basics of Memory Management

virtual memory
The Basics of Memory Management

vmstat command
System Usage Commands

VTune performance analyzer
Using VTune for Remote Sampling

w command
System Usage Commands