Performance

Precision and Reproducibility in Computing

Posted on November 16, 2015November 23, 2015 by Paul Shirkey

Run-to-run reproducibility in computing is often assumed as an obvious truth. However software running on modern computer architectures, among many other processes, particularly when coupled with advanced performance-optimized libraries, is often only guaranteed to produce reproducible results only up to a certain precision; beyond that results can and do vary run-to-run. Reproducibility is inte...
Read More

NMath Premium: FFT Performance

Posted on May 28, 2013March 1, 2016 by Paul Shirkey

NMath Premium is our new GPU-accelerated math and statistics library for the .NET platform. The supported NVIDIA GPU routines include both a range of dense linear algebra algorithms and 1D and 2D Fast Fourier Transforms (FFTs). NMath Premium is designed to be a near drop-in replacement for NMath, however there are a few important differences and additional logging capabilities that are specific ...
Read More

Clearing a vector

Posted on November 9, 2011March 1, 2016 by Trevor Misfeldt

A customer recently asked us for the best method to zero out a vector. We decided to run some tests to find out. Here are the five methods we tried followed by performance timing and any drawbacks. The following tests were performed on a DoubleVector of length 10,000,000. 1) Create a new vector. This isn't really clearing out an existing vector but we thought we should include it for complet...
Read More

Initializing NMath

Posted on November 9, 2011July 8, 2013 by Trevor Misfeldt

NMath uses Intel's Math Kernel Library (MKL) internally. This code contains native, optimized code to wring out the best performance possible. There is a one-time delay when the appropriate x86 or x64 native code is loaded. This cost can be easily controlled by the developer by using the NMathKernel.Init() method. Please see Initializing NMath for more details. - Trevor
Read More

Forward Scaling Computing

Posted on January 28, 2010March 2, 2016 by Paul Shirkey

Forward Scaling for Multicore Performance The era of sequential, single-threaded software development deployed to a uniprocessor machine is rapidly fading into history. Nearly all computers sold today have at least two, if not four cores - and will have eight in the near future. Intel announced last month the successful production and testing of a new 48-core research processor which will be ma...
Read More

Top