Friday, December 12, 2014

Digest for comp.programming.threads@googlegroups.com - 3 updates in 3 topics

Ramine <ramine@1.1>: Dec 11 04:50PM -0800

I have correcte some typos, please read again...
 
Hello,
 
 
 
If you have noticed i have just come up with a new algorithm of my
Parallel conjugate gradient solver library that is NUMA aware, this is
really a big improvement over my previous algorithm,my new algorithm
contains two parts that are the most expensive, and those two parts are:
a vector multiplication by a transpose of a matrix, and a vector
multiplication by a matrix, but when i have parallelized my previous
algorithm, i have parallelized just the memory data cache transfer from
the L2 cache-line hit to the CPU that costs around 10 CPU cycles for
every double type, and i have parallelized also the multiplication of
two doubles and addition of two doubles, but this was not enough, cause
what we have to do also is parallelize the memory data transfers from
the memory to the L2 cache , and this is what we call a NUMA aware
algorithm that really scale on NUMA architecture, and this is what i
have done in my new algorithm, the memory data transfers from memory to
the L2 cache was also parallelized and this have made my new algorithm
NUMA aware and really scalable on NUMA architecture. But to become NUMA
aware you need also to allocate memory of the arrays of your Matrix in
different NUMA nodes, but that's easy to do.
 
My Parallel conjugate gradient solver library supports dense matrices
and it is a library that solves linear system of equations and also it
solves large and very large dense linear system of equations.
 
 
You can download Parallel conjugate gradient solver library from:
 
 
https://sites.google.com/site/aminer68/parallel-implementation-of-conjugate-gradient-linear-system-solver
 
 
Thank you,
Amine Moulay Ramdane.
Ramine <ramine@1.1>: Dec 11 04:46PM -0800

Hello,
 
If you have noticed i have just come up with a new algorithm of
my Parallel conjugate gradient solver library that is NUMA aware,
this is really a big improvement over my previous algorithm,
my new algorithm contains two parst that are the most expensive,
and those teo parts are: a vector multiplication by a transpose of a
matrix, and a vector multiplication by a matrix, but when i have
parallelized my previous algorithm, i have parallelized just the memory
data cache transfer from the L2 cache-line hit to the CPU that costs
around 10 CPU cycles for every double type, and i have parallelized also
the multiplication of two doubles and addition of two doubles,
but this was not enough, cause what we have to do also is parallelize
the memory data transfers from the memory to the L2 cache ,
and this is what we call a NUMA aware algorithm that really scale on
NUMA architecture, and this is what i have done in my new algorithm,
the memory data transfers from memory to the L2 cache was also
parallelized and this have made my new algorithm NUMA aware and really
scalable on NUMA architecture. But to become NUMA aware you need also
to allocate memory of the arrays of your Matrix in different NUMA nodes,
but that's easy to do.
 
My Parallel conjugate gradient solver library supports dense matrices
and it is a library that solves linear system of equations and also it
solves large and very large dense linear system of equations.
 
 
You can download Parallel conjugate gradient solver library from:
 
 
https://sites.google.com/site/aminer68/parallel-implementation-of-conjugate-gradient-linear-system-solver
 
 
Thank you,
Amine Moulay Ramdane.
Ramine <ramine@1.1>: Dec 11 03:34PM -0800

Hello,
 
 
Parallel conjugate gradient solver library version 1.04 is here...
 
 
I have updated my parallel conjugate gradient solver library to version
1.04, i have just parallelized my algorithm so that it becomes NUMA
aware so that the memory transfers have become parallelized on NUMA
architecture , so this will make my algorithm really scalable on NUMA
architecture. But to become NUMA aware you have to allocate memory of
the arrays of your Matrix in different NUMA nodes and that's easy to do.
 
Also i have ported my Parallel conjugate gradient solver library to
Delphi XE.
 
You can download my Parallel conjugate gradient solver library from:
 
https://sites.google.com/site/aminer68/parallel-implementation-of-conjugate-gradient-linear-system-solver
 
 
Thank you,
Amine Moulay Ramdane.
You received this digest because you're subscribed to updates for this group. You can change your settings on the group membership page.
To unsubscribe from this group and stop receiving emails from it send an email to comp.programming.threads+unsubscribe@googlegroups.com.

No comments: