FFLAS Matrix multiplication

The core of the FFLAS library is the matrix matrix multiplication routine. It's main characteristics are

The following left figure shows the improvement provided by the BLAS (curve 3 and 4) despite the cost of the conversions to double with comparison to a matrix multiplication without conversion (curve 2). It also shows that the performances over finite fields tends asymptotically to the performances over double (curve 1).
The right figure emphasize the gain of Winograd's fast matrix multiplication algorithm compared to the O(n³) classic algorithm:

Conversion Gain

The combination of these two tunings leads to pretty good performances: for example the speed of 4251.08 Mfops is attained, on a P4-2.4 Ghz for the multiplication of two dense matrices of size 4000*4000 over the finite field with 101 elements (this speed correspond to the number of millions of operations per second that would require a classic matrix-multiplication to compute the same result in the same time).


Jean-Guillaume Dumas

[Contacts]

Please submit your questions, sugestions and bug reports to the discussion group ffpack-devel.

Clément PERNET
Laboratoire de Modélisation et Calcul
B.P. 53 -- 51, av. des Mathématiques,
38041 Grenoble, France.
Clement.Pernet@imag.fr
http://ljk.imag.fr/membres/Clement.Pernet
Pascal Giorgi
Laboratoire de l'Informatique et du parallèlisme,
LIP-ENS Lyon.
46, allée d'Italie,
69364
Pascal.Giorgi@ens-lyon.fr
http://perso.ens-lyon.fr/pascal.giorgi
Jean-Guillaume DUMAS
Laboratoire de Modélisation et Calcul
B.P. 53 -- 51, av. des Mathématiques,
38041 Grenoble, France
Jean-Guillaume.Dumas@imag.fr
http://ljk.imag.fr/membres/Jean-Guillaume.Dumas
Thierry GAUTIER
Projet INRIA-APACHE, Laboratoire Informatique et Distribution
ZIRST - 51, av Jean Kuntzmann
38330 Montbonnot Saint-Martin, France.
Thierry.Gautier@inrialpes.fr
http://www-id.imag.fr/~gautier

 

 




Valid HTML 4.01!