0
Not a bug

ODE's benchmark is slower with MCT

Michal Kvasnicka 6 year бұрын updated by dattorro 1 year бұрын 8

Hello, I just found strange behavior (bug?) of MATLAB R2018a (update 3) with latest version of MCT on the Path (Ubuntu Linux 16.04, 64bit).


If I run the bench() function with MCT on the path I get the following results:


>> bench(5)

ans =

    0.0743    0.1146    0.0462    0.0832    0.4225    0.5393
    0.0750    0.1108    0.0462    0.0845    0.3682    0.4330
    0.0739    0.1176    0.0521    0.0895    0.4052    0.4539
    0.0785    0.1275    0.0473    0.0826    0.3355    0.4691
    0.0744    0.1155    0.0464    0.1066    0.3874    0.4228


The same benchmark run without MCT on the path produce the following results:

>> bench(5)

ans =

    0.0745    0.1184    0.0130    0.0839    0.4043    0.5116
    0.0798    0.1139    0.0133    0.0834    0.3796    0.4141
    0.0809    0.1275    0.0141    0.1051    0.4728    0.4096
    0.0850    0.1182    0.0132    0.0839    0.3576    0.4142
    0.0884    0.1132    0.0134    0.0859    0.4168    0.4460

As you can see, the 3rd column (corresponding to the ODE's part of MATLAB benchmark) shows significantly (~4x) slower runs in a case of MCT on the path.


Could you clarify and/or reproduce this situation, please?

Answer

Answer
Not a bug

Hello Michal,


Toolbox overloads standard MATLAB routines to create elementary arrays - zeros, ones, eye, etc.

This is needed to support 'mp' type as argument in such functions, e.g. eye(100,'mp'). 


If requested type is not 'mp' then toolbox dispatches the call to built-in MATLAB functions (see arrayCreationOverload.m  for more details). Dispatch is quick but I expect it to be slower than direct call to double-precision functions.

MATLAB picks up toolbox overload when it is on the path.


Probably this is what causing slow-down in your tests.

Answer
Not a bug

Hello Michal,


Toolbox overloads standard MATLAB routines to create elementary arrays - zeros, ones, eye, etc.

This is needed to support 'mp' type as argument in such functions, e.g. eye(100,'mp'). 


If requested type is not 'mp' then toolbox dispatches the call to built-in MATLAB functions (see arrayCreationOverload.m  for more details). Dispatch is quick but I expect it to be slower than direct call to double-precision functions.

MATLAB picks up toolbox overload when it is on the path.


Probably this is what causing slow-down in your tests.

I am afraid that you are not right in this case. Please extract from standard matlab function bench.m the following content to the file bench_ode.m



--------------------------bench_ode.m-------------------------------------

function [t, n] = bench_ode
% ODE. van der Pol equation, mu = 1
F = @vanderpol;
y0 = [2; 0];
tspan = [0 eps];
[s,y] = ode45(F,tspan,y0);  %#ok Used  to preallocate s and  y   
tspan = [0 450];
n = tspan(end);
tic
[s,y] = ode45(F,tspan,y0); %#ok Results not used -- strictly for timing
t = toc;
end


function dydt = vanderpol(~,y)
%VANDERPOL  Evaluate the van der Pol ODEs for mu = 1
dydt = [y(2); (1-y(1)^2)*y(2)-y(1)];
end

-----------------------------------------------------------------------------------


In a case of MCT on the path:

>> time = bench_ode

time =

    0.0775

 

but in a case without MCT on the path:

>> time = bench_ode

time =

    0.0238

MCT has some slowdown impact on ODE's functionality in general ... This is not acceptable behavior of presence MCT on path! So, I think that this behavior is a bug.

The ode45 is m-script, it creates a lot of elementary arrays inside - which are dispatched through toolbox overloads (when it is on the path). That is why ode45 is slower.

Run ode45 through performance profiler and check where bottleneck is.


In general toolbox treats ODEs the same way as all other functionality - toolbox doesn't do any intrusive actions (e.g. replace standard scripts or else). MATLAB just picks up its overloads when it is on the path.

I did not observed this problem with any other matlab algorithms (linear algebra, FFT, statistics, etc.). Degradation of ODE's algorithm is so significant (benchmark problem is only very simple example to show this problem in natura), that  is really not acceptable.


Finally, to avoid this problem, you propose only remove MCT from the path? Am I right?  But this is off/on solution. In a case of active MCT user will be always suffered by significant ODE's slowdown.

@"I did not observed this problem with any other matlab algorithms (linear algebra, FFT, statistics, etc.)."

All of these in MATLAB are implemented directly in compiled languages C++/FORTRAN - that is why there is no difference.


On the contrary, ODE-related functionality is implemented in M-language with all the consequences for speed.


@"Finally, to avoid this problem, you propose only remove MCT from the path? Am I right?"

I do not create MATLAB's design rules and constraints. I just follow their rules as we all do.

I would be happy if someone would enlighten me with better workarounds.



Initially MATLAB was designed as procedural language, and OOP constructs were imposed later on to catch with modern trends. As a result, OOP support in MATLAB is very clumsy, slow and not flexible.

For those who are designing a machine (hardware) specifically to exceed Matlab's bench() function, they need to know that merely having Advanpix in Matlab Path causes 2X slowdown in R2023a.  Solution is to take Advanpix out of Path while building.                    

                                                          ADVANPIX NOT IN MATLAB PATH

Image 44


                                                         ADVANPIX IN MATLAB PATH

Image 43

____________________________________________________________________________

There is a second 2X slowdown caused by using Intel (Matlab R2023a default) BLAS and LAPACK on an AMD CPU.

Solution is to install AOCL from AMD:  amd.com/en/developer/aocl.html


Then, in Windows Environment:
_______________________________________
add to Path:
   C:\Program Files\AMD\AOCL-Windows\amd-blis\lib\ILP64 
add System Variable:
   BLAS_VERSION = AOCL-LibBlis-Win-MT-dll.dll

add to Path:
   C:\Program Files\AMD\AOCL-Windows\amd-libflame\lib\ILP64
add System Variable:
   LAPACK_VERSION = AOCL-LibFLAME-Win-MT-dll.dll

_______________________________________

("I" in front of "LP64" identifies 64-bit OS.)

You will know if you did it right when first time invoking bench() echoes AMD BLAS and LAPACK load.

Jon Dattorro

To check which BLAS and LAPACK are loaded, 

at Matlab command prompt enter:

version -blas

version -lapack


A workaround, to avert 2X slowdown in purely double precision programs, is to delete Advanpix from Matlab Path permanently.


In programs requiring Advanpix MCT, lead off with Matlab command:
   addpath('C:\Program Files\Advanpix');
which remains valid for current session.

For purely double precision programs, lead off with Matlab commands:
   warning('off', 'MATLAB:rmpath:DirNotFound');
   rmpath('C:\Program Files\Advanpix');

Jon Dattorro