backslash
Backslash, applied to an overdetermined system, is expected deterministic; i.e, no variability in solution.
But repeated calls to backslash, for data vL provided in the link, is variable and of infinite dynamic range.
clear all load vL % vL = double(vL); % Amat = double(Amat); xi = Amat\vL; %backslash error is nonzero when differenced with same backslash later on for i=1:10 notabug = Amat\vL; sum(abs(xi - notabug)) end
ans = 0
ans = 0
ans = 0
ans = 3.312601991721914126549721394248086e-06
ans = 3.271854404182205550123018444983506e-05
ans = 0
ans = 8.904811628089073569772503832159206e-06
ans = 8.904811628089073569772503832159206e-06
ans = 8.904811628089073569772503832159206e-06
ans = 3.312601991721914126549721394248086e-06
Uncommenting the double cast always produces ans=0 but reveals Amat to be an overdetermined rank deficient matrix.
MATLAB Version: 9.14.0.2489007 (R2023a) Update 6
Operating System: Microsoft Windows 10 Pro Version 10.0 (Build 19043)
CPU: AMD 3995WX
Advanpix Multiprecision Computing Toolbox Version 5.2.8.15537
Customer support service by UserEcho
Jon, thank you very much for the report!
Generally, some degree of variability of the solution is to be expected (due to matrix conditioning, multi-threading, flow of divide-and-conquer/bulge chasing computations, etc.).
The condition number of a matrix is log10(cond(Amat)) = ~21, roughly meaning that last ~21 digits are expected to be lost during computation of the solution.
However, even if we use extended precision, the last ~21 digits of the computed solution are not accurate (=can vary every time we solve the system). What counts is the relative accuracy of the system delivered by the solution.
I added few changes to your script:
Here is its results for precisions: 34, 50, 100 and 500 decimal digits.
First column is the system accuracy w.r.t current solution vector, second - current solution vector difference from the first one. Lastly in square brackets we have the current solution vector element with max difference from the first vector and its relative difference.
The relative accuracy of all the solutions are the same (1.55559e-15), this is clearly the maximum achievable accuracy for the system.
The solution vectors differ from the first one by last ~21 decimal digits. This coincides with the matrix conditioning (as we expected). The solution vector elements with max. difference also differs by the same (very small) margin from the first vector.
Overall, this perfectly matches the expected behavior of matrix solution.
***
Could you please send me references to MATLAB's requirements for solutions of overdetermined system to be bit-by-bit deterministic? In this particular case the MATLAB doesn't use any advanced QR algorithms/multithreading and probably this leads to exactly the same solution each time. But, mathematically it is not guaranteed or expected/justified.
In toolbox we use advanced algorithms for the case (multi-threadfinfg + D&C + bulge chasing with different execution paths each time). Nonetheless the solutions provide the same relative accuracy and differ from each other by expected margin (which is log10(cond(A)) = ~21 digits). This is how it is expected to behave. Let me know your comments.
Accuracy (precision) of solution is not in question. To say that two different multiprecision solutions are equivalent, because they fall within some calculable error tolerance, does not address the reproducibility problem at issue. Mathworks certainly hasn't asserted that a small neighborhood of linear system solutions are equivalent. One could indeed ask: Where does Matlab documentation state that linear system solution by mldivide() is NOT bit by bit deterministic? The fact that double precision solution (of an overdetermined rank deficient linear system) is reproducible leads to an expectation of multiprecision reproducibility for MCT.
MCT can indeed produce an identical result (for the overdetermined rank deficient system in my seminal post) on a subsequent run for the very same problem:
ans = 3.312601991721914126549721394248086e-06 % different results
ans = 0 % identical results
This means that every neighborhood of equivalent solutions must include {0}. "...multi-threading, flow of divide-and-conquer/bulge chasing computations" implies that MCT may have inadvertently introduced stochastics into the mldivide algorithm. Nondeterministic MCT behavior implies departure from Matlab algorithms.
MATLAB’s mldivide is expected deterministic; reproducible in floating point computations under identical conditions. Repeated runs of the same problem in the same environment will yield the same bitwise result unless something in the environment changes; e.g, CPU type, number of threads, BLAS/LAPACK library, or MATLAB release:
mldivide (backslash)
https://www.mathworks.com/help/matlab/ref/mldivide.html
(From “Algorithms” and “Tips” sections):
Although MathWorks does not explicitly say “bitwise identical” in that reference, it emphasizes (under the same hardware and same MATLAB version) that mldivide provides reproducible results; i.e, the same result each time.
Floating Point Results Between Computers
https://www.mathworks.com/support/tech-notes/1100/1108.html
Moore-Penrose Pseudoinverse (pinv)
https://www.mathworks.com/help/matlab/ref/pinv.html
Determinism
x = A\b;
for the same matrix A and vector b, on the same hardware, with the same version of MATLAB, and the same parallelization/threading settings, the results are bit for bit identical each time.As long as all conditions remain fixed, the backslash operator is deterministic for the same input problem—even in the overdetermined rank deficient case—and will yield the exact same solution for each call.
Jon, Happy New Year!
We have just released new version 5.3.6.15902: https://bit.ly/3mvxXqg
Now, its least-squares solver generates bitwise deterministic results every time.
Would appreciate it if you would update me on your test results.
I think with more advanced algorithms and many-core CPUs this will change in the future.
For example. Even though we use the same number of threads in a solver - it might split the overall work in different number of jobs and distribute them differently among cores on each call. This might happen because some CPU cores might be busy with other processes on the computer. More flexible/adaptive algorithms take this into account (as we did in least-square solver before 5.3.6).
Thank you for all your support and contributions!
Best wishes for the New Year!
Pavel