Implement Hardware-Efficient Real Partial-Systolic QR Decomposition

Open Script

This example shows how to implement a hardware-efficient QR decomposition using the Real Partial-Systolic QR Decomposition block.

Economy Size QR Decomposition

The Real Partial-Systolic QR Decomposition block performs the first step of solving the least-squares matrix equation AX = B which transforms A in-place to R and B in-place to C = Q'B, then solves the transformed system RX = C, where QR is the orthogonal-triangular decomposition of A.

To compute the stand-alone QR decomposition, this example sets B to be the identity matrix so that the output of the Real Partial-Systolic QR Decomposition block is the upper-triangular R and C = Q'.

Define Matrix Dimensions

Specify the number of rows in matrices A and B, the number of columns in matrix A, and the number of columns in matrix B. This example sets B to be the identity matrix the same size as the number of rows of A.

m = 10; % Number of rows in matrices A and B
n = 3;  % Number of columns in matrix A
p = m;  % Number of columns in matrix B

Generate Matrices A and B

Use the helper function realUniformRandomArray to generate a random matrix A such that the elements of A are between -1 and +1, and A is full rank. Matrix B is the identity matrix.

rng('default')
A = fixed.example.realUniformRandomArray(-1,1,m,n);
B = eye(m);

Select Fixed-Point Data Types

Use the helper function qrFixedpointTypes to select fixed-point data types for matrices A and B that guarantee no overflow will occur in the transformation of A in-place to R and B in-place to C = Q'B.

max_abs_A = 1;  % Upper bound on max(abs(A(:))
max_abs_B = 1;  % Upper bound on max(abs(B(:))
precisionBits = 24;  % Number of bits of precision
T = fixed.qrFixedpointTypes(m,max_abs_A,max_abs_B,precisionBits);
A = cast(A,'like',T.A);
B = cast(B,'like',T.B);

Open the Model

model = 'RealPartialSystolicQRModel';
open_system(model);

The Data Handler subsystem in this model takes real matrices A and B as inputs. It sends rows of A and B to QR block using the AMBA AXI handshake protocol. The validIn signal indicates when data is available. The ready signal indicates that the block can accept the data. Transfer of data occurs only when both the validIn and ready signals are high. You can set a delay between the feeding in rows of A and B in the Data Handler to emulate the processing time of the upstream block. validIn remains high when rowDelay is set to 0 because this indicates the Data Handler always has data available.

Set Variables in the Model Workspace

Use the helper function setModelWorkspace to add the variables defined above to the model workspace. These variables correspond to the block parameters for the Real Partial-Systolic QR Decomposition block.

numSamples = 1; % Number of sample matrices
rowDelay = 1; % Delay of clock cycles between feeding in rows of A and B
fixed.example.setModelWorkspace(model,'A',A,'B',B,'m',m,'n',n,'p',p,...
    'numSamples',numSamples,'rowDelay',rowDelay);

Simulate the Model

out = sim(model);

Construct the Solution from the Output Data

The Real Partial-Systolic QR Decomposition block outputs matrices R and C at each time step. When valid result matrices are output, the block sets validOut to true.

R = out.R;
C = out.C;

Extract the Economy-Size Q

The block computes C = Q'B. In this example, B is the identity matrix, so Q = C' is the economy-size orthogonal factor of the QR decomposition.

Q = C';

Verify That Q is Orthogonal and R is Upper-Triangular

Q is orthogonal, so Q'Q is the identity matrix within roundoff.

I = Q'*Q

I = 

    1.0000   -0.0000   -0.0000
   -0.0000    1.0000   -0.0000
   -0.0000   -0.0000    1.0000

          DataTypeMode: Fixed-point: binary point scaling
            Signedness: Signed
            WordLength: 62
        FractionLength: 48

R is an upper-triangular matrix.

R = 

    2.2180    0.8559   -0.5607
         0    2.0578   -0.4017
         0         0    1.7117

          DataTypeMode: Fixed-point: binary point scaling
            Signedness: Signed
            WordLength: 29
        FractionLength: 24

isequal(R,triu(R))

ans =

  logical

   1

Verify the Accuracy of the Output

To evaluate the accuracy of the Real Partial-Systolic QR Decomposition block, compute the relative error.

relative_error = norm(double(Q*R - A))/norm(double(A))

relative_error =

   1.5208e-06

Suppress mlint warnings.

%#ok<*NOPTS>