Skip to main content

The Official Journal of the Pan-Pacific Association of Input-Output Studies (PAPAIOS)

Enhancing the distance minimization methods of matrix updating within a homothetic paradigm

Abstract

Matrix updating methods are used for constructing the target matrix with the prescribed row and column marginal totals that demonstrates the highest possible level of its structural similarity to initial matrix given. A concept of structural similarity has a vague framework that can be slightly refined under considering a particular case of strict proportionality between row and column marginal totals for target and initial matrices. Here the question arises: can we accept the initial matrix homothety as optimal solution for proportionality case of matrix-updating problem? In most practical situations, an affirmative answer to the question is almost obvious. It is natural to call this common notion by homothetic paradigm and to refer its checking as homothetic testing. Some well-known methods for matrix updating serve as an additional instrumental confirmation to validity of homothetic paradigm. It is shown that RAS method and Kuroda’s method pass through the homothetic test successfully. Homothetic paradigm can be helpful for enhancing a collection of matrix updating methods based on constrained minimization of the distance functions. Main attention is paid to improving the methods with weighted squared differences (both regular and relative) as an objective function. As an instance of a incorrigible failure in the homothetic testing, the GRAS method for updating the economic matrices with some negative entries is analyzed in details. A collection of illustrative numerical examples and some recommendations for method’s choice are given.

1 An introduction to the matrix-updating problems

A general problem of updating rectangular (or square) matrix can be formulated as follows. Let A be an initial matrix of dimension N × M with row and column marginal totals \({\mathbf{u}}_{{\mathbf{A}}} = {\mathbf{Ae}}_{M}\), \({\mathbf{v}}_{{\mathbf{A}}}^{\prime } = {\mathbf{e}}_{N}^{\prime } {\mathbf{A}}\) where \({\mathbf{e}}_{N}\) and \({\mathbf{e}}_{M}\) are N × 1 and M × 1 summation column vectors with unit elements. Further, let \({\mathbf{u}} \ne {\mathbf{u}}_{{\mathbf{A}}}\) and \({\mathbf{v}} \ne {\mathbf{v}}_{{\mathbf{A}}}\) be exogenous column vectors of dimension N × 1 and M × 1, respectively. The problem is to estimate a target matrix X of dimension N × M at the highest possible level of its structural similarity (or resemblance, likeness, closeness, etc.) to initial matrix A subject to N + M equality constraints

$${\mathbf{Xe}}_{M} = {\mathbf{u}},\quad {\mathbf{e}}_{N}^{\prime } {\mathbf{X}} = {\mathbf{v}}^{\prime } ,$$
(1)

and under the consistency condition

$${\mathbf{e}}_{N}^{\prime } {\mathbf{u}} = {\mathbf{e}}_{M}^{\prime } {\mathbf{v}}.$$
(2)

It is assumed that initial matrix A does not include any zero rows or zero columns, does not have less than N + M nonzero elements, does not include any rows or columns with a unique nonzero element, and does not contain any pairs of rows and columns with four nonzero elements in the intersections. Otherwise, it is advisable to clean matrix A from those undesirable features before applying any matrix updating method in practice.

Clearly, the system of equations (1) is dependent at consistency condition (2) that provides an existence of target matrix X. However, it is easy to show that any N + M − 1 among N + M constraints (1) are mutually independent. Furthermore, it is evident that any feasible solution of matrix-updating problem X can be simply transformed into another one by letting, e.g.,

$$x_{nm}^{new} = x_{nm} + \varepsilon ,\;\;x_{nj}^{new} = x_{nj} - \varepsilon ;\quad x_{im}^{new} = x_{im} - \varepsilon ,\;\;x_{ij}^{new} = x_{ij} + \varepsilon$$

where ε is an arbitrary scalar, or

$$x_{nm}^{new} = x_{nm} + \varepsilon ,\;\;x_{nj}^{new} = x_{nj} - {\varepsilon \mathord{\left/ {\vphantom {\varepsilon 2}} \right. \kern-0pt} 2},\;\;x_{nk}^{new} = x_{nk} - {\varepsilon \mathord{\left/ {\vphantom {\varepsilon 2}} \right. \kern-0pt} 2};\quad x_{im}^{new} = x_{im} - \varepsilon ,\;\;x_{ij}^{new} = x_{ij} + {\varepsilon \mathord{\left/ {\vphantom {\varepsilon 2}} \right. \kern-0pt} 2},\;\;x_{ik}^{new} = x_{ik} + {\varepsilon \mathord{\left/ {\vphantom {\varepsilon 2}} \right. \kern-0pt} 2},$$

and so on.

Thus, general problem of matrix updating with a great variety of feasible solutions significantly depends on a definition of the measure for structural similarity between initial and target matrices. Various definitions of this measure generate a great manifold of different methods and techniques for matrix updating. A quite common approach to define the similarity measure is to use some objective distance function \(f\,\left( {{\mathbf{X}} - {\mathbf{A}}} \right)\) to be minimized subject to linear constraints (1) under the consistency condition (2). The main purpose of the article is to enhance the methods of this kind by defining and applying a new algebraic concept named homothetic paradigm.

2 A homothetic paradigm for the matrix updating methods

A notion of structural similarity between initial and target matrices has a vague framework that can be slightly refined in an axiomatic manner. In this context, let us consider a particular case of strict proportionality between row and column marginal totals \({\mathbf{u}} = k{\mathbf{u}}_{{\mathbf{A}}}\) and \({\mathbf{v}} = k{\mathbf{v}}_{{\mathbf{A}}}\) for target and initial matrices with the same scalar factor k. Here the main question arises: can we accept the matrix homothety \({\mathbf{X}} = k{\mathbf{A}}\) as optimal solution for proportionality case of a general matrix-updating problem? At first sight this solution can be appreciated as rather logical and corresponding to famous principle of insufficient reason (also known in decision theory as Laplace criterion) because there is no way to explain convincingly that X must not be equal to \(k{\mathbf{A}}\). Moreover, it allows preserving in X the same location of zeros as in the initial matrix. However, it is to be emphasized that the above question indeed seems neither simple nor evident, and its proposition cannot be proved formally.

Nevertheless, in most practical situations an affirmative answer to this question is almost obvious. In particular, as it is shown below, the well-known and widely used RAS and Kuroda’s methods for matrix updating serve as an additional instrumental confirmation to such an answer. In this connection, we will call this rather common notion by homothetic paradigm and will refer examining the property “if \({\mathbf{u}} = k{\mathbf{u}}_{{\mathbf{A}}}\) and \({\mathbf{v}} = k{\mathbf{v}}_{{\mathbf{A}}}\) then \({\mathbf{X}} = k{\mathbf{A}}\)” as a homothetic test for the matrix updating methods. It is advisable to propose that a successful passing through homothetic test were to be appreciated as a positive feature of any matrix updating method.

3 Homothetic testing of RAS method

The key idea of the RAS method is triple factorization of target matrix

$${\mathbf{X}} = {\mathbf{RAS}} = \left\langle {\mathbf{r}} \right\rangle {\mathbf{A}}\left\langle {\mathbf{s}} \right\rangle = {\hat{\mathbf{r}}}{\mathbf{A}} {\hat{\mathbf{s}}}$$
(3)

where r and s are unknown N × 1 and M × 1 column vectors. Here angled bracketing around vector’s symbol or putting a “hat” over it denotes a diagonal matrix, with the vector on its main diagonal and zeros elsewhere (see Miller and Blair 2009, p. 697).

Putting (3) into (1), we have the system of nonlinear equations

$${\hat{\mathbf{r}}}{\mathbf{A}}{\hat{\mathbf{s}}}{\mathbf{e}}_{M} = {\hat{\mathbf{r}}}{\mathbf{A}} {\hat{\mathbf{s}}} = \left\langle {{\mathbf{As}}} \right\rangle {\mathbf{r}} = {\mathbf{u}},\quad {\mathbf{e}}_{N}^{\prime } {\hat{\mathbf{r}}}{\mathbf{A}} {\hat{\mathbf{s}}} = {\mathbf{r}}^{\prime } {\mathbf{A}}{\hat{\mathbf{s}}} = {\mathbf{s}}^{\prime } \left\langle {{\mathbf{A^{\prime}r}}} \right\rangle = {\mathbf{v}}^{\prime } .$$

Proper transformations of this system lead to the following pair of iterative processes:

$${\mathbf{r}}_{(i)} = \left\langle {{\mathbf{A}}\left\langle {{\mathbf{A}}^{\prime } {\mathbf{r}}_{(i - 1)} } \right\rangle^{ - 1} {\mathbf{v}}} \right\rangle^{ - 1} {\mathbf{u}},\;\;i = 1 \div I;\quad {\mathbf{s}}_{(I)} = \left\langle {{\mathbf{A}}^{\prime } {\mathbf{r}}_{(I)} } \right\rangle^{ - 1} {\mathbf{v}};$$
(4)
$${\mathbf{s}}_{(j)} = \left\langle {{\mathbf{A}}^{\prime } \left\langle {{\mathbf{As}}_{(j - 1)} } \right\rangle^{ - 1} {\mathbf{u}}} \right\rangle^{ - 1} {\mathbf{v}},\;\;j = 1 \div J;\quad {\mathbf{r}}_{(J)} = \left\langle {{\mathbf{As}}_{(J)} } \right\rangle^{ - 1} {\mathbf{u}}$$
(5)

where i and j are iteration numbers, and the character “÷” between the lower and upper bounds of index’s changing range means that the index sequentially runs all integer values in the specified range.

As concerning a homothetic test for RAS method at \({\mathbf{u}} = k{\mathbf{u}}_{{\mathbf{A}}}\) and \({\mathbf{v}} = k{\mathbf{v}}_{{\mathbf{A}}}\), it can be easily shown that under starting condition \({\mathbf{r}}_{(0)} = {\mathbf{e}}_{N}\) or \({\mathbf{s}}_{(0)} = {\mathbf{e}}_{M}\) the RAS method iterative process (4) or (5) demonstrates one-step convergence to pair of vectors \({\mathbf{r}} = {\mathbf{e}}_{N}\), \({\mathbf{s}} = k{\mathbf{e}}_{M}\) or to \({\mathbf{r}} = k{\mathbf{e}}_{N}\), \({\mathbf{s}} = {\mathbf{e}}_{M}\), respectively. Hence, RAS algorithm’s implementation gives \(r_{n} s_{m} = k\) for any n and m, n = 1 ÷ N, m = 1 ÷ M, from which \({\mathbf{X}} = {\mathbf{rs}}^{\prime } \circ {\mathbf{A}} = k{\mathbf{A}}\) where the character “°” denotes Hadamard’s (element-wise) product of two matrices of the same dimension. Besides, it is easy to see that replacing the initial matrix A with its homothety k A leaves the RAS method iterations (4) and (5) invariant. Thus, the RAS method passes through a homothetic test successfully.

For another formal proof of this fact, notice that the RAS method can be associated with a conditional minimization of nonnegative function called the Kullback–Leibler divergence (see Kullback and Leibler 1951; Kullback 1959) that is used for comparing “true” and “test” probability distributions. This function is sometimes called “information gain,” albeit it measures an information loss from using a “test” distribution to approximate a “true” distribution. The Kullback–Leibler divergence actually expresses the difference between the cross-entropy of two distributions (namely, in order “true”–“test”) and the entropy of “true” distribution.

Let the “true” distribution be X/x, and let the “test” one be A/a, where \(a = {\mathbf{e}}_{N}^{\prime } {\mathbf{Ae}}_{M}\) and \(x = {\mathbf{e}}_{N}^{\prime } {\mathbf{Xe}}_{M} = {\mathbf{e}}_{N}^{\prime } {\mathbf{u}} = {\mathbf{e}}_{M}^{\prime } {\mathbf{v}}\). (So all elements of X and A are implied to be nonnegative.) In these denotations the Kullback–Leibler divergence can be written as

$$f_{KL} ({{\mathbf{X}} \mathord{\left/ {\vphantom {{\mathbf{X}} x}} \right. \kern-0pt} x};{{\mathbf{A}} \mathord{\left/ {\vphantom {{\mathbf{A}} a}} \right. \kern-0pt} a}) = \sum\limits_{n = 1}^{N} {\sum\limits_{m = 1}^{M} {\frac{{x_{nm} }}{x}\ln \left( {\frac{a}{x}\frac{{x_{nm} }}{{a_{nm} }}} \right)} } = \frac{1}{x}f_{KL} ({\mathbf{X}};{\mathbf{A}}) + \ln \frac{a}{x}$$
(6)

where \(f_{KL} ({\mathbf{X}};{\mathbf{A}})\) is corresponding Kullback–Leibler function for nonnormalized data. Note that values of \(f_{KL} ({{\mathbf{X}} \mathord{\left/ {\vphantom {{\mathbf{X}} x}} \right. \kern-0pt} x};{{\mathbf{A}} \mathord{\left/ {\vphantom {{\mathbf{A}} a}} \right. \kern-0pt} a})\) are dimensionless quantities, whereas \(f_{KL} ({\mathbf{X}};{\mathbf{A}})\) has dimension of x (or of X’s and A’s entries).

The function \(f_{KL} ({\mathbf{X}};{\mathbf{A}})\) for nonnormalized data often serves as an objective function in mathematical programming formulation of the RAS method instead of function (6), e.g., in Appendix 7.1 “RAS as a Solution to the Constrained Minimum Information Distance Problem” to Miller and Blair (2009). It becomes possible due to the linear dependency (6) between \(f_{KL} ({{\mathbf{X}} \mathord{\left/ {\vphantom {{\mathbf{X}} x}} \right. \kern-0pt} x};{{\mathbf{A}} \mathord{\left/ {\vphantom {{\mathbf{A}} a}} \right. \kern-0pt} a})\) and \(f_{KL} ({\mathbf{X}};{\mathbf{A}})\) with fixed and known coefficients, so that a minimum of both functions is achieved at the same point. Nevertheless, the Kullback–Leibler function for nonnormalized data cannot be classified as “information distance” at least from the viewpoint of dimensional analysis. (To be fully fair, the Kullback–Leibler divergence (6) is not a distance function really because the symmetry and subadditivity (triangle inequality) conditions do not hold for it.)

Consider the conditional minimization problem with objective function (6) for normalized data and linear constraints (1) under the consistency requirement \({\mathbf{e}}_{N}^{\prime } {\mathbf{u}} = {\mathbf{e}}_{M}^{\prime } {\mathbf{v}} = {\mathbf{e}}_{N}^{\prime } {\mathbf{Xe}}_{M} = x\).

The first partial derivatives of Lagrangian function for this problem with respect to x nm are

$$\frac{{\partial L_{KL} }}{{\partial x_{nm} }} = \frac{1}{x}\left( {\ln x_{nm} + 1 - \ln a_{nm} } \right) - \lambda_{n} - \mu_{m} = 0,\quad n = 1 \div N,m = 1 \div M,$$

from which it is easy to obtain the RAS triple factorization of target matrix (3) as follows:

$$x_{nm} = a_{nm} e^{{x\left( {\lambda_{n} + \mu_{m} } \right) - 1}} = e^{{x\lambda_{n} - {1 \mathord{\left/ {\vphantom {1 2}} \right. \kern-0pt} 2}}} a_{nm} e^{{x\mu_{m} - {1 \mathord{\left/ {\vphantom {1 2}} \right. \kern-0pt} 2}}} = r_{n} a_{nm} s_{m}$$

where \(\;{\varvec{\uplambda}}\) and \({\varvec{\upmu}}\) are the column vectors of Lagrange multipliers with dimensions N × 1 and M × 1, respectively, and e is the base of natural logarithms.

Thus, the RAS method’s logical emanation from the Kullback–Leibler divergence minimization approach is proved.

Finally, it is easy to see that in homothetic testing with \({\mathbf{X}} = k{\mathbf{A}}\) the nonnegative function (6) for normalized data reaches its global (or absolute) minimum value, since

$${f}_{KL} ({{k{\mathbf{A}}} \mathord{\left/ {\vphantom {{k{\mathbf{A}}} {ka}}} \right. \kern-0pt} {ka}};{{\mathbf{A}} \mathord{\left/ {\vphantom {{\mathbf{A}} a}} \right. \kern-0pt} a}) = \sum\limits_{n = 1}^{N} {\sum\limits_{m = 1}^{M} {\frac{{ka_{nm} }}{ka}\ln \left( {\frac{a}{ka}\frac{{ka_{nm} }}{{a_{nm} }}} \right)} } = \frac{ka}{ka}\ln \left( 1 \right) = 0,$$

although function \(f_{KL} ({\mathbf{X}};{\mathbf{A}})\) for nonnormalized data does not allow to consider the matrices \({\mathbf{X}} = k{\mathbf{A}}\) and A as indiscernibles, namely

$${f}_{KL} (k{\mathbf{A}};{\mathbf{A}}) = \sum\limits_{n = 1}^{N} {\sum\limits_{m = 1}^{M} {ka_{nm} \ln \left( {\frac{{ka_{nm} }}{{a_{nm} }}} \right)} } = ka \cdot \ln \left( k \right) \ne 0.$$

It means that, from viewpoint of the Kullback–Leibler divergence minimization approach together with the RAS method, the matrix homothety \({\mathbf{X}} = k{\mathbf{A}}\) can be considered as optimal solution for proportionality case of a general matrix-updating problem.

4 Homothetic testing of Kuroda’s method

Kuroda (1988) proposed an original method for matrix updating that comes to constrained minimization of the twofold-weighted quadratic objective function

$${f}_{K} ({\mathbf{X}};{\mathbf{A}}) = \frac{1}{2}\sum\limits_{n = 1}^{N} {\sum\limits_{m = 1}^{M} {w_{NM}^{1} \left( {\frac{{x_{nm} }}{{u_{n} }} - \frac{{a_{nm} }}{{u_{n}^{{\mathbf{A}}} }}} \right)^{2} } } + \frac{1}{2}\sum\limits_{n = 1}^{N} {\sum\limits_{m = 1}^{M} {w_{NM}^{2} \left( {\frac{{x_{nm} }}{{v_{m} }} - \frac{{a_{nm} }}{{v_{m}^{{\mathbf{A}}} }}} \right)^{2} } } ,$$

which can be rewritten in matrix form as

$${f}_{K} \left( {{\mathbf{x}}_{{\mathbf{u}}} ,\;{\mathbf{x}}_{{\mathbf{v}}} } \right) = \frac{1}{2}{\mathbf{x}}_{{\mathbf{u}}}^{\prime } {\mathbf{W}}_{1} {\mathbf{x}}_{{\mathbf{u}}} + \frac{1}{2}{\mathbf{x}}_{{\mathbf{v}}}^{\prime } {\mathbf{W}}_{2} {\mathbf{x}}_{{\mathbf{v}}}$$
(7)

where \({\mathbf{W}}_{1}\) and \({\mathbf{W}}_{2}\) are the nonsingular diagonal matrices of order NM with the relative reliability or relative confidence factors (weights). Here the NM-dimensional column vectors \({\mathbf{x}}_{{\mathbf{u}}}\) and \({\mathbf{x}}_{{\mathbf{v}}}\) are defined through applying the vectorization operator “vec,” which transforms a matrix into a vector by stacking the columns of the matrix one underneath the other (see, e.g., Magnus and Neudecker 2007), as follows:

$${\mathbf{x}}_{{\mathbf{u}}} = vec\left( {{\hat{\mathbf{u}}}^{ - 1} {\mathbf{X}} - {\hat{\mathbf{u}}}_{{\mathbf{A}}}^{ - 1} {\mathbf{A}}} \right) = {\mathbf{Ux}} - {\mathbf{U}}_{{\mathbf{A}}} {\mathbf{a}},\quad {\mathbf{x}}_{{\mathbf{v}}} = vec\left( {{\mathbf{X}}{\hat{\mathbf{v}}}^{ - 1} - {\mathbf{A}}{\hat{\mathbf{v}}}_{{\mathbf{A}}}^{ - 1} } \right) = {\mathbf{Vx}} - {\mathbf{V}}_{{\mathbf{A}}} {\mathbf{a}},$$

where

$$\begin{aligned} {\mathbf{a}} = vec\,{\mathbf{A}},\quad {\mathbf{U}}_{{\mathbf{A}}} = {\mathbf{E}}_{M} \otimes {\hat{\mathbf{u}}}_{{\mathbf{A}}}^{ - 1} ,\quad {\mathbf{V}}_{{\mathbf{A}}} = {\hat{\mathbf{v}}}_{{\mathbf{A}}}^{ - 1} \otimes {\mathbf{E}}_{N} \hfill \\ {\mathbf{x}} = vec\,{\mathbf{X}},\quad {\mathbf{U}} = {\mathbf{E}}_{M} \otimes {\hat{\mathbf{u}}}^{ - 1} ,\quad {\mathbf{V}} = {\hat{\mathbf{v}}}^{ - 1} \otimes {\mathbf{E}}_{N} , \hfill \\ \end{aligned}$$

\({\mathbf{E}}_{M}\) is an identity matrix of order M, and the character “” denotes the Kronecker product of two matrices.

Within a homothetic test for Kuroda’s method, the row and column marginal totals for target matrix are \({\mathbf{u}} = k{\mathbf{u}}_{{\mathbf{A}}}\) and \({\mathbf{v}} = k{\mathbf{v}}_{{\mathbf{A}}}\), hence

$${\mathbf{x}}_{{\mathbf{u}}} = {\mathbf{Ux}} - {\mathbf{U}}_{{\mathbf{A}}} {\mathbf{a}} = {\mathbf{U}}_{{\mathbf{A}}} \left( {\frac{1}{k}{\mathbf{x}} - {\mathbf{a}}} \right),\quad {\mathbf{x}}_{{\mathbf{v}}} = {\mathbf{Vx}} - {\mathbf{V}}_{{\mathbf{A}}} {\mathbf{a}} = {\mathbf{V}}_{{\mathbf{A}}} \left( {\frac{1}{k}{\mathbf{x}} - {\mathbf{a}}} \right).$$

Therefore, at \({\mathbf{x}} = k{\mathbf{a}}\) the vectors \({\mathbf{x}}_{{\mathbf{u}}}\) and \({\mathbf{x}}_{{\mathbf{v}}}\) vanish both, and the quadratic function (7) reaches its global minimum value equal to zero. It means that from viewpoint of Kuroda’s method, the matrix homothety \({\mathbf{X}} = k{\mathbf{A}}\) provides the optimal solution for a general problem of matrix updating in a case of strict proportionality between row and column marginal totals for target and initial matrices. Thus, Kuroda’s method passes through a homothetic test successfully as well as RAS method.

5 Homothetic test’s failure: the method of weighted squared differences (WSD)

A quite common approach to define a measure of the structural similarity between initial and target matrices is to use some matrix norm for the difference \({\mathbf{X}} - {\mathbf{A}}\) to be minimized subject to linear constraints (1) under the consistency condition (one can find the proper reviews, e.g., in Miller and Blair 2009; Temurshoev et al. 2011), so that the optimal solution can be represented as \({\mathbf{X}}^{ * } = \arg \,\min_{{\mathbf{X}}} \left\| {{\mathbf{X}} - {\mathbf{A}}} \right\|\), or in equivalent vector form as \({\mathbf{x}}^{ * } = \arg \,\min_{{\mathbf{x}}} \left| {{\mathbf{x}} - {\mathbf{a}}} \right|\) where \({\mathbf{a}} = vec\,{\mathbf{A}}\) and \({\mathbf{x}} = vec\,{\mathbf{X}}\), as earlier.

For instance, the choice of Frobenius matrix norm (and compatible Euclidean vector norm) leads to the constrained minimization of weighted quadratic objective function

$${f}_{WSD} ({\mathbf{x}};{\mathbf{a}}) = \sum\limits_{n = 1}^{N} {\sum\limits_{m = 1}^{M} {w_{nm} \left( {x_{nm} - a_{nm} } \right)^{2} } } = \left( {{\mathbf{x}} - {\mathbf{a}}} \right)^{\prime } {\mathbf{W}}\left( {{\mathbf{x}} - {\mathbf{a}}} \right)$$
(8)

where W is a nonsingular diagonal matrix of order NM with the relative reliability or relative confidence coefficients. (Usually the elements of matrix W is assumed to be normalized by multiplying it on a proper factor, i.e., \({\mathbf{e}}_{NM}^{\prime } {\mathbf{We}}_{NM} = 1\).) To complete a formulation of the constrained minimization problem for the WSD method, one needs to rewrite left-hand sides of the constraints (1) in vector denotations. Introducing the N × NM matrix \({\mathbf{G}} = {\mathbf{e}}_{M}^{\prime } \otimes {\mathbf{E}}_{N}\), which consists of M identity matrix \({\mathbf{E}}_{N}\) located horizontally, and the M × NM matrix \({\mathbf{H}} = {\mathbf{E}}_{M} \otimes {\mathbf{e}}_{N}^{\prime }\), which is N-fold successive replication of each column from identity matrix \({\mathbf{E}}_{M}\), we have

$${\mathbf{Xe}}_{M} = {\mathbf{Gx}} = {\mathbf{u}},\quad {\mathbf{X}}^{\prime } {\mathbf{e}}_{N} = {\mathbf{Hx}} = {\mathbf{v}}.$$
(9)

Notice that each column of G and H includes exactly one nonzero (unit) element such that \({\mathbf{e}}_{N}^{\prime } {\mathbf{G}} = {\mathbf{e}}_{M}^{\prime } {\mathbf{H}} = {\mathbf{e}}_{NM}^{\prime }\).

The Lagrangian function for problem of minimizing the objective function (8) subject to linear constraints (9) is just

$$L_{WSD} ({\mathbf{x}},{\varvec{\uplambda}},{\varvec{\upmu}}) = \left( {{\mathbf{x}} - {\mathbf{a}}} \right)^{\prime } {\mathbf{W}}\left( {{\mathbf{x}} - {\mathbf{a}}} \right) - {\varvec{\uplambda}}^{\prime } \left( {{\mathbf{Gx}} - {\mathbf{u}}} \right) - {\varvec{\upmu}}^{\prime } \left( {{\mathbf{Hx}} - {\mathbf{v}}} \right)$$

where \(\;{\varvec{\uplambda}}\) and \({\varvec{\upmu}}\) are the column vectors of Lagrange multipliers, as earlier. Taking the first partial derivatives of this function with respect to x, \(\;{\varvec{\uplambda}}\) and \({\varvec{\upmu}}\) gives

$$2{\mathbf{W}}\left( {{\mathbf{x}} - {\mathbf{a}}} \right) - {\mathbf{G}}^{\prime } {\varvec{\uplambda}} - {\mathbf{H}}^{\prime } {\varvec{\upmu}} = {\mathbf{0}}_{NM} ,\quad {\mathbf{Gx}} - {\mathbf{u}} = {\mathbf{0}}_{N} ,\quad {\mathbf{Hx}} - {\mathbf{v}} = {\mathbf{0}}_{M} .$$

Expressing x from first equation of this system, we obtain the problem solution as a function of Lagrange multipliers, namely

$${\mathbf{x}} = {\mathbf{a}} + \frac{1}{2}{\mathbf{W}}^{ - 1} \left( {{\mathbf{G}}^{\prime } {\varvec{\uplambda}} + {\mathbf{H}}^{\prime } {\varvec{\upmu}}} \right).$$
(10)

Inserting (10) into the second and third equations leads to the following system of equations with \(\;{\varvec{\uplambda}}\) and \({\varvec{\upmu}}\) as unknowns:

$$\begin{aligned} {\mathbf{GW}}^{ - 1} {\mathbf{G}}^{\prime } {\varvec{\uplambda}} + {\mathbf{GW}}^{ - 1} {\mathbf{H}}^{\prime } {\varvec{\upmu}} = 2\left( {{\mathbf{u}} - {\mathbf{Ga}}} \right), \hfill \\ {\mathbf{HW}}^{ - 1} {\mathbf{G}}^{\prime } {\varvec{\uplambda}} + {\mathbf{HW}}^{ - 1} {\mathbf{H}}^{\prime } {\varvec{\upmu}} = 2\left( {{\mathbf{v}} - {\mathbf{Ha}}} \right) \hfill \\ \end{aligned}$$
(11)

where \({\mathbf{Ga}} = {\mathbf{u}}_{{\mathbf{A}}}\) and \({\mathbf{Ha}} = {\mathbf{v}}_{{\mathbf{A}}}\).

Note that the main matrix of the system (11) is symmetric. Moreover, it is easy to see that \({\mathbf{GW}}^{ - 1} {\mathbf{G}}^{\prime } {\mathbf{e}}_{N} - {\mathbf{GW}}^{ - 1} {\mathbf{H}}^{\prime } {\mathbf{e}}_{M} = {\mathbf{0}}_{N}\) and \({\mathbf{HW}}^{ - 1} {\mathbf{G}}^{\prime } {\mathbf{e}}_{N} - {\mathbf{HW}}^{ - 1} {\mathbf{H}}^{\prime } {\mathbf{e}}_{M} = {\mathbf{0}}_{M}\) because of \({\mathbf{G}}^{\prime } {\mathbf{e}}_{N} = {\mathbf{H}}^{\prime } {\mathbf{e}}_{M} = {\mathbf{e}}_{NM}\), i.e., the matrix of the system (11) has the linearly dependent columns and so is singular. Thus, the general solution to corresponding homogeneous system (11) is \({\varvec{\uplambda}}^{\left( 0 \right)} = c{\mathbf{e}}_{N}\), \({\varvec{\upmu}}^{\left( 0 \right)} = - c{\mathbf{e}}_{M}\) with any scalar constant c.

Since a general solution to nonhomogeneous linear system equals the sum of a general solution to corresponding homogeneous system and any particular solution to nonhomogeneous system, let \({\varvec{\uplambda}} = {\varvec{\uplambda}}^{\left( 0 \right)} + {\varvec{\uplambda}}^{\left( 1 \right)}\) and \({\varvec{\upmu}} = {\varvec{\upmu}}^{\left( 0 \right)} + {\varvec{\upmu}}^{\left( 1 \right)}\), where the pair \({\varvec{\uplambda}}^{\left( 1 \right)}\), \({\varvec{\upmu}}^{\left( 1 \right)}\) is a particular solution to system (11). Putting these formulas into round-bracketed expression in the right-hand side of (10) gives

$${\mathbf{G}}^{\prime } {\varvec{\uplambda}} + {\mathbf{H}}^{\prime } {\varvec{\upmu}} = {\mathbf{G^{\prime}}}\left( {c{\mathbf{e}}_{N} + {\varvec{\uplambda}}^{\left( 1 \right)} } \right) + {\mathbf{H}}^{\prime } \left( { - c{\mathbf{e}}_{M} + {\varvec{\upmu}}^{\left( 1 \right)} } \right) = c{\mathbf{e}}_{NM} - c{\mathbf{e}}_{NM} + {\mathbf{G}}^{\prime } {\varvec{\uplambda}}^{\left( 1 \right)} + {\mathbf{H}}^{\prime } {\varvec{\upmu}}^{\left( 1 \right)} = {\mathbf{G}}^{\prime } {\varvec{\uplambda}}^{\left( 1 \right)} + {\mathbf{H}}^{\prime } {\varvec{\upmu}}^{\left( 1 \right)} ,$$

i.e., the optimal solution of minimization problem (8), (9) depends only on a particular solution to nonhomogeneous system (11).

As noted above, any N + M − 1 among N + M constraints (9) are mutually independent while \({\mathbf{e}}_{N}^{\prime } {\mathbf{u}} = {\mathbf{e}}_{M}^{\prime } {\mathbf{v}}\). Therefore, the Lagrange multipliers \(\;{\varvec{\uplambda}}\) and \({\varvec{\upmu}}\) can be determined from reduced system of linear equations (11) with any one of them eliminated (of course, with setting a corresponding multiplier equal to zero).

For homothetic testing the WSD method, suppose that \({\mathbf{x}} = k{\mathbf{a}}\). Inserting this homothety into the solution (10) leads to the condition

$${\mathbf{G}}^{\prime } {\varvec{\uplambda}} + {\mathbf{H}}^{\prime } {\varvec{\upmu}} = 2(k - 1){\mathbf{Wa}}$$
(12)

which is not met at N + M > 3 because it actually represents a generally inconsistent system of NM linear equations with N + M − 1 < NM unknown Lagrange multipliers. (Recall that one of them equals zero.) Hence, the vector \({\mathbf{x}} = k{\mathbf{a}}\) cannot be an optimal solution of minimization problem (8), (9) at \({\mathbf{u}} = k{\mathbf{u}}_{{\mathbf{A}}}\) and \({\mathbf{v}} = k{\mathbf{v}}_{{\mathbf{A}}}\).

Thus, WSD method does not pass through a homothetic test in contrast to RAS and Kuroda’s method.

6 Applying homothetic paradigm for improving the WSD method

Acceptance of the matrix \({\mathbf{X}} = k{\mathbf{A}}\) as optimal solution for proportionality case of a general matrix updating problem leads to establishing the fact that all matrices from homothetic family \(k{\mathbf{A}}\) demonstrate an excellent structural similarity between each other. The homothetic paradigm can be helpful for enhancing a collection of matrix updating methods based on constrained minimization of the distance (or quasi-distance) functions.

Within homothetic paradigm, we can set a goal to dispose the target matrix as close as possible not to initial matrix A, but to its homothetic family k A. As a result, the optimal solution becomes \(({\mathbf{X}}^{ * } ,k^{ * } ) = \arg \,\min_{{{\mathbf{X}},k}} \left\| {{\mathbf{X}} - k{\mathbf{A}}} \right\|\), or in equivalent vector form \(({\mathbf{x}}^{ * } ,k^{ * } ) = \arg \,\min_{{{\mathbf{x}},k}} \left| {{\mathbf{x}} - k{\mathbf{a}}} \right|\), and, clearly, it cannot be “worse” (in terms of the certain matrix/vector norm chosen) than the original one.

As it is shown below, transition from \(\,\left| {\,{\mathbf{x}} - {\mathbf{a}}\,} \right|\) to \(\,\left| {\,{\mathbf{x}} - k{\mathbf{a}}\,} \right|\) leads to an idea of orthogonal projecting an unknown target vector x onto the homothetic ray k a in NM-dimensional vector space with a scalar product operation.

In the case of WSD method, the improved version of weighted quadratic objective function (8) can be written as

$${f}_{iWSD} ({\mathbf{x}},l;{\mathbf{a}}) = \sum\limits_{n = 1}^{N} {\sum\limits_{m = 1}^{M} {w_{nm} \left( {x_{nm} - la_{nm} } \right)^{2} } } = \left( {{\mathbf{x}} - l{\mathbf{a}}} \right)^{\prime } {\mathbf{W}}\left( {{\mathbf{x}} - l{\mathbf{a}}} \right)$$
(13)

where l is an additional scalar variable.

The Lagrangian function for problem of minimizing the objective function (13) subject to linear constraints (9) becomes

$$L_{iWSD} ({\mathbf{x}},l,{\varvec{\uplambda}},{\varvec{\upmu}}) = \left( {{\mathbf{x}} - l{\mathbf{a}}} \right)^{\prime } {\mathbf{W}}\left( {{\mathbf{x}} - l{\mathbf{a}}} \right) - {\varvec{\uplambda}}^{\prime } \left( {{\mathbf{Gx}} - {\mathbf{u}}} \right) - {\varvec{\upmu}}^{\prime } \left( {{\mathbf{Hx}} - {\mathbf{v}}} \right)$$

where, as earlier, \(\;{\varvec{\uplambda}}\) and \({\varvec{\upmu}}\) are the column vectors of Lagrange multipliers. Taking the first partial derivatives of this function with respect to vector x and scalar l gives the following system of NM + 1 equations:

$$2{\mathbf{W}}\left( {{\mathbf{x}} - l{\mathbf{a}}} \right) - {\mathbf{G}}^{\prime } {\varvec{\uplambda}} - {\mathbf{H}}^{\prime } {\varvec{\upmu}} = {\mathbf{0}}_{NM} ,\quad {\mathbf{a}}^{\prime } {\mathbf{W}}\left( {{\mathbf{x}} - l{\mathbf{a}}} \right) = 0$$

where the second equation is being interpreted as an orthogonality condition for vectors \({\mathbf{x}} - l{\mathbf{a}}\) and a.

Expressing x from first equation of this system, we obtain the solution of problem (13), (9) as a function of Lagrange multipliers, namely

$${\mathbf{x}} = l{\mathbf{a}} + \frac{1}{2}{\mathbf{W}}^{ - 1} \left( {{\mathbf{G}}^{\prime } {\varvec{\uplambda}} + {\mathbf{H}}^{\prime } {\varvec{\upmu}}} \right).$$
(14)

Inserting (14) into second equation of above system and into the pair of constraint equations (9) leads to the following system of 1 + N + M equations with scalar l and vectors \(\;{\varvec{\uplambda}}\), \({\varvec{\upmu}}\) as unknown variables:

$$\begin{aligned} & {\mathbf{a}}^{\prime } {\mathbf{G}}^{\prime } {\varvec{\uplambda}} + {\mathbf{a}}^{\prime } {\mathbf{H}}^{\prime } {\varvec{\upmu}} = 0, \\ & 2l{\mathbf{Ga}} + {\mathbf{GW}}^{ - 1} {\mathbf{G}}^{\prime } {\varvec{\uplambda}} + {\mathbf{GW}}^{ - 1} {\mathbf{H}}^{\prime } {\varvec{\upmu}} = 2{\mathbf{u}}, \\ & 2l{\mathbf{Ha}} + {\mathbf{HW}}^{ - 1} {\mathbf{G}}^{\prime } {\varvec{\uplambda}} + {\mathbf{HW}}^{ - 1} {\mathbf{H}}^{\prime } {\varvec{\upmu}} = 2{\mathbf{v}} \\ \end{aligned}$$
(15)

where \({\mathbf{Ga}} = {\mathbf{u}}_{{\mathbf{A}}}\) and \({\mathbf{Ha}} = {\mathbf{v}}_{{\mathbf{A}}}\). Since any N + M − 1 among N + M constraints (9) are mutually independent while \({\mathbf{e}}_{N}^{\prime } {\mathbf{u}} = {\mathbf{e}}_{M}^{\prime } {\mathbf{v}}\), the scalar l and Lagrange multipliers \(\;{\varvec{\uplambda}}\), \({\varvec{\upmu}}\) can be determined from reduced system of linear equations (15) with any one of them eliminated (except the first equation) with setting a corresponding multiplier equal to zero.

Putting the improved WSD method to homothetic test, suppose that \({\mathbf{x}} = k{\mathbf{a}}\). Inserting this formula into the orthogonality condition gives \(\left( {k - l} \right){\mathbf{a}}^{\prime } {\mathbf{Wa}} = 0\) from which \(l^{ * } = k\). In turn, substituting the homothety into (14) leads to the homogeneous system

$${\mathbf{G}}^{\prime } {\varvec{\uplambda}} + {\mathbf{H}}^{\prime } {\varvec{\upmu}} = {\mathbf{0}}_{NM}$$

that obviously has the solution \({\varvec{\uplambda}}^{ * } = c{\mathbf{e}}_{N}\), \({\varvec{\upmu}}^{ * } = - c{\mathbf{e}}_{M}\) where c is an arbitrary scalar (see Sect. 5). Further, at \({\mathbf{u}} = k{\mathbf{u}}_{{\mathbf{A}}}\), \({\mathbf{v}} = k{\mathbf{v}}_{{\mathbf{A}}}\) and \(l^{ * } = k\) the linear equations (15) are also being transformed to the homogeneous system

$$\begin{aligned} & {\mathbf{a}}^{\prime } \left( {{\mathbf{G}}^{\prime } {\varvec{\uplambda}} + {\mathbf{H}}^{\prime } {\varvec{\upmu}}} \right) = 0, \\ & {\mathbf{GW}}^{ - 1} \left( {{\mathbf{G}}^{\prime } {\varvec{\uplambda}} + {\mathbf{H}}^{\prime } {\varvec{\upmu}}} \right) = {\mathbf{0}}_{N} , \\ & {\mathbf{HW}}^{ - 1} \left( {{\mathbf{G}}^{\prime } {\varvec{\uplambda}} + {\mathbf{H}}^{\prime } {\varvec{\upmu}}} \right) = {\mathbf{0}}_{M} \\ \end{aligned}$$

with the same simple solution. Therefore, the vectors \({\mathbf{x}}^{ * } = k{\mathbf{a}}\), \({\varvec{\uplambda}}^{ * } = c{\mathbf{e}}_{N}\), \({\varvec{\upmu}}^{ * } = - c{\mathbf{e}}_{M}\) bring the optimal solution to the problem of minimizing the objective function (13) subject to linear constraints (9) at \({\mathbf{u}} = k{\mathbf{u}}_{{\mathbf{A}}}\) and \({\mathbf{v}} = k{\mathbf{v}}_{{\mathbf{A}}}\).

Thus, the improved WSD method passes through a homothetic test successfully as well as RAS and Kuroda’s method (and in contrast to WSD method).

7 Analyzing and improving the method of weighted squared relative differences (WSRD)

To make the minimization problems (8), (9) and (13), (9) independent on scale of initial data, it is expedient to let

$${\mathbf{x}} = {\hat{\mathbf{a}}\mathbf{q}}$$
(16)

where q is NM-dimensional column vector of unknown relative coefficients, and then to introduce into consideration the relative distance functions, namely \(\left| {\,{\mathbf{q}} - {\mathbf{e}}_{NM} \,} \right|\) instead of \(\left| {\,{\mathbf{x}} - {\mathbf{a}}\,} \right|\) for WSD method and \(\left| {\,{\mathbf{q}} - k{\mathbf{e}}_{NM} \,} \right|\) instead of \(\left| {\,{\mathbf{x}} - k{\mathbf{a}}\,} \right|\) for improved WSD method. Here the transition from \(\left| {\,{\mathbf{q}} - {\mathbf{e}}_{NM} \,} \right|\) to \(\left| {\,{\mathbf{q}} - k{\mathbf{e}}_{NM} \,} \right|\), as it is shown below, leads to an idea of orthogonal projecting an unknown target vector q onto the homothetic ray \(k{\mathbf{e}}_{NM}\) in NM-dimensional vector space with a scalar product operation. Notice that Eq. (16) cannot be resolved with respect to q if the initial matrix A contains at least one zero entry.

In transition from WSD to WSRD method, the quadratic objective function (8) becomes

$$f_{WSRD} \,({\mathbf{q}};{\mathbf{e}}_{NM} ) = \sum\limits_{n = 1}^{N} {\sum\limits_{m = 1}^{M} {w_{nm} \left( {q_{nm} - e_{nm} } \right)^{2} } } = \left( {{\mathbf{q}} - {\mathbf{e}}_{NM} } \right)^{\prime } {\mathbf{W}}\left( {{\mathbf{q}} - {\mathbf{e}}_{NM} } \right)$$
(17)

whereas the improved objective function (13) is being transformed to

$$f_{iWSRD} \,({\mathbf{q}},l;{\mathbf{e}}_{NM} ) = \sum\limits_{n = 1}^{N} {\sum\limits_{m = 1}^{M} {w_{nm} \left( {q_{nm} - le_{nm} } \right)^{2} } } = \left( {{\mathbf{q}} - l{\mathbf{e}}_{NM} } \right)^{\prime } {\mathbf{W}}\left( {{\mathbf{q}} - l{\mathbf{e}}_{NM} } \right),$$
(18)

and, finally, inserting (16) into the linear constraints (9) leads to the following modified constraints:

$${\mathbf{Gx}} = {\mathbf{G}}{\hat{\mathbf{a}}}{\mathbf{q}} = {\mathbf{u}},\quad {\mathbf{Hx}} = {\mathbf{H}}{\hat{\mathbf{a}}}{\mathbf{q}} = {\mathbf{v}}.$$
(19)

The objective function (17) was first proposed by Harthoorn and van Dalen (1987) with the relative confidence coefficients matrix factorized as \({\mathbf{W}} = {\hat{\mathbf{a}}}{\hat{\mathbf{w}}}^{ - 1} {\hat{\mathbf{a}}}\) where w is exogenous vector of reciprocal weights for the elements of initial vector a. The Lagrangian function for problem of minimizing the objective function (17) subject to linear constraints (19) is just

$$L_{WSRD} ({\mathbf{q}};{\varvec{\uplambda}},{\varvec{\upmu}}) = \left( {{\mathbf{q}} - {\mathbf{e}}_{NM} } \right)^{\prime } {\mathbf{W}}\left( {{\mathbf{q}} - {\mathbf{e}}_{NM} } \right) - {\varvec{\uplambda}}^{\prime } \left( {{\mathbf{G}}{\hat{\mathbf{a}}}{\mathbf{q}} - {\mathbf{u}}} \right) - {\varvec{\upmu}}^{\prime } \left( {{\mathbf{H}}{\hat{\mathbf{a}}}{\mathbf{q}} - {\mathbf{v}}} \right),$$

where \(\;{\varvec{\uplambda}}\) and \({\varvec{\upmu}}\) are the column vectors of Lagrange multipliers, as earlier.

By analogy with derivation of (10) and (11) in Sect. 5, we obtain

$${\mathbf{q}} = {\mathbf{e}}_{NM} + \frac{1}{2}{\mathbf{W}}^{ - 1} {\hat{\mathbf{a}}}\left( {{\mathbf{G}}^{\prime } {\varvec{\uplambda}} + {\mathbf{H}}^{\prime } {\varvec{\upmu}}} \right)$$
(20)

and

$$\begin{aligned} & {\mathbf{G}}{\hat{\mathbf{a}}}{\mathbf{W}}^{ - 1} {\hat{{\mathbf{a}}}}{\mathbf{G}}^{\prime } {\varvec{\uplambda}} + {\mathbf{G}}{\hat{\mathbf{a}}}{\mathbf{W}}^{ - 1} {\hat{\mathbf{a}}}{\mathbf{H}}^{\prime } {\varvec{\upmu}} = 2\left( {{\mathbf{u}} - {\mathbf{Ga}}} \right), \\ & {\mathbf{H}}{\hat{\mathbf{a}}}{\mathbf{W}}^{ - 1} {\hat{\mathbf{a}}}{\mathbf{G}}^{\prime } {\varvec{\uplambda}} + {\mathbf{H}}{\hat{\mathbf{a}}}{\mathbf{W}}^{ - 1} {\hat{\mathbf{a}}}{\mathbf{H}}^{\prime } {\varvec{\upmu}} = 2\left( {{\mathbf{v}} - {\mathbf{Ha}}} \right) \\ \end{aligned}$$
(21)

where, as earlier, \({\mathbf{Ga}} = {\mathbf{u}}_{{\mathbf{A}}}\) and \({\mathbf{Ha}} = {\mathbf{v}}_{{\mathbf{A}}}\). It is easy to see that the system of linear equations (21) is a degenerate one, resembling (11), and its corresponding homogeneous system also has the general solution \({\varvec{\uplambda}}^{\left( 0 \right)} = c{\mathbf{e}}_{N}\), \({\varvec{\upmu}}^{\left( 0 \right)} = - c{\mathbf{e}}_{M}\) with any scalar constant c. Since any N + M − 1 among N + M constraints (19) are mutually independent while \({\mathbf{e}}_{N}^{\prime } {\mathbf{u}} = {\mathbf{e}}_{M}^{\prime } {\mathbf{v}}\), the Lagrange multipliers \(\;{\varvec{\uplambda}}\) and \({\varvec{\upmu}}\) can be determined from reduced system of linear equations (21) with any one of them eliminated with setting a corresponding multiplier equal to zero.

For homothetic testing the WSRD method, suppose that \({\mathbf{q}} = k{\mathbf{e}}_{NM}\). Inserting this homothety into the solution (20) leads to the condition (compare with condition (12) in Sect. 5)

$${\hat{\mathbf{a}}}\left( {{\mathbf{G}}^{\prime } {\varvec{\uplambda}} + {\mathbf{H}}^{\prime } {\varvec{\upmu}}} \right) = 2(k - 1){\mathbf{We}}_{NM}$$

which is not met at N + M > 3 because it actually represents a generally inconsistent system of NM linear equations with N + M − 1 < NM unknown Lagrange multipliers. (One of them equals zero because the matrix of system (21) has nonfull rank.) Hence, the vector \({\mathbf{q}} = k{\mathbf{e}}_{NM}\) cannot be an optimal solution of minimization problem (17), (19) at \({\mathbf{u}} = k{\mathbf{u}}_{{\mathbf{A}}}\) and \({\mathbf{v}} = k{\mathbf{v}}_{{\mathbf{A}}}\). Thus, WSRD method does not pass through a homothetic test as well as WSD method.

The objective function (18) demonstrates a result of applying a homothetic paradigm to the objective function of WSRD method (17). The Lagrangian function for problem of minimizing the improved objective function (18) subject to linear constraints (19) is just

$$L_{iWSRD} ({\mathbf{q}},l;{\varvec{\uplambda}},{\varvec{\upmu}}) = \left( {{\mathbf{q}} - l{\mathbf{e}}_{NM} } \right)^{\prime } {\mathbf{W}}\left( {{\mathbf{q}} - l{\mathbf{e}}_{NM} } \right) - {\varvec{\uplambda}}^{\prime } \left( {{\mathbf{G}}{\hat{\mathbf{a}}}{\mathbf{q}} - {\mathbf{u}}} \right) - {\varvec{\upmu}}^{\prime } \left( {{\mathbf{H}}{\hat{\mathbf{a}}}{\mathbf{q}} - {\mathbf{v}}} \right),$$

where \(\;{\varvec{\uplambda}}\) and \({\varvec{\upmu}}\) are the column vectors of Lagrange multipliers, and l is an additional scalar variable.

Taking the first partial derivatives of this function with respect to vector q and scalar l gives the following system of NM + 1 equations:

$$2{\mathbf{W}}\left( {{\mathbf{q}} - l{\mathbf{e}}_{NM} } \right) - {\hat{\mathbf{a}}}{\mathbf{G^{\prime}\lambda }} - {\hat{\mathbf{a}}}{\mathbf{H}}^{\prime } {\varvec{\upmu}} = {\mathbf{0}}_{NM} ,\quad {\mathbf{e}}_{NM}^{\prime } {\mathbf{W}}\left( {{\mathbf{q}} - l{\mathbf{e}}_{NM} } \right) = 0$$

where the second equation of the system expresses an orthogonality condition for vectors \({\mathbf{q}} - l{\mathbf{e}}_{NM}\) and \({\mathbf{e}}_{NM}\).

By analogy with derivation of (14) and (15) in Sect. 6, we obtain

$${\mathbf{q}} = l{\mathbf{e}}_{NM} + \frac{1}{2}{\mathbf{W}}^{ - 1} {\hat{\mathbf{a}}}\left( {{\mathbf{G}}^{\prime } {\varvec{\uplambda}} + {\mathbf{H}}^{\prime } {\varvec{\upmu}}} \right)$$
(22)

and

$$\begin{aligned} & {\mathbf{a}}^{\prime } {\mathbf{G}}^{\prime } {\varvec{\uplambda}} + {\mathbf{a}}^{\prime } {\mathbf{H}}^{\prime } {\varvec{\upmu}} = 0, \\ & 2l{\mathbf{Ga}} + {\mathbf{G}}{\hat{{\mathbf{a}}}}{\mathbf{W}}^{ - 1} {\hat{\mathbf{a}}\mathbf{G}}^{\prime } {\varvec{\uplambda}} + {\mathbf{G}}{\hat{{\mathbf{a}}}}{\mathbf{W}}^{ - 1} {\hat{\mathbf{a}}\mathbf{H}}^{\prime } {\varvec{\upmu}} = 2{\mathbf{u}}, \\ & 2l{\mathbf{Ha}} + {\mathbf{H}}\hat{{\mathbf{a}}}{\mathbf{W}}^{ - 1} {\hat{\mathbf{a}}\mathbf{G}}^{\prime } {\varvec{\uplambda}} + {\mathbf{H}}\hat{{\mathbf{a}}}{\mathbf{W}}^{ - 1} {\hat{\mathbf{a}}\mathbf{H}}^{\prime } {\varvec{\upmu}} = 2{\mathbf{v}} \\ \end{aligned}$$
(23)

where \({\mathbf{Ga}} = {\mathbf{u}}_{{\mathbf{A}}}\) and \({\mathbf{Ha}} = {\mathbf{v}}_{{\mathbf{A}}}\). Since any N + M − 1 among N + M constraints (19) are mutually independent while \({\mathbf{e}}_{N}^{\prime } {\mathbf{u}} = {\mathbf{e}}_{M}^{\prime } {\mathbf{v}}\), the scalar l and Lagrange multipliers \(\;{\varvec{\uplambda}}\), \({\varvec{\upmu}}\) can be determined from reduced system of linear equations (23) with any one (except the first one) of them eliminated with setting a corresponding multiplier equal to zero.

Putting the improved WSRD method to homothetic test, suppose that \({\mathbf{q}} = k{\mathbf{e}}_{NM}\). Inserting this formula into the orthogonality condition gives \(\left( {k - l} \right)\;{\mathbf{e}}_{NM}^{\prime } {\mathbf{We}}_{NM} = 0\) from which \(l^{ * } = k\). In turn, substituting the homothety into (22) leads to the homogeneous system

$${\hat{\mathbf{a}}}\left( {{\mathbf{G}}^{\prime } {\varvec{\uplambda}} + {\mathbf{H}}^{\prime } {\varvec{\upmu}}} \right) = {\mathbf{0}}_{NM}$$

that obviously has the solution \({\varvec{\uplambda}}^{ * } = c{\mathbf{e}}_{N}\), \({\varvec{\upmu}}^{ * } = - c{\mathbf{e}}_{M}\) where c is an arbitrary scalar (see Sect. 6). Further, at \({\mathbf{u}} = k{\mathbf{u}}_{{\mathbf{A}}}\), \({\mathbf{v}} = k{\mathbf{v}}_{{\mathbf{A}}}\) and \(l^{ * } = k\) the linear equations (23) are also being transformed to the homogeneous system

$$\begin{aligned} & {\mathbf{a}}^{\prime } \left( {{\mathbf{G}}^{\prime } {\varvec{\uplambda}} + {\mathbf{a}}^{\prime } {\mathbf{H}}^{\prime } {\varvec{\upmu}}} \right) = 0 \\ & {\mathbf{G}}{\hat{\mathbf{a}}}{\mathbf{W}}^{ - 1} {\hat{\mathbf{a}}}\left( {{\mathbf{G}}^{\prime } {\varvec{\uplambda}} + {\mathbf{H}}^{\prime } {\varvec{\upmu}}} \right) = {\mathbf{0}}_{N} \\ & {\mathbf{H}}{\hat{\mathbf{a}}}{\mathbf{W}}^{ - 1} {\hat{\mathbf{a}}}\left( {{\mathbf{G}}^{\prime } {\varvec{\uplambda}} + {\mathbf{H}}^{\prime } {\varvec{\upmu}}} \right) = {\mathbf{0}}_{M} \\ \end{aligned}$$

with the same simple solution. Therefore, the vectors \({\mathbf{q}}^{ * } = k{\mathbf{e}}_{NM}\), \({\varvec{\uplambda}}^{ * } = c{\mathbf{e}}_{N}\), \({\varvec{\upmu}}^{ * } = - c{\mathbf{e}}_{M}\) bring the optimal solution to the problem of minimizing the objective function (18) subject to linear constraints (19) at \({\mathbf{u}} = k{\mathbf{u}}_{{\mathbf{A}}}\) and \({\mathbf{v}} = k{\mathbf{v}}_{{\mathbf{A}}}\).

Thus, the improved WSRD method passes through a homothetic test successfully as well as improved WSD method (and in contrast to WSRD method).

8 Handling the zero entries in the distance minimization methods of matrix updating

In practice, an initial matrix often contains some zero elements. In this context, all matrix updating methods can be divided into two groups: those that do preserve the same location of zeros in target matrix as in the initial one, and those that do not. For example, RAS method should be assigned to the first group because of its multiplicative pattern (3), whereas Kuroda’s method belongs to the second group because it does not include any mechanisms for fitting the proper elements of target matrix to required zero level. Analogically, the regular and improved WSRD are methods of the first group because they are based on multiplicative pattern (16) in contradistinction to the WSD method both regular and improved.

If a zero preservation property is assumed desirable, the WSD methods can be slightly redeveloped to provide it by reducing the dimensionalities of minimization problems (8), (9) and (13), (9). Let J < NM be a number of nonzero elements in the initial matrix A (and in NM-dimensional vector a). While the zero preservation principle “if \(a_{j} = 0\) then \(x_{j} = 0\) for any j” is being applied, the target vector x does really contain only J unknown variables because the other NMJ variables should be setting to zero values. Hence, it is advisable to reduce the operational dimensions of initial and target vectors from NM × 1 to J × 1 by eliminating NMJ zero components.

Let \({\mathbf{E}}_{J \times NM}\) be a rectangular matrix with dimensions J × NM that is obtained from an identity matrix of order NM by deleting the rows corresponding to zero entries in the initial vector. Then the initial and target vectors can be redefined as \({\mathbf{a}}_{J \times 1} = {\mathbf{E}}_{J \times NM} {\mathbf{a}}_{NM \times 1}\) and \({\mathbf{x}}_{J \times 1} = {\mathbf{E}}_{J \times NM} {\mathbf{x}}_{NM \times 1}\). In turn, in order to make the formulas of objective functions for WSD and improved WSD methods (8) and (13) be operational, one needs also to redefine the weight matrix W by deleting the corresponding rows and columns, i.e., \({\mathbf{W}}_{J \times J} = {\mathbf{E}}_{J \times NM} {\mathbf{W}}_{NM \times NM} {\mathbf{E}}_{J \times NM}^{\prime }\).

Finally, it is necessary to adapt the linear constraints (9) to reduced dimension of the target vector. To this end, N × NM-dimensional matrix G and M × NM-dimensional matrix H must be postmultiplied by \({\mathbf{E}}_{J \times NM}^{\prime }\) in order to eliminate their redundant columns. Hence, the constraints (9) are being transformed to \({\mathbf{G}}_{N \times NM} {\mathbf{E}}_{J \times NM}^{\prime } {\mathbf{x}}_{J \times 1} = {\mathbf{u}}_{N \times 1}\) and \({\mathbf{H}}_{M \times NM} {\mathbf{E}}_{J \times NM}^{\prime } {\mathbf{x}}_{J \times 1} = {\mathbf{v}}_{M \times 1}\), respectively, where all dimensions are consistent in a sense of matrix product. Notice that, as earlier, each column of the transformed constraint matrices includes exactly one nonzero (unit) element so that \({\mathbf{e}}_{N}^{\prime } {\mathbf{G}}_{N \times NM} {\mathbf{E}}_{J \times NM}^{\prime } = {\mathbf{e}}_{M}^{\prime } {\mathbf{H}}_{M \times NM} {\mathbf{E}}_{J \times NM}^{\prime } = {\mathbf{e}}_{J}^{\prime }\). Thus, the redevelopment of regular and improved WSD methods comes to proper reducing the dimensions of all vectors and matrices in the minimization problems (8), (9) and (13), (9) without their reformulation.

Furthermore, it is easy to see that Kuroda’s method can be modified for preserving the zeros in a similar way.

As noted earlier, the regular and improved WSRD methods provide the same location of zeros in target matrix as in the initial one. Despite it, the eliminating procedure described above seems to be helpful for them also because the operation of excluding zero elements allows decreasing the dimension of solution space for the matrix-updating problem in significant degree. It is to be emphasized that macroeconomic matrices of high dimensions often appear to be very sparse in practice, up to more than 90% of zero elements. In such cases, the efficiency of computations within the minimization problems (17), (19) and (18), (19) increases rather noticeably.

9 One more failure of homothetic testing: GRAS method

The regular WSD and WSRD methods do not satisfy a homothetic test originally but can be improved by applying a homothetic paradigm. However, some methods of matrix updating demonstrate an incorrigible failure of homothetic testing. For instance, the generalized RAS (GRAS) method does not pass through a homothetic test and at the same time cannot be enhanced in this sense because of its nature.

As it is well-known, “… RAS can only handle non-negative matrices, which limits its application to SUTs that often contain negative entries…”—see Temurshoev et al. (2011, p. 92). The GRAS method has been proposed by Junius and Oosterhaven (2003) for the initial matrices with some negative entries, and later it was redeveloped by Lenzen et al. (2007). (In order to be fair, it is appropriate to mention that a mathematical framework of GRAS method was described earlier by Günlük-Şenesen and Bates 1988.)

Any initial matrix A can be represented as \({\mathbf{A}} = {\mathbf{P}} - {\mathbf{Q}}\) where P is a matrix of positive entries and Q is a matrix containing the absolute values of negative entries. The GRAS method leans on the mixed additive–multiplicative pattern

$${\mathbf{X}} = {\hat{\mathbf{r}}}{\mathbf{P}}{\hat{\mathbf{s}}} - {\hat{\mathbf{r}}}^{ - 1} {\mathbf{Q}}{\hat{\mathbf{s}}}^{ - 1}$$
(24)

where r and s are unknown N × 1 and M × 1 column vectors that are needed to be estimated subject to linear constraints (9). Notice that in the absence of negative entries GRAS method coincides with RAS method which is based on multiplicative pattern (3).

For homothetic testing the GRAS method, suppose that \({\mathbf{X}} = k{\mathbf{A}} = k{\mathbf{P}} - k{\mathbf{Q}}\). Inserting this homothety into the pattern (24) leads to the following pair of matrix equations:

$${\hat{\mathbf{r}}}{\mathbf{P}}{\hat{\mathbf{s}}} = k{\mathbf{P}},\quad {\hat{\mathbf{r}}}{\mathbf{Q}}{\hat{\mathbf{s}}} = \frac{1}{k}{\mathbf{Q}}.$$

The transition to Hadamard’s products in the left-hand sides of these equations gives

$$\left( {{\mathbf{rs}}^{\prime } } \right) \circ {\mathbf{P}} = k{\mathbf{P}},\quad \left( {{\mathbf{rs}}^{\prime } } \right) \circ {\mathbf{Q}} = \frac{1}{k}{\mathbf{Q}}$$

from which it follows that

$${\mathbf{rs}}^{\prime } = k{\mathbf{e}}_{N} {\mathbf{e}}_{M}^{\prime } ,\quad {\mathbf{rs}}^{\prime } = \frac{1}{k}{\mathbf{e}}_{N} {\mathbf{e}}_{M}^{\prime }$$
(25)

where \({\mathbf{e}}_{N} {\mathbf{e}}_{M}^{\prime }\) is outer product of two summation vectors, i.e., the matrix of dimension N × M with unit elements.

It is easy to see that the conditions (25) are met simultaneously if and only if \(k = \pm 1\), or \({\mathbf{X}} = \pm {\mathbf{A}}\). Hence, the matrix homothety \({\mathbf{X}} = k{\mathbf{A}} = k{\mathbf{P}} - k{\mathbf{Q}}\) cannot be a feasible solution for GRAS method at all other values of k (\(k \ne \pm 1\)). Thus, the GRAS method does not pass through a homothetic test.

The strict structure of mixed additive–multiplicative pattern (24) does not give any opportunity for improving the GRAS method on the base of homothetic paradigm without its considerable redevelopment. Here the main problem seems to be an inexact correspondence between the GRAS method’s objective function and Kullback–Leibler divergence that Lemelin (2009) has rightly pointed out. Indeed, if initial matrix A contains some negative entries, then by virtue of the Kullback–Leibler probabilistic interpretation the nonlinear function (6) can be rewritten as

$$f_{KL} ({{{\mathbf{X}}^{ + } } \mathord{\left/ {\vphantom {{{\mathbf{X}}^{ + } } {x^{ + } }}} \right. \kern-0pt} {x^{ + } }};{{{\mathbf{A}}^{ + } } \mathord{\left/ {\vphantom {{{\mathbf{A}}^{ + } } {a^{ + } }}} \right. \kern-0pt} {a^{ + } }}) = \sum\limits_{n = 1}^{N} {\sum\limits_{m = 1}^{M} {\frac{{\left| {\,x_{nm} } \right|}}{{x^{ + } }}\ln \left( {\frac{{a^{ + } }}{{x^{ + } }}\frac{{\left| {\,x_{nm} } \right|}}{{\left| {\,a_{nm} } \right|}}} \right)} } = \frac{1}{{x^{ + } }}\sum\limits_{n = 1}^{N} {\sum\limits_{m = 1}^{M} {\left| {\,x_{nm} } \right|\,\,\ln \left( {\frac{{\left| {\,x_{nm} } \right|}}{{\left| {\,a_{nm} } \right|}}} \right)} } + \ln \frac{{a^{ + } }}{{x^{ + } }}$$
(26)

where \(x^{ + } = {\mathbf{e}}_{N}^{\prime } {\mathbf{X}}^{ + } {\mathbf{e}}_{M} = \sum\nolimits_{n = 1}^{N} {\sum\nolimits_{m = 1}^{M} {\left| {x_{ij} } \right|} }\) and \(a^{ + } = {\mathbf{e}}_{N}^{\prime } {\mathbf{A}}^{ + } {\mathbf{e}}_{M} = \sum\nolimits_{n = 1}^{N} {\sum\nolimits_{m = 1}^{M} {\left| {a_{ij} } \right|} }\). Notice that now a value of \(x^{ + } \ne {\mathbf{e}}_{N}^{\prime } {\mathbf{u}} = {\mathbf{e}}_{M}^{\prime } {\mathbf{v}}\) is not known a priori in contrast to the case (6) in which all initial matrix entries are implied to be nonnegative.

Factorizing the unknown variables in (26) as \(\left| {\,x_{nm} } \right| = \left| {\,a_{nm} } \right|\;z_{nm}\), one can introduce the new variables \(z_{nm}\) and then obtain the following objective function:

$$f_{KL} ({\mathbf{Z}};{\mathbf{A}}^{ + } ) = \frac{{\sum\nolimits_{n = 1}^{N} {\sum\nolimits_{m = 1}^{M} {\left| {a_{nm} } \right|z_{nm} \ln z_{nm} } } }}{{\sum\nolimits_{n = 1}^{N} {\sum\nolimits_{m = 1}^{M} {\left| {a_{nm} } \right|z_{nm} } } }} - \ln \sum\limits_{n = 1}^{N} {\sum\limits_{m = 1}^{M} {\left| {a_{nm} } \right|z_{nm} } } + \ln \sum\limits_{n = 1}^{N} {\sum\limits_{m = 1}^{M} {\left| {a_{nm} } \right|} }$$
(27)

that should be minimized subject to the linear marginal total constraints

$$\sum\limits_{m = 1}^{M} {a_{nm} z_{nm} } = u_{n} ,\;\;n = 1 \div N,\quad \sum\limits_{n = 1}^{N} {a_{nm} z_{nm} } = v_{m} ,\;\;m = 1 \div M.$$
(28)

It is interesting to compare three summands in right-hand side of (27) with two summands of the GRAS method’s objective function

$$f({\mathbf{Z}};{\mathbf{A}}^{ + } ) = \sum\limits_{n = 1}^{N} {\sum\limits_{m = 1}^{M} {\left| {a_{nm} } \right|z_{nm} \ln \frac{{z_{nm} }}{e}} } = \sum\limits_{n = 1}^{N} {\sum\limits_{m = 1}^{M} {\left| {a_{nm} } \right|z_{nm} \ln z_{nm} } } - \sum\limits_{n = 1}^{N} {\sum\limits_{m = 1}^{M} {\left| {a_{nm} } \right|z_{nm} } }$$
(29)

that was being considered at Lenzen et al. (2007). Let a target matrix be equal to the initial matrix, i.e., \({\mathbf{X}} = {\mathbf{A}}\), from which it follows that \({\mathbf{Z}} = {\mathbf{e}}_{N} {\mathbf{e}}_{M}^{\prime }\). Then we have \(f({\mathbf{e}}_{N} {\mathbf{e}}_{M}^{\prime } ;{\mathbf{A}}^{ + } ) = - a^{ + }\), whereas \(f_{KL} ({\mathbf{e}}_{N} {\mathbf{e}}_{M}^{\prime } ;{\mathbf{A}}^{ + } ) = - \ln a^{ + } + \ln a^{ + } = 0\). Therefore, the GRAS method’s objective function (29) may take negative values, and an identity of indiscernibles is not being satisfied for it in contrast to function (27). Thus, there are no any reasons to consider (29) as a distance (or even quasi-distance) function.

Note that the first summand in right-hand side of (29) is just the original GRAS objective function that has been proposed by Junius and Oosterhaven (2003). It coincides with the numerator of first summand in (27) and, hence, cannot be considered as a measure of “information distance” from the viewpoint of dimensional analysis.

It is to be emphasized that nonlinear minimization problem (27), (28) is much more complicated computationally than the GRAS problem of minimizing the objective function (29) subject to constraints (28). The problem (27), (28) is likely to deserve a further analytical examination, although Lemelin (2009) has proposed to solve it numerically.

In turn, it is easy to see that in homothetic testing with \({\mathbf{X}} = k{\mathbf{A}}\) and \({\mathbf{Z}} = k{\mathbf{e}}_{N} {\mathbf{e}}_{M}^{\prime }\) the nonnegative objective function (27) reaches its absolute minimum value, since

$$f_{KL} (k{\mathbf{e}}_{N} {\mathbf{e}}_{M}^{\prime } ;{\mathbf{A}}^{ + } ) = \frac{{\sum\nolimits_{n = 1}^{N} {\sum\nolimits_{m = 1}^{M} {\left| {a_{nm} } \right|k\ln k} } }}{{\sum\nolimits_{n = 1}^{N} {\sum\nolimits_{m = 1}^{M} {\left| {a_{nm} } \right|k} } }} - \ln \sum\limits_{n = 1}^{N} {\sum\limits_{m = 1}^{M} {\left| {a_{nm} } \right|k} } + \ln a^{ + } = \ln k - \ln ka^{ + } + \ln a^{ + } = 0.$$

It means that, from viewpoint of the Kullback–Leibler divergence minimization approach, the matrix homothety \({\mathbf{X}} = k{\mathbf{A}}\) can be classified as optimal solution for proportionality case of a general matrix-updating problem. Therefore, the modified GRAS (mGRAS) method based on the minimization problem (27), (28) passes through a homothetic test successfully.

10 Numerical examples and concluding remarks

Consider the Eurostat input–output data set given in “Box 14.2: RAS procedure” (see Eurostat, 2008, p. 452) for compiling some numerical examples. The 3 × 4-dimensional initial matrix A combines the entries in intersections of the columns “Agriculture,” “Industry,” “Services,” “Final d.” with the rows “Agriculture,” “Industry,” “Services” in “Table 1: Input–output data for year 0.” Note that all the elements of this matrix are positive. The row marginal total vector u of dimension 3 × 1 is the proper part of the column “Output” in “Table 2: Input–output data for year 1,” and the column marginal total vector \({\mathbf{v}}^{\prime }\) of dimension 1 × 4 involves the proper entries of the row “Total” in the near-mentioned data source.

Table 1 Initial matrix and the target marginal totals, Eurostat (2008), p. 452
Table 2 The RAS and KM results for updating the data set from Table 1

Initial matrix A and target marginal totals u, \({\mathbf{v}}^{\prime }\) are marked by italic font in the left half and in the right half of Table 1, respectively. The results of handling the data from Table 1 by RAS method with iterative processes (4) or (5) and by Kuroda’s method (KM) of conditional minimizing the quadratic objective function (7) with \({\mathbf{W}}_{1} = {\mathbf{W}}_{2} = {{{\mathbf{E}}_{NM} } \mathord{\left/ {\vphantom {{{\mathbf{E}}_{NM} } {{\mathbf{e}}_{NM}^{\prime } {\mathbf{E}}_{NM} {\mathbf{e}}_{NM} }}} \right. \kern-0pt} {{\mathbf{e}}_{NM}^{\prime } {\mathbf{E}}_{NM} {\mathbf{e}}_{NM} }}\) subject to the linear marginal total constraints (9) are grouped in Table 2. The next two numerical examples demonstrate the results of applying the WSD and improved WSD methods as well as WSRD and improved WSRD methods for updating the available data set at \({\mathbf{W}} = {{{\mathbf{E}}_{NM} } \mathord{\left/ {\vphantom {{{\mathbf{E}}_{NM} } {{\mathbf{e}}_{NM}^{\prime } {\mathbf{E}}_{NM} {\mathbf{e}}_{NM} }}} \right. \kern-0pt} {{\mathbf{e}}_{NM}^{\prime } {\mathbf{E}}_{NM} {\mathbf{e}}_{NM} }}\) (see Tables 3 and 4, respectively). The calculated target matrices seem to be close among themselves.

Table 3 The WSD and iWSD results for updating the data set from Table 1
Table 4 The WSRD and iWSRD results for updating the data set from Table 1

The following numerical example is assigned to verify a response of WSD and WSRD methods to homothetic testing at k = 5 (see Table 5). It is easy to see that the WSD target matrix does distinct from 5A very significantly, whereas the difference between the WSRD target matrix and 5A is noticeably less.

Table 5 The WSD and WSRD homothetic test results for the data set from Table 1 at k = 5

Furthermore, several numerical examples are intended for testing some methods’ response to zero entries in the initial matrix. So let us disturb two elements of initial matrix A in Table 1, say (1, 3) and (2, 1), by putting it equal to zero for years 0 and 1. After recalculation of the marginal totals we get the data set in the left and right halves of Table 6. (Initial matrix A and target marginal totals u, \({\mathbf{v^{\prime}}}\) are marked by italic font; zero entries are underlined.)

Table 6 Initial matrix with zero entries and the target marginal totals

The left parts of Tables 7, 8 and 9 contain the target matrices calculated by methods that are not based on any multiplicative pattern, namely, WSD and improved WSD as well as Kuroda’s method, respectively. It is easy to check that they do not really preserve the same location of zeros in target matrix as in the initial one. In contrast, the right parts of Tables 7, 8 and 9 present the results of additional applying the proposed procedure of reducing the dimension of solution space in the matrix updating problem described in Sect. 8 (with the letters “rd” after an abbreviation of method’s title).

Table 7 The WSD and WSDrd results for updating the data set from Table 6
Table 8 The iWSD and iWSDrd results for updating the data set from Table 6
Table 9 The KM and KMrd results for updating the data set from Table 6

Other several numerical examples are intended to test some methods’ responses to negative entries in the initial matrix. So let us disturb two elements of initial matrix A in Table 1, say the same elements (1, 3) and (2, 1), by reversing their sign for years 0 and 1. After proper recalculation of the marginal totals, we obtain the data set presented in the left and right halves of Table 10. (Initial matrix A and target marginal totals u, \({\mathbf{v^{\prime}}}\) are marked by italic font; negative matrix elements are underlined.)

Table 10 Initial matrix with negative entries and the target marginal totals

The halves of Tables 11, 12 and 13 contain the target matrices calculated by methods that, in contrast to RAS, can be used in presence of negative entries, namely WSD and improved WSD methods, WSRD and improved WSRD methods, Kuroda’s method and GRAS method, respectively. It is easy to see that they are all quite acceptable at small values of relative difference \({{\left( {x - a} \right)} \mathord{\left/ {\vphantom {{\left( {x - a} \right)} a}} \right. \kern-0pt} a}\); in particular, for the data set from Table 10.

Table 11 The WSD and iWSD results for updating the data set from Table 10
Table 12 The WSRD and iWSRD results for updating the data set from Table 10
Table 13 The KM and GRAS results for updating the data set from Table 10
$${{\left( {x - a} \right)} \mathord{\left/ {\vphantom {{\left( {x - a} \right)} a}} \right. \kern-0pt} a} = {{\left( {{\mathbf{e}}_{N}^{\prime } {\mathbf{u}} - {\mathbf{e}}_{N}^{\prime } {\mathbf{u}}_{{\mathbf{A}}} } \right)} \mathord{\left/ {\vphantom {{\left( {{\mathbf{e}}_{N}^{\prime } {\mathbf{u}} - {\mathbf{e}}_{N}^{\prime } {\mathbf{u}}_{{\mathbf{A}}} } \right)} {{\mathbf{e}}_{N}^{\prime } {\mathbf{u}}_{{\mathbf{A}}} }}} \right. \kern-0pt} {{\mathbf{e}}_{N}^{\prime } {\mathbf{u}}_{{\mathbf{A}}} }} = {{\left( {663.4 - 640} \right)} \mathord{\left/ {\vphantom {{\left( {663.4 - 640} \right)} {640 = 3.7\% }}} \right. \kern-0pt} {640 = 3.7\% }}.$$

Setting the new target marginal totals, say, by doubling them, i.e., at

$${{\left( {x - a} \right)} \mathord{\left/ {\vphantom {{\left( {x - a} \right)} a}} \right. \kern-0pt} a} = {{\left( {2{\mathbf{e}}_{N}^{\prime } {\mathbf{u}} - {\mathbf{e}}_{N}^{\prime } {\mathbf{u}}_{{\mathbf{A}}} } \right)} \mathord{\left/ {\vphantom {{\left( {2{\mathbf{e}}_{N}^{\prime } {\mathbf{u}} - {\mathbf{e}}_{N}^{\prime } {\mathbf{u}}_{{\mathbf{A}}} } \right)} {{\mathbf{e}}_{N}^{\prime } {\mathbf{u}}_{{\mathbf{A}}} }}} \right. \kern-0pt} {{\mathbf{e}}_{N}^{\prime } {\mathbf{u}}_{{\mathbf{A}}} }} = {{\left( { 1 3 2 6. 8- 640} \right)} \mathord{\left/ {\vphantom {{\left( { 1 3 2 6. 8- 640} \right)} {640 = 107.3\% }}} \right. \kern-0pt} {640 = 107.3\% }},$$

we obtain new target matrices calculated by the same methods; they are located in Tables 14, 15 and 16. It is to be emphasized that only the iWSD, iWSRD methods and Kuroda’s method show acceptable results here, whereas the WSD, WSRD and GRAS methods are not good at all. Recall that exactly these methods do not satisfy a homothetic test proposed (see Sects. 5, 7 and 9, respectively).

Table 14 The WSD and iWSD results for the doubled marginal totals from Table 10
Table 15 The WSRD and iWSRD results for the doubled marginal totals from Table 10
Table 16 The KM and GRAS results for the doubled marginal totals from Table 10

The next numerical example is assigned to verify a response of GRAS method to homothetic testing at k = 2 and k = 10 (see Table 17). It is easy to see that the GRAS target matrices do differ from 2A and 10A very significantly. For instance, \(2a_{13} = - 20\) and \(2a_{21} = - 40\) whereas \(x_{13} = - 5.62\) and \(x_{21} = - 16.86\). In turn, at k = 10 we have \(10a_{13} = - 100\) and \(10a_{21} = - 200\), whereas \(x_{13} = - 1.14\) and \(x_{21} = - 6.13\). Moreover, there are some other large distinctions between the matrix elements in the columns where the negative entries are located, namely, columns 1 and 3.

Table 17 The GRAS homothetic test results for the data set from Table 10 at k = 2 and 10

As a conclusion, it is to be emphasized that a homothetic paradigm expresses the important and helpful property of any matrix updating method. Moreover, positive response to homothetic testing serves as an additional evidence of plausibility and correctness of the matrix updating method tested.

Homothetic paradigm leans on a common notion of orthogonal projecting that is likely to be the most powerful concept in econometrics. It has an obvious logical interpretation, corresponds to famous principle of insufficient reason and remains operational in a row of practical situations.

If a method of matrix updating emanates from some distance minimization problem, it can be fully adapted to homothetic testing (see Sect. 6). If at the same time a zero preservation property is assumed desirable, the method must be slightly redeveloped to provide its observance, with a concomitant reducing the dimensionalities of underlying minimization problem without its reformulation (see Sect. 8).

In this article, we have considered four methods of matrix updating that can be used for handling the matrices with some negative entries, namely, Kuroda’s, iWSD, iWSRD and GRAS (including mGRAS) methods. A critical analysis of the GRAS methodology is given in Sect. 9 (see also Tables 16 and 17).

It is obvious that, because of its additive nature, the iWSD method does not work rather good if the initial matrix has some very small entries together with some very large ones (a practically important case!). The final numerical examples are intended to test a response of the above-mentioned methods to appearance of a small entry in the initial matrix. So let us disturb one element of initial matrix A in Table 10, say entry (3, 4), by multiplying its value by 10−4 for years 0 and 1. After proper recalculation of the marginal totals, we obtain the data set presented in the left and right halves of Table 18. (Initial matrix A and target marginal totals u, \({\mathbf{v}}^{\prime }\) are marked by italic font; the small elements (3, 4) are underlined.)

Table 18 Initial matrix with one small entry and the target marginal totals

The results of handling the data from Table 18 by iWSD, Kuroda’s and iWSRD methods are presented in Tables 19, 20 and 21 where the elements of matrices Q are determined by formula (16).

Table 19 The iWSD results for updating the data set from Table 18
Table 20 The KM results for updating the data set from Table 18
Table 21 The iWSRD results for updating the data set from Table 18

As it was expected, the iWSD method has unjustifiably reversed the sign of \(a_{34}\) in target element \(x_{34}\) and has increased the initial absolute value in more than 80 times although \({x \mathord{\left/ {\vphantom {x a}} \right. \kern-0pt} a} = {{{\mathbf{e}}_{N}^{\prime } {\mathbf{u}}} \mathord{\left/ {\vphantom {{{\mathbf{e}}_{N}^{\prime } {\mathbf{u}}} {{\mathbf{e}}_{N}^{\prime } {\mathbf{u}}_{{\mathbf{A}}} }}} \right. \kern-0pt} {{\mathbf{e}}_{N}^{\prime } {\mathbf{u}}_{{\mathbf{A}}} }}\) is just equal to \({{559.09} \mathord{\left/ {\vphantom {{559.09} {542.01 = 1.03}}} \right. \kern-0pt} {542.01 = 1.03}}\) (see Table 19). In this situation Kuroda’s method appears to work somewhat better without sign reversing and with increasing the initial value in more than only 10 times (see Table 20). And finally, the iWSRD method seems to be most preferable in this example demonstrating the scattering in Q’s elements from 0.92 to 1.07 (see Table 21).

Thus, one can assert that improved WSRD method has some advantage among the other methods of matrix updating in practically important situations when the initial matrix has some very small entries and some very large ones simultaneously (because of multiplicative nature of iWSRD objective function).

Notice that the minimization problems associated with Kuroda’s and iWSRD methods have almost the same computational complexity. It can be shown that for their solving it is necessary to calculate the inverse of the symmetric matrix of order min{N, M} − 1 with the diagonal blocks on its main diagonal.

Abbreviations

KM:

Kuroda’s method

KMrd:

Kuroda’s method with a solution space of reduced dimension

WSD method:

method of weighted squared differences

WSDrd:

WSD method with a solution space of reduced dimension

iWSD method:

improved WSD method

WSRD method:

method of weighted squared relative differences

WSRDrd:

WSRD method with a solution space of reduced dimension

iWSRD method:

improved WSRD method

GRAS method:

generalized RAS method

mGRAS method:

modified GRAS method

References

  • Eurostat (2008) European manual of supply, use and input–output tables. Methodologies and working papers. Office for Official Publications of the European Communities, Luxembourg

  • Günlük-Şenesen G, Bates JM (1988) Some experiments with methods of adjusting unbalanced data matrices. J R Stat Soc Ser A (Stat Soc) 151(3):473–490

    Article  Google Scholar 

  • Harthoorn R, van Dalen J (1987) On the adjustment of tables with Lagrange multipliers. Occasional paper NA-024. The Netherlands Central Bureau of Statistics, National Accounts Research Division

  • Junius T, Oosterhaven J (2003) The solution of updating or regionalizing a matrix with both positive and negative entries. Econ Syst Res 15(1):87–96

    Article  Google Scholar 

  • Kullback S (1959) Information theory and statistics. Wiley, New York

    Google Scholar 

  • Kullback S, Leibler RA (1951) On information and sufficiency. Ann Math Stat 22(1):79–86

    Article  Google Scholar 

  • Kuroda M (1988) A method of estimation for the updating transaction matrix in the input–output relationships. In: Uno K, Shishido S (eds) Statistical data bank systems. Socio-economic database and model building in Japan. North Holland, Amsterdam, pp 43–56

    Google Scholar 

  • Lemelin A (2009) A GRAS variant solving for minimum information loss. Econ Syst Res 21(4):399–408

    Article  Google Scholar 

  • Lenzen M, Wood R, Gallego D (2007) Some comments on the GRAS method. Econ Syst Res 19(4):461–465

    Article  Google Scholar 

  • Magnus JR, Neudecker H (2007) Matrix differential calculus with applications in statistics and econometrics, 3rd edn. Wiley, Chichester

    Google Scholar 

  • Miller RE, Blair PD (2009) Input–output analysis: Foundations and extensions, 2nd edn. Cambridge University Press, Cambridge

    Book  Google Scholar 

  • Temurshoev U, Webb C, Yamano N (2011) Projection of supply and use tables: methods and their empirical assessment. Econ Syst Res 23(1):91–123

    Article  Google Scholar 

Download references

Acknowledgements

The author would like to thank anonymous referee and the participants of the International Conference on Economic Structures of PAPAIOS in Tokyo, Japan, March 17–19, 2017, and 19th National Conference of The Input–Output Research Association (IORA) of India in Pune, India, January 11–12, 2017, for their helpful comments and suggestions.

Competing interests

The author declares that he has no competing interests

Availability of data and materials

The data set supporting the conclusions of this article is available in Eurostat (2008, p. 452).

Consent for publication

Not applicable.

Ethics approval and consent to participate

Not applicable.

Funding

Not applicable.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Vladimir Motorin.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Motorin, V. Enhancing the distance minimization methods of matrix updating within a homothetic paradigm. Economic Structures 6, 36 (2017). https://doi.org/10.1186/s40008-017-0094-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s40008-017-0094-7

Keywords

JEL Classification