Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

A novel adaptive quasi-Newton-type update and its global convergence without Lipschitz condition for constrained system of nonlinear monotone equations

  • Kabiru Ahmed,

    Roles Investigation, Methodology, Writing – original draft, Conceptualization, Formal analysis

    Affiliations Department of Mathematical Sciences, Bayero University, Kano, Nigeria, Department of Mathematics, Federal University, Dutse, Nigeria

  • Hatem E. Semary,

    Roles Funding acquisition, Resources, Supervision, Validation, Writing – review & editing

    Affiliation Faculty of Informatics and Computing, Universiti Sultan Zainal Abidin, Terengganu, Malaysia

  • Asmaa S. Al-Moisheer,

    Roles Funding acquisition, Project administration, Resources, Validation, Writing – review & editing

    Affiliation Faculty of Informatics and Computing, Universiti Sultan Zainal Abidin, Terengganu, Malaysia

  • Sulaiman M. Ibrahim ,

    Roles Software, Visualization, Writing – review & editing

    sulaimancga@gmail.com

    Affiliations Department of Mathematics and Statistics, Faculty of Science, Imam Mohammad Ibn Saud Islamic University (IMSIU), Riyadh, Saudi Arabia, College of Applied and Health Sciences, A’Sharqiyah University, Ibra, Sultanate of Oman

  • Abubakar Sani Halilu,

    Roles Conceptualization, Formal analysis, Methodology, Writing – original draft

    Affiliation Faculty of Education and Arts, Sohar University, Sohar, Oman

  • Muhammed Yusuf Waziri,

    Roles Investigation, Project administration, Supervision, Validation

    Affiliation Department of Mathematical Sciences, Bayero University, Kano, Nigeria

  • Mohamad Afendee Mohamed,

    Roles Project administration, Supervision, Validation, Writing – review & editing

    Affiliation Faculty of Education and Arts, Sohar University, Sohar, Oman

  • Salisu Murtala

    Roles Data curation, Resources, Software, Validation, Writing – review & editing

    Affiliation Faculty of Education and Arts, Sohar University, Sohar, Oman

Abstract

This paper presents a new iterative method with a restart feature for solving constrained system of nonlinear monotone equations. The scheme, which is a double-parameter method, was initiated by considering a positive-definite adaptation of the quasi-Newton update proposed by Andrei (J. Comput. Appl. Math. 332, 26–44 (2018)) for unconstrained optimization. The two scaling parameters of the method are obtained by employing the measure function by Byrd and Nocedal (SIAM J. Numer. Anal. 26, 727–739 (1989)), which ensures that the condition number of the update is minimized. Another important attribute of the method is that its global convergence analysis is conducted without the Lipschitz assumption, which is a strong condition. Furthermore, the two parameters embedded in the scheme help in maintaining a balance in the distribution of the eigenvalues of its update matrix. The method converges globally regardless of the line search procedure employed. Numerical experiments with some methods in the literature show that the scheme is effective.

1 Introduction

The interest in this paper is solution of the constrained system of nonlinear equations

(1)

where is a nonempty closed convex set and F is a continuous vector-valued mapping, which is also monotone, namely, it satisfies

(2)

Real life applications of (1) can be found in studies such as general equilibrium problems in [1,2], the control systems analyzed in [3], networking and communications [4], modelling and data estimation [5], and in compressed sensing problems [614]. Some of the methods employed in solving (1) include adaptive versions of Newton’s method and its enhanced version, the quasi-Newton (QN) schemes [1518], the derivative-free methods [1924], Levenberg-Marquardt method [25], trust-region method [26] as well as spectral projection methods [27,28]. Despite their fast convergence properties, adaptations of Newton and QN methods have not been widely studied and few of the schemes for solving (1) have been developed. A viable reason for this is the huge memory required at each iteration to store matrices generated by the methods. The classical Newton and QN methods were developed to solve the optimization problem

(3)

with f being a smooth real-valued nonlinear function, with gradient . The sequence of iterates generated by these methods are obtained via the formula

where represents a step-size that is usually calculated using a line search procedure, and is search direction that is computed via

in Newton’s method, or

(4)

in QN method, where is a matrix known as the true Hessian or its approximation Bk in the QN method. The first QN update was developed by Davidon [29] with subsequent analysis by Fletcher and Powell [30], hence the name DFP after the three authors. The update is defined by

(5)

in which , . By setting Hk to be the identity matrix, the memoryless version of (6) is obtained, namely

A DFP scheme with the update (6) exhibit some nice properties, which includes quadratic termination, hereditary property, and generating conjugate directions for a quadratic objective function and an exact line search procedure. It retains positive-definiteness for general nonlinear functions, when the curvature condition is satisfied. Iterations of the scheme require and it has super-linear rate of convergence. Also, for strictly convex functions under the exact line search rule, the DFP scheme converges globally [31]. Notwithstanding these attributes, numerical studies and computations conducted have shown that the DFP update is usually unstable. This is due to the loss of positive-definiteness along some iterative points as well as the distribution of its eigenvalues. The Broyden [32], Fletcher [33], Goldfarb [34], and Shanno [35] (BFGS) update is the most prominent QN scheme available. Being a member of the Broyden family, the BFGS update is given by

(6)

where sk and yk remain as defined in (6). For the purpose of practical computations, search directions of the QN methods (5) are usually expressed in closed form as

(7)

where denotes the inverse approximation to the Hessian of the BFGS method. By applying the popular Woodbury formula [36] twice on (8), the inverse is obtained as

(8)

In order to avoid computing the matrix Hk, which involves huge storage at each iteration, it is replaced with the identity matrix I and the so called memoryless update is obtained, i.e.,

(9)

Further analysis of (8) by Andrei [37] shows that efficiency of the BFGS method depends so much on structure of its eigenvalues. It was also shown by Powell [38] and Byrd et al. [39] that the BFGS method suffers more from large eigenvalues of (8) than small ones. By conducting some numerical experiments on the BFGS method, Gill and Leonard [40] discovered that large number of iterations or functions and gradient evaluations may be required on some problems for the scheme to converge. These shortcomings of the scheme were shown in [40] to mostly emanate from poor initial Hessian approximations or ill-conditioning along the iterations. To address these problems, some scaling procedures were applied on the update in (8). By scaling the third term in (8) with a positive parameter , Biggs [41] gave the modification

Oren and Luenberger [42] scaled the first two terms of the update in (8) to obtain

where is a positive parameter. By adopting the technique of changing structure of eigenvalues Andrei [43] presented a two-parameter BFGS method, with Bk+1 given as

with and being positive parameters. Here, is computed to ensure that eigenvalues of Bk+1 are clustered, while is obtained so that large eigenvalues of Bk+1 are reduced, i.e., the eigenvalues are shifted to the left. This strategy brings about a better distribution of the eigenvalues. A modification of (11) was proposed in [44], namely

(10)

where and are parameters that are determined by employing Byrd and Nocedal’s measure function in [45]. The method proves to be computationally cheap and also has superlinear rate of convergence on practical problems.

In order to address some of the shortfalls of the DFP (6) and BFGS (8) updates and address some of their shortfalls, some variants for solving problem (3) have been proposed in the literature. Based on the matrix norm and an upper bound for condition number of the scaled version of (11), Babaie-Kafaki [46] proposed a BFGS-type method for the unconstrained optimization problem (3). The author proved that the method satisfy the descent condition and conducted numerical experiments to show its effectiveness. Also, Babaie-Kafaki and Aminifard [47] proposed two parameter scaled memoryless BFGS update in (10) for the problem (3) with a monotone choice for the initial step length. The authors proved the sufficient descent condition of the scheme for uniformly convex functions. Aminifard and Babaie-Kafaki [48] developed a DFP-type method for solving (3) by analyzing eigenvalues of a scaled memoryless variant of the DFP update (6), where the scaled parameter is obtained by skipping the direction of maximum magnification. By exploiting a spectral secant condition for the classical DFP update (6), Dargahi et al. [49] proposed a DFP-type method for (3), where the updating matrix is obtained such that its condition number is minimized. Bakhshinejad and Babaie-Kafaki [50] also proposed a DFP-type algorithm as an extension of the search direction obtained by applying the DFP update (6). The method was shown to satisfy the much required sufficient descent condition.

In order to exploit some of the advantages of the DFP and BFGS updates and address their shortcomings, some adaptive versions for solving (1) and its unconstrained form have been proposed in recent years. In line with this, Mamat et al. [51] proposed a DFP-type algorithm for solving the symmetric form of problem (1) by employing a scaled memoryless version of the DFP update (6). The authors proved global convergence of the scheme by employing some basic assumptions. As an enhancement of the classical DFP update (6), Dauda [52] proposed a three-term DFP-type method for solving the symmetric form of (1). The authors proved its global convergence using some mild conditions. Based on a scaled memoryless version of the DFP update (6), Awwal et al. [53] proposed a derivative-free method for solving (1). The authors proved global convergence of the method and compared its performance with some methods for solving (1). Inspired by a scaled version of the memoryless DFP update (6), another derivative-free method was proposed by Rao and Huang [54] for the unconstrained form of (1). Global convergence of the method was shown by employing some mild conditions. By applying a scaling technique and the measure function of Byrd and Nocedal [45] on a scaled memoryless DFP update (6), Ullah et al. [55] proposed a derivative-free DFP-type method for the constrained problem (1). Both global convergence and numerical efficiency were shown by the authors. In a recent development, Rehman et al. [56] proposed another version of a derivative-free DFP method for solving (1). The method converges globally and its effectiveness was illustrated by comparison with some recent methods in the literature. Recently, by computing an optimal value for the scaled memoryless BFGS update (11), Shah et al. [57] proposed a scaled derivative-free BFGS method for solving (1). The authors proved global convergence of the scheme and demonstrated its application to motion control. For more recent methods for solving (1), the reader can refer to works in [5865]. The objectives of this paper are as follows:

  • To develop a QN-type algorithm close to the BFGS method, for solving problem (1), which avoids the explicit computation of any matrix in the algorithm.
  • To develop a scheme that exhibits some nice attributes of the classical BFGS update, addresses some of its shortcomings, and which possess the vital property for analyzing the convergence for problem (1).
  • To develop a globally convergent algorithm without the Lipschitz condition and to analyze its numerical performance in solving problem (1).

The paper is structured as follows: Motivation and details of the method are given in the following section. Analysis of its global convergence is given in Section 3. Results of some numerical experiments of the scheme and some recent methods in the literature are presented in Section 4. Conclusions are given in Section 5.

2 An adaptive QN-type method

An adaptive QN-type scheme will be derived in this section. To achieve that, the following lemma is imperative:

Lemma 2.1. Cauchy Schwarz inequality

Let be an inner product space over the field of complex numbers with inner product . Then, for every pair of vectors , the following inequality holds.

(11)

As stated in the introduction section, adaptations of the iterative methods for the problem in (3) have been developed over the years. However, to the best of our knowledge few QN-type schemes exists in the literature. The ones available include the derivative-free DFP-type method proposed by Awwal et al. [53], where the authors scaled the first term of the classical DFP update (6) with a positive parameter to present

(12)

where represents a positive parameter that is computed such that the condition for analyzing global convergence holds for the scheme. Setting in (14) yields

(13)

where , is a scaling parameter. By substituting (15) into (9), the authors obtained the search direction

with , . The choice of suggested by the authors only guarantees the condition for analyzing global convergence holds, it does not ensure stability of the proposed method. Following the work by Awwal et al. [53], Ullah et al. [55] scaled the third term of the memoryless version of the DFP update (6) with a positive parameter to obtain

(14)

where ,which is derived by employing the measure function of Nocedal and Byrd [45] is obtained as . Using this value of , the authors presented search direction of the scheme as

(15)

The value of the scaling parameter obtained reduces the chances of the DFP updating matrix from being ill-conditioned. Still, it does not ensure that the search direction (18) satisfies the condition for analyzing global convergence. Following this, Shah et. al. [57] proposed another QN-type method for solving (1) by scaling the first and second term of the memoryless version of the Hessian approximation of the BFGS update [66], namely

(16)

The authors minimized the condition number of (19) to obtain the choice of as resulting in the search direction

In (19), the parameter does not always ensure stability of the BFGS method as the third term on the right may grow larger producing a shift of the eigenvalues to the right. In a recent attempt to address the shortfalls of the DFP update (6), Rehman et. al. [56] scaled the first and second terms of the memoryless DFP update (6), namely

(17)

with being a positive parameter whose optimal value was obtained as the minimizer of the condition number of Hk+1 which is given as

(18)

By substituting (22) into the memoryless DFP update (21), the authors obtained

(19)

and the search direction

Now, trace of the matrix Hk+1 in (23) can be computed as

(20)

It can be seen that the third-term on the right of (25) will shift the eigenvalues of Hk+1 towards the right making the matrix to have large eigenvalues. On the other hand, the second term being negative tends to shift the eigenvalues to the left. Even though this effect is countered by the first term, which is positive, it does not ensure proper distribution of the eigenvalues of Hk+1, and by extension stability of the scheme is not guaranteed.

Remark 2.2 As explained above, while the choices of the parameters and in (15), (17), (19), and (21) may enable the scheme generated with the corresponding matrix Hk+1 to satisfy the vital condition for analyzing global convergence, they do not ensure its stability – a condition attributed to uneven distribution of eigenvalues of Hk+1. In addition to this, like most of the published works in the literature for the constrained problem (1), the methods in [5356] depend on the Lipschitz condition for proving their global convergence. And since not all problems satisfy the Lipschitz condition, the number of problems analyzed tend to be limited.

Now, by replacing sk with , yk with Fk−1, Bk+1 with Qk in (8), and motivated by the scaling strategy in (12) and the above remark, we present the following two parameter scaled symmetric rank-two quasi-Newton type update,

(21)

in which and

(22)

Setting Qk−1 as the identity matrix yields the following scaled memoryless version of (26):

(23)

Now, in order to define search direction of the proposed scheme, we require the inverse of the update Qk and appropriate values for the positive parameters and . To achieve that goal, we first show that Qk is positive-definite. Assuming that , with , then we have that and . This further implies that a set of mutually orthonormal vectors exists such that , which determines that

Thus, Qk contains n − 2 eigenvalues equal to each. We label the two eigenvalues left as and .

From (28), the trace of Qk is obtained as

(24)

which then reveals that

(25)

Applying the Sherman-Morrison formula [36], the inverse of Qk is computed as

(26)

Since has n − 2 repeated eigenvalues equal to each, labelling the remaining two as and , we obtain from (31) that

(27)

Also, expressing as the product of two matrices yields

Let and , then

(28)

By applying the formula for determinant of an arbitrary matrix of rank two, namely

and setting , , , and , we obtain

(29)

Therefore, from (33) and (34), we obtain

(30)

Now, combining (32), (35) with some simplification, we get

(31)

Using (30) and (36), we obtain

which can also be expressed as

(32)

Now, by applying the inequality in (13) to (37), we obtain

(33)

Also, by employing the Cauchy Schwarz inequality in (13) and applying similar argument, we have that , which applying (38) consequently yields

(34)

Following (39), and since is a positive parameter, we conclude that the matrix Qk is positive-definite, which implies that is also positive-definite. We are now set to employ the measure function of Byrd and Nocedal [45] to compute approximations of and that will ensure well-conditioning of the matrix Qk as well as better distribution of its eigenvalues. The measure function is formulated as

(35)

From (28) and applying the same approach employed to compute the determinant of earlier, we obtain the determinant of Qk as

(36)

Now, from (29), (40) and (41), we have

(37)

In order to obtain approximations for and , we have to minimize the function with respect to both and . So, from (42), we get

Setting this result to zero, yields

which implies that

or

(38)

Also,

Setting this result to zero yields

which implies that

or

(39)

Following the above analysis and the fact that restarting search directions tend to improve convergence and efficiency of algorithms [67], we substitute (43) and (44) into (31), and propose the following auxiliary search direction:

(40)

Remark 2.3 It should be noted that the inequality , which triggers the restart process was required for two reasons: one, it is essential for analyzing eigenvalues of the matrix Qk, and two, it helps in proving boundedness of the search direction.

Lemma 2.4 The search direction dk in (45) satisfies the inequality

(41)

Proof: First, from (45) for k = 0, we have

Now, we proceed to analyze each of the two cases in (45) for k ≥ 1.

1st Case: If , we have

2nd Case: If , then from (45) and Cauchy Schwarz inequality, we get

By setting , we established the proof.■

For the line search strategy, we employ the adaptive procedure proposed by Yin et. al. [68], where the step-size is defined as , with ik being the smallest nonnegative integer i for which

(42)

for each , is the projection operator defined as

where , and u > 0.

Now, we described algorithm of the method as follows:

Algorithm 1 (MKAS)

1: Initialization. Select , , , , r > 1, , .

Set , k=0.

2: For , stop. Else, move to 3.

3: Compute as in (27), with as defined earlier such that (48) holds.

4: If and , stop . Otherwise, compute

(43)

is the projection operator, which is defined in [69] with the property

(44)

with

(45)

5: Testing , and . Compute , and .

6: Set if , where , otherwise compute , where r > 1.

7: Set k = k+1. Goto 2.

3 Analysis of convergence

We now proceed to analyze global convergence of the MKAS algorithm: First, we give the following assumptions:

Assumption 3.1 The mapping F is monotone.

Assumption 3.2 There exists such that .

Lemma 3.3 The search direction dk generated in step 6 of the MKAS algorithm satisfy the inequality

(46)

Proof. By applying Cauchy-Schwarz inequality and (46), the first inequality is satisfied. Now, from (45), and for k = 0, we have that , which shows that . For k ≥ 1, we consider the case of . From (45) and Cauchy-Schwarz inequality, we have

Setting , we get

For the second case, we have from (45) and Cauchy Schwarz inequality, that

Setting , we get

Therefore, since N1 is the max of (1,N1,N2), we obtain that

(47)

which proves that the second inequality of (52) holds.■

Lemma 3.4 Given that F in (1) is continuous on , then for each k ≥ 0, there exists nonnegative integer ik such that (48) is satisfied.

Proof. Suppose by contradiction that k ≥ 0 exists for which (48) does not hold for any nonnegative integer i, i.e.,

(48)

The second inequality in (54) was obtained from the fact that . By continuity of F, and letting with , we get

which contradicts

Lemma 3.5 Suppose Assumptions hold and the sequences and are generated by Algorithm 1. Then for a solution , the sequence is convergent and and are bounded.

Furthermore

(49)

Proof: Using (48) and definition of , we have

(50)

By (2) and for all , we have

(51)

Combining the inequality (57) above, (49), (50) and (51), we obtain

(52)

where the last inequality indicates that holds. The implication of this is that is a monotone decreasing sequence that is bounded below. So, is convergent and is bounded. Now, since is bounded and F is continuous, we see that there exists a constant such that for all k ≥ 0,

(53)

Combining (59) with (53) shows that is bounded. Furthermore, from (2), Cauchy Schwarz inequality, and (56), we have

which further yields

which clearly shows the boundedness of , which by continuity of F, indicates boundedness of , namely, a constant exists for which

Combining this result with the last inequality of (58) yields

which further implies that . Hence the proof is obtained. ■

Theorem 3.6 The sequence generated by MKAS Algorithm converges globally to a solution of (1).

Proof. Two cases are analyzed for the proof

1st case: Assuming that , then the boundedness of and continuity of F implies that some cluster points exists for which . Also, since is convergent, it implies that converges to .

2nd case: . In this case we have that a constant exists such that

(54)

From first inequality in (52), we have

Combining this result with (55) yields

(55)

By the boundedness of the sequences and , we see that there exists subsequences and for which

with being an infinite indexing set. From (52), we get

(56)

Letting in (63) with continuity of F and (61), we obtain

(57)

On the other hand, from (48), if , then will not satisfy (48), i.e.,

(58)

By taking limit as , with in (65), (62) and continuity of F, we obtain

which shows a contradiction with (64). Hence the proof is established. ■

4 Numerical experiments

To test the MKAS method’s effectiveness, it is compared with the QN-type methods in [55] labelled SMDFP, [57] labelled SMBFGS and [56] labelled PBDFP. To code the four algorithms, MATLAB R2014a was employed with a PC configured as (2.30ghz cpu, 4GB RAM). The criteria for stopping the iterations is or or iterations exceed 1000 without attaining a solution. Parameters values for MKAS are set as , , , , , m = 0.01, r = 1.6, and u = 0.8. Parameter values for the other algorithms are set according to how they are used in the papers.

The following test examples with dimensions 1000, 5000, and 10000, where F is given as: .

Example 1 This is the gradient of the strictly convex test example in [70] where added to yield

, .

Example 2 This is a modified version of the test example in [71] with added to yield

,

, .

Example 3 [72] with

, .

Example 4 [73].

,

,

,

with .

Example 5 A modified version of the test example in [72] with

, .

Example 6 A modification of the test example 3 with

, .

Example 7 A modification of the test example 3 with added to yield

,

,

,.

Example 8 A modification of the test example 3 with added to yield

,

, .

Example 9 A modification of the test example 2 with added to yield

,

,.

The following initial guesses were used:

x1, x2, x3 , x4 ,

x5 , x6 . The numerical result of the experiments for the four algorithms is displayed in three tables, and is accessible from the link https://github.com/hungugida/hungugida/blob/main/Tables1-3MKAS.pdf, where “N” and “Pdim” denotes example of the operator F and dimension, “SP” and “Nit” represents for initial guess and number of iterations. Also, “Fval,” “Ptime(s)” and “Norm” stands for function evaluations, processing time and the residual at stopping point, while “***” denotes failure to obtain a solution.

In order describe the values shown in the above link, we use the evaluation too by Dolan and Moré [74]. Figs 13 were plotted for iterations, function evaluations, and CPU time performance metrics. From Fig 1, we see that MKAS solved about 95% of the test examples with less number of iterations. This values however, include 9% solved by the algorithm together with PBDFP, SMDFP, and SMBFGS. It also showed that SMDFP solved 15% of the test examples with less iterations. Here also, this value includes 12% solved with the other algorithms. Fig 2 showed that MKAS solved 93% of the test examples with less function evaluations. It also showed that PBDFP solved 8%, which includes 2% solved with SMDFP, and SMBFGS algorithms. For least CPU time metric, Fig 3 showed that MKAS perform better than PBDFP, SMDFP, and SMBFGS because it solved 89% while the latter recorded 8%, 1% and 2%. To further validate the results of the experiments, we present the average residual obtained for each of the four algorithms as computed from tables in the above link as follows: MKAS (1.21 × 10−11), PBDFP (4.41 × 10−11), SMDFP (1.83 × 10−10) and SMBFGS (1.75 × 10−10). Hence, the above analysis indicates that MKAS method is better suited for solving (1) than the other schemes.

thumbnail
Fig 1. Performance profile of the four methods for number of iterations.

https://doi.org/10.1371/journal.pone.0344697.g001

thumbnail
Fig 2. Performance profile of the four methods for function evaluations.

https://doi.org/10.1371/journal.pone.0344697.g002

thumbnail
Fig 3. Performance profile of the four methods for CPU time.

https://doi.org/10.1371/journal.pone.0344697.g003

5 Concluding remarks and future research

We have presented a new quasi-Newton-type method for solving constrained system of monotone nonlinear equations in this article. The method is an adaptive modification of the BFGS method for unconstrained optimization. The scheme possesses the vital property for analyzing global convergence as well as the trust region property irrespective of the line search procedure; what’s more, the global convergence was proven without the Lipschitz condition, thereby broadening its applicability to a wider class of problems. Another important finding was the impact of the restart strategy in improving convergence of the scheme, as evident in the best average residual it recorded. Furthermore, numerical experiments with the method and some of the available quasi-Newton type methods for solving constrained system of monotone nonlinear equations showed that it is effective. The focus of a future research will be to develop a variant of the proposed method, where the global convergence will be analyzed without the monotonicity and Lipschitz assumptions. We also hope to include real-life applications of the method in the study.

Acknowledgments

The authors thank Rabiu Bashir Yunus for his initial involvement and preliminary input during the early stages of the study.

References

  1. 1. Dirkse SP, Ferris MC. Mcplib: a collection of nonlinear mixed complementarity problems. Optimization Methods and Software. 1995;5(4):319–45.
  2. 2. Meintjes K, Morgan AP. A methodology for solving chemical equilibrium systems. Applied Mathematics and Computation. 1987;22(4):333–61.
  3. 3. Prajna S, Parrilo PA, Rantzer A. Nonlinear Control Synthesis by Convex Optimization. IEEE Trans Automat Contr. 2004;49(2):310–4.
  4. 4. Luo ZQ, Yui W. An introduction to convex optimization for communications and signal processing. IEEE J Select Areas Commun. 2006;24(8):1426–38.
  5. 5. Evgeniou T, Pontil M, Toubia O. A Convex Optimization Approach to Modeling Consumer Heterogeneity in Conjoint Estimation. Marketing Science. 2007;26(6):805–18.
  6. 6. Xiao Y, Zhu H. A conjugate gradient method to solve convex constrained monotone equations with applications in compressive sensing. Journal of Mathematical Analysis and Applications. 2013;405(1):310–9.
  7. 7. Halilu AS, Majumder A, Waziri MY, Awwal AM, Ahmed K. On solving double direction methods for convex constrained monotone nonlinear equations with image restoration. Comp Appl Math. 2021;40(7).
  8. 8. Ahmed K, Waziri MY, Halilu AS, Murtala S. On two symmetric Dai-Kou type schemes for constrained monotone equations with image recovery application. EURO Journal on Computational Optimization. 2023;11:100057.
  9. 9. Ahmed K, Waziri MY, Murtala S, Halilu AS, Sabi’u J. On a Scaled Symmetric Dai–Liao-Type Scheme for Constrained System of Nonlinear Equations with Applications. J Optim Theory Appl. 2023;200(2):669–702.
  10. 10. Ahmed K, Waziri MY, Halilu AS, Murtala S, Sabi’u J. Another Hager-Zhang-type method via singular-value study for constrained monotone equations with application. Numer Algor. 2023;96(4):1583–623.
  11. 11. Ahmed K, Waziri MY, Halilu AS, Murtala S, Abdullahi H. Signal and image reconstruction with a double parameter Hager–Zhang‐type conjugate gradient method for system of nonlinear equations. Numerical Linear Algebra App. 2024;32(1).
  12. 12. Ahmed K, Waziri MY, Sani Halilu A, Murtala S. Two RMIL-type schemes with compressed sensing applications. Optimization Methods and Software. 2024;40(3):455–86.
  13. 13. Kiri AI, Waziri MY, Ahmed K. A modified Liu-Storey scheme for nonlinear systems with an application to image recovery. Iranian Journal of Numerical Analysis and Optimization. 2023;13:1–21.
  14. 14. Ahmed K, Yusuf Waziri M, Sani Halilu A, Murtala S. Sparse signal reconstruction via Hager–Zhang-type schemes for constrained system of nonlinear equations. Optimization. 2023;73(6):1949–80.
  15. 15. Solodov MV, Svaiter BF. A Globally Convergent Inexact Newton Method for Systems of Monotone Equations. Applied Optimization. Springer US. 1998. 355–69. https://doi.org/10.1007/978-1-4757-6388-1_18
  16. 16. Broyden CG. A class of methods for solving nonlinear simultaneous equations. Math Comp. 1965;19(92):577–93.
  17. 17. Ortega JM, Rheinboldt WC. Iterative solution of nonlinear equations in several variables. New York: Academic Press. 1970.
  18. 18. Dennis JE, Schnabel RB. Numerical Methods for Unconstrained Optimization and Nonlinear Equations. Englewood Cliffs, NJ: Prentice-Hall. 1983.
  19. 19. Gao P, He C. An efficient three-term conjugate gradient method for nonlinear monotone equations with convex constraints. Calcolo. 2018;55(4).
  20. 20. Halilu AS, Majumder A, Waziri MY, Ahmed K, Awwal AM. Motion control of the two joint planar robotic manipulators through accelerated Dai–Liao method for solving system of nonlinear equations. EC. 2021;39(5):1802–40.
  21. 21. Halilu AS, Majumder A, Waziri MY, Ahmed K. Signal recovery with convex constrained nonlinear monotone equations through conjugate gradient hybrid approach. Mathematics and Computers in Simulation. 2021;187:520–39.
  22. 22. Koorapetse M, Kaelo P, Lekoko S, Diphofu T. A derivative-free RMIL conjugate gradient projection method for convex constrained nonlinear monotone equations with applications in compressive sensing. Applied Numerical Mathematics. 2021;165:431–41.
  23. 23. Liu JK, Li SJ. A projection method for convex constrained monotone nonlinear equations with applications. Computers & Mathematics with Applications. 2015;70(10):2442–53.
  24. 24. Liu J, Feng Y. A derivative-free iterative method for nonlinear monotone equations with convex constraints. Numer Algor. 2018;82(1):245–62.
  25. 25. Kanzow C, Yamashita N, Fukushima M. Levenberg–Marquardt methods with strong local convergence properties for solving nonlinear equations with convex constraints. Journal of Computational and Applied Mathematics. 2004;172(2):375–97.
  26. 26. Bellavia S, Macconi M, Morini B. STRSCNE: A Scaled Trust-Region Solver for Constrained Nonlinear Equations. Computational Optimization and Applications. 2004;28(1):31–50.
  27. 27. Liu J, Li S. Multivariate spectral DY-type projection method for convex constrained nonlinear monotone equations. Journal of Industrial & Management Optimization. 2017;13(1):283–95.
  28. 28. Yu G, Niu S, Ma J. Multivariate spectral gradient projection method for nonlinear monotone equations with convex constraints. Journal of Industrial & Management Optimization. 2013;9(1):117–29.
  29. 29. Davidon WC. Variable metric method for minimization. Argonne, IL: Argonne National Laboratory. 1959.
  30. 30. Fletcher R, Powell MJD. A Rapidly Convergent Descent Method for Minimization. The Computer Journal. 1963;6(2):163–8.
  31. 31. Sun W, Yuan YX. Optimization Theory and Methods: Nonlinear Programming. New York: Springer. 2006.
  32. 32. Broyden CG. The convergence of a class of double-rank minimization algorithms. IMA Journal of Applied Mathematics. 1970;6:76–90.
  33. 33. Fletcher R. A new approach to variable metric algorithms. The Computer Journal. 1970;13(3):317–22.
  34. 34. Goldfarb D. A family of variable-metric methods derived by variational means. Math Comp. 1970;24(109):23–6.
  35. 35. Shanno DF. Conditioning of quasi-Newton methods for function minimization. Math Comp. 1970;24(111):647–56.
  36. 36. Sun W, Yuan YX. Optimization Theory and Methods: Nonlinear Programming. New York: Springer. 2006.
  37. 37. Andrei N. Eigenvalues versus singular values study in conjugate gradient algorithms for large-scale unconstrained optimization. Optimization Methods and Software. 2016;32(3):534–51.
  38. 38. Powell MJD. Some global convergence properties of a variable metric algorithm for minimization without exact line search. In: Cottle RW, Lemke CE, editors. Nonlinear Programming. Philadelphia: SIAM. 1976. 53–72.
  39. 39. Byrd RH, Liu DC, Nocedal J. On the Behavior of Broyden’s Class of Quasi-Newton Methods. SIAM J Optim. 1992;2(4):533–57.
  40. 40. Gill PE, Leonard MW. Reduced-Hessian Quasi-Newton Methods for Unconstrained Optimization. SIAM J Optim. 2001;12(1):209–37.
  41. 41. Biggs MC. Minimization Algorithms Making Use of Non-quadratic Properties of the Objective Function. IMA J Appl Math. 1972;9(2):123–123.
  42. 42. Oren SS, Luenberger DG. Self-scaling variable metric algorithms, Part I: Criteria and sufficient conditions for scaling a class of algorithms. Management Science. 1974;20:845–62.
  43. 43. Andrei N. A double parameter scaled BFGS method for unconstrained optimization. Journal of Computational and Applied Mathematics. 2018;332:26–44.
  44. 44. Andrei N. A Double-Parameter Scaling Broyden–Fletcher–Goldfarb–Shanno Method Based on Minimizing the Measure Function of Byrd and Nocedal for Unconstrained Optimization. J Optim Theory Appl. 2018;178(1):191–218.
  45. 45. Byrd RH, Nocedal J. A Tool for the Analysis of Quasi-Newton Methods with Application to Unconstrained Minimization. SIAM J Numer Anal. 1989;26(3):727–39.
  46. 46. Babaie-Kafaki S. A hybrid scaling parameter for the scaled memoryless BFGS method based on the ℓ∞ matrix norm. International Journal of Computer Mathematics. 2018;96(8):1595–602.
  47. 47. Babaie–Kafaki S, Aminifard Z. Two–parameter scaled memoryless BFGS methods with a nonmonotone choice for the initial step length. Numer Algor. 2019;82(4):1345–57.
  48. 48. Aminifard Z, Babaie-Kafaki S. Analysis of the Maximum Magnification by the Scaled Memoryless DFP Updating Formula with Application to Compressive Sensing. Mediterr J Math. 2021;18(6).
  49. 49. Dargahi F, Babaie-Kafaki S, Aminifard Z. Eigenvalue Analyses on the Memoryless Davidon–Fletcher–Powell Method Based on a Spectral Secant Equation. J Optim Theory Appl. 2023;200(1):394–403.
  50. 50. Bakhshinejad A, Babaie–Kafaki S. An extended version of the memoryless DFP algorithm with the sufficient descent property. Boll Unione Mat Ital. 2024;17(4):747–58.
  51. 51. Mamat M, Dauda MK, Mohamed MA, Waziri MY, Mohamad FS, Abdullah H. Derivative free Davidon-Fletcher-Powell (DFP) for solving symmetric systems of nonlinear equations. IOP Conf Ser: Mater Sci Eng. 2018;332:012030.
  52. 52. Dauda MK. Modified Conjugate Gradient Method via DFP Update for Solving Systems of Nonlinear Equations. myjcam. 2020;3(1):30–6.
  53. 53. Awwal AM, Kumam P, Sitthithakerngkiet K, Bakoji AM, Halilu AS. Derivative-free method based on DFP updating formula for solving convex constrained nonlinear monotone equations and application. AIMS Mathematics. 2021;6:8792–814.
  54. 54. Rao J, Huang N. A derivative-free scaling memoryless DFP method for solving large scale nonlinear monotone equations. J Glob Optim. 2022;87(2–4):641–77.
  55. 55. Ullah N, Shah A, Sabi’u J, Jiao X, Awwal AM. A one-parameter memoryless DFP algorithm for solving system of monotone nonlinear equations with application in image processing. Mathematics. 2023;11:1–26.
  56. 56. ur Rehman M, Sabi’u J, Sohaib M, Shah A. A projection-based derivative free DFP approach for solving system of nonlinear convex constrained monotone equations with image restoration applications. J Appl Math Comput. 2023;69(5):3645–73.
  57. 57. Shah A, ur Rehman M, Sabi’u J, Sohaib M, Furati KM. A new scaled BFGS method for convex constraints monotone systems: Applications in motion control. Results in Applied Mathematics. 2025;25:100531.
  58. 58. Liu P, Li J, Shao H, Wu T. An accelerated derivative-free memoryless Davidon-Fletcher-Powell method and its iteration-complexity analysis. J Sci Comput. 2025;105(2).
  59. 59. Liu P, Li J, Shao H, Shao F, Liu M. An accelerated Dai–Yuan conjugate gradient projection method with the optimal choice. Engineering Optimization. 2025;57(12):3648–76.
  60. 60. Liu P, Wu X, Shao H, Zhang Y, Cao S. Three adaptive hybrid derivative-free projection methods for constrained monotone nonlinear equations and their applications. Numerical Linear Algebra with Applications. 2023;30:e2471.
  61. 61. Liu P, Li L, Shao H, Liu M, Fan J. An Inertial-type CG Projection Method with Restart for Pseudo-monotone Costs with Application to Traffic Assignment. Netw Spat Econ. 2024;25(1):147–72.
  62. 62. Rehman M ur, Mahmood S, Sabi’u J, Sohaib M, Shah A, Furati KM. An efficient scaled DFP algorithm based on singular value analysis for solving systems of monotone nonlinear equations. Franklin Open. 2025;13:100422.
  63. 63. Sabi’u J, Saksurakan C, Neamprem K, Sirisubtawee S. An Improved DFP Method for Solving Convex Constrained Monotone Equations with Application in Image Restoration. Int J Comput Methods. 2025;23(02).
  64. 64. Sabi’u J, Shah A. An efficient three-term conjugate gradient-type algorithm for monotone nonlinear equations. RAIRO-Oper Res. 2021;55:S1113–27.
  65. 65. Sabi’u J, Shah A, Stanimirović PS, Ivanov B, Waziri MY. Modified optimal Perry conjugate gradient method for solving system of monotone equations with applications. Applied Numerical Mathematics. 2023;184:431–45.
  66. 66. Nocedal J, Wright SJ. Numerical Optimization. New York: Springer. 2006.
  67. 67. Powell MJD. Restart procedures for the conjugate gradient method. Mathematical Programming. 1977;12(1):241–54.
  68. 68. Yin J, Jian J, Jiang X, Liu M, Wang L. A hybrid three-term conjugate gradient projection method for constrained nonlinear monotone equations with applications. Numer Algor. 2021;88(1):389–418.
  69. 69. Zarantonello EH. Projections on convex sets in Hilbert space and spectral theory. New York: Academic Press. 1971.
  70. 70. Raydan M. The Barzilai and Borwein Gradient Method for the Large Scale Unconstrained Minimization Problem. SIAM J Optim. 1997;7(1):26–33.
  71. 71. Cruz WL, Martinez JM, Raydan M. Spectral residual method without gradient information for solving large-scale nonlinear systems of equations: theory and experiments. 2004.
  72. 72. Cruz WL. A spectral algorithm for large-scale systems of nonlinear monotone equations. Numer Algor. 2017;76(4):1109–30.
  73. 73. La Cruz W, Martínez J, Raydan M. Spectral residual method without gradient information for solving large-scale nonlinear systems of equations. Math Comp. 2006;75(255):1429–48.
  74. 74. Dolan ED, Moré JJ. Benchmarking optimization software with performance profiles. Mathematical Programming. 2002;91(2):201–13.