FAQ: Use of ml for nonlinear model

Home / Resources & support / FAQs / Use of ml for nonlinear model

How do I estimate a nonlinear model using ml?

Title		Use of ml for nonlinear model
Authors		Weihua Guan, Gustavo Sanchez, StataCorp

Consider the model

\(y = f(x) + e\)

where \(y\) is the outcome, \(f(x)\) is a nonlinear form of covariate \(x\), and \(e\) is the random error. The command nl will estimate the parameters of \(f(x)\) by using least squares. Here is an example:

. sysuse auto
(1978 automobile data)

. nl (rep78 = {b0}*(1-exp(-{b1}*headroom))), initial(b0 1 b1 0.2) nolog


      Source        SS            df       MS  
       Number of obs =         69
       Model    800.67378          2  400.336892     R-squared     =     0.9235
    Residual    66.326216         67  .989943524     Adj R-squared =     0.9212
       Root MSE      =   .9949591
       Total          867         69  12.5652174     Res. dev.     =   193.0866




       rep78   Coefficient  Std. err.      t    P>|t|     [95% conf. interval]
   
         /b0      3.43544   .1605208    21.40   0.000     3.115039    3.755841
         /b1     1.945364    1.76202     1.10   0.274    -1.571643    5.462371

We can write an nl program to fit the same model. We may want to do this if we need to be able to fit the model with different variables.

program nlnexpgr, rclass
	version 19 
        syntax varlist(min=2 max=2) if
        local lhs : word 1 of `varlist'
        local rhs : word 2 of `varlist'
        return local eq "`lhs'={b0=1}*(1-exp(-{b1=0.2}*`rhs'))"
end

This program will give the same results as we obtained above:

. sysuse auto, clear
(1978 automobile data)

. nl nexpgr : rep78 headroom, nolog


      Source        SS            df       MS  
       Number of obs =         69
       Model    800.67378          2  400.336892     R-squared     =     0.9235
    Residual    66.326216         67  .989943524     Adj R-squared =     0.9212
       Root MSE      =   .9949591
       Total          867         69  12.5652174     Res. dev.     =   193.0866




       rep78   Coefficient  Std. err.      t    P>|t|     [95% conf. interval]
   
         /b0      3.43544   .1605208    21.40   0.000     3.115039    3.755841
         /b1     1.945364    1.76202     1.10   0.274    -1.571643    5.462371

Now let’s consider how we would fit the same model with ml, which uses maximum likelihood. We can change the equation into

\(e = y - f(x)\)

If we assume that the errors are distributed as \(N(0,\sigma^2\)), the density function is as follows:

\(f(e) = 1/(\sqrt(2\pi)\sigma) exp (-e^2/(2\sigma^2))\)

Then, the log-likelihood function will be

\(lnL \,= -0.5*ln(2*_\pi) - ln(\sigma) - 0.5*e^2/\sigma^2\)
\(\qquad = -0.5*ln(2*_\pi) - ln(\sigma) - 0.5*(y-f(x))^2/\sigma^2\)

In our example, we have

\(f(x) = b0*(1-exp(-b1*x))\)

Considering the formula, we may treat \(b0\) as a parameter and \(b1*x\) as the linear combination \(xb\), which has no constant term. We also need to estimate another parameter, \(\sigma\), the standard deviation of the normal distribution.

program mlnexpgr
	version 19 
        args lnf b1x b0 sigma
        tempvar res
        quietly gen double `res' = $ML_y1 - `b0'*(1-exp(-`b1x'))
        quietly replace `lnf' = -0.5*ln(2*_pi)-ln(`sigma')-0.5*`res'^2/`sigma'^2
end

. ml model lf mlnexpgr (b1: rep78 = headroom, nocons) (b0:) (sigma:)
  
. ml max
(output omitted)

                                                        Number of obs =     69
                                                        Wald chi2(1)  =   2.85
Log likelihood = -96.543274                             Prob > chi2   = 0.0912



       rep78   Coefficient  Std. err.      z    P>|z|     [95% conf. interval]
   
b1                                                                            
    headroom     1.947199   1.152899     1.69   0.091    -.3124418    4.206841
   
b0                                                                            
       _cons     3.435331   .1375274    24.98   0.000     3.165782     3.70488

sigma                                                                         
       _cons     .9804333     .08346    11.75   0.000     .8168547    1.144012

The estimates we obtained from ml are close to those from the nl program. The estimated standard errors are a little different, but we know that they are from different algorithms.

We could extend the ml program to estimate robust variances with the vce(robust) option or with the vce(cluster clustvar) option.

. ml model lf mlnexpgr (b1:rep78=headroom,nocons) (b0:) (sigma:), vce(robust)

. ml max
 (output omitted)


                                                        Number of obs =     69
                                                        Wald chi2(1)  =   3.98
Log pseudolikelihood = -96.543274                       Prob > chi2   = 0.0461



                             Robust                                           
       rep78   Coefficient  std. err.      z    P>|z|     [95% conf. interval]
   
b1                                                                            
    headroom     1.947199   .9764862     1.99   0.046     .0333217    3.861077
   
b0                                                                            
       _cons     3.435331   .1145489    29.99   0.000     3.210819    3.659843

sigma                                                                         
       _cons     .9804333   .0743649    13.18   0.000     .8346808    1.126186

The robust estimation gives the same estimated coefficients, but it gives adjusted standard errors for the three parameters in this model. We may also apply this program for clustered data:

. ml model lf mlnexpgr (b1:rep78=headroom,nocons) (b0:) (sigma:), vce(cluster turn)

We use cookies

We use cookies to ensure that we give you the best experience on our website—to enhance site navigation, to analyze usage, and to assist in our marketing efforts. By continuing to use our site, you consent to the storing of cookies on your device and agree to delivery of content, including web fonts and JavaScript, from third party web services.

Cookie Settings

Last updated: 16 November 2022

StataCorp LLC (StataCorp) strives to provide our users with exceptional products and services. To do so, we must collect personal information from you. This information is necessary to conduct business with our existing and potential customers. We collect and use this information only where we may legally do so. This policy explains what personal information we collect, how we use it, and what rights you have to that information.

Advertising and performance cookies

This website uses cookies to provide you with a better user experience. A cookie is a small piece of data our website stores on a site visitor's hard drive and accesses each time you visit so we can improve your access to our site, better understand how you use our site, and serve you content that may be of interest to you. For instance, we store a cookie when you log in to our shopping cart so that we can maintain your shopping cart should you not complete checkout. These cookies do not directly store your personal information, but they do support the ability to uniquely identify your internet browser and device.

Please note: Clearing your browser cookies at any time will undo preferences saved here. The option selected here will apply only to the device you are currently using.

How do I estimate a nonlinear model using ml?

We use cookies

Privacy policy

Required cookies

Advertising and performance cookies

Source	SS df MS
		Number of obs = 69
Model	800.67378 2 400.336892	R-squared = 0.9235
Residual	66.326216 67 .989943524	Adj R-squared = 0.9212
		Root MSE = .9949591
Total	867 69 12.5652174	Res. dev. = 193.0866


rep78		Coefficient Std. err. t P>\|t\| [95% conf. interval]

/b0		3.43544 .1605208 21.40 0.000 3.115039 3.755841
/b1		1.945364 1.76202 1.10 0.274 -1.571643 5.462371


rep78		Coefficient Std. err. z P>\|z\| [95% conf. interval]

b1
headroom		1.947199 1.152899 1.69 0.091 -.3124418 4.206841

b0
_cons		3.435331 .1375274 24.98 0.000 3.165782 3.70488

sigma
_cons		.9804333 .08346 11.75 0.000 .8168547 1.144012


		Robust
rep78		Coefficient std. err. z P>\|z\| [95% conf. interval]

b1
headroom		1.947199 .9764862 1.99 0.046 .0333217 3.861077

b0
_cons		3.435331 .1145489 29.99 0.000 3.210819 3.659843

sigma
_cons		.9804333 .0743649 13.18 0.000 .8346808 1.126186

Stata/MP4 Annual License (download)

How do I estimate a nonlinear model using ml?

We use cookies

Privacy policy

Required cookies

Advertising and performance cookies