FAQ: Use of ml for nonlinear model

Home / Resources & support / FAQs / Use of ml for nonlinear model

How do I estimate a nonlinear model using ml?

Title		Use of ml for nonlinear model
Authors		Weihua Guan, Gustavo Sanchez, StataCorp

Consider the model

y = f(x) + e

where y is the outcome, f(x) is a nonlinear form of covariate x, and e is the random error. The command nl will estimate the parameters of f(x) by using least squares. Here is an example:

. sysuse auto
(1978 automobile data)

. nl (rep78 = {b0}*(1-exp(-{b1}*headroom))), initial(b0 1 b1 0.1) nolog


      Source        SS            df       MS  
       Number of obs =         69
       Model    800.67379          2  400.336893     R-squared     =     0.9235
    Residual    66.326214         67  .989943487     Adj R-squared =     0.9212
       Root MSE      =    .994959
       Total          867         69  12.5652174     Res. dev.     =   193.0865




       rep78   Coefficient  Std. err.      t    P>|t|     [95% conf. interval]
   
         /b0     3.435331   .1604479    21.41   0.000     3.115075    3.755586
         /b1     1.947203   1.767421     1.10   0.275    -1.580584    5.474989

We can write an nl program to fit the same model. We may want to do this if we need to be able to fit the model with different variables.

program nlnexpgr, rclass

	version 18
	syntax varlist(min=2 max=2) if
	local lhs : word 1 of `varlist'
	local rhs : word 2 of `varlist'

	return local eq "`lhs'={b0=1}*(1-exp(-{b1=0.1}*`rhs'))"

end

This program will give the same results as we obtained above:

. sysuse auto, clear
(1978 automobile data)

. nl nexpgr : rep78 headroom, nolog


      Source        SS            df       MS  
       Number of obs =         69
       Model    800.67379          2  400.336893     R-squared     =     0.9235
    Residual    66.326214         67  .989943487     Adj R-squared =     0.9212
       Root MSE      =    .994959
       Total          867         69  12.5652174     Res. dev.     =   193.0865




       rep78   Coefficient  Std. err.      t    P>|t|     [95% conf. interval]
   
         /b0     3.435331   .1604479    21.41   0.000     3.115075    3.755586
         /b1     1.947203   1.767421     1.10   0.275    -1.580584    5.474989

Now let’s consider how we would fit the same model with ml, which uses maximum likelihood. We can change the equation into

e = y − f(x)

If we assume that the errors are distributed as N(0,σ²), the density function is as follows:

f(e) = 1/(sqrt(2π) σ) exp (− e²/(2σ²))

Then, the log-likelihood function will be

lnL = -0.5*ln(2*_pi) - ln(σ) - 0.5*e²/σ²
    = -0.5*ln(2*_pi) - ln(σ) - 0.5*(y-f(x))²/σ²

In our example, we have

f(x) = b0*(1-exp(-b1*x))

Considering the formula, we may treat b0 as a parameter and b1*x as the linear combination xb, which has no constant term. We also need to estimate another parameter, “sigma”, the standard deviation of the normal distribution.

program mlnexpgr
	version 18
	args lnf b1x b0 sigma
	tempvar res
	quietly gen double `res' = $ML_y1 - `b0'*(1-exp(-`b1x'))
	quietly replace `lnf' = -0.5*ln(2*_pi)-ln(`sigma')-0.5*`res'^2/`sigma'^2
end

. ml model lf mlnexpgr (b1: rep78 = headroom, nocons) (b0:) (sigma:)
  
. ml max
(output omitted)

                                                        Number of obs =     69
                                                        Wald chi2(1)  =   2.85
Log likelihood = -96.543274                             Prob > chi2   = 0.0912



       rep78   Coefficient  Std. err.      z    P>|z|     [95% conf. interval]
   
b1                                                                            
    headroom     1.947203   1.152936     1.69   0.091    -.3125097    4.206915
   
b0                                                                            
       _cons     3.435331   .1375282    24.98   0.000      3.16578    3.704881

sigma                                                                         
       _cons     .9804333     .08346    11.75   0.000     .8168547    1.144012

The estimates we obtained from ml are close to those from the nl program. The estimated standard errors are a little different, but we know that they are from different algorithms.

We could extend the ml program to estimate robust variances with the vce(robust) option or with the vce(cluster clustvar) option.

. ml model lf mlnexpgr (b1:rep78=headroom,nocons) (b0:) (sigma:), vce(robust)

. ml max
 (output omitted)

                                                        Number of obs =     69
                                                        Wald chi2(1)  =   3.98
Log pseudolikelihood = -96.543274                       Prob > chi2   = 0.0462



                             Robust                                           
       rep78   Coefficient  std. err.      z    P>|z|     [95% conf. interval]
   
b1                                                                            
    headroom     1.947203   .9765424     1.99   0.046     .0332149    3.861191
   
b0                                                                            
       _cons     3.435331   .1145494    29.99   0.000     3.210818    3.659843

sigma                                                                         
       _cons     .9804333   .0743649    13.18   0.000     .8346808    1.126186

The robust estimation gives the same estimated coefficients, but it gives adjusted standard errors for the three parameters in this model. We may also apply this program for clustered data:

. ml model lf mlnexpgr (b1:rep78=headroom,nocons) (b0:) (sigma:), vce(cluster turn)

We use cookies

We use cookies to ensure that we give you the best experience on our website—to enhance site navigation, to analyze usage, and to assist in our marketing efforts. By continuing to use our site, you consent to the storing of cookies on your device and agree to delivery of content, including web fonts and JavaScript, from third party web services.

Cookie Settings

Last updated: 16 November 2022

StataCorp LLC (StataCorp) strives to provide our users with exceptional products and services. To do so, we must collect personal information from you. This information is necessary to conduct business with our existing and potential customers. We collect and use this information only where we may legally do so. This policy explains what personal information we collect, how we use it, and what rights you have to that information.

Advertising and performance cookies

This website uses cookies to provide you with a better user experience. A cookie is a small piece of data our website stores on a site visitor's hard drive and accesses each time you visit so we can improve your access to our site, better understand how you use our site, and serve you content that may be of interest to you. For instance, we store a cookie when you log in to our shopping cart so that we can maintain your shopping cart should you not complete checkout. These cookies do not directly store your personal information, but they do support the ability to uniquely identify your internet browser and device.

Please note: Clearing your browser cookies at any time will undo preferences saved here. The option selected here will apply only to the device you are currently using.

How do I estimate a nonlinear model using ml?

We use cookies

Privacy policy

Required cookies

Advertising and performance cookies

Source	SS df MS
		Number of obs = 69
Model	800.67379 2 400.336893	R-squared = 0.9235
Residual	66.326214 67 .989943487	Adj R-squared = 0.9212
		Root MSE = .994959
Total	867 69 12.5652174	Res. dev. = 193.0865


rep78		Coefficient Std. err. t P>\|t\| [95% conf. interval]

/b0		3.435331 .1604479 21.41 0.000 3.115075 3.755586
/b1		1.947203 1.767421 1.10 0.275 -1.580584 5.474989


rep78		Coefficient Std. err. z P>\|z\| [95% conf. interval]

b1
headroom		1.947203 1.152936 1.69 0.091 -.3125097 4.206915

b0
_cons		3.435331 .1375282 24.98 0.000 3.16578 3.704881

sigma
_cons		.9804333 .08346 11.75 0.000 .8168547 1.144012


		Robust
rep78		Coefficient std. err. z P>\|z\| [95% conf. interval]

b1
headroom		1.947203 .9765424 1.99 0.046 .0332149 3.861191

b0
_cons		3.435331 .1145494 29.99 0.000 3.210818 3.659843

sigma
_cons		.9804333 .0743649 13.18 0.000 .8346808 1.126186

Stata/MP4 Annual License (download)

How do I estimate a nonlinear model using ml?

We use cookies

Privacy policy

Required cookies

Advertising and performance cookies