Stata The Stata listserver
[Date Prev][Date Next][Thread Prev][Thread Next][Date index][Thread index]

st: Re: programmers providing keywords


From   Kit Baum <[email protected]>
To   [email protected]
Subject   st: Re: programmers providing keywords
Date   Tue, 14 Dec 2004 08:00:04 -0500

Programmers already provide keywords (or I provide keywords for them) for every routine on the SSC Archive. Have a look at the foot of
http://ideas.repec.org/c/boc/bocode/s439101.html
or
http://ideas.repec.org/c/boc/bocode/s448101.html

These keywords are stored in the Keywords: field of the RePEc template describing those routines, and in IDEAS or EconPapers, you can search on them.

The problem is that the Stata package file format does not presently provide a way to hold these keywords (or to be more precise, you can place anything you want on the d lines in a package file, but no one has suggested that my adding a line

d KW: normal variates, quotient, confidence interval

would result in anything happening regarding findit.

A number of authors propose keywords when they submit materials to SSC. I'm not sure that the keyword information belongs in the .hlp file--since the .hlp file will certainly not be inhaled by findit--but if there is a way to ensure that a modification of the package file for each SSC package would allow keywords to enter the findit index, I could add the existing Keywords: to the package file. In the case cited, "output" was one of the keywords for mat2txt, but of course Stata does not know that.

On this subject, the RePEc template for each package contains the information on the version of Stata needed for the package (which may be viewed by ssc whatsnew). That, also, does not show up in the package file; one might imagine that it is very useful information for those not able to use the latest version of Stata.

Kit Baum, Boston College Economics
http://ideas.repec.org/e/pba1.html

On Dec 14, 2004, at 2:33 AM, statalist-digest wrote:


Would it be worthwhile for programmers to provide key words, perhaps in the
command's help file, to be detected by -search- or -findit-? The key words
could be flagged perhaps in a dedicated smcl format in order
for -search-/-findit- to easily parse, for example, {kw:date}
{kw:conversion}. Maybe it would speed the search if they were set-off in
their own paragraph in the help file, much as for the key words section of
manuscripts.

This was prompted yesterday when Svend Juul pointed out Nick Cox's -todate-
as a much more convenient alternative to a "first principles" approach that
I had suggested earlier. This is certainly true, provided that you know
about its existence. A few experimental tries with -findit- and -search-
(using terms such as "date" "conversion" "string" and so on, variously
singly or in combination) either pulled up hundreds of irrelevant hits or
missed it.

I recall that the problem of avoiding -search-/-findit- overlooking
user-written contributions has come up on the list before, but I don't
recall whether there has been any resolution other than for interested users
to notify StataCorp ad hoc of user-written contributions that seem to be
overlooked or difficult to catch. If that's as it stands, then perhaps a
more systematic approach would be better.

Obviously, the concept of set-off or smcl-identified key words is no
panacea. Its success will still depend upon both programmer and future user
formulating a problem in terms of the same key words, which doesn't always
happen for a variety of reasons. But it seems that a systematized approach
would go a long way toward helping a user quickly determine whether there's
a user-written contribution already available and tailored to the problem at
hand.

Joseph Coveney
*
*   For searches and help try:
*   http://www.stata.com/support/faqs/res/findit.html
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/



© Copyright 1996–2024 StataCorp LLC   |   Terms of use   |   Privacy   |   Contact us   |   What's new   |   Site index