# st: Analysis of scoring data???

 From K Jensen To statalist@hsphsun2.harvard.edu Subject st: Analysis of scoring data??? Date Wed, 16 Nov 2005 08:01:31 -0800

I have data involving assessment of the results of different medical imaging techniques, by different specialists.

For a hundred images, each has been assessed on a four point quality scale by six specialists. All the assessors have scored all the images and there are no missing values. The images fall into four types (there are different numbers of each type) and there are two types of clinician (three of each).

So the data looks like:-
Image_ID Type Clinician C_type Score
...

We are particularly interested in making inferences about the utility of the different types of image. One distinction is between the images scored at 1 or 2 (not useful in practice) v. 3 and 4 (useful).

So a summary could look like:-
image | Score
type | 0 or 1 2 or 3 |
-------+----------------------+
A | Na Ya |
-------+----------------------+
B | Nb Yb |
-------+----------------------+
C | Nc Yc |
-------+----------------------+
D | Nd Yd |
-------+----------------------+--------
Total | N Y | N+Y=600

Or we could consider using the average scores across clinicians for each image.

Types A and B use different variants of one imaging method, types C and D another.

We would like to test a priori hypotheses, such as "A and B are better than C and D" or "C is better than D" or "Clinicians of type one score more highly than those of type two".

I was tempted to do simple chi square tests based on the rows in the "tabulate" command, but have realised that that that would in a sense be overestimating the sample size by a factor of six, as we have 100 different images assessed by six clinicians, not 600 different images.

I thought about logistic regression (xi:logit command) on the "score 0 or 1" v. "1 or 2" outcome, but the results (either as beta coefficients or odds ratios) would be less easier to interpret than simple probabilities of falling into different categories.

I also thought about using the glm command and assuming binomial family data (for the dichotomous outcome).

As you will have guessed I am no statistician. How would a professional statistician like to see these data analyzed? I have come to realise as I write that this is a general question rather than a specifically Stata one, so I am sorry if this is an inappropriate query for this list.

Thankyou in advance and in hope,

Karin

----------------------------------------------
Mailblocks - A Better Way to Do Email