Re: interassessor consistency data on TREC 06 Legal track ad hoc topics
- Subject: Re: interassessor consistency data on TREC 06 Legal track ad hoc topics
- From: "Dave Lewis (address for public mailing lists)" <misclists1@daviddlewis.com>
- Date: Thu, 1 Feb 2007 12:39:27 -0600
- Content-Transfer-Encoding: 7bit
- Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed
- DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; h=X-Originating-IP:In-Reply-To:References:Mime-Version:Content-Type:Message-Id:Content-Transfer-Encoding:From:Subject:Date:To:X-Mailer; s=default; d=daviddlewis.com; b=OPoeT51iJs+DYCEM/ek1ZrpRZXZX+dDIWQd2dCM6vfkPlvLcDiarMvFfY0Jv+G6CtUF/38U83mHhcfPgGtL4ZZrN2iVOLzGJUSxw/5BDn8okHriDYs8rYHo4gHJPSPeO6OAG33h6xXg74VF4OngMzYTz7qnPFhhKA+4tWyU+SrI=;
- In-Reply-To: <1170336368.45c1ea7075b96@webmail.nist.gov>
- References: <697A5A90-DA34-4C47-B4F1-C6565FE2C120@daviddlewis.com> <1170336368.45c1ea7075b96@webmail.nist.gov>
>> assessors. The sample consisted of 25 documents judged relevant by
>> the first assessor (or all such documents if fewer than 25), and
>> enough nonrelevant to bring the sample to 50 documents (49 in one
>> case due to a glitch).
>
> I realize this is difficult when the sample is drawn this way, but
> have you
> tried measuring the runs using this data, and seeing if they rank
> differently?
Ian - No, but if anyone is interested in trying that, I'm happy to
make available the data. (Gordon, I just wrote you separately about
your offer - thanks!)
As you say, it would take some thought to come up with a sensible
measure based on this strangely drawn sample.
Dave
Date Index |
Thread Index |
Problems or questions? Contact list-master@nist.gov