Search tips
Search criteria 


Logo of procascamcLink to Publisher's site
Proc Annu Symp Comput Appl Med Care. 1995 : 32–36.
PMCID: PMC2579050

Sampling strategies in a statistical approach to clinical classification.


This paper studies the sampling strategies for the Expert Network (EexNet), a statistical learning system used for patient record classification at the Mayo Clinic. The goal is to achieve high accuracy classification at an affordable computational cost in very large applications. The learning curves of ExpNet were observed with respect to the choice of training resources, the size, vocabulary coverage and category coverage of a training set, and the category distribution over training instances. A method combining advantages of different sampling strategies is proposed and evaluated using a large training corpus. As a result, Expert Network has achieved its nearly-optimal classification accuracy (measured by average precision) using a relatively small training set, with a fast real-time response which satisfies the needs of human-machine interaction.

Full text

Full text is available as a scanned copy of the original print version. Get a printable copy (PDF file) of the complete article (1.0M), or click on a page image below to browse page by page. Links to PubMed are also available for Selected References.

Selected References

These references are in PubMed. This may not be the complete list of references from this article.
  • Chute CG, Yang Y, Buntrock J. An evaluation of computer assisted clinical classification algorithms. Proc Annu Symp Comput Appl Med Care. 1994:162–166. [PMC free article] [PubMed]
  • Yang Y, Chute CG. An application of Expert Network to clinical classification and MEDLINE indexing. Proc Annu Symp Comput Appl Med Care. 1994:157–161. [PMC free article] [PubMed]

Articles from Proceedings of the Annual Symposium on Computer Application in Medical Care are provided here courtesy of American Medical Informatics Association