PMCCPMCCPMCC

Search tips
Search criteria 

Advanced

 
Logo of procascamcLink to Publisher's site
 
Proc Annu Symp Comput Appl Med Care. 1995: 32–36.
PMCID: PMC2579050
Sampling strategies in a statistical approach to clinical classification.
Y. Yang and C. G. Chute
Section of Medical Information Resources, Mayo Clinic/Foundation, Rochester, Minnesota 55905, USA.
Abstract
This paper studies the sampling strategies for the Expert Network (EexNet), a statistical learning system used for patient record classification at the Mayo Clinic. The goal is to achieve high accuracy classification at an affordable computational cost in very large applications. The learning curves of ExpNet were observed with respect to the choice of training resources, the size, vocabulary coverage and category coverage of a training set, and the category distribution over training instances. A method combining advantages of different sampling strategies is proposed and evaluated using a large training corpus. As a result, Expert Network has achieved its nearly-optimal classification accuracy (measured by average precision) using a relatively small training set, with a fast real-time response which satisfies the needs of human-machine interaction.
Full text
Full text is available as a scanned copy of the original print version. Get a printable copy (PDF file) of the complete article (1.0M), or click on a page image below to browse page by page.
Articles from Proceedings of the Annual Symposium on Computer Application in Medical Care are provided here courtesy of
American Medical Informatics Association