1. Zhang J. Representations of health concepts: A cognitive perspective. Journal of Biomedical Informatics. 2002:3517–24. [PubMed] 2. Hearst M. Automatic acquisition of hyponyms from large text corpora. Proceedings of the 14th conference on Computational linguistics; 1992. pp. 539–545.
3. Friedman C, Kra P, Yu H, Krauthammer M, Rzhetsky A. GENIES: a natural-language processing system for the extraction of molecular pathways from journal articles. Bioinformatics (Oxford, England) 2001;17(Suppl 1):S74–82. [PubMed] 4. Rindflesch TC, Fiszman M. The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text. Journal of Biomedical Informatics. 2003;36:462–477. [PubMed] 5. Lund K, Burgess C. Producing high-dimensional semantic spaces from lexical co-occurrence. Behavior Research Methods, Instruments, & Computers. 1996;28:203–208.
6. Schutze H. Word space. Advances in Neural Information Processing Systems. 1993;5:895–902.
7. Landauer TK, Dumais ST. A solution to Plato’s problem: The latent semantic analysis theory of acquisition, induction, and representation of knowledge. Psychological Review. 1997;104:211–240.
8. Hofmann T. Probabilistic Latent Semantic Analysis. Proceedings of Uncertainty in Artificial Intelligence, UAI’99; 1999. pp. 289–296.
9. Blei DM, Ng AY, Jordan MI. Latent Dirichlet allocation. Journal of Machine Learning Research. 2003;3:993–1022.
10. Griffiths T, Steyvers M. A probabilistic approach to semantic representation. Proceedings of the 24th Annual Conference of the Cognitive Science Society; 2002. pp. 381–386.
11. Hersh W, Buckley C, Leone TJ, Hickam D. OHSUMED: an interactive retrieval evaluation and new large test collection for research. Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval; 1994. pp. 192–201.
12. Cederberg S, Widdows D, Peters S. Infomap NLP software: an open-source package for natural language processing. [December 2008]. Webpage. http://infomap-nlp.sourceforge.net/ 13. Schütze H. Automatic Word Sense Discrimination. Computational Linguistics. 1998;24:97–123.
14. Jones MN, Mewhort DJK. Representing word meaning and order information in a composite holographic lexicon. Psychological Review. 2007;114:1–37. [PubMed] 15. Pado S, Lapata M. Dependency-Based Construction of Semantic Space Models. Computational Linguistics. 2007;33:161–199.
16. Kanerva P, Kristofersson J, Holst A. Random indexing of text samples for latent semantic analysis. Proceedings of the 22nd Annual Conference of the Cognitive Science Society; 2000. pp. 10–36.
17. Dumais ST. Improving the retrieval of information from external sources. Behavior Research Methods, Instruments and Computers. 1991;23:229–236.
18. Gorman J, Curran JR. Random Indexing using Statistical Weight Functions. Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing (EMNLP); Sydney, Australia. 2006. pp. 457–464.
19. Strang G. Introduction to Linear Algebra. Wellesley Cambridge Pr: 2003.
20. Karlgren J, Sahlgren M. From words to understanding. Foundations of Real-World Intelligence. 2001:294–308.
21. Pereira F, Tishby N, Lee L. Distributional clustering of English words. Proceedings of the 31st conference on Association for Computational Linguistics; 1993. pp. 183–190.
23. Steyvers M, Griffiths T. Probabilistic Topic Models. In: Landauer T, McNamara D, Dennis S, Kintsch W, editors. Handbook of Latent Semantic Analysis. Mahwah, N.J: Lawrence Erlbaum Associates; 2007.
24. Shannon CE. Prediction and entropy of printed English. Bell System Technical Journal. 1951;30:50–64.
25. Birkhoff G, von Neumann J. ‘The Logic of Quantum Mechanics’ Annals of Mathematics. 1936;37:823–843.
26. Rijsbergen CJV. The Geometry of Information Retrieval. Cambridge University Press; 2004.
27. Salton G, Wong A, Yang CS. A vector space model for automatic indexing. Commun ACM. 1975;18:613–620.
28. Robertson S, Spark-Jones K. Relevance Weighting of Search Terms. J Am Soc Information Sciences. 1976;27:129–146.
29. Quillian MR. In: Semantic memory. Minsky M, editor. Semantic Information Processing; 1968. pp. 216–270.
30. Lesk M. Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone. Proceedings of the 5th annual international conference on Systems documentation; ACM New York, NY, USA. 1986. pp. 24–26.
31. McDonald JE, Plate TA, Schvaneveldt RW. Pathfinder associative networks: studies in knowledge organization. Ablex Publishing Corp; 1990. Using pathfinder to extract semantic information from text; pp. 149–164.
32. Yarowsky D. Proceedings of the 33rd annual meeting on Association for Computational Linguistics. Cambridge, Massachusetts: Association for Computational Linguistics; 1995. Unsupervised word sense disambiguation rivaling supervised methods; pp. 189–196.
33. Belkin NJ, Croft WB. Annual review of information science and technology. Vol. 22. Elsevier Science Inc; 1987. Retrieval techniques; pp. 109–145.
34. Widdows D. Geometry and Meaning. Center for the Study of Language and Information/SRI. 2004
35. Brin S, Page L. The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems. 1998:107–117.
36. Volk M, Ripplinger B, Vintar Š, Buitelaar P, Raileanu D, Sacaleanu B. Semantic annotation for concept-based cross-language medical information retrieval. International Journal of Medical Informatics. 2002;67:97–112. [PubMed] 37. Maedche A, Staab S. Ontology learning for the Semantic Web. Intelligent Systems, IEEE. 2001;16:72–79.
38. Charniak E. Statistical Language Learning. Bradford Books; 1993.
39. Rilo E, Jones R. Learning Dictionaries for Information Extraction by Multi-Level Bootstrapping. Proceedings of AAAI-99; 1999. p. 474.
40. Cederberg S, Widdows D. Using LSA and noun coordination information to improve the precision and recall of automatic hyponymy extraction. Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003; 2003. pp. 111–118.
41. Domingos P. Toward knowledge-rich data mining. Data Mining and Knowledge Discovery. 2007;15(1):21–28.
42. Deerwester S, Dumais ST, Furnas GW, Landauer TK, Harshman R. Indexing by latent semantic analysis. Journal of the American Society for Information Science. 1990;41:391–407.
43. Widdows D, Ferraro K. Semantic Vectors: A Scalable Open Source Package and Online Technology Management Application. To appear in Sixth International Conference on Language Resources and Evaluation (LREC 2008); 2008.
45. Rapp R. Ninth Machine Translation Summit. 2003. Word sense discovery based on sense descriptor dissimilarity; pp. 315–322.
46. Landauer TK. personal communication.
47. Pustejowsky YJ. The Generative Lexicon. Computational Linguistics. 1991;17(4):409–441.
48. Koehn P. MT Summit. 2005. Europarl: A parallel corpus for statistical machine translation.
49. Widdows D, Peters S, Cederberg S, Chan CK, Steffen D, Buitelaar P. Unsupervised monolingual and bilingual word-sense disambiguation of medical documents using UMLS. Natural Language Processing in Biomedicine ACL 2003 Workshop; 2003. pp. 9–16.
50. Landauer TK, Laham D, Rehder B, Schreiner ME. How Well Can Passage Meaning be Derived without Using Word Order. A Comparison of Latent Semantic Analysis and Humans. Proceedings of the Nineteenth Annual Conference of the Cognitive Science Society; August 7–10, 1997; Stanford University. 1997.
51. Landauer TK, Laham D, Foltz PW. The Intelligent Essay Assessor. IEEE Intelligent Systems. 2000;15:27–31.
52. Swayne DF, Lang DT, Buja A, Cook D. GGobi: evolving from XGobi into an extensible framework for interactive data visualization. Computational Statistics and Data Analysis. 2003;43:423–444.
53. Landauer TK, Laham D, Derr M. From paragraph to graph: Latent semantic analysis for information visualization. Proceedings of the National Academy of Sciences; 2004. Apr, pp. 5214–5219.
54. Burgess C, Lund K. The dynamics of meaning in memory. Cognitive dynamics: Conceptual and representational change in humans and machines. 2000:117–156.
55. Widdows D, Cederberg S. Monolingual and bilingual concept visualization from corpora. Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: Demonstrations; 2003. pp. 31–32.
56. Schvaneveldt RW. Pathfinder associative networks: studies in knowledge organization. Ablex Publishing Corp; Norwood, NJ, USA: 1990.
57. Cohen TA. Exploring MEDLINE Space with Random Indexing and Pathfinder Networks. AMIA Annu Symp Proc. 2008:126–30. [PMC free article] [PubMed] 58. Heer J, Card SK, Landay JA. prefuse: a toolkit for interactive information visualization. Conference on Human Factors in Computing Systems; 2005. pp. 421–430.
59. Curran JR. Supersense tagging of unknown nouns using semantic similarity. Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics; 2005. pp. 26–33.
60. Harris ZS. The structure of science information. Journal of Biomedical Informatics. 2002;35:215–221. [PubMed] 61. Homayouni R, Heinrich K, Wei L, Berry MW. Gene clustering by latent semantic indexing of MEDLINE abstracts. Bioinformatics (Oxford, England) 2005;21(1):104–115. [PubMed] 62. Zambrano N, Gianni D, Bruni P, Passaro F, Telese F, Russo T. Fe65 is not involved in the platelet-derived growth factor-induced processing of Alzheimer’s amyloid precursor protein, which activates its caspase-directed cleavage. The Journal of biological chemistry. 2004 Apr;279:16161–16169. [PubMed] 63. Glenisson P, Antal P, Mathys J, Moreau Y, De Moor B. Evaluation of the vector space representation in text-based gene clustering. Pac Symp Biocomput. 2003:391–402. [PubMed] 64. Klein-Seetharaman The Use of Analogies for Interdisciplinary Research in the Convergence of Nano-, Bio- and Information Technology. NSF Report on Societal Implications of Nanoscience and Nanotechnology. 2005:128–133.
65. Ganapathiraju M, Balakrishnan N, Reddy R, Klein-Seetharaman J. TMpro: Transmembrane Helix Prediction Using Amino Acid Properties and Latent Semantic Analysis. BMC Bioinformatics. 2007;8(10) [PMC free article] [PubMed] 66. Stuart GW, Berry MW. A comprehensive whole genome bacterial phylogeny using correlated peptide motifs defined in a high dimensional vector space. Journal of Bioinformatics and Computational Biology. 2003:1475–493. [PubMed] 67. Stuart GW, Berry MW. An SVD-based comparison of nine whole eukaryotic genomes supports a coelomate rather than ecdysozoan lineage. BMC Bioinformatics. 2004;5:204–217. [PMC free article] [PubMed] 68. Widdows D, Cohen T. Semantic Vector Combinations and the Synoptic Gospels. to appear. Proceedings of the Third Quantum Interaction Symposium; March 25–27, 2009; DFKI, Saarbrücken.
69. Gordon MD, Dumais S. Using latent semantic indexing for literature based discovery. Journal of the American Society for Information Science. 1998;49:674–685.
70. Cole RJ, Bruza PD. A Bare Bones Approach to Literature-Based Discovery: An Analysis of the Raynaud?s/Fish-Oil and Migraine-Magnesium Discoveries in Semantic Space. Discovery Science. 2005:84–98.
71. Bruza P, Cole R, Song D, Bari Z. Towards Operational Abduction from a Cognitive Perspective. Oxford Univ Press; 2006.
72. Bath PA, Hersh William. Information retrieval: a health and biomedical perspective. New York, NY: Springer; 2003.
73. Lin J, Wilbur WJ. PubMed related articles: A probabilistic topic-based model for content similarity. BMC Bioinformatics. 2007;8(1):423. [PMC free article] [PubMed] 74. Vanteru BC, Shaik JS, Yeasin M. Semantically linking and browsing PubMed abstracts with gene ontology. BMC Genomics. 2008;(9 Suppl):1S10. [PMC free article] [PubMed] 75. Yang Y, Chute CG. An example-based mapping method for text categorization and retrieval. ACM Transactions on Information Systems (TOIS) 1994;12(3):252–277.
76. Yang Y. An evaluation of statistical approaches to MEDLINE indexing. Proc AMIA Annu Fall Symp. 1996:358–362. [PMC free article] [PubMed] 77. Yang Y, Chute CG. An application of Expert Network to clinical classification and MEDLINE indexing. Proc Annu Symp Comput Appl Med Care. 1994:157–61. [PMC free article] [PubMed] 78. Cooper GF, Miller RA. An Experiment Comparing Lexical and Statistical Methods for Extracting MeSH Terms from Clinical Free Text. Am Med Inform Assoc. 1998 [PMC free article] [PubMed] 79. Aronson AR, Bodenreider O, Chang HF, Humphrey SM, Mork JG, Nelson SJ, et al. The NLM Indexing Initiative. Proc AMIA Symp. 2000:17–21. [PMC free article] [PubMed] 80. Aronson AR. Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program. Proc AMIA Symp. 2001:1717–21. [PMC free article] [PubMed] 81. Cohen A, Hersh W, Dubay C, Spackman K. Using co-occurrence network structure to extract synonymous gene and protein names from MEDLINE abstracts. BMC Bioinformatics. 2005;6:103. [PMC free article] [PubMed] 82. Friedman C. A broad-coverage natural language processing system. Proc AMIA Symp. 2000:19270–4. [PMC free article] [PubMed] 83. Sager N, Lyman M, Bucknall C, Nhan N, Tick LJ. Natural language processing and the representation of clinical data. Journal of the American Medical Informatics Association: JAMIA. 1:142–60. [PMC free article] [PubMed] 84. Harris ZS. Mathematical structures of language. Interscience Publishers; New York: 1968.
85. Chute CG, Yang Y, Evans DA. Latent Semantic Indexing of medical diagnoses using UMLS semantic structures. Proc Annu Symp Comput Appl Med Care. 1991:1859. [PMC free article] [PubMed] 86. Chute CG, Yang Y. An evaluation of concept based latent semantic indexing for clinical information retrieval. Proc Annu Symp Comput Appl Med Care. 1992;639:43. [PMC free article] [PubMed] 87. Yang Y, Chute CG. Proceedings of the 14th conference on Computational linguistics - Volume 2. Nantes, France: Association for Computational Linguistics; 1992. A Linear Least Squares Fit mapping method for information retrieval from natural language texts; pp. 447–453.
88. Pedersen T, Pakhomov SVS, Patwardhan S, Chute CG. Measures of semantic similarity and relatedness in the biomedical domain. Journal of Biomedical Informatics. 2007;40:288–299. [PubMed] 89. Rubenstein H, Goodenough JB. Contextual correlates of synonymy. Commun ACM. 1965;8:627–633.
90. Budanitsky A, Hirst G. Semantic distance in WordNet: An experimental, application-oriented evaluation of five measures. Workshop on WordNet and Other Lexical Resources; 2001.
91. Fan JW, Friedman C. Semantic Classification of Biomedical Concepts Using Distributional Similarity. Journal of the American Medical Informatics Association. 2007;14:467–477. [PMC free article] [PubMed] 92. Grefenstette G. Corpus-derived first, second and third-order word affinities. Proceedings of Euralex; 1994. pp. 279–290.
93. Lin D. Automatic retrieval and clustering of similar words. Proceedings of the 17th international conference on Computational linguistics; 1998. pp. 768–774.
94. Curran JR, Moens M. Scaling context space. Proceedings of the 40th Annual Meeting on Association for Computational Linguistics; 2001. pp. 231–238.
95. Landauer TK, McNamara D, Dennis S, Kintsch W, editors. Lawrence Handbook of Latent Semantic Analysis. Mahwah, N.J: Lawrence Erlbaum Associates; 2007. Handbook of Latent Semantic Analysis.
96. Wiemer-Hastings P, Zipitria I. Rules for syntax, vectors for semantics. Proceedings of the 23rd Annual Conference of the Cognitive Science Society; 2001. pp. 1112–1117.
97. Kanejiya D, Kumar A, Prasad S. Automatic evaluation of students’ answers using syntactically enhanced LSA. Proceedings of the HLT-NAACL 03 workshop on Building educational applications using natural language processing. 2003;2:53–60.
98. Cohen T, Blatter B, Patel V. Exploring dangerous neighborhoods: latent semantic analysis and computing beyond the bounds of the familiar. AMIA ?. Annual Symposium proceedings/AMIA Symposium. AMIA Symposium; 2005. pp. 151–5.
99. Cohen T, Blatter B, Patel V. Simulating expert clinical comprehension: Adapting latent semantic analysis to accurately extract clinical concepts from psychiatric narrative. J of Biomedical Informatics. 2008;41(6):1070–1087. [PMC free article] [PubMed] 100. Sharda P, Das AK, Cohen TA, Patel V. Customizing clinical narratives for the electronic medical record interface using cognitive methods. International Journal of Medical Informatics. 2006;75:346–368. [PubMed] 101. Widdows D, Peters S. Mathematics of Language. Vol. 8. Bloomington, Indiana: 2003. Jun, Word Vectors and Quantum Logic Experiments with negation and disjunction.
102. Elvevaag B, Foltz PW, Weinberger DR, Goldberg TE. Quantifying incoherence in speech: An automated methodology and novel application to schizophrenia. Schizophrenia Research. 2007;93:304–316. [PMC free article] [PubMed] 103. Cline RJ, Haynes KM. Consumer health information seeking on the Internet: the state of the art. Health Educ Res. 2001 Dec;16(6):671–92. [PubMed] 104. Chen G, Warren J, Evans J. Automatically generated consumer health metadata using semantic spaces [Internet]; Proceedings of the second Australasian workshop on Health data and knowledge management; Wollongong, NSW. Australia: Australian Computer Society, Inc.; 2008. pp. 9–15.
105. McArthur R, Bruza P, Warren J, Kralik D. Projecting Computational Sense of Self: A Study of Transition in a Chronic Illness Online Community. System Sciences, 2006; HICSS ’06. Proceedings of the 39th Annual Hawaii International Conference on; 2006. p. 91c.
106. Berry MW, Mezher D, Philippe B, Sameh A. Parallel computation of the singular value decomposition. In: Kontoghiorghes E, editor. Handbook on Parallel Computing and Statistics. 2003. pp. 117–164.
107. Johnson W, Lindenstrauss J. Extension of lipshitz mapping to hilbert space. Contemporary Math. 1984;26:189–206.
108. Sahlgren M, Holst A, Kanerva P. Permutations as a Means to Encode Order in Word Space. Proceedings of the 30th Annual Meeting of the Cognitive Science Society (CogSci’08); July 23–26; Washington D.C., USA.
109. Sahlgren M. PhD Dissertation. Department of Linguistics, Stockholm University; 2006. The Word-Space model: Using Distributional Analysis to Represent Syntagmatic and Paradigmatic Relations Between Words in High-dimensional Vector Spaces.
110. Dennis S. Handbook of Latent Semantic Analysis. 2007. Introducing Word Order Within the LSA Framework.
111. Griffiths TL, Steyvers M, Blei D, Tenenbaum JB. Integrating topics and syntax. Advances in Neural Information Processing Systems. 2005;17:537–544.
112. Widdows D. Unsupervised methods for developing taxonomies by combining syntactic and statistical information. Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology. 2003;1:197–204.
113. Widdows D. Semantic Vector Products: Some Initial Investigations. Proceedings of the Second AAAI Symposium on Quantum Interaction; 2008.
114. Sahlgren M, Coster R. Using Bag-of-Concepts to Improve the Performance of Support Vector Machines in Text Categorization. Proceedings of the 20th International Conference on Computational Linguistics; COLING. 2004.
115. Berry M, Do T, O?Brien G, Krishna V, Varadhan S. University of Tennessee Computer Science Department Technical Report. 1993. SVDPACKC (Version 1.0) user?s guide; pp. CS-93–194.
116. Giles JT, Wo L, Berry MW. Statistical Data Mining and Knowledge Discovery. 2001. GTP (General Text Parser) Software for Text Mining.