BIBLIOGRAPHY

Natural Language Processing


The Linguistic String Project (LSP) at New York University began in 1965 with funding from the National Science Foundation to develop computer methods for structuring and accessing information in the scientific and technical literature. Document processing was to be based on linguistic principles, first to demonstrate the possibility of computerized grammatical analysis (parsing), then to extend to specialized vocabulary and rules for particular scientific domains. Domain specialization led to an elaboration of the methods of sublanguage analysis, in particular as applied to the language of clinical reporting in patient documents. The 30+ year history of the Linguistic String Project and its results are tracked in the following bibliography.

PUBLICATIONS

Major Summary
Other Significant Publication
M Application to Medical Texts

  1. Sager, N. (1967). Syntactic Analysis of Natural Language. Advances in Computers 8, pp. 153-188. Academic Press, NY.
    1. Linguistic basis for language computation (string analysis). 2. Procedure.
    3. Implementations in IPLV & FAP (IBM 7094), organization of grammars as BNF definitions and "restrictions" (= constraints on parse trees), treatment of conjunctions.

  2. Sager, N. (1966). Information Reduction of Texts by Syntactic Analysis. Seminar on Computational Linguistics, (A.W. Pratt, A.H. Roberts, K. Lewis, eds.), pp. 46-56. Division of Computer Science and Technology, National Institutes of Health, Bethesda, MD. (PHS Publication No. 1716.)
    First notion of tabular reduction of text using results of computerized language processing.

  3. Salkoff, M., and Sager, N. (1967). The Elimination of Grammatical Restrictions in a String Grammar of English. 2ème Conference Internationale sur le Traitement Automatique des Langues, Grenoble, August 1967.
    Theoretical paper: relation of string analysis to context-free grammar.

  4. Sager, N. (1970). The Sublanguage Method in String Grammars. Studies in Language and Linguistics, (R.W. Ewton, Jr. and J. Ornstein, eds.), pp. 89-98. University of Texas at El Paso.
    Introduces particular set of tree-navigation routines to implement restrictions.

  5. Sager, N. (1972). A Two-Stage BNF Specification of Natural Language. Journal of Cybernetics 2:3, pp. 39-50.
    Introduces "Restriction Language", a syntax for calling on the tree-navigation routines and tests at parse tree nodes in restrictions.

  6. Sager, N. (1972). Syntactic Formatting of Science Information. AFIPS Conference Proceedings 41, pp. 791-800. AFIPS Press, Montvale, NJ. Reprinted in Sublanguage: Studies of Language in Restricted Semantic Domains (R. Kittredge and J. Lehrberger, eds.), Walter de Gruyter, Berlin (1982), pp. 9-26.
    Introduces "Sublanguage Grammar" as basis for tabular reduction of text.

  7. Sager, N. (1973). The String Parser for Scientific Literature. In Natural Language Processing (R. Rustin, ed.), Algorithmics Press, NY, pp. 61-87.
    Summarizes approach, form of grammar; refers to 3rd implementation of parser (in Fortran) for CDC6600.

  8. Grishman, R. (1973). Implementation of the String Parser of English. In Natural Language Processing (R. Rustin. ed.), Algorithmics Press, NY, pp. 89-109.
    Describes Fortran implementation as compiler-compiler; describes "saving" to speed parsing.

  9. Grishman, R., Sager, N., Raze (Friedman), C., and Bookchin, B. (1973). The Linguistic String Parser. AFIPS Conference Proceedings 42, AFIPS Press, Montvale, NJ, pp. 427-434.
    Overall description of the system with some examples.

  10. Fitzpatrick, E., and Sager, N. (1974). The Lexical Subclasses of the Linguistic String Parser. American Journal of Computational Linguistics, No. 2. (See also S.P.R., item 9 under REPORTS.)
    Describes lexical component, later = Appendix 3 cf #34.

  11. Sager, N. (1975). Sublanguage Grammars in Science Information Processing. Journal of the American Society for Information Science 26, pp. 10-16. Click Wiley Online Library: JASIS.
    Theoretical: How sublanguage grammar provides tool for analyzing scientific literature.

  12. Sager, N., and Grishman, R. (1975). The Restriction Language for Computer Grammars of Natural Language. Communications of the ACM 18, pp. 390-400. Click ACM Digital Library, New York University.
    Describes the Restriction Language (RL) and illustrates how restrictions written in RL operate on parse trees.

  13. Hirschman, L., Grishman, R., and Sager, N. (1975). Grammatically-Based Automatic Word Class Formation. Information Processing and Management ll, pp. 39-57.
    A computational linguistics experiment: Develop semantic word classes by clustering, using parse-trees.

  14. Anderson, B., Bross, I.D.J., and Sager, N. (1975). Grammatical Compression in Notes and Records: Analysis and Computation. American Journal of Computational Linguistics 2, No. 4:68-81.
    First study of clinical narrative in relation to known grammar rules: what is left unsaid.

  15. Sager, N. (1975). Computerized Discovery of Semantic Word Classes in Scientific Fields. In Directions in Artificial Intelligence: Natural Language Processing, Courant Computer Science Report No. 7, Courant Institute of Mathematical Sciences, New York University, pp. 27-48.
    More on clustering, using published scientific articles for data [see #13].

  16. Hobbs, J., and Grishman, R. (1976). The Automatic Transformational Analysis of English Sentences: An Implementation. International Journal of Computer Mathematics, Section A, Vol. 5, pp. 267-283.
    Beginning of transformational component of grammar; description of mechanisms.

  17. Raze (Friedman), C. (1976). A Computational Treatment of Coordinate Conjunctions. American Journal of Computational Linguistics, No. 52.
    Describes treatment of conjunctions in the implemented string grammar.

  18. Hirschman, L., Grishman, R., and Sager, N. (1976). From Text to Structured Information: Automatic Processing of Medical Reports. AFIPS Conference Proceedings 45, AFIPS Press, Montvale, NJ, pp 267-275.
    M First application of implemented "information formatting" - x-ray diagnosis statements.

  19. Sager, N. (1977). Perspective Paper: Computational Linguistics. In Natural Language in Information Science; Perspectives and Directives for Research (D.E. Walker, H. Karlgren and M. Kay, eds.), Skriptor, Stockholm, FID 551, pp. 75-100.
    Overall view of approach - invited paper.

  20. Hirschman, L., and Grishman, R. (1977). Fact Retrieval from Natural Language Medical Records. IFIP World Conference Series on Medical Informatics 2 (D.B. Shires and H. Wolf, eds.), North- Holland, Amsterdam, pp. 247-251.
    M "Question-answering" applied to information-formatted x-ray reports. Results.

  21. Sager, N. (1977). Information Structures in the Language of Science. In The Many Faces of Information Science, AAAS Selected Symposium 3 (E. C. Weiss, ed.), Westview Press, Boulder, Colorado, pp. 53-73.
    Potential applications of Natural Language Processing to science-literature retrieval.

  22. Sager, N., Hirschman, L., Grishman, R., and Insolio, C. (1977). Transforming Medical Records Into a Structured Data Base. In D. Waltz, Natural Language Interfaces, ACM-SIGART Newsletter, No. 61 (Feb. 1977), pp. 38-39.
    One-page update: We have done x-ray reports, on to discharge summaries.

  23. Insolio, C., and Sager, N. (1977), Parsing Free Narrative. Unpublished paper. Presented at Annual Meeting of Association for Computational Linguistics, Georgetown University, Washington, DC, March 17, 1977.
    Some examples of unresolved ambiguity.

  24. Sager, N., Hirschman, L., Grishman, R., and Insolio, C. (1977). Computer Programs for Natural Language Files. In Information Management in the 1980's, Proceedings of the ASIS Annual Meeting 14. Knowledge Industry Publications, White Plains, NY.
    Example of information formatting: radiology reports. Another version of #20.

  25. Sager, N. (1978). Natural Language Analysis and Processing. In the Encyclopedia of Computer Science and Technology (J. Belzer, A.G. Holzman and A. Kent, eds.), Marcel Dekker, NY, Vol. 11. pp. 152-169. (1983 edition available in paperback.) Reprinted in Subject and Information Analysis (E.D. Dym, ed.), Marcel Dekker, NY, 1985.
    Overall view of natural language processing (NLP) field.

  26. Sager, N., and Lyman, M. (1978). Computerized Language Processing: Implications for Health Care Evaluation. Medical Record News 49:3, pp. 20-30.
    M Illustrates by manual example the application of information-formatting in medical audit.

  27. Grishman, R., and Hirschman, L. (1978). Question Answering from Natural Language Medical Data Bases. Artificial Intelligence 11, pp. 25-43.
    A natural language "front end" to query information-formatted x-ray reports.

  28. Sager, N. (1978). Natural Language Information Formatting: The Automatic Conversion of Texts to a Structured Data Base. In Advances in Computers 17 (M.C. Yovits, ed.), Academic Press, NY, pp. 89-162.
    1. Theoretical basis. 2. Methods. 3. Programs. 4. Applications - the full system for converting free-text information to formatted information. Precursor to #34.

  29. Sager, N., Hirschman, L., and Lyman, M. (1978). Computerized Language Processing for Multiple Use of Narrative Discharge Summaries. In Proceedings of the Second Annual Symposium on Computer Applications in Medical Care (F.H. Orthner, ed.), IEEE, New York, pp. 330-343. See also NIH November 9, 1978.
    M Presents information-formatting procedures (overview) with example of retrieval from small set of information-formatted discharge summaries.

  30. Hirschman, L., Sager, N., and Lyman, M. (1979). Automatic Application of Health Care Criteria to Narrative Patient Records. Proceedings of the Third Annual Symposium on Computer Applications in Medical Care (R.A. Dunn, ed.), IEEE, New York, pp. 105-113. See also NIH October 17, 1979.
    M Example of retrieving quality-assurance data from information-formatted discharge summaries.

  31. Sager, N. (1980). Review of Computational Linguistics in Medicine (W. Schneider and A-L. Sagvall Hein, eds.), North-Holland, NY, 1977. In American Journal of Computational Linguistics 6:1 (Jan.-Mar. 1980), pp. 44-47.
    Book review.

  32. Hirschman, L. (1981). Retrieving Time Information from Natural Language Texts. Information Retrieval Research (R.N. Oddy, S.E. Robertson, C.J. Van Rijsbergen and P. Williams, eds.), Butterworths, London, pp. 154-171.
    M Computational treatment of time information in information-formatted documents.

  33. Hirschman, L., and Sager, N. (1982). Automatic Information Formatting of a Medical Sublanguage. Sublanguage: Studies of Language in Restricted Semantic Domains (R. Kittredge and J. Lehrberger, eds.). Walter de Gruyter, Berlin, pp. 27-80.
    M Information formatting applied to clinical documents: methods, processing, results.

  34. Sager, N. (1981). Natural Language Information Processing: A Computer Grammar of English and Its Applications. Addison-Wesley, Reading, Mass.
    BOOK. Documentation (partial listing) of English grammar; summary of information- formatting, examples, use in teaching.
    M Chapter 8: Information Formatting, and Chapter 9: Application of Medical Information Formatting.

  35. Sager, N., Tick, L., Story, G., and Hirschman, L. (1980). A CODASYL-type Schema for Natural Language Medical Records. Proceedings of the Fourth Annual Symposium on Computer Applications in Medical Care 2 (J.T. O'Neill, ed.), IEEE, New York, pp. 1027-1033. NIH November 5, 1980.
    M First use of DBMS (CODASYL type schema) to store and retrieve from information-formatted clinical texts.

  36. Hirschman, L., Story, G., Marsh, E., Lyman, M., and Sager, N. (1981). An Experiment in Automated Health Care Evaluation from Narrative Medical Records. Computers and Biomedical Research 14:5 , pp. 447-463.
    M Full report of study in #30: Quality assurance data extracted from information-formatted discharge summaries; results compared with manual extraction.

  37. Sager, N., Bross, I.D.J., Story, G., Bastedo, P., Marsh, E. and Shedd, D. (1982). Automatic Encoding of Clinical Narrative. Computers in Biology and Medicine 12:1 , pp. 43-56.
    M First attempt at automatic coding using information-formatted radiology reports.

  38. Story, G., and Hirschman, L. (1981). Database Design for Natural Language Medical Data. Proceedings of the 14th Annual Hawaii International Conference on System Sciences III. Reprinted in Journal of Medical Systems 6:1 (1982), pp. 77-88.
    M First use of relational DBMS to store and retrieve from information-formatted clinical text.

  39. Hirschman, L., and Story, G. (1981). Representing Implicit and Explicit Time Relations in Narrative. Proceedings of the 7th International Joint Conference on Artificial Intelligence (IJCAI 81) 1, pp. 289-295.
    M Treatment of temporal relations in clinical narrative by processing time expressions in the text.

  40. Sager, N. (1981). Information Structures in Texts of a Sublanguage. ASIS-81 Conference Proceedings.
    M Summary report of information formatting and database representation of clinical text.

  41. Sager, N., and Grishman, R. (1982). Research in Computational Linguistics at New York University. ACM-SIGART Newsletter. Special Issue on Natural Language Processing.
    Newsletter update on project activities.

  42. Marsh, E., and Sager, N. (1982). Analysis and Processing of Compact Texts. COLING 82: Proceedings of the Ninth International Conference on Computational Linguistics (J. Horecky, ed.), North-Holland, pp. 201-206, Amsterdam.
    Syntactic features special to technical sublanguage reporting.

  43. Grishman, R., Hirschman, L., and Friedman, C. (1982). Natural Language Interfaces Using Limited Semantic Information. COLING 82: Proceedings of the Ninth International Conference on Computational Linguistics (J. Horecky, ed.), North-Holland, Amsterdam, pp. 89-94.
    Research on natural language "front end" for information-formatted text.

  44. Sager, N., Chi, E.C., Tick, L.J., and Lyman, M. (1982). Relational Database Design for Computer-Analyzed Medical Narrative. Proceedings of the Sixth Annual Symposium on Computer Applications in Medical Care (B.I. Blum, ed.), New York, pp. 797-804.
    M More on first relational database design for information-formatted clinical text as in #38.

  45. Marsh, E. (1983). Utilizing Domain-Specific Information for Processing Compact Text. Proceedings of the Conference on Applied Natural Language Processing, Association for Computational Linguistics, Menlo Park, CA, pp. 99-103.
    Compares syntactic features of clinical text with those in Navy messages. Extends #42.

  46. Grishman, R., Hirschman, L., and Friedman, C. (1983). Isolating Domain Dependencies in Natural Language Interfaces. Proceedings of the Conference on Applied Natural Language Processing, Association for Computational Linguistics, Menlo Park, CA, pp. 46-53.
    More on natural language "front end" to information-formatted clinical text. Extends #43.

  47. Chi, E.C., Sager, N., Tick, L.J., and Lyman, M. (1983). Relational Database Modeling of Free-Text Medical Narrative. Medical Informatics 8:3 , pp. 209-223. (Special Issue: New Methods for the Analysis of Clinical Data.) Taylor & Francis Ltd, London.
    M More on the first relational database design for information-formatted clinical text, as in #38, #44.

  48. Lyman, M., Chi, E.C., Sager, N., Tick, L.J., and Story, G.A. (1983). Automated Case Review of Acute Bacterial Meningitis of Childhood. In Proceedings of MEDINFO 83, Amsterdam, August 1983.
    Another demonstration of information-formatting followed by temporal analysis and retrieval program.

  49. Sager, N., and Kosaka, M. (1983). A Database of Literature Organized by Relations. Proceedings of the 7th Annual Symposium on Computer Applications in Medical Care (R. Dayhoff, ed.), IEEE Computer Society, Silver Spring, MD, pp. 692-695.
    An experiment in linguistic reduction of scientific literature to tabular form.

  50. Friedman, C., Sager, N., Chi, E.C., Marsh, E., Christenson, C., and Lyman, M.S., MD (1983). Computer Structuring of Free-Text Patient Data. Proceedings of the 7th Annual Symposium on Computer Applications in Medical Care (R. Dayhoff, ed.), IEEE Computer Society, Silver Spring, MD, pp. 688-691.
    M A progress report on current work on processing clinical text.

  51. Sager, N., and Wong, R., MD (1983). Developing a Database from Free-Text Clinical Data. Journal of Clinical Computing XI: 5 & 6, pp. 184-194.
    M Information formatting of pathology reports: design of an information format.

  52. Sager, N., Tick, L.J., Story, G., Friedman, C., and Christenson, C. (1983). Data Model for Natural Language Information. ACM-SIGART Newsletter, No. 86, Special Issue on A.I. and Database Research, p. 51.
    Another 1 page update on project activities.

  53. White, C. (1983). The Linguistic String Project Dictionary for Automatic Text Analysis. Proceedings of the Workshop on Machine Readable Dictionaries, SRI International, Menlo Park, CA.
    Describes lexical component of system and an experiment in supplementing the dictionary with entries from a computer readable form of Webster's Collegiate Dictionary.

  54. Gordon, D., and Sager, N. (1985). A Method of Measuring Information in Language, Applied to Medical Texts. Information Processing & Management 21:4 , pp. 269-289.
    M An experiment in grading the complexity of texts by a measure based on transformed parse trees of sentences.

  55. Wolff, S. (1984). The Use of Morphosemantic Regularities in the Medical Vocabulary for Automatic Lexical Coding. Methods of Information in Medicine, 23, pp. 195-203.
    M Reports on a program to determine semantic components of the neo-Latin portion of medical vocabulary.

  56. Sager, N. (1984). Sublanguage: Linguistic Phenomenon, Computational Tool. In Analyzing Language in Restricted Domains (R. Grishman and R. Kittredge, eds.), Lawrence Erlbaum, 1986, pp. 1-17.
    Overview paper focusing on sublanguage methodology.

  57. Friedman, C. (1984). Sublanguage Text Processing - Application to Medical Narrative. In Analyzing Language in Restricted Domains (R. Grishman and R. Kittredge, eds.), Lawrence Erlbaum, 1986, pp. 85-102.
    M Overview paper on information-formatting procedures.

  58. Chi, E.C., Friedman, C., Sager, N., and Lyman, M.S., M.D. (1985). Processing Free-Text Input to Obtain a Database of Medical Information. Proceedings of the 8th Annual ACM-SIGIR Conference on Research and Development in Information Retrieval, (June 1985), New York, Association for Computing Machinery.
    M The first relational database design reported to an Information Retrieval audience. Follows #38, #44, #45.

  59. Marsh, E., and Friedman, C. (1985). Transporting the Linguistic String Project System from a Medical to a Navy Domain. ACM Transactions on Office Information Systems 3:2 , pp. 121-140.
    Information-formatting of a non-medical sublanguage. Extends #42, #45.

  60. Sager, N., Chi, E.C., Friedman, C., and Lyman, M.S., MD (1985). Modeling Natural Language Data for Automatic Creation of a Database from Free-Text Input. Database Engineering 8:3, pp. 45-55.
    M Another report on the relational database design for information-formatted text, follows #38, #44, #47, #58.

  61. Sager, N. (1985). Natural Language Processing in Biomedical Laboratory Computing. Frontiers of Engineering and Computing in Health Care--1985 (J.C. Lin and B.N. Feinberg, eds.), vol. 1, 575-8. (Proceedings of the 7th Annual Conference of the IEEE/Engineering in Medicine and Biology Society, Sept. 1985). McGregor & Werner, Washington, DC, vol. 1, pp. 575-578.
    M Brief report of system, suggesting application to narrative laboratory reports.

  62. Chi, E.C., Lyman, M.S., Sager, N., Friedman, C., and Macleod, C. (1985). A Database of Computer-Structured Narrative: Methods of Computing Complex Relations. Proceedings of the 9th Annual Symposium on Computer Applications in Medical Care (M.J. Ackerman, ed.), IEEE Computer Society Press, Washington, DC, pp. 221-226.
    M Another project update for the medical informatics community.

  63. Lyman, M.S., Sager, N., Friedman, C., and Chi, E.C. (1985). Computer-Structured Narrative in Ambulatory Care: Its Use in Longitudinal Review of Clinical Data. Proceedings of the 9th Annual Symposium on Computer Applications in Medical Care (M.J. Ackerman, ed.), IEEE Computer Society Press, Washington, DC, pp. 82-86.
    M Medical applications of system; companion report to #62.

  64. London, J., Wolff, S., and Sager, N. (1985). The LSP Lexicon for Free-Text Information Formatting: Application to Medical Narrative. Workshop on the Lexicon, Parsing, and Semantic Interpretation, Graduate Center, City University of New York, Jan. 1985.
    M Summary paper on lexical component of system for linguistics audience.

  65. Sager, N., Friedman, C., Lyman, M.S., MD, and members of the Linguistic String Project (1987). Medical Language Processing: Computer Management of Narrative Data. Addison-Wesley, Reading, MA.
    M BOOK. Summarizes how system described in #34 (Sager book) is specialized for medical document processing.

  66. Sager, N. (1986). Representing Biomedical Information via Its Linguistic Structure. Proceedings of the AAMSI Congress 86, (A.H. Levy, MD, and B.T. Williams, MD, eds.). American Association for Medical Systems and Informatics, Washington, DC, vol.4, pp. 278-281.
    Another summary of system for a different audience.

  67. Sager, N., Friedman, C., Lyman, M.S., MD, Chi, E.C., Macleod, C., Chen, S., and Johnson, S. (1986). The Analysis and Processing of Clinical Narrative. MEDINFO 86; Proceedings of the Fifth Conference on Medical Informatics (R. Salamon, B. Blum, and M. Jorgensen, eds.). Elsevier Science Publishers B.V. (North Holland), Participants Ed., Part 2, pp. 1101-1105.
    M System described briefly to international medical informatics community.

  68. Sager, N., Mattick, P., Jr., Friedman, C.,and Chi, E.C. (1989). Information Structures in Survey Instruments, Proceedings of the Section on Survey Research Methods, American Statistical Association.
    Experiment in applying information-formatting to questions and answers in survey.

  69. Lyman, M.S. and Sager, N.S. (1989) The New York University Experience in the Computer Processing of Medical Language. In Computerized Natural Medical Language Processing, Elsevier Science Publishers, North Holland.
    M Presents system to international audience at IMIA sponsored symposium.

  70. Sager, N. (1989) Medical Language Processing: Computer Management of Narrative Data. In Computerized Natural Medical Language Processing, Elsevier Science Publishers, North Holland.
    M Describes sublanguage methodology and applicability of information-formatting to French clinical documents at IMIA symposium.

  71. Sager, N., Lyman, M., Tick, L.J., Borst, F., Nhàn, N.T., Reveillard, C., Su,Y., and Scherrer, J-R. (1989) Adapting a Medical Language Processor from English to French. MEDINFO89: Proceedings of the Sixth Conference on Medical Informatics. Elsevier Science Publishers B.F., North Holland.
    M The English medical information formatting system converted to French.

  72. Borst, F., Sager, N., Lyman, M., Nhàn, N.T., Tick, L.J., Reveillard, C., Su, Y., and Scherrer, J-R. (1989) Cost Containment and Quality of Care Assessment: By-Product of a Fully Integrated HIS Handling Free Text Analysis of Discharge Summaries. Poster presentation MEDINFO89.
    M Utility of medical language processing illustrated.

  73. Sager, N. (l989) Book Review: Garside, R., Leech, G., and Sampson, G. (Eds.) The Computational Analysis of English: A Corpus-Based Approach. Longman, London and New York (1987). Information Processing and Management, 25:No. 3, pp. 334-335.
    Book review.

  74. Lyman, M., Sager, N., Chi, E.C., Tick, L.J., Nhàn, N.T., Su, Y., Borst, F., and Scherrer, J-R. (1989). Medical Language Processing for Knowledge Representation and Retrievals. Proceedings of the 13th Annual Symposium on Computer Applications in Medical Care. (L.C. Kingsland, ed.). IEEE Computer Society Press, Washington, D.C., pp. 548-553.
    M Initial results of French medical language processing, based on English medical information formatting, with Geneva collaborators.

  75. Lyman, M. (1989) Applications of Medical Language Processing: The New York University Linguistic String Project. Presentation of paper. Abstract: Proceedings of the 52nd ASIS Annual Meeting. (J. Katzer, G.B. Newby, eds.). Published for the American Society for Information Science by Learned Information, Inc., Medford, NJ., p. 252.
    Abstract.

  76. Borst, F., Sager, N., Nhàn, N.T., Su, Y., Lyman, M., Tick, L.J., Revillard, C., Chi, E., et Scherrer, J.R. (l989) Analyse Automatique de Comptes Rendus D'Hospitalisation. In Informatique et Santé, Informatique et Gestion des Unités de Soins, Comptes Rendus du Colloque AIM-IF, Paris, 1989, Degoulet, P., Stephan, J-C., Venot, A., et Yvon, P-J, Redacteurs. Paris, Springer-Verlag, pp. 246-256.

    M French medical information-formatting, querying, via relational dBMS (in French).

  77. Nhàn, N.T., Sager, N., Lyman, M., Tick, L.J., Borst, F. and Su, Y. (1989) A Medical Language Processor for Two Indo-European Languages. Proceedings of the 13th Annual Symposium on Computer Application in Medical Care (L.C. Kingsland, ed.). IEEE Computer Society Press, Washington, D.C., pp. 554-558.
    M Relation of English and French information-formatting systems.

  78. Sager, N. (1990) Computer Analysis of Sublanguage Information Structures, Annals of NY Academy of Sciences. New York, The New York Academy of Sciences, vol 683, pp. 161-179.
    M Overview of system for general scientific audience.

  79. Lyman, M., Sager, N., Nhàn, N.T., Tick, L.J., Borst, F. and Scherrer, J.R. (1991) The Application of Natural-Language Processing to Healthcare Quality Assessment, Medical Decision Making 11, #4 Suppl., pp. S65-S68.
    M Application of system in medical audit using asthma discharge summaries.

  80. Borst, F., Lyman, M.S., Nhàn,N.T., Tick, L.J., Sager, N., Scherrer, J.-R., (1991) TEXTINFO: A Tool For Automatic Determination of Patient Clinical Profiles Using Text Analysis, Proceedings of the 15th Annual Symposium on Computer Applications of Medical Care, McGraw Hill, NY, pp. 63-67.
    M Further work with the French system, with Geneva collaborators.

  81. Sager, N., Lyman, M.S., Tick, L.J., Nhàn, N.T., Borst, F., Scherrer, J-R., Clinical Knowledge Bases From Natural Language Patient Documents, MEDINFO92: Proceedings of the Seventh Conference on Medical Informatics, Elsevier Science Publishers B.F., North Holland, pp. 1375-1381.
    M Further work on French clinical documents, with Geneva collaborators.

  82. Sager, N., Lyman, M.S., Tick, L.J., Nhàn, N.T., Bucknall, C.E. (1993) Natural Language Processing of Asthma Discharge Summaries for the Monitoring of Patient Care, Proceedings of the 17th Annual Symposium on Computer Applications of Medical Care, McGraw Hill, NY, pp. 265-268.
    M Results of the medical audit experiment using asthma discharge summaries.

  83. Sager, N., Lyman, M.S., Bucknall, C., Nhàn, N.T., Tick, L.J.,(1994) Natural Language Processing and the Representation of Clinical Data, Journal of the American Medical Informatics Association,, vol. 1, no.2, pp.142-160.
    M Overall description of system and results of using it for medical audit.

  84. Sager, N., Lyman, M.S., Nhàn, N.T., Tick, L.J.,(1994) Automatic Encoding into SNOMED III: A Preliminary Investigation, Proceedings of the 18th Annual Symposium on Computer Applications in Medical Care, JAMIA Symposium Supplement pp. 230-234.
    M First report on tying output of medical language processing system to an established medical nomenclature (SNOMED Int).

  85. Sager, N., Lyman, M.S., Nhàn, N.T., Tick, L.J., (1995) Medical Language Processing: Applications to Patient Data and Automatic Encoding, Methods of Information in Medicine, vol. 34, no. 1/2, pp. 140-146.
    M Journal report on automatic encoding of free text into SNOMED Int via medical language processing system, comparison with manual results by Computer-based Patient Records Institute (CPRI).

  86. Sager, N., Nhàn, N.T., Lyman, M.S., Tick, L.J., (1995) Computer Analysis of Clinical Narrative: Why, How, What, When, Proceedings of BIRA95, Gent, Belgium, pp. 22-53.
    M Using SGML to display results of medical language processing system.

  87. Sager, N., Nhàn, N.T., Lyman, M.S., Tick, L.J., (1996) Medical Language Processing with SGML Display. Proceedings of the 1996 AMIA Annual Fall Symposium, Hanley & Belfus, pp. 547-551.
    M Report on using SGML for multiple displays of results of medical language processing system.

  88. Spyns, P., Nhàn, N.T., Baert, E., Sager, N., De Moor, G. (1996) Combining Medical Language Processing and Mark-Up Technology: an experiment applied to Dutch, in C. Sevens & G. De Moor, eds, Proceedings of MIC96, Brussels, pp. 69-77.
    M Dutch parser combined with other components of LSP medical language processing system.

  89. Spyns, P., Nhàn, N.T., Baert, E., Sager, N., De Moor, G. (1997) Dutch Sublanguage Semantic Tagging combined with Mark-Up Technology, Proceedings of Fifth Conference on Applied Natural Language Technology , Morgan Kaufman, San Francisco, pp. 182-189. See also ACM Digital Library.
    M system using SGML display.

  90. Spyns, P., Nhàn, N.T., Baert, E., Sager, N., De Moor, G. (1998) Medical Language Processing applied to extract clinical information from Dutch medical documents, in Cesnik C., Safran C., & Degoulet P. (eds.), Proceedings of the 9th World Congress on Medical Informatics (MEDINFO98). Amsterdam. Pp. 685-689.
    M Dutch medical language processing using in part the LSP MLP, and information extraction with the use of SGML display.

  91. Sager, N. Nhàn, N.T. (2002) The computability of strings, transformations, and sublanguage, in The Legacy of Zellig Harris, eds. by Bruce E. Nevin and Stephen M. Johnson. John Benjamins Publishing Co., Amsterdam/Philadelphia. Volume 2, Chapter 4, pp. 79-120.
    M How the LSP system utilizes the linguistics of Zellig Harris.


STRING PROGRAM REPORTS [SPR]

SPR 1
Sager, N., Salkoff, M., Morris, J., and Raze (Friedman), C. (1966). Report on the String Analysis Programs, Introductory Volume. String Program Reports (S.P.R.) No. 1. Linguistic String Project, New York University and University of Pennsylvania.

SPR 2
Raze (Friedman), C. (1967). The FAP Program for String Decomposition of Scientific Texts. S.P.R. No. 2. NYU Linguistic String Project.

SPR 3
Bookchin, B. (1968). Computer Outputs for Sentence Decomposition of Scientific Texts. S.P.R. No. 3. NYU Linguistic String Project.

SPR 4
Sager, N. (1968). A Computer String Grammar of English. S.P.R. No. 4. NYU Linguistic String Project.

SPR 5
Salkoff, M., and Sager, N. (1969). Grammatical Restrictions on the IPLV and FAP String Programs. S.P.R. No. 5. NYU Linguistic String Project.

SPR 6
Sager, N., Touger, J., Harris, Z.H., Hamann, J., and Bookchin, B. (1970). An Application of Syntactic Analysis to Information Retrieval. S.P.R. No. 6. NYU Linguistic String Project.

SPR 7
Anderson, B. (1970). Transformationally-Based English Strings and Their Word Subclasses. S.P.R. No. 7. NYU Linguistic String Project.

SPR 8
Sager, N., Claris, P., and Clifford, J. (November 1970). French String Grammar. S.P.R. No. 8. NYU Linguistic String Project. Final Report, Contract No. OEC-0-9-097725-4412(014). Office of Education, Institute of International Studies, U.S. Department of Health, Education, and Welfare.

SPR 9
Fitzpatrick, E., and Sager, N. (1974). The Lexical Subclasses of the Linguistic String Parser, American Journal of Computational Linguistics, Microfiche 2. S.P.R. No. 9. NYU Linguistic String Project. (Hardcopy of Publication 10., p. 2.)

SPR 10
Sager, N. (1976). Evaluation of Automated Natural Language Processing in the Further Development of Science Information Retrieval. S.P.R. No. 10. NYU Linguistic String Project. (Final Report on Research Grant GN39879 to the National Science Foundation, Division of Science Information.)

SPR 11
Raze (Friedman), C. (1976). The Parsing and Transformational Expansion of Coordinate Conjunction Strings. S.P.R. No. 11. NYU Linguistic String Project.

SPR 12
Sager, N., and Hirschman, L. (1978). Information Structures in the Language of Science: Theory and Implementation. S.P.R. No. 12. NYU Linguistic String Project. (Final Report on Research Grant SIS75-22945 to the National Science Foundation, Division of Science Information.)

SPR 13
Sager, N., Hirschman, L., White, C., Foster, C., Wolff, S., Grad, R. (1980). Research into Methods for Automatic Classification and Fact Retrieval in Science Subfields. S.P.R. No. 13. NYU Linguistic String Project. (Final Report on Research Grant DSI 77-24530 to the National Science Foundation, Division of Information Science and Technology.)

SPR 14
Sager, N., Tick, L.J., Story, G.A., Chi, E.C., Shneiderman, B. (1982). Database Management Systems for Natural Language Databases. S.P.R. No. 14. NYU Linguistic String Project. (Final report on Research Grant IST 78-13939 to the National Science Foundation, Division of Information Science and Technology.)

SPR 15
Sager, N., Gordon, D., Marsh, E., Hirschman, L., Friedman, C., Kosaka, M., Christenson, C., and White, C. (1982). Computer-Based Investigation Into the Structure of Information. S.P.R. No. 15. NYU Linguistic String Project. (Final Report on Research Grant IST 79-20788 to the National Science Foundation, Division of Information Science and Technology.)

SPR 16
Sager, N., Johnson, S., Macleod, C., Kosaka, M., Friedman, C., and Chi, E. (1984). Computable Models of Time and Quantity in Natural Language Data. S.P.R. No. 16. NYU Linguistic String Project. (Final Report on Research Grant IST 81-15669 to the National Science Foundation, Division of Information Science and Technology.)


Contact Naomi Sager