This course will consider how methods of natural language processing can be used to bridge this gap: to extract information from text, and to answer a user's questions about text and data base information. We will consider several levels of text analysis, including syntactic analysis (grammars and parsing), semantic analysis (word and sentence meaning), and discourse analysis (pronoun resolution and text structure).
During the course you will use and extend a suite of text processing tools, JET, coded in Java, building up all the basic components for an information extraction system. There will be small weekly assignments (some written, some programming), a term project, and a final exam.
Students should have
Textbook: Jurafsky and Martin, Speech and Language Processing (Prentice Hall)
For further information, you can consult the course pages from 2001, when the course was last given. You may also want to look at the pages of our natural language research group, the Proteus Project, and in particular its work on information extraction.
For further information, contact Prof.