CSCI-GA.2590 - Natural Language Processing -- Spring 2013 -- Prof. Grishman
February 12, 2013
You may want to use the Jet parser for the first three exercises,
and particularly for #2. Keep in mind in doing these exercises that many
words have several parts of speech.
- (1 point) Using the tiny grammar below, draw the
two parse trees for the sentence “The fair features live music.”
(If you do this with Jet, note that this grammar is slightly different
from that provided as grammar1.)
linguistically-reasonable constraint which would
resolve this ambiguity.
sentence := np v | np v np;
np := n | art n | adj n | art adj
- (2 point) Using the same tiny grammar, compare
the efficiency of the top-down backtracking parser, a bottom-up
constituent) parser, and a top-down chart parser on the sentence “The
fair answers questions.” For the two grammar symbols, sentence and np
(1 point) Modify this grammar to capture subject-verb number
agreement. Does "The fair answers questions." now produce one
(1 point) J&M exercise 5.1. (practice tagging some text)
- compare the backtracking and chart parsers
with respect to the number of times each symbol is expanded (indicated
by the message "Seeking ..." on both parsers). For each
parser, report the count separately for sentence and np.
- compare all three parsers with respect
to the number of times a complete constituent
generated (This is indicated by the "Found" message on the top-down
parser, the "Adding" message on the bottom-up parser, and the "Adding"
message for a complete (inactive)
edge for the chart parser.) Again, report the count
separately for sentence
- the Penn tag set is given on p. 131 and on the front cover of
- in the Penn set, the word "to" is always tagged "TO", whether
it is an infinitival marker or a preposition
- modal verbs (MD) are those which do not take an "-s" suffix
in the third person singular present tense; "do" is not a modal
- the VB (base or infinitive of the verb) and VBP (present
tense plural of the verb) forms are the same for all verbs except the
verb "be" (the VB form is "be"; the VBP forms are "are" and
"am"), so you must distinguish by context:
- The main clause of a sentence must be tensed, so if a verb
appears by itself in a main clause it must be the tensed form.
For example, in "They bake cookies." , "bake" is a VBP. You can
verify this by changing the subject to a singular and seeing that the
verb changes: "He bakes cookies." (here "bakes" is a VBZ).
- On the other hand, a verb following a modal, a form
of "do", or "to" is an infinitive. In "They want to bake cookies"
or "They can bake cookies." , "bake" is a VB. You can verify this
by changing the subject to a singular and seeing that the form of
"bake" does not
Due February 19th.
Assignments may be submitted in hard copy at the beginning of
class, or by email to both
email@example.com and firstname.lastname@example.org
with the subject line NLP - Assignment #2. Email
assignments are due by midnight.