Tokenization Rules
delphia"
<ENAMEX TYPE="LOCATION">Phila-
delphia</ENAMEX>
based"
3.2 If, however, the word is naturally hyphenated and the hyphenated word just happens to be broken at the hyphen at the end of a line, the parts of the word are treated as separate tokens.
"Chicago-<ENAMEX TYPE="LOCATION">Chicago</ENAMEX>-
based
Tokenization Rules - 14 JUN 95
[Next] [Previous] [Top] [Back to MUC-6 main page]
Generated with CERN WebMaker