Tokenization Rules
<ENAMEX TYPE="LOCATION">U.K.</ENAMEX> industry
"Microtest Inc."
<ENAMEX TYPE="ORGANIZATION">Microtest Inc.</ENAMEX>
"Spokane, Wash."
<ENAMEX TYPE="LOCATION">Spokane</ENAMEX>, <ENAMEX TYPE="LOCATION">Wash.
</ENAMEX>
"Limousines are manufactured in the U.K."
Limousines are manufactured in the <ENAMEX TYPE="LOCATION">U.K.</ENAMEX>
"Prudential-Bache Securities"
"Allen-Bradley Co. and Hewlett-Packard Co. have undertaken a joint marketing and development program linking A-B manufacturing automation equipment with HP Unix-based computers."
"one-hundred percent"
"10/13/89"
"'87"
Generated with CERN WebMaker
2.1.2 A period used used as a decimal marker is considered integral to the number token.
"$5.10"<NUMEX TYPE="MONEY">$5.10</NUMEX>
2.2 Examples with hyphen or dash (see also section 3, below)
"F. Gregory Fitz-Gerald"<ENAMEX TYPE="PERSON">F. Gregory Fitz-Gerald</ENAMEX>
<ENAMEX TYPE="ORGANIZATION">Prudential-Bache Securities</ENAMEX>
<ENAMEX TYPE="ORGANIZATION">Allen-Bradley Co.</ENAMEX> and <ENAMEX TYPE="ORGANIZATION">Hewlett-Packard Co.</ENAMEX> have... <ENAMEX TYPE="ORGANIZATION">A-B</ENAMEX>... <ENAMEX TYPE="ORGANIZATION">HP<ENAMEX>...
<NUMEX TYPE="PERCENT">one-hundred percent</NUMEX>
2.3 Examples with slash
"The venture will be called Quality Spring/Togo Inc."The venture will be called <ENAMEX TYPE="ORGANIZATION">Quality Spring/Togo Inc.</ENAMEX>
<NUMEX TYPE="DATE">10/13/89</NUMEX>
2.4 Examples with other punctuation
"McDonald's burger company"<ENAMEX TYPE="ORGANIZATION">McDonald's</ENAMEX> burger company
<TIMEX TYPE="DATE">'87</TIMEX>
2.5 Examples with special characters
"S&P 500 Index"<ENAMEX TYPE="ORGANIZATION">S&P</ENAMEX> 500 Index
Tokenization Rules - 14 JUN 95
[Next] [Previous] [Top] [Back to MUC-6 main page]