Jet.HMM
Class BIOWriter
java.lang.Object
Jet.HMM.BIOWriter
- public class BIOWriter
- extends java.lang.Object
converts an XML-annotated named entity file (with the tag ENAMEX) into
a BIO-format file. In BIO format, each token is on a separate line. A
token which is not part of a name is tagged "O"; a token which is the
first token of a name of type X is tagged "B-X"; a token which is a
subsequent token of a name of type X is tagged "I-X". Sentences are
separated by a blank line.
Method Summary |
static void |
convertCollection(java.lang.String xmlCollectionName,
java.lang.String bioFileName)
converts the collection of XML-coded files 'xmlCollectionName' and writes
the BIO format data as a single file on 'bioFileName'. |
static void |
main(java.lang.String[] args)
|
static void |
task(java.lang.String[] args)
command-line callable file conversion (invoked by -Dtask=CorefEval). |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
BIOWriter
public BIOWriter()
main
public static void main(java.lang.String[] args)
throws java.io.IOException
- Throws:
java.io.IOException
task
public static void task(java.lang.String[] args)
- command-line callable file conversion (invoked by -Dtask=CorefEval).
Passed an array of two file names: the collection of XML files and
the file to contain the BIO-format data.
convertCollection
public static void convertCollection(java.lang.String xmlCollectionName,
java.lang.String bioFileName)
throws java.io.IOException
- converts the collection of XML-coded files 'xmlCollectionName' and writes
the BIO format data as a single file on 'bioFileName'.
- Throws:
java.io.IOException