Jet.HMM
Class BIOWriter

java.lang.Object
  extended byJet.HMM.BIOWriter

public class BIOWriter
extends java.lang.Object

converts an XML-annotated named entity file (with the tag ENAMEX) into a BIO-format file. In BIO format, each token is on a separate line. A token which is not part of a name is tagged "O"; a token which is the first token of a name of type X is tagged "B-X"; a token which is a subsequent token of a name of type X is tagged "I-X". Sentences are separated by a blank line.


Constructor Summary
BIOWriter()
           
 
Method Summary
static void convertCollection(java.lang.String xmlCollectionName, java.lang.String bioFileName)
          converts the collection of XML-coded files 'xmlCollectionName' and writes the BIO format data as a single file on 'bioFileName'.
static void main(java.lang.String[] args)
           
static void task(java.lang.String[] args)
          command-line callable file conversion (invoked by -Dtask=CorefEval).
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

BIOWriter

public BIOWriter()
Method Detail

main

public static void main(java.lang.String[] args)
                 throws java.io.IOException
Throws:
java.io.IOException

task

public static void task(java.lang.String[] args)
command-line callable file conversion (invoked by -Dtask=CorefEval). Passed an array of two file names: the collection of XML files and the file to contain the BIO-format data.


convertCollection

public static void convertCollection(java.lang.String xmlCollectionName,
                                     java.lang.String bioFileName)
                              throws java.io.IOException
converts the collection of XML-coded files 'xmlCollectionName' and writes the BIO format data as a single file on 'bioFileName'.

Throws:
java.io.IOException