Jet (Java Extraction Toolkit)
About JET
JET, the Java Extraction Tool, provides a variety of components
for language analysis, such as sentence segmentation, name
tagging, time expression tagging and normalization, part-of-speech
tagging, partial parsing, and coreference analysis. These
components can be arranged in pipelines for different
applications, and can be used either for interactive analysis of
individual sentences, or 'batch' analysis of complete documents.
Simple tools are provided for annotating documents and displaying
annotated documents. A full set of procedures are also
provided for performing information extraction of entities,
relations, and events following the ACE [Automatic Content
Extraction] specifications.
JET is a work in progress, and continues being regularly expanded
and updated.
License
Jet Copyright ©1999-2012 Ralph Grishman
Licensed under the Apache License, Version 2.0 (the "License"); you
may not use this file except in compliance with the License. You may
obtain a copy of the License at http://www.apache.org/licenses/LICENSE-2.0
. Unless required by applicable law or agreed to in writing,
software distributed under the License is distributed on an "AS IS"
BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express
or implied. See the License for the specific language governing
permissions and limitations under the License.
Downloading Jet
Release
|
Date
|
Jet distribution
tar
(select to download)
|
Notes
|
1.6.7
|
8 Nov 2012
|
jet-121108.tar.gz
|
|
1.7.2
|
23 Jan 2014
|
jet-140123.tar.gz
|
expand guide; add dependency parser;
add class files; add windows script
|
[An earlier version of Jet is available here.]
If you find Jet useful for your work or incorporate any part of
it into software you distribute, we earnestly request that you
notify Prof. Ralph Grishman.
Installing Jet
You need Java 1.5 or later in order to run Jet. It runs under
Linux, Apple Mac OS X, and Windows through terminal windows.
Download the latest Jet distribution tar file and extract all the
files with tar -xzvf.
The expanded directory will include
- jet-all.jar, the Jet 'executable' (including all libraries)
- bin, a directory with simple scripts for invoking Jet
in Linux and Mac OS
- win, a directory with simple scripts for invoking Jet
in Windows
- docs, a directory of user documentation files
- props, a directory of configuration files
- data, a directory of data files used by Jet
- acedata, a directory of additional data files used by Jet
for Ace information extraction
- example, a directory of files giving an example for
running Jet for Ace information extraction
- runAceExample, a script to run this example
- NOTICE, a copyright / license notice
In addition, the directory will contain the following files and
directories for those who wish to recompile or modify Jet
- src, a directory with the Jet source files
- test, a directory with source files of Jet unit tests
- parser-stub-src, stub sources for a statistical
parser
- lib, a directory containing other jar files required
by Jet
- build.xml, scripts for building Jet using ant
If you plan on using the Tratz dependency parser, you will also need
to download parseModel.gz and put it
in the jet/data directory.
To use Jet,
- for Linux or Mac OS, add the bin directory to your path,
for Windows, add win to your path
- set the environment variable JET_HOME to point to
the top directory into which the Jet files have been unpacked
The documentation for the current release (also included in the
download) is here.