Visualizations
Overview
ANNIS uses pluggable JAVA modules to visualize data. A configurable resolver module determines how each annotation layer is visualized, so that data from different annotation sets can be rendered with either different or the same visualizations (e.g. two separate syntax trees displayed one after the other for the same sentence).
To see some examples of data visualized in ANNIS, take a look at our gallery here. The table below provides a list of the currently available visualizers and summarizes some of their features:
- hit view/doc view - Visualizers are set to either display a hit view (e.g. ±n tokens around the search result) or a doc view to visualize the entire document the results come from. Many visualizers can do both, and in some cases two variants are available, in which case the document visualizer has the suffix 'doc'.
- RTL - Some visualizers also support right-to-left languages such as Arabic or Hebrew, which is shown in the RTL column.
- multi-seg - Some visualizers can use an alternate segmentation layer or tokenization to represent basic word forms instead of the default tokenization. These visualizations are suitable for corpora that contain dialogue data, subtokenization or conflicting tokenizations of the same data.
- parallel - Visualizers specifically equipped to deal with parallel corpora may display text from multiple aligned sources within the same module. For other visualizers, parallel corpora can trigger multiple instances of the same visualizer (e.g. syntax trees for each aligned language), but there will be no single visualization incorporating data from multiple sources at the same time.
- interaction - Visualizers that offer cross-module interactivity can be triggered by other visualizers, such as A/V players that can 'jump' to the correct aligned time position when the corresponding annotation is clicked on in the grid or kwic view.
visualizer | description | hit view | doc view | RTL | multi-seg | parallel | interaction |
---|---|---|---|---|---|---|---|
kwic | classic key-word in context view | yes | no | yes | yes | yes | |
tree1 | constituent syntax tree | yes | no | yes | yes | no | |
grid | grid for annotation spans | yes | no | yes | yes | no | |
grid_tree | grid showing a hierarchy of layered spans | yes | no | yes | yes | no | |
discourse | document view underscoring and co-highlighting linked data | no | yes | no | no | yes | |
arch_dependency2 | linear dependencies with arches between words | yes | no | no | yes | no | |
html / htmldoc | custom annotation-triggered HTML/CSS | yes | yes | yes | yes | no | |
rst / rstdoc | rhetorical structure theory tree | yes | yes | no | no | no | |
pdf / pdfdoc | aligned pdf page or document | yes | yes | -- | -- | -- | grid |
audio / video | aligned A/V data | yes | yes | -- | -- | -- | grid, kwic |
hierarchical_dependency3 | graphviz-based hierarchical dependencies (vertical) | yes | no | no | no | no | |
ordered_dependency3 | graphviz-based linear-hierarchical dependencies (horizontal) | yes | no | no | no | no | |
graph | graphviz-based annotation graph (for debugging) | yes | no | no | yes | yes |
- Coded by Torsten Marek, Institute of Computational Linguistics of the University of Zurich
- Code provided courtesy of Kim Gerdes, ILPGA, Paris. Requires SVG capable browser.
- Developed in conjunction with Dag Haug, PROIEL project, Oslo. Requires Graphviz.
Gallery
Click on the images below to see some applications of the ANNIS visualizer modules
Syntax trees
Token and span annotations
Discourse annotation
Parallel corpora
Aligned multimedia data
Custom HTML visualizations
These visualizations were created using configurable annotation-triggered HTML and linked CSS files. For more information on creating your own custom visualizations see the HTML visualization guide.
Aggregate visualizations
A new form of visualization showing quantitative frequency information using aggregation functionality in the ANNIS back-end.