ANNIS: Visualizations (corpus-tools.org)

Overview

ANNIS uses pluggable JAVA modules to visualize data. A configurable resolver module determines how each annotation layer is visualized, so that data from different annotation sets can be rendered with either different or the same visualizations (e.g. two separate syntax trees displayed one after the other for the same sentence).

To see some examples of data visualized in ANNIS, take a look at our gallery here. The table below provides a list of the currently available visualizers and summarizes some of their features:

hit view/doc view - Visualizers are set to either display a hit view (e.g. ±n tokens around the search result) or a doc view to visualize the entire document the results come from. Many visualizers can do both, and in some cases two variants are available, in which case the document visualizer has the suffix 'doc'.
RTL - Some visualizers also support right-to-left languages such as Arabic or Hebrew, which is shown in the RTL column.
multi-seg - Some visualizers can use an alternate segmentation layer or tokenization to represent basic word forms instead of the default tokenization. These visualizations are suitable for corpora that contain dialogue data, subtokenization or conflicting tokenizations of the same data.
parallel - Visualizers specifically equipped to deal with parallel corpora may display text from multiple aligned sources within the same module. For other visualizers, parallel corpora can trigger multiple instances of the same visualizer (e.g. syntax trees for each aligned language), but there will be no single visualization incorporating data from multiple sources at the same time.
interaction - Visualizers that offer cross-module interactivity can be triggered by other visualizers, such as A/V players that can 'jump' to the correct aligned time position when the corresponding annotation is clicked on in the grid or kwic view.

visualizer	description	hit view	doc view	RTL	multi-seg	parallel	interaction
kwic	classic key-word in context view	yes	no	yes	yes	yes
tree¹	constituent syntax tree	yes	no	yes	yes	no
grid	grid for annotation spans	yes	no	yes	yes	no
grid_tree	grid showing a hierarchy of layered spans	yes	no	yes	yes	no
discourse	document view underscoring and co-highlighting linked data	no	yes	no	no	yes
arch_dependency²	linear dependencies with arches between words	yes	no	no	yes	no
html / htmldoc	custom annotation-triggered HTML/CSS	yes	yes	yes	yes	no
rst / rstdoc	rhetorical structure theory tree	yes	yes	no	no	no
pdf / pdfdoc	aligned pdf page or document	yes	yes	--	--	--	grid
audio / video	aligned A/V data	yes	yes	--	--	--	grid, kwic
hierarchical_dependency³	graphviz-based hierarchical dependencies (vertical)	yes	no	no	no	no
ordered_dependency³	graphviz-based linear-hierarchical dependencies (horizontal)	yes	no	no	no	no
graph	graphviz-based annotation graph (for debugging)	yes	no	no	yes	yes

Coded by Torsten Marek, Institute of Computational Linguistics of the University of Zurich
Code provided courtesy of Kim Gerdes, ILPGA, Paris. Requires SVG capable browser.
Developed in conjunction with Dag Haug, PROIEL project, Oslo. Requires Graphviz.

Gallery

Click on the images below to see some applications of the ANNIS visualizer modules

Syntax trees

Arch dependency visualizer

Tree visualizer

Right-to-left tree

Hierarchical dependency

Ordered dependency

Token and span annotations

KWIC

Standard grid

Hierarchical grid-tree

Right-to-left KWIC

Discourse annotation

Coreference annotation

Rhetorical Structure Theory
annotation

Parallel corpora

Parallel KWIC view

Aligned discourse view

Aligned multimedia data

Aligned A/V data

Page-aligned
PDF embedding

Custom HTML visualizations

These visualizations were created using configurable annotation-triggered HTML and linked CSS files. For more information on creating your own custom visualizations see the HTML visualization guide.

Movie script visualization

Coptic diplomatic manuscript

Information structure visualization

Aggregate visualizations

A new form of visualization showing quantitative frequency information using aggregation functionality in the ANNIS back-end.

Frequencies visualization