Structured Parallel Coordinates: Ranking comparisons

© Copyright 2010-2012 Accademia Europea Bolzano

Structured Parallel Coordinates home page | EURAC LInfoVis page

The purpose of this Structured Parallel Coordinates visualization is to compare rankings, e.g. the top 100 most frequent words in different (sub)corpora, or the top 30 infinitives following different modal verbs, or the top 20 pop songs over the last 3 months. Click to see the modal example. (Data from the UKWAC100M web corpus of British English.)

Click to show instructions

Click to hide instructions

Each set of data is entered in the areas below line by line, in ranked order, from first to last, where each line is in one of these three formats:

  1. numerical value followed by a tab followed by items with that numerical value (same rank), with tabs between them; the numbers can indicate, i.e. frequencies, statistical measures, wave length, size in cm, etc.
  2. a single item followed by a tab followed by a number
  3. items that have the same rank separated by tabs, with no number

As the tab key cannot be used when entering data, it is best to prepare the data in a text editor and copy it to the data field.

"Number first" should be checked for the column in the first case, and it should not be checked in the other two cases. Items with the same number will automatically be treated as having the same rank (as long as they are entered on subsequent lines). Blank lines are ignored. If the numbers you provide are frequencies, you can check "Show frequencies" to have the percentages of the frequencies per data set shown as bars.

You can also provide a label for the data, as well as choose whether the data will be included in the visualization. You can change the order of the data sets by dragging the column headers ("Data 1", etc.) to the desired place. To have each axis have the same size between items, check "Equal sizes". If the numbers you provide are frequencies, you can check "Show frequencies" to have the percentages of the frequencies per data set shown as bars.

Once you have entered your data, click on "Compare ranks" to see the comparison below the data chart. If you checked "Show frequencies", the total number of tokens for a series is included in parentheses after the name of the series. Identical items in different data sets are connected by lines. ("[NA]" means that the relevant item does not occur in the series of that axis.)

Data 1Data 2
Label
Include
Number first
Data

Equal sizes Show frequencies