A block histogram lets you see the distribution of numeric values in a data set. The x-axis is divided into 'bins' that correspond to value ranges. Each item in the data set is drawn as a rectangular block, and the blocks are piled into the bins to show how many values in each range. To see the exact value of a block move your mouse over it. If you are charting more than one dimension use the menu at the bottom of the graph to choose which to show. You can also highlight a block by clicking - use control-click to highlight more than one, and click again to deselect). Highlights are helpful for pointing to particular items when you make a comment or following a particular item as you change the x-axis using the menu at bottom.
Document cloud comparison
The two clouds show the words related to the focus word in both documents in the same manner as for the single Word Association Cloud. The only difference is that colour is used to indicate words that are unique to one document or another. The words in blue on the left are unique to the 2007 SOTU and those in red on the right are unique to the 2008 SOTU. As before, you can click on a word to bring it in focus or click on the top edit box to change it. The clouds are linked in this case so that they always show the same word for both documents.
Double document shared word diagrams
Double Document Shared Word Diagrams compare and contrast two documents by showing both the unique and shared vocabulary and its distribution across the two documents of interest. The two columns of squares represent the two documents. The leftmost column of word circles shows the highest frequency non-trivial words found in document 1 but not document 2. The rightmost column of word circles shows those words unique to 2 and the central column shows the words that are common to both.<br />You can also input your text and generate your own visualisation.
The matrix chart divides the screen into a grid. Rows represent the values in one text column (e.g., political candidate) and columns represent another text column (e.g., states of the US). Each cell then shows a circle or bar that represents the value for its row/column combination (e.g., contribution to Hillary Clinton from New York).
All of the streets in the lower 48 United States: an image of 26 million individual road segments. The density represented corresponds to the density of the street network.
Text visualisation tool
The top left set of connected circles represents a partial view of a graph showing inter-relationships between words. There is a central ring of the primary words of interest and a secondary outer ring of some other words related to the central set. Click on an inner word to remove it from the central ring. Click on an outer word to add it to the central ring. In either case the words on the secondary ring are dynamically adjusted to show the 'most important words' related to the central set. The strength of the connections between the inner words and all the others are shown with simple lines. You can also hold down the number '1' key while clicking to make that word the only central word. <br /><br />The top right shows a collection of bar graphs giving the distribution of the primary words across the entire document. Underneath it is a small map showing the distribution of the words across the entire document. The bottom right gives a list of other interesting words that aren't already in the circle diagram - high frequency but modified so that capitalised words are boosted. These words can be clicked on to add them to the central diagram. The bottom left gives excerpts for the word last hovered over. There are 5 or 6 files you can explore by clicking on the upper left '?' icon.
This is an interactive tool to visualize debate transcripts.<br />Click on the 'Load another transcript' button to choose a different transcript. The top section shows the distribution of some selected words within the text across a 'timeline' which goes from left to right. Each speech segment is the same width and the height of the small white bars show the number of occurrences of that word for that segment. You can add new words with the text box in the top right corner. You can remove existing words by clicking on them.<br /><br />Right below the word distribution graphs is a similar coloured set showing a spectral decomposition of the text based on who spoke and how much was said. In this case the bar heights give the amount of text for each segment. Click and drag the mouse left to right to move along the timeline and show the actual text for 3 consecutive segments.
A word tree is a visual search tool for unstructured text, such as a book, article, speech or poem. It lets you pick a word or phrase and shows you all the different contexts in which it appears. The contexts are arranged in a tree-like branching structure to reveal recurrent themes and phrases. Note: all submitted data remains publicly viewable.
The most frequently used words in a text. The Wordle can visualise the relationship between a column of words and column of numbers. Or, it can strip out the punctuation and show word frequency within a text.