Comparison of Video Deep Tagging Systems

Recently I’ve been looking at and comparing different factors in the design of several deep video tagging applications including Viddler, The Click, Mojiti, and Gotuit’s Scenemaker. Basically, I’m looking at a few different factors in the design space of these systems including the coarse cost of adding annotations (e.g. # of clicks, navigation, typing text etc.), whether or not the video pauses when adding an annotation, how annotations are anchored (e.g. on segments or on points), and if annotation on annotations are possible including whether any notion of threaded responses is supported. Each of these systems touches the design space in a different way. The take away from this exercise is that there are several interconnections between the costs associated with interaction (including video pausing) and the type of anchor used and whether annotations can be added to other annotations. These different design criteria affect the efficiency and user experience of the resulting application.


Cost: 4 clicks + typing (navigation, add annotation, select type, type text, confirm)

After the add annotation button is clicked, the video pauses so that the following actions don’t interfere with watching the video. If you expand a comment the video does not stop.

Anchors: annotations are anchored to a single point in the video, thus both in and out points do not need to be specified. Anchors are shown on the timeline as points.

Annotation on Annotation: People can vote on a comment using a thumbs up / thumbs down metaphor as well as reply to comments (only to 1 layer deep).

The Click

Cost: 2 clicks+ typing (add annotation, type text, confirm annotation)

The video pauses while you are typing a comment, but resumes after confirming the annotation.

Anchors: Annotations are anchored to points within the video but are not shown in the timeline; the only way to navigate is to move to next or previous comment.

Annotation on Annotation: No.


Cost: VERY HIGH. 5 clicks + typing + navigation + positioning.

The video pauses when an annotation is added.

Anchors: Annotations get added to a segment of the video which defaults to 5 seconds long. The in point begins at the click point.

Annotation on Annotation: No


Cost: 3-5 clicks + typing + navigation ([position in], [position out], mark in, mark out, confirmation click), including 0-2 clicks for navigation

The video stops only when the “end segment” button is hit. Thus the video continues playing after the start segment button is hit so that the user can continue watching the video. This seems to be a result of the use of segments in the application since if it were a point marker you would have to pause at that one point. I like the idea of only pausing at the end segment click though.

Anchors: Segments with in / out points.

Annotation on Annotation: No.

Comments are closed.