The Whisper Deck
Overview
The Whisper Deck is a voice-controlled augmented reality data visualization tool that immerses users within a fluid information ecosystem of their own design. The project is an experimental interface that explores new ways in which we can examine the vast amount of data being generated by the world on a daily basis.
Video
Description
Using an off the shelf Vuzix Cam-AR head mounted display, users can look around their local environment and examine the world through the integrated webcam unit on the front of the display. Upon noticing a pre-defined symbol, a 3D world instantly appears. As long as this symbol remains in view, this newly created augmented space will continue to persist and will allow the user to examine it from any direction by simply moving around it in real space.
Users can issue requests to the Whisper deck using a series of voice commands. These commands will cause the world to reconfigure itself based upon your preferences. For example, if you would like to have the world gather information about a topic you are interested in – say, Boston Terrier Puppies – simply say the command “search Boston terrier puppies” with the keyword “over” at the end of your sentence. You can search almost any topic from ice cream to casino to cars or anything that interests you. The system will go out to the Internet and retrieve information relating to your request, including a spoken definition from Wikipedia as well as a set of images from various publicly accessible image search engines.
In addition, the Whisper Deck also allows visitors to compare the relative popularity of search term by interfacing with Google Trends. Speaking the command “compare” will allow you to name any number of terms which will be visualized as a 3D bar chart that can be further inspected.
Technology
The Whisper Deck uses a number of different tools. While most of the technologies described below are web-friendly, the voice controlled aspect of the system is handled via a desktop speech to text package.
- Flash ActionScript 3
- FLARToolkit (marker detection)
- Papervison 3D (3D rendering)
- Web Services
- Yahoo! Pipes (Flickr, Picasa & Google Images feed aggregation)
- Perl + Python (Google Trends integration)
- Voice Recognition / Playback
- Mac Speech Dictate
- Perl + integrated Apple OS X text to speech engine


December 22nd, 2009 at 2:58 pm
wohaa, nice! is there any chance that you will share this source code? we will have a ARDevCemp in Germany in Jan 2010 and that is really a nice peace of work.
maybe we could link you in via video conferencing (don’t know where you are located) and we could present your work. pls advise regards marc
December 23rd, 2009 at 1:39 pm
Craig,
I see that you are at NYU, here in the city. I would like to invite you to present at the January New York Augmented Reality Meetup. It will be held at the offices of Porter Novelli at 7PM on Tuesday, January 19th:
Porter Novelli Office
75 Varick Street
New York, NY 10013
Get in touch with me if you are interested in participating. You belong at this meeting!
Kind Regards,
Chris
PS. Your work came to my attention via Tom Carpenter’s blog, The Future Digital Life.
December 28th, 2009 at 9:22 am
[...] to Craig’s website for more information and get him to the next ARDevCamp, [...]
December 28th, 2009 at 10:52 am
[...] « The Whisper Deck [...]
January 18th, 2010 at 4:36 pm
[...] Craig Kapp – NYU – Whisper Deck – will introduce an experimental interface to access information using [...]
January 23rd, 2010 at 10:16 am
[...] demonstration. The demo provided by Craig Kapp, an NYU student, was indeed powerful and impressive, as you can see from his own website. One of the factors I didn’t comprehend was his desire to be away from his computer, yet his [...]
March 8th, 2010 at 6:56 pm
Cool! I just came to your blog via Google and I seriously loved it! The effort you do in posting here is seriously fantastic and I am pleased about it. Keep going buddy.