Oculus Audio SDK impressions ces 2015 (2) — See Also: Oculus Rift ‘Crescent Bay’ is Designed for Audiophiles – Here’s Why that’s Important for VR

Oculus Rift ‘Crescent Bay’ is Designed for Audiophiles – Here’s Why that’s Important for VR

Jan 15, 2015

Oculus’ latest feature prototype VR Headset, code-named Crescent Bay, was out in force at this year’s CES, blowing minds left, right and centre. But whilst debate rages over precisely what optics and display it contains, Crescent Bay’s audio has been somewhat sidelined. Here’s why Oculus’ work on building a dedicated audio pipeline with high-end hardware matters.

‘VR Audio’ – Oculus’ Latest Inititive to Improve Immersion

Oculus’ objectives at CES 2015 was fairly clear; 1) Get the new Oculus Rift ‘Crescent Bay’ onto as many people’s heads as possible and 2) Talk up VR Audio, the company’s term for their 3D positional audio pipeline. It seems to have been ‘mission accomplished’ on both counts judging by the sheer amount of mainstream press the company has received this year – all talking about 3D positional audio and their incredible experiences with Oculus’ latest hardware.

But whilst CB’s newly integrated headphones have drawn some superficial attention by the media and community at large, it may not be immediately obvious just how seriously Oculus is taking what you, the player, hear whilst in virtual reality. Also, if you’ve yet to experience truly effective positional audio whilst in VR (or any gaming experience for that matter), you may not realise that attaining ‘presence’, the industry term for psychological immersion, may rely so heavily upon it.

The Hardware

Oculus’ public mission for providing top-notch VR Audio began with their announcement at Oculus Connect that the company was to license Maryland University startup Visisonics‘ Realspace 3D Audio engine for inclusion in their SDK. This means that every developer creating experiences for the Oculus Rift will have immediate access to a set of APIs that allow them to take advantage of 3D positional audio without the need to seek out proprietary solutions. In theory, lowering the barrier of entry for great 3D sound in games.

A Few Words on HRTFs

As with most of the incredible things our brains do for us, the way we perceive the world through sound is taken for granted. But the subtle detection of reflections and distortion that sound suffers on its way to our ears provide us with critical spatial information.

An example of HRTF and ITD in action. [Credit: music.columbia.edu]

HRTF (Head-related transfer functions), is a somewhat unhelpful sounding acronym which refers to methods your brain uses to detect audio delays in the environment and property changes that your brain uses to judge relative distance between it and the sounds source. ITD (Inter-aural Time Delay: the correlation of time delay between sounds reaching each of our ears) and HRTF are used by our brains to build a surprisingly accurate aural landscape of the world. Unsurprisingly, emulating these cues can be extremely effective at convincing the brain it’s somewhere it’s not.

No HRTF is created equal however. As everyone’s head shape is subtly different, your brain is attuned specifically to it. Which raises interesting questions for VR Audio’s implementation. Will there be a calibration step that allows you to provide a 3D model of your head to ensure those audio reflection and occlusion is calculated accurately? I suspect not, but it’s likely that calibrating HRTFs for each user in virtual reality will be important in the future, as every other aspect of VR becomes more and more realistic.

From my own personal experience, I’ve come closer to achieving presence using spatialised 3D audio VR demos than any other so I’m heartened and impressed at the efforts being made by Oculus to ensure its use is not only supported but positively encouraged in future virtual reality content.

Combining high quality, custom components at every stage in Crescent Bay’s audio pipeline, Oculus has provided a way to ensure its vision for compelling and ultimately presence-enhancing 3D positional audio can be delivered to the consumer. Assuming these measures make their way into the consumer version, the Oculus Rift could wind up being the best sounding device in the household.

leoc

Latency has to be a big concern as well. Things like Rocksmith illustrate that audio latency on PC can be a real problem: it can be difficult or impossible to eliminate a noticeable delay between striking the guitar strings and hearing a note play through your speakers or headphone. That kind of lag would really hurt VR positional audio too, especially when the user moves his head.
Neuromute

Those integrated headphones look like the Sennheiser PX-100
- Soleya Williams
  
  The speculation is that they partnered with Koss: http://www.reddit.com/r/oculus/comments/2i5upu/are_the_crescent_bay_headphones_using_parts_from/
  
  I wonder what that will do to the company share-price if it’s true.
  - Jacob Pederson
    
    Yea, they look like Koss to me. Hope they are, I love the Porta-Pro!
Don Gateley

This also implies that so long as the actual transducer in the integrated ‘phones is linear (i.e. no harmonic or IM distortion), which is not difficult to achieve with today’s magnet and materials technology, then with DSP they can make the frequency response of the pipeline all the way to a representative eardrum whatever they choose to design or emulate. For example they could be made to sound identical to a $350 AKG K702 just by some good measurement and the inclusion of a DSP kernel to convolve within the pipeline. (I’ve done and am doing measurement based work that fully demonstrates the efficacy of this.)

Sounding like an existing device, however, is hardly optimal and not what they probably will do. Since they can give the entire path to the eardrum an arbitrary response, why not find the one (or several) that the most critical ears find the most ideal. That’s not an easy research task in and of itself but probably worthwhile in the scheme of things and certainly of great academic interest.

There is even more that can be done with this but I’ll stop here.
japes98

The guys over at Earmark Labs (www.earmarklabs.com) are doing some interesting stuff with personalized HRTFs.
AJ@VRSFX

Great article! HRTFs are a great option for 3D spatial audio. To be honest, though, I was extremely disappointed by Crescent Bay’s audio. Spatial cues can be obliterated from a signal that’s using even the best HRTFs if the dB and EQ aren’t quite right. As a result, external room noise is a big factor. It was fairly quiet in the room where they demoed Crescent Bay at Oculus Connect, but those headphones had sad grip strength. I was hoping it was just because it was a new prototype at the time, but they don’t seem to have fixed it yet as of CES. When I demoed Crescent Bay, I had to press the headphones firmly against my ears to hear the spatial cues at all, and even then they were faint compared to what they could be. The hardware needs to provide more than just flat response. Noise cancellation needs to be their end game.
- Don Gateley
  
  The response that you got when you held them tight can be achieved nearly identically with DSP while they are in normal contact. If that’s the sound they want delivered. My take on your experience is that they simply weren’t loud enough to compete with whatever cues remained incident from the room.
  
  I agree for all kinds of reasons that external sound must be isolated or cancelled but when dealing with things that are so sensitive to differential frequency response it is going to be very hard to get sufficient and consistent feedback control across the whole spectrum of sound and fits of interest. They are going to have to close off the room from both your eyes and your ears and do so physically not electronically if sound levels are to remain comfortable.
  
  Nothing that simply sits on your ear will do it for the difficult job of coordinating visual with auditory stimulus to achieve the immersion we and they want. The job of interpolating within an HRTF space using position data is trivial compared to the physical and empirical problems of ‘phone and coupling design with all the variables involved.
  
  In-ear buds can provide a much easier and more consistent solution but not everyone likes those things in their ears (and of course they would not be suitable for any demo room unless they were use-once and toss.)

Oculus Rift ‘Crescent Bay’ is Designed for Audiophiles – Here’s Why that’s Important for VR

‘VR Audio’ – Oculus’ Latest Inititive to Improve Immersion

The Hardware

A Few Words on HRTFs

Latest Headlines

VR Comfort Settings Checklist & Glossary for Developers and Players Alike

‘Into the Radius 2’ Releases in Early Access on PC VR Today, Including Two-Player Co-op

‘Arken Age’ Release Date is “coming soon,” Promising 10–15 Hour Campaign on PSVR 2 & PC VR

Features & Reviews

‘Bounce Arcade’ is Like VR Pinball for Your Fists—And Exactly the Kind of Creativity VR Needs to Thrive

‘Half-Life: Alyx’ on PSVR 2 Would be a Win-win-win for Valve, Sony, & Players

Hands-on: Sony’s New MR Headset Impresses with Clarity & Ergonomics, But Still Needs Tuning