Oculus Claims Breakthrough in Hand-tracking Accuracy


One of Facebook’s underlying goals for VR is to use it as a means of connecting distant people. While today friends can talk and play in a range of social VR applications, including Facebook Spaces, the representation of users in VR is still a caricature at best. At F8 this week, Oculus showed work being done on hand-tracking to bring more intuitive control and accurate avatars into VR.

Maria Fernandez Guajardo, Head of Product Management on Core Tech at Oculus, this week shared glimpses of her work, which she describes as turning the company’s various research projects into practical technologies for future products. Beyond hand-tracking, she also revealed the Oculus ‘Half Dome’ prototype, a headset with a 140 degree field of view and varifocal display.

Guajardo detailed a computer-vision based hand-tracking system trained with a self-optimizing machine learning algorithm, which she says achieves tracking that’s “far more accurate than any method before for tracking a single hand, two hands, and hand-object interactions.” Footage which appeared to show the hand-tracking in action (above) also appeared to show detection of snapping gestures.

Image courtesy Oculus

Guajardo explained that the company used a marker-based tracking system to record hand interactions in high fidelity, and then condensed the recorded data into 2D imagery which allowed them to set a convolutional neural network to the task of uniquely identifying the positions of the markers across a large set of hand pose imagery, effectively allowing the system to learn what a hand should look like given an arbitrary set of marker positions. Ostensibly, this trained system can then be fed markerless camera input of a user’s hands and solve for their position.

Image courtesy Oculus

By measure of something Oculus labeled ‘Tracking Success Rate’ (and also described as “accuracy,” the company claims to have achieved a rather astounding 100% success rate with single hand-tracking, up from the claimed 90.49% of other hand-tracking methods. The company claims even bigger leaps compared to other methods for two-handed and hand-object interactions.

Hand-tracking can enable rich, controller-less input, which could be especially suitable for mobile VR headsets where carrying separate controllers would hinder portability. While the accuracy achieved with Oculus’ approach is certainly impressive, it isn’t clear what the computational constraints are at this stage.

  • For serious software any advancements in natural hand gesture tracking is a great thing to see coming along.

    Games will always have the problem of occlusion and line of sight though, if it can’t see what you are doing with your hands/fingers then it wont be able to do anything meaningful.

    e.g. Grabbing a gun and putting your hands behind your back or to the extreme sides (think The Matrix lobby scene) . Your body or even your own hand/fingers could occlude gestures making it a special case use only.

    The VR Glove or VR thimble will emerge as the most accurate way to track hands/fingers no matter if the HMD does not have line of sight. Hopefully that will be sub mm accurate and low power too.

    • Mei Ling

      The main problem I see with any sort of full-on hand based tracking (with or without gloves) is how do you abstract your walking movements into the virtual world whilst sitting in place and not using any sort of leg tracking. Essentially how do you enable full “abstracted” locomotion when your hands are doing what they are supposed to be doing.

      If Oculus decide to release a version of their headset with hand gesture tracking then as far as I’m aware one hand would technically still need to hold a controller of some sort with a joystick and the other hand being free.

      • Well if gesture tracking works as it should then you could find a way to include simple teleport gesture positioning. But then you can’t be shooting a pair of guns at the same time or holding objects in both hands while moving around.

        Maybe a foot rocker pedal will be developed (like car pedals for seated experiences) or for ultimate immersion we would be using a slippery floor and a simplified version of the hand tracking but it tracks your your feet to allow both hands and both feet to be tracked / gesture tracked.

        • Mei Ling

          Valve Knuckles seems to be a sort of “best of both worlds” approach in that you strap the device onto your palm allowing your fingers to be free whilst having a track pad on your palm for gripping and abstracted functionality such as locomotion.

  • NooYawker

    They bought Leap Motion?

    • cartweet

      They bought NimbleVR which was similar tech years ago.

  • impurekind

    I’d like to see a system that uses only your hands for general menu interaction and stuff and then also lets you use a proper physical control when actually playing games of doing things where being able to grip something and feel tactile feedback is better. That would be a great dual solution.

    • Laurence Nairne

      Leap Motion handle this in their latest version of the Orion SDK though I’ve not worked on implementing it into any projects so couldn’t tell you how well it works. I also find that the 90.49% success rate of tracked positions supposedly achieved by their system is too prone to failure to be used in an actual product anyway.

      • Andrew Jakobs

        But don’t forget, it’s working with pretty old hardware and doing a pretty good job. They need to have new up-to-date hardware and I’ll think it will go up to 98% or something (100% tracking just isn’t possibe, not even a regular controller does 100%).

        • Laurence Nairne

          Aye it is old hardware, but computer vision is mostly a software challenge. It’s always about guesstimating finger positions on the information that the camera can see.

          New hardware won’t solve that. But you’re right, 100% is marketing speak.

  • sebrk

  • Stefan Küppers

    Hmm, all I see wiggling fingers and stuff leap motion showed ages ago. Maybe I am a cynic but please show me someone typing on a keyboard so I can be convinced that precision has improved over the years….

    • Bryan Ischo

      But these wiggling fingers appear to be much more accurate than Leap Motion’s wiggling fingers, which is worth talking about.

    • cartweet

      The difference here is that it’s training model can infer what your hand is doing even when your fingers are obscured. Leap motion does a pretty good job but their charts are showing with a single hand they can achieve 100% accuracy and a very high degree with hand-hand. As an example their system can identify when you snap your fingers.

  • Brad

    The problem with this kind of optical hand tracking is that it only works when the camera, presumably mounted on the headset, is looking at your hands, so if your hands are by your side, or you look away, it has no idea what your hands are doing.

  • Rob

    That is incredible, but unless they invent some kind of gloves, it won’t feel like you’re holding something…

  • Mike

  • Rolf Adelsberger

    Optical hand tracking certainly offers ease of use. However, there are multiple problems that you cannot solve with a lightweight setup. Haptics, for example. Or occlusions or even self-occlusions…

    Have a look here: https://www.youtube.com/watch?v=Aej-wrgHq6Y

    I’m holding a golden torch for Microsoft’s Mixed Reality, when I picked up a Lenovo headset zero vr research went into my purchase. I couldn’t be happier, I know no other technology at the moment has hand tracking “yet” but until we see software that actually exploits it this makes it a sort of useless feature. Furthermore I think hand tracking could probably work on WMR more effectively than other vr gear, this is because of Microsoft’s inside out cameras that are built into the headsets. Take that with a grain of salt though, currently WMR does not support hand gestures. What I’m trying to say is I can see it working flawlessly in some future update.

    This will never be a 100% accurate solution because there’s always going to be some position where your fingers get occluded by part of your hand. Probably good enough for most applications but not for anything super accurate like typing.

  • Jason Gorman

    The key here is the CNN (convolutional neural network). We are going to see these kinds of huge advancements popping up everywhere in the next few years. Those who think this is not possible, I think haven’t quite wrapped their heads around what neural networks are capable of. A quick trip github and typing in VR CNN or VR Neural Network will give you a glimpse.

  • DanCarmon

    And only 2 years after Intel achived and showed at CES real hands tracking! Yay facebook!


  • Rolf Adelsberger

    I already posted it earlier: optical tracking is fine for specific applications. However, as these systems don’t actually measure, but rather “match images” to pre-learned gestures, they will not be able to detect arbitrary gestures. Especially in situations where the cameras cannot see the fingers anymore…

    Have a look: https://youtu.be/Aej-wrgHq6Y