
We obtain most of our knowledge of the world around us 
  through our eyes. Our visual system processes information in two distinct ways 
  -- conscious and preconscious processing. When we are looking at a photograph, 
  or reading a book or map requires conscious visual processing and hence usually 
  requires some learned skill. Preconscious visual processing, however, describes 
  our basic ability to perceive light, color, form, depth and movement. Such processing 
  is more autonomous, and we are less aware that it is happening. 
  Physically our eyes are fairly complicated organs. Specialized cells form structures 
  which perform several functions -- the pupil acts as the aperture where muscles 
  control how much light passes, the crystalline lens performs focusing of light 
  by using muscles to change it's shape, and the retina is the workhorse converting 
  light into electrical impulses for processing by our brains. Our brain performs 
  visual processing by breaking down the neural information into smaller chunks 
  and passing it thoguh several filter neurons. Some of these neurons detect only 
  drastic changes in color, others neurons detect only vertical edges or horizontal 
  edges. 
  Depth information is conveyed in many different ways. Static depth cues include 
  interposition, brightness, size, linear perspective, and texture gradients. 
  Motion depth cues come from the effect of motion parallax, where objects which 
  are closer to the viewer appear to move more rapidly against the background 
  when the head is moved back and forth. Physiological depth cues convey information 
  in two distinct ways -- accommodation, which is how our eyes change their shape 
  when focusing on distant objects, and convergence, which is a measurement of 
  how far our eyes must turn inward when looking at objects closer than 20 feet. 
  We obtain stereoscopic cues by extracting relevant depth information by comparing 
  the left and right views coming each of our eyes. 
  Our sense of visual immersion in VR comes from several factors which include 
  field of view, frame refresh rate, and eye tracking. Limited field of view can 
  result in a tunnel vision feeling. Frame refresh rates must be high enough to 
  allow our eyes to blend together the individual frames into the illusion of 
  motion and limit the sense of latency between movements of the head and body 
  and regeneration of the scene. Eye tracking can solve the problem of someone 
  not looking where their head is oriented. Eye tracking can also help to reduce 
  computational load when rendering frames, since we could render in high resolution 
  only where the eyes are looking. 
  The sense of virtual immersion is usually achieved via some means of position 
  and orientation tracking. The most common means of tracking include optic, ultrasonic, 
  electromagnetic, and mechanical. All of these means have been used on various 
  head mouted display (HMD) devices. HMDs come in three basic varieties including 
  stereoscopic, monocular, and head coupled. The earliest stereoscopic HMD was 
  Ivan Sutherland's Sword of Damocles, which was built in 1968 while he was a 
  student at Harvard. It got its name from the large mechanical position sensing 
  arm which hung frm the ceiling and made the device ungainly to wear. NASA has 
  built several HMDs, chiefly using LCD displays which had poor resolution. The 
  University of North Carolina has also built several HMDs using such items as 
  LCD screens, magnifying optics and bicycle helmets. VPL Research's EyePhone 
  series were the first commercial HMDs. A good example of a monocular HMDs is 
  the Private Eye by Reflection Technologies of Waltham MA. This unit is just 
  1.2 x 1.3 x 3.5 inches and is suspended by a lightweight headband in front of 
  one eye. The wearer sees a 12-inch monitor floating in mid air about 2 feet 
  in front of them. The BOOM is head coupled HMD and was developed at NASA's Ames 
  Research Center. The BOOM uses two 2.5 inch CRTs mounted within a small black 
  box that has two hand grips on each side and is attached to end of articulated 
  counter-balanced arms serving as position sensing.