10
Effect of camera separation on the viewing experience of stereoscopic photographs Mikko Kytö Jussi Hakala Pirkko Oittinen Jukka Häkkinen Downloaded from SPIE Digital Library on 09 Aug 2012 to 130.233.121.10. Terms of Use: http://spiedl.org/terms

Effect of camera separation on the viewing experience of ...jussihakala.fi/wp-content/papercite-data/pdf/kyto2012a.pdf · Kytö et al.: Effect of camera separation on the viewing

  • Upload
    others

  • View
    5

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Effect of camera separation on the viewing experience of ...jussihakala.fi/wp-content/papercite-data/pdf/kyto2012a.pdf · Kytö et al.: Effect of camera separation on the viewing

Effect of camera separation on theviewing experience of stereoscopicphotographs

Mikko KytöJussi HakalaPirkko OittinenJukka Häkkinen

Downloaded from SPIE Digital Library on 09 Aug 2012 to 130.233.121.10. Terms of Use: http://spiedl.org/terms

Page 2: Effect of camera separation on the viewing experience of ...jussihakala.fi/wp-content/papercite-data/pdf/kyto2012a.pdf · Kytö et al.: Effect of camera separation on the viewing

Effect of camera separation on the viewing experienceof stereoscopic photographs

Mikko KytöJussi Hakala

Pirkko OittinenJukka HäkkinenAalto University

Department of Media TechnologyFI-00076 AALTO, FinlandE-mail: [email protected]

Abstract. This study presents a geometric and subjective analysis oftypical mobile stereoscopic 3-D images. The geometry of the stereo-scopic pipeline from the scene to the eyes of the viewer is a highlyrelevant issue in stereoscopic media. One important factor is cameraseparation, because it can be used to control the perceived depth ofstereoscopic images. The geometric analysis included considerationof disparity and roundness factor within typical mobile stereoscopicimaging scenes. These geometric properties of stereoscopic 3-Dimages were compared to subjective evaluations by varying cameraseparation in different scenes. The participants in this study evaluatedthe strength and naturalness of depth sensation and the overall view-ing experience from still images with the single-stimulus method. Theresults showed that participants were able to perceive the change ofdepth range even though the images were shown in random orderwithout a reference depth scale. The highest naturalness of depthsensation and viewing experience were achieved with 2 cm to 6 cmcamera separation in every content. With these preferred cameraseparations, the disparity range was less than�1 deg and cardboardeffect (quantified with roundness factor) did not negatively affectthe naturalness of depth sensation. © 2012 SPIE and IS&T. [DOI:10.1117/1.JEI.21.1.011011]

1 IntroductionIt can be expected that stereoscopic 3-D (S3D) applicationswill be incorporated in mobile devices in the foreseeablefuture with stronger impact. To better define the require-ments for S3D applications, more knowledge about viewingexperience is required. The widespread commercial applica-tion for S3D imaging requires a positive response fromviewers thus the effect of imaging parameters on the viewingexperience should be better understood. The challenge inS3D imaging compared to traditional photography emergesfrom imaging geometry because the perceived depth mustbe controlled to ensure a good level of visual experience.In order to do this, we need to know which depth magnitudesare preferred, what the viewing conditions are, which cameraparameters are used, and which depths occur in imagingscenes. The key component for controlling perceived

depth is camera separation. However, it is usually not adjus-table in mobile devices.

Mobile imaging scenes are diverse, hence defining anoptimal fixed camera separation is difficult. In order to selecta camera separation that is applicable for most uses, we needto know how viewers experience the depth magnitude. Thusthe first answer that must be defined is: Can viewers perceivethe change of disparity range in a set of S3D images? Whencamera separation increases, the disparity range of imagesincreases. The aim is to evaluate how participants perceivethis change. If participants’ depth perception changesaccording to camera separation: Can we resolve the disparityrange where subjective evaluations of viewing experience areat their highest?

Limiting the disparity range is essential for visual com-fort.1 Disparity ranges that are too long cause diplopiaand excessive mismatch between accommodation and con-vergence. The comfortable disparity range varies amongusers, but for most cases the disparity range selected accord-ing to the 1 deg rule can be expected to be small enough forcomfortable viewing.1 In diopters, the mentioned accommo-dation convergence mismatch rule is �0.3D,2 which equals�1.1 deg when interpupillary distance is 6.5 cm. It has beenshown that more than 90% of disparities in natural scenesalong eye level are within�1 deg .3 Thus it may be expectedthat this disparity range is also perceived as natural whenviewing a stereoscopic display.

The length of depth range in the viewer/display spaceis called the depth budget (Fig. 1). The appropriate depthbudget depends on the viewing distance and the viewer’sinterpupillary distance. It is less in front of the screen thanbehind the screen. When the depth budget of a stereoscopicdisplay has been determined, the camera separation can becomputed if the viewing, camera, and scene parameters areknown.4 If that is the case, the camera separation can becomputed to fill the depth budget entirely. In this study, thecomputational camera separation is computed according todepth budget, as shown in Fig. 1. We want to determine:How do the computational camera separations differ fromsubjectively preferred camera separations?

In addition to the depth budget, there are other geometricissues in S3D pipelines. In particular, the roundness factoris considered to be important and it can controlled with

Paper 11098SSP received May 1, 2011; revised manuscript received Sep. 9,2011; accepted for publication Sep. 26, 2011; published online Feb. 27,2012.

0091-3286/2012/$25.00 © 2012 SPIE and IS&T

Journal of Electronic Imaging 011011-1 Jan–Mar 2012/Vol. 21(1)

Journal of Electronic Imaging 21(1), 011011 (Jan–Mar 2012)

Downloaded from SPIE Digital Library on 09 Aug 2012 to 130.233.121.10. Terms of Use: http://spiedl.org/terms

Page 3: Effect of camera separation on the viewing experience of ...jussihakala.fi/wp-content/papercite-data/pdf/kyto2012a.pdf · Kytö et al.: Effect of camera separation on the viewing

camera separation.5 The roundness factor is defined as theratio of depth magnitude and size magnitude of an object.The depth transformation is the curve that maps scenedepth Zs to perceived depth Zp (Fig. 2). The roundness factorcan be computed with a procedure adapted from research byYamanoue et al.6 The derivative of depth transformationcurve k [Eq. (14) in Ref. 7] is computed as follows [Eq. (1)]:

k ¼ ΔZp

ΔZs. (1)

Magnification (mr) on retina is computed with angle of view-ing (αview) and camera field of angle (αcam) according toEq. (2):

mr ¼tan

�αview2

tan�αcam

2

� . (2)

Magnification of the image (m) can be presented withEq. (3):

m ¼ Zp

Zsmr. (3)

Finally, the roundness factor r is computed as follows[Eq. (4)]:

r ¼ km. (4)

The smaller the display, the higher the non-linearity betweenscene depth and perceived depth. If the size magnitude isbigger than the depth magnitude (roundness factor is lessthan 1), the object is perceived as flattened compared tothe real-world object (illustrated in Fig. 2). In stereoscopic

cinema, the limits for roundness factor are given byMendiburu.5 For example, roundness factors from 0.7 to0.5 are considered “sensible flattening of objects” and factorsfrom 0.2 to 0 to be very “cardboarded.” As mentioned,the roundness factor limits are given for stereoscopic cinema,but the limits can be different with smaller display sizes andviewing distances. To be able to generate S3D images asnatural as possible, an optimal roundness factor rangeshould be defined. Thus, we will also focus on: How doesthe roundness factor affect subjective evaluations ofnaturalness of depth?

However, it is well-known that the viewing experience ofstereoscopic media is a complex issue, dependent on morethan just geometric factors. This makes its definition andresearch challenging. Evaluation of image quality is anongoing research issue and there exist numerous image qual-ity models.8 The stereoscopic viewing experience is evenmore difficult to define. The attributes to quantify viewingexperience with S3D images are more subjective and abstractthan regular 2-D images. To understand how the viewingexperience behaves with changes in camera separation,Seuntiens’s9 stereoscopic visual experience model is used.The model is interesting because it raises the “naturalness”attribute above “image quality.” In 2-D image quality, natur-alness is considered to be one attribute of image quality,10 notthe other way around. The emphasis on the naturalness attri-bute can be expected to be important in S3D images, becausein S3D images unnatural phenomena (like cardboard andpuppet-theater effects) that do not appear in 2-D imagescan occur. The naturalness may have links to the “life-like”experience, which is commonly mentioned when describingS3D images.11

2 Mobile Imaging ScenesMuch of the research on S3D imaging has focused onimaging geometry and its effects on stereoscopic viewingexperience. However, in many studies,6,12–16 the images havebeen compositions of objects instead of typical imagingscenes. In Refs. 12–14, the effect of Joint PhotographicExperts Group (JPEG) coding on viewing experience wasstudied with compositions of objects. IJsseljsteijn et al.15

conducted an extensive study by varying imaging geometryand display durations, but the contents were atypical formobile use cases. Yamanoue et al.6 varied imaging geometrywhile investigating the puppet-theater effect and cardboardeffect with arguably unnatural image contents.

In this study, the photographic content is based on three ofthe International Imaging Industry Association’s (I3A) photoclusters. In I3A’s Camera Phone Image Quality Initiative, thetypical camera phone imaging conditions are presented as sixclusters in the photospace.17 The photospace is a statisticalfrequency distribution of picture-taking as a function of twovariables: illumination and subject-camera distance. Thesesix clusters have been estimated to cover 70% of typical2-D imaging conditions and thus represent the photospacewell. In this study, the photospace defined for 2-D imageswas used, even though the possible differences between2-D and S3D photospaces would be worth studying.

In stereoscopic imaging, the interesting dimension isthe subject-camera distance as it affects the geometry ofthe stereoscopic pipeline. Roughly, the photospace can besplit in two subject-camera distance ranges: distances

Fig. 1 The depth budget of a screen according to the 1 deg rule.Image not in scale.

Fig. 2 The effect of non-linear relation between scene depth andperceived depth on cardboard effect. The sphere at distance Zp1is flattened so the roundness factor is less than 1.

Kytö et al.: Effect of camera separation on the viewing experience of stereoscopic photographs

Journal of Electronic Imaging 011011-2 Jan–Mar 2012/Vol. 21(1)

Downloaded from SPIE Digital Library on 09 Aug 2012 to 130.233.121.10. Terms of Use: http://spiedl.org/terms

Page 4: Effect of camera separation on the viewing experience of ...jussihakala.fi/wp-content/papercite-data/pdf/kyto2012a.pdf · Kytö et al.: Effect of camera separation on the viewing

between 0.5 m and 5 m and distances over 5 m. The formerdistance range was selected for this study (the depth rangesof the scenes are presented in Table 1) and the selectedclusters are within this distance range.

The subject-camera distances for outdoor clusters can bemuch higher (over 5 m) so they might require larger cameraseparations than was possible to achieve with the equipmentused. However, there exists one study18 where longer cameraseparations (10 to 50 cm) were used for outdoor scenes. Theresults showed that the shortest camera separation (10 cm)was preferred. This result indicates that camera separations

below 10 cm are valid for further investigations and thesecamera separations are also more usable when consideringthe size of mobile devices.

The chosen subject-camera distance range includesimages shot in an indoor environment and they are presentedas Clusters 1 to 3 in Fig. 3. Cluster 1 is representative of oneperson sitting in front of a dark background, Cluster 2 of oneperson in a living room drinking wine, and Cluster 3 consistsof several persons playing a game in a living room. Thefourth image in the experiments is a composition of objects.The composition was modified from a typical 2-D camera

Table 1 The imaging and scene parameters.

Cluster 1 (Bar) Cluster 2 (Wine) Cluster 3 (Game) Composition

Varied camera separation (cm) 2, 4, 6, 8, 10 0, 2, 4, 6, 8, 10 2, 4, 6, 8, 10 2, 4, 6, 8, 10

Convergence distance using shift (cm) 120 150 200 200

Focal length (mm) 50

Sensor size, width × height (mm) 36 × 24

Aperture (f -number) 4 1,8 4 5,6

Scene depth range (cm) 70–150 100–300 180–500 200–300

Object distance according to I3A (cm) ≈100 ≈100 >400

Computational camera separation (cm) 4.7 5.4 10 23

Fig. 3 Clusters 1–3 in the following order: “Bar,” “Wine,” and “Game.” The rightmost is the composition. The first row illustrates cluster images takenwith mobile phones corresponding to the I3A clusters. On the second row are the S3D scenes used in this study. The bottom row shows thedisparity maps computed according to Ref. 20. The disparity maps were improved by hand as the computed disparity maps included distincterrors. The white value corresponds to the nearest depth and the darkest value to the farthest value. The disparity maps are computed withineach scene thus they should not be compared between scenes.

Kytö et al.: Effect of camera separation on the viewing experience of stereoscopic photographs

Journal of Electronic Imaging 011011-3 Jan–Mar 2012/Vol. 21(1)

Downloaded from SPIE Digital Library on 09 Aug 2012 to 130.233.121.10. Terms of Use: http://spiedl.org/terms

Page 5: Effect of camera separation on the viewing experience of ...jussihakala.fi/wp-content/papercite-data/pdf/kyto2012a.pdf · Kytö et al.: Effect of camera separation on the viewing

benchmarking composition to include more geometricobjects: a bicycle tire, volleyball, and a test target with planesat different depth levels. The test target was developed inRef. 19. The rationale for geometric objects was to helpthe user evaluate depth sensation and detect differences inthe roundness factor.

The disparity distribution (Fig. 3, bottom row) in Clusters2 and 3 is nearer the disparity distribution of natural scenes,3

whereas Cluster 1 and composition represent differentdisparity distributions. The last research question is: Arethere differences in the subjective evaluations between thescenes?

3 Experimental Setup

3.1 Shooting and Scene ConditionsImages of the scenes (Fig. 3) were taken with a stereo cameraconsisting of two Canon Mark II digital cameras with 50 mmlenses. A beam splitter configuration was used to achievesmall camera separations. Camera separations from 2 to10 cm were used to cover the camera separation rangedefined in Sec. 2. The cameras were carefully aligned andvertical misalignments removed. There are two different con-figurations to capture the depth with stereo cameras: paralleland toed-in configuration.11 The parallel configuration usesbuilt-in sensor shift or images are shifted afterwards. In thetoed-in method, the cameras are physically rotated towardseach other. The parallel configuration is preferred becausethe vertical disparity resulting from keystone distortionand depth plane curvature can be avoided. The parallelconfiguration was used in this study.

Table 1 summarizes the imaging and scene parametersfor the selected scenes (Fig. 2). The lighting conditionsdid not correspond to the I3A specifications, because theimages were taken without a flash. Also, because of lightingconditions, the depth of field (DOF) of the images was quitelimited, especially in Cluster 2 (“Wine”). The imaging field-of-view in vertical direction was 27 deg.

Because the scene, imaging, and viewing parameters areknown, the camera separation can be computed accordingto Ref. 4 to limit the depth budget. In this study, the depthbudget was selected according to the 1 deg rule (Fig. 1).

3.2 Viewing ConditionsThe display was an autostereoscopic Tridelity SL2400 with aresolution of 1920 × 1200 pixels. Image height was 33 cmand the viewing distance was 80 cm. This corresponds tovertical field-of-view of 23.3 deg. The participants used achin support to avoid artifacts from head movements. Thetests were done in normal office lighting (∼100 lx,∼3000 K). As it is probable that stereoscopic images areviewed and shared with either a stereoscopic television ora digital photo frame, optimizing the images for this contextis important. Consequently, the images were viewed froma desktop display. The larger display reveals the problemsthat emerge from a too-wide disparity range. The length ofcamera separation in mobile hand-held devices should bedetermined to offer good viewing comfort, which also shouldbe guaranteed in desktop display sizes.

3.3 ParticipantsTwelve participants ranging in age from 22 to 34 (nine male,three female) attended the tests. The participants had normalor corrected-to-normal vision. Participants’ stereo acuitywas tested with the Netherlands Organization for AppliedScientific Research (TNO) stereo vision test. Their experi-ence with S3D images was gauged with the question“How many times you have seen S3D images?” Responseoptions were: (1) never, (2) once or twice, (3) couple oftimes, (4) many times, and (5) often. The participants’ atti-tude towards S3D content was gauged with the question“What is your attitude towards stereoscopic media?” Thereply options were: (1) very negative, (2) negative, (3) neu-tral, (4) positive, and (5) very positive. After the experiment,participants were asked “How has this test affected your atti-tude towards stereoscopic images?” The reply options were:(1) towards very negative, (2) towards negative, (3) neutral,(4) towards positive, and (5) towards very positive.

3.4 Attribute SelectionThe attribute selection for the subjective test was based onthe hierarchical model of 3-D “Visual experience”.9 The“low” level attribute was “strength of depth sensation.” Itwas selected because there was a need to follow how theviewer experiences the changes in depth range. Strengthof depth sensation is expected to increase as a function ofcamera separation, thus it would work as a control attributeto follow the magnitude of perceived depth sensation.

The “naturalness of depth sensation” attribute wasselected to be consistent with the naturalness attribute inthe model. It is a higher level attribute than “Strength ofdepth sensation” as it measures the naturalness of the depthsensation, not just its amount. The highest level attributeselected for the subjective tests was “Viewing experience,”defined as the overall sensation of the S3D image relatedto the ease and comfort of the viewing experience.

3.5 Test ProcedureParticipants began with a training session using eightimages. The images of the scenes taken with 2 cm and10 cm camera separations were shown to the participantto help her/him adapt to the depth range. After the trainingsession, each image was shown to the participant two timesin random order. One 2-D image was added to Cluster 2as a “hidden reference.” The total number of uniqueimages was 21ð¼ 4contents×5camera separationsþ1hidden referenceÞ.

The participants evaluated the strength and naturalness ofdepth sensation and overall viewing experience with a scaleof 1 to 7, where 7 represented the highest score. The parti-cipants were asked to draw an ellipse on top of the image toindicate the area from which they evaluated the naturalnessof depth sensation. If the effect of the marked area was posi-tive, the area was marked with a green transparent ellipse andif the effect was negative, the area was marked with a redtransparent ellipse. This Recall Attention Map (RAM)approach is described in more detail in Ref. 21 and has beenuseful for giving more information on spatially dependentphenomena in S3D images.

The participants also had the option to identify which partof the image they perceived as natural and to explain why.

Kytö et al.: Effect of camera separation on the viewing experience of stereoscopic photographs

Journal of Electronic Imaging 011011-4 Jan–Mar 2012/Vol. 21(1)

Downloaded from SPIE Digital Library on 09 Aug 2012 to 130.233.121.10. Terms of Use: http://spiedl.org/terms

Page 6: Effect of camera separation on the viewing experience of ...jussihakala.fi/wp-content/papercite-data/pdf/kyto2012a.pdf · Kytö et al.: Effect of camera separation on the viewing

The user interface was shown below the autostereoscopicdisplay and the participants gave their opinion scores andqualitative evaluations using a computer mouse and key-board. The tests lasted for 25 to 70 min per participant,

with the mean being 46.5 min. There were no limitationsfor viewing time and participants were free to make changesto previous evaluations.

4 Results

4.1 Mean Opinion ScoresThe mean opinion scores (MOS) of the experiments areshown in Fig. 4. Camera separation had a statistically signif-icant (tested with two-way ANOVA) effect on the strength ofdepth sensation [Fð5; 3Þ ¼ 22.5, p < 0.001], naturalness ofdepth sensation [Fð5; 3Þ ¼ 7.62, p < 0.001], and viewingexperience [Fð5; 3Þ ¼ 4.98, p < 0.001]. The strength ofdepth sensation increased according to camera separationas expected. Tukey’s post hoc test revealed that the strengthof depth sensation differed statistically (p < 0.05) when thedifference in camera separation was 4 cm or more. However,the perception of depth magnitude was probably diminishedslightly. The images were evaluated with the single-stimulusmethod, thus the viewer could not compare the amount ofdepth sensation between stimuli in a parallel manner. Theresult is in line with another study14 where the participantswere able to sense the increased depth, but the differences incamera separations were higher. It has been shown thathumans adapt to different depth scales quite easily if a refer-ence depth scale is available.22

The naturalness of depth sensation and viewing experi-ence decreased at camera separations above 6 cm. Tukey’s

Fig. 4 TheMOS values as a function of camera separation. Error barsare for 95% confidence level.

Fig. 5 The MOS-values as a function of camera separation and disparity ranges. The disparity range (scale on right y -axis) was calculated accord-ing to Ref. 7 and shown as gray area. TheMOS-values are presented without error bars for clarity; the standard deviations are close to one for everyattribute.

Kytö et al.: Effect of camera separation on the viewing experience of stereoscopic photographs

Journal of Electronic Imaging 011011-5 Jan–Mar 2012/Vol. 21(1)

Downloaded from SPIE Digital Library on 09 Aug 2012 to 130.233.121.10. Terms of Use: http://spiedl.org/terms

Page 7: Effect of camera separation on the viewing experience of ...jussihakala.fi/wp-content/papercite-data/pdf/kyto2012a.pdf · Kytö et al.: Effect of camera separation on the viewing

post hoc test revealed that naturalness of depth sensation andviewing experience with camera separation of 10 cm differedstatistically (p < 0.05) from other values. The same trend,that the naturalness of depth decreases as a function of cam-era separation, has been found in other studies as well.15,23

Overall, there were strong correlations between the attri-butes. There was a correlation between viewing experienceand naturalness of depth sensation (r ¼ 0.87), which alsohas been found in other studies.21 Viewing experience andnaturalness of depth sensations had negative correlation withthe strength of depth sensation (r ¼ −0.60 and r ¼ −0.68,respectively). All of the correlations were statistically signif-icant (p < 0.01).

4.2 The Evaluations Between and Within ScenesThe results within scenes are shown in Fig. 5. The resultsshowed that there were no clear differences between thescenes. The two-way ANOVA showed that there were nostatistically significant differences between the scenes in thestrength of depth sensation [Fð4; 3Þ ¼ 1.702, p ¼ 0.166],the naturalness of depth sensation [Fð4; 3Þ ¼ 1.57,p ¼ 0.196], or the viewing experience [Fð4; 3Þ, p ¼ 0.97].The interaction effect of camera separation and scene alsowas not statistically significant in strength of depth sensation[Fð4; 3Þ ¼ 0.774, p ¼ 0.677], naturalness of depth sensation[Fð4; 3Þ ¼ 1.12, p ¼ 0.339], or viewing experience[Fð4; 3Þ ¼ 0.755, p ¼ 0.697].

Interestingly, the strength of depth sensation seemed to bestrong even though the S3D image was shown only behindthe screen and the disparity range was short. This phenom-enon was seen by comparing Cluster 1 and Composition. Thestrength of depth sensation in Composition (only positive dis-parity from 0.2 deg to 1 deg) was evaluated as equally highwith Cluster 1, even though its disparity range was shorterthan in Cluster 1 (both positive and negative disparities).

In Cluster 2 the hidden reference, the 2-D image, wasdetected and perceived with the lightest depth sensation.This result indicates that the depth scale was used in thesame way for the contents even though the disparity rangesof the contents are distinct.

However, within scenes, there were statistically signifi-cant differences according to camera separation. Two-wayANOVA was done within every cluster and Tukey’s posthoc test was used to analyze which values differ statistically(shown in Table 2).

In the scene with narrow depth variation in scene space ofCluster 1, the naturalness of depth sensation decreases withan increase in the length of disparity range. In Cluster 3, thereis more depth variation in the scene space and the highestnaturalness was achieved with a longer camera separationthan in Cluster 1.

The viewing experience also changes according to cameraseparation, but the impact of camera separation on viewingexperience is smaller than with other attributes. For example,in Cluster 3 the viewing experience is quite constant. Theviewing experience behaves the same way as the naturalnessof depth sensation. The limits for comfortable viewing limitsare exceeded most in Cluster 1 and the near disparity isalmost 3 deg with the longest camera separation. This effectalso can be seen from RAMs (Fig. 6).

4.3 Comparison of Computational CameraSeparations with Subjective Evaluations

The computational camera separations in Cluster 1 (4.7 cm)and Cluster 3 (10 cm) were longer than their subjectivelypreferred (highest viewing experience) values (2 cm,6 cm), but there are no statistically significant differencesin viewing experience between the computational valuesand subjectively preferred values in those contents. Compu-tational camera separation (5.4 cm) was quite close to

Table 2 The statistical significance (tested with two-way Anova) of the data within every scene. The numbers in parentheses show whichMOS at different camera separations differed or did not differ statistically from each other based on Tukey. The statistically significantdifferences should be bolded in the Table 2. This is true when p < 0.05.

Cluster 1 Cluster 2 Cluster 3 Composition

Strength (2 to 10 cm) p < 0.01 (2 to 10 cm) p ¼ 0.11 (2 to 10 cm) p < 0.05 (2 to all) p < 0.01

Naturalness (2 to 10 cm) p < 0.01 (2 to 10 cm) p ¼ 0.07 (6 to 10 cm) p < 0.05 (2 to 10 cm) p ¼ 0.11

Experience (2 to 10 cm) p < 0.01 (6 to 10 cm) p ¼ 0.14 (2 to 10 cm) p ¼ 0.97 (2 to 8 cm) p ¼ 0.25

Fig. 6 The RAMs according to camera separation are (a) 2 cm, (b) 6 cm, and (c) 10 cm. The RAMs clearly showed that with longer cameraseparations, the negative evaluations increase in the front part of the image where disparity has exceeded the 1 deg disparity limit. Greenareas indicate positive evaluation, red areas indicate negative evaluation, and in yellow areas the evaluations are mixed. The RAMs are combinedfrom all of the participants' evaluations and they can be seen properly in colors.

Kytö et al.: Effect of camera separation on the viewing experience of stereoscopic photographs

Journal of Electronic Imaging 011011-6 Jan–Mar 2012/Vol. 21(1)

Downloaded from SPIE Digital Library on 09 Aug 2012 to 130.233.121.10. Terms of Use: http://spiedl.org/terms

Page 8: Effect of camera separation on the viewing experience of ...jussihakala.fi/wp-content/papercite-data/pdf/kyto2012a.pdf · Kytö et al.: Effect of camera separation on the viewing

subjectively preferred value (highest viewing experience)in Cluster 2 (6 cm). In Cluster 3, the naturalness of depthsensation at the computational camera separation (10 cm)is statistically significantly worse than at the subjectivelypreferred camera separation (6 cm). In the Composition con-tent, the computational camera separation (23 cm) is muchlonger than the subjectively preferred camera separation(2 cm). In Composition content, the trend for viewing experi-ence is downward within the used range (2 to 10 cm) thus itcan be expected that viewing experience at computationalcamera separation would be significantly decreased.

However, the convergence distances were not adjustedaccording to computations from Ref. 4, which have affectedthe location of the disparity range. In Cluster 1, the conver-gence distance is farther than the computational value and inCluster 3 and Composition, the convergence is nearer thanthe computational value.

4.4 The Effect of the Roundness FactorThe effect of roundness factor on the naturalness of depthsensation is computed with the method shown in Fig. 7.The selected roundness factors are computed by takinginto account the disparity maps (Fig. 3, bottom row) andRAMs of the scenes (see example in Fig. 6). The depthsfor selected roundness factors are computed with weightedaverage of selected regions’ depths. Finally, these selectedroundness factors are compared to the naturalness ofdepth sensation.

Figure 8 shows the naturalness of depth sensation withdifferent camera separations as a function of selected round-ness factors. In this case, the longer the camera separation,the closer the roundness factor is to the value of 1.

It is worth observing that there was no positive correlationbetween naturalness of depth and roundness factor with theseimaging and viewing parameters. Contrary to expectations,the correlation is clearly negative (r ¼ −0.67, p < 0.01).This result indicates that the roundness factor effect on nat-uralness of depth sensation is not the critical factor with theseviewing conditions. In Composition, the depth was perceivednaturally even with low roundness factors (0.1 to 0.3). Whenthe roundness factor was between 0.3 and 1.1, the natural-ness of depth sensation decreased as a function of roundnessfactor in every content.

4.5 Effect of Participants’ ExperienceSmall depth magnitudes were preferred in the experiment.However, the participants’ experience had an effect onaverage evaluations (Fig. 9). Naturalness has been shownto be a very important attribute for S3D images9 and itseems that the more experience with stereoscopic imagesa participant has, the more natural the depth sensationwill be perceived. There is a linear correlation betweenexperience with stereoscopic images and the naturalnessof depth impression (r ¼ 0.58, p ¼ 0.05) as well as viewingexperience (r ¼ 0.57, p ¼ 0.05).

The strength of depth sensation also increased accordingto experience with stereoscopic images, but the effect wasnot as strong as with the other attributes (r ¼ 0.45,p ¼ 0.14). The number of participants was low in somegroups (shown as bars in Fig. 9) thus the results can beconsidered indicative.

4.6 Comments From the ParticipantsThe participants had the option to comment on which partsof the images they perceived as natural or unnatural. InCluster 1, the most comments were about the face, thehands, and the glasses. Mostly positive comments weremade about the naturalness of the person and negative com-ments focused on the foreground glass and the table at longercamera separations. In Cluster 2, most comments consideredthe limited depth-of-field, which was perceived as distract-ing. The out-of-focus background was commonly described

Fig. 7 The method to compare the naturalness of depth sensationand selected roundness factor.

Fig. 8 The naturalness of depth sensation according to roundnessfactor. The smaller the roundness factor, the more flattened theS3D image appears.

Fig. 9 The effect of experience with stereoscopic images on subjec-tive evaluations. The lines represent MOS-scores (scale on the left)and bars represent number of participants (scale on the right).

Kytö et al.: Effect of camera separation on the viewing experience of stereoscopic photographs

Journal of Electronic Imaging 011011-7 Jan–Mar 2012/Vol. 21(1)

Downloaded from SPIE Digital Library on 09 Aug 2012 to 130.233.121.10. Terms of Use: http://spiedl.org/terms

Page 9: Effect of camera separation on the viewing experience of ...jussihakala.fi/wp-content/papercite-data/pdf/kyto2012a.pdf · Kytö et al.: Effect of camera separation on the viewing

as annoying. The positive comments were mostly about thenatural appearance of the person and the wine glass. InCluster 3 with lower camera separations (2 to 4 cm), thenegative comments focused on the cardboard effect on theperson at the right-hand side. The person on the right isimaged straight from side , which seem to have increasedthe amount of cardboard effect. The other persons werenot reported to be flat, even though the roundness factoron the left-hand person is the same. The positive commentsin Cluster 3 were mostly about natural proportions betweenobjects. In the Composition, the roses were described oftenas appearing very natural. The added geometric objects werenot reported to be flat, even though the roundness factor isbelow 0.6 with every camera separation (see Fig. 9). Theself-made test target revealed crosstalk as it includededges with high contrast.

5 DiscussionThe results are promising for stereoscopic content produc-tion in mobile conditions. The participants were able toperceive the change of camera separation, and the positiveviewing experience and naturalness of depth were gainedeven at short camera separations (2 to 6 cm). With these cam-era separations, visual discomfort from mismatch betweenaccommodation and convergence is unlikely to be a problem.It seems that in order to be perceived as natural, the depthsensation must originate from the scene depth variation itself,not from long camera separation. However, the crosstalkmight have affected the results. Crosstalk is reported to dis-tort the image more at higher camera separations24 whichmight favor shorter camera separations. Measuring andreporting crosstalk with subjective evaluations would bean important task in the future. However, only three outof 12 participants reported crosstalk during and after theexperiment. The overall impression of the test was good,because it changed participants’ attitudes towards S3Dimages slightly to more positive (mean ¼ 3.6, sd ¼ 0.5).In the beginning of the test, the participants’ attitude towardsS3D content was positive (mean ¼ 3.8, sd ¼ 0.2).

Another important issue in the future would be to deter-mine the effect of on-line changes in camera separation tounderstand how participants adapt to a new depth scalesduring the test.22 In this study, the participants adapted todifferent contents, thus the depth magnitude was evaluatedwithin content, not between contents. If an absolute measure,for example a yardstick, were included in each content, theresult might have been different. In addition, the memoryeffect between stimuli might have had an effect on subjectiveresults. A 2-D image between stimuli should be included infuture studies to give participants a resting phase.

The results showed that the roundness factor does not pre-dict the naturalness of depth sensation. The roundness factorlimits for stereoscopic cinema production5 do not apply forsmaller display sizes. Yamanoue et al.6 found a relationshipbetween the cardboard effect and roundness factor whenparticipants were explicitly asked to evaluate the thicknessof a particular object in S3D image. However, the role ofthe roundness factor in the naturalness of depth sensationstill remains open. Further studies are needed to find howmuch roundness factor affects the evaluations of naturalnessof depth sensation and how the different display sizes andviewing distances affect the emergence of cardboard effect.

For example, in this study, desktop-sized displays wereused, but with smaller displays the cardboard effect is morelikely to occur because of the higher non-linearity betweenperceived and scene depth (Fig. 2).

In this study, the photospace defined for 2-D images wasused, even though the possible differences between 2-D andS3D photospaces still needs to be established. As the S3Ddigital cameras enable range metering, it would be possibleto investigate which subject-to-camera distances typicallyoccur. If the S3D photospace is different from the 2-D photo-space, different requirements may arise for camera separa-tion than found in this study. The S3D photospace alsocan be influenced by available stereo cameras. It is possiblethat consumers will start taking S3D images according tocamera separation. For example, this can indicate that thesubject-camera distances will become longer than in S2Dphotospace if the camera separations are over 6 cm.

6 ConclusionsThe aim of this study was to examine the effect of cameraseparation on subjective evaluations. The typical imagingscenes were imaged with camera separations from 2 to10 cm. The influences of geometric factors that dependon camera separation and on subjective evaluations wereexplored.

The results from the subjective tests show that the strengthof depth sensation increased as a function of cameraseparation, which was expected based on the geometry ofstereoscopic imaging. The participants were able to perceivethe change of depth scale even though the images wereshown in random order without a reference depth scale.The depth was perceived equally strongly between contentseven though the length and position of the disparity rangesvaried. The most natural depth perception and best viewingexperience was achieved with camera separations of 2 cm to6 cm. With these camera separations, the disparity range wasbelow 1 deg. However, the experience of the participants hadan effect on evaluations. The suggestive results showed thatthe more experience with stereoscopic images a participanthas, the more natural the depth sensation in S3D images isperceived.

The cardboard effect emerging from short camera separa-tions is unlikely to be a problem with the desktop-sized dis-play used in this study. The cardboard effect was estimatedwith roundness factor, which was computed using geometryof stereoscopic pipeline, disparity maps, and user selectionsfor important areas. Interestingly, the naturalness of depthsensation was evaluated as high even though the roundnessfactor, previously thought to influence naturalness evalua-tion, was low.

References

1. M. Lambooij et al., “Visual discomfort and visual fatigue of stereo-scopic displays: a review,” J. Imag. Sci. Technol. 53(3), 030201 (2009).

2. K. Masaoka et al., “Spatial distortion prediction system for stereoscopicimages,” J. Elect. Imag. 15(1), 013002 (2006).

3. Y. Liu, A. C. Bovik, and L. K. Cormack, “Disparity statistics in naturalscenes,” J. Vis. 8(11), (2008).

4. G. Jones et al., “Controlling perceived depth in stereoscopic images,”Proc. SPIE 4297, 42–53 (2001).

5. B. Mendiburu, 3D Movie Making: Stereoscopic Digital Cinema fromScript to Screen, Focal Press, Burlington, VT (2009).

6. H. Yamanoue, M. Okui, and F. Okano, “Geometrical analysis of pup-pet-theater and cardboard effects in stereoscopic HDTV images,” IEEETrans. Circuits Syst. Video Technol. 16(6), 744–752 (2006).

Kytö et al.: Effect of camera separation on the viewing experience of stereoscopic photographs

Journal of Electronic Imaging 011011-8 Jan–Mar 2012/Vol. 21(1)

Downloaded from SPIE Digital Library on 09 Aug 2012 to 130.233.121.10. Terms of Use: http://spiedl.org/terms

Page 10: Effect of camera separation on the viewing experience of ...jussihakala.fi/wp-content/papercite-data/pdf/kyto2012a.pdf · Kytö et al.: Effect of camera separation on the viewing

7. A. J. Woods, T. Dohery, and R. Koch, “Image distortions in stereo-scopic video systems,” Proc. SPIE 1915, 36–48 (1993).

8. P. Engeldrum, “Image quality modeling: where are we?,” in IS&T’sPICS Conf., IS&T, pp. 251–255 (1999).

9. P. J. H. Seuntiens, “Visual experience of 3D TV,” Ph.D. Thesis,Technische Universiteit Eindhoven (2006).

10. H. de Ridder and S. Endrikhovski, “33.1: Invited paper: image quality isFUN: reflections on fidelity, usefulness and naturalness,” in SID Sym-posium Digest of Technical Papers, SID, 33(1), pp. 986–989 (2002).

11. J. Häkkinen et al., “Measuring stereoscopic image quality experiencewith interpretation based quality methodology,” Proc. SPIE 6808,68081B (2008).

12. W. A. Ijsselsteijn, “Perceptual factors in stereoscopic displays: theeffect of stereoscopic filming parameters on perceived quality andreported eyestrain,” Proc. SPIE 3299, 282–291 (1998).

13. P. Seuntiens, L. Meesters, and W. Ijsselsteijn, “Perceptual evaluation ofJPEG coded stereoscopic images,” Proc. SPIE 5006, 215–226 (2003).

14. P. Seuntiens, L. Meesters, and W. Ijsselsteijn, “Perceived quality ofcompressed stereoscopic images: effects of symmetric and asymmetricJPEG coding and camera separation,” ACM Trans. Appl. Percept. 3(2),95–109 (2006).

15. W. A. IJsselsteijn, H. De Ridder, and J. Vliegen, “Subjective evaluationof stereoscopic images: effects of camera parameters and display dura-tion,” IEEE Trans. Circuits Syst. Video Technol. 10(2), 225–233(2002).

16. L. Pockett and M. Salmimaa, “Methods for improving the quality ofuser created stereoscopic content,” Proc. SPIE 6803, 680306 (2008).

17. International Imaging Industry Association, “CPIQ Initiative WhitePaper,” 2007.

18. L. Goldmann, F. De Simone, and T. Ebrahimi, “Impact of acquisitiondistortions on the quality of stereoscopic images,” in Internatl.Workshop on Video Processing and Quality Metrics for ConsumerElectronics (VPQM) (2010) .

19. M. Kytö, M. Nuutinen, and P. Oittinen, “Method for measuring stereocamera depth accuracy based on stereoscopic vision,” Proc. SPIE 7864,78640I (2011).

20. S. Lankton, “3D Stereo Disparity - File Exchange - MATLAB Central,”29-May-2011. [Online]. Available: http://www.mathworks.com/matlabcentral/fileexchange/19406-3d-stereo-disparity. [Accessed: 29-Apr-2011].

21. J. Hakala, M. Nuutinen, and P. Oittinen, “Interestingness of stereo-scopic images,” Proc. SPIE 7863, 78631S (2011).

22. P. Milgram and M. Krueger, “Adaptation effects in stereo due to on-linechanges in camera configuration,” Proc. SPIE 1669, 122–134 (1992).

23. J. Häkkinen et al., “Effect of image scaling on stereoscopic movieexperience,” Proc. SPIE 7863, 78630R (2011).

24. P. J. H. Seuntiëns, L. M. J. Meesters, and W. A. IJsselsteijn, “Perceptualattributes of crosstalk in 3D images,” Displays 26(4–5), 177–183(2005).

Mikko Kytö received his MSc from HelsinkiUniversity of Technology, Finland, in 2009.He is currently pursuing his PhD degree atAalto University School of Science inthe Department of Media Technology. Hisresearch interests include viewing experi-ence of S3D contents, possibilities of binocu-lar perception, and perceptual issues inaugmented reality.

Jussi Hakala received his MSc degree inmedia technology from Aalto University,School of Science, in 2010. He continuesto pursue his PhD degree at the Departmentof Media Technology at Aalto UniversitySchool of Science. His current research inter-ests include stereoscopic image quality,binocular perception, and the added valueof stereoscopy.

Pirkko Oittinen, Dr Sci (Eng) is a full profes-sor at Helsinki University of Technology in theDepartment of Media Technology, part ofAalto University School of Science. HerVisual Media research group has the missionof advancing visual technologies and raisingthe quality of visual information to createenhanced user experiences in differentusage contexts. The research approach isconstructive and exploratory; the activitieslook at cross-disciplinary boundaries. Current

research topics include still image, video and 3-D image quality, con-tent repurposing for mobile platforms, and media experience arisingfrom human-media interaction.

Jukka Häkkinen received his PhD in experi-mental psychology from the Department ofPsychology, University of Helsinki, Finland.He also has worked as a principal scientistat Nokia Research Center. Currently, he isan adjunct professor at the Department ofMedia Technology, Aalto University Schoolof Science. His research has handled basicprocesses of human stereoscopic vision,but recently he has focused on the ergo-nomics and experience aspects of emerging

display technologies, such as head-mounted and stereoscopic dis-plays. He is leading projects related to stereoscopic video compres-sion, mobile stereoscopic user interfaces, and repurposing ofstereoscopic materials. His main project is defining and measuringimage quality experience in stereoscopic photographs and movies.

Kytö et al.: Effect of camera separation on the viewing experience of stereoscopic photographs

Journal of Electronic Imaging 011011-9 Jan–Mar 2012/Vol. 21(1)

Downloaded from SPIE Digital Library on 09 Aug 2012 to 130.233.121.10. Terms of Use: http://spiedl.org/terms