IS&T / SPIE Electronic Imagingspie.org/documents/conferencesexhibitions/ei11-abstracts.pdfby novel view synthesis techniques. They usually involve three steps: computing the stereo

electronicimaging.org • TEL:+17036429090 • [email protected] 1

IS&T /

ReturntoContents

IS&T / SPIE

ElectronicImagingelectronicimaging.orgTechnical Summaries

Conferences/Courses:23-27January2011

HyattRegencyHotelSanFranciscoAirport,California,USA

Contents7863: Stereoscopic Displays and Applications XXII 2

7864A: 3D Imaging Metrology 19

7864B: 3D Image Processing (3DIP) and Applications II 24

7864C: The Engineering Reality of Virtual Reality 2011 27

7865: Human Vision and Electronic Imaging XVI 29

7866: Color Imaging XVI: Displaying, Processing, Hardcopy, and Applications 36

7867: Image Quality and System Performance VIII 48

7868: Visualization and Data Analysis 2011 56

7869: Computer Vision and Image Analysis of Art II 62

7870: Image Processing: Algorithms and Systems IX 68

7871: Real-Time Image and Video Processing 2011 77

7872: Parallel Processing for Imaging Applications 83

7873: Computational Imaging IX 90

7874: Document Recognition and Retrieval XVIII 97

7875: Sensors, Cameras, and Systems for Industrial, Scientific, and Consumer Applications XII 105

7876: Digital Photography VII 111

7877: Image Processing: Machine Vision Applications IV 117

7878: Intelligent Robots and Computer Vision XXVIII: Algorithms and Techniques 123

7879: Imaging and Printing in a Web 2 0 World II 131

7880: Media Watermarking, Security, and Forensics XIII 136

7881A: Multimedia on Mobile Devices 2011 141

7881B: Multimedia Content Access: Algorithms and Systems V 147

7882: Visual Information Processing and Communication II 150

2 electronicimaging.org • TEL:+17036429090 • [email protected]

IS&T /

ReturntoContents

Conference 7863: Stereoscopic Displays and Applications XXIIMonday-Thursday24-27January2011PartofProceedingsofSPIEVol.7863StereoscopicDisplaysandApplicationsXXII

7863-01, Session 1

Adapting stereoscopic movies to the viewing conditions using depth-preserving and artifact-free novel view synthesisF.Devernay,S.Duchêne,A.Ramos-Peon,INRIARhône-Alpes(France)

The3Dshapeperceivedfromviewingastereoscopicmoviedependsontheviewingconditions,mostnotablyonthescreensizeanddistance,anddepthandsizedistortionsappearbecauseofthedifferencesbetweentheshootingandviewinggeometries.Whentheshootinggeometryisconstrained,orwhenthesamestereoscopicmoviemustbedisplayedwithdifferentviewinggeometries(e.g.inamovietheaterandona3DTV),thesedepthdistortionsmaybereducedbynovelviewsynthesistechniques.Theyusuallyinvolvethreesteps:computingthestereodisparity,computingadisparity-dependent2Dmappingfromtheoriginalstereopairtothesynthesizedviews,andfinallycomposingthesynthesizedviews.

Inthispaper,wefocusonthesecondandthirdsteps.Weexaminethreedifferentmappingfunctions:baselinemodification,whichpreservesimagecontentbutdistortsdepthandcreatedivergence,viewpointmodification,whichpreservesdepthbutmodifiesheavilyimagecontent,andthenewly-introducedhybriddisparityremapping,whichpreservesbothdepthandimagecontent.

Forthefinalcompositionstep,weproposeanasymmetricviewsynthesismethod,whereartifactsaredetectedandblurredinthesynthesizedview,whiletheotherviewisthesameastheoriginal,thuspreservingtheoverallperceivedqualityofthestereoscopicmovie.

7863-02, Session 1

Visual fatigue monitoring system based on eye-movement and eye-blink detectionD.Kim,S.Choi,J.Choi,K.Sohn,YonseiUniv.(Korea,Republicof)

Inthispaper,weproposedavisualfatiguemonitoringsystembasedoneye-movementandeye-blinkdetection.Itanalyzestheeye-movementandnumberofblinksbasedontheassumptionthatsaccademovementoftheeyedecreasesandthenumberofeyeblinkincreaseswhenvisualfatigueofviewerisaccumulated.Theproposedsystemhasaninfraredsinglecameraandaninfraredlightsource.Then,thepupiloftheeyecanbedetectedbyapplyingbinarythresholdtoPurkinjeimage.Thethresholdisautomaticallyselectedbytwoconstraintswhicharetheangleofeclipsefittingandthesizeofthepupil.Finally,patternmatchingisperformedtoselecttheaccuratepositionofthepupilamongthecandidatesandthesystemestimatesthetotalamountofeyemovementandthenumberofeyeblinks.Theresultswereobtainedwhilewatchingstereoscopicvideosafterpersonalcalibrationprocedure.Accordingtosubjectiveevaluationanddescriptiveself-report,theresultsshowthatsaccademovementoftheeyedecreasesasthevisualfatigueoftheviewerisaccumulated.However,thenumberofeyeblinksshowslargevariancealongthetimeaxiswhichimpliesitisnotproperforvisualfatiguemonitoringsystem.

7863-03, Session 1

Factors impacting quality of experience in stereoscopic imagesL.Xing,J.You,NorwegianUniv.ofScienceandTechnology(Norway);T.Ebrahimi,EcolePolytechniqueFédéraledeLausanne(Switzerland);A.Perkis,NorwegianUniv.ofScienceandTechnology(Norway)

Thestereoscopic3DindustryhasfallenshortofachievingacceptableQualityofExperience(QoE)becauseofvarioustechnicallimitations,suchasexcessivedisparity,accommodation-convergencemismatch.ThisstudyinvestigatestheeffectonQoEofstereoscopicscenecontent,camerabaseline,screensizeandviewinglocationinaholisticapproach.Wefirstdesigned240typicaltestconfigurations,inwhichthedisparityconstructedfromtheshootingcondition(scenecontentandcamerabaseline)islocatedindifferentrangesofmaximaldisparitysupportedbyviewingenvironment(screensizeandviewinglocation)inordertocoverdifferentconditions.Second,extensivesubjectivetestswereconductedusingasinglestimulusmethodology,inwhich15samplesforeachviewinglocationwereobtained.Finally,astatisticalanalysiswasperformedandtheresultsrevealedthatcontent,baseline,aswellastheinteractionsbetweensize,contentandbaseline,haveasignificantimpactonQoEinstereoscopicimages,whileothercombinations,especiallyviewinglocationinvolved,havenosignificantimpact.TheresultedMeanOpinionScores(MOS)andstatisticalresultscanbefurtherusedtocompareanddesignnewstereoscopicqualitymetrics.

7863-04, Session 1

Visual discomfort induced by fast salient object motion in stereoscopic videoS.Lee,Y.J.Jung,H.Sohn,Y.M.Ro,H.W.Park,KoreaAdvancedInstituteofScienceandTechnology(Korea,Republicof)

Instereoscopicdisplays,instinctconflictsbetweenaccommodationandvergenceiswellknownasoneofmajorfactorsthatmayincurvisualdiscomfort.Inordertoavoidexcessiveaccommodation-vergenceconflicts,binoculardisparityof1degreehasbeensuggestedasaguidelineforcomfortableviewingzone.However,thisvalueisnotanabsolutethresholdsinceitoverlookseffectsofspace-andtime-varyingdisparities.Also,evenwithinzoneofcomfortableviewing,therearemanyfactorsthatmaybeabletoinducevisualdiscomfort,suchasindividualdifferencesinvisionabilityandcharacteristicsofstereoscopiccontents.Togenerateasafetyofstereoscopicvideocontents,itisveryessentialtoinvestigatefactorsandconditionsthatinducevisualdiscomfort.Inthispaper,wefocusourscopeontheinvestigationoflocalmotioncharacteristicsthatmayleadtovisualdiscomfortwithinthezoneofcomfortableviewing.Althoughseveralresearchershaveexaminedthevisualdiscomfortinducedbytheeffectofbothdisparityandmotion,itrequiresmorein-depthstudiestounderstandtheeffectsofmotionswithspace-andtime-varyingdisparities.Thecontributionofthispaperistoinvestigatetherelationbetweenvisualdiscomfortandlocalmotionsthatvarywithdifferentdirection,velocity,rotationalvelocity,andmovementartifactswithinzoneofcomfortableviewing.


IS&T /

ReturntoContents

7863-05, Session 1

3D video disparity adjustment for preference and prevention of discomfortH.Pan,C.Yuan,S.J.Daly,SharpLabs.ofAmerica,Inc.(UnitedStates)

Withthehugesuccessof3Dmoviesintheaters,3DTVsandother3Dproductsarepenetratingintothehomewitheverincreasingspeed.Oneofthekeyissuesassociatedwith3DTVsisthetradeoffbetweencomfortand3Dvisualimpact.Bigdisparityisoftenpreferredforstrongvisualimpactbutoftenleadtoviewerdiscomfortbasedondisplaysizeandviewingdistances.Thegoaloftheproposedalgorithmistoprovideviewersatoolforthemtoadjustdisparityaccordingtotheenvironment,contentsandtheirpreferenceinordertohavemorecomfortableandhigherquality3Dexperiences.

Morespecifically,givenaplanarstereoscopicdisplay,thealgorithmtakesinastereoscopicimagepairthatcausesviewingdiscomfort/fatigue,andoutputsamodifiedstereoscopicpairthatcauseslessornoviewingdiscomfort/fatigue.Thealgorithmfulfillsthefunctionsofdisparityestimation,occlusiondetection,disparityadjustmentandviewsynthesis.Anovelpixelweightingmechanisminregularized-block-matchingbaseddisparityestimationhelpsimprovetherobustness,accuracyandspeedofmatching.Occlusiondetectionusesmultiplecuesinadditiontomatchingerrorstoimprovetheaccuracy.Anaccommodation/vergencemismatchvisualmodelisusedindisparityadjustmenttopredictdiscomfort/fatiguefromthedisparityinformation,theviewingconditionsanddisplaycharacteristics.Theholefillinginviewsynthesisisinthedisparitymapofthenewviewinsteadofthenewviewitselftoreducetheblurriness.Thepreliminaryresultsarepromising.

7863-06, Session 2

Can the depth perception of stereoscopic images be influenced by 3D sound?A.Turner,N.S.Holliman,DurhamUniv.(UnitedKingdom)

Thecreationofbinocularimagesforstereoscopicdisplayhasbenefitedfromsignificantresearchandcommercialdevelopmentinrecentyears.However,perhapssurprisingly,theeffectofaddingauditorydepthinformationtostereoscopicimageshasrarelybeenstudied.Havingfoundfewsimilarstudiesintheliteratureweaddresstwopreliminaryquestions.Firstwhatisthesmallestdifferenceinauditorydepththatcanbereliablydetectedusingsoundalone?Secondisitpossiblethattheadditionofauditorydepthinformationcanenhancethevisualperceptionofdepthinastereoscopicimage?

7863-07, Session 2

Evaluating motion and binocular parallax as depth cues for autostereoscopic displaysM.Braun,FachochschulefürTechnikundWirtschaftBerlin(Germany);U.Leiner,D.Ruschin,Fraunhofer-InstitutfürNachrichtentechnikHeinrich-Hertz-Institut(Germany)

Theperceptionofspaceintherealworldisbasedonmultifaceteddepthcues,mostofthemmonocular,somebinocular.

Developing3D-displaysraisesthequestion,whichofthesedepthcuesarepredominantandshouldbesimulatedbycomputationalmeansinsuchapanel.Beyondthecuesbasedonimagecontent,suchasshadowsorpatterns,Stereopsisanddepthfrommotionparallaxarethemostsignificantmechanismssupportingobserverswithdepthinformation.Wesetupacarefullydesignedtestsituation,widelyexcludingundesiredotherdistancehints.Thereafterweconductedausertesttofindout,whichofthesetwodepthcuesismorerelevantandwhetheracombinationofbothwouldincreaseaccuracyinadepthestimationtask.Thetrialswereconductingutilizingourautostereoscopic“Free2C”-displays,whicharecapabletodetect

theusereyepositionandsteertheimagelobesdynamicallyintothatdirection.Atthesametime,eyepositionwasusedtoupdatethevirtualcamera’slocationandtherebyofferingmotionparallaxtotheobserver.Asfarasweknow,thiswasthefirsttimethatsuchatesthasbeenconductedusinganautosteresocopicdisplaywithoutanyassistivetechnologies.Ourresultsshowed,inaccordancewithpriorexperiments,thatbothcuesareeffective,howeverStereopsisisbyorderofmagnitudemorerelevant.Combiningbothcuesimprovedtheprecisionofdistanceestimationbyanother30-40%.

7863-09, Session 4

A multi-resolution multi-size windows disparity estimation approachJ.MartinezBauza,QualcommInc.(UnitedStates);M.P.Shiralkar,ClemsonUniv.(UnitedStates)

Thispaperdescribesanalgorithmforestimatingthedisparitybetween2imagesofastereopair.Thedisparityisrelatedtothedepthoftheobjectsinthescene.Beingabletoobtainthedepthoftheobjectsinthesceneisusefulinmanyapplicationssuchasvirtualreality,3Duserinterfaces,background-foregroundsegmentation,ordepth-image-basedsynthesis.Thislastapplicationhasmotivatedtheproposedalgorithmaspartofasystemthatestimatesdisparitiesfromastereopairandsynthesizesnewviews.Synthesizingvirtualviewsenablesthepost-processingof3Dcontenttoadapttouserpreferencesorviewingconditions,aswellasenablingtheinterfacewithmulti-viewauto-stereoscopicdisplays.

Theproposedalgorithmhasbeendesignedtofulfillthefollowingconstraints:(a)lowmemoryrequirements,(b)localandparallelizableprocessing,and(c)adaptabilitytoasuddenreductioninprocessingresources.Oursolutionusesamulti-resolutionmulti-size-windowsapproach,implementedasaline-independentprocess,well-suitedforGPUimplementation.Themulti-resolutionapproachprovidesadaptabilitytosuddenreductioninprocessingcapabilities,besidescomputationaladvantages;thewindows-basedimageprocessingalgorithmguaranteeslow-memoryrequirementsandlocalprocessing.

7863-10, Session 4

Warping error analysis and reduction for depth image based rendering in 3DTVL.Do,S.Zinger,TechnischeUniv.Eindhoven(Netherlands);P.H.N.deWith,CycloMediaTechnologyB.V.(Netherlands)

Interactivefree-viewpointselectionappliedtoa3Dmultiviewvideosignalisanattractivefeatureoftherapidlydeveloping3DTVmedia.Inrecentyears,significantresearchhasbeendoneonfree-viewpointrenderingalgorithmswhichmostlyhavesimilarbuildingblocks.In[1],wehaveanalyzedtheprincipalbuildingblocksofmostrecentrenderingalgorithmsandtheircontributiontotheoverallrenderingquality.Wehavediscoveredthatthefirststep,Warpingdeterminesthebasicqualitylevelofthecompleterenderingchain.Inthispaper,wehaveanalyzedthewarpingstepinmoredetailsinceitleadstowaysforimprovement.Wehaveobservedthatwarpingerrorsconsistofmainlythreetypesoferrorswhichareroundingerrorswhenperformingpixel-basedwarping,quantizationerrorsofdepthmapsandroundingerrorsatthevirtualimage.Foreacherrorfactor,wehaveproposedatechniquethatcanreducetheerrorsandthusincreasethewarpingquality.Thenewtechniquesareevaluatedwithtwoseriesofexperimentsusingreal-lifeandsyntheticdata.Fromthefirstexplorationexperiments,weobservethattheproposedtechniquesreducethewarpingerrorsandhelptoincreasetheoverallrenderingquality.

Conference 7863: Stereoscopic Displays and Applications XXII


IS&T /

ReturntoContents

7863-11, Session 4

Novel view synthesis for dynamic scene using moving multi-camera arrayT.Yokoi,NagoyaUniv.(Japan);N.Fukushima,NagoyaInstituteofTechnology(Japan);T.Yendo,M.PanahpourTehrani,NagoyaUniv.(Japan);T.Fujii,TokyoInstituteofTechnology(Japan);M.Tanimoto,NagoyaUniv.(Japan)

WearedevelopingtechnologiesforFree-viewpointTV(FTV)inwhichtheviewercanfreelychangetheviewpoint.FTVallowsustochangetheviewpointfreelyina3Dworld.Generally,givenmulti-viewimages,itisnecessarytorepresentthe3Dspacetogenerateanarbitraryviewpointimage.

Rayspacemethodthatbelongstoimagebasedrendering,canbeusedforrepresentationof3-Dspaceandfree-viewpointimagegeneration.Therayspacemethoddoesnotneedany3-Dinformationofthesceneforfree-viewpointimagegeneration.Adenseray-spaceisrequiredtobeabletofreelychangetheviewpoint.Inordertomakeadenseray-spaceweneedtosynthesizeviewsbetweenactualimagesintheray-space.Wefocusonfree-viewpointimageusingmulti-viewimages,usingdepthmaps.

Thefree-viewpointimagecanbegeneratedbyusingimagescapturedbyastaticmulti-camerasystem.However,itishardtorenderanobjectthatmoveswidelyinthescene.Ifanobjectmoveswidely,thenumberofthecamerashouldbeincreased.Thus,thesetupandcomputationalcostsofsuchasystemareincrease.Alternatively,wecanreducethenumberofcameratoreducethecostsbyincreasingthecamerainterval.However,inthiscase,onlysparseinformationcanbeacquired,sothatthefree-viewpointimagequalitysignificantlydecreases,andcannotbeusedforphoto-realisticapplications.

Inthispaper,weaddressthisproblembyproposingmovingcameraarray.Furthermore,formovingcameraarray,weproposeanovelmethodforsynthesizingfree-viewpointimagesusingbothspatialandtemporalinformation.

Byusingmovingcameraarray,notonlywecanfollowanobjectofinterest,butalsowecanobtaindenserayinformationofstaticareaofthescene.Tomakethebestuseofthedenseacquisitionoftherayinthestaticareas,thefree-viewpointimagecanbegeneratednotonlybytheimagesinspatialdomain,butalsotheimagesintemporaldomain.Therefore,intheproposedsynthesismethod,weusefourreferenceimages,whicharetwoviewsinspatial,anothertwoviewsintemporaldirection.Viewsintemporaldirectionbelongtodifferenttimes,beforeandafter,anddifferentviews.

Inthisprocesswearetryingtouseclosertwoviewstothelocationofvirtualviewpointfromdifferentframes.

Experimentsusingasequencecapturedbysimulatedmovingmulti-camerasystemsdemonstrateobjectiveandsubjectiveimprovementofviewsynthesisqualityincomparisonwithconventionalviewsynthesisscheme.

7863-12, Session 4

Depth-based representations: which coding format for 3D video broadcast applications?P.Kerbiriou,G.Boisson,K.Sidibe,TechnicolorR&DFrance(France);Q.Huynh-Thu,TechnicolorS.A.(France)

3DVideo(3DV)deliverystandardizationiscurrentlyongoinginMPEG.Nowtimeistochoosetherepresentationformat.Ourconcernisthefinalqualityforend-users,i.e.synthesizedviewsvisualquality.Wefocusontwomajorrivaldepth-basedformats,namelyMultiviewVideoplusDepth(MVD)andLayeredDepthVideo(LDV).MVDcanbeconsideredasthebasicdepth-basedformat,generatedbydisparityestimationfrommulti-viewsequences.LDVismoresophisticated,withthecompactionofmultiviewdataintocolour-anddepth-occlusionslayers.WecomparefinalviewsqualityusingMVD2andLDV(bothcontainingtwocolorchannelsplustwodepthcomponents)codedwithMVCatvariouscompressionratios.Dependingontheformat,theappropriatesynthesisisperformedtogeneratefinalstereocopic

pairs.ComparisonsareprovidedintermsofSSIMandPSNRwithrespecttooriginalviewsandtosynthesizedreferences(obtainedwithoutcompression).Eventually,LDVoutperformssignificantlyMVDwhenusingstate-of-the-artreferencesynthesisalgorithms.Detectingocclusionsbeforeencodingrevealsbeneficialincomparisonwithhandlingrendundantinformationsatdecoderside.Besides,weobservethatdepthquantizationdoesnotinducemuchlossonthefinalviewqualityuntilasignificantdegradationlevel.Improvementsindisparityestimationandviewsynthesisarethereforestillexpectedfortheremainingstandardizationsteps.

7863-13, Session 5

Multiview image compression based on a new basis representationT.Yamada,T.Fujii,TokyoInstituteofTechnology(Japan)

Inordertocompressorinterpolatethemultiviewimageefficiently,weproposeanewbasisrepresentationbyusingdirectionalsampling.Whilethemultiviewimagedatahasthestaticcorrelationrelatedtothecameraposition,directionalsamplingcaneliminatethecorrelationefficiently.Weapplydirectionalthreedimensionaldiscretecosinetransform(directional3D-DCT)anddirectionalthreedimensionaldiscretewavelettransform(directional3D-DWT)tothreedimensionaldatathathasvertical,horizontalandviewdirection.Intheexperimantalresults,theproposedmethodshowedbetterqualitythanpreviousmethod,notonlyintheobjectiveevaluation,butalsointhesubjectiveevaluation.

7863-14, Session 5

Design of tuneable anti-aliasing filters for multiview displaysA.R.Boev,R.Bregovic,A.P.Gotchev,TampereUniv.ofTechnology(Finland)

Inthispaper,weaddresstheproblemofanti-aliasingfilteringofimagestobedisplayedonauto-stereoscopicdisplays.Multiviewdisplaysemployanopticallayer,whichdistributesthelightofanunderlyingTFT-LCDpanelindifferentdirections.Certainpropertiesofthelayercreatespecificartifacts,suchasghostimages,moirépatternsandmasking.Theseartefactsareespeciallyvisibleandannoyingwhen2Dimagery,suchasgraphicsandtext,istobedisplayedonauto-stereoscopicdisplays.Wemodelthelayerasanimageprocessingchannelandidentifydisplayparametersthatareimportantforthedesignsofanartifactmitigationfilter.Themodelexplainscommon3Ddisplayartifacts,suchasmoirépatternscausedbyaliasing,andghostimagescausedbycrosstalk.

Itturnsoutthatknowledgeoftheinterleavingpatternandtheangularvisibilityprofileofeachsub-pixelisnotsufficientforpredictingthevisualoutput.Duetotheopticallayer,thevisiblepartsofsub-pixelshaveanon-rectangularshapeandthegapsbetweenthemaredirectionallyoriented.Theslantofthelayercreatesapattern,whichinterfereswithtextureswithsimilarorientationinthevisualizedimageandcreatesmaskingartifactssimilartothe‘imaging’artefactscausedbyupsamplingintheabsenceofapost-filter.

Usually,imagingistackledbyananti-imagingpost-filter.Astheimagingiscreatedbythephysicalstructureofthedisplay,itisimpossibletoimposeapost-filter.However,theeffectcanbepartiallymitigatedbyapre-filter.Inordertodeterminethepropertiesoftherequired2Dfilter,andconsequentlytohavethebestpossiblerepresentationofimagesonthedisplay(minimizingaliasing,imagingandghosting),itisnecessarytodeterminetheperformanceofthedisplayinthefrequencydomain;thatis,wehavetoknowwhichfrequencycomponentsintheimagewecankeep(onesthatwillbeproperlyrepresentedonthescreen),andwhichoneswehavetoattenuate(remove)aspotentialcausesofdistortions.Applyingsuch2Dfilterwillremovemoiréartifactsandmakemaskingartifactslessvisible.Weshowthatthepassbandofthisfilterisoptimal,andnofurtherbandlimitationisnecessary.



IS&T /

ReturntoContents

However,prolongedobservationofamultiviewdisplaymightmakethemaskingartifactslessnoticeable,asthehumanvisualsystemgraduallyadaptstotheconstantpatternimposedbytheopticallayer.Someobserversfindthe‘optimal’2Dfiltertocreateover-smoothedimage,andprefersharperimageattheexpenseofsomevisiblemaskingartifacts.Weproposeatunablefilter,withapassbandthatcanbegraduallymorphedbetween‘optimal’and‘all-pass’shape.Thiscanbepresentedtotheuserasasingle“sharpness”control.

Inthispaper,wedescribe:1)measurementmethodologyforderivingthefrequencyperformanceofanarbitrarymultiviewdisplay;2)methodologyfordesigninganartifactmitigatingfilterforthatdisplay;3)algorithmforrecalculatingthefilterthatallowsgradualchange(morphing)ofthepassbandcontrolledbyasingleparameter.

7863-15, Session 5

Multiview image compression based on LDV schemeB.Battin,C.Niquin,P.Vautrot,Univ.deReimsChampagne-Ardenne(France);D.G.Debons,3DTVSolutions(France);L.Lucas,Univ.deReimsChampagne-Ardenne(France)

Inrecentyears,wehaveseenseveraldifferentapproachesemergetodealwithmultiviewcompression.First,wecanfindtheH264/MVCextensionwhichgeneratesquiteheavybitstreamswhenusedonn-viewsautostereoscopicmediasanddoesnotallowinter-viewreconstruction.AnothersolutionreliesintheMVD(MultiView+Depth)schemewhichkeepspviews(n>p>1)andtheirassociateddepth-maps.Thismethodisnotsuitableformultiviewcompressionsinceitdoesnotexploittheredundancybetweenthepviews,moreoverocclusionareascannotbeaccuratelyfilled.Inthispaper,wepresentourmethodbasedontheLDV(LayeredDepthVideo)approachwhichkeepsonereferenceviewwithitsassociateddepth-mapandthen−1residualonesrequiredtofilloccludedareas.Wefirstperformaglobalper-pixelmatchingstep(providingagoodconsistencybetweeneachview)inordertogenerateoneunified-colorRGBtexture(whereauniquecolorisdevotedtoallpixelscorrespondingtothesame3D-point,thusavoidingilluminationartifacts)andasignedintegerdisparitytexture.

Next,weextractthenon-redundantinformationintotwotextures(aunified-coloroneandadisparityone)containingthereferenceandthen−1residualviews.TheRGBtextureiscompressedwithaconventionalDCTorDWT-basedalgorithmandthedisparitytexturewithalosslessdictionaryalgorithm.Then,wewilldiscussaboutthesignaldeformationsgeneratedbyourapproach.

7863-16, Session 5

Upsampling range camera depth maps using high-resolution vision camera and pixel-level confidence classificationC.Tian,V.A.Vaishampayan,AT&TLabs.Research(UnitedStates);Y.Zhang,TexasA&MUniv.(UnitedStates)

Weconsidertheproblemofupsamplingalow-resolutiondepthmapgeneratedbyarangecamera,byusinginformationfromoneormoreadditionalhigh-resolutionvisioncameras.Thegoalistoprovideanaccuratehighresolutiondepthmapfromtheviewpointofoneofthevisioncameras.Weproposeanalgorithmthatfirstwarpsandconvertsthelowresolutiondepthmapintoadepth/disparitymapinthecoordinateframeofandatthesameresolutionasonevisioncamera,thenclassifiesthepixelsintoregionsaccordingtowhethertherangecameradepthmapisreliable,andfinallyperformsagraphcutoptimizationontheunreliableregions.Toreducethemisalignmentcausedbyusingonlyasinglehomographicwarping,weemployadepth-dependenthomographicmappingwhichhasseveralcandidates,whichresultsinmoreaccuratealignmentbetweenthecameraviews.Experimentalresultsshowthattheproposedmethodisabletoupsamplethedepthmapbyafactorof10-by-10withveryaccuratedepthdetails.Theimprovementsarevisuallyperceptibleona3Dauto-stereoscopicdisplay.

7863-17, Session 6

Attack of the s mutans! A stereoscopic-3D multi-player direct-manipulation behavior-modification serious game for improving oral health in pre-teensA.Hollander,FirsthandTechnologyInc.(UnitedStates)

AttackoftheS.Mutans!Isamulti-playergamedesignedtoharnesstheimmersionandappealpossiblewithstereoscopic3Dtocombatthetoothdecayepidemic.Toothdecayisoneoftheleadingcausesofschoolabsences.Earlyproblemswithteethcanhaveprogressive,systemichealthrepercussions.Thetraditionalmethodsofeducatingthepublicabouthowtocarefortheirteetharefailing.In2008theauthorsreceivedagrantfromtheNationalInstitutesofHealthtobuilda2000sqftmuseumexhibitthatincludedasuiteofseriousgamesinvolvingthebehaviorsandbacteriathatcausecavities.OneoftheseisanadventurewherefivesimultaneousplayersusemodifiedWiicontrollerstobattlebiofilmsandbacteriawhileimmersedinenvironmentsgeneratedwithina12-footstereoscopicWUXGAdisplay.TheauthorsdescribethesystemandinterfaceusedinthisprototypeapplicationandsomeofthewaystheyattemptedtousethepowerofimmersionandtheappealofS3Drevolutiontochangebehaviorsandlives.

7863-18, Session 6

Stereoscopic multi-perspective capture and display in the performing artV.Kuchelmeister,TheUniv.ofNewSouthWales(Australia)

Multi-perspectiveimagecaptureconstitutesanovelmethodofdocumentingtheentirenessofanevent,byrecordingfrommultiplepointsofview.Incombinationwithstereoscopicimaging,itcapturesthetwomodalitiesofthree-dimensionalrepresentation,perspectiveanddepth.Thesemodalitiesarethenappliedfordisplayinahexagonalstereoscopicmulti-screenplatform.Inthispapertheauthordescribes,informofacasestudy,theimplementationoftwoinstallationprojectswithintheperformingartcontext.

7863-19, Session 6

Machine vision and vitrectomy: three-dimensional high definition video for surgical visualization in vitreoretinal surgeryC.D.Riemann,CincinnatiEyeInstituteandMedNetTechnologies,Inc.(UnitedStates)

Machinevisionprovidedby3dimensionalhighdefinitionvideosystemswassuccessfullyusedforsurgicalvisualizationin8vitreoretinalsurgeries.Clinicalresultswereexcellentandsurgeonintraoperativecomfortwassuperb.Withcontinuedadvancements,thisnewtechnologymayevolvetoprofoundlychangesurgicalvisualizationforvitreoretinalsurgeryandophthalmicsurgeryasawhole.

7863-20, Session 7

High image quality 3D displays with polarizer glasses based on active retarder technologyS.Jung,Y.Lee,H.Park,J.Park,D.Lee,W.Jeong,J.Kim,I.Chung,LGDisplay(Korea,Republicof)

Inthispaper,wedescribedthebasicconceptsofactiveretarder3Ddisplaywhichcangivefullresolutionandhighluminancein3Dimageswithsimplepolarizerglasses.Byoptimizingthe3Dperformanceof



IS&T /

ReturntoContents

AR3Dwithrespecttovariousdesignparameters,themajorproblemsofAR3Dsuchasthehigh3Dcrosstalkandthe3Dimagedifferencewereclearlysolved.Fromtheexperimentalresults,B/W3Dcrosstalkwasobtainedas1.0%inbothleftandrighteyeswhicharesimilartothatoftheconventionalPR3D.Theluminanceisobtainedto75nitthroughpolarizerglassesundertheconditionofthesurfaceluminancetobe300nitwithoutpolarizerglasses,leadingto25%lightefficiencyin3Dmodes.Theluminancedifferencewasalsoreducedtobearound0.5%,whichcanberecognizedassamelevelofleftandrightluminance.Fromtheoptimizedresults,itwasclarifiedthatthattheAR3Ddisplaytechnologycangivesuperior3Dimagequalityamongthevariousglassestype3Dtechnologieswithhighresolutionandluminance.

7863-21, Session 7

High-brightness film projection system for stereoscopic moviesL.Lipton,Oculus3D(UnitedStates)

Aplano-stereoscopicprojectionsystemconsistingofafilmformatandauniquelensforusewiththe35mmmotionpictureinfrastructurehasbeendesignedandimplemented.Thesystemovercomespriorartlimitationswithregardtoambiguitiesinthreadingandassemblyofprintswhichhasleadtopseudostereoscopicimages.Thenewsystemisatleastthreetimesbrighterthanmostpriorsingleprojectorfilmefforts.Itisalsofarbrighterthanthemajorityofdigitalstereoscopicprojectionproducts.Theimagesareextremelysteadyandsharpandcapableofbeingprojectedontoonthelargesttheatricalscreens.Thesystem,byOculus3D™,costslessthanafifthofthatofdigitalproductsandfillsaneedbecauseofthegreatnumberofstereoscopicfeaturefilmsbeingreleasedgiventheshortageofavailablescreens.

7863-22, Session 7

New generation of universal active glassesB.Mendiburu,Volfoni(UnitedStates);B.Caillaud,G.Jovene,T.Henkinet,Volfoni(France)

Wepresentanewgenerationofactiveglassesthatusesnewliquidcrystalshuttersandanelectroniccapableofrecognizingandsynchronizingwithmostpartofexisting3Ddisplays.Inthispaperwefocusontheelectro-opticalcharacterizationofourliquidcrystalsshutterstechnology,namedECB,andtheperformanceofouruniversalelectronicdemonstratingtheadvantageofthesenewglasses.

Ourliquidcrystal,confinedinaverythincellgap(1,5µm),hashomogeneousperformanceformostimportantparametersin3D.Indeed,ECBmaterialhasfastswitchingtimes,especiallyafastrelaxationtimepreventingcolorbanding.Secondly,lighttransmittanceandchromaticdependencehavebeenoptimizedtoimproveopticalquality.Inparallelwehavedevelopedanelectronicpartthatcanadaptstoanydisplayinthemarket,asitiseasilyupdatedbysoftwarewhennewdisplaysappears.

Inconclusion,ourtechnologyprovides3Dglasseswithincreasedimagebrightness,blockingstates,viewingangleandlowcolordependence.Combinedwithaprogrammableelectronictheyaregoodcandidateforthehighqualityeyewearneededbythenewgenerationof3-Ddisplays.

7863-23, Session 7

Continuously adjustable Pulfrich spectaclesK.M.Jacobs,BinghamtonUniv.(UnitedStates);R.S.Karpf,ADDISInc.(UnitedStates)

WhilemanyPulfrich3-Dmovieshavebeenproduced,thestandardimplementationhasinherentdrawbacks.ThefilmindustryhascorrectlyconcludedthatthestandardPulfirch3-Dimplementationisnotausefulcommercial3-Dtechnique.

ContinuouslyAdjustablePulfrichSpectacles(CAPS)isanew

implementationofthePulfrichEffectthatallowsanystandard2-Dmovietobeoptionallyviewedin3-Dusinginexpensiveviewingspecs.Itworksonanyviewingdevice.Withouttheglasses,themoviewillappearasanormal2-Dimage.

Recentscientificresultsinthefieldsofhumanperception,optoelectronics,videocompressionandvideoformatconversionaretranslatedintoanewimplementationofPulfrich3-D.CAPSusestheseresultstocontinuouslyadjusttothemoviesothattheviewingspectaclesalwaysconformtotheopticaldensitythatoptimizesthePulfrichstereoscopicillusion.Thisinstantlyprovides3-Dimmersiontoanymovingsceneinany2-Dmovie.

7863-24, Session 8

Visual discomfort with stereo displays: effects of viewing distance and direction of vergence-accommodation conflictT.Shibata,Univ.ofCalifornia,Berkeley(UnitedStates)andWasedaUniv.(Japan);J.Kim,D.M.Hoffman,M.S.Banks,Univ.ofCalifornia,Berkeley(UnitedStates)

Prolongeduseofconventionalstereodisplayscausesviewerdiscomfortandfatiguebecauseofthevergence-accommodationconflict.Weusedanovelvolumetricdisplaytoexaminehowviewingdistanceandthesignofthevergence-accommodationconflictaffectsdiscomfortandfatigue.Inthefirstexperiment,wepresentedafixedconflictatshort,medium,andlongviewingdistances.Wecomparedsubjects’symptomsinthatconditionandoneinwhichtherewasnoconflict.Weobservedmorediscomfortandfatiguewithagivenvergence-accommodationconflictatthelongerdistances.Inthesecondexperiment,comparedsymptomswhentheconflicthadonesigncomparedtowhenithadtheoppositesign,andwedidsoashort,medium,andlongdistances.Weobservedgreatersymptomswithuncrosseddisparitiesatlongdistancesandwithcrosseddisparitiesatshortdistances.Thesefindingshelpdefinecomfortableviewingconditionsforstereodisplays.

7863-66, Session 8

Effects of 3D display on accommodative and vergent responses and subsequent visual discomfort and motion sicknessS.Yang,PacificUniv.(UnitedStates)

Recentproliferationof3-Ddisplayhasbeenshowntoaugmentviewingexperiences.However,3-Dviewingalsoinducessignificantvisualdiscomfortandmotionsickness.Thisstudyinvestigatedwhetherthesesymptomsresultfromtheconflictbetweenaccommodativedemand,determinedbydisplaydistance,andthevergentdemanddeterminedbybinocularimagedisparity.Adultsubjectswereaskedtotrackthemotionofa3-Dtargetdisplayedinstepwiseorcontinuousmotion.Theyalsowatchedmoviesin2-Dand3-Dandreportedtheirvisualdiscomfortandmotionsicknesssymptomsbeforeandafterviewing.Subject’saccommodationandbinoculareyepositionwerecontinuouslymeasured,aswellastheirphysicalsymptoms.Resultsfoundthatsubjectsmadegreateraccommodativeandvergentresponseinviewing3-Dstimuliincontinuousmotion,butonlywhenthestimuluswasapproachingthem.Increaseinvisualsymptoms(blurredvision,doublevision,andoutoffocus)andmotionsickness(nausea,dizziness,anddisorientation)inviewing3-Dmoviewascorrelatedtotheincreaseinaccommodativeandvergentresponsesin3-Danimationforindividualsubjects.Thesefindingssuggestthatreductioninperceivednearmotioncouldhelpmaintain3-Dperceptionwhilekeepingthevisualdiscomfortandmotionsicknessatanacceptablelevel.



IS&T /

ReturntoContents

7863-25, Session 9

Effect of image scaling on stereoscopic movie experienceJ.P.Häkkinen,J.Hakala,AaltoUniv.SchoolofScienceandTechnology(Finland);M.Hannuksela,NokiaResearchCtr.(Finland);P.Oittinen,AaltoUniv.SchoolofScienceandTechnology(Finland)

Asstereoscopicmovieshavebecomemorepopular,theutilizationofmultipledistributionchannelsbecomesmoreimportant.EspeciallyBlu-Rayandmobiledeviceswillbesignificantchannelsfordistributingconsumercontent.However,changesintheimagesizeandviewingdistanceaffectthebinocularparallax,whichmightincreasethevisualloadaswellaschangetheviewingexperience.Inourstudyweexaminetheeffectofdifferentviewingcontextsontheexperiencedstereoscopicquality.Specifically,weexaminetheviewingexperienceofS3Dcontentsonsmalldisplays,hometheatersizeddisplaysandonacinemascreen.IntheexperimentswechangethecamerabasedistanceineachconditionandmeasuretheeffectofchangeonexperiencedS3Dquality.Thesameanimationcontentsandbasedistancesareutilizedinallthreeviewingconditionssothatwecancomparethesubjectiveresultswitheachother.Theresultswillofferusknowledgeofthethresholdswheretheviewingcontextandscalingbegintobevisibleandannoying.Thisknowledgecanbeutilizedtoformqualityguidelinesforstereoscopicrepurposing,inwhichthedepthofthecontentsischangedtoachievemoresatisfactoryviewerexperience.

7863-26, Session 9

Relationship between perception of image resolution and peripheral visual field in stereoscopic imagesM.Ogawa,K.Shidoji,KyushuUniv.(Japan)

High-resolutionstereoscopicimagesareeffectiveforvirtualrealityandteleoperationsystems.However,thehighertheimageresolution,thehigheristhecostofcomputerprocessingandcommunication.Toreducethecost,numerousearlierstudieshavesuggestedtheuseofmulti-resolutionimages,whichhavehighresolutionintheregionofinterests(ROI)andlowresolutioninotherareas.However,observerscanperceiveunpleasantsensationsandincorrectdepthbecausetheycouldseethelow-resolutionareaintheirfieldofvision.Inthisstudy,weconductedanexperimenttoresearchtherelationshipbetweentheviewingfieldandtheperceptionofimageresolution,anddeterminedtherespectivethresholdsofimage-resolutionperceptionforthepositionsoftheviewingfield.Intheresults,participantscouldnotdiscriminatethehigh-resolutionstimulusfromthedecreasedstimulus,63[ppi],atpositionsmorethan8[deg]outsidethegazepoint.Moreover,withpositionsshiftedfurther14[deg]fromthegazepoint,participantscouldnotdistinguishbetweenthehigh-resolutionstimulusandthedecreasedstimuliwhoseresolutiondensitieswere42and25[ppi].Theresultsshowthatwewillproposethecompositionofmulti-resolutionimagesinwhichobserversdonotperceivetheunpleasantsensationsandincorrectdepthwithdatareduction(compression).

7863-49, Poster Session

Human perception considerations for 3D content creationA.Green,AlmontGreenStudios(UnitedStates)

3Dcontentviewedwithstereopsisactivatesregionsofthebrainthatdealwithperceivingreality.Whereinconsistenciesexist,the3Dillusioncreatesperceptionconflictsthatmanifestwithanegativereaction.

ObservationsofpeopleviewingautostereoscopiclenticularphotographsshowthatluminancedynamicsmatchingofimagerytothatofhumaneyesusingHDRphotographictechniquesimprovesthe

subjectiveresponse.Matchingthesizeofthephotographtowhatisperceivedinrealityalsoevokespositiveresponse.Smallersizescreatea“KenandBarbieeffect”whichreferstothedolllikeappearanceofpeopledepictedinasmallsizephoto.

Usingaspecial12cameraarraytocapturemultipleperspectivessimultaneouslywithextendeddynamics,coupledtolongfocallengthlensesmakesitpossibletocreateimagerythatcloselymatchesreallifeperceptions.Lensdistortionsandinterocularspacingproblemsaremitigated.

TheresultingimageryisprintedatlifesizewithaninkjetsystemwithHDRinksonpaperdesignedforbacklighting.Bycreatingsampleswithandwithoutextendedluminancedynamicsandotherdynamicallyadjustibleparameters,itispossibletoevaluatehowaspectsofcontentcreationeffectperceptionquality.

UsingHDRtechniquesandprecisionmatchingofimagesizeandperspectivecanbeclearlyshowntoimprovethesubjectivequalityof3Dimagery.Binocularrivalryandbinoculardisparityissueshavealsobeenevaluatedandobservedandthenegativeaspectsof2Dto3Dconversionsdemonstrated.


System crosstalk issues on autostereoscopic displaysP.Wang,S.Hwang,H.Huang,C.Chuang,NationalTsingHuaUniv.(Taiwan)

Inthisstudy,thereweretwopartsofexperiments.Thefirstpartofexperimentwasconductedonamulti-viewautostereoscopicdisplayofmirror-typedisplays.Thefourexperimentalpicturesrandomlyappearedinthemirror-typedisplay.Thisresearchaimedatinvestigatingtherelationshipbetweensystemcrosstalkandperspectiveimagequality.Theindependentvariablewassystemcrosstalkoftheexperimentalpictures.Thedependentvariablesweresubjectiveevaluation,physiologicalandsubjectivemeasurestowardasthenopiaintheexperiments.Thesecondpartofexperimentinvestedthefittingvalueamong10levelsofsystemcrosstalkonmirror-typedisplaycorrespondingtothepictureshownonthebarrier-typedisplay.

Inthefirstexperiment,everylevelofsystemcrosstalktotheviewercrosstalkwassignificantlydifferent.Inaddition,theobjectiveandsubjectiveasthenopiaintheexperimentwerenotstatisticallysignificant.Inthesecondexperiment,viewers’evaluationsofthesystemcrosstalkonthefourpicturesweresignificantlydifferent.Thereexistedinteractioneffectbetweenthetypeofpictureandtheperspectivedistortion.Buttheperspectivedistortiondidn’taffectdistinctly.Thetypeofthepicturewouldaffectthetoleranceofviewer’scrosstalk.

Theoutcomeoftheresearchcouldprovidethedisplaymanufacturersaguidelinefordesigningautostereoscopicdisplaysofhighquality.


Automatic 3D video format detectionT.Zhang,Z.Wang,J.Zhai,Technicolor(UnitedStates);D.Doyen,TechnicolorS.A.(France)

Many3Dformatsexistandwillco-existforalongtimesincethereisno3Dstandardthatdefinesagenerallyaccepted3Dformat.Thesupportformultiple3Dformatswillbeimportantforbringing3Dintohome.Inthispaper,weproposeanovelandeffectivemethodtodetectwhetheravideoisa3Dvideoornot,andtofurtheridentifytheexact3Dformat.First,wepresenthowtodetectthose3Dformatsthatencodeapairofstereoimagesintoasingleimage.Theproposedmethoddetectsfeaturesandestablishescorrespondencesbetweenfeaturesintheleftandrightviewimages,andappliesthestatisticsfromthedistributionofthepositionaldifferencesbetweencorrespondingfeaturestodetecttheexistenceofa3Dformatandtoidentifytheformat.Second,wepresenthowtodetecttheframesequential3Dformat.Intheframesequentialformat,thefeaturepointsareoscillatingfromframetoframe.Similarly,theproposedmethodtracksfeaturepointsoverconsecutiveframes,



IS&T /

ReturntoContents

computesthepositionaldifferencesbetweenfeatures,andmakesadetectiondecisionbasedonwhetherthefeaturesareoscillating.Experimentsshowtheeffectivenessofourmethod.


Low-complexity 2D to 3D video conversionY.Chen,R.Zhang,M.Karczewicz,QualcommInc.(UnitedStates)

3Dfilmand3DTVarebecomingreality.Morefacilitiesanddevicesarenow3Dcapable.Comparedtocapture3Dvideocontentdirectly,2Dto3Dvideoconversionisalow-cost,backwardcompatiblealternate.Therealsoexistsatremendousamountofmonoscopic2Dvideocontentthatareofhighinteresttobedisplayedon3Ddeviceswithnoticeableimmersiveness.2Dto3Dvideoconversion,therefore,hasdrawnlotsofattentionrecently.Inthispaper,alowcomplexity2Dto3Dconversionalgorithmispresented.Theconversiongeneratesstereovideopairsby3Dwarpingbasedonestimatedper-pixeldepthmaps.Thedepthmapsareestimatedjointlybymotionandcolorcues.Subjectivetestsshowthattheproposedalgorithmachieves3Dperceptionwithacceptableartifact.


Development of a modular stereoscopic pre-visualisation and display frameworkV.Kuchelmeister,TheUniv.ofNewSouthWales(Australia)

Theincreasingpopularityforstereoscopiccontentintheentertainmentindustryandcomputergraphicsapplicationsandtheavailabilityofaffordablecaptureanddisplaysystemsisincontrasttotheactualknowledgeofunderlyingstereoscopicdesignprinciplesandfundamentalconcepts.Contentcreatorsandeducatorsunexperiencedinstereoscopyrequireintegrated,easytouseandflexibletoolswhichcanassistintheprocessofcreatingthethree-dimensional“look”theyareafterwithinthelimitsofacomfortableviewingexperience.

Theproposedframeworkinthispaper,acustomstereoscopicexportpluginforthepopular3DmodellingapplicationGoogleSketchupandaflexiblestereoscopicdisplayengine,allowsforstereoscopicpre-visualisationinnearreal-timeinaformatoftheirchoice.Theuserinterfacecanrecommendstereoscopicsettingsaccordingtothescene,cameraanddisplayproperties,calculatescorrespondingvaluesaccordingtomanualentriesbutalsoleavesunrestrictedcontroloverallparameters.Thedisplayengineallowsfordifferentstereoscopicformatstobeshownandsavestheresultinformofimageswithmetadataforreference.Particularattentionisputonusability,accessibilityandtightintegration.


Color appearance in stereoscopyD.Gadia,A.Rizzi,C.Bonanomi,D.Marini,Univ.degliStudidiMilano(Italy);A.Galmonte,Univ.degliStudidiVerona(Italy);T.Agostini,Univ.degliStudidiTrieste(Italy)

Therelationshipbetweencolorandligthnessappearanceandtheperceptionofdepthhasbeenstudiedsinceawhileinthefieldofperceptualpsychologyandpsycho-physiology.Ithasbeenfoundthatdepthperceptionaffectsthefinalobjectcolorandlightnessappearance.Inthestereoscopyfield,manystudieshavebeenproposedonhumanphysiologicaleffects,butfewhasconsideredcolorinformation.

GoalofthepaperisrealizingsomeexperimentsinVirtualRealityinordertodeterminetheeffectsofdepthperceptiononobjectcolorappearance.Weconsiderbordereffects,luminancegradients,spatialgradients,differentparallaxvalues,andweinvestigatehowdifferentchoicesofthesefeaturesaffectthefinalperception.

Wecreatedavirtual3Dtestscenewithasimpleconfigurationof

geometricfiguresoverafloatingbackground.Wegenerateddifferentstereoscopicrenderingsofthisscene,changingparametersforcolorandpositionoftheobjects.

WecollecttheperceptualresponsesofseveralusersaftertheobservationofthetestsceneinanimmersiveVirtualRealityroom(theVirtualTheateroftheUniversityofMilan).

Usersareaskedtojudgetherelativeappearanceunderdifferentversionsofthescenevaryingtherelativeobjectdepths.

Wepresentananalysisanddiscussionoftheseexperiments.


Coarse integral volumetric imaging with flat screen and wide viewing angleS.Sawada,A.Nakao,H.Kodaira,H.Kakeya,Univ.ofTsukuba(Japan)

Thispaperproposesaflat-screen3Ddisplaysystemwithwideviewingangleandlittledistortionbasedoncoarseintegralvolumetricimaging　(CIVI).CIVIcombinesmultiviewandvolumetricdisplaysolutionsandpresentsundistortedfloating3Dimagebycorrectingdistortionofvolumetricimageforeachview.

IntheconventionalCIVIwithlimitedviewingangle,distortionsofimageplainscanbeapproximatedparabolicinthedirectionofdepth,whilethoseinhorizontalandverticaldirectionscanbeignored.Whentheviewinganglebecomeswider,however,thisapproximationcannotrealizeundistortedimage.

Tocopewiththesedistortions,eachelementalimageisapproximatedwithindividualsecondorderequationsinthemethodwepropose.Alsodistortionsinhorizontalandverticaldirectionsarecorrectedbyusingtexturemapping.Toattainprecisecorrectioninvertical,horizontalanddepthdirections,opticalpathsoflightraysbetweenthedisplaypanelandeachviewpointarecalculatedwithanopticalsimulator.ColoraberrationiscorrectedbymappingRGBtexturesseparatelybasedontheresultofopticalsimulation.

CIVIprototypesystemwithflatscreenandwideviewingangleisproducedbasedontheabovedistortioncorrectionmethod.Itisconfirmedthattheproposedsystemworksasexpectedtocorrectopticaldistortions.


Coarse integral imaging without pseudo imageT.Kurokawa,H.Kakeya,Univ.ofTsukuba(Japan)

Coarseintegralimaging(CII),whereeachcomponentlensislargeenoughtocoverpixelsfarmorethanthenumberofviews,canshowclearfloating3Dimagewhendistortioniscorrected.

ThemajorproblemforCIIisemergenceofpseudoimagesthatappeararoundtherightimagetobepresented.Inthispaperweproposetwomethodstosuppresspseudoimages.

Torealizesuppressionofpseudoimage,wefirstproposeuseoflargeaperturelenswithsmallFnumberinfrontoftheelementallenses.WhenalargeaperturelensofsmallFnumberissetinfrontoftheelementallensessothatthedistancebetweenthemmaybethesameasthefocaldistanceoflargeaperturelens,onlypseudoimagescanbeerasedbyusingtotalinternalreflectionontheoutskirtofthelargeaperturelens.

Thesecondmethodweproposeisuseoflensarraybehindthedisplaypanelpairedwithsegmentedbacklight.Whenthelensispairedwithpropersizeofbacklight,leakofrayouttoadjacentelementallensesandlossofrayintoproperelementallensarebothavoided.Sincethebacklightareaisreduced,thismethodconsumeslesselectricpower.



IS&T /

ReturntoContents


Free-viewpoint image generation from a video captured by a handheld cameraK.Takeuchi,NagoyaUniv.(Japan);N.Fukushima,NagoyaInstituteofTechnology(Japan);T.Yendo,M.PanahpourTehrani,NagoyaUniv.(Japan);T.Fujii,TokyoInstituteofTechnology(Japan);M.Tanimoto,NagoyaUniv.(Japan)

Wepresentanovelsystemthatgeneratesfree-viewpointimagesusingafreelymovinghandheldcamerainstaticscene.

Togeneratefree-viewpointimagesusingacapturedvideobyahandheldcamera,viewframes’pose/positionareneeded.

Previously,acheckerboardpatternhastobecapturedineveryframetocalculatecamerapose/position.Thisapproachobtainsframes’pose/positioneasily,howeverweneedtohaveacheckerboardpatternwithknowndimensionsthatlimitstheapplication.

Inanothermethod,correspondingfeaturepointsinallframeimagesareusedtoestimatetheviewframes’pose/positions,assumingapseudoperspectiveprojectionwithoutcheckerboardpattern.Howeverduetotheassumption,thehandheldcameracannotchangeanglesincapturingthescene.

Toaddressaboveproblems,weproposeamethodthatusescorrespondingfeaturepointstocalculatecamerapose/positionusingthestateof-art,“StructurefromMotion”(SfM).Usingthismethod,wecanmoveahandheldcamerafreelywithanyangle.

Moreover,weproposeamethodthatgeneratesadepthmapbythenearestviewpointsatthelocationofthefreeviewpointimage.Proposeddepthestimationschemeusesgraph-cutsalgorithmforoptimization,whilereconstructedfeaturepointsobtainedinSfMareadditionallyusedtoenhancetheperformance.


New stereoscopic video shooting rule based on stereoscopic distortion parameters and comfortable viewing zoneW.Chen,J.Fournier,FranceTelecom(France);M.Barkowsky,P.LeCallet,Univ.deNantes(France)

Inthispaper,weproposedanewstereoscopicvideoshootingruleconsideringtwomostimportantissuesin3DTV:stereoscopicdistortionandcomfortableviewingzone.Theresultsofthisstudywillprovideanewmethodtoproposecameraparametersbasedonmanagementofnewcriteria(depthandshapedistortionanddepthoffocus)inordertoproduceoptimizedstereoscopicimagesandvideos.


Reduced-view super multi-view displayJ.Nakamura,K.Tanaka,TokyoUniv.ofAgricultureandTechnology(Japan);C.Tsai,IndustrialTechnologyResearchInstitute(Taiwan);Y.Takaki,TokyoUniv.ofAgricultureandTechnology(Japan)

InordertoreducethenumberofviewsoftheSMVdisplay,twoormoreviewsaregeneratedonlyaroundviewer’sleftandrighteyes,withanintervalsmallerthanthepupildiameter.Thepositionsoftheviewsaremovedaccordingtotheviewer’seyepositionstoincreaseviewingfreedom.Thereduced-viewSMVdisplayisimplementedusingalenticular3Ddisplay.Acylindricallensconstitutingalenticularlensprojectsagroupofpixelstogenerateagroupofviews.Thepixelgroupgeneratingtheleftviewgroupandthatgeneratingtherightviewgroupthroughanidenticalcylindricallensarespatiallyseparatedtoseparatetheviewgroups.Theleftpixelgroupsandtherightpixelgroupsfordifferentcylindricallensesareinterlacedhorizontallyontheflatpaneldisplay.Aprototypereduced-viewSMVdisplaywasconstructed.Eachviewgroupconsistedofeightviews.Theintervaloftheviewswas2.6

mm.AnLCDpanelwithaslantedsubpixelarrangementwasused.Thescreensizewas2.57inchesandthe3Dresolutionwas256×192.AUSBcamerawasattachedtothedisplaytodetectviewerposition.Theframerateofthefacedetectionandtheimageupdatewas30Hz.


Psycho-physiological effects of visual artifacts by stereoscopic display systemsS.Kim,J.Yoshitake,H.Morikawa,T.Kawai,O.Yamada,A.Iguchi,WasedaUniv.(Japan)

Themethodsforstereoscopic(3D)displayswithglassescanbeclassifiedastime-multiplexingandspatial-multiplexing.Eachmethodhasitsintrinsicvisualartifacts.Withthetime-multiplexingmethod,anobserverperceivesthreeartifacts:flicker,theMach-Dvorakeffect,andphantomarray.Theyonlyoccurunderacertaincondition:anycondition,duringsmoothpursuit(SPM),andduringsaccadiceyemovements,respectively.Withthespatial-multiplexing,temporal-parallax(duetointerlacedvideosignal),binocularrivalryandlow-resolutionwouldbeinduced.Theseartifactsareconsideredoneofthemajorproblemstosafetyandcomfortofobserverswhileviewing3Ddisplay.Inthisstudy,inordertoevaluatetheimplicationsofthevisualartifactstothesafetyandcomfort,physiologicalchangeswereexaminedthroughsubjectivesymptomsofvisualfatigueanddepthsensation.Also,tounderstandthecharacteristicsofeachartifactandthecombiningeffectsoftheartifacts,fourexperimentconditionsweredesigned.Theresultsshowedthattheperceptionofthevisualartifactsdiffersfromvisualenvironmentsanddisplaymethods.Furthermorevisualfatigueanddepthsensationwasinfluencedbyindividualcharacteristicsofvisualartifacts.


2D viewing experience with fixed 3D displaysM.Salmimaa,T.Jarvenpaa,M.Polonen,NokiaResearchCtr.(Finland)

Themaingoalofthepaperistopresentresultsfromsubjectivestudieswhereparticipantsevaluate2Dcontentcreatedbyusingdifferentrepresentationmethodsandrenderedonasmall-sizeautostereoscopic3Ddisplay.Thedisplayinquestionhasalenticularlensasastereostructure,andthestereostructureassuchcannotbeswitchedbetween2Dand3Dmodes.Subjectiveopinionsonthedifferentrepresentationsofthe2Dcontentonastereoscopic3Ddisplayhavebeenstudiedandtheseviewingexperiencescompared.


Interestingness of stereoscopic imagesJ.Hakala,M.Nuutinen,P.Oittinen,AaltoUniv.SchoolofScienceandTechnology(Finland)

Theaddedvalueofstereoscopyisanimportantfactorforstereoscopicproductdevelopmentandcontentproduction.Previousstudieshaveshownthat‘imagequality’doesnotencompasstheaddedvalueofstereoscopy,andthustheattributesnaturalnessandviewingexperiencehavebeenusedtoevaluatestereoscopiccontent.Theobjectiveofthisstudywastoexplorewhattheaddedvalueofstereoscopymayconsistofandwhatarethecontentpropertiesthatcontributetothemagnitudeoftheaddedvalue.Thehypothesiswasthatinterestingnessisasignificantcomponentoftheaddedvalue.Asubjectivestudywasconductedwheretheparticipantsevaluatedthreeattributesofthestimuliinconsumerphotographydomain:viewingexperience,naturalnessofdepthandinterestingness.Inadditiontotheno-referencedirectscalingmethodanovelmethod,therecalledattentionmap,wasused.Weconcludefromtheresultsthatinterestingnessisafactorofequalimportanceasnaturalnessinthe



IS&T /

ReturntoContents

addedvalueofstereoscopyinstillimages.FromthequalitativeresultsandRAMswefoundthatlocaldifferencesindistancesdrawpositiveattentioninstereoscopicimagesandweproposethata‘localdisparitycontrast’metricneedstobedeveloped.


Subjective evaluation of HDTV stereoscopic videos in IPTV scenarios using absolute category ratingK.Wang,AcreoAB(Sweden);M.Barkowsky,R.Cousseau,Univ.deNantes(France);K.E.Brunnström,AcreoAB(Sweden);R.Olsson,MidSwedenUniv.(Sweden);P.LeCallet,Univ.deNantes(France);M.Sjöström,MidSwedenUniv.(Sweden)

InthisworkasetofprocessedvideossequencesaredesignedforcomparingthecodingperformanceandtransmissionefficiencyofHDS3DstereoscopicsequencesinanumberofIPTVscenarios.ThesescenariosincludeH.264/AVCsimulcastcodingaswellasMultiviewVideoCoding(MVC).Inaddition,spatialandtemporalsubsamplingaswellascomparisonto2Dpresentationisconsidered.Asubjectiveexperimentisconductedtoinvestigatetheusers’experienceofstereoscopicvideoqualitybyusingtheAbsoluteCategoryRatingmethod(ACR)methodwithatrainingsessionthatusesDoubleStimulusContinuousQualityRating(DSCQR).

Apre-testofsubjectiveexperimentresultshowsthespatialandtemporaldownsamplingmaybeconsideredasanalternativetoincreasingthecodingquantizationparameterforimprovingcompressionefficiency.Theinfluenceofthecontenttypeontheoptimalchoiceofspatialandtemporaldownsampling,theappropriateencoder,anditsparameterswillbeanalyzedinthefinalpaper.


Improved depth map estimation in stereo visionH.Fradi,J.E.Dugelay,EURECOM(France)

Researchershavebeengivingespecialattentiontostereovisionsystemscapableofperceivingaccuratedepthinformation.Inthisarticle,weproposeanewstereomatchingalgorithmbasedoncorrelationandshowingprogressinhandlingproblemofmismatchedpoints.Itisathreestepsframework;first,anappropriatecostmatchingisusedtoavoidthedrawbacksofpossibleambiguousmatchescausedbytheviolationoftheresemblanceconstraint.Then,abidirectionalmatchingisappliedtodetectandtorejectmismatches.Amatchingisvalidonlyifafterareturn(e.g.right-left-right)thefinalpositionisthesameastheinitialone.Third,thecreatedholeswillbefilledinbyincorporatingedgesdetectiontoavoidthatawindowcontainsmorethanoneobject.Itisanefficientmethodforselectingavariablewindowsizewithadaptiveshapeinordertogetaccurateresultsatdepthdiscontinuitiesandinhomogeneousareaswhilekeepingalowcomplexityofthewholesystem.Theresultingdisparitymapcanbeconvertedtodepthmapusingsimpletriangulation.ExperimentalresultsusingtheMiddleburydatasetsdemonstratethevalidityofourpresentedapproach.Themaindomainofapplicationsforthisstudyisthedesignofnewfunctionalitieswithinthecontextofmobiledevices.


Is visual fatigue changing the perceived depth accuracy on an autostereoscopic display?M.Barkowsky,R.Cousseau,P.LeCallet,Univ.deNantes(France)

Recently,3DTVserviceshavebeenintroducedtothepublictogether

withavarietyofstereoscopicandauto-stereoscopicdisplays.However,visualfatigueisastillaseriousthreadtothewideadoptionofbinocularpresentationofvideocontents.Inthispaper,asubjectivestudyispresentedwhichaimstomeasuretheminimumperceivabledepthdifferenceonanautostereoscopicdisplay.Thedevelopedexperimentalsetupwasusedtocomparethesubject’sperformancebeforeandafter3Dexcitationonanautostereoscopicdisplay.Bycomparingtheresultstoaverificationsessionwith2Dexcitation,theeffectof3Dvisualfatiguecanbeexamined.


Interlaced MVD format for free viewpoint videoS.Lee,S.Lee,J.Lee,H.Wey,D.Park,C.Kim,SamsungAdvancedInstituteofTechnology(Korea,Republicof)

Anew3Dvideoformatwhichconsistsofonefullresolutionmonovideoandhalfresolutionleft/rightvideosisproposed.Theproposed3Dvideoformatcangeneratehighqualityvirtualviewsfromsmallamountofinputdatawhilepreservingthecompatibilityforlegacymonoandframecompatiblestereovideosystems.Thecenterviewvideoisthesamewithnormalmonovideodata,butleft/rightviewsareframecompatiblestereovideodata.Thisformatwastestedintermsofcompressionefficiency,renderingcapability,andbackwardcompatibility.Especiallywecomparedviewsynthesisqualitywhenvirtualviewsaremadefromfullresolutiontwoviewsoroneoriginalviewandtheotherhalfresolutionview.Forframecompatiblestereoformat,experimentswereperformedontwosamplingcases,interlacedandquincunxmethod.Foreachcase,theproposedformatgivesBDbit-rategainsupto15　.


Visual discomfort prediction for stereo contentsS.He,T.Zhang,Technicolor(UnitedStates);D.Doyen,TechnicolorS.A.(France)

Thecurrentrenaissanceof3Dmovieshasdrawnmoreandmoreattentionfromtheaudience.Whilepeopleareenjoyingthe3Dmoviesinthetheater,theyalsohopetobringthe3Dexperiencetohome.Three-dimensionaltelevision(3DTV)hasbeenexpectedtobethenextadvanceintelevision.In3DTVscenario,muchsmallerscreensizesandviewingdistancesinhometheatersetupputmorerestrictionsonthe3Dcontentfedintothe3DTV,e.g.asmallerdepthrange.Ontheotherhand,differentpeoplehavedifferentcomfortrangeofdepthina3Dcontent.Theversionofthe3Dcontentsenttohomewillnotstratifyallthepeopleinonefamily.Inthispaper,wetrytosolvethisproblembyprovidingapredictionofviewingdiscomfortofcertaininputcontentbycertainviewer.OurmethodisbasedontheDisparityDiscomfortProfile(DDP)builtthroughsubjectivetestforeachviewer.Theinputcontentisanalyzedbystudyingitsdisparitydistribution.Thepredictionofdiscomfortisperformedbymatchingthedisparitydistributionwiththeviewer’sDDP.Thenamechanismtoallowtheviewerstoadjustthedepthrangeaccordingtotheirvisualcomfortprofileorviewingpreferenceisusedtominimizethediscomfort.Experimentsshowpromisingresultsoftheproposedmethod.


Three-dimensional holographic display using active shutter for head mounted display applicationH.Kim,J.Park,ChungbukNationalUniv.(Korea,Republicof)

Three-dimensionalholographicsystemusingactiveshuttersforheadmounteddisplayisproposed.Conventionalthree-dimensional



IS&T /

ReturntoContents

headmounteddisplaysuffersfromeye-fatiguesinceitonlyprovidesbinoculardisparity,notmonoculardepthcueslikeaccommodation.Theproposedmethodpresentstwohologramsofa3Dscenetocorrespondingeyesusingactiveshutters.Sinceaholographydeliveredtoeacheyehasfullthree-dimensionalinformation,notonlythebinoculardepthcuesbutalsomonoculardepthcuesarepresented,eliminatingeye-fatigue.Theapplicationtotheheadmounteddisplayalsogreatlyrelaxestheviewinganglerequirementthatisoneofthemainissuesoftheconventionalholographicdisplays.Inpresentation,theproposedopticalsystemwillbeexplainedindetailwithexperimentalresults.


Pixel-offset position detection using lens array for integral three-dimensional displayH.Sasaki,NHKScience&TechnicalResearchLabs.(Japan);M.Kawakita,NationalInstituteofInformationandCommunicationsTechnology(Japan)andNHKScience&TechnicalResearchLabs.(Japan);K.Masaoka,J.Arai,M.Okui,F.Okano,NHKScience&TechnicalResearchLabs.(Japan);Y.Haino,M.Yoshimura,M.Sato,JVCKENWOODHoldings,Inc.(Japan)

Intheintegral3DTVsystem,high-densityelementalimagesarenecessarytoenhancereconstructed3Dimagequality(resolution,viewingzoneandangle,anddepthdirectionalrepresentability).Thedual-greenpixel-offsetmethod,whichusestwogreenimages(G1andG2),iswellknownasameansofachievingultrahigh-resolutionimagery.

Weproposeapreciseandeasymethodfordetectingthepixel-offsetdistancewhenthelensarrayismountedinfrontofthedisplaysurface.Inthismethod,patternluminancedistributionsbasedonsinewavesaredisplayedoneachG1andG2panel.Thedifferencebetweenphases(amountofphasevariation)ofthesepatternsisconservedwhenthepatternsaresampledandaliasedtothelowerfrequencybythelensarray.Thisallowsthepixel-offsetdistanceofthedisplaypaneltobemeasuredinastateofmagnification.

Inthiscase,relationbetweenthecontrastandtheamountofphasevariationofthepatterniscontradictedinrelationtofrequency.Wethereforedevisedawaytofindtheoptimalspatialfrequencyofthepatternbyregardingtheproductofcontrastandamountofphasevariationofthepatternsasanindicatorofaccuracy.

Wemeasuredandadjustedthepixel-offsetdetectionmethoddescribedabovewiththedevelopeddisplaysystem.ResultsdemonstratethattheMTFofelementalimageswererefined.Weexpectthattheresolutioncharacteristicsofthedepthdirectionwillbeimprovedbyit,thusensuringhigherqualityreconstructed3Dimages.


3D imaging for glasses free multi-view 3D displaysS.Gurbuz,M.Kawakita,NationalInstituteofInformationandCommunicationsTechnology(Japan);S.Yano,NHKScience&TechnicalResearchLabs.(Japan);S.Iwasawa,H.Ando,NationalInstituteofInformationandCommunicationsTechnology(Japan)

Inthispaper,amulti-viewbased3Dimagingtechniqueisdescribedforlifelike3Dvisualizationonmulti-viewauto-stereoscopicdisplays.Thelifelikemulti-view3Dvisualizationrequiresregenerationofthelightfieldofasceneforeveryview.Thecompletelightfieldofascenecanbereconstructedfromtheimagesofasceneideallytakenfrominfiniteviewpoints.However,capturingtheimagesofascenefrominfiniteviewpointsisnotfeasibleforpracticalapplications.Therefore,inthiswork,weprovidethedetailsofthemulti-cameraimagealignmentprocedureandvirtualcameraviewimagegenerationthatarenecessarytoachievegoodvisualizationquality.Forthetask,weutilizeanarrayofhardware-synchronized30camerastocapturemultipleperspectivereal-views+29virtualviewsofthescenewithproperhorizontalparallaxandocclusionrelationships.Thus,ateverytimeinstance,59

independentperspective(30real+29virtual)viewsat1080×1920resolutionaredisplayedonthemulti-viewauto-stereoscopicdisplay.Themajorcontributionsarecomputationalalignmenttechniqueformulti-cameraimagesandnovelviewvideorenderingwhereanewalgorithmefficientlyrendersanovelviewfromtwo(leftandrightcamera)videostreams.


Reduction of image blurring in an autostereoscopic multilayer liquid crystal displayH.Gotoda,NationalInstituteofInformatics(Japan)

Amultilayerliquidcrystaldisplay(LCD)isadisplaydeviceconstructedbystackingmultipleliquidcrystallayersontopofalightsource.Inapreviousstudy,wehavealreadyshownthatamultilayerLCDcandelivervaryingimagesdependingontheviewers’eyepositions,andcanbeusedforauto-stereoscopic3Dviewing.However,undesirableblurringissometimesobservedintheimagesthataviewerreceivesfromthedisplay.Suchblurringisnotableespeciallyaroundobjectsinthescenethatarefarawayfromtheviewer.

Toaddressthisproblem,weproposetoplaceanopticallensinfrontoftheliquidcrystallayers.Thelensrefractsthebeamsoflight,thusbringingtheeffectsofmovingthefarobjectstonearerpositions.Throughasimulation-basedstudy,weshowthatanoptimalchoiceexistsforthefocallengthofthelens,whichreducesthelocalimageblurringwhilenotcompromisingtheoverallimagequality.

Aprototypedisplaywith4layershasbeenimplementedtodemonstratethatautostereoscopic3DviewingisreallypossiblewithamultilayerLCD.Theimplementationalsoindicatesthattheproposedmethodiscomparabletootherautostereoscopicmethodssuchastheintegralimagingusinganarrayofmicrolenses.


A new volmetric 3D display using multi-varifocal lens and high-speed 2D displayT.Sonoda,H.Yamamoto,S.Suyama,Univ.ofTokushima(Japan)

Wehavedevelopedanewvolumetric3-Ddisplayusingthemulti-varifocallensandhigh-speed2-Ddisplay.Floatingclear3-Dimageinspacecanbesuccessfullyobtained.Ourvolumetric3-Dimageiscomposedofmany2-Dlayeredimagesbyusingmulti-varifocallens.Many2-Dimagescanbelayeredbychangingtheirdepthpositionusingthediscretefocallengthchangeofmulti-varifocallens.Thehigh-speedmulti-varifocallensiscomposedofseveralsetsofabirefringentlensandapolarizationswitchingdevice.Thetotallenspoweristhesumofthelenspowersofthesesets.Thenumberoflenssets,N,canyield2Nvariationsoftotalfocallengths.Inordertore-positionmany2-Dimageswithinafterimagetime,high-speed2-Ddisplayisnewlyconstructedbymulti-projectorsusingLEDlight-sources.Multi-projecterimagesareprojectedtothesamepositionofonescreen.Byswitchingmulti-projectorsquickly,2-Dimagesonthescreencanbedisplayedathigh-speed.Thishigh-speed2-Ddisplaycansuccessfullyprovidebrightandclear2-DlayeredimagesbyusingpointlightsourcesofLED.


A novel super-multi-view display containing 7 680 perspective viewsA.Grasnick,SunnyOceanStudiosPte.Ltd.(Singapore)

Withtheincreasingavailabilityof3Dmovies,againinginterestinautostereoscopic(AS)3Ddisplaysbecomesobvious.

Oneofthemostcommonapplicationsforglassesfreesystemsistheuseinpublicpresentations(out-of-homeadvertising).Butalmostevery



IS&T /

ReturntoContents

AS-3Ddisplayhasalimitedviewingarea.Theresulting“sweetspot”preventsthesedisplaysfromauniversalusage.

Theraiseofthenumberofperspectiveviewscouldextendtheviewingarea.

Aprojectionofmorethanoneperspectiveimageinthepupiloftheobserver’seyewillreducethevergence-accommodation-conflictandallowfocusingonthevirtualimages.ButanadaptionoftheSuper-Multi-View(SMV)conditiontoanactualdisplaywillreducetheresolutiondramatically.

The7.680viewdisplayusesaconceptofsuperpositionandmultiplexingofdifferentperspectiveviewsanddisplaypixels.

InthisconceptthemainadvantageisanenlargementoftheviewingareawithoutthesamelevelofresolutionreductionasinaconventionalAS-3Ddisplay.

SMVusingsuperpositionandmultiplexingcanbeusedwithdifferentAS-3Dtechnologies,i.e.lenticulars,zoneplatesorparallaxbarrier.

Thepurposeofthelaboratorysetupinthisresearchwastoshowafunctionalmodelusingupto7.680differentviews,whereaseveryviewhasaverylowresolution.Withthesamenumberofcolumnsandviews,eachdisplayedperspectiveviewhasaneffectivehorizontaldimensionofonly1subpixel(1/3pixel).

Thescreenimageforoneframeofthe7.680viewdisplaywasmadefromastackofperspectiveimages.Basedontheresultsofthetests,itisdiscussedifandhowtheSMV-AS-3Dtechnologywithmanythousand(ormore)viewscanbecomeatrendin3Dtelevision.


Use of camera drive in stereoscopic display of learning contents of introductory physicsS.Matsuura,TokyoGakugeiUniv.(Japan)

Stereogramdisplayofphysicssimulationsforintroductoryphysicslearnerswerecreated.Sincethedesignsofsimulationsaresimple,andtheirimageshavelesspictorialclues,the3Dmodelsprojectedontothe2Dplanearenoteasytounderstandthecomposition.Then,thestereoscopicrepresentationstronglyimproveseasinessforunderstanding.Also,thecamera-drivingmeansseemedtothelearnerfeelexploratoryonthesimulations.


Producing content for 3D home theaterJ.J.Karns,XDImages(UnitedStates)

3-dimensionaldisplaysforhomeusecanbedividedinto3broadclasses:VirtualRealitydisplays,Stereoscopicdisplaysrequiringglasses,andAutostereographiclenticulardisplays.Virtuallyall3Dcontentproducedforthehomeentertainmentmarkettodaycanbeviewedusinganyofthe3classesof3Ddisplayswithroughlythesamelevelof3Dquality.Fullappreciationofautostereographiclenticulardisplaysrequiresmultiple-perspective3Dcontentcreatedusinganadjustable-parallax,multi-viewsystem.Mostnotably,autostereographiclenticulardisplaysprovidemotionparallaxandchangesinocclusionwithinasingletemporalframethatarenotrealizedincontentderivedfrom2-viewstereocontent.

Wehavebeenexperimentingwithanewlyinvented,adjustable-parallax,multi-perspectivecamerasystemtoidentifyconsumerpreferencesinmulti-perspective,3D,lenticularimages(parallaxpanoramagrams).Wehavefoundthatmuchoftheconventionalwisdomandresearchrelatedto2-viewstereogramsmustbesignificantlymodifiedasitrelatestoparallaxpanoramagrams.Ourobjectiveistoidentifyandcodifyfactorsandmethodsthatmaximizeconsumeracceptanceandenjoymentofmulti-perspectiveparallaxpanoramagrams.


DWT-based stereoscopic image watermarkingM.P.Mitrea,A.Chammem,F.J.Prêteux,TELECOM&ManagementSudParis(France)

Thepresentpaperdealswithstereoscopicimageprotectionbymeansofwatermarkingtechniques.First,astudyontheoptimalinsertiondomainiscarriedout.Secondly,inordertoreachthetrade-offbetweentransparencyandrobustness,principlesfromspreadspectrumandsideinformationarecombined.Finally,theexperimentswereperformedonthestereoscopicimagedatabaseorganisedatEcolePolytechniqueFédéraledeLausanneparT.EbrahimiandL.Goldman(http://mmspl.epfl.ch/page38841.html)andonastereoscopicmedicalimagedatabasebuiltattheARTEMISDepartment(Dr.CatalinFetita,www.it-sudparis.eu/artemis).

Themarkisrepresentedby64bitstobeinsertedinthe(9,7)DWT(DiscreteWaveletTransport)correspondingtotherightviewofthestereoscopicimage.Inordertobenefitfrombothhighrobustnessandlargedatapayload,theinsertiontechniquecombinesspread-spectrumandinformed-embeddingprinciples(accordingtoabasicideapresentedintoapreviouspatenttheauthorsfilled-in).Themarkedstereoscopicimageisobtainedbycombiningthewatermarkingrightviewandthedisparityinformationcomputedbetweenthewatermarkedrightviewandtheoriginalleftview.

Theexperimentsexhibitedstrongrobustnessagainstlinear(Gaussian)andnon-linear(sharpening,median)filteringandgeometricattacks.Thetransparencywasestablishedbybothobjectiveandsubjectivecriteria.


Development of a new HD multi-view camera and processing systemC.Park,J.Lee,J.Kang,KoreanBroadcastingSystem(Korea,Republicof);K.Lee,KoreanBroadcastingSystem(UnitedStates)

Wehavedevelopedanewmulti-viewcamerasystemwhichconsist9piecesofHDcamcorders,amechanicalapparatusandacontrolPC.Byusingthissystem,sometestmulti-viewvideosweremadeandsuccessfullydisplayedinthelenticulartypemulti-viewdisplay.

The9piecesofHDcamcordersarelocatedonthelineartypestandwhichissupportedbythreecameratripods.Eachviewof9camerasisrecordedintoeachcamcorder,atthesametime,twoviewvideosaretransportedtothecontrolPCduringpicture-takingformonitoring.WealsodevelopedothercapturingmethodwhichcapturesallviewssimultaneouslyintoPCviaIEEE-1394.

Thedistancebetweeneachcameracanbecalibratedmanuallyfromtheminimumcontactedpositionandthemaximum25cm,andtheopticalaxisofeachcameracanberotatedbyservo-motor.Thereforewecanconfigurenotonlythetoed-incamerasystembutalsotheparallelaxiscamerasystem.

Sometestvideoswerecapturedbythissystemandconvertedtomulti-viewformatvideoofthemulti-viewdisplay.Then,wesuccessfullywatchedthemulti-viewvideoviathelenticulardisplaywithoutwearingglasses.

Inthefuture,wewillresearchonthebestcapturingconditionofthismulti-viewcamerasystem.


Multi-view video codec based on KTA techniquesJ.Seo,K.Sohn,YonseiUniv.(Korea,Republicof)

Multi-viewvideocoding(MVC)isvideocodingstandardformulti-



IS&T /

ReturntoContents

viewvideodatabyISO/IECandITU-T.ItshowedaveragePSNRgainof1.5dBcomparedwithview-independentcodingbyH.264/AVC.However,becauseresolutionofmulti-viewandstereoscopicvideoisgettinghigherfor3Deffectandreality,highperformancevideocodecisrequired.

TheMVCadoptedhierarchicalB-picturestructureandinter-viewpredictionforcodingefficiency.HierarchicalB-picturestructureremovesredundancyintimeaxisregardlessofthecharacteristicsofmulti-viewvideo,andinter-viewpredictionreducesinter-viewredundancybypredictionfromreconstructedneighborviews.InstandardizationprocessoftheMVC,Othertechniqueswereproposed,suchasilluminationcompensationbetweenviews,motioninformationskipmodeandviewsynthesismode.However,theywerenotadoptedfortheMVC,becausetheydidnotshowsufficientcodinggain.Thus,weproposeenhancedvideocodecformulti-viewvideobyKeyTechnologyArea(KTA)techniques.TheKTAisanewvideocodecbyVideoCodingExpertGroup(VCEG),anditisbeingcarriedoutforcodingefficiencyandlowercomputationalcomplexity.TheKTAsoftwareshowedbettercodinggainthanH.264/AVCbyusingadditionalcodingtechniques.Thetechniqueswereproposedfor2Dvideo,butweappliedthemformulti-viewvideo.


On-screen-display (OSD) menu detection for proper stereo content reproduction for 3D TVE.V.Tolstaya,V.V.Bucha,M.Rychagov,SamsungElectronicsCo.,Ltd.(RussianFederation)

Modernconsumer3DTVsetsareequippedwithspecialdevicesallowingswitchingbetweenmonoandstereomodes.Besides,someoutsidedevicessuchasDVDorBlue-rayplayers,areabletodisplaysomeadditionalinformationbyimposinganoverlaypictureonvideocontent,anOn-Screen-Display(OSD)menu.InthiscaseTVsetmustrecognizethetypeofOSD,whetheritismonoorstereo,andvisualizeitcorrectlybyeitherswitchingoffstereomode,orcontinuedemonstrationofstereocontent.

WeproposeanewstablemethodfordetectionofmonoOSDonstereocontent.MonoOSDissettobearectangularareawithlettersandpictograms.OSDmenucanbeofdifferenttransparencylevelsandcolors.ThemainproblemindetectingOSDistodistinguishwhetherthecolordifferenceisduetoOSDpresence,orduetostereoparallax.WeappliedspecialtechniquestofindreliableimagedifferenceandadditionallyusedacuethatusuallyOSDhasveryimplicitgeometricalfeatures:straightparallellines.Thedevelopedalgorithmwastestedonourvideosequencesdatabase,withseveraltypesofOSDwithdifferentcolorsandtransparencylevelsoverlaiduponvideocontent.Detectionqualityexceeded99%oftrueanswers.


Real-time ray-space transfer of cylindrical objective spaceT.Yendo,NagoyaUniv.(Japan);T.Fujii,TokyoInstituteofTechnology(Japan);M.PanahpourTehrani,M.Tanimoto,NagoyaUniv.(Japan)

Weproposeareal-timeray-spacetransfersystemthatcapturesraysaroundacylindricalobjectivespacefrom360degreehorizontallyandreconstructsitas3Dimagewhichcanbeseenfrom360degree.Thesystemconsistsofaraycapturingunit,adisplayunit,andadataconverter.

Theraycapturingunitisthesameaswehavealreadyproposed;itconsistsofascanningopticssystemandahigh-speedcamera.Thescanningopticssystemiscomposedofadouble-parabolicmirrorshellandarotatingflatmirrortiltedat45degreestothehorizontalplane.Themirrorshellproducesarealimageofanobjectthatisplacedatthebottomoftheshellandtherotatingmirrorreflecttheimagetothe

camera,thatenablescapturingimagesfromdifferentanglesasifthecameraismoving.

Accordingtothepropertyofthescanningopticssystem,capturedimageincludesdistortionandisrotated.Thedataconverterprocessescompensationofthedistortionandtherotationinreal-time.Duetoextremelyhighframerate,wedevelopedaspecialhardwareusingFPGAtoachievereal-timeimageprocessing.Thedataconverteralsoconvertsimagedataintosuitableformatfortransmittingtothedisplay.

ThedisplayunitisalsoknownastheSeelinder,nowwehavedevelopednewversionofthedisplay.Itcanreceivesimagedataofover300viewsanddisplayitinreal-timewith24bitcolordepth.


Analysis of scene distortions in stereoscopic images due to the variation of the ideal viewing conditionsA.Viale,D.Villa,D.Marini,Univ.degliStudidiMilano(Italy)

Recentlystereoscopyhasincreasedalotitspopularityandvarioustechnologiesarespreadingintheatersandhomesallowingobservationofstereoscopicimagesandmovies,becomingaffordableevenforhomeusers.Howevertherearesome“goldenrules”thatusersshouldfollowtoensureabetterenjoymentofstereoscopicimages,firstofalltheviewingconditionshouldnotbetoodifferentfromtheidealones,whichwereassumedduringtheproductionprocess.

Toperceivestereodepthinsteadofaflatimage,twodifferentviewsofthesamesceneareshowntothesubject,oneisseenjustthroughhislefteyeandtheotherjustthroughtherightone;thevisionprocessismakingtheworkofmergingthetwoimagesinavirtualthree-dimensionalscene,givingtotheusertheperceptionofdepth.Theproblemisthatviewingthesametwoimagesfromadifferentpositionwillresultin“merging”adifferentvirtualthree-dimensionalscene.

Withthesetofinstrumentswehavedeveloped,wecananalyzedifferentviewingconditionsofthestereoscopicsceneinordertoconfigureaviewingenvironment,eitheramovieorahometheater,toallowacorrectvisionasindependentaspossiblefromtheidealviewingconditions.


Analysis of resolution limitation of glasses-free 3D tabletop displayD.Moldovan,S.Yoshida,M.Kawakita,H.Ando,NationalInstituteofInformationandCommunicationsTechnology(Japan)

Inthisworkweproposeamethodtocomputethemaximumdisplayableresolutionofatabletopdisplayusingconic-shapedopticaldevice.OurapproachemployscomputationoftheNyquistfrequencyofthe3-Dimageinbothverticalandhorizontalplanes.Bybeingabletoextractaformulathatcombinesprojector’spitchandopticalfeaturesofthescreenforcomputingthemaximumresolution,futureapplicationswillbenefitbyknowinghowtodesignthetabletopdisplayinordertoobtaina3-Dimageofacertainresolution.


Image quality of up-converted 2D video from frame-compatible 3D videoF.Speranza,W.J.Tam,C.A.Vázquez,A.Vincent,R.Renaud,R.Klepko,CommunicationsResearchCtr.Canada(Canada)

Howtoprovideboth2Dand3Dvideowithhighpicturequalityisakeyconcernforbroadcastersandcontentproviders.Videoformatplaysarole.AlthoughService-compatibleformatsprovide3Dvideocapabilitieswiththeabilitytodeliverregular2Dvideoservices,theyrequiremorebandwidthfortransmission.WithFrame-compatible



IS&T /

ReturntoContents

formats,theseparateleftandrightviewsarereducedinresolutionandpackedtofitwithinthesamevideoframeasaconventional2Dsignal.Thus,theydonotrequiremorebandwidththancurrent2Dsystems.However,withFrame-compatibleformatsthequalityofthe2Dvideoisanissuebecausea3Dserviceshouldalsobeabletodeliveraconventional2Dsignal.Suchas,whentheviewerelectstowatchaprogramin2Dorwhena2Dsignalneedstobeintermixedwith3Dprogramminginadvertising.Wereportintwoexperimentsthelossinvideoqualityof2Dvideomaterialup-convertedfromside-by-sideframecompatible3Dvideoatdifferentbit-ratesandunderboth3Dand2Dviewingconditions.Theresultsconfirmedthelossofvideoqualityofthe2Dvideoup-convertedmaterial.However,thelosswasrathersmall,wasnotaffectedbybitrateandvariedmarginallyforthe2Dand3Dpresentationmodes.


System crosstalk measurement of a time-sequential 3D display using ideal shutter glassesF.H.Chen,K.Huang,L.D.Lin,C.Wu,K.Lee,IndustrialTechnologyResearchInstitute(Taiwan)

themarketofstereoscopic3DTVgrowsupfastrecently;however,for3DTVreallytakingoff,theinteroperabilityofSGtoviewdifferentTVsetsmustbesolved,sodevelopingameasurementmethodtoseparatetime-sequentialstereoscopicdisplaysandshutterglasses(SG)isnecessary.Formeasuringthe3Dperformanceoftime-sequentialstereoscopic3Ddisplays,theopticalcharacterizationoftime-sequentialdisplaysandSGshouldbeseparated.Theadvantagesarethatthesourcesofopticalcharacterizationaredistinguished,andtheinteroperabilityofSGisbroadened.Hence,thispaperproposedan“idealshutterglasses”(ISG),whosenon-idealpropertiesareeliminated,asaplatformtoevaluatetheopticalcharacterizationpurelyfromthedisplay.IntheISGmethod,theilluminanceofthedisplaywasmeasuredintimedomaintoanalyzetheopticalcharacterizationofthedisplay.Inthisexperiment,theISGmethodwasusedtomeasuresystemcrosstalk(SCT)withahigh-speed-responseilluminancemeter.Fromthetime-resolvedilluminancesignals,theslowtimeresponseofliquidcrystalleadingtoSCTisvisualizedclearly.Furthermore,anintriguingphenomenonthatSCTmeasuredthroughSGincreaseswithshorteningviewingdistancewasobserved,anditmayarisefromLCleakageofthedisplayandshutterleakageatlargeviewingangle.Thus,wemeasuredhowLCandshutterleakagedependingonviewingangleandverifiedourargument.Therefore,theISGmethodoffersaplatformtoevaluateSCTfromtime-sequential3Ddisplays,andtheresourcesofSCTaredistinguished.


Guidance for horizontal image translation (HIT) on high definition stereoscopic video productionD.K.Broberg,CableTelevisionLabs.,Inc.(UnitedStates)

Horizontalimagetranslation(HIT)isanelectronicprocessforshiftingtheleft-eyeandright-eyeimageshorizontallyasawaytoalterthestereoscopiccharacteristicsandalignmentof3Dcontentaftersignalshavebeencapturedbythestereoscopiccameras.HITcanbeavaluabletoolinthepostproductionprocessasameanstomodifystereoscopiccontentformorecomfortableviewingontelevisionscreens.Mostcommonlyitcanbeusedtoalterthezeroparallaxsetting(ZPS),tocompensateforstereowindowviolationsortocompensateforexcessivepositiveornegativeparallaxinthesourcematerial.

HITmustbeusedcautiouslyandwithfullawarenessoftheimpactonotherinterrelatedaspectsofthestereography.Asmoreandmorecinematic3Dcontentmigratestotelevisiondistributionchannelstheuseofthistoolwilllikelyexpand.Withappropriateconsideration,HITcanbeusedeffectivelytoadjustsuchcontentformorecomfortable

viewingontelevision.However,disregardofcertainguidelinescanactuallyharmthe3dviewingexperienceofsuchcontentbytelevisionaudiences.Thispaperprovidesguidanceonitsmosteffectiveuseanddescribessomeoftheinterrelationshipsandtrade-offs.Thepaperrecommendstheadoptionofthecinematic2Kvideoformatasa3Dsourcemasterformatforhighdefinitiontelevisiondistributionofstereoscopic3Dvideoprogramming.

7863-27, Session 10

Implementation of autostereoscopic HD projection display with dense horizontal parallaxS.Iwasawa,M.Kawakita,NationalInstituteofInformationandCommunicationsTechnology(Japan);S.Yano,NHKScience&TechnicalResearchLabs.(Japan);H.Ando,NationalInstituteofInformationandCommunicationsTechnology(Japan)

Ourfinalgoalistodevelopadvancedautostereoscopyincaseofnotcompellingviewerstowear3Dglasseson,ornottobeshownunderinsufficientresolutionevenifitfrees3Dglasses.Webelievethatlargerscreensize,higherimagequality,naturalimageappearancesuchmotionparallaxandmultipleviewercapableareprioritytargetsforprofessional3Ddisplayapplications.Bycombiningtheproprietaryscreenandthedevelopedprojectorarray,we’vedesignedandimplementedakindofautostereoscopicprojectiondisplay.Enoughnumberofpixelstorendertruehighdefinitionisassignedforeveryviewpoint.Theinitialimplementationhasmorethan100millionoverallpixels.Anactualobservedhorizontalmotionparallaxisquitesmoothandreducedflipping.Throughoutthisfeasibilitystudy,we’velearnedespeciallyfollowingtwopractices;astrongrequirementof“arrayfriendlyfeature”readyprojector,andexistenceofsomeimageryglitches.Appearancesofmoirésandghostimagesaremostsignificantvisualfatigueontheimplementation.Someoftheseproblemsweretackledandalreadysuppressed.Projectorsforthearrayshouldbepreparedwithamanagementofcolorspaceandbrightness,geometricimagecompensation,andaccurateframesynchronization,andsoforth.Toextractandexaminepracticalproblemsalongwithautostereoscopicprojectiondisplaywasafirststepasafeasibilitystudy.

7863-28, Session 10

Full-parallax 360 degrees horizontal viewing integral imaging using anamorphic opticsM.Erdenebat,G.Baasantseren,J.Park,N.Kim,K.Kwon,ChungbukNationalUniv.(Korea,Republicof)

Weproposedfull-parallaxintegralimagingdisplaywith360degreeshorizontalviewingangle.Theelementalimagesareprojectedbythehigh-speedDMDprojectorandintegratedintothree-dimensionalimagebythelensarray.Thecylindricallenssystemtailorsthehorizontalandverticalviewingangleoftheintegrated3Dimagesinordertoobtainhighangularraydensityinhorizontaldirectionandlargeviewingangleinverticaldirection.Finally,themirrorscreenthatrotatesinsynchronizationwiththeDMDprojectorpresentstheintegratedthree-dimensionalimagestodesireddirectionaccordingly.Bythismethod,full-parallax360degreehorizontalviewinganglethree-dimensionalimageswithbothofmonocularandbinoculardepthcuescanbeachieved.

7863-29, Session 10

Optical characterization of autostereoscopic 3D displaysM.J.Sykora,3MCo.(UnitedStates)

Recently,therehavebeenmanyexcitingannouncementsfor



IS&T /

ReturntoContents

autostereoscopic3D(AS3D)displays;particularlyformobileapplications.ThispaperreviewsthemeasurementofAS3Ddisplaysusingdifferentopticaltechniques.ThequalityandusabilityofanAS3Ddisplayishighlydependantonitsopticalproperties.Someofthepropertiesmeasuredaretheviewingdistance,viewingoffset,biasangle,andcrosstalk.Theanalysistechniquesusedtogleanthecriticalinformationabouttheseopticalpropertieswillbedescribed.AcomparisonismadebetweenvarioustypesofAS3Ddisplays,includingatimesequential3Ddisplay.Themeasurementresultsfromthemetrologymethodsarecomparedwhereeverpossibleandexampleswillbeshown.

7863-30, Session 11

Depth cube display using depth mapB.Song,S.Min,J.Jung,KyungHeeUniv.(Korea,Republicof)

Weproposeadepthcubedisplay(DCD)methodusingdepthmap.Thestructureoftheproposedmethodconsistsoftwoparts:Aprojectionpartcomposedofprojectorforgeneratingimageandatwistednematicliquidcrystaldisplay(TN-LCD)aspolarizationmodulatingdeviceforadjustingproperdepthandadisplaypartcomposedofanair-spacedstackofselectivescatteringpolarizerswhichmaketheincidentlighttoscatterselectivelyasthepolarizationofthelightrays.AnimagefromprojectorwhosedepthisdeterminedaspassingthroughtheTN-LCDdisplayingdepthmapprogressesintothestackofselectivescatteringpolarizersandthenthree-dimensionalimageisgenerated.Atthattime,thepolarizationofeachpolarizerisset0°,45°and90°sequentially,andthentheincidentlightraysarescatteredbydifferentpolarizerasthepolarizationoftheserays.Ifthelightrayhasthepolarizationbetweenthoseofpolarizers,thislightrayisscatteredbymultipolarizersandtheimageofthisrayisgeneratedonairbetweenpolarizers.TheproposedmethodismoresimplestructureandimplementedeasilythanpreviousDCDmethod.Weexplainandverifytheproposedmethod.

7863-31, Session 11

Surface representation of 3D objects for aerial 3D displayH.Ishikawa,H.Watanabe,S.Aoki,H.Saito,KeioUniv.(Japan);S.Shimada,M.Kakehata,Y.Tsukada,NationalInstituteofAdvancedIndustrialScienceandTechnology(Japan);H.Kimura,AerialSystemsInc.(Japan)andBurtonInc.(Japan)

Anewtype3Ddisplay,whichissmallandhigh-speeddesktopaerial3Ddisplay(desktopsystem),hasbeendevelopedbyBurtonInc.andAIST.Evenifthedisplayareaissmall,thedesktopsystemcancreateadotoflightat50kHz.

Inthispresentation,weproposeanovelmethodfordrawingthecomplexsurfaceof3Dobjectsbyvectorscanning,whichissuitabletothedesktopsystem.Theproposedmethodrepresentsthesurfacewithcrosssectionsofanobjectagainsttheverticaldirection.Thismeansthattheobjectisrepresentedbyasetofcontoursonthecrosssections.Thedrawingrouteineachsectionisdeterminedbytheconnectionofpolygonalpatchestakingintoaccountaburdenof3Dscanner.Astheresult,pointsequencedataiscreatedfrompolygonalmodels.

Fromtheexperimentsofdrawing,3Dobjects,forexamplesuchasahand,cansuccessfullybedrawnbytheproposedmethodinthedesktopsystem.Basedontheseexperiments,weconfirmthatitisappropriatetorepresent3Dobjectsbysectioncontours,whenthedisplaysystemwhichgeneratesdotsoflightathighfrequencyandwhichscansathighspeedvectorscanningisused.

7863-33, Session 13

How are crosstalk and ghosting defined in the stereoscopic literature?A.J.Woods,CurtinUniv.ofTechnology(Australia)

Crosstalkisacriticalfactordeterminingtheimagequalityofstereoscopicdisplays.Alsoknownasghostingorleakage,highlevelsofcrosstalkcanmakestereoscopicimageshardtofuseandlackfidelity;henceitisimportanttoachievelowlevelsofcrosstalkinthedevelopmentofhigh-qualitystereoscopicdisplays.Inthewideracademicliterature,thetermscrosstalk,ghostingandleakageareoftenusedinterchangeablyandunfortunatelyveryfewpublicationsactuallyprovideadescriptiveormathematicaldefinitionoftheseterms.Unfortunatelywhendefinitionsareprovidedtheyaresometimescontradictory.

Thispaperreviewshowthetermscrosstalk,ghostingandassociatedterms(systemcrosstalk,viewercrosstalk,gray-to-graycrosstalk,leakage,extinctionandextinctionratio,and3Dcontrast)aredefinedandusedinthestereoscopicliterature.Bothdescriptivedefinitionsandmathematicaldefinitionsareconsidered.

Thepaperwillalsobrieflydiscussliteratureontheperceptionofcrosstalkinstereoscopicdisplays.

7863-34, Session 13

A simple method for measuring crosstalk in stereoscopic displaysM.A.Weissman,TrueVisionSystems(UnitedStates);A.J.Woods,CurtinUniv.ofTechnology(Australia)

Maintaininglowcrosstalkinastereoscopicdisplaysystem-thatis,reducingtheamountof“wrong”imageineacheye-iscriticallyimportantforcomfortableandhigh-quality3Dviewing.Amoderateamountcancauseeyestrain;alargeamountwillpreventfusingthe3Dscene.However,becauseof

-Thelackofmeasurements,

-Thecomplexityofmakingameasurement,or

-Thereluctanceofmanufacturerstoreleasemeasurementdata,

itisoftendifficultfortheusertoknowhowmuchcrosstalkisinanyparticulardisplay.

Wewillproposehereasimplemethodofmeasuringcrosstalk(alsoknownas“ghosting”),onethatreliesononlyviewingtestpatternsonthedisplay.Noelectronicoropticalinstrumentsareneeded.Ourhopeisthatthistoolcanbedistributedwidelyandwillleadtothecollectionofconsistentinformationabout3Ddisplays,andtherefore,totheproductionofthebeststereoscopicdisplayspossible.

Wewillalsopresenttheresultsofopticalmeasurementsthatconfirmthatthesimplemethodgivesgoodcrosstalkmeasurements

7863-35, Session 13

Ergonomic evaluation of crosstalk in stereoscopy through heart activity and forehead blood flowS.Toyosawa,H.Morikawa,K.Nakano,T.Kawai,WasedaUniv.(Japan);C.Chen,H.Chang,J.Yang,IndustrialTechnologyResearchInstitute(Taiwan)

Crosstalkisaphenomenainstereoscopywhereobjectsbecomeblurryduetoleakageoftheleftimageintotherighteyeandviceversa,andconsideredoneofthemostseriousproblemsinstereoscopy.Thecurrentstudyaimsatexaminingmentalactivityunderavariouslevelofcrosstalkthroughheartactivityandforeheadbloodflow.Intheexperimentthatpresentedthreestillimagesandonevideowithavariouscrosstalkratios,heartrateshowedtri-phasicpattern:decelerative-accelerative-decelerative-accelerativeforalltheimage



IS&T /

ReturntoContents

types.Thepatternsuggeststhechangeinmentalstateinaccordancetothecrosstalklevel:i.e.orientationresponseunderno-crosstalk,mentalelaborationuponnoticingthepresenceofcrosstalk,reducedlevelofelaborationascrosstalkprogressed,andstressedstatewhenthecrosstalkexceedstolerancelimit.However,thepatternsintheratioofthelowandhighfrequencycomponentoftheheartratevariability(LF/HF)andforeheadbloodflowshowedslightdeviationfromtheheartratepatterninsomestimulustypes.Thissuggeststhatthementalstatesundercrosstalkedimageviewingcouldbemorecomplexthansimplecombinationoforientationresponseandmentalelaboration.

7863-36, Session 13

Optical characterization of shutter glasses stereoscopic 3D displaysP.M.Boher,T.R.Leroux,V.Collomb-Patton,ELDIM(France)

Ashutterglassesstereoscopic3Ddisplayisacombinationofonedisplayworkingathighfrequencyandliquidcrystalshutterglasses.Bothcomponentshavetheirownimperfectionsthatmustbetakenintoaccountsimultaneouslytomeasuretheperformancesofsuchsystems.Intheproposedpapera3DreadySamsungSyncMaster2233RZ120HzLCDdisplaycoupledwithaNVIDIA3Dvisionsystemismeasured.Transmittanceandresponsetimeoftheshutterglassesaremeasuredusingastaticlightsource.GreytogreylevelresponsetimesandluminancetargetsoftheLCDarealsomeasured.Finally,thetemporalbehaviorofthecompletesystemismodeledandgreytogreyluminanceacrossshutterglassesarededuced.Visualimpactischeckedusinggreyleveltestpatternsandimagingcolorimeter.Greylevelvariationsduetothecrosstalkbetweenthetwoeyesandthetemporalsynchronizationarethemainsourceofimperfectionforthistypeofdisplay.

7863-37, Session 13

The effect of crosstalk on depth magnitude in thin structuresI.Tsirlin,R.S.Allison,L.M.Wilcox,YorkUniv.(Canada)

Stereoscopicdisplaysmustpresentseparateimagestotheviewer’sleftandrighteyes.Crosstalkistheunwantedcontaminationofoneeye’simagefromtheimageoftheothereye.Ithasbeenshowntocausedistortions,reduceimagequalityandvisualcomfortandincreaseperceivedworkloadwhenperformingvisualtasks.Crosstalkalsoaffectsone’sabilitytoperceivestereoscopicdepthalthoughlittleconsiderationhasbeengiventotheperceptionofdepthmagnitudeinthepresenceofcrosstalk.Inthispaperweextendapreviousstudy(Tsirlin,Allison&Wilcox,2010,submitted)ontheperceptionofdepthmagnitudeinstereoscopicoccludingandnon-occludingsurfacestothespecialcaseofcrosstalkinthinstructures.Crosstalkinthinstructuresdiffersqualitativelyfromthatinlargerobjectsduetotheseparationoftheghostandrealimagesandthustheoreticallycouldhavedistinctperceptualconsequences.Toaddressthisquestionweusedapsychophysicalparadigm,whereobserversestimatedtheperceiveddepthdifferencebetweentwothinverticalbarsusingameasurementscale.Ourdatashowthatcrosstalkdegradesperceiveddepth.Ascrosstalklevelsincreasedthemagnitudeofperceiveddepthdecreased,especiallyforstimuliwithlargerrelativedisparities.Incontrasttotheeffectofcrosstalkondepthmagnitudeinlargerobjects,inthinstructures,asignificantdetrimentaleffectwasfoundatalldisparities.Ourfindings,whenconsideredwiththeotherperceptualconsequencesofcrosstalk,suggestthatitspresenceinS3Dmediaeveninmodestamountswillreduceobservers’satisfaction.

7863-38, Session 14

Effects of stereoscopic presentation on visually induced motion sicknessH.Ujike,H.Watanabe,NationalInstituteofAdvancedIndustrial

ScienceandTechnology(Japan)

ThepresentstudyinvestigateswhetherVIMS,whichcanbeinducedin2Dimages,isaffectedbystereoscopicpresentation.Todothis,weconductedanexperimenttomeasuretheeffectspsychologicallyandphysiologically.Thirty-fiveadults,aged21-77years,participatedintheexperiment.Visualstimuluswascomputergraphicsthatsimulatestravelingalongstreetsfor10minutes.Thevisualrotations,(+/-30deg,0.167Hz),alongpitchandrollaxeswerealternativelyaddedatintervals.Thestimuluswerecreatedaseitherthestereoscopic,“3D”,or“2D”images,andtheywerepresentedon3DLCdisplays.Eachobserverwatchedboth2Dand3Dimageswithone-hourrestbetweenthem.WemeasuredSimulatorSicknessQuestionnaire(SSQ)beforeandaftereachtrial,andsubjectivecomfortleveleveryoneminuteduringeachtrial.Moreover,wemeasuredelectrocardiogram,plethysmograph,respirationasindicesofautonomicnervousactivity.TheresultsshowedthathigherSSQscoresandlowercomfortablelevelforthe3Dimagethanforthe2Dimage.Moreover,%RR50,whichistheindexofparasympatheticnerveactivity,clearlydecreasedmoreforthe3Dimagethanforthe2Dimage.WeconcludethatstereoscopicpresentationenhancesbiomedicaleffectsofVIMS.Wespeculatethatstereoscopicimagescanbeefficientreferenceofspatialorientation.

7863-39, Session 14

Vergence and accommodation to multiple-image-plane stereoscopic displays: ‘Real world’ responses with practical image-plane separations?K.J.MacKenzie,R.Dickson,S.J.Watt,BangorUniv.(UnitedKingdom)

Conventionalstereoscopicdisplayspresentimagesonasinglefocalplane.Theresulting‘conflict’betweenthestimulitotheeyes’focusingresponse(accommodation)andtoconvergencecausesfatigueandpoorstereoperformance.Onepromisingsolutionistodistributeimageintensityacrossanumberofrelativelywidelyspacedimageplanes-atechniquereferredtoasdepthfiltering.Previously,wefoundthiselicitsaccurate,continuousmonocularaccommodationresponseswithimage-planeseparationsupto~1.1Diopters(MacKenzieetal.,2010),suggestingthatarelativelysmall(i.e.practical)numberofimageplanesissufficienttoeliminatevergence-accommodationconflictsoveralargerangeofsimulateddistances.However,accommodationresponsesovershootsystematicallywhenthesamestimuliareviewedbinocularly,duetoconvergence-drivenaccommodation(MacKenzie&Watt,2010).Here,weexaminedtheminimumimage-planespacingrequiredforaccurateaccommodationtobinoculardepth-filteredimages.Wecomparedaccommodation(andvergence)responsestostepchangesindepth(0.3-1.2D)fordepth-filteredstimuli,usingimage-planeseparationsof0.6-1.2D,andequivalentrealstimuli.Accommodationresponseswereaccurateforimage-planeseparationsof~0.6-0.9D.Thus,depthfilteringcanbeusedtopreciselymatchaccommodationandvergencedemandinapracticalstereoscopicdisplay,usingarelativelysmallnumberofimageplanes.

7863-40, Session 14

Both efficiency measures and perceived workload sensitive for manipulations in binocular disparityM.vanBeurden,W.Ijsselsteijn,TechnischeUniv.Eindhoven(Netherlands)

Stereoscopicdisplaysareknowntoofferanumberofkeyadvantagesinvisualizingcomplex3Dstructuresordatasets.Thelargemajorityofstudiesthatfocusonevaluatingstereoscopicdisplaysforprofessionalapplicationsusecompletiontimeand/orpercentagecorrectanswerstomeasurepotentialperformanceadvantages.However,bothcompletiontimeandaccuracymightnotfullyreflectallthebenefitsofstereoscopic



IS&T /

ReturntoContents

displays.Inthispaper,wearguethatperceivedworkloadisanadditionalvaluableindicatorreflectingtheextenttowhichuserscanbenefitfromusingstereoscopicdisplays.Overall,theresultsshowedthattheperformance(completiontimeandaccuracy)wasoptimalaround25minofarc,andsignificantlydecreasedforadisparitylevelof50minarc.Furtherperceivedmentalworkloaddecreasedwithincreasingdisparity.Whenthedisparitylevelbecomes50minofarc,perceivedworkloadsignificantlyincreased.Perceiveddiscomfortgraduallyincreasedwithincreasingdisparitylevels.Theresultsfurthersuggestthatperceivedworkloadwasshowntobesensitiveforvariationsindisparityandthereforeintroducesapromisingtheoreticalconceptaswellasausefulmeasurementtooltoaidhumanfactorsresearchonstereoscopicdisplays.

7863-41, Session 14

Comparison of relative (mouse-like) and absolute (tablet-like) interaction with a large stereoscopic work-spaceM.Averkiou,N.A.Dodgson,Univ.ofCambridge(UnitedKingdom)

Wecomparetwodifferentmodesofinteractionwithalargestereoscopicdisplay.Inabsolutemode,thephysicalpointer’spositionexactlymapstopositioninthedisplayvolume,analogoustoa2Dgraphicstableand2Dscreen.Inrelativemode,theconnectionisbetweenthephysicalpointer’smotionandthemotionofthepointerinthevolumeisanalogoustothatobtainedwitha2Dmouseand2Dscreen.

Bothstatisticalanalysisandparticipants’feedbackindicatedastrongpreferenceforabsolutemodeoverrelativemode.Thisisincontrastto2Ddisplayswhererelativemode(mouse)isfarmoreprevalentthanabsolutemode(tablet).Wealsocomparedhead-trackingagainstnohead-tracking.Therewasnostatistically-significantadvantagetousinghead-tracking,howeveralmostallparticipantsstronglyfavouredhead-tracking.

7863-42, Session 15

Optimal design and critical analysis of a high resolution video plenoptic demonstratorV.Drazic,J.Sacré,J.Bertrand,A.Schubert,Technicolor(France)

Aplenopticcameraisanaturalmulti-viewacquisitiondevicealsocapableofmeasuringdistancesbycorrelatingasetofimagesacquiredunderdifferentparallaxes.Itssinglelensandsinglesensorarchitecturehavetwodownsides:limitedresolutionanddepthsensitivity.Inaveryfirststepandinordertocircumventthoseshortcomings,wehaveinvestigatedhowthebasicdesignparametersofaplenopticcameraoptimizeboththeresolutionofeachviewandalsoitsdepthmeasuringcapability.InasecondstepwehavebuiltaprototypebasedonaveryhighresolutionRedonemoviecamerawithanexternalplenopticadapterandarelaylens.Theprototypedelivers5videoviewsof820x410.Themainlimitationinourprototypeisviewcrosstalkduetoopticalaberrationswhichreducethedepthaccuracyperformance.Wehavesimulatedsomelimitingopticalaberrationsandpredictedtheimpactontheperformancesofthecamera.Wehavedevelopedadjustmentprotocolsbasedonasimplepatternandanalyzingprogramswhichinvestigatetheviewmappingandamountofparallaxcrosstalkonthesensoronapixelbasis.Theresultsofthesedevelopmentsenableustoadjustthelensletarraywithasubmicrometerprecisionandtomarkthepixelsofthesensorwheretheviewsdonotregisterproperly.

7863-43, Session 15

Geometric and subjective analysis of stereoscopic I3A cluster imagesM.Kytö,J.Hakala,P.Oittinen,AaltoUniv.SchoolofScienceandTechnology(Finland)

Itcanbeexpectedthatstereoscopicphotographywillbeincorporatedinmobilephonesinnearfuture.ThetypicalscenesinmobilephonephotosaredividedintoclustersinInternationalImagingIndustryAssociation’s(I3A)CameraPhoneImageQualityInitiative.ThispaperpresentsageometricandsubjectiveanalysisforstereoscopicversionsoffourI3Aclusters.

Thegeometryofthestereoscopicpipelinefromscenetoviewer’seyeisaveryrelevantissueinstereoscopicmedia.Oneimportantfactoristhecameraseparation,becauseitcanbeusedtocontroltheperceiveddepthofstereoscopicimages.Thecomputationalcameraseparationswerecomparedtosubjectivelypreferredcameraseparations.

Participantsevaluatedthestrengthandnaturalnessofdepthsensationandoverallviewingexperiencefromthestillimageswithsingle-stimulusmethod.Resultsshowedthatparticipantswereabletoperceivethechangeofdepthscaleeventhoughtheimageswereshowninrandomorderwithoutareferencedepthscale.

Themilddepthsensationwaspreferredoverstrongdepthsensation.Thecomputationalcameraseparationdifferedfromthesubjectivelypreferredcameraseparationwhenthedepthrangeofthescenewasnarrow.Thisresultindicatesthatsceneswithnarrowdepthshouldnotbeimagedwithalongcameraseparationjusttofillthedepthbudgetofthedisplay.

7863-44, Session 16

The Dynamic Floating Window: a new creative tool for 3D moviesB.R.Gardner,Independent3DConsultant(UnitedStates)

Unliketherealworld,stereoscopiccinemashaveaborderedframewhichcanunnaturallycutofftheimage,creatingconflictingvisualcues.Thiscandiminishthe3-Deffectandcausevisualfatigue.

OBJECTIVE:Asolutionissoughtwhichmeetsfourkeycriteria:

(1)remove“WindowViolations”(visualconflict)

(2)offercontrolstominimize“RetinalRivalryZones”

(3)“invisible”-it’suseisneitherapparentnordistractingtoaudiences

(4)controllable-shouldsupportfilmmaker’sstorytelling

METHOD:In1952,SpottiswoodeappliedastaticFloatWindowtechniquetotheshortfilm,“TheBlackSwan”.

Inthispaper,theconceptofthestaticFloatingWindowisgreatlyexpandedtomatchthedynamicnatureofmovies.

Bypositioningadynamicmaskatthebordersofthe3-Dmovie,thenanimatingtheLeftandRighteyestereoscopicbordermasksdifferentially,astereoscopicparallaxiscreated.Thusthescreenbordersappearto“float”offofthedevicescreeninthreedimensions.Byvaryingtheborderwidthsandangles,thisWindowcanevenbere-orientedtobenon-paralleltothedisplayscreen.ThisdecouplestheperceivedWindow(screenborder)fromthescreen,makingitacontinuouslycontrollableelementbythe3-Dfilmmaker.

TheDynamicFloatingWindowhasbeensuccessfullyusedtoachieveallfourofthestatedObjectives,andthey’vebeenappliedtooveradozen3-Dfeaturefilms.

7863-45, Session 16

Stereo video inpaintingF.Raimbault,A.Kokaram,TrinityCollegeDublin(Ireland)

Astheproductionofstereoscopiccontentincreases,sodoestheneed



IS&T /

ReturntoContents

forpost-productiontoolsforthatcontent.Videoinpaintinghasbecomeanimportanttoolforrigremovalbuttherehasbeenlittleconsiderationoftheprobleminstereo.Analgorithmforstereovideoinpaintingthatbuildsonexistingexemplarbasedworkandalsoconsiderstheissuesofviewconsistencywillbepresented.

Givenuserselectedpatchesinthesequencewhichmaybeinthesamelocationinseveralframesandinbothviews,theobjectiveistofillinthispatchusingalltheavailablepictureinformation.Existingalgorithmslacktemporalconsistency,causingflickeringandotherartefacts.Theuseoflongtermpictureinformationacrossmanyframesinordertoachievetemporalconsistencyatthesametimeasexploitinginter-viewdependencieswillbediscussed.

Thecoreoftheprocessisbuiltonfindingmatchingpatchesinsurroundingpictureinformationbyextendinganexemplar-basedframework.Asampleareaisconstructedfromframesintime,views(usinginterviewdisparityvectors)andinthecurrentframe.Matchingpatchesinthoseframesareusedtofillthemissingholerecursivelypixelbypixel.

7863-46, Session 16

A modified non-local mean inpainting technique for occlusion filling in depth-image based renderingL.Azzari,F.Battisti,Univ.degliStudidiRomaTre(Italy);A.P.Gotchev,TampereUniv.ofTechnology(Finland);M.Carli,Univ.degliStudidiRomaTre(Italy);K.Egiazarian,TampereUniv.ofTechnology(Finland)

Atechniqueforfillingdisocclusionholesarisingfromdepth-imagebasedrenderingisproposed.Itadaptsastate-of-the-artexemplar-basedinpaintingalgorithmtothespecificsofdepth-basedviewsynthesisin3Dvideodisplay.Modificationsaresuggestedintheso-calledprioritymapandalsointhewaypatchesaresearchedforsimilarityandtobeusedinformingnon-localmeanestimates.

Objectiveandsubjectivetestshavebeenconductedtoevaluatetheperformanceoftheproposedtechniqueagainststate-of-the-artocclusion-fillingapproachesandtheresultsshowanimprovedperformancealongwithanimprovedefficiencycomparedwiththeoriginalmethod.

7863-47, Session 16

A study on the stereoscopic codecs for non-real time 3DTV servicesB.Lee,ElectronicsandTelecommunicationsResearchInstitute(Korea,Republicof)

Thispaperpresentsastudyonthestereoscopic3Dcodecforthenon-realtime3DTVservices.FortheDTV(DigitalTelevision)systemwhereitsbandwidthislimitedtoaccommodatethefull3DHDqualityvideo,acomplementaryenableristhenon-realtimedeliveryschemewhich3Dvideocomponentisdownloadedinadvance.Fromthecodecperspective,ifstereovideosarecodedinindependently,thecoding/decodingcanbesimplymanagedcomparedtothecasewherestereovideosareinterviewpredictioncoded.InDTVenvironmentwhereitisbuiltonATSC(AdvancedTelevisionSystemCommittee)standard,makingachoiceofanoptimalcodecisregardedasoneofthekeyissuesfor3DTVservices.WhenweviewtheNRTservice,thechoiceofcodecisalsoaprimaryconcernbecausethecombinationofcodecandtheuseofinterviewpredictionhaveaneffectontheperformanceofNRTservices.SointhispaperthecombinationsofavailablecodecsareevaluatedandalsotheperformancecomparisonsareconductedtofindtheoptimalcodecforNRTservices.Amongstvariouscodeccombinations,weevaluatedMPEG2+MPEG2,MPEG2+H.264Simulcast,MPEG2+H/264InterviewPredictionandMVC.Basedonthisevaluation,thispaperalsoaddressestheoptimalconditionofNRTdelivery.ItcoverstheNRTscenarios,3Dobjectsize,objectsegmentationandaggregation,scheduling,anddeliveryscheme.

7863-48, Session 16

A modular cross-platform GPU-based approach for flexible 3D video playbackR.Olsson,H.Andersson,M.Sjöström,MidSwedenUniv.(Sweden)

Differentcompressionformatsforstereoandmultiviewbased3Dvideoisbeingstandardizedandsoftwareplayerscapableofdecodingandpresentingtheseformatsontodifferentdisplaytypesisavitalpartinthecommercializationandevolutionof3Dvideo.

ThispaperdescribesthedesignandimplementationofaGPU-basedreal-time3Dvideoplaybacksolution,builtontopofcross-platform,opensourcelibrariesforvideodecodingandhardwareacceleratedgraphics.Asoftwarearchitectureisproposedthatefficientlyprocessandpresentshighdefinition3Dvideoinreal-timeandinaflexiblemannersupportbothcurrent3Dvideoformatsandemergingstandards.Thepresentedsolutionisastand-alone3DvideoplayerapplicationbuiltontopofFFmpeg,usinglibavformatformediacontainerdemultiplexingandlibavcodecforvideodecoding.Toincreasemodularityandflexibilitythe3Dvideoplayerfunctionalityhasbeendividedintotwoseparatecomponents:avideoplayeranda3Dvideofilter.

Theproposedprototype3Dvideoplaybacksolutionshowsthatitispossibletobuilda3Dvideoplayerapplicationrelyingcompletelyuponopen-sourceandcross-platformlibraries.Compressedtiledandmulti-viewvideoformatsuptoresolutionsof1080phavesuccessfullybeenverifiedtobedisplayedattheirintendednativeframerate.



IS&T /

ReturntoContents

Conference 7864A: 3D Imaging MetrologyMonday-Tuesday24-25January2011PartofProceedingsofSPIEVol.7864A3DImagingMetrology

7864A-01, Session 1

Traceable hierarchical procedures for dimensional metrologyD.K.MacKinnon,J.A.Beraldin,L.Cournoyer,B.Carrier,NationalResearchCouncilCanada(Canada)

Wepresentaseriesofdimensionalmetrologyproceduresthathavebeeneitherdesignedormodifiedfromexistingprocedurestoensuretraceabilityofeachmetricfromthecertifiedreferencesurfacetothecertifyinglaboratory.Thesemetricsaredividedintosurfaceformprecision,surfacefittrueness,andsurfaceresponse.Theproceduresforgeneratingthesemetricswouldformthebasisofavolumetricanalysisofthecharacteristicprofileofa3Dimagingsystem.Weuseahierarchicalapproachinwhicheachmetricbuildsoneithercertifiedreferencevaluesorpreviously-generatedcharacteristicvalues.StartingfromsimpleplanarandsphericalsurfacesusingfittingproceduresrecommendedbyNIST,wedemonstratehowmetricsforplaneformspread(flatness),sphereformspread,sphereformerror,sphere-spacingerror,plane-spacingerror,planaruncertaintyresolution,intensityresolution,andspatialresolutionarebuiltuponeachother.Bothsimulatedandrealdataareusedtodemonstratehowtheseproceduresareusedaspartofaprocessforcharacterizingtheperformanceofa3Dimagingsystem.

7864A-02, Session 1

Harmonic distortion free distance estimation in ToF cameraB.Kang,S.Kim,K.Lee,J.D.K.Kim,C.Kim,SamsungAdvancedInstituteofTechnology(Korea,Republicof)

ATime-of-Flight(ToF)depthcameracapturesthedistancefromthecameratoanobjectusinganearinfrared(NIR).ThedistancecanbecalculatedfromthephaseshiftbetweentheemittedandreflectedNIR.ToFdepthcamerasusuallymodulatetheNIRwithasquarewaveratherthanasinusoidalwaveduetoitsdifficultyinhardwareimplementation.Previousmethodusesasimpletrigonometricfunctiontoestimatethephaseshiftusingthedifferenceofelectronsgeneratedbythereflectedsquarewavesothattheestimatedphaseshiftcanincludeaharmonicdistortion.Thisisbecausethephaseshiftshouldbelinearlyproportionaltothedifferenceofelectronsalongthedistancetoanobject.Thetrigonometricfunction,however,nonlinearlyestimatesthephaseshift.Inthispaper,weproposeanewestimationmethodbasedonthesignofthedifferenceofelectronstoreducethedistortionofthephaseshift.Forquantitativeevaluation,thepreviousandproposedmethodsaretestedonourprototypeToFdepthcamera.Experimentalresultsshowthatthedistancecalculatedfromtheproposedmethodismoreaccuratethanthatfromthepreviousone.

7864A-03, Session 1

Separating true range measurements from multi-path and scattering interference in commercial range camerasA.A.Dorrington,J.P.Godbaz,M.J.Cree,A.D.Payne,L.V.Streeter,TheUniv.ofWaikato(NewZealand)

Time-of-flightrangecamerasacquireathree-dimensionalimageofascenesimultaneouslyforallpixelsfromasingleviewinglocation.Attemptstouserangecamerasformetrologyapplicationshavebeenhamperedbythemulti-pathproblem,whichcausesrangedistortionswhenstraylightinterfereswiththerangemeasurementinagivenpixel.Correctingmulti-pathdistortionsbypost-processingthethree-dimensionalmeasurementdatahasbeeninvestigated,butenjoys

limitedsuccessbecausetheinterferenceishighlyscenedependent.Analternativeapproachbasedonseparatingthestrongestandweakersourcesoflightreturnedtoeachpixel,priortorangedecoding,ismoresuccessful,buthasonlybeendemonstratedoncustombuiltrangecameras,andhasnotbeensuitableforgeneralmetrologyapplications.InthispaperwedemonstrateanalgorithmappliedtoboththeMesaImagingSR4000andCanestaInc.XZ422Demonstratorunmodifiedoff-the-shelfrangecameras.Additionalrawimagesareacquiredandprocessedusinganoptimizationapproach,ratherthanrelyingontheprocessingprovidedbythemanufacturer,todeterminetheindividualcomponentreturnsineachpixel.Substantialimprovementsinaccuracyareobserved,especiallyinthedarkerregionsofthescene.

7864A-04, Session 1

3D imaging studies of rigid-fiber sedimentationD.W.Vahey,T.Scott,U.S.ForestService(UnitedStates);E.J.Tozzi,Univ.ofCalifornia,Davis(UnitedStates);D.J.Klingenberg,Univ.ofWisconsin-Madison(UnitedStates)

Fibersareindustriallyimportantparticlesthatexperiencecouplingbetweenrotationalandtranslationalmotionduringsedimentation.Thisleadstohelicaltrajectoriesthathavebeenpoorlyunderstoodfromboththeoreticalandexperimentalperspectives.

Sedimentationexperimentsandhydrodynamicanalysiswereperformedonelevencopper“fibers”ofaveragelength10.5mmanddiameter0.20mm.Eachfibercontainedthreelinearbutnon-coplanarsegments.Thefiberswerecharacterizedbytheir2Dprojectionsonorthogonalplanes.

Thefibersweresequentiallyreleasedintosiliconeoilcontainedinatransparentcylinderofsquarecrosssection.Identical,synchronizedcamerasweremountedtoamoveableplatformandimagedthecylinderfromorthogonaldirections.Thecameraswerefixedinpositionduringthetimethatafiberremainedinthefieldofview.Subsequently,thecameraswerecontrollablymovedtothenext,loweredfieldofview.Thetrajectoryofdescendingfiberswasfollowedoverarangeof250mm.

Customsoftwarewaswrittentodecouplefiberorientationandtrajectoryfromthe3Dimages.Fiberswithsimilarterminalvelocitywerefoundtohavehighlyvariableangularvelocities.Bothterminalandangularvelocitieswerewell-predictedbytheory.Helicalradiuswashardtopredictwhenangularvelocitywassmall,probablyreflectingerrorsinmeasuringfibershape.

7864A-05, Session 1

Depth upsampling method using the confidence map for a fusion of a high resolution color sensor and low resolution time-of-flight depth sensorK.Bae,K.Kyung,T.Kim,SAMSUNGElectronicsCo.,Ltd.(Korea,Republicof)

Thispaperproposesadepthupsamplingmethodusingtheconfidencemapforafusionofahighresolutioncolorsensorandlowresolutiontime-of-flightdepthsensor.Theconfidencemaprepresentstheaccuracyofdepthdependingonthereflectanceofameasuredobjectandisestimatedwithamplitude,offset,andreconstructederrorofareceivedsignal.Theproposedmethodsuppressesthedepthartifactsthatarecausedbydifferencebetweenlowandhighreflectivematerialsonanobjectatadistance.Althoughthesurfaceofanobjectislocatedatthesamedistance,thereflectanceofsmallregionswithinthesurfacedependsonconstituentmaterials.Weightedfilter


IS&T /

ReturntoContents

Conference 7864A: 3D Imaging Metrology

generatedbyconfidencemapisaddedtomodifiednoise-awarefilterfordepthupsampling(NAFDU)thatisproposedbyDereketal.,andisadaptivelyselected.Theproposedmethodconsistsoffollowings;thenormalization,thereconstruction,theconfidencemapestimation,andthemodifiednoise-awarefiltering.Inthenormalization,amplitudesandoffsetsofreceivedsignalsarecalculatedandreceivedsignalsarenormalizedbythose.Thenormalizedsignalsaredenoised,andthenthephaseshiftsaremeasuredbetweentransmittedandreceivedsignals.Inthereconstruction,receivedsignalsarereconstructedusingonlythevaluesofphaseshiftsandthereconstructionerrorsarecalculated.Theconfidencemapisestimatedwithamplitudes,offsets,andreconstructionerrors.Thecoefficientsofamodifiednoise-awarefilterareadaptivelyselectedbyreferringtotheconfidencemap.Theproposedmethodshowstheenhancedresultsofremovingdepthartifactsintheexperiments.

7864A-06, Session 2

Instrument for 3D characterization of autostereoscopic displaysJ.Prevoteau,Univ.deReimsChampagne-Ardenne(France)and3DTVSolutions(France);S.Chalençon-Piotin,Univ.deReimsChampagne-Ardenne(France);D.G.Debons,3DTVSolutions(France);L.Lucas,Y.Remion,Univ.deReimsChampagne-Ardenne(France)and3DTVSolutions(France)

Wenowhavenumerousautostereoscopicdisplays,anditismandatorytocharacterizethembecauseitwillallowtooptimizetheirperformancesandtomakeefficientcomparisonbetweenthem.Thereforeweneedstandardssowehavetobeabletoquantifythequalityoftheviewer’sperception.Thepurposeofthepresentpaperistwofold;wefirstpresentanewinstrumentofcharacterizationofthe3Dperceptiononagivenautostereoscopicdisplays;thenweproposeanewwaytorealizeanexperimentalprotocolallowingtogetafullcharacterization.Thisinstrumentwillallowustocompareefficientlythedifferentautostereoscopicdisplaysbutitwillalsovalidatepracticallytheadequacybetweentheshootingandrenderinggeometries.Inthisaim,wearegoingtomatchaperceivedscenewiththevirtualscene.Itishardlypossibletodeterminatethesceneperceivedbyaviewerplacedinfrontofanautostereoscopicdisplay.Indeedifitmaybeexecutableonthepop-out,itisimpossibleonthedeptheffectbecausethedepthofthevirtualsceneissetbehindthescreen.Therefore,wewillhavetouseanopticalillusionbasedonthedeflectionoflightbyamirrortoknowthepositionwhichtheviewerperceivessomepointsofthevirtualsceneonanautostereoscopicdisplay.

7864A-07, Session 2

Accurate stereo matching based on multiband imagingM.Doi,A.Minami,OsakaElectro-CommunicationUniv.(Japan);S.Tominaga,ChibaUniv.(Japan)

Theaccuracyofstereomatchingdependsonprecisedetectionofcorrespondingpointsinapairofstereoimagesbytemplatematching.Amultibandimagingsystemcapturesmorethanthreechannelsinavisiblerange.Themultibandimagingtechniqueisusefulforimprovingtheaccuracyofthestereomatching.Inthispaper,weproposeanimagingsystemandanalgorithmforstereomatchingbasedonmultibandimages.Theimagingsystemiscomposedofaliquid-crystaltunablefilterandahighsensitivemonochromecamera.InourmodifiedSSDAalgorithm,thesimilarityiscalculatedforeachbandinthedescendingorderofthevarianceofbandimageintensityintemplate.Sincethetemporarysimilarityatnon-candidatepointsexceedsthresholdquicklyinbandwithlargevariance,theprocessingtimeisshortenedbyquickinterruptionofthecalculationatthepoint.ExperimentalresultsshowthatmultibandstereomatchingisaccuratecomparedwithRGBstereomatching.Asheetofcolortexturepatcheswithsmallcolordifferenceswasusedasameasurementtarget.Therateofdetectionofcorrectcorrespondingpointswas98.4%bymultibandstereomatching,whiletheratewas34.7%byRGBstereo

matching.Moreover,useofthemodifiedSSDAreduced17%oftheCPUtime.

7864A-24, Session 2

Flash trajectory imaging of target 3D motionX.Wang,Y.Zhou,S.Fan,J.He,Y.Liu,InstituteofSemiconductors(China)

Determinationof3Dmotionparametersandimagingofmovingtargetshavebeenresearchhotspotsinrecentyears.Inastronomy,remotesensing,trackingtargetsandestimatingtheirmotionparametersareveryimportant.Traditionaltrajectorypredictiontechniquebyimagesequencesandcorrespondingimagingprocessingarecomplicated,especiallyfortargetsincomplexbackground.Inordertosolvetheaboveproblems,wepresentaflashtrajectoryimaging(FTI)techniquefortarget3Dmotion.Thistechniqueusesapulsedlasertoilluminatetargetsandacamerawithamicrochannelplatetotakeimages.Themicrochannelplateactsasbothanamplifierandashutter.IntheFTI,timedelayintegrationandtimeslicingareused.Formovingtargets,themodeoftimedelayintegrationincreasesinformationofonesingleframeimagesothatonecandirectlygainthemovingtrajectory.Timeslicinggivestherangeoftargetsandrealizessilhouettedetectionwhichcandirectlyextracttargetsfromcomplexbackground.Therefore,thecomplexityofimageprocessingdecreases.Bytwosuccessiveframesincludingtargettrajectory,themotionparametersandflightattitudecanbegiven.Sincetheminimumgatingrateperframeisoneandthemaximumcanreachseveralmillions,theFTIcaneffectivelyimageloworhighspeedmovingtargetsandalsogivetheir3Dmotionparameters.Inaddition,themanneroftimeslicingmakestheFTIhaspropertiessuchashighsuppressionofbackscatterfromfogandotherobscurants,highsignal-to-noiseratio,andlongdetectionrange.FortheFTI,wehaveresearcheditinexperimentsandalsostudiedthealgorithmaboutit.OurresearchdemonstratesthattheFTIisaneffectiveapproachtodeterminethemotionparametersof3Dmotiontargetsandimagemovingtargets.

7864A-10, Session 3

The ASTM E57 file format for 3D imaging data exchangeD.Huber,CarnegieMellonUniv.(UnitedStates)

Thereiscurrentlynogeneral-purpose,openstandardforstoringdataproducedbythreedimensional(3D)imagingsystems,suchaslaserscanners.Asaresult,producersandconsumersofsuchdatarelyonproprietaryorad-hocformatstostoreandexchangedata.Thereisacriticalneedinthe3Dimagingindustryforopenstandardsthatpromotedatainteroperabilityamong3Dimaginghardwareandsoftwaresystems.Forthepasttwoyears,agroupofvolunteershasbeenworkingwithintheASTME57Committeeon3DImagingSystemstodevelopanopenstandardfor3Dimagingsystemdataexchangetomeetthisneed.TheE57FileFormatfor3DImagingDataExchange(E57formathereafter)iscapableofstoringpointclouddatafromlaserscannersandother3Dimagingsystems,aswellasassociated2Dimageryandcoremeta-data.Thispaperwilldescribethemotivation,requirements,design,andimplementationoftheE57format,andwillhighlightthetechnicalconceptsdevelopedforthestandard.Wewillalsocomparetheformatwithotherproprietaryorspecialpurpose3Dimagingformats,suchastheLASformat,andwewilldiscussandanalyzetheopensourcelibraryimplementationdesignedtoread,write,andvalidateE57files.


IS&T /

ReturntoContents

7864A-11, Session 3

The impact of different alignment strategies on the overall performance of a white light scanner according to the uncertainty especial according to sphere spacing error specified in VDI 2634E.Klaas,BreuckmannGmbH(Germany)

Thispaperisaboutaccuracyofopticalwhitelightorsocalled“topometric”scanners.Inalmostanyapplicationofsuchscannersitisnecessarytoputtogetherscansfromdifferentdirections:fromacoupleofscanstoacoupleofhundredscans.Accuracycanbeusuallywelldescribedforasinglescan,whereasaccuracyforthoseassembleddatasetsishardertoestimateandspecify:itdependsonmuchmoreparametersaswellasonthealignmentstrategybeingused.Thispaperwilldescribedifferentalignmentstrategiesincludingusingrobotsandtrackingsystems,targetsaswellasusingbestfittingmethods.Theimpactofthesemethodsontheresultingoverallaccuracyisdescribedanddemonstratedusingrealliveexamples.AlsodifferentmethodsofachievingtheseaccuracynumbersarebeingpresentedincludingguidelinessuchasVDI2634.Itwillbrieflytouchonthebasicprincipleofwhitelightscanningtounderstandthepotentialbutalsolimitationsofthesetechnique.

Thisshouldbeausefulguidelineforengineersorqualitymanagerswhowanttoestablishorlearnaboutnewscanningtechnologieswithspecialattentiontoaccuracyissues.

7864A-12, Session 3

Simulation-based determination of local probing uncertainty for fringe projection measurementsJ.Weickmann,A.A.Weckenmann,Friedrich-Alexander-Univ.Erlangen-Nürnberg(Germany)

Fringeprojectionsensorsgaininimportanceinmanufacturingqualitycontrolduetotheirmultipleadvantages.Inordertoadaptthemeasurementstrategytoaspecificinspectiontask,bothasuitablesensorandthenecessarymeasurementshavetobechosen,sothatthecompleteworkpieceshapeisrecordedwithatolerance-compatiblemeasurementuncertainty,accordingtoDINENISO14253-1.Thusareliableforecastofthemeasurementuncertaintyiscrucialforaneffectiveinspection-planningprocedure.Therearemultipleinfluences,whoseimpactsonthemeasurementresultvarydependentonthepositionofthemeasuredpoint.Sothelocalmeasurementuncertaintyateachmeasuredpoint-thatmeansthe‘probinguncertainty’-isindividual.Today,theprobinguncertaintycannotbeforecastedlocally.Thus,theexpecteduncertaintycannotbetakenintoaccount,whenaninspectionisplanned.Thispapershowsasimulation-basedapproachtoeliminatethisshortfall.Firstly,adefinitionforprobinguncertaintyisgiven.Thenthemodelforthesimulationofplannedfringeprojectionmeasurements-includingaGUM-conformantforecastforthelocalprobinguncertainty-isdescribed.Thissimulationisthenimplementedintoaprototypeofanassistancesystemthatsupportstheinspectionplannerwhensettingupthemeasurementstrategy.Finallyamethodfortheexperimentalverificationofthelocalprobinguncertaintyisintroducedandthesimulationresultsareverified.

7864A-13, Session 3

Low cost characterization of TOF range sensors resolutionG.Guidi,M.Russo,G.Magrassi,M.Bordegoni,PolitecnicodiMilano(Italy)

ThepurposeofthispaperistodefinealowcostapproachforestimatingtheresolutionofaTOFlaserscanner,portingtothe3D

environmentconceptsalreadyacceptedandimplementedin2DISOstandardsforelectronicimagingsystemssuchasISO12233and16067.

Themethodologypresentedisbasedontheproductionofimagessimilartothosegenerateby2DimagingofthecitedISOtargets,fromthelaserscanningofspecificallydeveloped3Dtargets,madebywindowsofvaryingsizeonthehorizontalplane(xy)andabruptjumpsonthelaserbeamdirection(z).EachrangemapistransformedinaB/Wimagebycodingzingreylevels.OneachimagethecitedISOmethodscanbeapplied.Thesecanbedividedintwocategories:onebasedontheestimationofresolutionfromthedirectevaluationoftheimagegeneratedbygeometricfeaturesprogressivelyclosereachother,andtheotherbasedonthefrequencydomainanalysisofslantededgestheevaluationoftheassociatedModulationTransferFunction(MTF).AfewdifferentTOFrangecamerasbasedonlightpulsesandonphasedetectionofcontinuousmodulatedlighthavebeenconsidered.Theexperimentalresultsarefinallypresentedanddiscussed.

7864A-14, Session 3

Introducing the depth transfer curve for 3D capture system characterizationK.Atanassov,V.Ramachandra,S.R.Goma,QualcommInc.(UnitedStates)

Evaluatingdepthcharacteristicssubjectivelyisaverydifficulttasksincetheintendedorsideeffectsfromimagefusion(depthinterpretation)bythebrainarenotimmediatelyperceivedbytheobserver.Objectiveevaluationof3Dcameradepthcharacteristicsisausefultoolthatcanbeusedfor“blackbox”characterizationof3Dcameras.Inthispaperweproposeamethodologytoevaluate3Dcamerasdepthcapturecapabilitiesusingaspecialtestchart-with3Dfeatures-thatispracticalandcontainsnecessarystructuretoextractimportant3Ddepthstatisticsandwepresentaprocessingalgorithmthatextractstherelevantdepthinformation.

7864A-16, Session 4

Assessment of the quality of as-is building information models generated from point clouds using deviation analysisE.B.Anil,P.Tang,B.Akinci,D.Huber,CarnegieMellonUniv.(UnitedStates)

Threedimensional(3D)imagingsensors,suchaslaserscanners,arebeingusedtocreateBuildingInformationModels(BIMs)oftheas-isconditionsoffacilities.Qualityassurance(QA)needstobeconductedtoensurethatthemodelsaccuratelydepicttheas-isconditions.WeproposeanewapproachforQAthatanalyzespatternsintheraw3Ddataandcomparesthe3Ddatawiththeas-isBIMgeometrytoidentifypotentialerrorsinthemodel.This“deviationanalysis”approachtoQAenablesuserstoanalyzetheregionswithsignificantdifferencesbetweenthe3Ddataandthereconstructedmodelorbetweenthe3Ddataofindividuallaserscans.Thismethodcanhelpidentifythesourceoftheerroranddoesnotrequireadditionalphysicalaccesstothefacility.Toshowtheapproach’spotentialeffectiveness,weconductedcasestudiesofseveralprofessionallyconductedas-isBIMprojects.Wecomparedthedeviationanalysismethodtoanalternativemethod-thephysicalmeasurementapproach-intermsofcoverageoftheenvironment,typesoferrorsdetected,andtimerequirements.Wealsoconductedasurveyandevaluationofcommercialsoftwarewithrelevantcapabilitiesandidentifiedtechnologygapsthatneedtobeaddressedtofullyexploitthedeviationanalysisapproach.



IS&T /

ReturntoContents

7864A-17, Session 4

Content-based depth estimation in focused plenoptic cameraS.R.Goma,K.Atanassov,QualcommInc.(UnitedStates);T.G.Georgiev,AdobeSystemsInc.(UnitedStates)

Theplenopticcamerahasbeenusedforgeneratingnovelviewsandrefocusing.Howevertheoriginalpurposeofthis,aswellastheimprovedfocusedplenopticcamera,hasbeenmeasuringdepth.Depthestimationinfocusedplenopticcameraisacriticalstepformostapplicationsofthistechnologyandposesinterestingchallenges,asthisestimationiscontentbased.Wepresentaniterativealgorithm,contentadaptive,thatexploitstheredundancyfoundinimagescapturedwithfocusedplenopticcamera.Ouralgorithmdeterminesforeachpointitsdepthalongwithameasureofreliabilityallowingsubsequentenhancementsofspatialresolutionofthedepthmap.Also,ourcaptureiscorrectedforthedistortionbetweenimagesofcentralandnon-centralmicrolenses.Weremarkthatthespatialresolutionoftherecovereddepthcorrespondstodiscretevaluesofdepthinthecapturedscenetowhichwereferasslices.More,eachslicehasadifferentdepthandwillallowextractionofdifferentspatialresolutionsofdepth,dependingonthescenecontentbeingpresentinthatslicealongwithoccludingareas.Interestingly,asfocusedplenopticcameraisnottheoreticallylimitedinspatialresolution,weshowthattherecoveredspatialresolutionisdepthrelated,andassuch,renderingofafocusedplenopticimageiscontentdependant.

7864A-18, Session 5

Measurement of micro gears: comparison of optical, tactile-optical, and CT-measurementsU.Neuschaefer-Rube,M.Bartscher,F.Härtig,M.Neukamm,Physikalisch-TechnischeBundesanstalt(Germany);J.Goebbels,BundesanstaltfürMaterialforschungund-prüfung(Germany)

Microgearsareappliedinhighquantityinalotofapplications.Therefore,precisemeasurementsareofgrowingimportancetoensuretheirquality.Thepresentationdescribesthemeasurementofgearsofamicroplanetarygearsetwithanopticalsensor,atactile-opticalprobeandcomputedtomography(CT).

Animagingsensorbasedonfocusvariationisusedfortheopticalmeasurements.Thetactile-opticalmeasurementswerecarriedoutwithasocalledfiberprobe.Thissensorappliesimageprocessingtodeterminethepositionofthetactileprobingelement.Forthemeasurementssinglepointprobingwasused.Duetolimitedaccessibilityatsomegearsnotallregionscanbemeasuredbytheopticalsensorandthetactile-opticalprobe.Incontrasttothis,withCTalwaysthewholepartcanbemeasuredwithhighpointdensity.

AlltheusedsensorsdelivermeasurementdatainCartesiancoordinates.Achallengeistotransferthedataincoordinatesinwhichgearparametersaredefined.Forthis,specialattentionmustbepaidtothecenterpointofthegearandtotheorientationoftheteeth.

Thecomparisonbetweendataofdifferentmeasurementswascarriedoutsuccessfully.Thedeviationsbetweentactile-opticalandtheCTdataareonlyafewmicrometers.

7864A-19, Session 5

Method for measuring stereo camera depth accuracy based on stereoscopic visionM.Kytö,M.Nuutinen,P.T.Oittinen,AaltoUniv.SchoolofScienceandTechnology(Finland)

Wepresentamethodtoevaluatestereocameradepthaccuracyinhumancenteredapplications.Itenablesthecomparisonbetweenstereocameradepthresolutionandhumandepthresolution.

Ourmethodusesamultileveltesttargetwhichcanbeeasilyassembledandusedinvariousstudies.Binoculardisparityenableshumanstoperceiverelativedepthsaccurately,makingamultileveltesttargetapplicableforevaluatingthestereocameradepthaccuracywhentheaccuracyrequirementscomefromstereoscopicvision.

ThemethodwasvalidatedwithastereocamerabuiltoftwoSLRs.ThedepthresolutionoftheSLRswasbetterthannormalstereoacuityatallmeasureddistancesrangingfrom0.7mto5.8m.Themethodwasusedtoevaluatetheaccuracyofalowerqualitystereocamera.Twoparameters,focallengthandbaseline,werevaried.Focallengthhadalargereffectonstereocamera’sdepthaccuracythanbaseline.Thetestsshowedthatnormalstereoacuitywasachievedonlyusingatelelens.

However,auser’sdepthresolutioninavideosee-throughsystemdiffersfromnakedeyeviewing.Thesametesttargetcanbeusedtoevaluatethisbymixingthelevelsrandomlyandaskinguserstosortthelevelsaccordingtotheirdepth.Thecomparisoncanbedonebycalculatingcorrelationsbetweenthemeasuredorder,perceivedorderandtheactualorderofthelevels.

7864A-20, Session 5

Best practices for the 3D documentation of the Grotta Dei Cervi of Porto Badisco, ItalyJ.A.Beraldin,M.Picard,NationalResearchCouncilCanada(Canada);V.Valzano,A.Bandiera,Univ.delSalento(Italy);F.Negro,CASPUR(Italy)

final/partialresults:

-Creationofa3Dpolygonalmodelofthecentral300-mcorridor.

-Texturemappingofmostofthecaveespeciallywheresomepictographsshowdeterioration.

-Creationoftransversalandlongitudinalcross-sectionsofthecavewitha1-mmresolution.

-Creationofhighresolutionrealisticrenderings(upto17.000x8.000)usingcolourandshapewitha1-mmresolution.

-Videoanimationform3Ddigitalpolygonalmodel(1080p).

-CreationofaCD-ROMforpublicconsumption.

-Creationofawebsite.

-Creationofa3Dpolygonalmodelforstereoscopicdisplays.

-Extractionofinformationfrommodelinordertoassistrestoration/preservationwork.

7864A-21, Session 6

NPL freeform artefact for verification of non-contact measuring systemsM.McCarthy,NationalPhysicalLab.(UnitedKingdom)

Newmachiningtechniquesmakeitpossibletomanufacturerangesofadvancedfreeformcomponentsandwithappropriatemetrology,higherprecisions(sub-micrometre)canpotentiallybeachieved.However,suchrapiddevelopmentsarelimitedduetoalackofmeasurementconfidenceandsuitablemeasurementtraceability.Tactileco-ordinatemeasuringmachines(CMMs)areabletoscancomplexsurfaces,butthisprocessisoftenverytimeconsuming.Incontrast,manyportableoptical-basednon-contactco-ordinatemeasuringsystemarenowcommerciallyavailablewhichcancapturevastquantitiesofpoint-clouddatainafractionofthetime.Verificationofthesenon-contactsystemsiscomplexandusefulguidessuchastheVDI/VDE2634seriesdemonstratecapabilitywhileemploystestartefactssuchasspheresandplanes.Theguidedoesnotextendtofullyaddressperformanceverificationwhenfreeformsurfacesaretobemeasured.

Toassistindustry,NPLhasdevelopedarangeoffreeform-basedreferenceartefacts,allowingtheperformanceofnon-contactsystems,suchasfringeprojector,laserscannersandothersimilaropticallybasedsystemstobeverifiedagainstasetofknownsurfaceconditions.



IS&T /

ReturntoContents

Thepurposeoftheartefactsistodemonstratethecapabilityofselectednon-contacttechnologiesandcertainsystems,tomeasurespecificformsandsurfaceconditions,ratherthantobeauniversalstandard.

The‘NPLFreeformartefacts’havebeendesigned,manufacturedandthencalibratedatNPLusinganultrahighaccuracytactilebasedCMM.Achosenartefact,havinganominallyfootprintof120mmx120mmhassubsequentlybeenusedtoverifytheperformanceofanumberofcommercialfreeformopticalmeasuringsystems.Thispaperwilldescribethegenericdesignofthe‘NPLFreeFormartefact’,thecalibrationprocedureadoptedanddiscussameasurementinter-comparisoninvolvinganumberofnon-contactindustrialmeasuringsystems.

7864A-22, Session 6

Proposed NRC-CNRC portable target case for short-range triangulation-based 3D imaging systems characterizationB.Carrier,D.K.MacKinnon,L.Cournoyer,J.A.Beraldin,NationalResearchCouncilCanada(Canada)

TheNationalResearchCouncilofCanada(NRCC)iscurrentlyevaluatinganddesigningartifactsandmethodstocompletelycharacterize3-Dimagingsystems.Wehavegatheredasetofartifactstoformalow-costportablecaseandprovideaclearly-definedsetofproceduresforgeneratingcharacteristicvaluesusingtheseartifacts.Initscurrentversion,thiscaseisspecificallydesignedforthecharacterizationofshort-range(standoffdistanceof1centimeterto3meters)triangulation-based3-Dimagingsystems.Thecaseisknownasthe“NRC-CNRCPortableTargetCaseforShort-RangeTriangulation-based3-DImagingSystems”(NRC-PTC).Theartifactsinthecasehavebeencarefullychosenfortheirgeometric,thermal,andopticalproperties.AsetofcharacterizationproceduresareprovidedwiththeseartifactsbasedonprocedureseitheralreadyinuseorarebasedonknowledgeacquiredfromvarioustestscarriedoutbytheNRCC.Geometricdimensioningandtolerancing(GD&T),awell-knowslanguageintheindustrialfield,wasusedtodefinethesetoftests.Thefollowingparametersofasystemarecharacterized:dimensionalproperties,formproperties,orientationproperties,localizationproperties,profileproperties,repeatability,intermediateprecision,andreproducibility.AnumberoftestswereperformedinaspecialdimensionalmetrologylaboratorytovalidatethecapabilityoftheNRC-PTC.TheNRC-PTCwillsoonbesubjectedtoreproducibilitytestingusinganintercomparisonevaluationtovalidateitsuseindifferentlaboratories.



IS&T /

ReturntoContents

Conference 7864B: 3D Image Processing (3DIP) and Applications IIWednesday-Thursday26-27January2011PartofProceedingsofSPIEVol.7864B3DImageProcessing(3DIP)andApplicationsII

7864B-25, Session 8

3D shape descriptors for face segmentation and fiducial points detection: an anatomical-based analysisA.E.SalazarJiménez,Univ.NacionaldeColombiaSedeMedellín(Colombia);A.Cerón,Univ.MilitarNuevaGranada(Colombia);F.A.PrietoOrtiz,Univ.NacionaldeColombia(Colombia)

Thebehaviorofnine3Dshapedescriptorswhichwerecomputedonthesurfaceof3Dfacemodels,isstudied.Thesetofdescriptorsincludessixcurvature-basedones,SPINimages,FoldedSPINImages,andFingerprints.Insteadofdefiningclustersofverticesbasedonthevalueofagivenprimitivesurfacefeature,afacetemplatecomposedby28anatomicalregions,isusedtosegmentthemodelsandtoextractthelocationofdifferentlandmarksandfiducialpoints.Verticesaregroupedby:region,regionboundaries,andsubsampledversionsofthem.Theaimofthisstudyistoanalyzethediscriminantcapacityofeachdescriptortocharacterizeregionsandtoidentifykeypointsonthefacialsurface.Theexperimentincludestestingwithdatafromneutralfacesandfacesshowingexpressions.Also,inordertoseetheusefulnessofthebending-invariantcanonicalform(BICF)tohandlevariationsduetofacialexpressions,thedescriptorsarecomputeddirectlyfromthesurfaceandalsofromitsBICF.Intheresults:thevalues,distributions,andrelevanceindexesofeachsetofvertices,wereanalyzed.

7864B-28, Session 9

Deformable shape retrieval using bag-of-feature techniquesH.Tabia,M.Daoudi,J.Vandeborre,TELECOMLille1(France);O.Colot,Univ.desSciencesetTechnologiesdeLille(France)

Wepresentanovelmethodfor3D-shapematchingusingBag-of-Featuretechniques(BoF).Themethodstartsbyselectingandthendescribingasetofpointsfromthe3D-object.Suchdescriptorshavetheadvantageofbeinginvarianttodierenttransformationsthatashapecanundergo.Basedonvectorquantization,weclusterthosedescriptorstoformashapevocabulary.Then,eachpointselectedintheobjectisassociatedtoacluster(word)inthatvocabulary.Finally,aBoFhistogramcountingtheoccurrencesofeverywordiscomputed.Theseresultsclearlydemonstratethatthemethodisrobusttonon-rigidanddeformableshapes,inwhichtheclassoftransformationsmaybeverywideduetothecapabilityofsuchshapestobendandassumedierentforms.

7864B-32, Session 10

Automatic generation of 3D building models from orthogonal building footprintK.Sugihara,GifuKeizaiUniv.(Japan);X.Zhou,NagoyaBunriUniv.(Japan);T.Murase,ChukyoGakuinUniv.(Japan)

Basedonbuildingpolygonsorbuildingfootprintsondigitalmaps,weproposeaGISandCGintegratedsystemthatautomaticallygenerates3-Dbuildingmodels.A3-Durbanmodelisanimportantinformationinfrastructurethatcanbeutilizedinseveralfields.However,enormoustimeandlaborhastobeconsumedtocreatethese3-Dmodels,using3DmodelingsoftwaresuchasSketchUp.Inordertoautomatethelaborioussteps,weproposedtheGISandCGintegratedsystemthatautomaticallygenerates3-Dbuildingmodelsfrombuildingpolygons

onadigitalmap.Mostbuildingpolygons’edgesmeetatrightangles(orthogonalpolygon).Acomplicatedorthogonalpolygoncanbepartitionedintoasetofrectangles.Theintegratedsystempartitionsorthogonalbuildingpolygonsintoasetofrectanglesandplacesrectangularroofsandbox-shapedbuildingbodiesontheserectangles.Inordertopartitionanorthogonalpolygon,weproposedausefulpolygonexpression(RLexpression)andapartitioningschemethatisusedindecidingfromwhichvertexadividingline(DL)isdrawn.Inthispaper,weproposeanewschemeforpartitioningbuildingpolygonsandforcreatingacomplicatedshapeofbuildingmodelsbasedonorthogonalbuildingpolygons.


Feature vertices for 3D synchronization using Euclidean minimum spanning treeN.Tournier,Lab.d’InformatiquedeRobotiqueetdeMicroelectroniquedeMontpellier(France)andStratégiesS.A.(France);W.Puech,G.Subsol,Lab.d’InformatiquedeRobotiqueetdeMicroelectroniquedeMontpellier(France);J.Pedeboy,StratégiesS.A.(France)

Synchronizationin3Ddatahidingisoneofthemainproblems.Weneedtoknowwherewecanembedinformation,andbeabletofindthisspaceaftertheinsertionofthemessage.Variousalgorithmsproposetheirsynchronizationtechniquesbytriangleorvertexpathina3Dmesh.

Inthispaper,weproposedanewsynchronizationtechniquebasedonEuclideanminimumspanningtreecomputing(EMST)andtheanalysisofthedisplacementoftheverticeswithoutmovingtheconnectionsinthetree.Basedontheanalysisofthevertices,weselectthemostmobileandsynchronizetheseareasbycomputinganewEMSTonthem,thatwecall“robustEMST”.

Then,weanalyzetherobustnessofthetechnique,i.e.thestabilityofthemostmobileverticesselection;anddemonstratetheconsistenceofthecriterionselectionwiththevertexdisplacement.


Probability distributions from Riemannian geometry, generalized hybrid Monte Carlo sampling, and path integralsE.Paquet,NationalResearchCouncilCanada(Canada);H.L.Viktor,Univ.ofOttawa(Canada)

Whenconsideringprobabilisticpatternrecognitionmethods,especiallymethodsbasedonBayesiananalysis,theprobabilisticdistributionisoftheutmostimportance.However,despitethefactthatthegeometryassociatedwiththeprobabilitydistributionconstitutesessentialbackgroundinformation,itisoftennotascertained.ThispaperdiscusseshowthestandardEuclidiangeometryshouldbegeneralizedtotheRiemanniangeometrywhenacurvatureisobservedinthedistribution.Tothisend,theprobabilitydistributionisdefinedforcurvedgeometry.Inordertocalculatetheprobabilitydistribution,aLagrangianandaHamiltonianconstructedfromcurvatureinvariantsareassociatedwiththeRiemanniangeometryandageneralizedhybridMonteCarlosamplingisintroduced.Finally,weconsiderthecalculationoftheprobabilitydistributionandtheexpectationinRiemannianspacewithpathintegrals,whichallowsadirectextensionoftheconceptofprobabilitytocurvedspace.


IS&T /

ReturntoContents


Compression method by using the motion estimation of residual image transformed from elemental image array in three-dimensional integral imagingC.H.Yoo,J.Lee,H.Kang,E.Kim,KwangwoonUniv.(Korea,Republicof)

Inthispaper,weproposedahighlyenhancedcompressionschemeofIntegralImaging(InIm)byuseofsub-images(SIs)removingtheMotionVector(MV)ofresidualimagearraytransformedfromSub-ImageArray(SIA).Inthepickupprocess,SIAisgeneratedfromEIAaftertheobjectthroughthevirtualpinholearrayisrecordedasElementalImageArray(EIA).ItprovidesenhancedcompressionefficiencybyimprovingthesimilarityamongSIs.Intheproposedmethod,asegmentedarea,whichisamacroblock,inthereferenceSIismatchedoncurrentSIsapplyingtoMSE.MVsoccurredamongSIsmightresultinanadditionalincreasefordatacompression.Accordingly,thecomputedmotionestimationfromtheblock-matchingissavedasMVandallobjectsineachcurrentSIareshiftedtotheobjectpositionofthereferenceSItocompensatetheirMVbasedonthemotionestimation.WecanenhancethesimilarityofSIsremovingMV,sothatanimprovementofcompressionefficiencyoftheSIAcouldbeobtained.Inaddition,thevideocompressionschemesuchasMPEG-4canbeappliedtodatareductionoftheconsecutiveframes.TheproposedalgorithmoutperformsthebaselineJPEGandtheconventionalEIAcompressionschemeappliedtoInImandiscomparedwithsimulationsproducedusingtheseschemes.


Detection of the aortic intimal tears by using 3D digital topologyC.Lohou,InstitutsUniversitairesdeTechnologie(France);B.Miguel,CHUClermont-Ferrand(France)

Severalworksaboutaortasegmentationhavealreadybeenperformed:mostofthemconcernaorticouterwallsandaremainlyproposedinthecaseofabdominalaorticaneurysmsandareusuallybasedondeformabletechniques.

AorticdissectionsisarealproblemofPublicHealth,andmayquicklyleadtodeath.Aorticdissectionsareduetopresenceoftearsinsidelumens(orholesintheintimaltissue).Thesetearsaredifficulttodetectbecausetheydonotcorrespondtoafilledorgantosegment;thesetearsareusuallyvisuallyretrievedbyradiologistsbyexamininggraylevelvariationonsuccessiveplanes,whichisaverydifficultanderror-pronetask.

Ourpurposeistodetecttheseintimaltearstohelpcardiacsurgeonstoestablishadiagnosis:thevisualizationofintimaltearscouldleadtochooseasizeofendoprothesis,andtheaddofthisdatacouldalsohelpsurgeonsduringtheintervention.

Atthisaim,weuseAktoufetal.’sholesfillingalgorithmsproposedinthefieldofdigitaltopology.Thisalgorithmpermitsthefillingofholesona3Dobjectbyusingtopologicalnotions—holesareintimaltearsinourimages.

Wealsothinkthatthisapproachwouldgainedtobeknowntospecialistsofotherdiseases.


Data processing path from multimodal 3D measurement to realistic virtual modelR.Sitnik,J.F.Krzeslowski,G.Maczkowski,WarsawUniv.ofTechnology(Poland)

Asetofcalculationmethodshasbeendevelopedandtestedtoprovidemeansofcreatingvirtualcopiesofthreedimensional(3D)historicalobjectswithminimaluserinput.Wepresentastepbystepdataprocessingpathalongwithalgorithmdescriptionrequiredtoreconstructarealistic3Dmodelofaculturallysignificantobject.Theimportantfeatureforarchivinghistoricalobjectsistheabilitytoincludebothinformationaboutitsshapeandtexture,allowingvisualizationusingarbitraryconditionsofillumination.Datasamplesusedasinputfortheprocessingmethodchainwerecollectedusinganintegrateddeviceconsistingofshape,multispectralcolorandsimplifiedBRDFmeasurements.Toconfirmtheusabilityofpresentedmethods,ithasbeentestedbyexampleofreallifeobject-statueofanancientGreekgoddessKybele.Additionalvisualizationmethodshavealsobeenexaminedtorenderarealisticvirtualrepresentationsatisfyingintrinsicsurfacepropertiesoftheinvestigatedspecimen.


Preliminary study of statistical pattern recognition-based coin counterfeit detection by means of high resolution 3D scannersM.Leich,S.Kiltz,C.Kraetzer,J.Dittmann,C.Vielhauer,Otto-von-Guericke-Univ.Magdeburg(Germany)

AccordingtotheEuropeanCommissionaround200,000counterfeitEurocoinsareremovedfromcirculationeveryyear.Whilethereexistapproachestoautomaticallydetectthesecoins,satisfyingerrorratesareusuallyonlyreachedforlowqualityforgeries,so-called“localclasses”.Mintedforgeriesofveryhighquality(“commonclasses”)poseaproblemforthesemethodsaswellasfortrainedhumans.

Thispaperpresentsafirstapproachforstatisticalanalysisofcoinsbasedonhighresolution3Ddataacquiredwithachromaticwhitelightsensor.Thegoalofthisanalysisistodeterminewhethertwocoinsareofcommonorigin.Thetestsetforthesefirstandnewinvestigationswillconsistofabout50coinsfromnotmorethanfivedifferentsourcestoassesstheoverallpotential.Theanalysisisbasedontheassumptionthat,apartfrommarkingscausedbywearsuchasscratchesandresidueconsistingofgreaseanddust,coinsfromequaloriginhaveamoresimilarheightfieldthancoinsfromdifferentmints.Theinfluenceofwearmarkingsisdiscussedandanapproachforeliminatingthisinfluenceisoutlined.


3D digitization of metallic specular surfaces using scanning from heating approachA.Bajard,O.Aubreton,Univ.deBourgogne(France)

Becauseofthedifficultyofdealingwithspecularityofseveralsurfaces,fewmethodshavebeenproposedtomeasurethree-dimensionalshapesofspecularmetallicobjects.Inthispaperwepresentanapplicationonthiskindofmaterialofanapproachcalled“ScanningFromHeating”.Thisapproachhasbeendeveloppedinitialyfor3Dreconstructionoftransparentobjects.ThisarticlepresentsanapplicationoftheworkingprincipleofSFHmethodonmaterialwithhighthermalconductivity.

Conference 7864B: 3D Image Processing (3DIP) and Applications II


IS&T /

ReturntoContents


3D image processing architecture for camera phonesK.Atanassov,V.Ramachandra,M.Aleksic,S.R.Goma,QualcommInc.(UnitedStates)

Recentdevelopmentsin3Ddisplaytechnologyhavecreatedademandforconsumergenerated3Dcontent.Cameraphonesareubiquitousandassuch,oneofthefirstchoicesforusergeneratedcontent.Wepresentasolutiontoafull3Ddatapathimplementationforcameraphones,providingpracticalargumentsto3Dtechnologyissuessuchascamerapositioning,disparitycontrolrationale,andscreengeometrydependency.Implementingsuccessfully3Dcapturefunctionalityonphonecamerasrequiresalgorithmsthatfitwithintheprocessingcapabilitiesofthedevice.Variousconstraintslikesensorpositiontolerances,moduletomodulevariation,post-processing,3Dvideoresolutionandframerate,shouldbecarefullyconsideredfortheirinfluenceon3Dexperience.Migratinguserfunctionalityfromthe2Dusagemodel,suchaszoomandpan(oncaptureanddisplay)requiresadditionalconsideration.Itisalsoimportantthattheuserinteractionwithboththecaptureandthedisplaydeviceisbothintuitiveandefficient.Finally,boththeprocessingpowerofthedeviceandthepracticalityoftheconceptneedtobetakenintoaccountinthecalibrationandprocessingmethodology.


Return detection for outdoor active triangulationD.M.Ilstrup,R.Manduchi,Univ.ofCalifornia,SantaCruz(UnitedStates)

Noabstractavailable.

Conference 7864B: 3D Image Processing (3DIP) and Applications II


IS&T /

ReturntoContents

Conference 7864C: The Engineering Reality of Virtual Reality 2011Tuesday25January2011PartofProceedingsofSPIEVol.7864CTheEngineeringRealityofVirtualReality2011

7864C-46, Session 13

Acquisition of stereo panoramas for display in VR environmentsR.A.Ainsworth,Ainsworth&Partners,Inc.(UnitedStates);D.J.Sandin,Univ.ofCalifornia,SanDiego(UnitedStates)andUniv.ofIllinoisatChicago(UnitedStates);J.P.Schulze,A.Prudhomme,T.A.DeFanti,Univ.ofCalifornia,SanDiego(UnitedStates)

Virtualrealitysystemsareanexcellentenvironmentforstereopanoramadisplays.Theacquisitionanddisplaymethodsdescribedherecombinehigh-resolutionphotographywithsurroundvisionandfullstereoviewinanimmersiveenvironment.Thiscombinationprovidesphotographicstereo-panoramasforavarietyofVRdisplays,includingtheStarCAVEandNextCAVE.

Thezeroparallaxpointusedinconventionalpanoramaphotographyisalsothecenterofhorizontalandverticalrotationwhencreatingphotographsforstereopanoramas.Thetwophotographicallycreatedimagesaredisplayedonacylinderorasphere.Theradiusfromtheviewertotheimageissetatapproximately20feet,orattheobjectofmajorinterest.

Afullstereoviewispresentedinalldirections.Thetwosphericalimagesaredisplacedhorizontallybytheinteroculardistance,asseenfromtheviewer’sperspective.Thispresentscorrectstereoseparationinwhateverdirectiontheviewerislooking,evenupanddown.Objectsatinfinitywillmovewiththeviewer,contributingtoanimmersiveexperience.

StereopanoramascreatedwiththisacquisitionanddisplaytechniquecanbeappliedwithoutmodificationtoalargearrayofVRdeviceshavingdifferentscreenarrangementsanddifferentVRlibraries.


Low cost heads-up virtual reality (HUVR) with optical tracking and haptic feedbackT.Margolis,T.A.DeFanti,G.Dawe,A.Prudhomme,J.P.Schulze,Univ.ofCalifornia,SanDiego(UnitedStates)

ResearchersattheUniversityofCalifornia,SanDiego,havecreatedanew,relativelylow-costaugmentedrealitysystemthatenablesuserstotouchthevisualenvironmenttheyareimmersedin.

TheHeads-UpVirtualRealitydevice(HUVR)couplesaconsumer3DHDflatscreenTVwithahalf-silveredmirrortoprojectanygraphicimageontotheuser’shandsandintothespacesurroundingthem.Withhisorherheadpositionopticallytrackedtogeneratethecorrectperspectiveview,theusermaneuversaforce-feedback(haptic)devicetointeractwiththegeneratedimage,literally‘touching’theobject’sanglesandcontoursasifitwasatangiblephysicalobject.

HUVRcanbeusedfortrainingandeducationinstructuralandmechanicalengineering,archaeologyandmedicineaswellasothertasksthatrequirehand-eyecoordination.OneofthemostuniquecharacteristicsofHUVRisthatausercanplacetheirhandsinsideofthevirtualenvironmentwithoutoccludingthe3Dimage.Builtusingopen-sourcesoftwareandconsumerlevelhardware,HUVRoffersusersatactileexperienceinanimmersiveenvironmentthatisbothfunctionalandaffordable.


An integrated pipeline to create and experience compelling scenarios in virtual realityC.Cruz-Neira,D.Reiners,J.Springer,Univ.ofLouisianaatLafayette(UnitedStates)

Wepresentinthispaperourresearchondesigningasoftwarepipelinethatenablesustocreatecompellingscenarioswithafairdegreeofvisualandinteractioncomplexityinasemi-automatedway.Specifically,wearetargeting“drivable”urbanscenarios,rangingfromlargecitiestosparselypopulatedruralvillagesthatincorporatebothstaticcomponents,suchashouses,trees,telephonepoles,etc.anddynamiccomponentssuchaspeople,vehicles,aswellasevents,suchasexplosions,suddennoises,etc.

Ourpipelinehasfourbasiccomponents:1)Anenvironmentdesigner,whereuserssketchtheoveralllayoutofthescenarioandanautomatedmethodconstructsthe3Denvironmentfromtheinformationinthesketch;2)Ascenarioeditorusedforauthoringthecompletescenario,incorporatethedynamicelementsandevents,tweaktheautomaticallygeneratedenvironment,definetheexecutionconditionsofthescenario(typeofdevice,interactions,datainputs,etc),andsetanydatagatheringthatmaybenecessaryduringtheexecutionofthescenario;3)Arun-timeenvironmentforthedifferentvirtualrealitysystemsprovidinguserswiththeinteractiveexperiencedesignedthroughthedesignerandtheeditor;4)Abi-directionalmonitoringsystemthatallowsustocaptureandmodifyinformationfromthevirtualenvironment.

Themainuseofthispipelineisfortherapiddevelopmentofscenariosforhumanfactorsstudies,however,itcanbeappliedinamuchmoregeneralcontext.


Whose point-of-view is it anyway?G.P.Garvey,QuinnipiacUniv.(UnitedStates)

SharedvirtualworldssuchasSecondLifeprivilegeasinglepoint-of-view,namelythatoftheuser.WhenloggedintoSecondLifeauserseesthevirtualworldfromadefaultviewpoint,whichisfromslightlyaboveandbehindtheuser’savatar(theuser’salterego‘in-world.’)Thispoint-of-viewisasiftheuserwereviewinghisorheravatarusingacamerafloatingafewfeetbehindit.Infactitispossibletosettheviewtoasifyouwereseeingtheworldthroughtheeyesofyouravataroryoucanevenmovethecameracompletelyindependentofyouravatar.

Achangeinpoint-of-viewmeansmorethanjustadifferentcamerapoint-of-view.Thepracticeofusingmultipleavatarsrequiresatransformationofidentityandpersonality.Whenauser‘enacts’theidentityofaparticularavatar,their‘real’personalityismaskedbytheassumedpersonality.

Thetechnologyofvirtualworldspermitsbothachangeofpoint-of-viewandalsofacilitatesachangeinidentity.Doesthiscauseanypsychologicaldistress?Oristheabilitytobesomeoneelseandseeaworld(agame,avirtualworld)throughadifferentsetofeyessomehowliberatingandevenbeneficial?


IS&T /

ReturntoContents


Biocybrid systems and the reengineering of lifeD.Domingues,Univ.deBrasília(Brazil)andLART-Lab.ResearchinArtandTechnoScience(Brazil);A.Rocha,C.Hamdan,L.Augusto,Univ.deBrasília(Brazil)

OurcollaborativeresearchesinArtandTechnoSciencearebasedonbiocybridsystems(bio+cybrid+hybrid)engineeringimmersivemultimodaltechnologiesforCaves,andwearabledevicestoexpandperceptualexperiencesandthesenseofpresenceinVirtualRealityandAugmentedReality,locative,mobileandtransparentinterfacesinurbanmixedlife.Weexploretheartists´creativityclosetotheinventivityofscientistsandmutualcapacityforthegenerationofbiocybridsystemswhichimpliesinthehumanexistence,beingco-locatedinthecontinuumzonebetweenofbodyandflesh-cyberspaceanddata-andthehybridpropertiesofphysicalworld.AnthropologicalissuesandthesenseofpresencebeingadequatedbythetechnologicalapparatusforhumanlifeinSoftwareArtpracticesrequiretheinterfacedesignforintertwinedrelationshipsbody/environment/netsreaffirmingtheubiquitousandmobileconditioninphysicalworldandcyberspace.Enactiveinterfacesandthechangesandchallengesofthereengineeringoflifearediscussedinthreeartworks.EcologicalBiocybridEmotionsinParacosmosoffertheimmersioninourCavetoreceivingsignalsfromremotelandscapesfromthedistantSouthMatoGrosso,Pantanalzone,Brazil,bymixingthelifeofbiologicalcommunityoffrogsandsnakesnaturalbehaviortobiologicalsignalsofatrackedbodyanditsphysiologytransductedintoasophiscatedsystemofbiofeedbackandmutationsinavirtualdatalandscape.Mutualbehaviorsandthesystematicstructuringoftherelationsbetweensensorypatternsultimatelyderivedfrommanufacturedsensortechnologiesandthosegivenbylocomotion,heat,heartbeatsandbreathing,andelectricalpotentialsofelectrooculogram,bytakingtheinternalparadigmsandtheparametersshouldresultinthosecontingenciesleadingtotheexternalizationofapercept.OpenedBodyConnection(performance)isabiocybridsysteminaugmentedrealityexpandedtomixedrealityinabodilycontinuumofflesh-cyberdataandphysicalspace.Duringaritual,inapublicevent,atattooartistinscribedonthebackoftheperformerthecomputationlanguageofthecodeawingshape.Thetattoobecameamixedrealitysystem,onlyreadincomputervisionandduringtwohours,otherprintedtagsonthebodyprovidedthree-dimensionalanimationstraveledthroughthebodytransmittedonline.Animatedwingsrespondtotheartist´dreamofflying.Expandedinteractionsandmixedlandscapegeneratedthe14Bis,biocybridsystem,byexploringmobileandlocativeinterfaceinmixedrealitywhichallowthecomputervisionintheskyoftheBraziliancapitalofthelittleplaneinventedbythepioneerSantosDumont.ThecreativepracticeinmixedrealityplacingthetagcodeinthegeolocatedGlobalPositioningSystemresultofthemodelledhistoricalplaneinrealscaleapparitioninthecitysky.Theabsence/presencestateishomologatedthoughthecomputervisionofamobilecellcam.Inapostbiologicalextrusionofhumanvision,theactofseeingissharedwiththesatelliteeyeintheskyandthehandledeyeinthemobiledevice,byexpandingtheneuropsychophysiologyofhumanperception.Syntheticsensesandthereengineeringoflifeofferadifferentscenarioforhumannarrativeinthetheateroflife.


Twisting the sense of space in immersive treesM.Song,SimonFraserUniv.(Canada);S.J.Barnes,Univ.ofBritishColumbia(Canada);D.Gromala,T.Fox,SimonFraserUniv.(Canada);D.Barnes,IndependentArtist(Canada)

Whatconsequencesdowepayforsittinginbedandmindlesslysurfingthewebtoreadaboutwhatpeopleateforbreakfast,buyingyetagainanotherpairofjeansorwatchingthelatestTVshows?Howareouronlinelivesaffectingourreal-worldenvironmentsandtheecologicalsystemsthatweliveinbutforgetaswearesoengrossedinourdigitallives?Weseektoexploretheserelationshipsandtheeffectsofouractionsthroughaninformation-rich,inhabitableexhibitionthatparadoxicallyblursmultipleaspectsofthevirtualandthephysical.

A16-foothighBanyantree-ametaphorforancientandcontemporaryconnectedness-isconstructedfrommultiple,inter-nestedlayersofaprojectionmaterialthatisalternatelytranslucentandopaque.ThroughexplorationandhabitationofapenetrableBanyantree,interactorsexperiencetheecologicalandenvironmentalcostsofinternetconsumptionthroughstereoscopicprojectionsthatvaryindepth-of-fieldandtwistaroundthecoreofthetree.Theinteractorsarethesourceofpleasureandtheconsumedenergythatmovesthroughtheinter-connectedbranchesandrootsoftheBanyantree-passingthroughxylem,phloemandpith.


Productive confusions: learning from simulations of pandemic virus outbreaks in Second LifeM.Cardenas,Univ.ofCalifornia,SanDiego(UnitedStates);L.S.Greci,Univ.ofCalifornia,SanDiego(UnitedStates)andVeteransAdministrationSanDiego(UnitedStates);S.Hurst,K.Garman,H.Hoffman,R.Huang,M.Gates,K.Kho,E.Mehrmand,T.Porteous,Univ.ofCalifornia,SanDiego(UnitedStates);A.Calvitti,Univ.ofCalifornia,SanDiego(UnitedStates)andVeteransAdministrationSanDiego(UnitedStates);E.Higginbotha,VeteransAdministrationSanDiego(UnitedStates);Z.Agha,Univ.ofCalifornia,SanDiego(UnitedStates)andVeteransAdministrationSanDiego(UnitedStates)

Usersofimmersivevirtualrealityenvironmentsoftenreportaside-effectoffeelingliketheirrealexperiencesafterimmersivesessionsresembleelementsofthevirtualworld.Yetperhapsthisside-effectcanbeturnedaroundtoexplorethepossibilitiesforimmersioninvirtualworldgrouptrainingsimulations.Thispaperwilldescribeobservationsfrommytimeworkingasanartist/researcherwiththeUCSDMedicalSchoolandVAHealthcareSystemtodeveloptrainingsfornurses,doctorsandHospitalIncidentCommandstaffwhichsimulatepandemicvirusoutbreaks.Byexaminingmomentsofslippagebetweenrealities,bothintoandoutofthevirtualenvironment,momentsoftheconfusionofboundaries,wecanbetterunderstandmethodsforcreatingimmersion.Iwillusethemixingofrealitiesasatransversallineofinquiry,borrowingfromsciencestudies,gamestudies,andperformancestudiestorevealthesocialimplicationsoftechnology.FocusingondrillsconductedinSecondLife,Iwillexaminebothmomentswithinthedrill,interviewsafterthedrillandthefeedbackwithactualhospitalprocedures.

Conference 7864C: The Engineering Reality of Virtual Reality 2011


IS&T /

ReturntoContents

Conference 7865: Human Vision and Electronic Imaging XVIMonday-Thursday24-27January2011PartofProceedingsofSPIEVol.7865HumanVisionandElectronicImagingXVI

7865-05, Session 2

What makes good image composition?R.Banner,Hewlett-PackardLabs.IsraelLtd.(Israel)

Compositionisoneofthemostimportantfeaturesforthevisualrepresentationsofideasandmessages.Inparticular,theabilitytoevaluatecompositionsandseparatebetween“good”and“bad”compositionsisofmajorimportanceforgraphicdesignapplicationsthatfacilitatethecreationofcompellingdigitaljobs.Givenacompositionalarrangementofobjectsinthepictorialspace,thisworkproposesameasurethatevaluateshowbalancedthegivencompositionis.Tothatend,wefirstreviewseveralimportantperceptualconceptsthatartistsintuitivelyobeytoproducebalancedcompositions.Basedontheseconceptswesuggestanovelcompositionmeasurethatmakesauseoftheelectrostaticmodel.Finally,wedemonstratethevalidityofthismeasureintheoryandthroughsimulations.

7865-06, Session 2

A comparison of perceived lighting characteristics in simulations versus real-life setupB.A.Salters,P.J.H.Seuntiens,PhilipsResearchNederlandB.V.(Netherlands)

TheadvanceofLEDtechnologyenablesawholenewrangeofluminairedesigns,whichpreviouslywerenotpossible.Theopticalperformanceofadesignintermsofe.g.brightnessdistributionscanbesimulatedquitewellbymeansofraytracingsoftware.Thesesimulationsarenotverysuitabletorateacertaindesignonperceptualaspectssuchascosinessordiffuseness,andvisibilityofartifactssuchasnon-uniformities.Forthiskindofperceptualresearch,itisstillrequiredtohaveanactualprototypeofacertaindesignwhichiscostlyandtimeconsuming.

Therefore,itisextremelyusefultohaveanunderstandingifperceptionquestionscouldbeansweredbymeansofphotorealisticrenderings.Forthiswehavebuiltaroomwhereseveralluminairescanbetested.ThesameroomhasbeensimulatedinLighttoolsgeneratingphotorealisticrenderings.Finally,severaldifferentperceptualquestionshavebeeninvestigatedstatistically,bothintheexperimentalroom,aswellasusingapictureshownonanormalLCDmonitor,andprintedonpaper.Wewilldiscusssimilaritiesanddifferencesbetweentheresultsofbothtypesoftests.Inthisway,relationscanbeestablishedbetweenallaspectsindesigningluminaires:designandsimulation,prototyping,andperceptionstudies.

7865-07, Session 2

Investigating two features of aesthetic perception in consumer photographic images: clutter and centerC.D.Cerosaletti,A.C.P.Loui,A.C.Gallagher,EastmanKodakCo.(UnitedStates)


7865-08, Session 3

Analyzing near-infrared images in utility assessmentN.Salamati,Z.Sadeghipoor,S.E.Süsstrunk,EcolePolytechniqueFédéraledeLausanne(Switzerland)

Visualcognitionisofsignicantimportanceinimagingapplications,suchassecurityandsurveillance,whereitiscrucialtodeterminethemaximumdistortionlevelthatcanbeappliedtotheimageswhilestillinsuringthatenoughinformationisconveyedtorecognizethescene.SincereectionintheNIRpartofthespectrumismaterialdependent,anobjectmadeofaspecicmaterialismoreprobabletohaveuniformresponseintheNIRimages.Consequently,edgesintheNIRimageslikelycorrespondtothephysicalboundariesoftheobjectsratherthanchangesincolorwithintheobject.

Inthispaper,weevaluatetheusefulnessofNIRimagesforcognitiontasks.WecomparethemaximumdistortionlevelofvisibleandNIRimageswherescenesarestillcorrectlyrecognized.Weperformedasubjectivestudyonsixscenes,eachonerepresentedbyitsvisibleandNIRimage.TheimageswerecompressedtodierentQfactorsusingJPEGcompression.Wefoundthatinthispreliminarytest,fouroutofthesiximagesweremoreeasilyrecognizedbasedontheNIRimages.

7865-09, Session 3

Appearance-based human gesture recognition using multimodal features for human computer interactionD.Luo,WasedaUniv.(Japan);H.Gao,H.K.Ekenel,KarlsruherInstitutfürTechnologie(Germany);J.Ohya,WasedaUniv.(Japan)

TheuseofgestureasanaturalinterfaceplaysanutmostimportantroleforachievingintelligentHumanComputerInteraction(HCI).Humangesturesincludedifferentcomponentsofvisualactionssuchasmotionofhands,face,andtorso,toconveymeaning.Sofar,inthefieldofgesturerecognition,mostpreviousworkshavefocusedonthemanualcomponentofgestures.Inthispaper,wepresentanappearance-basedmultimodalgesturerecognitionframework,whichcombinesthedifferentgroupsoffeaturessuchasheadmotion,facialexpressionandhandmotionswhichhavebeenextractedfromtheimageframescaptureddirectlybyawebcamera.Werefer12classesofhumangestureswithfacialexpressionincludingneutral(e.g.asign“feel”),negative(e.g.“angry”)andpositive(e.g.“excited”)meaningsfromAmericanSignLanguages.Wecombinethefeaturesintwolevelsbyemployingtwofusionstrategies.Atthefeaturelevel,anearlyfeaturecombinationcanbeperformedbyconcatenatingandweightingdifferentfeaturegroups,andLDAisusedtochoosethemostdiscriminateelementsbyprojectingthefeatureonadiscriminativeexpressionspace.Thesecondstrategyisappliedondecisionlevel.Weighteddecisionsfromsinglemodalityarefusedinalatestage.ACondensation-basedalgorithmisadoptedforclassification.Wecollectedadatasetwiththreerecordingsessionsandconductedexperimentswiththecombinationtechniques.Experimentalresultsshowedthatthecombinedrecognitionoutperformsthesinglemodalitysignificantly.


IS&T /

ReturntoContents

Conference 7865: Human Vision and Electronic Imaging XVI

7865-10, Session 3

Adaptive user interfaces for relating high-level concepts to low-level photographic parametersE.Scott,P.A.MadhawaSilva,B.Pardo,T.N.Pappas,NorthwesternUniv.(UnitedStates)

Commoncontrolsforphotographiceditingcanbedifficulttouseandhaveasignificantlearningcurve.Often,auserdoesnotknowadirectmappingfromahigh-levelconcept(suchas“soft”)totheavailableparametersorcontrols.Inaddition,manyconceptsaresubjectiveinnature,andtheappropriatemappingmayvaryfromusertouser.Toovercometheseproblems,weproposeasystemthatcanquicklylearnamappingfromahigh-levelsubjectiveconceptontolow-levelimagecontrolsusingmachinelearningtechniques.Tolearnsuchaconcept,thesystemshowstheuseraseriesoftrainingimagesthataregeneratedbymodifyingaseedimagealongdifferentdimensions(e.g.,color,sharpness),andcollectstheuserratingsofhowwelleachtrainingimagematchestheconcept.Sinceitisknownpreciselyhoweachmodifiedexampleisdifferentfromtheoriginal,thesystemcandeterminethecorrelationbetweentheuserratingsandtheimageparameterstogenerateacontrollertailoredtotheconceptforthegivenuser.Theendresult—apersonalizedimagecontroller—isapplicabletoavarietyofconcepts.Wehavedemonstratedtheutilityofthisapproachtorelatelow-levelparameters,suchascolorbalanceandsharpness,tosimpleconcepts,suchas“lightness”and“crispness,”aswellasmorecomplexandsubjectiveconcepts,suchas“pleasantness.”Wehavealsoappliedtheproposedapproachtorelatesubbandstatistics(variance)toperceivedroughnessofvisualtextures(fromtheCUReTdatabase).

7865-11, Session 3

Parametric quality assessment of synthesized texturesD.SiddalingaSwamy,D.M.Chandler,K.J.Butler,OklahomaStateUniv.(UnitedStates);S.S.Hemami,CornellUniv.(UnitedStates)


7865-12, Session 3

On the perception of band-limited phase distortion in natural scenesK.P.Vilankar,L.Vasu,D.M.Chandler,OklahomaStateUniv.(UnitedStates)

Thispaperpresentstheresultsofapsychophysicalexperimentperformedtofindthesensitivityofthehumanvisualsystemtobandlimitedphasedistortionsinspatialfrequencybandsrangingfromapproximately0.6-16cycles/degree.Acomplexwavelettransformwasusedtogeneratephase-distortedimagesbyaddingGaussiannoisetothephasecomponentofthecomplexwaveletcoefficientswithinindividualsubbands.Phase-distortionsensitivitywasmeasuredusingaspatialtwo-alternativeforced-choiceprocedure.Threenaturalscenesandfiveverticalsine-wavegratingswereusedintheexperiment.Theresultsrevealedthatforindividualsine-wavegratings,phase-distortionsensitivitywasfoundtoincreasewithdecreasingcontrast,whichsuggeststheeffectofcontrastmasking.Foracompoundsine-wavegrating(containingfourspatialfrequencies15.8,5.05,1.65and0.57cycles/degree),andfornaturalscenes,phase-distortionsensitivitydemonstratedanimage-specifictrend,suggestingtheoccurrenceofbothmaskingandcueingduringphase-distortiondetection.Basedontheseresults,analgorithmisdevelopedwhichattemptstopredictthevisualqualityofphase-distortedimages.Wedemonstratethatouralgorithmperformswellinqualityestimationofphase-distortedimagescomparedtoothermodernimagequalityassessmentalgorithms.

7865-13, Session 4

Complex bioinformatics data: insights from data visualization and perceptionT.Munzner,TheUniv.ofBritishColumbia(Canada)

Noabstractavailable

7865-15, Session 4

Perceptual issues in the recovery and visualisation of integrated systems biology dataT.P.Pridmore,TheUniv.ofNottingham(UnitedKingdom)

Thesystemsapproachtobiologicalresearchemphasisesunderstandingofcompletebiologicalsystems,ratherthanareductionistfocusontightlydefinedcomponentparts.Systemsbiologyisnaturallyinterdisciplinary;researchgroupsactiveinthisareatypicallycontainexperimentalandtheoreticalbiologists,mathematicians,statisticians,computerscientistsandengineers.Awiderangeoftoolsareusedtogenerateavarietyofdatatypeswhichmustbeintegrated,presentedtoandanalysedbyresearchersfromanyandallofthecontributingdisciplines.Thegoalhereistocreatepredictivemodelsofthesystemofinterest;themodelsproducedmustalsobeanalysed,andinthecontextofthedatafromwhichtheyweregenerated.Effective,integrateddataandmodelvisualisationmethodsarecrucialifscientifically-appropriatejudgmentsaretobemade.

TheNottinghamCentreforPlantIntegrativeBiology(CPIB)takesasystemsapproachtothestudyoftherootofthemodelplantArabidopsisThaliana.Arichmixtureofdatatypes,manyextractedviaautomaticanalysisofindividualandtime-orderedsequencesofstandardCCDandconfocallasermicroscopeimages,isusedtocreatemodelsofdifferentaspectsofthegrowthoftheArabidopsisroot.InthispaperwebrieflyreviewthedatasetsandmodellingformalismsemployedwithinCPIB,anddiscussissuesraisedbytheneedtointerpretimagesoftheArabidopsisrootandintegrateandpresenttheresultingdataandmodelstoaninterdisciplinaryaudience.

7865-16, Session 4

Using cellular network diagrams to interpret large-scale datasets: past progress and future challengesP.D.Karp,M.Latendresse,S.Paley,SRIInternational(UnitedStates)

Noabstractavailable

7865-17, Session 4

Visualizing large high-throughput datasets based on the cognitive representation of biological pathwaysA.Nagel,Max-Planck-InstitutfürMolekularePflanzenphysiologie(Germany);O.Thimm,BASFPlantScienceCompanyGmbH(Germany);M.Stitt,B.Usadel,Max-Planck-InstitutfürMolekularePflanzenphysiologie(Germany)

Noabstractavailable


IS&T /

ReturntoContents

7865-18, Session 4

Metadata Mapper: a user interface web service for mapping data between independent visual analysis components, guided by perceptual rulesB.E.Rogowitz,VisualPerspectivesConsulting(UnitedStates);N.Matasci,TheUniv.ofArizona(UnitedStates)

Noabstractavailable

7865-47, Session 4

Hypergraph visualization and enrichment statistics: how the EGAN paradigm facilitates organic discovery from Big DataJ.Paquette,LifeTechnologiesCorp.(UnitedStates)

Noabstractavailable

7865-20, Session 6

Examination of 3D visual attention in stereoscopic video contentQ.Huynh-Thu,L.Schiatti,Technicolor(France)

Studieshaveindicatedthatviewerstendtofocustheirattentiononspecificareasofinterestinavideoandtwomechanismsofvisualattentionhavebeenidentified:bottom-upandtop-downmechanisms.Bottom-upattentioncorrespondstoinvoluntaryandunconsciouseyemovements,whicharedrivenmostlybythesignalcontent.Ontheotherhand,top-downattentionismostlydrivenbytask,context,semanticsandexperience.Veryfewstudieshaveinvestigatedvisualattentionandeyemovementpatternson3Dstereoscopicmovingsequences.Inthispaper,weinvestigate3Dvisualattentionanddifferencesinvisualattentionbetween2Dand3Dcontent.Forthatpurpose,weconductedasubjectiveexperimentusinganeye-trackingapparatusanda3Dstereoscopicdisplay.Contentwasviewedusingpassivepolarizedglassestechnologyinafree-viewingscenario.Observerswereaskedtoviewthe2Dand3Dversionsofthesamevideocontent.Wediscussourobservationsintermsoftheattentivebehaviorandcomparethisaspectbetweenthe2Dand3Dscenarios.Finally,wealsodiscussissuesrelatedtotheset-upofa3Deyetrackingexperimentandthemeasurementofthehumangazeinthreedimensions.

7865-21, Session 6

Quantifying how the combination of blur and disparity affects the perceived depthJ.Wang,M.Barkowsky,V.Ricordel,P.LeCallet,Univ.deNantes(France)

Theinfluenceofamonoculardepthcue,blur,ontheapparentdepthofstereoscopicsceneswillbestudiedinthispaper.When3Dimagesareshownonaplanarstereoscopicdisplay,binoculardisparitybecomesapre-eminentdepthcue.Butitinducessimultaneouslytheconflictbetweenaccommodationandvergence,whichisoftenconsideredasamainreasonforvisualdiscomfort.

Weproposetodecreasethe(binocular)disparityof3Dpresentations,andtoreinforce(monocular)cuestocompensatethelossofperceiveddepthandkeepanunalteredapparentdepth.Thelimitationofdepth-of-fieldofhumaneyescausesblurintheretinalimagewhichisknownasanimportantmonoculardepthcue.

Weconductasubjectiveexperimentwithabackgroundplaneandasingleobjectintheforeground.ASiemensStarisusedasthe

foregroundobject,sinceitcontainsregionswithbothlowfrequencyandhighfrequency.Inthesubjectiveexperiment,twosourcesofperceiveddepthareused:disparityandblur.Theperceiveddepthfromdisparitystemsfromthedifferenceofdisparitybetweentheforegroundobjectandthebackground.TheperceiveddepthfromblurstemsfromtheamountofblurintroducedtothebackgroundbyconvolutionwithaGaussiankernel.Adetailedstatisticalanalysisisperformedrevealingtheinteractionofthetwodepthcuesatdifferentdepthlevelsona3Dflatscreendisplay.Theresultsareusedtodevelopacomputationalmodelofperceiveddepthdependingonbluranddisparity.


Depth perception enhancement based on chromostereopsisJ.Hong,H.Lee,D.Park,C.Kim,SamsungAdvancedInstituteofTechnology(Korea,Republicof)

Thegoalofthisstudyistoenhancethecubiceffectinimageswithreduceddepthorcubiceffect,usingchromostereopsisamongthecharacteristicsofhumanvisualperception.ChromostereopsisgenerallyreferstoaphenomenoninwhichRed,alongwavelength,seemstobemoreprojectedthanBlue,ashortwavelength.Withthischromostereopsis,inordinarytimes,colorswithalongwavelengthseemcloserthanthosewithashortwavelengthduetothesenseofdistance;butasthelightnessofthebackgroundlayerisclosertoWhite,areversaleffecttakesplacesothatthesenseofdepthmaydifferdependingonthelightnessofthebackgroundlayer.Withthis,theinformationonthelightnessofthebackgroundlayerbecomesanotherdepthcue.Basedonthisproperty,apsychophysicalexperimentwascarriedouttoexaminetheimpactofchromostereopsisandtheneighborcolorbetweenlayersonthesenseofdepthbasedonthelightnessofthebackgroundlayer.


An evaluation of perceived color break-up on field-sequential color displaysM.Kobayashi,A.Yoshida,Y.Yoshida,SharpCorp.(Japan)

Fieldsequentialcolor(FSC)displayshaveamajorproblem:colorbreak-up(CBU).Moreover,itisdifficulttoquantifytheCBUinsaccadiceyemovements,becausethephenomenonoccursasquicklyasaflashinsaccadiceyemovements,andthereareindividualvariabilitiesforperceivingtheCBU.SomepreviousstudyhavepresentedassessmentsofsaccadicCBU,butnotindicatedthedetectionandallowancethresholdsofthetargetsizeinhorizontalsaccadiceyemovements.Then,weconductedpsychophysicalexperimentsbasedonanFSCdisplaydrivingwithsub-framefrequencyof240Hz-1440Hz(eachframeconsistofred,green,andbluesub-frames).Weemployedasimplestimulusforourexperiment,astaticwhitebarwithvariablewidth.Wetaskedtensubjectsafixedsaccadelengthof58.4visualdegreesinhorizontaleyemovements,andafixedtargetluminanceof15.25cd/m2.WeexaminedPESTmethodtofinddetectionandallowancethresholdsofwhitebarwidthforsaccadicCBU.

Thispaperprovidescorrelationsbetweentargetsizesandsub-framefrequenciesofanFSCdisplaydevice,andproposesaneasilyevaluationmethodofperceivingsaccadicCBUonFSCdisplays.


Text detection: effect of size and eccentricityC.H.Kao,C.Chen,NationalTaiwanUniv.(Taiwan)

Westudiedthedetectabilityofdisplayedtextsasafunctiontextsizeandretinaleccentricity.Wemeasuredthecontrastdetectionthresholdforvarioustypesoftext-likestimuli(real-,non-,oraclebone-,andscrambled-characters)atdifferentsizeandpresentedatdifferenteccentricities.Thecontrastthresholdwasmeasuredwithaspatialtwo-



IS&T /

ReturntoContents

alternativeforced-choiceparadigmandPSIadaptivemethod.Whenthetextsizeissmall,thedetectionthresholdofacharacterdecreasedwiththeincreaseofitssizewithaslope-1/2onlog-logcoordinatesuptoacriticalsizeatalleccentricitiesandforallstimulustypes.Beyondthiscriticalsize,therewaslittle,ifany,improvementoftextdetectabilityastextsizefurtherincreased.Thecriticalsizedependedonthestimulustypeandeccentricity.Thesensitivityforalltypesofstimulidecreasedastheeccentricityincreased.Theestimatedreceptivefieldsizeofthetextdetectorsisgreaterthanthatofthelinedetectors.Inaddition,thelogarithmofreceptivefieldsizeincreaseswitheccentricitywithaslope0.22.Withthisinformation,wecanconstructamodeltoestimatetheabilityofahumanobservertodetectacharacterbythesizeandeccentricityofthatcharacter.


Image enhancement of high digital magnification for patient with central vision lossZ.Li,G.Luo,E.Peli,SchepensEyeResearchInstitute(UnitedStates)

Centralvisionlossisaleadingvisionimpairmentaffectingmillionsofpeopleworldwide.Wearedevelopingaheadmountedcameraanddisplaydeviceformobileuse,whichcanprovideawiderangeofmagnificationlevelsbymeansofdigitalzooming(scaling).Theexposurelevelmaynotbeappropriateforthesub-imagetobemagnified,andthereforeoftenresultsintoodark,toobright,orlowcontrastsub-images.Thisproblemisprevalentespeciallyathighmagnificationlevels.

Forsmallsub-images,conventionalhistogramstretchingenhancementoftencausesover-orunder-enhancement,andflickeringvideoimages.Accordingtohumanobservers’selection,ournewenhancementmethodsworkbetter,andthereforearesuitabletobeimplementedinthedigitalmagnificationdevice.


Quality versus intelligibility: studying human preferences for American Sign Language videoF.M.Ciaramello,S.S.Hemami,CornellUniv.(UnitedStates)

Real-timevideoconferencingusingcellulardevicesprovidesnaturalcommunicationtotheDeafcommunity.Forthisapplication,compressedAmericanSignLanguage(ASL)videomustbeevaluatedintermsoftheintelligibilityoftheconversationandnotintermsoftheoverallaestheticqualityofthevideo.ThisworkreportsanexperimenttodeterminethesubjectivepreferencesofASLusersintermsofthetrade-offbetweenintelligibilityandaestheticqualitywhenvaryingtheproportionofthebitrateallocatedexplicitlytotheregionsofthevideocontainingthesigner.Testvideosareencodedusingarate-distortionoptimizationtechniquethatjointlyoptimizesforquality(measuredusingPSNR)andintelligibility(measuredusingacomputationalmodelofintelligibility)accordingtoasingle,user-specifiedencodingparameter.Experimentalresultssuggestthatathighbitrates,usersprefervideoscodedwiththequalitycriteria,inwhichthenon-signerregionsinthevideoareencodedwithsomenominalrate.Asthetotalencodingbitratedecreases,usersprefervideocodedwiththeintelligibilitycriteria,inwhichagreaterproportionoftherateisallocatedtothesigner.

7865-22, Session 7

Preferences for the balance between true image detail and noiseS.G.Deshpande,S.J.Daly,SharpLabs.ofAmerica,Inc.(UnitedStates)


7865-23, Session 7

Measurement of compression-induced temporal artifacts in subjective and objective video quality assessmentC.Mantel,Gipsa-lab(France)andSTMicroelectronics(France);P.Ladret,Gipsa-lab(France);T.Kunlin,STMicroelectronics(France)

Temporalpoolingandtemporaldefectsarethetwodifferencesbetweenimageandvideoqualityassessment.Whereastemporalpoolinghasbeentheobjectoftworecentstudies,thispaperfocusesontherarelyaddressedtopicofcompression-inducedtemporalartifacts,suchasmosquitonoise.Tostudytemporalaspectsinsubjectivequalityassessment,wecomparedtheperceivedqualityoftwoversionsofamosquitonoisecorrector:onepurelyspatialandtheotherspatio-temporal.Wesetupapaired-comparisonexperimentandchoosevideoswhosecompressionmainlycreatestemporalartifacts.Resultsprovedtheexistenceofapurelytemporalaspectinvideoqualityperception.Weinvestigatethecorrelationbetweensubjectiveresultsfromtheexperimentandthreevideometrics(VQM,MOVIE,VQEM),aswellastwotemporally-pooledimagemetrics(SSIMandPSNR).SSIMandPSNRmetricsfindthecorrectedsequencesofbetterqualitythanthecompressedonesbutdonotdistinguishspatialandspatio-temporalprocessings.TheconfrontationofthoseresultswiththeVQMandMovieobjectivemetricsshowthattheydonotaccountforthistypeofdefects.Adetailedstudyhighlightsthateithertheydonotdetectthemortheresponseoftheirtemporalcomponentismaskedbytheoneoftheirspatialcomponents.

7865-24, Session 7

Perceived contrast of electronically magnified videoA.M.Haun,R.L.Woods,E.Peli,SchepensEyeResearchInstitute(UnitedStates)

Itisfrequentlyobservedthatelectronicmagnificationofimageryresultsinadecreaseintheapparentcontrastofthemagnifiedimagerelativetotheoriginal.Magnificationistypicallyusedinoneoftwocontexts:eithertheentireimageisenlargedtofillalargerdisplayarea,oraportionofanimageisenlargedtofillthesamedisplayarea.Thedecreaseinperceivedcontrastmightbeduetoacombinationofimageblurandofsub-samplingthelargerrangeofcontrastsintheoriginalimage.Inaseriesofexperiments,wemeasuredtheeffectonapparentcontrastofmagnificationinbothcontexts;bothasafunctionofmagnificationpowerandofviewingdistance(visibilityofblurinducedbymagnification).Wefoundasignificantdifferenceintheapparentcontrastofmagnifiedversusunmagnifiedvideosequences.Theeffectonapparentcontrastwasfoundtoincreasewithincreasingmagnification,andtodecreasewithincreasingviewingdistance(orwithdecreasingapparentsize).Therewassignificantvariationbetweenobserversinthemagnitudeoftheperceiveddifference,particularlybetweenexperiencedandnaiveobservers,butacrossobserversandconditionsthereductioninperceivedcontrastwasreliablyontheorderof0.1to0.2logunits(80%to63%).Theseeffectsareconsistentwithexpectationsbasedonboththecontraststatisticsofnaturalimagesandthecontrastsensitivityofthehumanvisualsystem.Itisdemonstratedthat1)localareaswithinlargerimagesorvideoswillusuallyhavelowerphysicalcontrastthanthewhole;and2)visibilityof



IS&T /

ReturntoContents


‘missingcontent’(e.g.blur)inanimageisgenerallyinterpretedasadecreaseincontrast,andthisvisibilitydeclineswithviewingdistance.Theperceptualmagnitudesofbotheffectsaretherebypredictablebyamodelofhumancontrastsensitivityandperceivedcontrast.

SupportedbyNIHgrantsEY05957andEY19100.

7865-25, Session 7

Estimating the impact of single and multiple freeze occurrences on video qualityT.Xiao,K.E.Brunnström,AcreoAB(Sweden)

Hybridmodelsneedsmodulesforanalyzingthebitstreamaswellasanalyzingthedecodedvideo.Forthedecodedvideointerestingtoestimatetheimpactofpacketlossafterconcealment.Onecommonconcealmentstrategyistosimplyfreezethevideotothelastgoodframe.

ThisworkisbasedonanalgorithmdevelopedbyWolf(2009)thatcandetectdroppedorrepeatedframes,whichmaybeperceivedasfreezeorpausebytheviewer.Amappingfromthedetectedfreezingtimeandthenumberoffreezingoccurrencestovideoqualitywillpresented.AsastartingpointweconsidertherelationshipthatPastrana-Vidaletal.(2004)suggestedbetweenprobabilityofdetectionandthedurationofthedroppedframes.Theyalsofoundthatitisimportanttoconsidernotonlythedurationofthefreezebutalsothenumberoffreezeoccurrences.Twodifferentrelationshipsbetweenthetotaldurationoffreezeandthenumberofoccurrenceshasbeenderived,basedonextensionofmodelsforasinglefreezeoccurrence.OnewasbasedonPastrana-Vidaletal.(2004)andtheotherwasbasedonVanKesteretal(2011).Asubjectivetestwasdesignedtoevaluatetheperformanceofthemodels.Goodperformancewasfoundonthisdatai.emorethan0.9correlation,whichwassimilarforbothmodels.TheRMSEwasbetterforoneofthemodels.

7865-26, Session 7

The effects of scene characteristics, resolution, and compression on the ability to recognize objects in videoJ.Dumke,C.G.Ford,I.W.Stange,InstituteforTelecommunicationSciences(UnitedStates)

Thequalityofvideousedinpublicsafetyapplicationsmustbeevaluatedintermsofitsusabilityforspecifictasksperformedbytheenduser,andthistaskisoftenobjectrecognition.Asvideoapplicationsinpublicsafetybecomemorewidespread,guidancetoendusersregardinghowtoidentifythelevelofvideoqualitynecessaryfortheirapplicationbecomesnecessary.

ThePublicSafetyCommunicationResearch(PSCR)projectperformedasubjectivetestasoneofthefirstinaseriestoexplorevisualintelligibilityinvideo-theabilityoftheusertorecognizeanobjectinavideostreamgivenvariousconditions.Thetestsoughttomeasuretheeffectsonvisualintelligibilityofthreesceneparameters(targetsize,scenecomplexity,scenelighting),severalcompressionratesandtworesolutions(VGA(640x480)andCIF(352x288)).Sevensimilarlysizedobjectswereusedastargetsinninesetsofnear-identicalsourcescenes,whereeachsetwascreatedusingadifferentpermutationoftheparametersunderstudy.Viewerswereaskedtoidentifytheobjectsviamultiplechoicequestions.Objectivemeasurementswereperformedoneachofthescenes,andthepredictiveabilityofthemeasurementonvisualintelligibilityispresented.

7865-27, Session 7

Supplemental subjective testing to evaluate the performance of image and video quality estimatorsA.R.Reibman,AT&TLabs.Research(UnitedStates);F.M.Ciaramello,CornellUniv.(UnitedStates)

Thesubjectivetestsusedtoevaluateimageandvideoqualityestimators(QEs)areexpensiveandtimeconsuming.Moreproblematic,themajorityofsubjectivetestingisnotdesignedtofindsystematicweaknessesintheevaluatedQEs.

Asaresult,amotivatedattackercantakeadvantageofthesesystematicweaknessestogainunfairmonetaryadvantage.Inthispaper,wedrawonsomelessonsofsoftwaretestingtoproposeadditionaltestingproceduresthattargetaspecificQEundertest.Theseproceduressupplement,butdonotreplace,thetraditionalsubjectivetestingproceduresthatarecurrentlyused.ThegoalistomotivatethedesignofobjectiveQEswhicharebetterabletoaccuratelycharacterizehumanqualityassessment.

7865-28, Session 7

On evaluation of video quality metricsM.Cadik,T.O.Aydin,K.Myszkowski,H.P.Seidel,Max-Planck-InstitutfürInformatik(Germany)

Weproposeandmakepubliclyavailableanewdatasetforevaluationofimage/videoqualitymetricswithemphasisonapplicationsincomputergraphics.Severalaspectswereinfluentialwhiledesigningthedataset:(i)inadditiontotheassessmetofthequalityofLDRvideos,theassessmentofhigh-dynamicrange(HDR)videos,aswellascomparingHDRvideoswithLDRvideosandviceversa,and(ii)theoutcomeofthesubjectiveexperimentintheformofdistortionmapsthatshowqualitypredictionasafunctionofspatialpositionwhichisespeciallyimportantforapplicationsincomputergraphics.Furthermore,weshowanexampleevaluationofrecentimageandvideoqualitymetricsthatweresuccessfullyappliedinthefieldofcomputergraphics.Thegoalofthisevaluationwastoexaminethecorrelationbetweentheobjectivequalitypredictionscomputedbythevideoqualitymetrics,andthesubjectiveresponsesobtainedbytheexperimentalprocedure.Tothatendtheproposeddatasetandthesubjectivestudyhavethefollowinguniquefeaturesoverpreviousstudiesonvideoqualityassessment:1)ThetestsetincludesLDR-LDR,HDR-HDR,andHDR-LDRreference-testvideopairswithvarioustypesofdistortions.2)ABrightSideDR37-PHDRdisplay(max.luminance~3000$cd/m^2)wasusedfordisplayingthevideos.3)Thesubjectswerenotaskedtoassessonlyanoverallqualityofthevideo,buttomarktheregionswheretheysawdifferencesbetweentestandreferencevideos,resultingindistortionmapssimilartothemetricoutcome.

7865-29, Session 8

Interactions of visual attention and quality perceptionJ.A.Redi,EURECOM(France);H.Liu,TechnischeUniv.Delft(Netherlands);R.Zunino,Univ.degliStudidiGenova(Italy);I.Heynderickx,PhilipsResearchNederlandB.V.(Netherlands)

Severalattemptshavebeenmadeinliteraturetointegratequalitymetricscomputationwithvisualsaliencyinformation,withcontrastingresults.Acarefuldesignoftheintegrationstrategyshouldreflectthemechanismsunderlyingtheinteractionbetweenimagequalityassessmentandvisualattention.Asubjectivestudy,basedontrackingeye-movementsduringqualityassessmentwasperformedtobetterunderstandthem.TheaimoftheexperimentwastoanalyzetheeffectofqualityevaluationontheattentiondeviationfromNaturalSceneSaliency(NSS),and,inparticular,whether,andifsohow,thisdeviationdependedonthedistortionkindand/oramount.Deviatedsaliency


IS&T /

ReturntoContents


mapswerederivedfromthescoringofdistortedimages,andthencomparedtothecorrespondingNSS,derivedfromthefree-lookingofhighqualityimages.ThestudyrevealedsomedifferencesbetweentheDeviatedandtheNaturalScenesaliencymaps,relatedtothequalityleveloftheimages.Thehigherthequality,themorethedeviatedattentionwasspreadinthebackgroundregions.Noevidentroleforthekindofdistortioninthesaliencychangeswasfoundinstead.Rather,thequalityassessmenttaskseemedtoprevailonthenaturalattention,forcingittodeviateinordertobetterevaluatetheimpactofartifacts.

7865-30, Session 8

Task dependence of visual attention on compressed videos: point of gaze statistics and analysisA.Mittal,A.K.Moorthy,W.S.Geisler,A.C.Bovik,TheUniv.ofTexasatAustin(UnitedStates)

Previously,wepresentedsomepreliminaryresultsonhowtaskinfluencesvisualattentionwhenasubjectviewscompressedvideosatvaryinglevelsofcompression.Ouranalysissuggestedthatthetaskdoesindeedinfluencefixationsandthatthepointsofgazeweredefinitelyinfluencedbytheamountofdistortionconditionedonthetask.However,thisanalysisdidnotprovidefurtherinsightintolowlevelstatisticalfeaturesthatattractvisualattention.Here,weanalyzetheeyemovementdatabycomputingstatisticsatpointofgazeforeachofthetwotasks-qualityassessmentandsummarization.Wecomputeluminance,root-mean-squared(RMS)contrast,motionandqualityatpointsofgazeandanalyzehowthesestatisticsbehaveacrosstasksaswellaswithintasksacrossvaryinglevelsofcompression/distortion.

7865-31, Session 8

Measuring contour degradation in natural image utility assessment: methods and analysisG.O.Pinto,D.M.Rouse,S.S.Hemami,CornellUniv.(UnitedStates)

Utilityestimationalgorithmspredicttheusefulnessofanimageforaparticulartasktobeperformedbyahuman.Theydifferfromqualityassessmentalgorithmsinthattheyshouldprovideaccurateestimatesevenwhenimagesareextremelyvisiblydistortedrelativetotheoriginal,yetarestillsufficientforthetask.OurgrouphaspreviouslyproposedtheNaturalImageContourEvaluation(NICE)algorithmforutilityestimation.ThisworkinvestigatesnovelwaystomeasurethedegradationofimagecontoursandappliestheresultstoimprovetheperformanceofNICE.NICEestimatesperceivedutilitybycomparingtheintensityedgemapsofboththereferenceanddistortedimagesusingaHammingdistance.Thisworkconsidersbothgradientorientationsandqualityassessmentalgorithmsspatiallylimitedtoedgesinsteadofcomparingedgemaps,andreplacestheHammingdistancewithdistancetransforms.TheperformanceofthesealternativeimplementationswasevaluatedontheCU-Nantesdatabase,whichprovidesperceivedutilityscoresforacollectionofdistortedimages.ByapplyingVIFsolelyonthecontourregionsweobtainedastatisticallysignificantimprovementwithrespecttothePearsoncorrelationmetric.Thisresultsuggeststhatwecanrecasttheimageutilityassessmentproblemastheproblemofimagequalityassessmentontheimagecontours.TheotherapproachesyieldedstatisticallyequivalentperformancetothestandardimplementationofNICE,suggestingthatutilityestimationisrobusttotheexactimplementationoftheedgedegradationmeasureandthedistancemeasure.

7865-32, Session 9

Evolution of attention mechanisms for early visual processingT.Müller,A.Knoll,TechnischeUniv.München(Germany)

Beinginspiredbythehierarchicalarchitectureofthehumanvisualprocessingapparatus,anproposedapproachintroducesanearlyprocessingsystembasedonsaliencyfeaturesforattentionalfiltering.Areascontainingahighsaliencyvaluesarethencomposedintoregionsofinterestforfurtherobjectrecognitionandtracking.Theareasaredeterminedfrombothbottom-upcomputation,e.g.indynamicenvironmentsconnectedblobsoftexturesmovinginauniformwaywithinthevisualfieldareconsideredsalient;andtop-downcomputation,wheresaliencyeffectsareeitherunconsciousthroughinherentmechanismslikeinhibition-of-return,orvolitionalthroughcognitivefeedback,e.g.whenanobjectmovesconsistentlyinthevisualfieldbutunexpectedlydisappears,attentionisdirectedtothisregioninordertoinvestigatethisunexpectedbehavior.

Inthispaperfurthermoreamulti-threadedextensiontothissaliencymechanismapplyingevolutionaryprocessesisproposed.Here,multiplesaliencyunitsareusedtoproducetheregionsofattention.Alloftheseunitshavedifferentparameter-sets.Now,apopulationofsaliencyunitscreatesregionsofattentionfirst,thentheresultsareevaluatedbycognitive/top-downfeedbackandfinallythegeneticmechanismisappliedtothepopulationunits:mutationandcloningofthebestperformersandextinctionoftheworstperformers.Thefitnessfunctionisdefinedbymeasurementoftheprobabilityfortheregionsbeingtask-relevant,i.e.containingrelevantobjectsforthehuman-robot-interactiontask.

7865-33, Session 9

Learned saliency transformations for gaze guidanceE.Vig,Univ.zuLübeck(Germany);M.Dorr,SchepensEyeResearchInstitute(UnitedStates)andUniv.zuLübeck(Germany);E.Barth,Univ.zuLübeck(Germany)

Thesaliencyofanimageorvideoregionindicateshowlikelyitisthatthevieweroftheimageorvideofixatesthatregionduetoitsconspicuity.Anintriguingquestionishowwecanchangethevideoregiontomakeitmoreorlesssalient.Here,weaddressthisproblembyusingamachinelearningframeworktolearnfromalargesetofeyemovementscollectedonreal-worlddynamicsceneshowtoalterthesaliencylevelofthevideolocally.Wederivesaliencytransformationrulesbyperformingspatio-temporalcontrastmanipulations(onaspatio-temporalLaplacianpyramid)ontheparticularvideoregion.Ourgoalistoimprovevisualcommunicationbydesigninggaze-contingentinteractivedisplaysthatchange,inrealtime,thesaliencydistributionofthescene.

7865-34, Session 10

On the relationship between selective visual attention and visual consciousnessN.Tsuchiya,C.Koch,CaliforniaInstituteofTechnology(UnitedStates)

Therelationshipbetweenattentionandconsciousnessisacloseone,leadingmanyscholarstoconflatethetwo.Idistinguishbetweenexogenous,saliency-driven,task-independentattentionandtop-down,endogenousandvoluntaryattention.InthefirstpartIwillsummarizepowerfulcomputationalmodelsofsaliency-drivenattentionthatcapturealargefractionofeyemovementsinnormalsubjectsinspectingnaturalscenes.Inthesecondpart,Iwillsummarizepsychophysicalevidencearguingthattop-downattentionandconsciousnessaredistinctphenomenathatneednotoccurtogetherandthatcanbemanipulatedusingdistinctparadigms.Subjectscanbecomeconsciousofanisolatedobject,orthegistofthescenein


IS&T /

ReturntoContents

thenearabsenceoftop-downattention.Conversely,subjectscanattendtoperceptuallyinvisibleobjects.Inparticular,Iwilldescribeafullfactorialstudyoftheinfluencesofattentionandconsciousnessonafterimageformation.Thesedataprovideclearevidencefordistinctiveinfluenceofattentionandconsciousnessonperception.

7865-35, Session 10

A gaze-contingent display to study contrast sensitivity under natural viewing conditionsM.Dorr,P.J.Bex,SchepensEyeResearchInstitute(UnitedStates)

Contrastsensitivityhasbeenextensivelystudiedoverthelastdecadesandtherearewell-establishedmodelsofearlyvisionthatwerederivedbypresentingthevisualsystemwithsyntheticstimulisuchassine-wavegratingsnearthresholdlevels.Naturalscenes,however,containamuchwiderdistributionoforientations,spatialfrequencycontent,andbothluminanceandcontrastvalues.Furthermore,humanstypicallymovetheireyestwotothreetimespersecondundernaturalviewingconditions,butmostlaboratoryexperimentsrequiresubjectstomaintaincentralfixation.

Weheredescribeagaze-contingentdisplaycapableofperformingreal-timecontrastmodulationsofvideoinretinalcoordinates,thusallowingustostudycontrastsensitivitywhendynamicallyviewingdynamicscenes.

OursystemisbasedonaLaplacianpyramidforeachframethatefficientlyrepresentsindividualfrequencybands.Eachoutputpixelisthencomputedasalocallyweightedsumofpyramidlevelstointroducelocalcontrastchangesasafunctionofgaze.

OurGPUimplementationachievesreal-timeperformancewithmorethan100fpsonhigh-resolutionvideo(1920by1080pixels)andasynthesislatencyofonly1.5ms.

Psychophysicaldatashowthatcontrastsensitivityisgreatlydecreasedinnaturalvideosandunderdynamicviewingconditions.Syntheticstimulithereforeonlypoorlyrepresentnaturalvision.

7865-36, Session 10

Analyzing complex gaze behavior in the natural worldJ.B.Pelz,T.Kinsman,K.M.Evans,RochesterInstituteofTechnology(UnitedStates)

Thehistoryofeye-movementresearchextendsbackatleastto1794,whenErasmusDarwin(Charles’grandfather)publishedZoonomia,includingdescriptionsofeyemovementsduetoselfmotion.Butresearchoneyemovementswasrestrictedtothelaboratoryfor200years,untilLandbuiltthefirstwearableeyetrackerattheUniversityofSussexandpublishedtheseminalpaper“Wherewelookwhenwesteer”[Land&Lee1994].Intheinterveningcenturies,welearnedatremendousamountaboutthemechanicsoftheoculomotorsystemandhowitrespondstoisolatedstimuli,butvirtuallynothingabouthowweactuallyuseoureyestoexplore,gatherinformation,navigate,andcommunicateintherealworld.

InspiredbyLand’swork,wehavebeenworkingtoextendknowledgeintheseareasbydevelopinghardware,algorithms,andsoftwarethathaveallowedresearcherstoaskquestionsabouthowweactuallyusevisionintherealworld.Centraltothateffortarenewmethodsforanalyzingthevolumesofdatathatcomefromtheexperimentsmadepossiblebythenewsystems.

WewilldescribeanumberofrecentexperimentsanddemonstrateSemantiCode,anewprogramthatsupportsassistedcodingofeye-movementdatacollectedinunrestrictedenvironments.

7865-37, Session 10

What your visual system sees where you are not looking: implications for imaging applicationsR.E.Rosenholtz,MassachusettsInstituteofTechnology(UnitedStates)

Whatistherepresentationinearlyvision?Considerableresearchhasdemonstratedthattherepresentationisnotequallyfaithfulthroughoutthevisualfield;representationappearstobecoarserinperipheralandunattendedvision,perhapsasastrategyfordealingwithaninformationbottleneckinvisualprocessing.Inthelastfewyears,aconvergenceofevidencehassuggestedthatinperipheralandunattendedregions,theinformationavailableconsistsoflocalsummarystatistics.Givenarichsetofthesestatistics,manyattributesofapatternmaybeperceived,yetpreciselocationandconfigurationinformationislostinfavorofthestatisticalsummary.Thisrepresentationimpactsawiderangeofvisualtasks,includingvisualsearchaswellasvisualcognitionofcomplexdisplays.Thistalkdiscussestheimplicationsforbothperception,andforimagingapplicationssuchasinformationvisualization.

7865-38, Session 10

Attention as a Bayesian inference processS.Chikkerur,MassachussetsInstituteofTechnology(UnitedStates)

DavidMarrfamouslydefinedvisionas“knowingwhatiswherebyseeing’’.Intheframeworkdescribedhere,attentionistheinferenceprocessthatsolvesthevisualrecognitionproblemofwhatiswhere.Thetheoryproposesacomputationalroleforattentionandleadstoamodelthatperformswellinrecognitiontasksandthatpredictssomeofthemainpropertiesofattentionatthelevelofpsychophysicsandphysiology.Weproposeanalgorithmicimplementation-aBayesiannetworkthatcanbemappedintothebasicfunctionalanatomyofattentioninvolvingtheventralstreamandthedorsalstream.Thisdescriptionintegratesbottom-up,feature-basedaswellasspatial(contextbased)attentionalmechanisms.Attentionalphenomenasuchsuchaspop-out,multiplicativemodulationandchangeincontrastresponse,whichhavebeendescribedintherecentliteratureasfundamentallydifferentandinsomecasesasconflictingfindings,arealldirectlypredictedbythesamemodel.WealsoshowthattheBayesianmodelpredictswellhumaneyefixations(consideredasaproxyforshiftsofattention)innaturalscenes,andcanimproveaccuracyinobjectrecognitiontasksinvolvingclutteredrealworldimages.Inbothcases,wefoundthattheproposedmodelcanpredicthumanperformancebetterthanexistingbottom-upandtop-downcomputationalmodels.

7865-39, Session 10

Statistical modeling of surprise with applications to predicting attention and gazeL.Itti,TheUniv.ofSouthernCalifornia(UnitedStates)

Noabstractavailable



IS&T /

ReturntoContents

Conference 7866: Color Imaging XVI: Displaying, Processing, Hardcopy, and ApplicationsMonday-Thursday24-27January2011PartofProceedingsofSPIEVol.7866ColorImagingXVI:Displaying,Processing,Hardcopy,andApplications

7866-01, Session 1

Image reconstruction on multi-primary displaysC.BrownEliott,Nouvoyance,Inc.(UnitedStates)


7866-02, Session 1

Adaptive color visualization for dichromats using a customized hierarchical paletteC.E.RodriguezPardo,G.Sharma,Univ.ofRochester(UnitedStates)

Colordisplaysystemshavebeengrowingrapidly,andtheemergenceofnewtechnologies,materialsandapplications,haveenableddisplaydesignerstoimprovetheobserverperceptionaswellasphysicalconstraintsinthedisplayperformance.Inparticular,multi-primarycolortechniqueshasbeenrecentlyexplored,showingadvantagesinexpandingthecolorgamut,wideningtheviewingangleandpowersaving,whencomparingwiththeconventionaldisplaysystemswithonlythreeprimaries.Theseworksassumeastandardtrichromatobserverforwhichthedisplayprimariesandotherparametersareoptimized.However,itisknownthatnearly10%ofthemalepopulationhascolordeficiencies.Althoughsomeworkhasbeenmadeatthestagesofimageprocessingandcomputervisioninordertoimprovetheperceptionoftheseobservers,fewresearchhasbeendevelopedinthedisplaypart.Theaimofthisworkittoimprovetheperceptionofcolorfordichromatobservers,byusingamulti-primarysystem,thatallowstochoosethebestcombinationoflightsourcesinordertomaintainproportionalthedifferencesincolorperceptionbetweennormalobserversandcolordeficiencyindividuals.

7866-03, Session 1

R/G/B color crosstalk characterization and calibration for LCD displaysR.Safaee-Rad,QualcommInc.(Canada);M.Aleksic,QualcommInc.(UnitedStates)

LCDdisplaysexhibitsignificantcolorcrosstalksbetweentheirred,greenandbluechannels(moreorlessdependingonthetypeofLCDtechnology).Thisproblem,ifitisnotaddressedproperly,leadsto(a)asignificantcolorerrorsintherenderedimagesonLCDdisplays,(b)significantshiftsinred,greenandblurprimariesand(c)asignificantgraytrackingproblem.Thetraditionalmethodforaddressingthisproblemhasbeenusinga3x3colorcorrectionmatrixinthedisplayprocessingpipeline.Experimentaldataclearlyshowsthatthislinearmodelforcolorcorrectionisnotsufficienttoaddresstheaboveproblems.Herein,itisproposedtousehigherorderpolynomialsforcolorcorrectioninthedisplayprocessingpipeline.Thispaperpresentsdetailedexperimentalresultsandcomparativeanalysisonusingpolynomialmodelswithdifferentordersforcolorcorrection.

7866-04, Session 1

Color gamut boundary optimization of wide gamut display devicesF.Lebowsky,STMicroelectronics(France)

High-endmonitorsbasedonLCDtechnologyincreasinglyaddresswidecolorgamutimplementationsfeaturingprecisecolorcalibrationwithinavarietyofdifferentcolorspacessuchasextendedsRGBorAdobeRGB.CombiningaLook-Up-TablemethodwithlinearinterpolationinRGBcomponentspaceusing3x3matrixmultiplicationprovidesoptimizedmeansoftonecurveadjustmentsaswellasindependentadjustmentofdeviceprimaries.Theproposedcalibrationmethodcompleteswithinseveralsecondscomparedtotraditionalcolorcalibrationprocedureseasilytakingseveralminutes.Inaddition,theusercanbegivensubjectivecontrolovercolorgamutboundarysettingsbasedondynamicgamutboundaryvisualization.Theproposedcomponentarchitecturenotonlyprovidesindependentcontrolover8colorverticesbutalsoenablesadjustmentinquantitiesof10-4offullamplituderange.Userdefinedcolorpatchescanbeadjustedmanuallywhilesimultaneouslytrackingcolorgamutboundariesandvisualizinggamutboundaryviolationinrealtime.Allthisprovidesaconvenientapproachtofinetuningtonecurvesandmatchingparticularconstraintswithregardtouserpreferences,forexamplespecificambientlightingconditions,acrossdifferentdevicessuchamonitorsandprinters.

7866-05, Session 2

Color correction for projected image on colored-screen based on a cameraD.Kim,T.Lee,KyungpookNationalUniv.(Korea,Republicof);M.Choi,DaeguPolytechnicCollegeUniv.(Korea,Republicof);Y.Ha,KyungpookNationalUniv.(Korea,Republicof)

Thispaperproposesacolorcorrectionmethodforimagesprojectedoncoloredsurfaces.Toachieveourobjectwithoutcharacterizationprocess,wereplacemeasurementdeviceforcharacterizationwithadigitalcamera.Weestimateacolorcorrectionmatrixbylinearregressionusinginputdigitalvalueswhichproducethesamecoloronbothwhiteandcoloredscreen.Differentlyfrompreviousmethods,theuseofgeneralstillcameraallowstomeasureregardlessofplaces.Inaddition,twocapturedimagesonwhiteandcoloredscreenwithramppatchesinformthecolorshiftfor9stepsofeachchannel,enablingaccurateconstructionofthetransformmatrix.Nonlinearityofcameracharacteristicsisalsoconsideredbyusingregressionmethodtoconstructatransformmatrix.Intheexperimentalresults,correctedimageusingtheproposedmethodoncoloredscreenhaveshownbetterperformancethanpreviousmethodsinbothobjectiveandsubjectiveevaluations.

7866-06, Session 2

Modeling LCD displays with local backlight dimming for image quality assessmentJ.Korhonen,N.Burini,S.Forchhammer,TechnicalUniv.ofDenmark(Denmark);J.M.Pedersen,Bang&OlufsenA/S(Denmark)

Traditionally,objectiveimageandvideoqualityassessmentmethodsoperatewiththenumericalpresentationofthesignal,andtheydonottakethecharacteristicsoftheactualoutputdeviceintoaccount.Thisisareasonableapproach,whenqualityassessmentisneededforevaluatingthesignalqualitydistortionrelateddirectlytodigitalsignalprocessing,suchascompression.However,thephysicalcharacteristicsofthedisplaydevicealsoposeasignificantimpactontheoverallperception.Inordertofacilitateimagequalityassessmentonmodernliquidcrystaldisplays(LCD)usinglightemittingdiode(LCD)backlightwithlocaldimming,wepresenttheessentialconsiderationsandguidelinesformodelingthecharacteristicsofdisplayswithhigh


IS&T /

ReturntoContents

dynamicrange(HDR)andlocallyadjustablebacklightsegments.Therepresentationoftheimagegeneratedbythemodelcanbeassessedusingthetraditionalobjectivemetrics,andthereforetheproposedapproachisusefulforassessingtheperformanceofdifferentbacklightdimmingalgorithmsintermsofresultingqualityandpowerconsumptioninasimulatedenvironment.WehaveimplementedtheproposedmodelinMatlabenvironmentandcomparedthevisualresultsproducedbythemodelagainstrespectiveimagesdisplayedonarealdisplaywithlocallycontrolledbacklightunits.

7866-07, Session 2

Content dependent selection of image enhancement parameters for mobile displaysY.Lee,C.Kim,Y.Kang,G.Kim,H.Kim,InhaUniv.(Korea,Republicof)

Mobiledevicessuchascellularphonesandportablemultimediaplayerwithcapabilityofplayingterrestrialdigitalmultimediabroadcasting(T-DMB)contentshavebeenintroducedintoconsumermarkets.Assizeandresolutionofmobiledisplaysincrease,marketdemandsforimprovedimagequalityaregraduallyincreasing.Variousimageenhancementtechniquesareappliedtomeettheneedsforimprovedimagequalityonmobiledisplays.Theyincludenoisereduction,contrastandsharpnessenhancementandcolorcorrection,etc.UnlikeTVapplication,therearestrictrestrictionsonmemoryresourcesandcomputationalcomplexityformobileapplications.Inthispaper,parametersforimageenhancementtechniquesareadaptivelydeterminedbasedonimagecontents.Minimizationofflickerduetosuddenscenechangeisalsoproposed.Experimentalresultsindicatethatdynamicselectionofimagecontrolparametersexhibitsbetterperformancethantraditionalimagingchainwithfixedparameters.

7866-08, Session 2

Saliency-driven black point compensationA.J.Lindner,EcolePolytechniqueFédéraledeLausanne(Switzerland);N.Bonnier,OcéPrintLogicTechnologies(France);S.E.Süsstrunk,EcolePolytechniqueFédéraledeLausanne(Switzerland)

Wepresentanovelframeworkforautomaticallydeterminingwhetherornottoapplyblackpointcompensation(BPC)inimagereproduction.Visuallysalientobjectshavealargerinfluenceondeterminingimagequalitythanthenumberofdarkpixelsinanimage,andthusshoulddrivetheuseofBPC.WeproposeasimpleandefficientalgorithmicimplementationtodeterminewhentoapplyBPCbasedonlow-levelsaliencyestimation.WeevaluateouralgorithmwithapsychophysicalexperimentonanimagedatasetprintedwithorwithoutBPConaCanonprinter.Wefindthatouralgorithmiscorrectlyabletopredicttheobservers’preferencesinallcaseswherethesaliencymapsareunambiguousandaccurate.

7866-09, Session 2

DIN 6164 for gamut mapping?U.Caluori,D.Küpper,K.Simon,EMPA(Switzerland)

Colorasaperceptualphenomenonis,todate,anonlyapproximatelyunderstoodresearchtopic.Ingeneral,colorspacesaddressspecificapplicationsorsituations,suchthattheymaynotbeadequateinothercontexts.Inthiswork,weinvestigatefourdevice-independentcolorspaces(colorordersystems)withregardtotheirsuitabilityforaspecificgamutmappingconcept.

WeimplementedthetransformationofcolorstoandfromCIELABandIPTbysymboliccomputation,andforMunsellandDIN6164bybuildinganefficientthree-dimensionalinterpolationstructure.WeencounteredseveraldifficultieswithDIN6164andMunsell,

likedefiningagooddistanceformula,artifactsthatoccurwhencompressingcolors,discontinuity,boundaryhandling,extrapolationandsampleintersection.Wedescribehowwetackledtheseproblemsandfinallyverifiedandevaluatedourimplementationbyconductingapsycho-visualexperimentonimagesprocessedwithoneofourlatestgamutmappingalgorithms.

PreliminaryresultsshowthatourmodifiedDIN6164spacesurprisinglyperformsbestintermsofimagequality,whichmotivatesfurtherinvestigationofDIN6164asaseriousalternativeinthiscontext.

7866-10, Session 3

Estimation of low dynamic range images from single Bayer image using exposure look-up table for high dynamic range imageT.Lee,Y.Ha,KyungpookNationalUniv.(Korea,Republicof);C.Lee,AndongNationalUniv.(Korea,Republicof)

HDR(highdynamicrange)imagingtechniquesguaranteeswiderdynamicrangeforimagesthanthosecapturedfromgeneralstillcamera.Usually,thesetechniquesemergeseveralLDR(lowdynamicrange)imageswithdifferentexposure.ItmeansthattheyneedadditionalprocessforcapturingLDRimages,asfollowstheycauseghosteffectformovingobjectsinHDRimages.Accordingly,thispapersuggestsamethodtoestimatearbitraryLDRimagesfromasingleBayerimageusingexposureLUT(look-uptable),consideringchanneldependency.Bayerimageprovidessufficientluminanceinformationwith14bitdataforeachchannelthanLDRimageswith8bit.WefirstconstructthreeLUTsforeachRGBchannelusingBayerimage.Ithastherelationshipbetweeninputsceneandoutputaverageluminance.Then,frominputimagecapturedbyautomode,correspondinginputdigitalvalueisestimatedbyusingcurrentexposureandLUTs.Next,targetexposureswhicharecorrespondedtotargetaverageluminancefromuserchoiceareestimatedusingestimateddigitalvalueandLUTs.Afterthat,finalLDRimageswithtargetexposuresareestimatedusinginputimageandLUT.Toimprovetheaccuracyofestimation,saturatedareaisestimatedbyconsideringchanneldependency.Inexperiments,highPSNRvaluesareobtainedbetweenestimatedandcapturedimages.

7866-11, Session 3

Flicker reduction in tone mapped HDR videosB.Guthier,S.Kopf,M.Eble,W.Effelsberg,Univ.Mannheim(Germany)

InordertodisplayaHDRvideoonaregularlowdynamicrange(LDR)screen,itneedstobetonemapped(TM).Agreatnumberoftonemappingoperatorsexist-alldesignedtotonemaponeimageatatime.UsingthemoneachframeofaHDRvideoindividuallyleadstoflickeringinthefullimage.

Inourwork,weanalyzethreetonemappingoperatorswithrespecttoflickering.Weproposeacriterionfortheautomaticdetectionofimageflickerbyanalyzingtheaveragepixelbrightnessofthetonemappedframe.FlickerisdetectedifthedifferencebetweentheaveragesoftwoconsecutiveframesislargerthanathresholdderivedfromStevens’powerlaw.Fine-tuningofthethresholdisdoneinasubjectivestudy.

Additionally,weproposeagenericmethodtoreduceflickering.Itisapplicabletoallparameterdriventonemappingoperators.Webeginbytonemappingaframewiththeparameterssettodefaultvalues.Iftheflickerdetectionreportsavisiblevariationintheframe’sbrightness,theparametersareadjustedandtheframeistonemappedagain.Asaresult,thebrightnessvariationissmoothedoutoverseveralframes,becominglessdisturbing.

Conference 7866: Color Imaging XVI: Displaying, Processing, Hardcopy, and Applications


IS&T /

ReturntoContents


7866-12, Session 4

Applying AR technology with a projector-camera system in a history museumK.Miyata,NationalMuseumofJapaneseHistory(Japan);R.Shiroishi,OchanomizuUniv.(Japan);Y.Inoue,BunkyoUniv.(Japan)

Inthisresearch,anAR(augmentedreality)technologywithprojector-camerasystemisappliedinahistorymuseumtoprovideuser-friendlyinterfaceandpseudohands-onexhibition.TheproposedsystemisadesktopsystemanddesignedforoldJapanesecoinstoenhancethevisitors’interestandmotivationtoinvestigatethem.Thesurfaceoftheoldcoinshasfinestructuresonbothsides,soitismeaningfultoshowthereversesidetothevisitorsforenhancingtheirinterestandmotivation.ThedetectionoftheARmarkersandrenderingoftheprocessedimageswereperformedusingARToolKit,andtheappearancesoftheoldcoinswerecalculatedbasedonthephotometricstereoalgorithm.TheuserscouldobservetheimagesoftheoldcoinswithchangingappearancefollowedbythemovementsoftheARmarkers.

TheproposedsystemcontributestodevelopanexhibitionmethodbasedonthecombinationsoftherealartifactsandtheARtechnology,anddemonstratedtheflexibilityandcapabilitytoofferbackgroundinformationrelatingtotheoldJapanesecoins.However,theaccuracyofthedetectionandtrackingofthemarkersandmoredetailedvisitorevaluationsurveyarerequiredtoimprovetheeffectivenessofthesystem.

7866-13, Session 4

Memory preservation made prestigious but easyR.Fageth,C.Debus,CeWeColorAG&Co.OHG(Germany);P.Sandhaus,CarlvonOssietzkyUniv.Oldenburg(Germany)

Preservingmemoriescombinedwithstorytellingusingeitherphotobooksformultipleimagesinordertotellstoriesandusinghighqualityproductssuchasimagesprintedoncanvasorimagesbehindacrylinordertouseitasprestigiouswalldecorationaresubstitutingmoreandmoreclassical4*6printsandclassicalsilverhalideposters.Digitalprintingviaelectrophotographyandinkjetissubstitutingmoreandmoreclassicalsilverhalidetechnologyasdominantproductiontechnologyforthesekindsofproducts.

Imagesusedinordertogeneratetangibleoutputarestoredinseverallocations(desktopand/oronlineingalleriesorsocialnetworks)ororiginatedatleastfromtwoandmoreimagetakingdevices.Thismakesthegenerationofacompellingoutputmorecomplicatedaswellastheconfirmationprocessbeforegeneratingtangibleoutputofthepeopleinvolvedinthedesignprocessisbecomingachallenge.

Thispaperdescribesauniqueapproachofcombiningdesktopbasedsoftwaretoinitiateacompellingprojectbutadditionallyuseonlinecapabilitiesinordertofinalizeandoptimizethatprojectonlineinacommunityprocess.Acomparisonoftheconsumerbehaviorbetweenonlineanddesktopbasedsolutionsforgeneratingphotobookswillbepresented.

Thepaperalsoanalyzestheuserbehaviorofgeneratingpages;howmanypagesarecontainingtextorclipartsandpre-definedstylesversusowndesignedproducts.Howmanyimagesaregeotaggedor/andoriginatedbymobilephones,pointandshootcamerasorDSLRs.

Additionallywedescribeanowndevelopedprocesstokeepthequalitystandardthesameoverdifferentfactoriesanddigitalprinters(electrophotographyandinkjet)suppliedbydifferentmanufacturers.Customersatisfactionwhileexpectationssimilarqualityofanimagetobeviewedonawallorseeninaphotobookisthedriverfortheverifiedparametersduringourproductionprocessesandthetestsetofimagesusedtoderivetheseparameters.

7866-14, Session 4

A method to estimate the UV content of illumination sourcesP.Green,Y.Chang,

TheneedtostandardisetheUVcontentofincidentilluminationhasledtobothISO3664andISO13655adoptingpreciserequirementsforthespectralpowerofilluminationsourcesinbothvisibleandUVregionsofadaylightsimulator.

ByvaryingthepowerofUVanddaylight-simulatingsources,andmeasuringtheincidentirradianceandthereflectedradiancefromaselectionoffluorescentandnon-fluorescentsubstrates,itwasfoundthatforagivenpaperthefluorescentradianceincreaseslinearlywithincidentUVirradiance.ItwaspossibletomodelthisrelationshipsothattheUVenergy(andhencetheUVcontentofthesource)couldbepredictedwithreasonableaccuracyfrommeasurementsofreflectancefactororreflectedradiance.

Twoparticularapplicationswherethisisofinterestare:

a)estimatingtheUVcontentofanilluminationsourceusedinviewingcoloursamples,fromareflectedradiancemeasurementofasample

b)estimatingtheUVcontentofanilluminationsourceusedinaspectrophotometerorothercolourmeasurementinstrumentwithaninternallamp.

WealsodescribehowtoestimateaUVMetamerismIndex(MI-UV)valueforcommonlamptypes,usingsomeassumptionsaboutthetypicalspectralpowerdistributionofsuchlampsintheUVregion,andcomparetheseestimatedMI-UVvaluesagainstthosecomputeddirectlyfromthelampspectralirradiance.

7866-15, Session 5

Knowledge exchange in the CREATE project: Colour Research for European Advanced Technology EmploymentC.E.Parraman,Univ.oftheWestofEngland(UnitedKingdom);A.Rizzi,Univ.degliStudidiMilano(Italy)

Thepresentationwillreviewa4yearEuropeanfundedprojectCREATE(ColourResearchforEuropeanAdvancedTechnologyEmployment),whichwasestablishedin2006.ThegroupcametogethertopromoteandexchangeresearchandknowledgethroughaseriesofconferencesandtrainingcoursestoresearchersworkinginEuropewhowereintheearlystagesoftheircareer.Thelong-termobjectivewastoaddressabroadrangeofthemesincolourandtodevelopwithartists,designers,technologistsandscientistsacrossdisciplinaryapproachtoimprovingcolourcommunicationandeducationandtoprovideaforumfordialoguebetweendifferentfields.Nowattheendofitsfunding,thispaperwillhighlightsomeofthekeymilestonesoftheproject.Moreover,havingcompletedasupplementaryworkshopeventinOctober2010,researchersconsiderednewthemesforafutureRe-CREATE.

7866-16, Session 5

Is it turquoise + fuchsia = purple or is it turquoise + fuchsia = blue?G.B.Beretta,N.Moroney,Hewlett-PackardLabs.(UnitedStates)

Thefirststepincommunicatingcoloristonameit.Thesecondstepiscolorsemiotics.Thethirdstepisintroducingstructureinthesetofcolors.Incoloreducationatalllevels,thisstructureoftentakestheformofformulas,likered+green=yellow,orturquoise+red=black.Inrecenttimes,JohannesItten’scolortheoryanditsassociatedcolorwheelhasbeenveryinfluential,mostlythroughitsimpactonBauhaus,althoughanumberofcolorordersystemsandcircleshavebeenintroducedoverthecenturies.


IS&T /

ReturntoContents

Studentsgetconfusedwhentheyaretryingtoformulatethecolornamearithmeticusingthestructureofcolorordersystemsandconceptslikecomplementarycolorsandopponentcolors.Suddenlyturquoise+fuchsia=purpleinsteadofblue;purpleandvioletbecomeblurred,andfinallythestudent’sheadexplodesunderepistemologicalpressuresofItten,Albers,Goethe,Newton,daVinci,andalltheothermonstersofcolorstructure.

Inthiscontributionweproposeasystematicpresentationofstructureincolor,fromcolortheoriestocolornaming.WestartfromtheconceptofcolorperceptionintroducedbydaVinciandworkourselvesthroughcolormeasurement,colorformation,andcolornaming,todevelopthebasisforarobustsystembasedontablelookupandinterpolation.

Onecauseofconfusionisthatcolornaminghasbeenquitelooseincolortheory,whereforexampleredcanbeusedinterchangeablywithfuchsiaandbluewithturquoise.Furthermore,commoncolortermsareintermingledwithtechnicalcolorantterms,forexamplecyanandturquoiseorfuchsiaandmagenta.Wepresenttheevolutionofafewcolorterms,someofwhichhaveexperiencedaradicaltransitionoverthecenturies.

7866-17, Session 5

Human vision based color edge detectionA.Kim,H.S.Kim,S.Park,DaejinUniv.(Korea,Republicof)

Edgedetectioncanbeofgreatimportancetosharpnessenhancementinvariousdigitalimagingapplicationssuchasdigitaltelevisionandcamera.Therefore,extractingmoreaccurateedgepropertiesissignificantlydemandedforachievingabetterimagequality.Invectorgradientedgedetection,absolutedifferenceofRGBvaluesbetweenacenterpixelvalueanditsneighborhoodvaluesareusuallyused,althoughsuchadevice-dependentcolorspacedoesnotaccountforhumanvisualcharacteristicswell.Thegoalofthisstudyistotestavarietyofcolordifferenceequationsandproposethemosteffectivemodelthatcanbeusedforthepurposeofcoloredgedetection.Asetof7colordifferenceequationsisselectedandimplementedinthisstudy.ThoseincludedeltaRGB,CIELABdeltaE,CMCdeltaE,Leedscolordifference,CIEDE94,CIEDE2000(dE00)andCIECAM02-UCSdeltaE(dECAM-UCS).Consequently,therewerenotsignificantperformancevariationsobservedbetweenthose7colordifferenceequationsforthepurposeofedgedetection.However,dE00anddECAM-UCSshowedslightlyhigherzscoresthantheothers.Observeraccuracywaslessthan20%valueofCVsotheagreementbetweentheallhumanobserversparticipatedinthisexperimentcanbethoughtofasreasonable.

7866-18, Session 5

Color universal design: analysis of color category dependency on color vision type (2)N.Kojima,M.G.Kamachi,Y.G.Ichihara,KogakuinUniv.(Japan);K.Ito,TheUniv.ofTokyo(Japan)

Thepresentstudyinvestigatesthetendencyofindividualstocategorizecolors.Humansrecognizecolorsbycategorizingthemusingspecificcolornames,suchasred,blue,andyellow.Whenanindividualwithacertaintypeofcolorvisionobservesanobject,theycategorizeitscolorusingaparticularcolornameandassumethatotherpeoplewillperceivethecolorinanidenticalmanner.However,therearemanyvariationsinhumancolorvisionasaresultofdifferencesinphotoreceptorsintheeye,includingredandgreenconfusion.Thus,anotherpersonwithadifferenttypeofcolorvisionmaycategorizeacolorusingacompletelydifferentname.Toaddressthisissue,weattemptedtodeterminethedifferencesintherangesofcolorthatpeoplewithdifferenttypesofcolorvisionobserve.ThisisanimportantsteptowardsachievingColorUniversalDesign,avisualcommunicationmethodthatisviewer-friendlyirrespectiveofcolorvisiontype.Herein,wereportonasystematiccomparisonamongindividualswithtrichromacy(C-type),protan(P-type)anddeuteran(D-type)colorvision.Thispaperisafollow-uptoSPIE-IS&T/Vol.7528752805-1.

7866-19, Session 6

Object classification by color normalization or calibration?W.Hans,D.W.Paulus,Univ.Koblenz-Landau(Germany)

Model-basedapproachestoobjectrecognitionrelyonshapeandcontourswhileappearance-basedapproachesuseinformationprovidedbytheobjectintensityorcolor.ColorhistogramsasanobjectcharacteristicsarecommonlyusedtosolvethistaskanddescribedindetailbySwainandBallard.RGBcolorvaluesformedbyacameradependheavilyontheimageformationprocess-especiallytheilluminationinvolved.Mainlyforthisreasoncolornormalizationalgorithmsareappliedtoestimatetheimpactofpositionandcoloroftheilluminationandeliminateoratleastminimizetheirinfluencetotheimageappearance.

TheKOPIDdatasetusedinourresearchconsistofseveralitemsoncartonboxeswithmanycolors.SincetheimagesinthissetcontainsaGretagMacbethColorCheckerc,anothercolornormalizationisapplicable:colorcalibration.WecompareseveralcolornormalizationprocedurestoacalibrationmethodproposedbyRaymondL.Lee,Jr.Byestimatingthespectralreflectancesofobjectsurfacesoneobtainacolorimetricallycorrectimagerepresentation.

Ourexperimentsperformseveralhistogramdistancemeasuresusedforhistogrambasedobjectclassification.Additionallywevarythenumberofbinsused,theorderofsomeprocessingsteps,andthedimensionalityofcolorhistogramstodetermineamostsuitableparametersettingforobjectclassification.

7866-20, Session 6

Contrast preserving color fusionJ.Kamenicky,B.Zitova,InstituteofInformationTheoryandAutomation(CzechRepublic)

Wewilladdresstheproblemofedgepreservingimagefusionforvisualizationandforprintingoftwointensityimagesfromdifferentmodalities.Themostimportantpointisthatinsteadofdegradingthecontainedinformationbyintensityoutputonly,wewillcomputeacolorimageinthewaythatgivesusbetterpossibilitytocontroledgepreservation.

Themostcommonapproachinthesesituationsistheuseofanalpha-blending,leadinginsomecasestoworsevisibilityofedgespresentinoneoftheinputimages.Theproposedmethodismeanttosolvethisissuebypreservingintensitychangesintheinputimagesequally,independentlyoftheothermodality.

Themainideaofthemethodisbasedontheperceptualcolordifference.A2Drectangularcolormappingschemeiscreatedinsuchaway,thatcolordifferencesasperceivedbythehumaneyeinallpointsarenearlythesame.Then,thismappingschemeisappliedtogeneratetheoutputcolor.

Theproposedmethodcanhelptodistinguishevenslightdifferencesinanyinputmodalitywithoutriskoflosingdetails.Modificationsoftheproposedmethodcanbederivedforspecialcaseswherecertainintensityintervalsaremoreimportantthanothers.

7866-21, Session 6

Evaluating the smoothness of color transformationsA.Aristova,Z.Wang,J.Y.Hardeberg,GjøvikUniv.College(Norway)

Colorimagequalityisanimportantfactorinvariousmediasuchasdigitalcameras,displaysandprintingsystems.Theemploymentofdifferentcolorimagingmedialeadstoaconstantproblemthateachdeviceproducescolordifferently.Itmakesvariousmanufacturersfocusonthetechnologytoachievesuccessfulcross-mediacolorreproduction.Inthiscasetheimagereproductionqualitydepends



IS&T /

ReturntoContents

onprocessesofdevicecharacterization.Thedevicecharacterizationandprofilingarecentralprocesseswhichallowpredictingaresultofdevicereproductionaccordingtotheknowninput,andprovidecommunicationofdevices.Colorlook-uptables(LUTs)arethemostcommonempiricalapproachfordevicecharacterization,andarethebasisforICCprofiles.Correct3DLUT-basedcolorconversionindevicecharacterizationisanimportantfactorforachievinghighqualityofthereproducedcolorimage.SuchfactorsasLUTssize,interpolationmethodsandunavoidablenoiseincolormeasurementprocessandunstableprintingprocessinfluenceonsmoothnessof3DLUT-basedcolortransforms,andmayresultintheappearanceofartifactsinthefinalreproducedimages.Itisquitecommontoevaluatethequalityofcolortransformsintermsofcolorimetricaccuracy,butsmoothnessisoftenneglected,eventhroughitsimportanceisnowgenerallyagreedon(Olson,1999).

Soanimportantprobleminthiscaseistofindawayofquantifyingtowhichextentdifferenttransformsproducesmoothornotsmoothoutputimages.TheevaluationofsmoothnessofLUTs-basedcolortransformswillallowavoidingundesirableresultsinimagecolorreproductionsystemssuchasartifactsanddistortionsofimagecontentandimprovingdevicecharacterizationprocessforachievingsmoothcolortransforms.Therearesomescientificstudiesdedicatedtoevaluatingsmoothnessofcolortransformsbuttheproposedalgorithmswereappliedandtestedonlyonwell-designed3D-LUTs,devicecharacterizationprocessandexperimentaldata.Sothesemetricsstillrequiretestingandevaluationusingcompleximagesandonprofilesobtainedduringdifferentmeasurementsandindifferentenvironments.

Anewmethodofevaluatingsmoothnessofcolortransformationswasproposedbasedonextensionofsecondderivativemethodsuggestedearlier(Green,2008).Thealgorithmisbasedonconsidering3DLUTsofICCprofiles(AToB#table)assetofcolorplanesintheprintercolorspaceandcorrespondingtothemvaluesinCIELABcolorspace.EachcolorplaneinCIELABcolorspacepresentssetofcolortransitionsinhorizontalandverticaldirections.ThesecondderivativeofdeltaL*,deltaa*anddeltab*betweenadjustedpointsofeachverticalandhorizontaltransitionsitwasfound.Statisticalestimationswereusedforderivinggeneralresultamongcolorplanesforprofiles.

Inthisresearchwehavealsoproposedapproachesusingimagedifferencemetricsforevaluatingsmoothnessofcolortransformations.Forthesegoals45ICCprofilesweregeneratedfrommeasurementswithdifferentrepeatabilityandfrommeasurementsofconsecutiveprintedchartsonsubstratesofthesamepapertypeforproviding3DLUTswithvariousnoisecharacteristics.Theprocessofprofilingwasdesignedtobeclosetoarealpracticalcase.Fourtestimagescontainingsmoothtransitionsofcolorswereconvertedusingtheprofilesforobtainingimageswithvaryingsmoothness.Apsychophysicalexperimentinvolving20observerswasconductedforevaluatingperceiveddifferenceinsmoothnessbetweensoftcopyoforiginalimageandsoftcopiesofimages’reproductionsusingacategoryscalefromperfectmatchtoworstmatchinsmoothness.

Theproposedmethodhaveshownbetterperformanceinpredictingsmoothnessofcolortransformationbyparticularprofiles-comparedwithpreviousmetricsbyGreen(2008),andKimetal.(2010).Full-referenceimagequalitymetricsSSIM,GSSIM,pixelwiseCIELABDeltaE,sCIELAB,Adaptivebilateralfilter,Edgesimilarity,andStructuralcontent-werecomparedforevaluatingdifferenceinsmoothnessbetweenoriginalandreproducedimages.GSSIMandStructuralcontenthaveshownhigherPearsoncorrelationwithvisualjudgmentsandrepresentationthantheothermetrics.

7866-22, Session 6

High capacity image barcodes using color separabilityO.Bulan,G.Sharma,Univ.ofRochester(UnitedStates);B.Oztan,RensselaerPolytechnicInstitute(UnitedStates)

Two-dimensionalbarcodesarewidelyusedforencodingdatainprinteddocuments.Inanumberofapplications,thevisualappearanceofthebarcodeconstitutesafundamentalrestriction.Inthispaper,weproposehighcapacitycolorimagebarcodesthatencodedatainan

imagewhilepreservingitsbasicappearance.Ourmethodaimsathighembeddingratesandsacrificesimagefidelityinfavorofembeddingrobustnessinregionswherethesetwogoalsconflictwitheachother.Themethodoperatesbyutilizingcyan,magenta,andyellowprintingchannelswithelongateddotswhoseorientationsaremodulatedinordertoencodethedata.Atthereceiver,byusingthecomplementarysensorchannelstoestimatethecolorantchannels,dataisextractedineachindividualcolorantchannel.Inordertorecovererrorsintroducedinthechannel,errorcorrectioncodingisemployed.Ourresultsindicatethattheproposedmethodcanachieveembeddingratesoftraditionaltwo-dimensionalbarcodeswhilepreservingtheappearanceofthebaseimage.

7866-23, Session 7

The color side of darkR.Bala,XeroxCorp.(UnitedStates)

Noabstractavailable

7866-24, Session 7

What a bad signal from this strange deviceA.Rizzi,Univ.degliStudidiMilano(Italy)

Noabstractavailable

7866-25, Session 7

HDR imaging and color constancy: Two sides of the same coin?J.J.McCann,McCannImaging(UnitedStates)

DarkSideofColor

Atfirst,wethinkthatHighDynamicRange(HDR)imagingisatechniqueforimprovedrecordingsofsceneradiances.Manyofusthinkthathumancolorconstancyisavariationofacamera’sautomaticwhitebalancealgorithm.However,oncloserinspection,glarelimitstherangeoflightwecandetectincamerasandretinas.Allsceneregionsbelowmiddlegrayareinfluenced,moreorless,bytheglarefromthebrightscenesegments.Insteadofaccurateradiancereproduction,HDRimagingworkswellbecauseitpreservesthedetailsinthescene’sspatialcontrast.Similarly,humancolorconstancy,alsooncloserinspection,dependsonspatialcomparisonsthatsynthesizeappearancesfromallthescenesegments.CanspatialimageprocessingplaysimilarprinciplerolesinbothHDRimagingandcolorconstancy?

7866-26, Session 7

ICC profiles: are we better off without them?G.B.Beretta,G.J.Dispoto,E.Hoarau,I.Lin,J.Zeng,Hewlett-PackardLabs.(UnitedStates)

BeforeICCprofiles,adevice-independentpagedescriptiondocumentwouldencodeallcolorinadeviceindependentCIEspacelikeCIELAB.Whenthedocumentwastobeprinted,thepresspersonwouldmeasureatargetandcreateacolortransformationfromtheCIEcoordinatestodevicecoordinates.Forofficeandconsumercolorprinters,thecolortransformationforasstandardpaperwouldbehardwiredintheprinterdriverortheprinterfirmware.

Thisprocedurehadtwodisadvantages:thecolortransformationsrequireddeepexpertisetoproduceandwherehardtomanage(thelattermakingthemhardtoshare),andtheimagedatawastransformedtwice(frominputdevicetocolorimetricandthentooutputdevicecoordinates)introducingdiscretizationerrorstwice.ThefirstproblemwassolvedwiththeICCprofilestandard,andthelastproblemwas



IS&T /

ReturntoContents

solvedbystoringoriginalthedevicedependentcoordinatesinthedocument,togetherwithaninputICCprofile,sothecolormanagementsystemcouldfirstcollapsethetwoprofilesandthenperformasinglecolortransformation.

Unfortunately,thereisawidevarietyinthequalityofICCprofiles(seefigureathttp://www.mostlycolor.ch/2010/04/color-errors-in-printing.html).Evenworse,therealnightmareisthatquitefrequentlytheincorrectICCprofilesareembeddedinpagedescriptiondocumentsorthecolormanagementsystemsapplythewrongprofiles(seepaperdescribedinhttp://www.mostlycolor.ch/2010/04/vanity-publishing.html).

Forconsumerandofficeprinters,thesolutionistoforgoICCprofilesandreduceeverythingtothesinglesRGBcolorspace,soonlyoneprofileisrequired.However,thesRGBqualityisinsufficientforprintsolutionproviders.HowcanamodernprintworkflowsolvetheICCprofilenightmare?

7866-27, Session 7

Green halftoning: Can less be more?J.P.Allebach,PurdueUniv.(UnitedStates)

Noabstractavailable

7866-28, Session 7

Can displays go wild?G.G.Marcu,AppleInc.(UnitedStates)

Noabstractavailable


Spectral reflection and transmission prediction model of halftone image on fluorescent supportsG.Shi,Y.Zhang,J.Chen,W.Ni,JiangnanUniv.(China)

Anewmodelwhichpredictsthereflectanceofhalftoneimageonfluorescentsupportsisestablishedinthisarticle.Thereflectedlightfromfluorescentsupportsisdividedintotwoparts:theprimarystreamswhichconsistoforiginallyincidentlightandthefluorescentstreamswhicharecreatedbyabsorptionoftheUVlights.Byanalyzingthedifferenttransmissionpathsoftheprimarylightsflowandthefluorescenceflowinink-layerandpaper,thetotalreflectanceformulatopredictthecolorsprintedonfluorescentsupportsisderived.Besidestheattenuationoftheprimarysteamsandthefluorescentstreamsininklayer/paperbulkareconsidered,theink-spreadingmodelwithwhichwecanderivetheeffectivedotcoverageisaccountedfor.Itcanimprovetheaccuracyofthepredictionmodelafterusingtheeffectivedotcoverageinthetotalreflectanceformula.Withsuchacalibratedpredictionmodel,wecanpredictthetotalreflectanceofanyhalftoneimageonfluorescentsupports.ThenweusetheMatlabsoftwaretomakedatasimulationtoverifytheaccuracyofthemodelwederived.Atlast,tworeflectancecurvesweregeneratedandit’sprovedthatthenewmodelsignificantlyimprovetheaccuracyofprediction,comparedwiththepreviousmodel.


The transmission of light affect the color reproduction of plastic printJ.Chen,Y.Zhang,G.Shi,Z.Xu,JiangnanUniv.(China)

Weanalyzethatthediffuselighttransmissionoftheplasticprintingaffectsthecolorreproductionoftheproduct.Thepaperuseskubelka-munkmodelandradiativetransfertheory,andconsiderstheimpact

oftheplastic-printedfilmandtheink-colorfactorstostudythecolorreproductionofplasticprint.Inthispaper,wedonotconsiderthewhiteinkthatwasprintedonthebottomoftheplasticsubstratefirstly(thentheplasticsubstratewascoatedwithcolorinks),andinsideprintingoftheplastics.Weonlyconsiderthetransmissionoflightintheprintingprocesswasusedinthehierarchicalthinking,andconsiderthedifferentopticalpropertiesofheinklayerandtheplasticsubstrate.Thepaperhasimportantsignificanceonthesecurityofprintedplastic.


Reflectance model of plastic substrate halftone image based on Markov chainW.Ni,Y.Zhang,G.Shi,J.Huang,JiangnanUniv.(China)

Markovchainexpressestheconditionprobabilityofarandomsequence.Itisrelatetothelateststate,isnotrelatetothepreviousstate.TheuseofMarkovchainsprovidestwomainadvantagescomparedwiththeclassicalmethodofexpressingmultiplereflection-transmissionprocessesbygeometricseries.Thedistanceofthelightscatteringlandscapeorientationinlowerbifaceismuchlargerthanthesizeofdotwhilescreeninghighfrequency.Sowejustconsideredthesituationthatmultilayerspecimenscomposedoflayershavingdistinctrefractiveindicesandexpandedthemathematicmodel.Thetransfermatrixofhalftonehomochromousimageistheadditionofthetransfermatrixofeverycolorandblanksubstrate.Weobtainthereflectivitymodelofhalftoneplasticcolorpresswork.Thismathematicmodelisimportanttothedigitalworkflowandautocontroldetectionofpresswork.


Color image segmentation on region growing and multiscale clusteringW.Wang,HenanPolytechnicUniv.(China)

Thispaperpresentsacolorimagesegmentationmethodbycombiningregiongrowingandcolorclusteringalgorithms.Thismethodconsidersthebothcolorandlocationinformationinatransformedcolorspace.Aftermulti-scaleclustering(MSC),itdoesaspatialprocessingregiongrowing.MSCcanperformbetterinconqueringtheover-segmentproblemthanequaldistanceclustering.ComparedwiththepreviousmethodsonlydependedMSC,theregiongrowingcanenhancetheabilityofnoisesuppression.Thismethodinheritstheideathatoperatesclusteringfirstandthencarriesoutspatialprocessing.Bothclusteringalgorithmandspatialprocessingmethodareimproved,sothismethodcangetmoresatisfiedresults.

Thispaperpresentsacolorimagesegmentationmethodbycombiningregiongrowingandcolorclusteringalgorithms.Thismethodconsidersthebothofcolorinformationandlocationinformationinatransformedcolorspace.AftertheMSC,itdoesaspatialprocessingregiongrowing.MSCcandobetterinconqueringtheover-segmentproblemthanequaldistanceclustering.ComparedwiththepreviousmethodonlydependedMSC,theregiongrowingcanenhancetheabilityofnoisesuppression.Thewholealgorithmdoesnotneedprioridataandcanfindappropriatenumberofclustersadaptively.

Ofcourse,therearemanydeficienciesinthisalgorithmneedtobeimproved.Forexample:(1)whenfixonthecorepixel,thesizeofregionDisnotonlyassociatedwiththesizeoftheimage,butalsoinvolvedwithattentiontodetailneglect.Ifweneedtoneglectmoredetails,thesizeofregionDwillbebigger;and2)intheMSC,oneshouldjoinsomespecialprocessingbeforeandaftertheclusteringtoreducetheamountofcalculation.


Advanced spectral reflectance prediction model for color printsD.Tian,Q.Wang,Y.Zhang,JiangnanUniv.(China)



IS&T /

ReturntoContents

Wepresentedamodelthatenablethecharacterisationofapressinaenvironmentwithfrequentlychanginginks.Inthestudy,papersubstrateisassumedtobeaperfectdiffuser,i.e.,theintensityscatteredbythepaperfollowsLambertlaw,thecolorantlayerislowscattering.Thespectralreflectancepredictionmodelisbasedonadescriptionofthemultiplereflectionsoflightbetweenthepapersubstrateandtheprint-airinterfaces.Inourexperiment,wetestedourmodelonanoffsetprintingmachineandobtainedanaccuracyintermsofrootmeansquareerrorof0.58%andof1.21dE.Goodagreementwasfoundbetweenthesimulationsandboththeanalyticalandmeasuredreflectancesfortheprintedpatches.Themodelonlyneedsasinglepintofatargetandaspectralmeasurementofthenewinksetforthefutureprintjob.Theperformanceofthesystemliesatabout1-2dEfromthemeasurementintherootmeansquareerror.Themodelcouldthereforebeusedinprintingsystemstoprintmachinecharacterize.Itcouldalsobeusedtopredictthecolorofimagesasafunctionofilluminationandviewingangles.


Color reproduction performance of halftone color image printed on paper substrateW.Ni,Y.Zhang,G.Shi,J.Huang,JiangnanUniv.(China)

Wecanobtainthelightintensityandreflectivitywhichsendupfromscreendotsorthepaperamongscreendotsafteranalyzinginklayerprobabilitytransfermatrixwhenthesourceanddetectorareonthetop,inklayerprobabilitytransferinversematrixwhenthesourceisinthebottomanddetectorisonthetop,paperprobabilitytransfermatrixwhenthesourceanddetectorareonthetopandpaperprobabilitytransferinversematrixwhenthesourceisinthebottomanddetectorisonthetopsynthetically.BytherelationshipbetweenPi(a)andPp(a),wecangetthereflectivityofpresswork.

Inthenewmodel,theeffortofinkspreading(physicsdotgain),inkpenetrationandopticsdotgainareconsidered.Fromournewmodel,thereflectivityofhomochromyhalftonescreendotcanbeenobtained.Whenmultilayerinkcombined,threekindsofinkcanformscreendotsof8kindsofcolorscalledNeugebauerPrimaries.Ifoverprintscreendotareasatisfiesstatisticallyindependent,wecangainspectrumreflectivityofcolarhalftonepresswork.Thismathematicmodelisimportanttothedigitalworkflowandautocontroldetectionofpresswork.


Color image segmentation on region growing and multiscale clusteringJ.Sun,W.Wang,Z.Jia,HenanPolytechnicUniv.(China)

Thispaperpresentsacolorimagesegmentationmethodbycombiningregiongrowingandcolorclusteringalgorithms.Thismethodconsidersthebothcolorandlocationinformationinatransformedcolorspace.Aftermulti-scaleclustering(MSC),itdoesaspatialprocessingregiongrowing.MSCcanperformbetterinconqueringtheover-segmentproblemthanequaldistanceclustering.ComparedwiththepreviousmethodsonlydependedMSC,theregiongrowingcanenhancetheabilityofnoisesuppression.Thismethodinheritstheideathatoperatesclusteringfirstandthencarriesoutspatialprocessing.Bothclusteringalgorithmandspatialprocessingmethodareimproved,sothismethodcangetmoresatisfiedresults.

Thispaperpresentsacolorimagesegmentationmethodbycombiningregiongrowingandcolorclusteringalgorithms.Thismethodconsidersthebothofcolorinformationandlocationinformationinatransformedcolorspace.AftertheMSC,itdoesaspatialprocessingregiongrowing.MSCcandobetterinconqueringtheover-segmentproblemthanequaldistanceclustering.ComparedwiththepreviousmethodonlydependedMSC,theregiongrowingcanenhancetheabilityofnoisesuppression.Thewholealgorithmdoesnotneedprioridataandcanfindappropriatenumberofclustersadaptively.

Ofcourse,therearemanydeficienciesinthisalgorithmneedtobe

improved.Forexample:(1)whenfixonthecorepixel,thesizeofregionDisnotonlyassociatedwiththesizeoftheimage,butalsoinvolvedwithattentiontodetailneglect.Ifweneedtoneglectmoredetails,thesizeofregionDwillbebigger;and2)intheMSC,oneshouldjoinsomespecialprocessingbeforeandaftertheclusteringtoreducetheamountofcalculation.


Regression based characterization of color measurement instruments in printing applicationsP.Nussbaum,J.Y.Hardeberg,GjøvikUniv.College(Norway)

Inthecontextofprintqualityandprocesscontrolcolorimetricparametersandtolerancevaluesareclearlydefined.Although,calibrationproceduresarewelldefinedforcolormeasurementinstruments,inprintingworkflowusingmorethanonecolormeasurementinstrumentmeasuringthesamecolorwedgecanproduceobviouslydifferentresults.Incertainsituationswhereoneinstrumentgivesvalueswhicharejustinsidethegiventolerancesandthesecondmeasurementinstrumentsproducesvalueswhichexceedsthepredefinedtoleranceparametersthequestionariseswhethertheprintorproofisapprovedornotaccordingtostandardparameters.

Theaimofthispaperwastofindanappropriatemodeltocharacterizecolormeasurementinstrumentsforprintingapplicationstoreducecolordifferencesduetoinstrumentuncertainties.Themethodusedisderivedfromcolormeasurementinstrumentcharacterizationmethodswhichhavebeenappliedbyperformingthepolynomialregressionwithleastsquaretechnique.Sixcolormeasurementinstrumentswereusedmeasuringcolorpatchesofacontrolcolorwedgeonthreedifferenttypesofsubstrates.Thecharacterizationfunctionswerederivedusingpolynomialregressionandthenonlinearoptimizationroutine,basedonthetrainingsetof14colorreferencepatchesandthecorrespondingcolorimetricmeasurementsobtainedbythemeasurementinstruments.Thederivedfunctionsarethenusedtopredictthecolorimetricvaluesof46colorwedgepatchesindependentofthetrainingset.Theestimatedcolorimetricvaluesfromoneinstrumentwerethencomparedtotheestimatedcolorimetricvaluesfromadifferentmeasurementinstrument.Comparingthecolordifferencesoftherawmeasurementdataobtainedbytwoinstruments,theappliedcharacterizationmodelwasreducingthecolordifferencessignificantdependentontheinstrumentcombination(productfamilies).


Printing anaglyph maps optimized for displayH.Zeng,Hewlett-PackardCo.(UnitedStates);R.Zeng,LimingUniv.(China)

Althoughanaglyphshaveabigadvantagethattheycanbepresentedusingtraditionalsinglechannelmediasuchasprint,film,display,etc.,amediatypemustbedeterminedasapairofviewsiscombinedintoasingleimagetominimizeretinalrivalryandstereocrosstalk.Mostofanaglyphmapsandmaptoolsareoptimizedfordisplayandassumedusingred-cyanfilteredglassesforviewing.Duetothelargedifferencebetweenadisplaygamutandaprintergamut,redandcyancolorsthatareusedtoseparatetheleftviewandtherightviewarechangedconsiderablyastheyaremappedfromadisplaycolorspacetoaprintercolorspaceforprintingandresultsinseriousretinalrivalry.Asolutionusingaspecialgamutmappingmethodtopreservetherelativerelationshipofcyanishandreddishcolorswasdevelopedtogamutmapcolorsfromdisplaytoprinter.Andthecolorcharacterizationtobalanceneutralcolorsforspecificred/cyanglassesisappliedtofurtherimprovethecolorappearance.



IS&T /

ReturntoContents


A restoration method for book scan imagesH.Ohk,S.H.Kim,D.Choi,SamsungElectronicsCo.,Ltd.(Korea,Republicof)

Whenabounddocumentsuchasabookisscannedorcopiedwithaflat-bedscanner,therearetwokindsofdefectsinthescannedimage;thegeometricandphotometricdistortion.Therootcauseofthesetwodefectsistheimperfectcontactbetweenthebooktobescannedandthescannerglassplate.Thelonggapbetweenthebookcenterandtheglassplatecausestheopticalpathfromthesurfaceofthebookandtheimagingunit(CCD/CIS)tobedifferentfromtheoptimalcondition.Themaindistortionofphotometriconeis“ShadowDistortion”.Nearthespine,reflectanceoflightisreducedbecauseofthedistancebetweenbooksurfaceanddocumentglassandthecurvatureofbooksurface.Anditisthemainreasonof“shadowdistortion”.Thisshadowdistortioncanmakespineregionentirelyblackincopyprocess.AndGeometricdistortionoccursbecauseofstructureoflensandCCD.Inscanner,thereislensatthemiddleofCCDorCIS.Ifascanobjectislaidondocumentglass,itslengthatCCDismeasuredcorrectly.Butifascanobjectisnotcontactedwithdocumentglass,itslengthisshorterthanoriginalone.

Wesuggestamethodforrestoringbounddocumentscanimageswithoutanyadditionalinformationorsensor.Wecorrectthebounddocumentimagesbasedontheestimationoftheboundaryfeatureandbackgroundprofile.BoundaryFeatureisobtainedaftercalculatingandanalyzingtheMinimumBoundaryRectanglewhichenclosesthewholeforegroundcontentswithminimumsize.Fromtheboundaryinformationwecanestimatesomeinformation-foldingpointposition,leftandrightpageinformation,skewangleofeachpageandsoonandtheextractedfeatureisusedforcorrectinggeometricdistortionde-skew,warping,andpageseparation.Backgroundprofileisestimatedfromthegradientmapanditisutilizedtocorrectphotometricdistortion;exposureproblem.Experimentalresultsshoweffectivenessofourproposedmethod.

Inthispaper,weproposeasolutionforcopyingorscanningthick,bounddocument.Thissolutioncontains“Pageseparating”,“ShadowDistortionCorrecting”,“SkewCorrection”and“PerspectiveDistortionCorrection”.Pageseparationcutseachpageofbookautomaticallyand“ShadowDistortionCorrection”correctsshadowedareanearthespinregionand“PerspectiveDistortionCorrection”warpsinputimagewitheffect.

7866-29, Session 8

Soft proofing of printed colours on substrates with optical brightening agentsN.S.Parab,P.J.Green,LondonCollegeofCommunication(UnitedKingdom)

TheappearanceofcoloursprintedonsubstrateswithopticalbrighteningagentshasbeenstudiedwithhelpofacolourmatchingexperimentwheretheobserversmatchedacolourpatchdisplayedonaLCDmonitor,byadjustingitsL*a*b*values,toanothercolorpatchprintedoutonpaperviewedundervaryingamountofUVcontentinlightingconditionintheviewingbooth.Acustomisedviewingboothwasbuiltforthispurposeandsubstrateswithvaryingamountofopticalbrightnerswereconsideredforthestudy.

AmodelbasedonCIECAM02andascalingtechniquehasbeendevelopedtopredicttheperceivedcolourmatchonaLCDdisplay,ofcoloursprintedonsubstrateswithopticalbrightnersandviewedundertheviewingboothwithvaryingamountofUVcontentintheviewingillumination.Accordingtotheobtainedresults,theappearanceofthecoloursprintedonsubstratescontainingopticalbrightnersvariedwithvariationintheamountofUVcontentintheviewingillumination.ThedevelopedmodelgavegoodpredictionoftheXYZtristimulusvaluesfortheperceivedmatchontheLCDdisplayfromtheXYZtristimulusvaluesoftheprintedcoloursonthesubstratewithacceptable∆Eab.ThisshowsthatCIECAM02canbeeffectivelyusedforsoftproofing.

7866-30, Session 8

Ghostscript color managementM.J.Vrhel,R.P.Johnston,ArtifexSoftwareInc.(UnitedStates)

Ghostscriptisawellknownopensourcedocumentconversionengineintroducedin1986byL.PeterDeutsch.Initially,GhostscriptwasdesignedtorenderPostScriptdocumentsbuthasgrowntohandleconversionsamongstseveralhighlevelvectorlanguagesincludingPostScript(PS),PDF,XPS,SVGandPCL.Today,GhostscriptisfoundoneveryLinuxsystemandisstillfreelyavailableforusersandcommercialdevelopmentunderGPLlicensing.

TheinitialdesignofGhostscriptwaswellbeforethedefinitionoftheICCformatandlikelyevenbeforetherewasmuchthoughtaboutdigitalcolormanagement.WhenPScolormanagement(PCM)wasintroducedbyAdobe,itwasincorporatedintoGhostscriptandbecametheprimarymethodtoachievemanagedcolorwithinGhostscriptthroughtheuseofcolorrenderingdictionaries(CRDs)andcolorspacearrays.ICCsupporttoGhostscriptwasaddedwiththesupportofPDF1.3butthesolutionwasrestrictedinaperformancesenseduetothearchitecturerelyinguponPScolormanagement.

ThispaperreviewsasignificantupdatetothecolorarchitecturewithinGhostscript.TheICC-baseddesignoperatesefficientlyinGhostscript’smulti-threadedrenderingenvironment,allowseasyinterfacingofexternalCMMsandincludesconversionofPSandPDFcolorobjectstoICCobjects.

7866-31, Session 8

Color control of a lighting system using RGBW LEDsM.Tanaka,T.Horiuchi,S.Tominaga,ChibaUniv.(Japan)

Alightingsystemisproposedtorenderobjectsunderavarietyofcoloredilluminations.ThesystemisconstructedwithaLEDunit,whitediffusionfilters,dimmers,andapersonalcomputerasacontroller.TheLEDunitiscomposedoffourkindsofcolorLEDlampswhichare12red(R),14green(G),12blue(B)and10white(W)colors.WedeterminethedigitalcontrolsignalsoftheRGBWlightsforgeneratingthecoloredlightwiththetargetXYZtristimulusvalues.Thenwehaveamappingproblemfromthe3Dcolorspaceofthetristimulusvaluestothe4DcontrolspaceoftheRGBWdigitalvalues.WedevelopaneffectivealgorithmbydecomposingthetargetXYZvaluesinto(XYZ)rgbforRGBlightsand(XYZ)wvaluesforWlights.WhenacoloredlightwiththetargettristimulusvaluescanbegeneratedwithoutthewhiteLEDlights,theconventionalXYZ-RGBcolorcoordinateconversionisusedforobtaininginputRGBvaluesonly.ForahighluminancerangeintheXYZspace,theXYZ-RGBWcoordinateconversionisusedfordetermininginputRGBWdigitalvalues.TheaccuracyofthecoloredlightisexaminedwithregardtotheCIEcolordifference.

7866-32, Session 9

Brightness contrast under high surround luminance levels: psychophysical data vs CIECAM02Y.S.Baek,H.S.Kim,S.Park,DaejinUniv.(Korea,Republicof)

ThisstudyaimstoevaluatebrightnesscontrastcalculatedbyCIECAM02andcomparetheperformancewiththepsychophysical.Thesurroundconditionswerechanged7levelsfromDarktoOverBright(2087cd/m2).Forthisstudy,sixneutralcolorsusedastestimagesuniformlyfilledtheentiredisplayscreen.Psychophysicalexperimentswerecarriedouttoinvestigatetherelationbetweenperceivedbrightnessandluminanceusingmagnitudeestimationmethod.ThereferencebrightnesswassetwhitetestimageunderAverage3(200cd/m2).Toinvestigateperceivedimagecontrast,theMichelsoncontrastwascalculatedusingperceivedbrightnessofwhiteandblacktestimage.Asaresult,wefoundoutthattheperceivedimagecontrastishighestatAverage3surroundcondition.Consequently,brightness



IS&T /

ReturntoContents

contrastincreasesassurroundluminanceincreasesbutitdecreasesfromaveragesurroundconditiontooverbrightsurroundcondition.Furthermore,MichelsoncontrastwascomputedusingbrightnessofCIECAM02.BecauseCIECAM02considersonlythreesurroundconditions,ithassamebrightnesscontrastregardlessofthevariationinthesurroundluminance.Consequently,thisshowsdifferentresultwithourresult.Highersurroundluminancelevelsshouldbetakenintoaccount.

7866-33, Session 9

LabRGB: optimization of bit allocationF.Nakaya,FujiXeroxCo.,Ltd.(Japan)

Spectraldistributioncanbewrittenasalinearcombinationofeigenvectorsandtheeigenvectorsmethodgivestheleastestimationerror,buteigenvectorsdependonasampleselectionofpopulationandencodingvalueshavenophysicalmeaning.RecentlyreportedLabPQRistoconveyphysicalvalues,butstillisdependentonasampleselectionofpopulation.Thus,LabRGB,wasproposedin2007.LabRGBistoprovide“sampleselectionofpopulation”freespectralencoding/decodingmethods,whichconsistsofsixuniquetrigonometricbasefunctionsandphysicallymeaningfulencodingvalues.LabRGBwasappliedtotherealmultispectralimagesandshowedagoodperformanceinspectralaswellascolorimetricestimation.Inthispaper,theallocationofabitdepthtotheweightingfactorsisexaminedintermsofspectralandcolorimetricdistanceofnearestneighbors.Theoptimumwaytominimizetheunusablecombinationofweightingfactorsisobtainedbyusingthecorrelationoftheweightingfactors.Theoptimumwaytominimizethespectralandcolorimetricdistanceofnearestneighborsisalsoobtainedbyusingthenonlinearmappingmethod.Thetwomethodsthusobtainedgiveagoodclueforexplicitlydefiningthebitdepthsofrespectivescoresforthefutureapplicationsandstandardization.

7866-34, Session 9

Spatio-temporal colour correction of strongly degraded moviesA.B.M.T.Islam,I.Farup,GjøvikUniv.College(Norway)

Wepresentamethodfordigitalcolourrestorationofstronglydegradedmoviematerial.ThemethodisbasedupontheexistingSTRESSalgorithm.Inordertocopewiththeproblemofhighlycorrelatedcolourchannels,weimplementedapreprocessingstepinwhichsaturationenhancementisperformedinaPCAspace.Spatialcolouralgorithmstendstoemphasisealldetailsintheimages,includingdustandscratches.Surprisingly,wefoundthatthepresenceofthesedefectsdoesnotaffectthebehaviourofthecolourcorrectionalgorithm.AlthoughtheSTRESSalgorithmisalreadyinitselfmoreefficientthantraditionalspatialcolouralgorithms,itisstillcomputationallyexpensive.Tospeeditupfurther,wewentbeyondthespatialdomainoftheframesandextendedthealgorithmtothetemporaldomain.Thisway,wewereabletoachievean80percentreductionofthecomputationaltimecomparedtoprocessingeverysingleframeindividually.Weperformedauserexperiment,andfoundthatourmethodproducessignificantlybetterresultsthantheexistingmethods.Thus,ourmethodoutperformstheexistingonesintermsofbothvisualqualityandcomputationalefficiency.

7866-35, Session 9

Color correction optimization with hue regularizationH.Zhang,H.Liu,OregonStateUniv.(UnitedStates);S.Quan,BroadcomCorp.(UnitedStates)

Previousworkhassuggestedthatobserversarecapableofjudgingthequalityofanimagewithoutanyknowledgeoftheoriginalscene.

Whenreferenceisnotavailable,observersextracttheapparentobjectsinanimageandcomparethemwithtypicalcolorsofsimilarobjectsrecalledfromtheirmemories.Somegenerallyagreedresearchresultsindicatethatalthoughperfectcolorimetricrenderingisnotconspicuousandcolorerrorscanbewelltolerated,observersdoperceivesomememorycolorssuchasskin,grass,andskyfairlyconsistentlyandrememberthemwithslightlydifferenthuesandhighersaturationthantheiroriginals,withspecificpreferences.Appropriaterenditionofthesecolorsisnecessaryandcontributesheavilytotheoverallperceivedimagequality.

Acolorcorrectionmatrixisthetransformationconvertingtheimagedatafromadevicedependentcolorspacetoatargetcolorspace.Colorcorrectionmatrixcanbeobtainedthroughlinearregressionbetweenthetwocolorspaces,minimizingthemeansquareEuclideanerrorincertaincolorimetriccoordinates.Unfortunately,thismethodcouldresultinobjectionabledistortionsifthecolorerrorsbiasedmemorycolorsundesirably.

Weproposeacolorcorrectionoptimizationmethodwithpreferredcolorreproductioninmindthroughhueregularization.Preferredcolorreproductionwillbereviewed,andanoptimizationmethodwillbeproposedusingrecursivelinearregressionwithadditionalconstraintsforhueregularization.

7866-36, Session 10

Spectral model of an electro-photographic printing systemM.A.Kriss,MAKConsulting(UnitedStates)

AtEI2007inSanJose,Californiaadetailedphysicalmodelsformonochromeandcolorelectro-photographicprinterswaspresented.Thesemodelswerebasedoncomputersimulationsoftoner-dotformationforavarietyofhalftonestructures.Theopticalinteractionsbetweenthetoner-dotsandthepapersubstratewereincorporatedbymeansofanopticalscatteringfunction,whichallowedforthecalculationofopticaldot-gain(andphysicaldot-gain)asfunctionofthehalftonestructure.Thecolormodelusedsimplered-green-bluechannelstomeasuretheeffectoftheabsorptionandscatteringpropertiesofthecyan,magenta,yellowandblacktonersonthefinalhalf-toneimage.ThenewspectralmodelusesthefullabsorptionandscatteringspectrumoftheimagetonersincalculatingthefinalcolorimageintermsofCIEXYZvaluesforwell-definedcolorandgraypatches.Thenewspectralmodelwillbeusedtoshowtheimpactofhalftonestructureandtoner-layer-orderonconventionaldot-on-dotandrotateddotcolorhalftonesystemsandhowtominimizetheimpactofimagetonerscattering.ThemodelhasbeenexpandedtousetheNeugebauerequationstoapproximatetheamountofcyan,magenta,andyellowtonersrequiredtogivea“good”neutralintherotateddothalftoneandfinetuningisachievedbyadjustingthedevelopmentthresholdlevelforeachlayertoholdagoodneutraloverthefulltonalrange.Oncea“good”neutralisobtainedtheimpactondotgain,colorreproductionandoptimumlayerordercanstudiedwithanemphasisonhowthefullspectralmodeldiffersfromthesimplerthree-channelmodel.Themodelisusedtoexplorethedifferentapproachesrequiredindot-on-dotandrotateddotscreenstoachievegoodresults.Inthefuturethemodelwillbeappliedtostochasticscreens.

7866-37, Session 10

Optimized selection of image tiles for ink spreading calibrationT.Bugnon,R.D.Hersch,EcolePolytechniqueFédéraledeLausanne(Switzerland)

TheYule-NielsenmodifiedspectralNeugebauermodel(YNSN)enablespredictingreflectancespectrafromsurfacecoverages.Inordertoprovideanimprovedpredictionaccuracy,thismodelisenhancedwithaninkspreadingmodelaccountingforinkspreadinginallsuperpositionconditions(IS-YNSN).Asanyspectralreflectionpredictionmodel,theIS-YNSNmodelisdesignedtopredictthereflectionspectraofuniformpatches.Insteadofuniformpatches,



IS&T /

ReturntoContents

weinvestigateiftileslocatedwithincolorimagescanbeaccuratelypredictedandhowtheycanbeusedtofacilitatethecalibrationoftheinkspreadingmodel.Inthepresentcontribution,wefirstdetailanalgorithmtoautomaticallyselectimagetilesbaseduniquelyontheCMYorCMYKpixelvaluesofthesecolorimagesandshowthatsuchimagetilescanbeaccuratelypredictedbytheIS-YNSNmodelprovidedthattheyareuniformenough.Thisselectionalgorithmincorporatesadditionalconstraintsandisverifiedon6differentcolorimages.Wefinallydemonstratethattheinkspreadingmodelcanbecalibratedwithasfewas5to10imagetilesprovidedthattheimagetilesarechosenbyapplyingtheproposedadditionalconstraints.

7866-38, Session 10

A preferred skin color enhancement method for photographic color reproductionH.Zeng,Hewlett-PackardCo.(UnitedStates);M.R.Luo,Univ.ofLeeds(UnitedKingdom)

Skintonesarethemostimportantcolorsamongthememorycolorcategory.Reproducingskincolorspleasinglyisanimportantfactorinphotographiccolorreproduction.Movingskincolorstowardtheirpreferredskincolorcenterimprovesthecolorpreferenceofskincolorreproduction.Severalmethodstomorphskincolorstoasmallerpreferredskincolorregionhasbeenreportedinthepast.Inthispaper,anewapproachisproposedtofurtherimprovetheresultofskincolorenhancement.Anellipsoidskincolormodelisappliedtocomputeskincolorprobabilitiesforskincolordetectionandtodetermineaweightforskincoloradjustment.Preferredskincolorcentersdeterminedthroughpsychophysicalexperimentswereusedforcoloradjustment.Preferredskincolorcentersfordark,medium,andlightskincolorcategoriesareappliedtoadjustskincolorsdifferently.Skincolorsaremorphedtowardtheirpreferredcolorcenterinbothchromaandhueangle.Aspecialprocessingforhighlightskincolorsisappliedtoavoidcontrastlossinhighlight.A3-Dinterpolationmethodisappliedtofixapotentialcontouringproblemandtoincreasecolorprocessingefficiency.Aninitialpsychophysicalexperimentvalidatesthatthemethodofpreferredskincolorenhancementeffectivelyidentifiesskincolors,improvestheskincolorpreference,anddoesnotobjectionablyaffectpreferredskincolorsinoriginalimages.

7866-39, Session 10

Kubelka-Munk theory for efficient spectral printer modelingM.A.Abebe,J.Gerhardt,J.Y.Hardeberg,GjøvikUniv.College(Norway)

Inspectralcolorreproduction,wereproduceacolorbasedonitsspectralreflectanceratherthanonitscolorimetricvalues.Thishasthepotentialofincreasingthecolourfidelityqualityofthereproductionunderdifferentilluminationconditionsandgiveshighergeberalcoloraccuracy.

Inourworkweparticularlyfocusonthequalityofspectralcolorimagereproductionbymulti-channelinkjetprinting;akeyelementofthisprocessistoaccuratelymodelthecolorimetricandspectralbehavioroftheprinter.ForinstancetheYule-NielsenmodifiedspectralNeugebauer(YNSN)modelisaverymuchusedspectralprintermodel.Inthismodelthespectralreflectanceofthedifferentcolorantcombinationsisestimatedasaconvexcombinationofthereflectancesoftheso-calledNeugebauerPrimaries(NP),whicharetheprimaries,secondaries,tertiaries,etc.Inordertosetupsuchamodelforaprintingsystemwithahighnumberofcolorantchannels,manycolorpatchesmustbemeasured;infactthenecessarynumberofpatchesincreasesexponentiallywiththenumberofchannels,thisisacostlyandtime-consumingtask.AnotherproblemwiththisapproachisthatforNPsofmorethan3colorants,limitationsofpapertotalinkcoveragestarttoplayanimportantrole.

WeproposetouseKubelka-MunktheorytoestimatethespectralreflectancesoftheNeugebauerPrimariesinsteadofprintingand

measuringthem,andsubsequentlytousetheseestimatedNPsasthebasisofourYNSNprintermodeling.WehaveevaluatedthisapproachexperimentallyonseveraldifferentpapertypesandontheHPDeskjet1220CCMYKinkjetprinterandtheXeroxPhaser7760CMYKlaserprinter,usingboththeconventionalspectralNeugebauer(SN)modelandtheYNSNmodel.

Usingthisapproachwefindthatweachievenotonlylesstimeconsumingmodelestablishment,butalso,somewhatunexpectedly,improvedmodelprecisionoverthemodelsusingtherealmeasurementsoftheNPs.WehavealsoinvestigatedahybridmodelwithmixedNPs,halfmeasuredandhalfestimated.

OurresultsshowusthereasonabilityoftheKubelka-Munktheoryforspectralprintermodeling.Theresultsdifferfromprintertoprinterandfrompapertopaper.ThespectralestimationsofbothSNandYNSNmodelsinlaserprintersperformmuchbetterthanforinkjetprinters.Wealsoseethatusingsimpleandverycheapcopypaperwillgiveusevenbetterperformancesthanusingsomeexpensivephotopapers.Thepaperpropertywhichseemstobethemostimportantfactorforthequalityofthemodelistheopacity,thehighertheopacitythehighertheperformanceoftheKMtheorywillbe.

7866-40, Session 11

A simple color prediction model based on multiple dot gain curvesY.Y.Qu,S.Gooran,LinköpingUniv.(Sweden)

Summary:Mostofthecolorpredictionmodelsuseasingledotgaincurveforeachink,fewmentiondotgainchangesindifferentinksuperpositionsituations,butstilltheyfigureonlyonedotgaincurveforeachpossiblecombinationsituation.

Inthispaperwepresentasimplecolorpredictionmodel.InthismodelweutilizethreedifferentdotgaincurvesforeachprimaryinkobtainedbyCIEX,YandZ,whichapproximatelystandforthreedifferentwavelengthbands.

Inaddition,wenoticedthatthedotgaincurvesforsingleinkprintedonpaperarenotrepresentingthedotgainforthesameinkwhenprintedonanotherink.Therefore,dotgaincurvesfordifferentinkoverlappingsituationsareoptimizedbymatchingcalculationofspecialtrainingpatchestothecorrespondingmeasuredtristimulusvalues.Regardingcertaininkcombination,foreachinkwefinallyfigureoutthreedotgaincurves,eachofwhichisacombinationofdifferentdotgaincurvesthatareweightedbasedontheirprobability.

Ourworkpresentsafeasiblecolorpredictionmodelconsideringbothinksuperpositionandlightwavelengthinfluence.

7866-41, Session 11

Subsampled optimal noise management method for a robust separation based calibration of color printing systemsM.Qiao,J.M.Sanchez,Y.Chen,I.Case,G.Lin,XeroxCorp.(UnitedStates)

Formanycolorprintingsystems,printercalibrationisoftenutilizedtoreturntheprintertoaknownstatetoensureconsistentcoloroutput.Inparticular,thekeyvisualresponseofcolorbalanceisoftencontrolledbythecalibrationstatereturn.Inputcolorsignalnoise,generatedfromtheprintingsystemnaturalvariationwhenprintingthecalibrationtarget,affectstheaccuracyandrobustnessofthecalibrationoutcome.Noisemanagementtechniquesformanaginginputcolorsignalnoisepriortosystemcalibrationareoftenabsentorrelyonadhocanalysisandareusuallynotbasedonthereturnofawelldevelopedprinterresponsethathasbeenextractedfrommeasuredsignalusingadvancednoisemanagementmethods.ThispaperdescribesPartIIofanoverallmethodfordevelopingarobustnoisemanagementsystemforprintercalibration.InPartI,an8-bitfullresolutioncalibrationtargetisdescribedandaniterativefilteringnoisemanagementmetricandmethodaredefinedanddeveloped.InthisPartII,thespecific



IS&T /

ReturntoContents

developmentofalowresolutioncalibrationtargetandcorrespondingnoisefreerepresentationoftheprintersystemstate,asdefinedbyquantitativemetricsrelativetotheprinterresponsederivedfromhighresolutionsignalinPartIisdefinedanddeveloped.Thissubsampledcalibrationtargetusingtheproposednoisemanagementmethodcanincreasetheproductivityandreduceoperatorerrorinprintshopworkflowwithminimallossofaccuracy.

7866-42, Session 11

Investigating the wavelength dependency of dot gain in color printM.Namedanian,S.Gooran,D.Nyström,LinköpingUniv.(Sweden)

Fullycharacterizingphysicalandopticaldotgainisusefulforsystemcalibrationandqualitycontrolofthecolorreproduction.Themainpurposeofthispaperistostudytheopticaldotgainbehaviorofcolorprints.Wepresentanapproachtoseparateopticalandphysicaldotgainbyusingmicroscopicimages.Ahighresolutioncamera(2µm/pixel)equippedwithasetofsevenbroadbandinterferencefiltersilluminatinglightin400nmto700nmwavelengthband,isused.Thecamerasystemcancapturebothreflectedandtransmittedimages.Thisphenomenonhasbeenusedtoseparatetheopticalandphysicaldotgain.Byusingtheseveninterferencefiltersthewavelengthdependencyofopticaldotgainhasbeenstudied.Asblackinkhastheabsorbingwavelengthbandof400nmto700nm,itsopticaldotgainindifferentwavelengthshasbeeninvestigatedandtheresultsshowthatlightscatteringofthepaperiswavelengthindependent.ThismeansthatPointSpreadFunction(PSF)isindependentoftheink.ByusingthePSFandthephysicaldotshapesofacolorhalftonedimage,itispossibletopredicttheresultingcolorincludingtheeffectofopticaldotgain.

7866-43, Session 11

Fast approach for toner savingI.V.Safonov,I.V.Kurilin,M.N.Rychagov,SamsungElectronicsCo.,Ltd.(RussianFederation);H.K.Lee,S.H.Kim,D.Choi,SamsungElectronicsCo.,Ltd.(Korea,Republicof)

Savingoftonerconsumptionisanimportanttaskinmodernprintingdevicesandhasasignificantecologicalimpact.Existingtonersavingapproacheshavetwomaindrawbacks:appearanceofhardcopyintonersavingmodeisworseincomparisonwithnormalmode;processingofwholerenderedpagebitmaprequiressignificantcomputationalcosts.

Weproposetoaddsmallholesofvariousshapesandsizestorandomplacesinsideacharacterbitmapduringfontrenderingbeforestoringcharacterbitmaptofontcache.ThisschemeisbasedonprocessingpipelineinRIPofstandardprintinglanguagesPostscriptandPCL.Processingoftextcharactersonly,andmoreover,processingofeachcharacterforgivenfontandsizealone,isanextremelyfastprocedure.Theapproachdoesnotdeterioratehalftonedbitmapandbusinessgraphicsandprovidetonersavingfortypicalofficedocumentsupto15-20%.Rateoftonersavingisadjustable.

Appearanceofcharactersisalmostindistinguishableincomparisonwithsolidblacktextduetorandomplacementofsmallholesinsidethecharacterregions.Thesuggestedmethodautomaticallyproducesnotonersavingonsmallfonts,sincepreservesqualityofsmallfonts.Readabilityoftextprocessedbyproposedmethodisfine.OCRprogramsprocessthatscannedhardcopysuccessfullytoo.

7866-44, Session 11

A virtual printer and reference printing conditionsP.Green,LondonCollegeofCommunication(UnitedKingdom);

C.Revie,FFEIUK(UnitedKingdom);D.McDowell,EastmanKodakCo.(UnitedStates)

Inalatebindingworkflow,dataiscommonlypreparedinanoutput-referredstatebasedonareferenceintermediateRGBcolourencoding.Suchencodingsmayhavealargergamutthanthetargetprintingcondition,andsothereissomeambiguityoverhowtopreviewthedatabeforeithasbeenconvertedtothetargetprintingcondition.

Hereweproposeanadditionalintermediateencoding,referredtoasa‘virtualprinter’whichbridgesthegapbetweenthree-componentreferenceRGBorPCSencodings,andreferenceCMYKprintingconditions.

Thevirtualprinterhasalargecolourgamutwhichrepresentsasupersetofmostavailableprintgamuts.Itisdefinedhereintermsofthereflectanceandcolorimetriccoordinatesofthevirtualcolorants,andassociatedcolourmixingmodel.

Whenusedinacolourreproductionworkflow,documentscanbeinitiallyrenderedtotheprinter-likegamutofthevirtualprinter,andchannelpreferences(suchasblackgeneration)canbedefined.Re-renderingtoareferenceprintingconditionandassociatedcolourgamutisdeferred,thussupportingre-purposingofthedocument.

BytransformingacolourdocumenttovirtualprinterCMYorCMYK,itispossibletoperformediting,previewandchannelspecificationoperationspriortore-renderingtoareferenceprintingcondition.Inconjunctionwiththereferenceprinter,whosecolourgamutislimitedtoaspecificprintingprocess,thevirtualprinterprovidesrobustsupportforlatebindingworkflowsinthegraphicarts.

7866-45, Session 12

Cost function analysis for stochastic clustered-dot halftoning based on direct binary searchP.Goyal,M.Gupta,PurdueUniv.(UnitedStates);C.Staelin,M.Fischer,Hewlett-PackardLabs.IsraelLtd.(Israel);O.Shacham,Hewlett-PackardIndigoLtd.(Israel);J.P.Allebach,PurdueUniv.(UnitedStates)

Noabstractavailable

7866-46, Session 12

Stochastic clustered-dot screen design for improved smoothnessM.Gupta,P.Goyal,PurdueUniv.(UnitedStates);M.Fischer,C.Staelin,Hewlett-PackardLabs.IsraelLtd.(Israel);T.Kashti,O.Shacham,Hewlett-PackardIndigoLtd.(Israel);J.P.Allebach,PurdueUniv.(UnitedStates)

Noabstractavailable

7866-47, Session 12

Design of color screen tile vector setsJ.Kim,Y.Chen,PurdueUniv.(UnitedStates);M.Fischer,Hewlett-PackardLabs.IsraelLtd.(Israel);O.Shacham,Hewlett-PackardIndigoLtd.(Israel);C.Staelin,Hewlett-PackardLabs.IsraelLtd.(Israel);K.Bengtson,Hewlett-PackardCo.(UnitedStates);J.P.Allebach,PurdueUniv.(UnitedStates)

Noabstractavailable



IS&T /

ReturntoContents

7866-48, Session 12

UV Fluorescent Encoded Image Using Two Successive Filling Halftone AlgorithmsY.Zhao,S.Wang,XeroxCorp.(UnitedStates)

Methodsareprovidedforcreatingafluorescentwatermarkwithinanimageonasubstrate,suchaspaper.Themethodinvolvescreatingahalftoneimageusingtwodifferenthalftonestrategies.Thehalftonemethodiscombinedwithabinarywatermarkmasktoformtwocolorpatterns(e.g.,oneinabackgroundregionoftheimageandoneinawatermarkregionoftheimage)andtwosuccessive-fillinghalftonealgorithms,suchthattheinkdropletsdepositedbyonecolorpatterncovermoreofthesubstratethantheinkdropletsdepositedbytheothercolorpattern,withthetwocolorpatternshavingapproximatelythesamereflectanceundernormallight.However,underUVillumination,avisibledifference(e.g.,thewatermark)isseeninthetwopatterns.

7866-49, Session 12

Moire-free color halftoning using hexagonal geometryR.P.Loce,S.Wang,XeroxCorp.(UnitedStates)

Noabstractavailable

7866-50, Session 13

A hybrid adaptive thresholding method for text with halftone pattern in scanned document imagesS.Yu,W.Ming,KonicaMinoltaSystemsLabs.,Inc.(UnitedStates)

Inthispaper,ahybridadaptivethresholdingmethodforscanneddocumentimagescontainingtextwithhalftonepatternispresented.Themethodisbasedonthetopologicalfeatureandgraylevelstatisticsofhalftonetext.Histogramonlybasedthresholdingmethodsoftenmisssomehalftonetextafterbinarization,especiallywithclosetobackgroundgraylevelhalftonetext.Theproposedmethoddividesthedocumentimageintononoverlapwindowsandextractstextcharactersasconnectcomponentineachwindow.TheEulernumberofeachtextcharacteristhencalculatedandusedastopologicalfeaturestoidentifyhalftonetext.Aftermostthehalftonetextareidentifiedineachwindow,thepixelvaluestatisticsofthehalftonetextareestimated.Halftonetextarefirstsegmentedoutbyusingthethresholddeterminedbytheirpixelvaluestatistics.Thenaglobalthresholdiscalculatedfortheremainingpixelsintheimagetosegmentoutdarktext.Thefinalbinarizationresultofthedocumentimageisobtainedbycombiningthebinarizationresultsofthehalftonetextanddarktext.Comparingtomethodsbasedonlyonhistogram,satisfiedbinarizationresultsareobtainedwhentestingtheproposedmethodonscanneddocumentimagescontaininghalftonetext.

7866-51, Session 13

Window-based spectral analysis of periodic color halftone screensA.H.Eid,B.E.Cooper,E.E.Rippetoe,LexmarkInternational,Inc.(UnitedStates)

Improperdesignofcolorhalftonescreensmaycreatevisuallyobjectionablemoirépatternsinthefinalprintsduetotheinteractionbetweenthehalftonescreensofthecolorprimaries.Thepredictionofsuchinteractionsfromthescreens’bitmapshelpstoidentifyandavoidproblematicpatterns,reducingthetimerequiredtodesigneffective

colorhalftonescreens.

Inthispaper,wedetectthemoirépatternsbyexaminingthespatialfrequencyspectrumofthemixedscreens.WestudydifferentwindowingtechniquesincludingHamming,Hanning,andBlackman,tobetterestimatethemoiréstrength,frequencyandorientation.Thewindow-basedspectralestimationhastheadvantageofreducingtheeffectofspectralleakageassociatedwiththenon-windoweddiscretesignals.

Twomethodsareusedtoverifythedetectedmoiréfromthebitmaps.First,weanalyzescansoftheprintedhalftones,usingthesametechniquethatweappliedtothebitmaps.Second,weindependentlyinspecttheprintedhalftonesvisually.Ourexperimentsshowpromisingresultsbydetectingthemoirépatternsfromboththebitmapimagesaswellasthescansoftheactualprintsverifiedbyvisualinspection.

7866-52, Session 13

Descreening of color halftone images in the frequency domainC.J.Stanger,T.Tran,E.H.BarneySmith,BoiseStateUniv.(UnitedStates)

Colorprintersusethreecolors,cyan,magenta,andyellow,andsometimesblackwhenprintingcolorimages.Awiderangeofcolorsiscreatedbyvaryingthesizeofthedotsandtherelativecontributionofthecolorprimaries.Theperceivedcolordependsonthesizeofthedotsandthedifferentcombinationsofthethreecolorsatdifferentangles.Thistypeofprintingiscalledhalftoning.Scanningahalftoneimageintroduceshalftoneartifacts,knownasMoirépatterns,whichsignificantlydegradetheimagequality.Printersthatuseamplitudemodulation(AM)screeningforhalftoneprintingpositiondotsinaperiodicpattern.Therefore,frequenciesrelatingtohalftoningareeasilyidentifiableinthefrequencydomain.Thispaperproposesamethodfordescreeningscannedcolorhalftoneimagesusingacustombandrejectfilterdesignedtoisolateandremoveonlyfrequenciesrelatedtohalftoningwhileleavingimageedgessharpwithoutimagesegmentationoredgedetection.Toenablehardwareacceleration,theimageisprocessedinsmalloverlappedwindows.Thewindowsarefilteredindividuallyinthefrequencydomain,thenpiecedbacktogethertoshowtheentirefilteredimagewithoutnoticeableblockingartifacts.

7866-53, Session 13

Analog image backup with steganographic halftonesR.A.Ulichney,I.Tastl,E.Hoarau,Hewlett-PackardLabs.(UnitedStates)

Hardcopy(analog)backupofphotographsisanimportantalternativetodigitalstorage.Itoffersameanstovisuallyenjoythe“storageformat”decoupledfromadigitalstoragemediawhichcanhaveashorterarchivallifethanhardcopy,alongwithshorterlifetimeofhardwaresupport.Thepaperdescribesameanstoeliminatetheneedtoincludeunsightlytextthatispartofearliersolutionsbyembeddingallrequiredmetadatainasmallsteganographichalftonewiththeprint.Theonlyhardwarerequirementisanimagescanner,whichwecansafelyassumewillbeavailablefarintothefuturewhenreadersoftoday’sdigitalstoragemediawillbelonggone.Examplesoftheresultingarchivalcompositionsandmetadata-embeddedhalftoneswillbeincluded.



IS&T /

ReturntoContents

Conference 7867: Image Quality and System Performance VIIIMonday-Wednesday24-26January2011PartofProceedingsofSPIEVol.7867ImageQualityandSystemPerformanceVIII

7867-01, Session 1

Image quality metrics for the evaluation of print qualityM.Pedersen,J.Y.Hardeberg,GjøvikUniv.College(Norway);N.Bonnier,OcéPrintLogicTechnologies(France);F.Albregtsen,Univ.ofOslo(Norway)

Imagequalitymetricshavebecomemoreandmorepopularintheimageprocessingcommunity.Manydifferentimagequalitymetricshavebeenproposed,oftenwiththegoalofbeingabletopredictperceivedimagequality.

However,sofar,noonehasbeenabletodefineanimagequalitymetricwellcorrelatedwiththeperceptforoverallimagequality.

Inourresearchwehavepresentedasetofqualityattributesbuiltonexistingattributesfromtheliterature.Thesixproposedqualityattributesare:sharpness,color,lightness,artifacts,contrast,andphysical.Anexperimentvalidatesthequalityattributesassuitablefortheevaluationofimagequality.

Wehavethenproposedtouseimagequalitymetricsforeachqualityattributesinordertopredictperceivedimagequality.Aselectionofsuitableofimagequalitymetricsforthedifferentqualityattributeshasbeencarriedout.

Eachofthequalityattributeshasbeeninvestigated,andanexperimentalanalysishasbeencarriedouttofindthemostsuitableimagequalitymetricsforeachofthegivenqualityattributes

Theprocessofapplyingimagequalitymetricstoprintedimagesisnotstraightforwardasimagequalitymetricsrequireadigitalinput.Theprintedimagesneedtobetransformedfromphysicalcopiestodigitalcopiesinordertoapplymetrics.Aframeworkhasbeendevelopedforthisprocess,whichincludesthetransformationtoadigitalformat,imageregistration,andtheapplicationofimagequalitymetrics.

Theresultsindicatethatimagequalitymetricscancorrelatewiththeperceptforcertainqualityattributes,buttheyarenotcorrelatedwithoverallperceivedimagequality.Thereforetheuseofqualityattributestogetherwithimagequalitymetricsisinteresting,andverypromising.

7867-02, Session 1

Hyper error map based document stitchingL.C.Cui,LexmarkInternational,Inc.(UnitedStates)

Documentstitchingcangenerateunpleasantstitchingartifacts.Hereweexamineonetypeofstitchingartifacts,themisalignedfeaturesandedges,andproposeamethodtominimizethatbasedononeperformancecharacteristicsofthehumanvision,thehyperacuity.

7867-03, Session 1

Quantification of perceived macro-uniformityK.Lee,Y.Bang,H.Choh,SamsungElectronicsCo.,Ltd.(Korea,Republicof)

INCITSW1.1teamdefinedmacro-uniformity,categorizeditintofivetypesofsub-attributes,andattemptedtoevaluateitbasedonqualityrulermethod.Qualityrulermethodiseasilyusedfortheendusertojudgethelevelofprintdefect.However,theprecisequantificationmethodwithoutrulerimagesismorehelpfultodevelopertoanalyzeprintingsystemcomponentsaffectingprintuniformityandtocommunicatebetweendevelopers.

Inthispaper,weproposeamethodtoquantifyperceivedmacro-

uniformityforagiventestprint.Wesupposethatmacro-uniformityisperceivedbyblendingfourkindsofsub-attributes:banding,streaking,2Dnoise,andgradient.Eachattributeisindependentlymeasuredbythedevelopedmethod.Themeasuredvaluesareconvertedtothesamevisualscaleusingbythesubjectiveresult.Thescoreofmacro-uniformityisdeterminedbytheweightedsumofeachmeasuredvalue.Thoughthesubjectivetest,wemakesuretheperformanceofthespecificmethodformeasuringsub-attributesofmacro-uniformity.Correlationsbetweenthespecificmethods(banding/streaking,2Dnoise,andGradient)andsubjectivescoreare0.92,0.97,and0.86,respectively.Weobtainthecorrelationbetweentheproposedmethodtoquantifyoveralluniformityandsubjectivescoreis0.94.

7867-04, Session 1

Current practices in art image reproduction: image quality experimentationS.P.Farnand,RochesterInstituteofTechnology(UnitedStates)

Aproject,supportedbytheAndrewW.MellonFoundation,iscurrentlyunderwaytoevaluatecurrentpracticesinfineartimagereproduction,determinetheimagequalitygenerallyachievable,andestablishasuggestedframeworkforartimageinterchange.Todeterminetheimagequalitycurrentlybeingachieved,experimentationhasbeenconductedinwhichasetofobjectivetargetsandpiecesofartworkinvariousmediawereimagedbyparticipatingmuseumsandotherculturalheritageinstitutions.Printfilesandguideprints,iftheseareusedintheinstitution’stypicalworkflow,weredeliveredtotheRochesterInstituteofTechnologywhereprintsweremadeonthesameHeidelbergSpeedmasterpressrunbythesameoperatorandusingthesameinksandpaperthroughout.Theresultingprintswereusedasstimuliinpsychometrictestingtogeneratescalesofimagequality.Inthistesting,twenty-fourobserverswereaskedtoranktheprintsrelativetotheoriginalartwork.Theyareaskedspecificallytodeterminewhichprintswerethebestreproductionsoftheoriginalartwork,asopposedtothemostpersonallypleasingimages.Theresultsindicatedthatcertainoftheworkflowsprovidedmoreconsistentlyaccuratereproductionswhilecertainotherworkflowsconsistentlyfellshort.Theexperimentalresultswillbeamongtheinputsusedtoconstructaconceptualframeworkofthevarioustypesofimagingtakingplaceinculturalinstitutionsatpresent.Basedonthisframework,animageprocessingtoolthatincorporatesappearancemodelsthatareadequateforthevariousworkingenvironmentsinculturalheritageinstitutionswillbedeveloped.

7867-05, Session 1

Using metrics to assess the ICC perceptual rendering intentK.R.Falkenstern,N.Bonnier,OcéPrintLogicTechnologies(France);H.Brettel,TelecomParisTech(France);F.Viénot,Muséumnationald’Histoirenaturelle(France)

Increasedinterestincolormanagementhasresultedinmoreoptionsfortheusertochoosebetweenfortheircolormanagementneeds.WeproposeanevaluationprocessthatusesmetricstoassessthequalityofICCprofiles,specificallyfortheperceptualrenderingintent.Theprimaryobjectiveoftheperceptualrenderingintent,unlikethemedia-relativeintent,isapreferredreproductionratherthananexactmatch.ProfilevendorscommonlyquoteaCIEDE*abcolordifferencetodefinethequalityofaprofile.

Withtheperceptualrenderingintent,thismayormaynotcorrelatetothepreferredreproduction.

Forthisworkwecompiledacomprehensivelistofqualityaspects,


IS&T /

ReturntoContents

usedtoevaluatetheperceptualrenderingintentofanICCprinterprofile.Theaspectsareusedastoolstoindividuallyjudgethedifferentqualitiesthatdefinetheoverallstrengthofprofiles.Theproposedworkflowusesmetricstoassesseachaspectanddeliversarelativecomparisonbetweendifferentprinterprofileoptions.Theaimoftheresearchistoimprovethecurrentmethodsusedtoevaluateaprinterprofile,whilereducingtheamountoftimerequired.

7867-06, Session 2

Development of perceptually calibrated objective metrics of noiseE.W.Jin,B.W.Keelan,S.F.Prokushkin,AptinaImagingCorp.(UnitedStates)

Thisstudyaimsatdevelopinganoisemetricwithatransformtojustnoticeabledifferences(JNDs)ofqualityinpictorialscenes.Suchaperceptuallycalibratednoisemetricisparticularlyvaluableforcomparingtheimpactofnoisewiththatofotherattributesandforcomputingoverallimagequality.Asystemsimulationmodelwasusedtocreatescene-dependentnoisemasksthatreflecttheperformanceoftoday’smobilecapturedevices.Sampleswithdifferentoverallmagnitudesofnoiseandwithvaryingmixturesofred,green,blue,luminance,andchrominancenoiseswereincludedinthestudy.Eleventreatmentsineachoftenpictorialsceneswereevaluatedbytwentyobserversusingasoftcopyrulermethod.Themostgeneralandbest-performingmetrictestedinvolvedintegratingthesystemnoisepowerspectraoveravisualfrequencyweightingfunction,andcombiningthecovariancesobtainedwithempiricalcoefficients.InCIELABspace,inclusionofanormallynegativeL*a*covarianceinadditiontoL*anda*variancesimprovedthepredictivenesssignificantly(b*variancewasfoundtocontributelittle).TesttargetsinlinearsRGBandrenderedL*a*b*spacesforeachtreatmentwillbemadeavailabletoenableotherresearcherstotestmetricsoftheirowndesignandcalibratethemtoJNDsofqualitywithoutperformingadditionalperceptualexperiments.

7867-07, Session 2

Perceptually relevant evaluation of noise power spectra in adaptive pictorial systemsB.W.Keelan,R.B.Jenkin,AptinaImagingCorp.(UnitedStates)

NoisePowerSpectra(NPS)aretraditionallymeasuredusinguniformareasoftone.Adaptivealgorithms,suchasnoisereduction,demosaicing,andsharpening,canmodifytheirbehaviorbasedonunderlyingimagestructure.Inparticular,noisereductionalgorithmsmaysuppressnoisemorestronglyinperfectlyuniformareasthantheywouldinareaswithmodestvariations,asfoundinactualpictorialimages,andsoyieldunrepresentativeNPS.Thisphenomenonwouldbesimilarinnaturetothesusceptibilityofhigh-contrast-edgestoadaptivesharpeningandthesubsequentover-estimationofeffectivepictorialmodulationtransferfunction.ExperimentationisdescribedthatexaminestheeffectofmodernadaptivenoisereductionalgorithmsontheNPSofimagescontainingvaryinggradients.Gradientsarechosenbasedonasurveyofconsumerimagesfromareaswherenoiseistypicallynoticeable,suchasbluesky,wallsandfaces.Althoughslightlossinperformanceofadaptivenoisereductionisobservedasgradientsincrease,theeffectisperceptuallysmallatgradientsofrelevanceinpictorialimaging.Thesignificantadditionalcomplexityofmeasuringgradient-basedNPSatanumberofmeanlevelsdoesnotappeartobejustified;measuringNPSfromuniformareasoftoneshouldsufficeformostperceptualwork.

7867-08, Session 2

A novel perceptual image quality measure for block based image compressionT.Shoham,D.Gill,S.Carmel,ICVTLtd.(Israel)

Reliable,lowcomplexity,automaticperceptualevaluationofimagequalitystillremainsanopenchallenge.Specifically,evaluationofimagequalitythatundergoesrecompressionusingablock-basedscheme,suchasJPEG,isanimportantenablerforautomatic,perceptuallylossless,imagerecompression.

Weproposeanovelimagequalitymeasurethatanswersthisneed.Theproposedqualitymeasurecombines3metricstoobtainascoreintherange0-1,where1correspondstoidenticalimages.Thefirstcomponentevaluatestheaverage,per-pixel,distortion.Thesecondcomponentmeasurestheextentofblockinessaddedbythecompressionprocess.Thethirdcomponentmeasuresthetexturedistortionineach4x4pixelblock.Thesemetricsarethenpooledintoasinglescoreusingaweightedgeometricaverage.Theimageisdividedintotiles,whosesizedependsonimageresolution.Theproposedscoreiscalculatedforeachimagetile,andmaybeusedforevaluationoflocalquality.Thetilesscorescanalsobepooledintoasingleimagequalityscore,byaveragingthelowesttilescorewiththeaveragescoreoveralltileswith,thusmimickinghumanperception.

Theproposedqualityscorehasbeensuccessfullyintegratedintoareal-time,automatic,perceptuallylossless,JPEGrecompressionsystem.

7867-09, Session 2

A metric for predicting preferred coring level to reduce toner scatter in electrophotographic printingH.J.Park,SamsungElectronicsCo.,Ltd.(Korea,Republicof);J.P.Allebach,PurdueUniv.(UnitedStates)

Asmoderntechnologydevelopsveryrapidly,weareswampedwithmanynewcutting-edgeproductsincludingimagingapplications.Tosurviveinafiercelycompetitiveenvironment,imagequalityplaysanimportantroleinimagingproducts.Sharpnessthatdescribestheclarityofdetailsandedgetransitionofanimageisoneofconcernsintheimagingproductsinceitaffectsoverallperceptionontheimage.Inspiteofconvenienceofphysicalassessment,itshouldbenoticedthattheperceiveddatafromahumanviewercannotbereplacedwiththephysicalmeasureddata,intermsofacceptabilitytotheend-useroftheimagingproduct.Therefore,themeasureddatabyphysicalassessmentshouldbecorrelatedwiththeperceiveddatabypsychophysicalassessment.

Inthispaper,weexploitthepreviouslypublishedpsychophysicaldatabasedonperceivedtonerscatter.Forphysicaldata,weapplytheEdgeTransitionWidth(ETW)betweenT90andT10boundariesasalinemetrictomeasuresharpnessandtheWeightedDifferentialTonerScatter(WDTS)withintheregionbetweenT60andT05boundariestomeasurethespreadofdifferentialtonerscatter.Utilizingbothperceivedandmeasureddata,wepredictpreferredcoringleveltoreducetonerscatterinelectrophotographic(EP)printing.

7867-11, Session 3

A universal and reference-free blurriness measureC.Chen,W.Chen,J.A.Bloom,DialogicMediaLabs(UnitedStates)

Blurrinessisamongtheartifactsthatcanbeintroducedintostillimagesandvideosequencesbyprocessing.Measurementsofblurrinesscanbeincludedaspartofanassessmentofperceptualquality.Inthispaper,anoveluniversalandreference-freeblurrinessmeasurementapproachispresented.ThegradientimagegeneratedfromthegivenimageismodeledasaMarkovchain,specifiedusingaone-steptransitionprobabilitymatrix.Thetransitionprobabilitiesforselectedpairsofgradientvaluesarecomputedandcombinedtoformulatetheblurrinessmeasureforagivenimage.Thisisthefirsttimethattransitionprobabilitiesareappliedtoperceptualqualityassessment.Transitionprobabilitiescanexploittherelationshipbetweenadjacentelementsinthegradientimageandthusgiveverypromisingblurriness

Conference 7867: Image Quality and System Performance VIII


IS&T /

ReturntoContents


measure.Experimentalstudiesareconductedtocomparetheproposedmethodtothestate-of-the-artreference-freeblurrinessmeasurementalgorithms.Theresultsshowthattheproposedmethodoutperformsthecommonlyusedmeasures.Inthispaperwealsodiscussthecomputationalcomplexityoftheproposedapproachandconcludethat,foranumberofapplications,itcanbeusedforreal-timevideoanalysis.

7867-12, Session 3

Issues in the design of a no-reference metric for perceived blurH.Liu,TechnischeUniv.Delft(Netherlands);I.Heynderickx,PhilipsResearchNederlandB.V.(Netherlands)

Designingano-reference(NR)blurmetricthatcanreliablypredictwhathumansperceiveremainsanacademicchallenge.Inthispaper,weaddresssomesignificantissuesrelevanttothedevelopmentofaNRblurmetric.Basedonstate-of-the-artmetricsandthedataofpsychovisualexperimentsavailableintheliterature,weexplaincurrentconcernsanddifficultiesinthemetricdesign:(1)theclassificationofblurmetricsdependingontheirtargetedapplications,(2)theeffectofedgedetectionmethodontheperformanceofametric,(3)thesensitivityinperformanceofametricintermsofcontentindependency,(4)theaddedvalueofincludingvisualattentioninthedesignofablurmetric.Theseissuesarediscussedineitherquantitativeorqualitativeterms,whichisbeneficialforthefutureresearchindesigningamorereliableNRblurmetric.

7867-13, Session 3

Evaluating super resolution algorithmsY.Kim,J.H.Park,G.Shin,H.Lee,D.Kim,S.H.Park,J.Kim,SamsungElectronicsCo.,Ltd.(Korea,Republicof)

Thisstudyintendstoestablishasoundtestingandevaluationmethodologybaseduponthehumanvisualcharacteristicsforappreciatingtheimagerestorationaccuracy;inadditiontoproposingacolordifferenceequation(CDE)basedobjectiveevaluationmethod.Intotal,5differentsuperresolution(SR)algorithms-suchasiterativeback-projection(IBP),maximumaposteriori(MAP),robustSR,projectionsontoconvexsets(POCS),andanon-uniforminterpolation-wereselected.TheperformancecomparisonbetweentheSRalgorithmsintermsoftheirrestorationaccuracywascarriedoutthroughbothsubjectivelyandobjectively.TheformermethodologyreliesuponthetripletcomparisonmethodrecommendedbyISO20462-2.Forthelatter,thetwomostwidelyusedCIEstandardCDEs,i.e.CIELABandCIEDE2000,wereadoptedforevaluatingtherestorationaccuracyofthoseSRalgorithms.Consequently,POCSandanon-uniforminterpolationoutperformedtheothersforanidealsituation,whilenohugealgorithmdependencycouldbeobservedinarealworldcasewhereanypriorinformationabouttheblurkernelisremainedunknown.However,IBPandMAPresultedinhighersharpnessifablurkernelcanbeaccuratelyestimated.Objectivedataanalysiswithalargernumberofteststimulicanverifyourresultsandwillbediscussedinthefinalmanuscript.

7867-14, Session 3

Image quality assessment based on distortion identificationA.Chetouani,A.Beghdadi,Univ.Paris-Nord(France)

ANewGlobalFull-ReferenceImageQualitySystembasedonclassificationandfusionschemeisproposed.Itconsistsofmanysteps.ThefirststepisdevotedtotheidentificationofthetypeofdegradationcontainedinagivenimagebasedaLinearDiscriminantAnalysis(LDA)classifierusingsomecommonImageQualityMetric(IQM)asfeatureinputs.AnIQMperdegradation(IQM-D)isthenused

toestimatethequalityoftheimage.Foragivendegradationtype,theappropriateIQM-DisderivedbycombiningthetopthreebestIQMsusinganArtificialNeuralNetworkmodel.Theperformanceoftheproposedschemeisevaluatedfirstintermsofgooddegradationidentification.Then,foreachdistortiontypetheimagequalityestimationisevaluatedintermsofgoodcorrelationwiththesubjectivejudgmentsusingtheTID2008imagedatabase.

7867-15, Session 4

Image quality evaluation of light field photographyQ.Fu,Xi’anInstituteofOpticsandPrecisionMechanics(China);Z.Zhou,Univ.ofScienceandTechnologyofChina(China);Y.Yuan,BeiHangUniv.(China);B.Xiangli,TheAcademyofOpto-Electronics(China)

Lightfieldphotographycapturesboth2Dspatialand2Dangularinformationofascene.Digitalrefocusinganddigitalcorrectionofaberrationscouldbedoneafterthephotographistaken.However,capturing4Dlightfieldiscostlyandtradeoffsbetweendifferentimagequalitymetricsshouldbemadeandevaluated.

Thispaperexplorestheeffectsoflightfieldphotographyonimagequalitybyquantitativelyevaluatingsomebasiccriteriaforanimagingsystem.Asimulationapproachwasfirstdevelopedbyray-tracingthelightraysofadesignedopticalsystem.AstandardtestingchartfollowedbyISO12233wasprovidedastheinputsceneandanimagewasacquiredbylightfieldrenderingmethods.Asacomparison,thesamemeasuresweretakenforthesamemainlenssystemastheresultsofconventionalphotography.Imagequalitymetricswerecalculatedatseveraldifferentdepths.Anexperimentallightfieldsystemwasbuiltupanditsperformancewastested.

Thisworkhelpsbetterunderstandingtheprosandconsoflightfieldphotographyincontrastwithconventionalimagingmethodsandperceivingthewaytooptimizethedigital-opticaldesignofthesystem.

7867-16, Session 4

Feature-based automatic color calibration for networked camera systemS.Yamamoto,TokyoMetropolitanCollegeofIndustrialTechnology(Japan);K.Taki,N.Tsumura,T.Nakaguchi,Y.Miyake,ChibaUniv.(Japan)

Inthispaper,weproposeanautomaticcolorcalibrationtechniqueamongthenetworkedcameras.Eachcameraisassumedtohavetheoverlappingareaintheircapturedarea,andtodetectoneormorecommonobjectsatleast.Ouralgorismfirstlysearchesthesameobjectineachscene.MSERmethodisappliedtodetecttheareaofappropriateobject.Afterthat,thefeature-basedSIFTdetectioncancalculatetheshapeinformationonthecandidateareaas128vectors.Next,similaritycomparisonisperformedamongthecandidateareabyusing128vectorsofshapefeature.Ifthesimilarityoftheclosestpairisremarkablecomparedwithotherpairs,itsRGBoutputswhichareaveragedpixelvalueinthisareaareusedaselementsofcolorcalibration.Finally,thecomparisonofRGBoutputisperformedbetweenmostsimilarobjects,andRGBoutputsareusedtomakethecolorcalibrationmatrixwhenjudgedthesameobjectfrominformationofshapeandthecolor.ExperimentallywefoundthatRGBoutputbetweencamerasisgraduallycorrespondingbycontinuousdetection.

7867-17, Session 4

Analysis of estimation error in image quality measurementsP.D.Burns,CarestreamHealth,Inc.(UnitedStates)

Errorpropagationanalysisisoftenusedtopredictthetransformation


IS&T /

ReturntoContents

ofvariationorerrorwhenasignalundergoesatransformation.Forexampleinimagecapture,whenred,green,bluecolor-signalsaretransformedbyacolor-matrixoracolorimetrictransformation.Less-oftenconsidered,however,isthemeasurementerrorinherentinthecalculationofseveralderivedmeasurements,suchasmodulationtransferfunction(MTF),contrast-to-noiseratioornoise-powerspectrum.Eachofthesemeasurementsisactuallyanestimatebasedonobservedimagecharacteristics,suchasmeandigitalsignallevel,samplestandarddeviation.

Considerthesignalvariationthatresultfromexposureanddetectorcharacteristics.Whensignalsarecombined,eitherbymatrixorspatialoperations,soarethevariations,andthiscanbemodeledasformingafunctionofrandomvariables.Afterdescribingthetechnicalbasisforanalysisoftheestimationerrorintermsofcomponentparametervariations,weanalyzetheoriginandmagnitudeofmeasurementvariationforparticularmeasurementsofinterest:thespatialfrequencyresponse(SFR),asdescribedintheISOstandards,summarymeasuresofimagesharpness,suchasCMT(visually-weighted)acutance.Basedonthisapproach,ispossibletomodelthepropagationofthefirst-andsecond-ordererrorstatisticsintermsofexpectedbiasandvariationerrorwhenapplyingsuchmeasurementsinspecificperformancemonitoringorproductiontasks.

7867-18, Session 5

LCD displays performance comparison by MTF measurement using the white noise stimulus methodC.Mitjà,J.Escofet,Univ.PolitècnicadeCatalunya(Spain)

Theamountofimagesproducedtobeviewedassoftcopiesonoutputdisplaysaresignificantlyincreasing.Thisgrowingoccursattheexpenseoftheimagestargetedtohardcopyversionsonpaperoranyotherphysicalsupport.Eveninthecaseofhighqualityhardcopyproduction,peopleworkinginprofessionalimagingusesdifferentdisplaysinselecting,editing,processingandshowingimages,fromlaptopscreentospecializedhighenddisplays.Then,thequalityperformanceofthesedevicesiscrucialinthechainofdecisionstobetakeninimageproduction.Metricsofthisqualityperformancecanhelpintheequipmentacquisition.DifferentmetricsandmethodshavebeendescribedtodeterminethequalityperformanceofCRTandLCDcomputerdisplaysinclinicalarea.Oneofmostimportantmetricsinthisfieldisthedevicespatialfrequencyresponseobtainedmeasuringthemodulationtransferfunction(MTF).ThisworkpresentsacomparisonbetweentheMTFofseveralLCDdisplaysmeasuredbythewhitenoisestimulusmethod,oververticalandhorizontaldirections.Additionally,differentdisplaysshowparticularpixelsstructurepattern.Inordertoidentifythispixelstructure,asetofhighmagnificationimagesistakenfromeachdisplaytoberelatedwiththerespectiveverticalandhorizontalMTF.

7867-19, Session 5

Improving the quality of H 264 by using a new rate control modelM.Hrarti,A.Saadane,M.Larabi,XLIM-SIC(France)

ToimprovethequalityresultsofH.264,weproposeinthispaper,anewRate-Quantization(R-Q)modelresultingfromextensiveexperiments.Thislatterisdividedintotwoparts.ThefirstpartisanIntraR-QmodelusedtodetermineanoptimalinitialQPforI-Framesandderivedfromextensiveexperiments.TheQPdeterminationisbasedonbothtargetbit-rateandI-Framecomplexity.TheI-frametargetbit-rateisobtainedfromtheglobaltargetbit-ratebyusinganewnon-linearmodel.ThesecondpartisanInterR-QmodelusedtocalculateQP.Fromthis,wedemonstratealogarithmicrelationshipbetweentheQPandthetargetbitbudgetusedtoencodeagivenFrameorMacroblock.TheInterR-Qmodelcoefficientsareupdatedusingthestatisticsofthepreviouscodedunits.TheinterR-Qmodeldoesnotneedanycomplexitymeasure(suchaMAD)andreplacesbothlinearandquadraticmodelsusedinH.264ratecontroller.

7867-20, Session 5

A noble method on no-reference video quality assessment using block modes and quantization parameters of H 264/AVCI.Park,T.Na,M.Kim,KoreaAdvancedInstituteofScienceandTechnology(Korea,Republicof)

Videoqualityassessmentisanimportanttoolofguaranteeingvideoservicesinarequiredlevelofquality.AlthoughsubjectivequalityassessmentismorereliableduetothereflectionofHumanVisualSystem(HVS)thanobjectivequalityassessment,itisatime-consumingandveryexpensiveapproach,andisnotappropriateforreal-timeapplications.Therefore,muchresearchhasbeenmadeforobjectivevideoqualityassessmentinsteadofsubjectivevideoqualityassessment.

Amongthreekindsofobjectiveassessmentapproacheswhicharefull-reference,reduced-referenceandno-referencemethods,no-referencemethodhasdrawnmuchattentionbecauseitdoesnotrequireanyreference.Theencodingparametersaregoodfeaturestouseforno-referencemodelbecausetheencodedbitstreamscarryplentyofinformationaboutthevideocontentsanditiseasytoextractsomecodingparameterstoassessvisualquality.

Inthispaper,weproposeano-referencequalitymetricusingtwokindsofcodingparametersinH.264/AVC:quantizationandblockmodeparameters.TheseparametersareextractedandcomputedfromH.264/AVCbitstreams,withoutrelyingonpixeldomainprocessing.Wedesignalinearqualitymetriccomposedofthesetwoparameters.TheweightvaluesoftheparametersareestimatedusinglinearregressionwiththeresultsofsubjectivequalityassessmentwhichareobtainedbasedontheDSIS(DoubleStimulusImpairmentScale)methodofITU-RBT.500-11.

7867-21, Session 5

Prioritization of AL-FEC information for improving IP television services QoSE.Mammi,Univ.degliStudidiRomaTre(Italy);G.Russo,P.Talone,FondazioneUgoBordoni(Italy)

Inthedigitaltelevisionworld,animportanttransformationisrepresentedbythetelevisionoverIPservice.OneofthekeyfactorsenablingthespreadingoftelevisionoverIPisrepresentedbythequality.Furthermore,packetlossisprobablythemainservicedegradationsourceforthatservices.

TheproposedapproachcombinestheuseofAL-FECwiththeset-upofatransportqualitymechanismbasedonFECpacketsprioritization.ToFECpacketsisassignedatransferpriorityhigherthanthatofthemediapacketstransferredunderthebesteffortparadigm,thusreducingincongestedrouterstheamountofFECpacketlosses.Inthiswaytheerrorcorrectioncapabilityisimproved.Furthermore,astheFECstreamisusuallyapercentageofthemediaone,thechoiceofapplyingtheprioritizationtotheFECstreamandnottothewholemediaallowsreducingtheimpactofprioritizationoftelevisionservicetrafficonothertypesoftraffic,concurrentonthesamelink.ThetestshavebeenperformedonasimulatednetworkandonarealIPtest-bed.Theresultsshowtheeffectivenessoftheproposedapproachwithrespecttheun-prioritizedone,allowingtoobtainhighervideoqualityatthesamepacketlossrate.

7867-22, Session 6

Reference image method for measuring quality of photographs produced by digital camerasM.Nuutinen,AaltoUniv.SchoolofScienceandTechnology(Finland);O.Orenius,T.S.Säämänen,Univ.ofHelsinki(Finland);P.T.Oittinen,AaltoUniv.SchoolofScienceandTechnology



IS&T /

ReturntoContents

(Finland)

Computationalimagequalitymetricscanbedividedthreegroups:full-reference(FR),reduced-reference(RR)andno-reference(NR).FRmetricscannotbeappliedtocomputeimagequalityproducedbydifferentdigitalcamerasbecausepixel-wisereferenceimagesaremissing.NRmetricsareapplicableonlywhenthedistortiontypeisknownandthedistortionspaceislow-dimensional.RRmetricsprovideatradeoffbetweenNRandFRmetrics.ARRmetricdoesnotrequirepixel-wisereferenceimage,itonlyneedsasetofextractedfeatures.WiththeaidofRRfeaturesitispossibletotrytoavoidtheproblemsrelatedtotheNRmetrics.InthisstudyweusedRRmetricsformeasuringimagequalityofnaturalimagesproducedbydigitalcameras.Weproposeamethodwherereferenceimagesareproducedusingareferencecamera.Thereferenceimagesareexpectedtobenaturalreproductionsoftheviewsunderstudy.WetestedourmethodusingthreeRRmetricsproposedintheliterature.Theresultssuggestthattheproposedmethodispromisingwhentheproblemistomeasurethequalityofnaturalimagesproducedbydigitalcamerasforthepurposeofcamerabenchmarking.

7867-23, Session 6

RAW camera DPCM compression performance analysisK.Bouman,V.Ramachandra,K.Atanassov,M.Aleksic,S.R.Goma,QualcommInc.(UnitedStates)

TheMIPIstandardhasadoptedDPCMcompressionforRAWdataimagesstreamedfrommobilecameras.ThisDPCMislinebasedanduseseitherasimple1or2pixelpredictor.Inthispaper,weanalyzetheDPCMcompressionperformanceasMTFdegradation.Totestthisscheme’sperformance,wegeneratedSiemensstarimagesandbinarizedthemto2-levelimages.ThesetwointensityvalueswherechosensuchthattheirintensitydifferencecorrespondstothosepixeldifferenceswhichresultinlargestrelativeerrorsintheDPCMcompressor.(E.g.apixeltransitionfrom0to4095correspondstoanerrorof6betweentheDPCMcompressedvalueandtheoriginalpixelvalue).TheDPCMschemeintroducesdifferentamountsoferrorbasedonthepixeldifference.WepassedthesemodifiedSiemensstarchartimagestothiscompressorandcomparedthecompressedimageswiththeoriginalimagesusingIT3MTFresponseplotsforslantededges.Further,wediscussthePSFinfluenceonDPCMerroranditspropagationthroughtheimageprocessingpipe.

7867-24, Session 7

Brightness, lightness, and specifying color in high-dynamic-range scenes and imagesM.D.Fairchild,P.Chen,RochesterInstituteofTechnology(UnitedStates)

Traditionalcolorspaceshavebeenwidelyusedinavarietyofapplicationsincludingdigitalcolorimaging,colorimagequality,andcolormanagement.Thesespaces,however,weredesignedforthedomainofcolorstimulitypicallyencounteredwithreflectingobjectsandimagedisplaysofsuchobjects.Thismeansthedomainofstimuliwithluminancelevelsfromslightlyabovezerotothatofaperfectdiffusewhite(ordisplaywhitepoint).ThislimitstheapplicabilityofbothofthesespacestocolorproblemsinHDRimaging.Thisiscausedbytheirhardinterceptsatzeroluminance/lightnessandbytheiruncertainapplicabilityforcolorsbrighterthandiffusewhite.ToaddressHDRapplications,twonewcolorspaceswererecentlyproposed,hdr-CIELABandhdr-IPT.Theyarebasedonreplacingthepower-functionnonlinearitiesinCIELABandIPTwithamorephysiologicallyplausiblehyperbolicfunctionoptimizedtomostcloselysimulatetheoriginalcolorspacesinthediffusereflectingcolordomain.Thispaperwillpresenttheformulationofthenewmodels,evaluationsusingMunselldataincomparisonwithCIELAB,IPT,andCIECAM02,twosetsoflightness-scalingdataabovediffusewhiteandvariousformulationsofhdr-CIELABandhdr-IPTtopredictthevisualresults.

7867-25, Session 7

Evaluating HDR photos using Web 2 0 technologyG.Qiu,Y.Mei,TheUniv.ofNottingham(UnitedKingdom)

Inthiswork,weexploitWeb2.0technologytoevaluateHDRphotographs.Wehaveconstructedawebsiteforthispurpose.TheURLofthewebsiteishttp://www.hdri.cs.nott.ac.uk/whichiscurrentlyliveandhasbeencollectingviewerinputs.

Atanyonetime,twoversionsofthesameHDRphotosarerenderedbytwodifferenttonemappingoperatorsareshowntotheviewers.Theviewerisaskedtoclickontheonethatshe/heprefersorclickon“cannottellthedifference”buttoniftheusercannotdecidewhichoneisbetter.SuchpaircomparisonresultsarethenputintoanSQLdatabaseserver.ThesepaircomparisonresultsarethenprocessedbyarankingalgorithmcalledtheCondorcetmethod[6].ThismethodwillrankallversionsofthesameHDRphotoaccordingtotheperceivedvisualqualitiesbythewebsite’svisitors.Inthewebsite,thisisdisplayasthe“top10”withthefollowingURLhttp://www.hdri.cs.nott.ac.uk/top10.php

Thewebsitealsoenablesuserstouploadtheirowntonemappingoperators’resultsforcomparisonwithresultsofotheroperators.Ourvisionsisthatasweaccumulatemoreandmoreoperators(especiallywithuserscontributingnewoperatorsresults),thewebsitewilleventuallyenablenewtonemappingoperatorstocomparewithexistingonesthusmakingitmucheasiertocomparetherelativemeritsofanewtonemappingoperator.


Potential of face area data for predicting sharpness of natural imagesM.Nuutinen,AaltoUniv.SchoolofScienceandTechnology(Finland);O.Orenius,T.S.Säämänen,Univ.ofHelsinki(Finland);P.T.Oittinen,AaltoUniv.SchoolofScienceandTechnology(Finland)

Facedetectiontechniqueshavebeenusedformanydifferentapplications.Forexamplefacedetectionisabasiccomponentinmanyconsumerstillandvideocameras.Inthisstudywecomparetheperformanceoffaceareadataandfreelyselectedlocalareadataforpredictingthesubjectivesharpnessofphotographs.Thelocalvalueswerecollectedinasystematicwayfromimageandfortheanalysesweselectedonlytheoneswiththehighestperformance.Theobjectivesharpnessmetricwasbasedonthestatisticofthewaveletcoefficientsoftheselectedarea.Weusedthreeimagecontentswhosesubjectivesharpnessvalueshadbeenmeasured.Theimagecontentswerecapturedby13camerasandtheimageswereevaluatedby25subjects.Thequalityofthecameraswasbetweenlow-endmobilephonecamerastolow-endcompactcameras.Theimagecontentssimulatedtypicalphotosthatconsumerstakewiththeirmobilephones.Imagesizewasscaledto2Mpixinallcases.Thefaceareasontheimageswereapproximately7,20and74kpix.Basedontheresultsthefaceareadataisvaluableformeasuringthesharpnessofphotographsiffacesizeislargeenough.Whenthefaceareasizewas20kpixor74kpixtheperformanceofthemeasuredsharpnessvalueequalsorisbetterthanthesharpnessvaluemeasuredfromthebestlocalareas.Ifthefaceareawastoosmall(7kpix)theperformancewaslowcomparedtothebestlocalareas.


A video quality assessment model based on the MPEG-7M.Sato,Y.Horita,Univ.ofToyama(Japan)

InthispaperweproposeanewvideoqualityassessmentmodelbasedontheMPEG-7Descriptor.Previously,wehaveexaminedthe



IS&T /

ReturntoContents

estimatedaccuracyinstillimagebyusingtheMPEG-7descriptor.Weusedescriptor(“ColorLayout”,“ScalableColor”,“ColorStructure”,“EdgeHistogram”and“HomogeneousTexture”)fromtheMPEG-7Descriptor.Asaresult,wewasabletopresumethestillimagebyhighaccuracy.

Inthispaper,weusethepresumptionaccuracybasedonresultofpreviousmodel(estimateofframequality)whenweproposevideoqualityassessmentmodel.

WeestimatedVideoQualitybasedonFrameQualityandweencodedbyusing10-kindsofvideomaterial(bit-rate:448/1024/4096kbps,frame-rate:30/10/5fps,codingschemes:WindowsMediaVideo9andh.264).

TheresultsoftheestimationaccuracyforWMV9/h.264are:correlation0.98/0.98,averageerror0.13/0.15andmaximumerror0.48/0.46.Theresultsoftheestimationaccuracyforbothencodingmethods(WMVandh.264)are:correlation0.93,averageerror0.22andmaximumerror0.72.


Image quality: a tool for no-reference assessment methodsS.Corchs,F.Gasparini,F.Marini,R.Schettini,Univ.degliStudidiMilano-Bicocca(Italy)

Inthisworkweproposeanimagequalityassessmenttool.ThetooliscomposedofdifferentmodulesthatimplementseveralNoReference(NR)metrics(i.e.wheretheoriginaloridealimageisnotavailable).DifferenttypesofimagequalityattributescanbetakenintoaccountbytheNRmethods,likeblurriness,graininess,blockiness,lackofcontrastandlackofsaturationorcolorfulnessamongothers.Anextramodulepermitstheusertogiveasubjectiveratingoftheimagequality.Ourtoolaimstogiveastructuredviewofacollectionofobjectivemetricsthatareavailableforthedifferentartifacts/attributeswithinanintegratedframeworkthatalsooffersthepossibilityofasubjectiveevaluation.Aseachmetriccorrespondstoasinglemodule,ourtoolcanbeeasilyextendedtoincludenewmetricsortosubstitutesomeofthem.Thesoftwarepermitstoapplythemetricsnotonlygloballybutalsolocallytodifferentregionsofinterestoftheimage.Inthisway,ifnecessary,theusercaninteractwiththetoolandeventuallychooseanotherNRmethodorhecandecidetoapplyacertainmetriclocally(theregionofinterestcanbemanuallyselected)becausetheglobaloneisnotincorrespondencewithhissubjectivejudgment.Thiscomputer-aidprocesscanbeiterativelyappliedforeachoftheimages.


Extending video quality metrics to the temporal dimension with 2D-PCRC.Keimel,M.Rothbucher,K.Diepold,TechnischeUniv.München(Germany)

Theaimofanyvideoqualitymetricistodeliveraqualitypredictionsimilartothevideoqualityperceivedbyhumanobservers.Onewaytodesignsuchamodelofhumanperceptionisbydataanalysis.Inthiscontributionweintendtoextendthisapproachtothetemporaldimension.Eventhoughvideoobviouslyconsistsofspatialandtemporaldimensions,thetemporalaspectisoftennotconsideredwellenough.Insteadofincludingthisthirddimensioninthemodelitself,themetricsareusuallyonlyappliedonaframe-by-framebasisandthentemporallypooled,commonlybyaveraging.Thereforeweproposetoskipthetemporalpoolingstepandusetheadditionaltemporaldimensioninthemodelbuildingstepofthevideoqualitymetric.WeproposetousethetwodimensionalextensionofthePCR,the2D-PCR,inordertoobtainanimprovedmodel.WeconductedextensivesubjectivetestswithdifferentHDTVvideosequencesat1920x1080and25framesperseconds.Forverification,weperformedacrossvalidationtogetameasureforthereal-lifeperformanceoftheacquiredmodel.Wewillshowthatthedirectinclusionofthetemporaldimensionofvideointothemodelbuildingimprovestheoverallpredictionaccuracyofthevisualqualitysignificantly


Image quality metric benchmarking on compressed image databasesM.Nauge,M.Larabi,Univ.dePoitiers(France)

Inrecentliteraturehundredsofpaperhasproposedobjectivequalitymetricsdedicatedtoseveralimageandvideoapplications.

Ouraimistosimplifythechoiceofmetricsaccordingtotheapplication.Wecanalsoverifythebenefitofyournewimagealgorithmsintermofthereductionofvisualdegradation.

TosuccessfullycompletethistaskwelargelyusedstandardmethodologyfortheevaluationofobjectivemodelperformanceinrespectofthereportfromtheVideoQualityExpertsGroup.Thisbenchmarkingofmetricsusesfourimagedatabasesfromdifferentcountries,comprisingofalargerangeofimagecontents,imagedistortiontypes,subjectivescoresfromvariousobserversandvaryingequipmentanddisplayconfigurations.

Inthisexperimentwetesttwenty-sevenmetrics,fromsimplemathematicalmeasures(likePSNR)tocomplexmodelisationofHumanVisualSystem(likeHDR_VDP).


Optimal front light design for reflective displays in different ambient illuminationS.Wang,T.Chang,C.Li,Y.Bai,K.Hu,IndustrialTechnologyResearchInstitute(Taiwan)

Inthisstudy,aluminanceandcolor-tunablefrontlightdevicefore-paperdisplaywasbuilt.Thepsychophysicalexperimentresultsplayedanimportantroleinvisualperceptionsinceitcouldlinkthehumanpsychologicalresponsestothephysicalstimuli.Acolorcalibrationprocedurewasappliedonthedevicetopresentcorrect256levelluminanceand13differentcolortemperaturesatfixedilluminationof200cd/m2.Whenthehumanvisualsystemwasfirstexposedtoanewstimulus,itmustbeadaptedto13differentcolortemperatures.Humansubjectivetestingexperimentswerecompletedtorevealthehumanpreferencefortheluminanceandcolortemperatureofthefrontlightdeviceindifferentambientillumination.

Afteranalyzingtheexperimentalresults,itcouldbeconcludedthatwhentheambientilluminationwasdimmer,thehumanobserverspreferredbrighterfrontlight.However,indarksurrounding,thehighestluminanceoffrontlightdevicewas208cd/m2,andthemostpreferredcolortemperatureliedbetween11000Kand13000K.Also,thisstudyrevealedthathumanobserverspreferhigherluminanceandcolortemperaturewhilereadingtext.Intheotherwords,carefullycontrolledsubjectivepsychophysicalexperimentwillberequiredtoobtainmorehumanvisualperceptiondata,andthedatacanbeusedtodesignthefrontlightforreflectivedisplays.


Comparison of HDTV formats in a consumer environmentC.Keimel,A.Redl,K.Diepold,TechnischeUniv.München(Germany)

Highdefinitiontelevision(HDTV)hasbecomequitecommoninmanyhomes.Still,therearetwodifferentformatsusedcurrentlyincommercialbroadcasting:oneinterlacedformat,1080i50/60,andoneprogressiveformat,720p50/60.Therehavealreadybeenquiteafewcontributionscomparingthevisualqualityoftheseformatssubjectivelyundercommonstandardconditions.Theseconditions,however,don’tnecessarilyrepresenttheviewingconditionsinthereal-lifeconsumerenvironment.Inthiscontributionwethereforedecidedtodoacomparisonunderconditionsmorerepresentativeoftheconsumerenvironmentwithrespecttodisplayandviewingconditions.Furthermorewedecidedtoselectnotspeciallypreparedtest



IS&T /

ReturntoContents

sequences,butreal-lifecontentandcodingconditions.Aswewerenotinterestedintheinfluenceofthetransmissionerrors,wecapturedthesequencesdirectlyintheplay-outcentreofacablenetworkproviderinboth1080i50and720p50.AlsowecapturedforcomparisonthesamecontentindigitalPAL-SDTV.Weconductedextensivesubjectivetestswithoverall25testsubjectsandamodifiedSSISmethod.TheresultsshowthatbothHDTVformatsoutperformSDTVsignificantly.Although720p50isperceivedtohaveabetterqualitythan1080i50,thisdifferenceisnotsignificantinastatisticalsense.Thissupportsthevalidityofpreviouscontribution’sresults,gainedinstandardconditions,alsoforthereal-lifeconsumerenvironment.

7867-26, Session 8

Just noticeable difference vs visual difference: hypotheses and how to check whether they are true or notS.N.Bezryadin,P.Burov,KWEInternational,Inc.(UnitedStates)

Theproblemofaccuratecolorreproductionisahottopic,whichiscloselylinkedtotheproblemofaccuratemeasurementofhumanvisualthreshold,or“JustNoticeableDifference”(JND).

TheformulasMacAdamusedforhisColorMatchingEllipsoidswerecreatedwithanassumptionthathumanabilitytodistinguishclosestimulimaybedescribedwiththeGaussiandistribution,astandarddistributionforstochasticprocesses(wenameithypothesis#1).However,thisassumptionhasn’tyetbeenproved.

SincemostimagingscientistsbelievethatJNDexperimentsaretoocomplicatedandcostly,VisualDifference(dV)experimentshavegainedhighpopularity.Inconnectiontothistherearealsoseveralcommonbeliefs,suchas:

-VisualDifferenceexperimentalresultscanbeextrapolatedonpairswithColordifferencelessthan1JND(letusnameithypothesis#2)

-1JNDisequalto1dV(hypothesis#3)

-JNDisproportionaltodV(hypothesis#4).

However,therehavebeennoexperimentsdevotedtodeterminethecorrelationbetweenJNDanddV.

Thispaperpresentstwoexperimentaltechniquesthatcandeterminestimulipairswhichp%ofobserversperceiveasdifferent,whiletheother(100-p)%cannotdistinguish.Italsodiscussestheexperiment’srequirementssuchasaccuracyandstabilityofequipment.

Thepresentedtechniquewillhelptoresolvethefollowingproblems:

1.Determinewithhighaccuracystimulipairsof1JNDdifferenceinvariousGamutareas.

2.CheckwhethertheassumptionthathumanabilitytodistinguishsimilarstimulimaybedescribedwithGaussiandistributioniscorrect.

3.CheckifJNDanddVarereallyproportionalandwhetherVisualDifferenceexperimentalresultsmaybeextrapolatedonpairswithColordifferencelessthan1JND.

7867-27, Session 8

Device dependent scene dependent quality predictions using effective pictorial information capacityK.H.Oh,S.Triantaphillidou,R.E.Jacobson,Univ.ofWestminster(UnitedKingdom)

Thisstudyaimstointroduceimprovementsinthepredictionsofdevice-dependentimagequalitymetrics(IQMs).Avalidationexperimentwasfirstcarriedouttotestthesuccessofsuchametric,theEffectivePictorialInformationCapacity(EPIC),usingresultsfromsubjectivetestsinvolving32testscenesreplicatedwithvariousdegreesofsharpnessandnoisiness.Themetricwasfoundtobeagoodpredictorwhentestedagainstaverageratingsbut,asexpectedbydevice-dependentmetrics,itpredictedlesssuccessfullytheperceivedqualityofindividual,non-standardsceneswithatypicalspatialandstructural

content.

ImprovementinpredictionswasattemptedbyusingamodularimagequalityframeworkanditsimplementationwiththeEPICmetric.Itinvolvesmodellingacomplicatedsetofconditions,includingclassifyingscenesintoasmallnumberofgroups.Thesceneclassificationemployedforthepurposeusesobjectivescenedescriptorswhichcorrelatewithsubjectivecriteriaonscenesusceptibilitytosharpnessandnoisiness.Theimplementationthusallowsautomaticgroupingofscenesandcalculationofthemetricvalues.Resultsindicatethatmetricpredictionswereimproved.Mostimportantly,theywereshowntocorrelateequallywellwithsubjectivequalityscalesofstandardandnon-standardscenes.Thefindingsindicatethatadevice-dependent,scene-dependentIQMcanbeachieved.

7867-28, Session 9

Social image qualityG.Qiu,A.Kheiri,TheUniv.ofNottingham(UnitedKingdom)

Noabstractavailable

7867-29, Session 10

Utility studies for security encoded office documents: experimental design challengesC.A.Deller,G.J.Woolfe,CanonInformationSystemsResearchAustraliaPty.Ltd.(Australia)

Wehavedevelopedmethodologiestostudytheusabilityofdocuments.Thismethodologyhasbeenappliedtothestudyoftheimpactofvisiblesecuritypatternsintheusabilityoftypicalofficedocuments.Twospecificinformationretrievaltaskshavebeenexamined:theretrievaloftext-basedinformationfromawrittenreportandtheretrievalofnumericalinformationfromtablesandgraphs.Themethodologywehavedevelopedaimstominimizesourcesofuncontrolledvariabilityinthemeasurementswhilesimultaneouslyavoidingasystematicbiasfromlearningeffectsandmaintainingtaskequivalenceacrossalldocuments.Webelievethemethodologiesdevelopedinthisworkmayproveusefulinfuturestudiesofdocumentusability.

7867-30, Session 10

Printed fingerprints: a framework and first results toward detection of artificially printed latent fingerprints for forensicsS.Kiltz,M.Hildebrandt,J.Dittmann,C.Vielhauer,C.Krätzer,Otto-von-Guericke-Univ.Magdeburg(Germany)

InthepublicationofLotharSchwarzanaminoacidmodelforprintinglatentfingerprintstoporoussurfacesisintroduced,motivatedbytheneedforreproducibilitytestsofdifferentdevelopmenttechniquesforforensicinvestigations.Thiscanbeusedlegitimatelyforqualityassurance.However,thistechniquealsoenablesthefabricationofartificialtracesconstitutingapossiblesecuritythreat,motivatinganeedforresearchofappropriatefabricationdetectiontechniques.

Itisimportanttodetectanddistinguishbetweenreal,naturallatentfingerprintpatternsfromhumansandartificiallyprintedlatentfingerprintsundertheconsiderationoftheSchwarzaminoacidmodelandink-jetprintingtechniques.Thediscriminationshouldworkbeforeandaftertraditionaldevelopment/enhancingtechnologies(e.g.carbon-

black)areapplied.Themaincontributionofourworkisafirstproposalforanextensibleframeworkfortheexaminationoffingerprintsasaprocess-chaincomposedoffingerprintprinting,processingofthephysicalforensictracesample,digitalacquisition,andasubjectiveassessmentofthedigitalsample.Weevaluatefirstresultsandtendenciesoftheinfluenceofpropertiesontherecognitionofartificialtraces.Thoseincludepropertiesofthefingerprint,thesamplematerial



IS&T /

ReturntoContents

andage,theprinter,theink,thecontact-lessacquisitionsensorandthepropertiesrelatedthedactyloscopicexpert.

7867-31, Session 10

Monitoring image quality for security applicationsM.Larabi,D.Nicholson,Univ.dePoitiers(France)

Nowadays,securityapplicationsareofabiginterestforpersonalandpublicsecurity.Governmentsandbigcitiesareputtingalotofeffortandmoneyonit.Videosurveillanceisthemostcommonwaytomonitorandenhancethesecurityofstrategicplaces.Thereisalargediversityofsystems,videocodecs,sensors,andsoftwareinthisfield,whichmakesdifficulttheassessementofeventhecertificationofvideo-surveillancesystemsandsub-systems.Asinstalledsystemshavetooperateforawhile,interoperabilityandextensibilityareveryimportantforthedurabilityofavideosurveillancesystem.

7867-32, Session 10

Video quality and interpretability study using SAMVIQ and Video-NIIRSD.L.Young,RaytheonIntelligence&InformationSystems(UnitedStates);J.Ruszczyk,GeneralDynamicsAdvancedInformationSystems(UnitedStates);T.Bakir,Harris(UnitedStates)

Theresultsofavideoqualityandinterpretabilitystudyaredescribed.Thestudyvariedscenecontent,compressionratio,andencoderimplementationspecifics.LossofvideoqualityasmeasuredtheSubjectiveAssessmentMethodologyVideoImageQuality(SAMVIQ)iscomparedtolossofinterpretabilityasmeasuredbytheVideoNationalIntelligenceInterpretabilityRatingScale(Video-NIIRS).QualityratingresultsarecomparedtopredictiveindicatorsofqualitysuchastheVisualInformationFidelity(VIF).InterpretabilitysubjectiveratingresultsarecomparedtopredictiveobjectiveindicatorsofinterpretabilityusingtheMotionImageryQualityEquation(MIQE).

ThismaterialisbaseduponworksupportedbyaDoDcontract.Anyopinions,findingsandconclusionsorrecommendationsexpressedinthismaterialarethoseoftheauthor(s)anddonotnecessarilyreflecttheviewsoftheGovernment.

7867-33, Session 11

Weighted-MSE based on saliency map for assessing video quality of H 264 broadcasted video streamsH.Boujut,BordeauxUniv.(France);O.Hadar,Ben-GurionUniv.oftheNegev(Israel);J.Benois-Pineau,T.Ahmed,BordeauxUniv.(France);P.Bonnet,AudematWorldCastSystemsGroup(France)

ThepapercontributestoobjectivevideoqualityassessmentofbroadcastedvideooverDVBandIPnetworks.Weintroduceanobjectivevideoqualitymetricbasedonsaliencymaptoassesspacketandsignallossinfluenceonbroadcastedvideostreams.Thisnewmetric,wecalleditWeighted-MSE(WMSE),requiresthefull-referencevideolikeMSEandSSIMmetrics.UnlikeMSEwhichdoesnotconsidertheHumanVisualSystem(HVS),WMSEusesspatio-temporalsaliencymapstoincreasethecontributionofsalientregions.Wenotethattherearesimilarideasintheliterature,buttheyworkonspatialsaliencymapandusemagnitude-errorweightingscheme.DespitethefactthatSSIMmetricandtheapproachproposedintheliteraturealreadytakeintoaccounttheHVS,theWMSEalsoconsidersthetemporalsideofvisualperception.Weuseadequatefusionstrategiestocombinebothspatialandtemporalsaliencyinasinglemap.TheWMSEmetricsweproposeisdesignedasabasisofcomparisonforqualityofexperiencetests.

Furthermore,inthispaperwecontributeaswelltoafastersaliencymapextractionusingH.264compressedstreaminformation.

7867-34, Session 12

Metrics for regression testing and optimization of visual attention (saliency) modelsR.J.Moore,B.Stankiewicz,3MCo.(UnitedStates)

Whendevelopingapredictivevisualtoolclearmetricsarerequiredtoevaluatethemodel’sperformance.OneareaofrichresearchisintheareaofVisualAttentionModeling.Inthisfieldofresearchonetypicallycompareseyetrackingdatacollectedfromhumanobserverstothepredictionsmadebythemodel.Toevaluatetheperformanceofthesepredictionsresearchinvisualattentionmodelingtypicallyusessignaldetection(ReceiverOperatingCharacteristic(ROC))tomeasurethepredictivepowerofthesystem.Researcherstypicallycomparethemodel’sSaliencymapoutputtoeyetrackingdatatogenerateROCcurvesandvaluesforeachsaliencymap.Theaverageoverasetoftestimagesprovidesafinalmeasureofthesystem’sperformance.Inreleasingacommercialvisualattentionsystem,wehavespentconsiderableeffortindevelopingmetricsandtestingmethodsthatallowforregressiontesting,andareusefulforoptimizingthevisualattentionmodel.ItwasdeterminedthatROCalonewasnotasatisfactorymeasureofsystemperformance.Inthispaperwepresentthemethodsusedtotestandmeasuretheperformanceofourvisualattentionmodel,andhowtheyallowustobuildregressionteststhatcanbeusedtooptimizethemodel’sparameters.

7867-35, Session 12

Naturalness and interestingness of test images for visual quality evaluationR.Halonen,S.Westman,P.T.Oittinen,AaltoUniv.SchoolofScienceandTechnology(Finland)

Balancedandrepresentativetestimagesareneededtostudyperceivedvisualqualityinvariousapplicationdomains.Thisstudyinvestigatesnaturalnessandinterestingnessasqualityattributesinthecontextoftestimages.Takingatop-downapproachweaimtofindthedimensionswhichconstitutenaturalnessandinterestingnessintestimagesandtherelationshipofthesehigh-levelqualityattributes.Wecompareexistingcollectionsoftestimages(e.g.SonysRGBimages,ISO12640images,Kodakimages,Nokiaimagesandtestimagesdevelopedwithinourgroup)inanexperimentinvolvingqualitysortingandstructuredinterviews.Basedonthedatagatheredweanalyzetheviewer-suppliedcriteriafornaturalnessandinterestingnessacrossimagetypes,qualitylevelsandjudges.Thisstudyadvancesourunderstandingofthesubjectivecriteriausedwhenjudgingimagequalityaswellasenablesthevalidationofcurrenttestimagesandfurtherstheirdevelopment.



IS&T /

ReturntoContents

Conference 7868: Visualization and Data Analysis 2011Monday-Tuesday24-25January2011PartofProceedingsofSPIEVol.7868VisualizationandDataAnalysis2011

7868-01, Session 1

Data repository mapping for influenza protein sequence analysisD.A.Pellegrino,Jr.,C.Chen,DrexelUniv.(UnitedStates)

Thispaperintroducesanewmethodforcreatinganinteractivesequencesimilaritymapofallknowninfluenzavirusproteinsequencesandintegratingthemapwithexistinggeneralpurposeanalyticaltools.TheNCBIdatamodelwasdesignedtoprovideahighdegreeofinterconnectednessamongstdataobjects.Substantialandcontinuousincreaseindatavolumehasledtoalargeandhighlyconnectedinformationspace.Researchersseekingtoexplorethisspacearechallengedtoidentifyastartingpoint.Theyoftenchoosedatathatispopularintheliterature.Referencesintheliteraturefollowapowerlawdistributionandpopulardatapointsmaybiasexplorerstowardpathsthatleadonlytodead-endsofwhatisalreadyknown.Tohelpdiscovertheunexpectedwedevelopedaninteractivevisualanalyticssystemtomaptheinformationspaceofinfluenzaproteinsequencedata.ThedesignismotivatedbytheneedsofeScienceresearchers.

7868-02, Session 1

GPU-accelerated visualization of protein dynamics in ribbon modeM.Wahle,S.Birmanns,TheUniv.ofTexasHealthScienceCtr.atHouston(UnitedStates)

Proteinsarebiomoleculespresentinlivingorganismsandessentialforcarryingoutvitalfunctions.Inherenttotheirfunctioningisfoldingintodifferentspatialconformations,andtounderstandtheseprocesses,itiscrucialtovisuallyexplorethestructuralchanges.Inrecentyears,significantadvancementsinexperimentaltechniquesandnovelalgorithmsforpost-processingofproteindatahaveroutinelyrevealedstaticanddynamicstructuresofincreasingsizes.Inturn,interactivevisualizationofthesystemsandtheirtransitionsbecamemorechallenging.

Therefore,muchresearchfortheefficientdisplayofproteindynamicshasbeendone,withthefocusbeingspacefillingmodels,butfortheimportantclassofabstractribbonorcartoonrepresentations,thereexistonlyfewmethodsforanefficientrendering.Yet,thesemodelsareofhighinteresttoscientists,astheyprovideacompactandconcisedescriptionofthestructureelementsalongtheproteinmainchain.

Inthiswork,amethodwasdevelopedtospeedupribbonandcartoonvisualizations.SeparatingtwophasesinthecalculationofgeometryallowstooffloadcomputationalworkfromtheCPUtotheGPU.Thefirstphaseconsistsofcomputingasmoothcurvealongtheprotein’smainchainontheCPU.Inthesecondphase,conductedindependentlybytheGPU,verticesalongthatcurvearemovedtosetupthefinalgeometricalrepresentationofthemolecule.

7868-03, Session 1

OpenOrd: an open-source toolbox for large graph layoutS.Martin,W.M.Brown,SandiaNationalLabs.(UnitedStates);R.Klavans,K.Boyack,SciTechStrategies,Inc.(UnitedStates)

Wedocumentanopen-sourcetoolboxfordrawinglarge-scaleundirectedgraphs.Thistoolboxisbasedonapreviouslyimplementedclosed-sourcealgorithmknownasVxOrd.Ourtoolbox,whichwecallOpenOrd,extendsthecapabilitiesofVxOrdtolargegraphlayoutbyincorporatingedge-cutting,amulti-levelapproach,average-linkclustering,andaparallelimplementation.Ateachlevel,verticesaregroupedusingforce-directedlayoutandaverage-linkclustering.The

clusteredverticesarethenre-drawnandtheprocessisrepeated.Whenasuitabledrawingofthecoarsenedgraphisobtained,thealgorithmisreversedtoobtainadrawingoftheoriginalgraph.Thisapproachresultsinlayoutsoflargegraphswhichincorporatebothlocalandglobalstructure.Adetaileddescriptionofthealgorithmisprovidedinthispaper.Examplesusingdatasetswithover600Knodesaregiven.Codeisavailableatwww.cs.sandia.gov/~smartin.

7868-04, Session 1

A pseudo-haptic knot diagram interfaceH.Zhang,IndianaUniv.-PurdueUniv.Indianapolis(UnitedStates);J.Weng,ZhejiangUniv.(China);A.Hanson,IndianaUniv.(UnitedStates)

Tomakeprogressinunderstandingknottheory,wewillneedtointeractwiththeprojectedrepresentationsofmathematicalknotswhichareofcoursecontinuousin3Dbutsignificantlyinterruptedintheprojectiveimages.Onewaytoachievesuchagoalwouldbetodesignaninteractivesystemthatallowsustosketch2Dknotdiagramsbytakingadvantageofacollision-sensingcontrollerandexploretheirunderlyingsmoothstructuresthroughacontinuousmotion.Recentadvancesofinteractiontechniqueshavebeenmadethatallowprogresstobemadeinthisdirection.Pseudo-hapticsthatsimulateshapticeffectsusingpurevisualfeedbackcanbeusedtodevelopsuchaninteractivesystem.Thispaperoutlinesonesuchpseudo-hapticknotdiagraminterface.Ourinterfacederivesfromthefamiliarpencil-and-paperprocessofdrawing2Dknotdiagramsandprovideshaptic-likesensationstofacilitatethecreationandexplorationofknotdiagrams.Acenterpieceoftheinteractionmodelsimulatesa``physically’’reactivemousecursor,whichisexploitedtoresolvetheapparentconflictbetweenthecontinuousstructureoftheactualsmoothknotandthevisualdiscontinuitiesintheknotdiagramrepresentation.Anothervalueinexploitingpseudo-hapticsisthatanacceleration(ordeceleration)ofthemousecursor(orsurfacelocator)canbeusedtoindicatetheslopeofthecurve(orsurface)ofwhomtheprojectiveimageisbeingexplored.Byexploitingthisadditionalvisualcues,weproceedtoafull-featuredextensiontoapseudo-haptic4Dvisualizationsystemthatsimulatesthecontinuousnavigationon4Dobjectsandallowsustosensethebumpsandholesinthefourthdimension.Preliminarytestsofthesoftwareshowthatmainfeaturesoftheinterfaceovercomesomeexpectedperceptuallimitationsinourinteractionwith2Dknotdiagramsof3Dknotsand3Dprojectiveimagesof4Dmathematicalobjects.

7868-05, Session 2

Interactive isosurfaces with quadratic C1 splines on truncated octahedral partitionsT.Kalbe,A.Marinc,TechnischeUniv.Darmstadt(Germany);M.Rhein,Univ.Mannheim(Germany);M.Goesele,TechnischeUniv.Darmstadt(Germany)

Thereconstructionofacontinuousfunctionfromdiscretedataisabasictaskinmanyapplicationssuchasthevisualizationof3Dvolumetricdatasets.

Here,weusealocalapproximationmethodforquadraticC1-splinesonuniformtetrahedralpartitionstoachieveagloballysmoothfunction.Thesplineisbasedonatruncatedoctahedralpartitionofthevolumetricdomain,whereeachtruncatedoctahedronisfurthersplitintoafixednumberofdisjuncttetrahedra.TheBernstein-Béziercoefficientsofthepiecewisepolynomialsaretherebydirectlydeterminedbyappropriatecombinationsofthedatavaluesinalocalneighborhood.Aspreviouslyshown,thesplinesandtheirderivativesprovideanapproximationordertwoforsmoothfunctionsaswellastheirderivatives.Wepresentthefirstvisualizationsusingthesesplinesandshowthattheyarewell-suitedforGPU-based,interactivehigh-qualityvisualizationofdiscretedata.


IS&T /

ReturntoContents

7868-06, Session 2

Indirect multi-touch interaction for brushing in parallel coordinatesR.Kosara,TheUniv.ofNorthCarolinaatCharlotte(UnitedStates)

Interactioninvisualizationisoftencomplicatedandtedious.Brushingdatainavisualizationsuchasparallelcoordinatesisacentralpartofthedataanalysisprocess,andsetsvisualizationapartfromstaticcharts.Modifyingabrush,orcombiningitwithanotherone,usuallyrequiresalotofeffortandmodeswitches,though,slowingdowninteractionandevendiscouragingmorecomplexquestions.

Weproposetheuseofmulti-touchinteractiontoprovidefastandconvenientinteractionwithparallelcoordinates.Byusingamulti-touchtrackpadratherthanthescreendirectly,theuser’shandsdonotobscurethevisualizationduringinteraction.Usingone,two,three,orfourfingers,theusercaneasilyandquicklyperformcomplexselections.Beingabletochangetheselectionsrapidly,theusercanexplorethedatasetmoreeasilyandeffectively,andcanfocusonthedataratherthantheinteraction.

7868-07, Session 3

The science of visual analysis at extreme scaleL.T.Nowell,U.S.Dept.ofEnergy(UnitedStates)

Noabstractavailable

7868-08, Session 4

A randomized framework for discovery of heterogeneous mixturesM.A.Livingston,A.M.Palepu,J.Decker,M.Dermer,U.S.NavalResearchLab.(UnitedStates)

“Mixturemodels”isthetermgiventomodelsthatconsistofacombinationofindependentfunctionscreatingthedistributionofpointswithinaset.Wepresentaframeworkforautomaticallydiscoveringandevaluatingcandidatemodelsforunstructureddata.Ourabstractionofmodelsenablesustoseamlesslyconsiderdifferenttypesoffunctionsindifferentnumbersofdimensionsasequallypossiblecandidates.Ourframeworkdoesnotrequireanestimateofthenumberofunderlyingmodelsortrainingonsampledata,allowspointstobeprobabilisticallyclassifiedintomultiplemodelsoridentifiedasoutliers,andincludesafewparametersthatananalyst(nottypicallyanexpertinstatisticalmethods)mayusetoadjusttheoutputofthealgorithm.Wegiveresultsfromourframeworkwithsyntheticdataandclassicdata.

7868-09, Session 4

Exploring height fields: interactive visualization and applicationsM.Allili,A.Villares,Bishop’sUniv.(Canada);D.Corriveau,Univ.deSherbrooke(UnitedStates)

Heightfieldsareanimportantmodelingandvisualizationtoolinmanyapplicationsandtheirexplorationrequirestheirdisplayatinteractiveframerates.Thisishardtoachieveevenwithhighperformancegraphicscomputersduetotheirinherentgeometriccomplexity.Typicalsolutionsconsistofusingpolygonalapproximationsoftheheightfieldtoreducethenumberofgeometricprimitivesthatneedtoberendered.Startingfromaroughapproximation,arefinementprocessisoperateduntiladesiredlevelofdetailisreached.Inthiswork,wepresentanovelefficientalgorithmthatstartswithanapproximationthatcarriesenoughinformationabouttheheightfieldsothatonlyfewrefinementstepsare

neededtoachieveanydesiredlevelofdetail.Ourinitialapproximationisasimpletriangulationwhosenodesarethecriticalpointsoftheheightfield,thatisthepeaks,pits,andpassesofthesurfacewhichgiveitsoverallshape.

Theextractionofcriticalpointsofthesurface,whichisadiscretestructure,isdoneusinganewlydesignedalgorithmbasedondiscreteMorsetheory.

7868-10, Session 4

An Evaluation of Methods for Encoding Multiple, 2D Spatial DataM.A.Livingston,J.Decker,Z.Ai,U.S.NavalResearchLab.(UnitedStates)

Datasetsoveraspatialdomainarecommoninanumberoffields,oftenwithmultiplelayers(orvariables)withindatathatmustbeunderstoodtogetherviaspatiallocality.Thusoneareaoflong-standinginterestisincreasingthenumberofvariablesencodedbypropertiesofthevisualization.Anumberofpropertieshavebeendemonstratedand/orprovensuccessfulwithspecifictasksordata,buttherehasbeenrelativelylittleworkcomparingtheutilityofdiversetechniquesformulti-layervisualization.Aspartofoureffortstoevaluatetheapplicabilityofsuchvisualizations,weimplementedfivetechniqueswhichrepresentabroadrangeofexistingresearch(ColorBlending,OrientedSlivers,Data-DrivenSpots,BrushStrokes,andStickFigures).Thenweconductedauserstudywhereinsubjectswerepresentedwithcompositesofthree,four,andfivelayers(variables)usingoneofthesemethodsandaskedtoperformataskcommontoourintendedendusers(GISanalysts).WefoundthattheOrientedSliversandData-DrivenSpotsperformedthebest,withStickFiguresyieldingthelowestaccuracy.Throughanalyzingourdata,wehopetogaininsightintowhichtechniquesmeritfurtherexplorationandofferpromiseforvisualizationofdatasetswithever-increasingsize.

7868-11, Session 5

Multivariate visualization of chromatographic systemsT.Urness,T.Marrinan,A.R.Johnson,M.F.Vitha,DrakeUniv.(UnitedStates)

Chemistsareoftenfacedwiththechallengeofseparatingandquantifyingthemoleculesinacomplexmixture.Pharmaceuticalcompanies,forexample,musttestformulationsforimpuritiesandtomakecertainthedrugcontainsthecorrectamountsofactiveingredients.Therearehundredsofchromatographicsystemstochoosefromwhendevelopinganalyticalmethods.Therefore,toavoidwastingtimeandmoney,itisvitalthatmethoddevelopmentbeguidedbychemicalprinciplesratherthanbytrialanderror.

Thismanuscriptisbrokenintofourmainsections.Inthefirst,wedescribechromatographyandthepracticalproblemwearetryingtosolve.Inthesecond,wediscussseveralmethodsforvisualizingmultivariatedata,includingglyphs,triangleplots,andparallelcoordinates.Wethendescribea3Dvisualizationtoolthatwehavecreatedinordertoanalyzelargesetsofchromatographicdata.Asdetailedinthatsection,ourapproachcombinesscatterplots,parallelcoordinates,andspecializedglyphstoassistintheanalysisofthedata.Inthefinalsection,wedemonstratetheutilityofthevisualizationtoolbyapplyingittotwochromatographicdatasets.

7868-12, Session 5

Visualization of dynamic adaptive resolution scientific dataA.Foulks,R.D.Bergeron,S.H.Vohr,TheUniv.ofNewHampshire(UnitedStates)

Interactivevisualizationofverylargedatasetsremainsachallenging

Conference 7868: Visualization and Data Analysis 2011


IS&T /

ReturntoContents

problemtothevisualizationcommunity.Onepromisingsolutioninvolvesusingadaptiveresolutionrepresentationsofthedata.Inthismodel,importantregionsofdataareidentifiedusingreconstructiveerroranalysisandareshowninhigherdetail.Duringthevisualization,regionswithhighererrorarerenderedwithhighresolutiondata,whileareasoflowerrorarerenderedatalowerresolution.Wehavedevelopedanewdynamicadaptiveresolutionrenderingalgorithmalongwithsoftwaresupportlibraries.TheselibrariesaredesignedtoextendtheVisITvisualizationenvironmentbyaddingsupportforadaptiveresolutiondata.VisITsupportsdomaindecompositionofdata,whichweusetodefineourARrepresentation.Weshowthatwiththismodel,weachieveperformancegainswhilemaintainingerrortolerancesspecifiedbythescientist.

7868-13, Session 5

A flexible low-complexity device adaptation approach for data presentationR.U.Rosenbaum,Sr.,A.Gimenez,Univ.ofCalifornia,Davis(UnitedStates);H.Schumann,Univ.Rostock(Germany);B.Hamann,Univ.ofCalifornia,Davis(UnitedStates)

Visualdatapresentationsrequireadaptationforappropriatedisplayonaviewingdevicethatislimitedinresourcessuchascomputingpower,screenestate,and/orbandwidth.Duetothecomplexityofsuitableadaptation,thefewproposedsolutionsavailableareeithertooresource-intensiveorinflexibletobeappliedbroadly.Effectiveuseandacceptanceofdatavisualizationonconstrainedviewingdevicesrequireadaptationapproachesthataretailoredtotherequirementsoftheuserandthecapabilitiesoftheviewingdevice.

Weproposeadynamicdeviceadaptationapproachthattakesadvantageofprogressivedatarefinement.Theapproachreliesonhierarchicaldatastructuresthatarecreatedonceandusedmultipletimes.Byincrementallyreconstructingthevisualpresentationontheclientwithincreasinglevelsofdetailandresourceutilization,wecandeterminewhentotruncatetherefinementofdetailsoastousetheresourcesofthedevicetotheirfullcapacities.Todeterminewhentofinishtherefinementforaparticulardevice,weintroduceaprofile-basedstrategywhichalsoconsidersuserpreferences.Wediscussthewholeadaptationprocessfromthestorageofthedataintoascalablestructuretothepresentationontherespectiveviewingdevice.Thisparticularimplementationisshownfortwocommondatavisualizationmethods,andempiricalresultsweobtainedfromourexperimentsarepresentedanddiscussed.

7868-14, Session 6

EdgeMaps: visualizing explicit and implicit relationsM.Doerk,S.Carpendale,C.Williamson,Univ.ofCalgary(Canada)

WiththisworkweintroduceEdgeMaps:anewmethodforintegratingthevisualizationofimplicitandexplicitdatarelations.Explicitrelationsarespecificconnectionsbetweenentitiesalreadypresentinagivendataset,whileimplicitrelationsarederivedfrommultidimensionaldatabasedonsharedpropertiesandsimilaritymeasures.Manydatasetsincludebothexplicitandimplicitrelationsthatareoftennotaccountedfortogetherininformationvisualizations.Node-linkdiagramstypicallyfocusonexplicitdataconnections,whilenotincorporatingimplicitsimilaritiesbetweenentities.Multi-dimensionalscalingconsiderssimilaritiesbetweenitems,however,explicitlinksbetweennodesarenotdisplayed.Incontrast,EdgeMapsvisualizebothimplicitandexplicitrelationsbycombiningandcomplementingspatializationandgraphdrawingtechniques.Asacasestudyforthisapproachwechoseadatasetofphilosophers,theirinterests,influences,andbirthdates.Byintroducingthelimitationofactivatingonlyonenodeatatime,interestingvisualpatternsemergethatresembletheaestheticsoffireworksandwaves.Wearguethattheinteractiveexplorationofthesepatternsenablestheviewertograspthestructureofagraphbetterthancomplexgraphvisualizations.

7868-15, Session 6

Visualizing node attribute uncertainty in graphsN.Cesario,A.Pang,Univ.ofCalifornia,SantaCruz(UnitedStates);L.Singh,NorthwesternUniv.(UnitedStates)

Visualizationisfrequentlymisrepresentativeofactualdataasitisanabsoluterepresentationwhenuncertaintyoftenexistsinthedata.Whilevarioustechniquesandtoolsexistforvisualizinguncertaintyinscientificvisualizations,thesedonotexistforvisualizinginformationsuchasgraph/networkdata.Specifically,toourknowledge,notoolexiststhatallowsausertoviewagraphornetworkwithuncertaintyinherentwithintheattributesofnodesandedges.Withtherecentprevalenceindatawhichcanberepresentedasagraph(e.g.socialnetworks),graphsarenolongersimple,bi-modaldatasetswithonlynodesandedges.Ourtaskisoftentoworkwithmulti-modalgraphsthatcontainmultipletypesofnodesandedgeswhereeachnode/edgecanhavemany-perhapshundreds-ofattributes,andtheseattributesroutinelyhavesomeuncertaintyattachedtothem.Moreover,itisoftenusefultocomparemultiplegraphsofthistypeaswellastheegonetworksofnodesinthesegraphs.Inthispaperwepresentvarioustechniquesandaprototypetoolthatcanbeusedtovisualizemulti-modalgraphdatawithuncertaintyattachedtoeachattributeandcomparemultiplesuchgraphswithoneanother.

7868-16, Session 6

Interactive visualization of scattered moment tensor dataH.Obermaier,Univ.ofKaiserslautern(Germany);M.I.Billen,Univ.ofCalifornia,Davis(UnitedStates);H.Hagen,Univ.ofKaiserslautern(Germany);M.Hering-Bertram,Fraunhofer-InstitutfürTechno-undWirtschaftsmathematik(Germany)

Inthispaperwepresentanumberofnovelextractionandvisualizationtechniquesforinteractiveanalysisofscatteredmomenttensorfields.

Symmetricsecond-ordermomenttensorsderivedfromseismicmeasurementsduringearthquakesarerelatedtostresstensorsandcontainimportantgeologicalinformationaboutsurfacedisplacementintheearth’smantle.Forabetterunderstandingofearthquakesources,typesandproperties,analysisofthistypeofdatasetsiscrucial.Themethodsintroducedinthisworkfacilitateinteractivevisualizationofscatteredmomenttensordatatosupportearthquake,source,anddisplacementanalysis.Tothisgoal,wecombinevisualizationsofthree-dimensionalspatiallocationandorientationinformationderivedfrommomenttensordecompositionsandpresentinteractiontechniquestoprovidesemanticlinksbetweenbothviewpoints.Wedevelopnewtensorglyphshighlightingtheindefinitecharacterofmomenttensors,whileconveyingimportantgeologicalinformationsuchaswavepropagationandfaultorientations,showingsignificantimprovementsoverclassicbeachball-glyphbasedvisualizationtechniques.Additionally,weproposenoveltensorclusteringandaveragingtechniquesbasedonaselectionofmomenttensorsimilaritymeasuresalongwithaccompanyingvisualizationmethodstoovercomevisualclutter,removeredundantinformationandhelpgainaninsightintoscatteredmomenttensordatasetsforearthquakeanalysis.

7868-17, Session 7

Visualizing frequent patterns in large multivariate time seriesM.C.Hao,M.Marwah,Hewlett-PackardLabs.(UnitedStates);H.Janetzko,Univ.Konstanz(Germany);R.K.Sharma,Hewlett-PackardLabs.(UnitedStates);D.A.Keim,Univ.Konstanz(Germany);U.Dayal,Hewlett-PackardLabs.(UnitedStates);D.Patnaik,N.Ramakrishnan,VirginiaPolytechnicInstituteandStateUniv.(UnitedStates)



IS&T /

ReturntoContents

Thedetectionofpreviouslyunknown,frequentlyoccurringpatternsintimeseries,oftencalledmotifs,hasbeenrecognizedasanimportanttask.However,itisdifficulttodiscoverandvisualizethesemotifsastheirnumbersincrease,especiallyinlargemultivariatetimeseries.Tofindfrequentmotifs,weuseseveraltemporaldataminingandeventencodingtechniquestoclusterandconvertamultivariatetimeseriestoasequenceofevents.Thenwequantifytheefficiencyofthediscoveredmotifsbylinkingthemwithaperformancemetric.Tovisualizefrequentpatternsinalargetimeserieswithpotentiallyhundredsofnestedmotifsonasingledisplay,weintroducethreenovelvisualanalyticsmethods:(1)motiflayout,usingcoloredrectanglesforvisualizingtheoccurrencesandhierarchicalrelationshipsofmotifsinamultivariatetimeseries,(2)motifdistortion,forenlargingorshrinkingmotifsasappropriateforeasyanalysisand(3)motifmerging,tocombineanumberofidenticaladjacentmotifinstanceswithoutclutteringthedisplay.Analystscaninteractivelyoptimizethedegreeofdistortionandmergingtogetthebestpossibleview.Aspecificmotif(e.g.,themostefficientorleastefficientmotif)canbequicklydetectedfromalargetimeseriesforfurtherinvestigation.Wehaveappliedthesemethodstotworeal-worlddatasets:datacentercoolingandoilwellproduction.Theresultsprovideimportantnewinsightsintotherecurringpatterns.

7868-18, Session 7

Visual pattern discovery in timed event dataM.Schaefer,F.Wanner,F.Mansmann,C.Scheible,V.Stennett,A.T.Hasselrot,D.A.Keim,Univ.Konstanz(Germany)

Businessprocesseshavetremendouslychangedthewaylargecompaniesconducttheirbusiness:Theintegrationofinformationsystemsintotheworkflowsoftheiremployeesensuresahighservicelevelandthushighcustomersatisfaction.Onecoreaspectofbusinessprocessengineeringareeventsthatsteertheworkflowsandtriggerinternalprocesses.Strictrequirementsoninterval-scaledtemporalpatterns,whicharecommonintimeseries,aretherebyreleasedthroughtheordinalcharacterofsuchevents.Itisthisadditionaldegreeoffreedomthatopensunexploredpossibilitiesforvisualizingeventdata.

Inthispaper,wepresentaflexibleandnovelsystemtofindsignificantevents,eventclustersandeventpatterns.Eacheventisrepresentedasasmallrectangle,whichiscoloredaccordingtocategorical,ordinalorinterval-scaledmetadata.Dependingontheanalysistask,differentlayoutfunctionsareusedtohighlighteithertheordinalcharacterofthedataortemporalcorrelations.Thesystemhasbuilt-infeaturesfororderingcustomersoreventgroupsaccordingtothesimilarityoftheireventsequences,temporalgapalignmentandstackingofco-occurringevents.Twocharacteristicallydifferentcasestudiesdealingwithbusinessprocesseventsandnewsarticlesdemonstratethecapabilitiesofoursystemtoexploreeventdata.

7868-19, Session 7

Enhancing visualization with real-time frequency-based transfer functionsE.Vucini,TechnischeUniv.Wien(Austria);D.Patel,ChristianMichelsenResearchAS(Norway);E.Groeller,TechnischeUniv.Wien(Austria)

Transferfunctionshaveacrucialroleintheunderstandingandvisualizationof3Ddata.Whileexhaustiveresearchhasscrutinizedthepossibleusesofoneandmulti-dimensionaltransferfunctionsinthespatialdomain,toourknowledge,noattempthasbeendonetoexploretransferfunctionsinthefrequencydomain.Inthisworkweproposetransferfunctionsforthepurposeoffrequencyanalysisandvisualizationof3Ddata.Frequency-basedtransferfunctionsofferthepossibilitytodiscriminatesignalscomposedfromdifferentfrequencies,toanalyzeproblemsrelatedtosignalprocessing,andtohelpunderstandingthelinkbetweenthemodulationofspecificfrequenciesandtheirimpactonthespatialdomain.

Wedemonstratethestrengthofthefrequency-basedtransferfunctionbyapplyingittomedicalCT,ultrasoundandMRIdata,physicsdata

aswellassyntheticseismicdata.Theinteractivityoftheproposedframeworkinbuildingcomplexfiltersandtheusageforstructureorfeatureenhancementcanbeausefuladditiontoconventionalclassificationtechniques.

7868-34, Session 8

Scientific visualization for data analysisD.L.Kao,NASAAmesResearchCtr.(UnitedStates)

Noabstractavailable

7868-20, Session 9

The role of visualization and interaction in maritime anomaly detectionM.Riveiro,G.Falkman,Univ.ofSkövde(Sweden)

Thesurveillanceoflargesea,airorlandareasnormallyinvolvestheanalysisoflargevolumesofheterogeneousdatafrommultiplesources.Timelydetectionandidentificationofanomalousbehaviororanythreatactivityisanimportantobjectiveforenablinghomelandsecurity.Whileitisworthacknowledgingthatmanyexistingminingapplicationssupportidentificationofanomalousbehavior,autonomousanomalydetectionsystemsforareasurveillancearerarelyusedintherealworld.Wearguethatsuchcapabilitiesandapplicationspresenttwocriticalchallenges:(1)theyneedtoprovideadequateusersupportand(2)theyneedtoinvolvetheuserintheunderlyingdetectionprocess.

Inordertoencouragetheuseofanomalydetectioncapabilitiesinsurveillancesystems,thispaperanalyzesthechallengesthatexistinganomalydetectionandbehavioralanalysisapproachespresentregardingtheiruseandmaintenancebyusers.Weanalyzeinputparameters,detectionprocess,modelrepresentationandoutcomes.Wediscusstheroleofvisualizationandinteractionintheanomalydetectionprocess.Practicalexamplesfromourcurrentresearchwithinthemaritimedomainillustratekeyaspectspresented.

7868-21, Session 9

Multiscale visual quality assessment for cluster analysis with self-organizing mapsJ.Bernard,T.vonLandesberger,S.Bremm,T.Schreck,TechnischeUniv.Darmstadt(Germany)

Clusteranalysisisanimportantdataminingtechniqueforanalyzinglargeamountsofdata,byreductiontoalimitednumberofclusters.Clustervisualizationtechniquesaimatsupportingtheuserinbetterunderstandingthecharacteristicsandrelationshipsamongthefoundclusters.Whilepromisingapproachestovisualclusteranalysisalreadyexist,theseusuallyfallshortofincorporatingthequalityoftheobtainedclusteringresults.However,duetothenatureoftheclusteringprocess,qualityplaysanimportantaspect,asformostpracticaldatasets,typicallymanydifferentclusteringsarepossible.Beingawareofclusteringqualityisimportanttojudgetheexpressivenessofagivenclustervisualization,ortoadjusttheclusteringprocesswithrefinedparameters,amongothers.

Inthiswork,wepresentanencompassingsuiteofvisualtoolsforqualityassessmentofanimportantvisualclusteralgorithm,namely,theSelf-OrganizingMap(SOM)technique.Wedefine,measure,andvisualizethenotionofSOMclusterqualityalongahierarchyofclusterabstractions.Thequalityabstractionsrangefromsimplescalar-valuedqualityscoresuptothestructuralcomparisonofagivenSOMclusteringwithoutputofadditionalsupportiveclusteringmethods.ThesuiteofmethodsallowstheusertoassesstheSOMqualityontheappropriateabstractionlevel,andarriveatimprovedclusteringresults.Weimplementourtoolsinanintegratedsystem,applyitonexperimentaldatasets,andshowitsapplicability.



IS&T /

ReturntoContents

7868-22, Session 10

Privacy-preserving data visualization using parallel coordinatesA.Dasgupta,R.Kosara,TheUniv.ofNorthCarolinaatCharlotte(UnitedStates)

Theproliferationofdatainthepastdecadehascreateddemandforinnovativetoolsindifferentareasofexploratorydataanalysis,likedataminingandinformationvisualization.However,theproblemwithreal-worlddatasetsisthatmanyoftheirattributescanidentifyindividuals,orthedataareproprietaryandvaluable.Thedataminingfieldhasdevelopedavarietyofwaysfordealingwithsuchdata,andhasestablishedanentiresubfieldforprivacy-preservingdatamining.Visualization,ontheotherhand,hasseenlittle,ifany,workonhandlingsensitivedata.Withthegrowingapplicabilityofdatavisualizationinreal-worldscenarios,thehandlingofsensitivedatahasbecomeanon-trivialissueweneedtoaddressindevelopingvisualizationtools.

Withthisgoalinmind,inthispaper,weanalyzetheissueofprivacyfromavisualizationperspectiveandproposeaprivacy-preservingdatavisualizationtechniquebasedonclusteringinparallelcoordinates.Wealsooutlinethekeydifferencesinapproachfromtheprivacy-preservingdataminingfieldandcomparetheadvantagesanddrawbacksofourapproach.

7868-23, Session 10

A tri-linear visualization for network anomaly detectionR.F.Erbacher,NorthwestSecurityInstitute(UnitedStates);R.B.Whitaker,UtahStateUniv.(UnitedStates)

Thisresearchdiscussesanovelapplicationofternaryplotstothevisualizationofnetworktrafficdata.Theseplotsprovetobeenormouslyeffectiveatidentifyinganomalousnetworkactivityandcanbevaluableinmonitoringnetworkactivitymuchmoreefficientlythancanbedonewithexistingtechniques.Thevisualizationwasimplementedinourexistingvisualizationinfrastructuretoreducedevelopmenttime.Testingwasperformedonactualnetworktrafficdatacollectedfromalocalnetwork.Multipleanomalieswereeasilyidentifiablewithinthedatasetwithoutanypriorknowledgeastothecontentsofthetestfile.Thispaperdiscussestheternaryplotanditsapplicationtonetworktrafficdata,theformulasneededtocalculateanddisplayternarycoordinates,andthebasicarchitectureforthevisualizationimplementation.

7868-24, Session 11

EmailTime: visual analytics and statistics for temporal emailM.ErfaniJoorabchi,J.Yim,C.D.Shaw,SimonFraserUniv.(Canada)

Althoughthediscoveryandanalysisofcommunicationpatternsinlargeandcomplexemaildatasetsaredifficulttasks,theycanbeavaluablesourceofinformation.WepresentEmailTime,avisualanalysistoolofemailcorrespondencepatternsoverthecourseoftimethatinteractivelyportrayspersonalandinterpersonalnetworksusingthecorrespondenceintheemaildataset.Ourapproachistoputtimeasaprimaryvariableofinterest,andplotemailsalongatimeline.EmailTimehelpsemaildatasetexplorersinterpretarchivedmessagesbyprovidingzooming,panning,filteringandhighlightingetc.Tosupportanalysis,italsomeasuresandvisualizeshistograms,graphcentralityandfrequencyonthecommunicationgraphthatcanbeinducedfromtheemailcollection.ThispaperdescribesEmailTime’scapabilities,alongwithalargecasestudywithEnronemaildatasettoexplorethebehaviorsofemailuserswithindifferentorganizationalpositionsbetweenJanuary2000andDecember2001.Wedefinedemailbehaviorastheemailactivitylevelofpeopleregardingaseriesofmeasuredmetricse.g.sentandreceivedemails,numbersofemail

addresses,etc.ThesemetricswerecalculatedthroughEmailTime.Resultsshowedspecificpatternsintheuseemailwithindifferentorganizationalpositions.Wesuggestthatintegratingbothstatisticsandvisualizationsinordertodisplayinformationabouttheemaildatasetsmaysimplifyitsevaluation.

7868-25, Session 11

A web-enabled visualization toolkit for geovisual analyticsQ.V.Ho,P.Lundblad,T.Åström,M.Jern,LinköpingUniv.(Sweden)

Weintroduceaframeworkandclasslibrary(GAVFlash)implementedinAdobe’sActionScript,designedwiththeintentiontosignificantlyshortenthetimeandeffortneededtodevelopcustomizedweb-enabledapplicationsforvisualanalyticsorgeovisualanalyticstasks.Throughanatomiclayeredcomponentarchitecture,GAVFlashprovidesacollectionofcommongeo-andinformationvisualizationrepresentationsextendedwithmotionbehaviorincludingscattermatrix,extendedparallelcoordinates,tablelens,choroplethmapandtreemap,integratedinamultiple,time-linkedlayout.Versatileinteractionmethodsaredrawnfrommanydatavisualizationresearchareasandoptimizedfordynamicwebvisualizationofspatio-temporalandmultivariatedata.BasedonanatomiclayeredcomponentthinkingandtheuseofprogramminginterfacemechanismtheGAVFlasharchitectureisopenandfacilitatesthecreationofneworimprovedversionsofexistingcomponentssothatideascanbetriedoutoroptimizedrapidlyinafullyfunctionalenvironment.FollowingtheVisualAnalyticsmantra,amechanism“snapshot”forsavingtheexplorativeresultsofareasoningprocessisdevelopedthataidscollaborationandpublicationofgainedinsightandknowledgeembeddedasdynamicvisualizationsinblogsorwebpageswithassociativemetadataor“storytelling”.


A 3D particle visualization system for temperature managementB.Lange,N.Rodriguez,W.Puech,Lab.d’InformatiquedeRobotiqueetdeMicroelectroniquedeMontpellier(France);H.Rey,X.Vasques,IBM(France)

Thispaperdealswitha3Dvisualizationtechniqueproposedtoanalyzeandmanageenergyefficiencyfromadatacenter.DataareextractedfromsensorslocatedintheIBMgreendatacenterinMontpellier.Thesesensorsmeasuredifferentinformationsuchashygrometry,pressureandtemperature.Wewanttovisualizeinrealtimethelargeamongofdataproducebythesesensors.Avisualizationenginehasbeendesigned,basedonparticlessystemandaclientserverparadigm.InordertosolveperformanceproblemsaLevelOfDetailsolutionhavebeendeveloped.ThesemethodsarebasedontheearlierworkintroducedbyJamesClarkin1976.Inthispaperweintroducetheparticlemethodusedforthisworkandsubsequentlyweexplaindifferentsimplificationmethodsthatwehaveappliedtoimproveoursolution


A digital topology-based method for the topological filtering of a reconstructed surfaceM.Allili,Bishop’sUniv.(Canada);D.Li,Univ.deSherbrooke(Canada);M.Allili,Univ.duQuébecenOttaouais(Canada)

Inthispaper,weuseconceptsfromdigitaltopologyforthetopologicalfilteringofreconstructedsurfaces.GivenafinitesetSofsamplepointsin3Dspace,weusethevoronoi-basedalgorithmofAmentaandBern



IS&T /

ReturntoContents

toreconstructapiecewise-linearapproximationsurfaceintheformofatriangularmeshwithvertexsetequaltoS.Atypicalsurfaceobtainedbymeansofthisalgorithmoftencontainssmallholesthatcanbeconsideredasnoise.Weproposeamethodtoremovetheunwantedholesthatworksasfollows.Wefirstembedthetriangulatedsurfaceinavolumetricrepresentation.Then,weusea3D-holeclosingalgorithmoftofiltertheholesbytheirsizeandclosethesmallholesthatareingeneralirrelevanttothesurfacewhilethelargerholesoftenrepresenttopologicalfeaturesofthesurface.Wepresentsomeexperimentalresultsthatshowthatthismethodallowsautomaticallyandeffectivelysearchingandsuppressingunwantedholesina3Dsurface.


A meta-notation for data visualizationS.Y.Lee,U.Neumann,TheUniv.ofSouthernCalifornia(UnitedStates)

Weproposeanotationdevisedtoexpressmajorstructuralcharacteristicsinwidely-useddatavisualizations.

Thenotationconsistsofunaryandbinaryoperatorsandtheycanbecombinedtogethertodescribeavisualization.

Bycapturingsignificantstructuralfeaturesofavisualization,itcanbeappliedinmatchingorcomparingtwovisualizationsinaconceptuallevel.

Inthispaper,wediscussthedesignofoperatorsinournotationbypresentingtheirconceptsandusages.

Thenweshowhowexpressiveournotationisbyexploitingsomeofcommonly-useddatavisualizationsandstudyrulesandrelationshipsderivedfromthem.

Theadvantageoftheproposedapproachisthatthebehaviorsofoperatorsprovideabasicsetofrequiredcapabilitieswithwhichanimplementationcanbeorganized.

Thus,itcanbeusedtodesignorsimulateasystem,whichinterconnectsandcommunicateswithvarioustypesofdatavisualizationtoolsbysendingandreceivingvisualizationrequestsbetweenthem.


Enhancing online timeline visualizations with events and imagesA.Pandya,A.Mulye,S.T.Teoh,SanJoséStateUniv.(UnitedStates)

Theuseoftimelinetovisualizetime-seriesdataisoneofthemostintuitiveandcommonlyusedmethods,andisusedforwidely-usedapplicationssuchasstockmarketdatavisualization,andtrackingofpolldataofelectioncandidatesovertime.Whileuseful,thesetimelinevisualizationsarelackingincontextualinformationofeventswhicharerelatedorcausechangesinthedata.Wehavedevelopedasystemthatenhancestimelinevisualizationwithdisplayofrelevantnewseventsandtheircorrespondingimages,sothatuserscannotonlyseethechangesinthedata,butalsounderstandthereasonsbehindthechanges.Wehavealsoconductedauserstudytotesttheeffectivenessofourideas.


Multivariate data visualization via outdoor scenesB.A.Hillery,R.P.Burton,BrighamYoungUniv.(UnitedStates)

AbstractVisualizationofmultivariatedatapresentsachallengeduetothesheerdimensionalityanddensityofinformation.Whenpresentingthedatasymbolically,thishighinformationdimensionalityanddensitymakesitdifficulttodevelopasymbologycapableofdisplayingitinasinglepresentation.Oneapproachtomultivariatevisualizationinvolvescreatingsymbolswithhigherdimensionality.Higherdimensionalsymbolscanbeproblematic,sincetheytypicallyrequiresignificanthumanattentiveprocessingtointerpret,offsettingtheirgreaterinformationalcapacity.Althoughattemptshavebeenmadetodevelophigher-dimensionalsymbolsthatareprocessedinapreattentivefashion,successhasprovenelusive.Recentcognitiveresearchindicatesthatoutdoorscenesareprocessedinapreattentivemanner.Weevaluateoutdoorscenesasacandidatefordevelopinganeffectivehigher-dimensionalsymbologybyimplementingthemandcomparingthemtoothermethods,bothstandardandpreattentive.



IS&T /

ReturntoContents

Conference 7869: Computer Vision and Image Analysis of Art IIWednesday26January2011PartofProceedingsofSPIEVol.7869ComputerVisionandImageAnalysisofArtII


Time and order estimation of paintings based on expert priors: applications in art history and curatorial treatmentR.S.Cabral,J.P.Costeira,Univ.TécnicadeLisboa(Portugal);F.deLaTorre,CarnegieMellonUniv.(UnitedStates);A.Bernardino,G.Carneiro,Univ.TécnicadeLisboa(Portugal)

Inthispaper,wepresentaframeworkforestimatingtheorderinganddateinformationofpaintingsanddrawings.Weformulatethisproblemastheembeddingintoaonedimensionmanifold,whichaimstoplacepaintingsfarorclosetoeachotheraccordingtoameasureofsimilarity.Ourformulationcanbeseenasamanifoldlearningalgorithm,albeitproperlyadaptedtodealwithexistingquestionsintheartcommunity.

Tosolvethisproblem,weproposeadynamicprogrammingapproachandaconvexoptimizationformulation.Bothmethodsareabletoincorporateartexpertiseaspriorstotheestimation,intheformofconstraints.Typesofinformationincludeexactorapproximatedatingandpartialorderings.Weexploretheuseofsoftpenaltytermstoallowforconstraintviolationtoaccountforthefactthatpriorknowledgemaycontainsmallerrors.

Sincetheproposedapproachliesontheexistenceofstatisticscorrelatingimagedatawithtimevariation,weprovideapreliminarystudyofthefeaturesavailableintheimageprocessingappliedtotheartsliterature.

Wedescribepossibleapplicationswheretimeinformation(andhence,thismethod)couldbeofuseinarthistory,fakedetectionorcuratorialtreatment.


Machine learning of multi-feature visual texture classifiers for the authentication of Jackson Pollock’s drip paintingsM.Al-Ayyoub,StonyBrookUniv.(UnitedStates);D.G.Stork,RicohInnovations,Inc.(UnitedStates);M.T.Irfan,StonyBrookUniv.(UnitedStates)

JacksonPollock’sactionpaintingsaresomeofthemostimportantworksinAmericanAbstractExpressionism.Therearemanyworksofdoubtfulattributionandoutrightfakes.Materialstudiesofpaint,support,priming,signaturesandprovenancearenotalwaysdefinitiveinattributionstudies,andsoTaylorandhiscolleaguesintroducedabox-countingalgorithmtoestimatefractalandscale-spacesignaturesofPollock’sworks.TheyreportedthatsuchsignaturesgenerallydifferedfromfakePollocksthattheirmethodcouldbeusedaspartofauthenticationprotocol.Ourcurrentprojectbuildsuponpriorworkbytrainingonmoreimagedata,andofhigherresolution,ofbothgenuinePollocksandfakes,andaimsatemployingfeatureextraction,featureselectionandclassifierselectiontechniquescommonlyusedinpatternrecognitionresearch.Herewepresenttheresultsofseveralsupervisedclassificationframeworks,suchasSupportVectorMachines(SVM),decisiontrees(DT),andAdaBoost.Weextractfeaturesfromthefractality,multifractality,pinknoisepatterns,topologicalgenus,andcurvaturepropertiesoftheimagesofcandidatepaintings,andaddresslearningissuesthathavearisenduetothesmallnumberofexamples.Inourexperiments,wehavefoundthattheunmodifiedclassifierslikeSupportVectorMachinesorDecisionTreealonedonotgivegoodresults.Inparticular,DecisionTreealonegivesanaccuracyofroughly60%.WeusedaDecisionTreeasaweaklearnerinAdaBoost,weobtainedaccuraciesofroughly80%.Thus,althoughoursetofobservationsisverysmall,boostingmethodscansignificantlyimproveclassificationaccuracyforPollockauthentication.


Improved curvature-based inpainting applied to fine art: recovering van Gogh’s partially hidden brush strokesD.G.Stork,RicohInnovations,Inc.(UnitedStates);Y.Kuang,F.Kahl,LundUniv.(Sweden)

Underpaintingsandpentimenti(revealedthroughx-rayimagingandinfraredreflectography)compriseimportantevidencerevealingtheintermediatestatesofaworkandthustheworkingmethodsofmanyartists.Basedondigitalimageprocessingandstatisticalanalysis,Shahram,StorkandDonohointroducedtheDe-pictalgorithm,whichrecoverslayersofbrushstrokesinpaintingswithopenbrushworkwhereseverallayersarepartiallyvisible,suchasvanGoghrqs“Selfportraitwithagreyfelthat.”Whilethatpreliminaryworkservedasaproofofconceptthatcomputerimageanalyticmethodscouldrecoversomeoccludedimages,theworkneededfurtherrefinementbeforeitcouldbeatoolforartscholars.Ourcurrentworkrectifiesthisomission.Weextendedthatearliermethodbyrefiningtheinpaintingstepthroughtheinclusionofcurvature-basedconstraints,inwhichamathematicalcurvaturepenaltytermforextractedchromaticlevellinesbiasesthereconstructiontowardmatchingtheartist’shandmotion.Werefineourmethodsusing“groundtruth”imagedata:passagesoffourlayersofbrushstrokesinwhichtheintermediatelayerswererecordedphotographically.Ateachsuccessivetoplayer(currentlyidentifiedbytheuser),weused$k$-meansclusteringcombinedwithgraphcutstoobtainchromaticallyandspatiallycoherentsegmentationofbrushstrokes.Wethenreconstructedstrokesatthedeeperlayerwithourcurvature-basedinpaintingalgorithmbasedonchromaticlevellines.OurmethodsareclearlysuperiortopreviousversionsoftheDe-pictalgorithmonvanGogh’sworks,andcouldbeappliedtotheclassicdrippaintingsofJacksonPollock,wherethedripworkismoreopenandthephysicsofsplashingpaintensuresthatthecurvaturemoreuniformthaninthehandcreatedbrushstrokesofvanGogh.


Did Caravaggio employ optical projections? An image analysis of the parity in the artist’s paintingsD.G.Stork,RicohInnovations,Inc.(UnitedStates)

WeexamineoneclassofevidenceputforthinsupportoftherecentclaimthattheItalianBaroquemasterCaravaggiosecretlyemployedopticalprojectorsasadirectdrawingaid.Specifically,wetesttheclaimsthatthereisan“abnormalnumber”ofleft-handedfiguresinhisworksand,morespecifically,that“DuringtheDelMonteperiodhehadtoomanyleft-handedmodels.”Wealsotestwhethertherewasareversalinthehandednessofagivenindividualmodelindifferentpaintings.SuchevidencewouldbeconsistentwiththeclaimthatCaravaggioswitchedbetweenusingaconvexlensprojectortousingaconcavemirrorprojectorandwouldsupport,butnotprove,theclaimthatCaravaggiousedopticalprojections.Weestimatetheparity(+or-)ofeachofCaravaggio’s76appropriateoilpaintingsbasedonthehandednessoffigures,theorientationofasymmetricobjects,placementofscabbards,depictedtext,andsoon,andsearchforstatisticallysignificantchangesinhandednessinfigures.Wealsotrackthedirectionoftheilluminationovertimeintheartist’soeuvre.Wediscusssomehistoricalevidenceasitrelatestothequestionofhispossibleuseofoptics.Wefindtheproportionofleft-handedfigureslowerthanthatinthegeneralpopulation(nothigher),andnosignificantchangeinestimatedhandednessevenofindividualmodels.Opticalproponentshavearguedthat“Bacchus”(1597)portraysaleft-handedfigure,butwegivevisualandculturalevidenceandconclude


IS&T /

ReturntoContents

thatthisfigureisinsteadright-handed,therebyrebuttingthisclaimthatthepaintingwasexecutedusingopticalprojections.Moreover,scholarsrecentlyre-discoveredtheimageoftheartistwitheaselandcanvasreflectedinthecarafeofwineatthefrontleftinthetableauin“Bacchus,”showingthatthispaintingwasalmostsurelyexecutedusingtraditional(non-optical)easelmethods.Weconcludethatthereis1)nostatisticallysignificantabnormallyhighnumberofleft-handedfiguresinCaravaggio’soeuvre,includingduringanylimitedworkingperiod,2)nostatisticallysignificantchangeinhandednessamongallfiguresorevenindividualfiguresthatmightbeconsistentwithachangeinopticalprojector,and3)thevisualandculturalevidencein“Bacchus”showsthefigurewasright-handedandthattheartistexecutedthisworkbytraditional(non-optical)easelmethods.WeconcludethatthegeneralparityandhandednessevidencedoesnotsupporttheclaimthatCaravaggioemployedopticalprojections.


A computer graphics reconstruction and optical analysis of scale anomalies in Caravaggio’s “Supper at Emmaus”D.G.Stork,RicohInnovations,Inc.(UnitedStates);Y.Furuichi,Consultant(Japan)

DavidHockneyhasarguedthattherighthandofthedisciple,thrusttotherearin“SupperatEmmaus”(1606),isanomalouslylargeasaresultofCaravaggiorefocusingaconcavemirrorprojector.Weshowrigorouslythattoachievesuchananomalouslylargeimage,Caravaggiowouldhaveneededtomakeextremelylarge,conspicuousandimplausiblealterationstohisstudiosetup,movinghispurportedmirrornearlyonemeterforwardbetweenprojectingthedisciple’slefthandandthenhisrighthand.Moreover,the192-cm-widecanvaswouldhavebeenofftothesideofanyaperture,sothatthelightfromthesubjectcouldstrikethemirrorandcanvas.Butsuchaplacementwouldmeanthatthelightfromfiguresattheextremesofthetableauwouldhavestruckthemirroratalargeanglewithrespecttotheopticalaxis,leadingtoblurry,uselessimages.Toavoidsuchseveredegradationintheprojectedimages,Caravaggiowouldhavelikelyhadtoswitchhiscanvasfromonesideoftheaperturetotheotherinordertocapturethefiguresattheextremesofthetableau.Allthesemajordisruptionstohisstudiowouldhaveimpeded---notaided---Caravaggioinhiswork.WeusecomputergraphicsreconstructionofCaravaggio’sstudiotoexploreanddemonstratetheseproblems.WearguethatCaravaggiomostlikelysetthesizesofthesehands“byeye”forartisticreasons.Inthiswayweargueagainsttheopticalprojectionclaimforthispainting.


Image analysis of the underdrawings in Lorenzo Lotto’s “Husband and wife”D.G.Stork,RicohInnovations,Inc.(UnitedStates);A.J.Kossolapov,StateHermitageMuseum(RussianFederation)

Underdrawingsandpentimentirevealintermediatestatesofapaintingandthustheworkingmethodsofsomeartists.IthasbeenclaimedthatLorenzoLottousedopticalprojectionsduringtheexecutionof“Husbandandwife”(1543)andthatunderdrawingsmightrevealevidenceoftracingofopticalprojections.Weanalyzex-rayandinfra-redimagesoftheunderdrawingsinthispainting---capturedundercareful,museum-studioconditionsandenhancedthroughdigitalimageprocessing---withspecialattentiontothepossibilityofevidenceoftracingsofopticalprojectionsinthekeyholeportionofthedepictedcarpet.Wealsostudytheworkinsituandinhigh-resolutionmacroopticalimagesofthecentralportionofthecarpetpattern.Thesephotographsrevealthatthetopportionofthekeyholepatternisnot“blurry,likeanout-of-focusimage,”butinsteadwasmerelyexecutedinasomewhatbroaderbrushthanneighboringpassages.Thesephotographsalsoshowthatthewhiteportionswereexecutedatopabroadlayerofdarkred,andthusthatnorecordofanopticalprojection

wouldhavebeenpresentwhenLottoexecutedthevisibleportion---theportionthatledtotheopticalclaim.Thereisnoevidenceoftracingmarks---inpencilorinanymedium---inthetop,visibleportionofthispassageeither.Assuch,thisvisual,infra-redandx-rayevidencedoesnotsupporttheclaimthatthispaintingwasexecutedunderopticalprojections.Wealsoreviewcontemporarytextualevidenceinearly16th-centuryVenicethathasbeenusedtosupporttheopticalprojectionclaimforLottoandconcludethatitalsofailstosupporttheprojectionclaimforthispainting.


Automated classification of quilt photographs into crazy and non-crazyA.Gokhale,IndianInstituteOfTechnology,Kharagpur(India);P.Bajcsy,Univ.ofIllinoisatUrbana-Champaign(UnitedStates)

Thisworkaddressestheproblemofautomaticclassificationandlabelingof19th-and20th-centuryquiltsfromphotographs,whichareclassifiedaccordingtothequiltpatternsintocrazyandnon-crazycategories.Themotivationofourworkisinautomatedannotationofalargecollectionofquiltimagesforresearchpurposesofhumanists.Theresearchvalueofannotationsforhumanistsisinunderstandingthedistinctcharacteristicsofanindividualquilt-makerorrelevantquilt-makinggroupsintermsoftheirchoicesofpatternselection,colorchoices,layout,andoriginaldeviationsfromtraditionalpatterns.Thecurrentannotationmethodismanualandtheassignmentisachievedbyvisualinspection.Accordingtoourknowledge,theredoesnotexistcurrentlyacleardefinitionofthelevelofcrazy-ness,noranautomatedmethodforclassifyingpatternsascrazyandnon-crazy.

Weapproachtheproblembymodelingthelevelofcrazy-nessbythedistributionofclustersofcolor-homogeneousconnectedimagesegmentsofsimilarshapes.Themodelisturnedintoasetofimagefeaturesthatareextractedandrepresentourmodelofcrazy-ness.Thefeaturesareinputintoasupervisedclassificationmethod,suchastheSupportVectorMachine(SVM)withtheradialbasisfunctionkernelandoptimizedusing10-foldcrossvalidation,usedinourwork.

Theclassificationmethodologyconsistsoffoursteps.Inthefirststep,acolor-homogeneousRegion/TexelisdetectedbycolorbasedK-meansclusteringfollowedbyconnectivityanalysis.Thisstepleadstoaclusterofcolor-homogeneousregionsthatrepresentquiltpatcheswithsimilarcolors.Thesecondstepusesadivide-and-conquerapproachtoidentifysub-clustersthatsharesimilarshapepropertiessuchasareaandperimeter.Foreachsub-cluster,statisticsofnearestneighbordistancesarecomputedforexample,themeanandvarianceofdistances.Inthethirdstep,aquiltsignatureperimageisformedfromparametersincludingthenumberofclusterscontainingasinglecolor-homogeneousRegion/Texel,numberofclusterscontainingmultiplecolor-homogeneousRegion/Texel,themaximumandaveragenumberofcolor-homogeneousregionsperclusterofregionsandtheminimumvariancepresentinanycluster.Ourselectionoftheseparametersisbasedontheobservationthatcrazypatternshaveasmallnumberofcolor-homogeneousandshape-similarregionsinaclusterandalargenumberofclusterscontainingonlyasingleregion.Theyalsohavenosymmetryandhencelargevarianceininter-Regionnearestneighbordistance.Incontrary,non-crazypatternswillhaveasmallnumberofclustersandalargenumberofcolor-homogeneousandshape-similarregionsinacluster.Finally,aSupportVectorMachine(SVM)modelistrainedusinglabeledquiltimagesand10-foldcrossvalidationisused.

WeimplementedtheclassificationmethodologyusingacombinationofJavaandMatlabcode.Thealgorithmwasappliedto40quiltimagesfromtheMATRIXdatabaseattheMichiganStateUniversity.Wereportalmost90percentclassificationaccuracyover40imagesusingSVManditsradialbasisfunction.Inthefuture,weplanonextendingthecategoricalmodelofcrazy-nesstoacontinuousfunctionreflectingthelevelofcraziness.

Conference 7869: Computer Vision and Image Analysis of Art II


IS&T /

ReturntoContents


Polarized light scanning for cultural heritage investigationJ.A.O.Toque,Y.Murayama,A.Ide-Ektessabi,KyotoUniv.(Japan)

Numerousculturalheritageartworkshaveshinysurfacesresultingformgold,silver,andothermetallicpigments.Inadditionvarnishoverlayeronoilpaintingsmakesitchallengingtoretrievetruecolorinformation.Thisisduetothegreateffectoflightingconditionwhenimagesareacquiredandviewed.Thereflectionoflightfromsuchsurfacesisacombinationofthesurface’sspecularanddiffusedlightreflections.Inthispaperthespecificproblemsencounteredwhendigitizingculturalheritagewerediscussed.Experimentalresultsusingtheimagesacquiredwithahigh-resolutionlargeflatbedscanner,togetherwithamathematicalmethodforimagecapturewerepresentedanddiscussedindetail.Focuswasgiveninseparatingthediffusedandspecularcomponentsofthereflectedlightforthepurposeofanalyticalimaging.Themathematicalalgorithmdevelopedinthisstudyenablesimagingofculturalheritageefficientlywithinapracticaltimelimit.


After digital cleaning: visualization of dirt layerM.N.Soriano,C.M.Palomero,Univ.ofthePhilippines(Philippines)

Wehavepreviouslyshownthataneuralnetworkcanbetrainedtoperformdigitalcleaningonanoilpaintingbylearningthetransformationfromdirtytocleanpixels.SuchanetworkwasusedtovirtuallycleananimageofFilipinoNationalArtist,FernandoAmorsolo’s1948oilpainting,MalacanangbytheRiver.Whenthepaintingwasremovedfromitsoriginalframe,itwasobservedthatthepartsthatwerepreviouslycoveredbytheframeweregenerallycleanerthantheexposedparts.Usingthecleanerunexposedpartsasourbasisforwhatthepaintingmighthavelookedlikehaditnotundergonedirtying,wetrainedaneuralnetworktolearnthetransformationfromdirtytocleansegmentsofapainting.Atotalof1,350pairsofinputandoutputpixelsweremanuallyselectedfromallaroundtheedgesofthepaintingimage.Specialcarewastakentopreservetextureinformationthebytakingtheinputandoutputpairfromthesametexturecomponent(brushstroke,shadowofbrushstroke,bumpsanddipsofcanvasweave).

Acomparisonofthepaintingimagebeforeandafterdigitalcleaningshowsmorevividcolorsandacleanerlookforthedigitally-cleanedpainting.Themask-likeboundaryofdirtbetweenthepartsthatwereexposedandunexposedduetotheframeisalsolessvisible.

Exploitingthetrainedneuralnetwork’sabilitytosolvethedesiredoutputforcompletelynewinputsallowedustoperformwholepainting-cleaninginatotallynon-invasivemanner.

Inthiswork,wedemonstratetwomethodstovisualizethedirtthatthedigital-cleaningprocedureremoved.Firstisthevectordifferencemethod,whereinwecomputethecolorchangebetweeneachoriginalandcleanedpixelasavectordifferenceindifferentcolorspaces.Thecolordifferencevectorissuperimposedontoaneutralcolorandrenderedforthewholeimage.Inthesecondmethodwemodelthedirtasatransparencyfilmthatissuperimposedontothecleanpainting.Atransparencyfunctiondependentonthecolordifferencebetweenoriginalandcleanedpixeldeterminestheopacityofeachpixelinthevirtual“dirt”film.Spectralmeasurementsonknownwhiteportionsofthepaintingprovideanestimateofthetransmittanceasafunctionofwavelength.Bothmethodsprovidegoodvisualizationofthedirtremovedandcouldofferinsightsonapainting’sdirtyingordiscolorationprocess.

7869-01, Session 1

Documenting Van Eyck’s Ghent Altarpiece: field work experiences from the cryptR.Spronk,Queen’sUniv.(Canada)andRadboudUniv.Nijmegen(Netherlands)

JanandHubertvanEyck’sfamousGhentAltarpiece(1432)inSt.BavoCathedralinGhent,Belgium,isthesinglemostimportantworkofEarlyNetherlandishpaintinginexistence,andisgenerallyacceptedtobeamongthemostimportantsurvivingartworksintheworld.In2010,thepolyptychwassubjectedtoanurgentconservationtreatmentandtotechnicaldocumentations,toestablishwhetherafullrestorationisnecessaryinthenearfuture.Theindividualpanelsweredocumentedwithinfraredreflectography,underultravioletlight,andwithhigh-resolutiondigitalmacro-photographyinvisiblelightandintheinfrared.Some20detailsweredocumentedwithX-radiographyandcomparedwithsuchdocumentsfromthe1980s,toestablishwhetherrecentdeteriorationscanbeobservedatthecraquelure-level.Thesupportsofthefourcentralpanelswereanalysedwithdendrochronology,tocomplementthealreadyavailablefindingsforthepanelsintheleftandrightzones.Thecentralpanelsweredocumentedwithmultispectralinfraredscanningandwithnon-destructiveinstrumentalanalysessuchasXRFandXRD,amongothers.Inthispresentation,RonSpronkwilldescribetheprojectandsomeofitsinitialresults.

7869-02, Session 1

Computer analysis of lighting style in fine art: an inter-artist studyD.G.Stork,RicohInnovations,Inc.(UnitedStates)

Stylometry---themathematicaldescriptionofartists’styles---hasbeenbasedonanumberofpropertiesofvisualart,suchascolor,brushstrokeshape,visualtexture,andcurvaturemeasuresofcontours.Weintroducelightingcoherence(theagreementamonglightingdirectionsestimatedthroughoutapainting)asapropertyofstyle.SurrealistssuchasGiorgiodeChiricoworkedfromimaginationratherthanmodelsanddeliberatelyintroducedincommensurateandcontradictorylightingclues;artistsofthehighRenaissancewhoworkedfromnature,suchasLeonardo,andphotorealistswhoworkedfromphotographs,suchasRichardEstes,striveforgreatcoherenceandagreementamonglightingclues.Perceptualstudiesshowthatobserversarepoorjudgesoflightingconsistencyinphotographsandthus,byextension,paintings,whilecomputermethodssuchasrigorouscast-shadowanalysis,occludingcontouranalysisandsphericalharmonicbasedestimationoflightfieldscanbequiteaccurate.Forthisreasons,computerlightinganalysismethodsmayprovideanewtoolforarthistoricalstudies.Wedefineascalarmeasureoflightingcoherencebasedonthedistributionoflightingdirectionsestimatedinapainting.Weusethismeasuretodescribethelightinginpaintingspreviouslyanalyzed(e.g.,Vermeer’s“Girlwithapearlearring,”delaTour’s“Christinthecarpenter’sstudio,”Caravaggio’s“Magdalenwiththesmokingflame”and“CallingofSt.Matthew”)andextendourcorpustoworkswherelightingcoherenceisofinteresttoarthistorians,suchasCaravaggio’s“AdorationoftheShepherds”(1609)fortheCapuchinchurchofSantaMariadegliAngeli.Ourmeasureoflightingcoherencemayhelprevealtheworkingmethodsofsomeartists,andindiachronicstudiesrevealchangesintheworkingmethodsofagivenartist.Wespeculateonartistsandarthistoricalquestionsthatmayultimatelyprofitfromthisnewcomputationaltool.

7869-03, Session 2

The automatic annotation and retrieval of digital images of prints and tile panels using network link analysis algorithmsG.Carneiro,J.P.Costeira,Univ.TécnicadeLisboa(Portugal)

Thestudyofthevisualartofprintmakingisfundamentalforarthistory.



IS&T /

ReturntoContents

Printmakingmethodshavebeenusedforcenturiestoreplicatevisualartworks,andtheseworkshaveinfluencedartistsforcenturies.

Particularlyinthiswork,weareinterestedintheinfluenceoftheseprintsonartistictilepanelpainters,whohaveproducedanimpressivebodyofworkinPortugal.Thestudyofsuchpanelshavegainedinterestfromarthistorians,whoessentiallytrytofindlinksbetweenprintsandtilepanelsinordertocomprehendtheevolutionofvisualarts.Severaldatabasesofdigitizedartimageshavebeenusedforsuchend,buttheuseofthesedatabasesreliesonmanualimageannotationsbyarthistoriansandaneffectiveinternalorganization.Weproposeanautomationofthesetasksusingstatisticalpatternrecognitiontechniques.Specifically,weintroduceanewmethodfortheautomaticanalysisofdatabasescontainingdigitalimagesofprintsandtilepanels.Themaincontributionofourpaperisanovelstatisticalpatternrecognitionmethodbasedonlinkanalysis.Thesuccessfulimplementationofthissystemshallenableamoreefficientstudyandresearchofartdatabasesbymakingtheanalysisprocessfasterandlessdependentonexpertusers.

7869-04, Session 2

Explaining scene composition using kinematic chains of humans: application to Portuguese tiles historyN.P.daSilva,M.Marques,G.Carneiro,J.P.Costeira,Univ.TécnicadeLisboa(Portugal)

Paintedtilepanels(Azulejo)areoneofthemostspecificPortugueseformsofart.Mostofthesepanelsareinspiredon,andsometimesareliteralcopiesof,paintingsandprints.InordertostudytheinfluencesinAzulejo(tile),ArtHistoriansneedtotracetheseroots.Todothattheymanuallysearchdatabasesofprintssearchingforsimilaritiesbetweenthetilepanelandthepaintingsandprintsdatabase.Thisisanoverwhelmingtaskthatshouldbeautomatedasmuchaspossible.Amongseveralcues,theposeofhumansandthegeneralcompositionofpeopleinasceneisquitediscriminative.

Thispaperdescribesahumanposematchingframeworkforcompositionanalysisin16th-18thcenturyprintsandengravings.Theposeannotationsintheprintsarerepresentedbyawireframewiththekinematicchainofthearticulatedbodyofahuman.Modelingtheannotatedposesassubspaces(suchasusedin[1,2])wecancreateadictionaryofposes.Givenanimagefromapaneloftiles,weperformsubspacecomparisontofindtheprintsthatinspiredthepanel.Inordertohandlethedeformationsduetothedifferentscalingofbodypartsandtotheartisticinterpretationoftheoriginaldrawing,weexplaintheobservedpose(subspace)asasparseconvexcombinationofthesubspacedictionary[3,4].Thisconvexcombinationisinterpretedasanonparametric(empirical)probabilitydistribution.Thesystemretrievestheprintscorrespondingtothepositivecoefficients,togetherwiththeassociatedprobabilityofhavinginspiredthetiles’work.

[1]C.Tomasi,T.Kanade,“Shapeandmotionfromimagestreamsunderorthography:afactorizationmethod”,IJCV9(2):137-154,1992.

[2]C.Bregler,A.Hertzmann,H.Biermann,“RecoveringNon-Rigid3DShapefromImageStreams”,inProc.IEEECVPR’00,vol.2,pp.2690,2000.

[3]R.Tibshirani,“Regressionshrinkageandselectionviathelasso”,JRSSB58(1):267-288,1996.

[4]C.Chennubhotla,A.Jepson,“SparsePCAExtractingMulti-ScaleStructurefromData”,inProc.IEEEICCV’01,vol.1,pp.641,2001.

7869-05, Session 2

Top-down analysis of low-level object relatedness leading to semantic understanding of medieval image collectionsP.Yarlagadda,J.A.Monroy,B.Carque,B.Ommer,UniversitätsKlinikumHeidelberg(Germany)

Imageunderstanding,anactiveresearchareaincomputervision,dealswiththeanalysisoflow-levelrelationshipsamongvariousobjectinstancesfoundinacollectionofimages.Suchobjectrelationshipsprovetobeusefulinputfortaskssuchasunderstandingthedifferentprinciplesofartisticdesignandforidentifyingthecharacteristicsofdifferentteamsofartiststhathavedrawnacollectionofimages.Inthiscontribution,weidentifyasuitablefeaturerepresentationfortheobjectinstancesandutilizeatop-downapproachtoobtainlow-levelrelationshipsamongthembasedonthefeatures.Finally,weanalyzeobjectrelationshipstoidentifydifferentartisticstylessuchastheconciseandaccuratestyleofHagenanworkshopofDieboldLauberorthedelicateandsketchystyleoftheswabianworkshopofLudwigHenfflin.Ourworkisbasedonadatabaseconsistingof27latemedievalpapermanuscriptsfromUpperGermanoriginarchivedbyHeidelbergUniversitylibrary.Thesecodicesareillustratedwithmorethan2,000halforfull-pagetinteddrawings.

7869-06, Session 2

A framework for analysis of large database of old art paintingsJ.DaRugna,G.Chareyron,PôleUniv.LéonarddeVinci(France)

Formanyyears,alotofmuseumsandcountriesorganizethehighdefinitiondigitalizationoftheirowncollections.Inconsequence,theygeneratemassivedataforeachobject.Inthispaper,weonlyfocusonartpaintingcollections.Nevertheless,wefacedaverylargedatabasewithheterogeneousdata.Indeed,imagecollectionincludesveryoldandrecentscansofnegativephotos,digitalphotos,multiandhyperspectralacquisitions,X-rayacquisition,andalsofront,backandlateralphotos.Moreover,wehavenotedthatartpaintingssufferfrommuchdegradation:craquelure,softening,artifact,humandamagesand,overtimecorruption.Consideringthat,itappearsnecessarytodevelopspecificapproachesandmethodsdedicatedtodigitalartpaintinganalysis.Consequently,thispaperpresentsacompleteframeworktoevaluate,compareandbenchmarkdevotedimageprocessingalgorithms.

Inthefirstpartwepresentimages,diversityofacquisitionmethodsandallunderlyingdifficultieslinkedtoourproblematic.Inthesecondpartwediscusstheoverallschemaofourframework.Finally,weintroduceanewapproachtocraqueluredetectiondesignedtooldpainting.

Toconclude,ourframeworkisdesignedtoanalyzeandbenchmarkimageprocessingalgorithminthespecificcontextofoldartpainting.Itishighlyupgradeableandcustomizable.

7869-07, Session 3

Image fusion for art analysisB.Zitova,M.Benes,J.Blazek,InstituteofInformationTheoryandAutomation(CzechRepublic)

Ourpaperaddressesproblemofmultimodaldataacquisitionandfollowingdatavisualizationforanartanalysisandinterpretation.Varioustypesofmodalitiesforacquisitionofdigitalimagesareusedforartanalysis.Thedatawecanobtainusingvariousmodalitiesdifferintwoways.Thegroupofdifferencesweareinterestedinaredetailsorcharacteristicsofanartwork,whichareapparentjustinthecertainmodality.Nexttothistheacquiredimagesdifferbytheirmutualgeometryandbytheirradiometricquality.Thesedifferenceclassesrepresenttwocategoriesofimageprocessingmethods.Thefirstonedealswitheffectivewayshowtocombinetheacquiredinformationintooneimage-imagefusion.Thesecondcategoryofmethodscoversdataenhancementandrestorationalgorithms.Intheproposedpaperwewillpresentthemethodologyforidentificationofobjectsdifferentinindividualmodalitychannels,thecomparisonofapplicabilityofimagefusionalgorithmsforartanalysis,andthecontrastpreservingimagefusionmethod.Fromthesecondcategorywewillpresenttheimagequalityenhancementforblurredandnoisydatawithspecialattentiontothemultimodalcase.



IS&T /

ReturntoContents

7869-08, Session 3

Recovery of handwritten text from the diaries and papers of David LivingstoneK.T.Knox,AirForceResearchLab.(UnitedStates);R.L.Easton,Jr.,RochesterInstituteofTechnology(UnitedStates);W.A.Christens-Barry,EquipoiseImaging,LLC(UnitedStates);K.Boydston,MegavisionInc.(UnitedStates)

DuringhisexplorationsofAfrica,DavidLivingstonekeptadiaryandwrotelettersabouthisexperiences.Neartheendofhistravelheranoutofpaperandinkandbeganrecordinghisthoughtsonleftovernewspaperwithinkmadefromseeds.Thesewritingssufferfromfading,frominterferencewiththeprintedtextandfrombleedthroughofthehandwritingontheothersideofthepaper,makingthemhardtoread.NewimageprocessingtechniqueshavebeendevelopedtodealwiththesepaperstomakeLivingstone’shandwritingavailabletothescholarstoread.

AscanoftheDavidLivingstone’spaperswasmadeusingatwelve-wavelength,multispectralimagingsystem.Thewavelengthsrangedfromtheultraviolettothenearinfrared.Inthesewavelengths,thethreedifferenttypeofwritingbehavedifferently,makingthemdistinguishablefromeachother.Becauseoneofthewritingshasbledthroughfromtheothersideofthepaper,thescansofthereversesidecanbeusedtoenhancetheeffect.Theresultisapseudocolorimagethatshowsthedesiredtextinahigh-contrastcolor,whilethetwotextstobesuppressedappearinlow-contrastcolorsmakingthedesiredtextlegibletothescholars.

7869-09, Session 3

Automation of digital historical map analysesT.Shaw,P.Bajcsy,Univ.ofIllinoisatUrbana-Champaign(UnitedStates)

Thispaperaddressestheproblemofautomatinganalysesofhistoricalmaps.Theproblemismotivatedbythelackofaccuracyandconsistencyinthecurrentcomparisonprocessofgeographicalobjectsfoundinhistoricalmapsbyvisualinspections.TheobjectiveofourworkistocompareshapecharacteristicsoftheGreatLakesregioncreatedinthe17ththroughthe19thcenturiesinadatasetof40FrenchandBritishmaps.Ourapproachdecomposesthevisualinspectionintostepssuchasobjectsegmentation,spatialscalecalibration,extractionofcalibratedobjectdescriptorsandcomparisonofdescriptorsovertimeandmultiplecartographerhouses.Theautomationofobjectsegmentationisachievedbytemplateshape-basedsegmentationusingHumomentsasshapedescriptorsandball-basedregiongrowing.Theautomationofspatialcalibrationisaccomplishedbyclassificationoflinesalongmapbordersandbymappingstripedboundariesintersectedbylatitudeandlongitudelinesintodegreesofarclength.Thus,shapecharacteristicsofsegmentationresultsinpixelscanbeconvertedtogeographicalunits,forexample,anareaofalakeinsquaremiles.Wereportourexperimentalevaluationsofautomationaccuracybasedonthe40FrenchandBritishmaps,aswellastheknowledgeobtainedfromtheareacomparisons.

7869-16, Session 3

Automatic multispectral ultraviolet, visible and near-infrared capturing system for the study of artworkJ.A.Herrera-Ramirez,M.Villaseca,J.Pujol,Univ.PolitècnicadeCatalunya(Spain)

Thespectralimagingtechnologyhasproveditsusefulnessinavarietyofsensingapplicationsrangingfromremotesensing,suchassatelliteorradarimaging,toartworkconservationwereithasrecentlygainedimportance.Inthecaseofthenear-infraredregion

(NIR)thespectralimaginghelpsintheanalysisofpaintingsidentifyingpigmentsthroughtheanalysisofthespectraandtheirdistributionovertheartwork,besidesitprovidesagoodtoolintheexplorationandstudyoftheunderdrawingsoftheartpieces.Inthevisibleregionofthespectraseveralcolorrelatedstudies,likecolorimagingandarchiving,canalsobecarriedout.Besides,inthecaseoftheUVrange,itsproperuseservesforthedetectionoforganicmaterialsthathavefluorescenceproperties.Inthisworkamultispectralcapturingprototypesystemintendedfortheanalysisofartworkintheultraviolet,visibleandnear-infraredrangeofthespectrumhasbeendeveloped.Severalaspectsrelatedtoitsconstructionaswellassimulatedandfirstexperimentalresultsarepresentedinthisstudy.Specifically,thesystemcapturestheinformationintherangeof350nmto1650nm.Itisbasedintwoimagingsensors,oneCCDandanInGaAscamera,andasetofLEDsactingasamultiplexedlightsourcethatallowextractingtheinformationofseveralspectralbandsinsidethismentionedrange.Thedatacanbeanalyzedasacubeofdata,astackofmonochromaticimages,orasetofreflectancespectrasamples,onepereachpixeloverthefieldofview.Togetthislastinformationtheautomaticsystemiscomplementedbyasetofcomputationalroutinesfortheacquisitionofimagesandthestudyofthemthroughseveralmathematicalalgorithms(Moore-PenrosepseudoinverseandMatrixRmethod),whichallowthereflectancespectraofthesamplesimagedtobereconstructedfromthemultispectralimagesacquiredifthesystemispreviouslytrainedwithasetofknownandcalibratedsamples.Adistortionalgorithm,basedonthepreliminaryacquisitionofapatternconsistingofagridofcircularspots,tocorrectthepossibleartifactsintroducedinthecaptureprocessisutilized.Inadditiontothisdistortioncorrection,aflatfieldprocedurewhichusesagainandanoffsetmatricesiscarriedoutovertheimagestoovercomethepossibleinhomogeneitiesintheilluminationofthesampleandjointlyhelpintheperformanceofasubsequentstitchingalgorithmormosaicing,basedontheexistingcorrelationbetweenthedigitallevelsofthemultispectralimages,forhighspatialresolutionimagesinlargeformatpaintings.Thiswholesystemunderscoresthepotentialoffurtherdevelopmentsinthisfield.

7869-11, Session 4

Automatic registration of multi-band reflectance and luminescence imagesD.Conover,TheGeorgeWashingtonUniv.(UnitedStates);J.Delaney,NationalGalleryofArt(UnitedStates);M.Loew,TheGeorgeWashingtonUniv.(UnitedStates)

Ashigh-resolutionimagesofpaintings,acquiredusingvariousimagingmodalities,becomemoreavailable,itisincreasinglyimportanttoachieveaccurateregistrationbetweentheimagesinordertoobtainabetterunderstandingofhowthepaintingwasconstructed.Thegoalsofthisprojectaretofirstaccuratelyregisterreflectanceandluminescenceimages,aswellastruecolor,scannedx-rays,andinfraredreflectograms.Then,usingtheregisteredsetofimages,automaticallyidentifyandemphasizeinformationnotvisibleatthesurfaceofthepainting.Theregistrationalgorithmwillidentifylargesetsofcandidatefiducialpointsinafirstimage,paireachfiducialpointwithapointinasecondimage,selectthebestsetoffiducialpointpairs,andthentransformthesecondimageusingthebestsetofpairstobringthesecondimageintoalignmentwiththefirst.

7869-12, Session 4

Art documentation quality in function of 3D scanning resolution and precisionE.Bunsch,MuseumPalaceatWilanow(Poland);R.Sitnik,WarsawUniv.ofTechnology(Poland)

Currently,alotofdifferent3Dscanningdevicesareusedfor3Dacquisitionofartartifactsurfaceshapeandcolor.Eachofthemhasdifferenttechnicalparametersstartingfrommeasurementprinciple(structuredlight,lasertriangulation,interferometry,holography)andendingonparameterslikemeasurementvolumesize,spatial



IS&T /

ReturntoContents

resolutionandprecisionofoutputdataandcolorinformation.Someofthe3Dscannerscangrabadditionalinformationlikesurfacenormalvectors,BRDFdistribution,multispectralcolor.Inthispaper,theproblemofestablishingofthresholdfortechnicalparametersof3Dscanningprocessinfunctionofrequiredinformationabouttheobjectisdiscussed.Onlytwomaintechnicalparametersareunderconsideration,duetocoverasmanydifferent3Dscanningdevicesaspossible,measurementsamplingdensity(MSD-representedbynumberofpointspersquaremillimeter)andmeasurementuncertainty(MU-directlyinfluencingfinaldataaccuracy).AlsodifferentmaterialsandfinishingtechniquesrequiresdifferentthresholdsofMSDandMUparameterstocollectsimilardocumentation(forexampledocumentationofobjectstateforartconservationdepartment)ofdifferentobjects.InthispaperweconsiderexemplarypaintingandstonesamplestovisualizewhatobjectfeaturescanbeobservedwithindifferentvaluesofMSDandMUparameters.

7869-13, Session 4

Investigation of the degradation mechanism and discoloration of traditional Japanese pigments by multispectral imagingJ.A.O.Toque,A.Ide-Ektessabi,KyotoUniv.(Japan)

Pigmentdegradationhasbeenasubjectofinterestamongresearchersinthefieldofculturalheritagestudies.Knowinghowpigmentsbehavewhensubjectedtodifferentelementssuchashightemperature,humidity,electromagneticradiationandmanymoreothersisofprimeimportance.Inthisstudy,theeffectsofsubjectingJapanesepigmentstohightemperaturewereinvestigated.Focuswasgivenontheeffectsintermsofpigmentdiscolorationandthemicromechanismofdegradation.Multispectralimageswereusedtotrackthechangesincolorandspectralreflectancebyreconstructingcolorimetricandspectralinformationfromtheimages.Themultispectralimagesweretakenusingahigh-resolutionflat-bedscannerequippedwithaline-CMOScamera.Inaddition,thepigmentswerecharacterizedusingcommerciallyavailablespectrometers,X-raydiffractionandX-rayfluorescencespectroscopytoascertaintheinfluenceofhightemperatureexposureofthepigments.Thehighresolutionmultispectralscansgavethemostvaluableinsightsintothediscolorationandmicromechanismofpigmentdegradationsincetheyprovidebothanalyticalandvisualinformation.

7869-14, Session 4

Improved methods for dewarping images in convex mirrors in fine art: applications to van Eyck and ParmigianinoY.Usami,WasedaUniv.(Japan);D.G.Stork,RicohInnovations,Inc.(UnitedStates);J.Fujiki,NationalInstituteofAdvancedIndustrialScienceandTechnology(Japan);H.Hino,WasedaUniv.(Japan);S.Akaho,NationalInstituteofAdvancedIndustrialScienceandTechnology(Japan);N.Murata,WasedaUniv.(Japan)

Wederiveanddemonstratenewmethodsfordewarpingimagesdepictedinconvexmirrorsinartworkandforestimatingthethree-dimensionalshapesofthemirrorsthemselves.Previousmethodswerebasedontheassumptionthatmirrorsweresphericalorparaboloidal,anassumptionunlikelytoholdforhand-blownglassspheresusedinearlyRenaissanceart.Weassumemerelythatthemirrorisradiallysymmetricandrequiremerelythattherebestraightsourcelinesintheactualscene.Weexpressthemirror’sshapelocallyasamathematicalseriesandposetheimagedewarpingtaskasthatofestimatingthecoefficientsintheseriesexpansion.Centraltoourmethodisthe“plumblineprinciple”:thattheoptimalcoefficientsarethosethatdewarpthemirrorimagesoastostraightenlinescorrespondingtostraightlinesinthesourcescene.Wesolveforthesecoefficientsalgebraicallythroughprincipalcomponentanalysis,PCA.Wefindthatitisimportanttoselectanappropriatesetofbasisfunctions,particularlywhentherearebutfewstraightlines,soastoavoidoverfitting.Ourmethodreliesonaglobalfigureofmerittobalancewarpingerrorsthroughouttheimageandtherebyreducesarelianceonthesomewhatsubjectivecriterionusedinearliermethods.Oncewehavefoundtheoptimalimagedewarping,wecomputethemirrorshapebysolvingadifferentialequationbasedontheestimateddewarpingfunction.WedemonstrateourmethodsontheArnolfinimirrorandrevealadewarpedimagesuperiortothosefoundinpriorwork---animagenoticeablymorerectilinearthroughoutandhavingamorecoherentgeometricalperspectiveandvanishingpoints.Moreover,wefindthemirrordeviatedfromsphericalandparaboloidalshape;thisimpliesthatitwouldhavebeenuselessasaconcaveprojectionmirror,ashasbeenclaimed.



IS&T /

ReturntoContents

Conference 7870: Image Processing: Algorithms and Systems IXMonday-Tuesday24-25January2011PartofProceedingsofSPIEVol.7870ImageProcessing:AlgorithmsandSystemsIX

7870-01, Session 1

An adaptive optimization of the polynomial wavelet thresholdD.Akopian,S.G.Sathyanarayana,S.S.Agaian,TheUniv.ofTexasatSanAntonio(UnitedStates)

Thispaperpresentsanewclassofpolynomialthresholdoperatorsfordenoisingsignalsusingwavelettransforms.Theoperatorsareparameterizedtoincludeclassicalsoft-andhard-thresholdingoperatorsandhavemanydegreesoffreedomtooptimallysuppressundesirednoiseandpreservesignaldetails.Toavoidthecomplicatedprocessofsignalmodelidentificationforspecifictypeofsignals,anadaptiveleastmeansquares(LMS)optimizationmethodisproposedforthepolynomialcoefficients.Itisshownthattheoptimalchoiceofthepolynomialcoefficientscanbeformulatedasaleastsquares(LS)problemifthetrainingsequencesareavailable.ThenanadaptiveLMSapproachfortheoptimizationofwaveletcoefficientsisproposedandstudiedasanapproachtoreducecomputationalcosts.Thisapproachallowsoptimizingcoefficientswithoutmatrixinversionsandifneededoptimallyadaptthethresholdpolynomialsfordifferentwavelettransformbandswithoutcomputationaloverheads.

7870-02, Session 1

Video denoising using separable 4D nonlocal spatiotemporal transformsM.T.Maggioni,TampereUniv.ofTechnology(Finland);G.Boracchi,PolitecnicodiMilano(Italy);A.Foi,K.O.Egiazarian,TampereUniv.ofTechnology(Finland)

Weproposeapowerfulvideofilteringalgorithmthatexploitstemporalandspatialredundancycharacterizingnaturalvideosequences.Thealgorithmimplementstheparadigmofnonlocalgroupingandcollaborativefiltering,whereahigher-dimensionaltransform-domainrepresentationisleveragedtoenforcesparsityandthusregularizethedata.Theproposedalgorithmexploitsthemutualsimilaritybetween3-Dspatiotemporalvolumesconstructedbytrackingblocksalongtrajectoriesdefinedbythemotionvectors.Mutuallysimilarvolumesaregroupedtogetherbystackingthemalonganadditionalfourthdimension,thusproducinga4-Dstructure,termedgroup,wheredifferenttypesofdatacorrelationexistalongthedifferentdimensions:localcorrelationalongthetwodimensionsoftheblocks,temporalcorrelationalongthemotiontrajectories,andnonlocalspatialcorrelation(i.e.self-similarity)alongthefourthdimension.Collaborativefilteringisrealizedbytransformingeachgroupthroughadecorrelating4-Dseparabletransformandthenbyshrinkageandinversetransformation.

Inthisway,collaborativefilteringprovidesestimatesforeachvolumestackedinthegroup,whicharethenreturnedandadaptivelyaggregatedtotheiroriginalpositioninthevideo.

Theproposedfilteringprocedureisparticularlyeffectiveatnoiseremoval,forwhichitoutperformsthestateoftheart.

7870-03, Session 1

Intelligent edge enhancement using multilayer neural network based on multi-valued neuronsI.N.Aizenberg,S.Alexander,J.T.Jackson,T.Neal,J.Wilson,K.Kendrick,TexasA&MUniv.-Texarkana(UnitedStates)

Inthispaper,wesuggesttosolvetheedgeenhancementproblemusinganintelligentapproach.Weusehereamultilayerneuralnetworkbasedonmulti-valuedneurons(MLMVN)asanintelligentedgeenhancer.Aproblemofneuraledgeenhancementusingaclassicalmultilayerperceptron(MLP)wasalreadyconsideredbysomeauthors.SinceMLMVNsignificantlyoutperformsMLPintermsoflearningspeed,complexityandclassification/predictionrate,whensolvingbenchmarkanddifferentreal-worldproblems,itisveryattractivetoapplyitforsolvingtheedgeenhancementproblem.

Themainresult,whichispresentedinthepaper,isaprovenabilityofMLMVNtoenhanceedgescorrespondingtovariousedgedetectionoperators.Moreover,itispossibletoenhanceedgesonthenoisyimagesignoringanoisytexture.ItisshownthattolearnanyedgedetectionoperatorusingMLMVN,itisenoughtouseasingleimageforthelearningpurposes.

Themostimportantconclusionisthataneuralnetworkcanlearndifferentedgedetectionoperatorsfromasingleexampleandthenitcanprocessthoseimagesthatdidnotparticipateinthelearningprocessdetectingedgesdefinitelyaccordingtothelearnedoperatorwithahighaccuracy.

7870-04, Session 1

New loss functions for ordered hypothesis machinesR.B.Porter,LosAlamosNationalLab.(UnitedStates)

Justaslinearmodelsgeneralizethesamplemeanandweightedaverage,weightedorderstatisticmodelsgeneralizethesamplemedianandweightedmedian.Thisanalogycanbecontinuedinformallytogeneralizedadditivemodelsinthecaseofthemean,andStackFiltersinthecaseofthemedian.Bothofthesemodelclasseshavebeenextensivelystudiedforsignalandimageprocessing,butitissurprisingtofindthatforpatternclassification,theirtreatmenthasbeensignificantlyonesided.Generalizedadditivemodelsarenowamajortoolinpatternclassificationandmanydifferentlearningalgorithmshavebeendevelopedtofitmodelparameterswithfinitetrainingdata.HoweverStackFiltersremainlargelyconfinedtosignalandimageprocessingandapplicationtoclassificationisrarelyseen.

InpreviousworkwefoundthatapplyingStackFiltermodelclassestoclassificationproblemsisinterestingfrombothatheoreticalandapracticalperspective.Specifically,wefoundStackFilterdesignbyclassificationlossfunctionsexhibitedmanydesirablepropertiesintermsofcontrollingbiasandvariance,aswellasefficientoptimizationproceduresbasedonLinearProgramming.Wecallthisnewmodelclass,anditsassociatedoptimizationmethod,OrderedHypothesisMachines.Inthispaperwewillsummarizethisrecentworkandalsopresentnewresultsonquadraticandexponentiallossfunctionsusingbothsyntheticandreal-worlddata.

7870-05, Session 1

Signal filtering of daily cloud types trends as derived from satellites imagesJ.R.Dim,H.Murakami,JapanAerospaceExplorationAgency(Japan)

Therelationshipbetweentheintensityfunctionofneighboringpixelsofdailycloudsatellitethermalimagesisusedtoextractthehorizontalgradient.TheimagesusedarederivedfromtheNationalOceanicandAtmosphericAdministration/AdvancedVery-High-ResolutionRadiometer(NOAA-AVHRR)satellite.Thehorizontalgradientislocallyobtainedforeach3*3-pixelarea.Connectionsbetweencloudfacesasexpressedbythemagnitudeofthisgradientallowforthe


IS&T /

ReturntoContents

distinctionofvarioustexturalfeatureswhoseinterpretationleadstothediscriminationofcloudtypes.Thelong-termvariationofthesecloudtypesamountsisthenevaluatedforconsistencyandreliability.Thecloudtypeamountsignalisderivedfrommultiplesatelliteserieswhichareknowntohaveexperiencedorbitaldrift.Theeffectofthisdriftonthecloudamountsignalisremovedthroughafilteringprocessusingtheempiricalmodedecomposition(EMD).TheEMDcomponent,associatedwiththedriftisfiltered.Theresultsobtainedshowanimprovementofthesignalbyasubstantialpercentageaccordingtothegeographicalarea.

7870-06, Session 2

Analysing wear in carpets by detecting varying local binary patternsS.A.OrjuelaVargas,E.Vansteenkiste,F.Rooms,S.DeMeulemeester,R.DeKeyser,W.R.Philips,Univ.Gent(Belgium)

InthisapproachweproposeanovelmethodforgroupingLBPpatternsbasedindetectingthosethatchangealongthetransitionaldegreesofwear.Themethodconsistsinevaluatingforeachpatternthelinearrankcorrelationrelatedtothewearlabels.Forthis,thesamebinscorrespondingtotransitionalwearlabelsaresequentiallyplacedandtheSpearmanrankcorrelationforeachbiniscomputed.

Afterwards,aminimalamountofLBPvaryingpatternsisestablishedandacorrespondingSpearmanrankcorrelationthresholdisautomaticallycomputed.LBPpatternsthanarelessthanthethresholdreferredasnonvaryingpatternsaregroupedtogether.Thus,whencomputingthesymmetricKullback-Leibleritisassuredtoretainthemaximuminformationcorrespondingtochangesinappearanceforagiventhreshold.InthisapproachweproposeanovelmethodforgroupingLBPpatternsbasedindetectingthosethatchangealongthetransitionaldegreesofwear.Themethodconsistsinevaluatingforeachpatternthelinearrankcorrelationrelatedtothewearlabels.Forthis,thesamebinscorrespondingtothe8wearlabelsaresequentiallyplacedandtheSpearmanrankcorrelationforeachbiniscomputed.

7870-07, Session 2

Line and streak detection on polished and textured surfaces using line integralsM.S.Erkilinc,M.Jaber,E.Saber,RochesterInstituteofTechnology(UnitedStates)

Inthispaper,aframeworkfordetectinglinesinagivenpolishedortexturedsubstrateisproposed.Modulesforimagecapture,rectification,enhancement,andlinedetectionareincluded.Ifthesurfacebeingexaminedisspecular(mirror-like),theimagecapturewillberestricted,thatis,thecamerahastobefixedoff-axisinthezenithdirection.Amoduleforimagerectificationandprojectionisincludedtoovercomethislimitationinordertoyieldanorthographicimage.Moreover,amoduleforimageenhancementthatincludeshigh-boostandbilateralfiltershasbeenemployedtoimprovetheedgesharpnessanddecreasethespatialnoiseintheimage.Finally,aline-integraltechniquehasbeenappliedtofindtheconfidencevectorsthatrepresentthespatialpositionsofthelinesofinterest.TheFull-WidthHalf-Maxapproximationisappliedtodeterminethecorrespondinglinesinatargetimage.Experimentalresultsshowthatourtechniquehasaneffectiveperformanceonsyntheticandrealimages.Assessmentofprintqualityisthemainapplicationoftheproposedalgorithm;however,itcanbeusedtodetectlines/streakinprints,onsubstrateoranytypeofmediawherelinesarevisible.

7870-08, Session 2

Detecting photographic and computer-generated compositesV.Conotter,M.Broilo,L.Cordin,Univ.degliStudidiTrento(Italy)

Nowadays,sophisticatedcomputergraphicseditorsleadtoasignificantincreaseinthephotorealismofimages.Thus,computergenerated(CG)imagesresulttobeconvincingandhardtobedistinguishedfromrealonesatafirstglance.Here,weproposeanimageforensicstechniqueabletoautomaticallydetectlocalforgeries,i.e.,objectsgeneratedviacomputergraphicssoftwareinsertedinnaturalimages,andviceversa.Wedevelopanovelhybridclassifierbasedonwaveletbasedfeaturesandsophisticatedpatternnoisestatistics.Experimentalresultsshowtheeffectivenessoftheproposedapproach.

7870-09, Session 2

Spatio-temporal analysis and forward modelling of solar polar plumes in white lightA.Llebaria,O.Morillot,ObservatoireAstronomiquedeMarseille-Provence(France)

TheanalysisofthedataprovidedbyLASCO-C2coronagraphonboardtheSOHOspatialobservatoryrevealedthefractalcharacteristicsofmanyoutstandingstructuresofthesolarcorona,whichisthetinybutextendedenvelopeofplasmawrappingtheSun.Amultiscaleanalysisofrecentimagesequenceshasbroughtaclearerviewoftheevolutionandthelocalstructureofthesefeatureswhichresultsfromatwostepsprojectionprocessofthe2DelectronicdistributionovertheSunpolarcaps.Togetaninsightinthevolumedensitydistributionoverthesecapsandtheirevolutionwithintime,weusedtheforwardmodellingapproachbasedonthepresentknowledgeabouttheplasmadistribution,thephysicalprocessofdiffusionandtheprojectiongeometryonthefieldofview.Theanalysisprovidesuswiththemultifractalcharacterizationoftheobservedphenomena.Intheforwardmodellingprocessthegoalistoreconstructthetimesequenceof2DelectronicdistributionsslowlyevolvingovertheSunpolarcaps.Weuseddifferentmethodologies:theinverseFouriertransformof2D+1D(surfaceandtime)frequencymodelling,theevolvingmultiscalesynthesiswithGaussianwaveletsandtheconcealedMarkovapproach.LatelyaprocedurederivateoftheVossgenerationschemaoffBmfractalshasbeensuccessfullydeveloped.Thesedifferentmethodsarecomparedandtheirrelativeadvantagesanddrawbacksdiscussedaswellasthetoolsusedtocomparesyntheticimagestoobservedones.

7870-10, Session 2

Imaging using synchrotron radiation for forensic scienceF.M.Cervelli,S.Carrato,Univ.degliStudidiTrieste(Italy);A.Mattei,Rep.InvestigazioniScientifiche(Italy);M.Jerian,AmpedSRL-P.I.(Italy);L.Benevoli,L.Mancini,F.Zanini,L.Vaccari,A.Perucchi,G.Aquilanti,SincrotroneTriesteS.C.p.A.(Italy)

ForensicSciencehasexperiencedanincreasinginterestinallresearchactivities:allpathsthatallowtheinvestigatorstoobtainmoreinformationaboutthecrimescenedynamicsandabouttheculpritaretrackedtosolveseriouscrimeslikehomicidesandmajorcrimesrelatedtonationalsecurityliketerrorists’attack.

Theaimofthisresearchistoadoptamulti-techniqueapproach,basedonconventionalandSynchrotronRadiation(SR)techniques,tostudylatentfingerprints(i.e.fingerprintsnotvisibletothehumaneye)fromthemorphologicalandchemicalpointofview,offeringtoforensicscienceacomprehensivetooltobeexploitedforparticularlycomplexcriminalcases.

Hereweaddressfingerprintanalysis,performingastudyonlatentfingerprintvisualizationwithaSRsource.

Severalhumanfingerprintsweredepositedontwosubstratesofdifferentnature,ie.undopedsiliconwafersandpoly-ethylene-terephthalate(PET).Weadoptedtwodifferentdepositionmodalities,asetofcleanfingerprintsandasetoffingerprintscontaminatedbyadifferentmixtureofgunshotresidue,inordertotesttheanalysisand

Conference 7870: Image Processing: Algorithms and Systems IX


IS&T /

ReturntoContents

visualizationsystemintwocommonscenarios.

WecharacterizedthemorphologyaswellasthechemicalcompositionofthefingerprintsandoftheircontaminantsbyFouriertransforminfrared(FTIR)spectroscopy,X-rayfluorescenceandtomography,X-raydiffractionandscattering,andX-rayabsorptionfinestructureanalysis.Thesetechniquesweretestedasalternativeimagingtechniquestobeusedinordertobothpreservethecollecteditemsandtoallowfingerprintanalysisinthosecaseswerealltheotherclassicaltechniquesfail.

7870-11, Session 3

PSO-based methods for medical image registration and change assessment of pigmented skinS.T.Kacenjar,LockheedMartinCorp.(UnitedStates);M.Zook,FoxChaseCancerCtr.(UnitedStates);M.Balint,LockheedMartinCorp.(UnitedStates)

Inthefieldofdermatology,theclassificationofskinlesionsisessentialtothedetectionofmelanoma,amalignanttumorofmelanocytes.Theobservationofchangesinthecolor,shape,andsizeofmolesovertimeisespeciallyimportantandmayleadtotheearlydiscoveryofmalignantformsofskincancer.However,thisendeavoriscomplicatedbythefactthattheaveragepatienthasdozensofmoles,allofwhichrequireathoroughexaminationovertime.Inthispaperweproposeaprocessfortrackingchangesinskinlesionsbyconductingperiodic,regional,bodyscansofapatientandleveraginganalgorithmtoeliminatenoiseinducedbydifferingcameraangles,lighting,andgeneralchangesinskintone.Thisalgorithmconsistsof(1)acoarsealignmentoftime-sequencedimagery,(2)refinedalignmentoflocalskintopographiesthroughtheutilizationofaParticleSwarmOptimizer(PSO),and(3)theassessmentoflocalchangesinlesionpigmentationbetweentime-sequencedimagery.Onceoptimized,thedifferencesinimagerywillequipdermatologistswithabettertoolfordetectingpotentialcasesofmelanoma.

7870-12, Session 3

Image-based segmentation for characterization and quantitative analysis of the spinal cord injuries by using diffusion patternsM.Hannula,A.Olubamiji,TampereUniv.ofTechnology(Finland);I.Kunttu,NokiaResearchCtr.(Finland);P.Dastidar,TampereUniv.Hospital(Finland);J.Hyttinen,TampereUniv.ofTechnology(Finland)

Centralnervoussysteminjuriessuchasbraintraumasandspinalcordinjuriesaswellasneurodegenerativediseasesareamongthemostcommoncausesofdeathorseriousdisabilityinindustrializedcountries.Imagebaseddiagnosisandanalysismethodsforthisfieldareescalatinganddeveloping.Modernclinicalimaginggivesdatatodevelopmethodsfor3-Dmodellinganddesigningthecelltransplantationtherapyforhumans.Variousnewmagneticresonance(MR)imagingsequences,suchasdiffusionimaging,provideusinformationofthedamagedbrainstructureandtheneuronalconnections.Thesecanbeanalyzedtoform3Dmodelsofthegeometryandfurtherincludingfunctionalinformationoftheneuronsofthespecificbrainareatodevelopfunctionalmodels.ModelingoffersatoolwhichcanbeusedforthemodelingofbraintraumafromMR-imagesofthepatientsandthusinformationtotailorthepropertiesofthetransplantedcells.

ItisknownthatthewatermoleculesofthewhitematterofthebrainandspinalcordbehavesinanisotropicnatureforminganarrangedpatternwhichcanbeanalyzedbyusingMagneticresonancediffusiontensorimaging(DTI).TheDTIisanoveltechnologythatcanbeusedtostudythediffusionpatternofthewhitematterfibersofthespinalcord.Inthispaper,wepresentaframeworkfortheimagebasedanalysisof

thediffusionofwatermoleculesinsofttissues.Weareconcentratingonthevisualizationofthewatermoleculesinaperceptivemeaningfulmannerbyfibertracking,characterizationandquantitativeanalysisofthediffusionpattern.Weanalyzehowtherateofdiffusionchangesinthecaseswhenthereisaspinalcordinjurycomparedtothecasesinwhichthespinalcordisnormal.Afterthis,weuseasemi-automaticsegmentationmethodtodetecttheactuallocationoftheinjury.Thesegmentationmethodisbasedonthechangesindiffusionpatterns.

7870-13, Session 3

Descreening using segmentation-based adaptive filteringM.N.Ahmed,A.H.Eid,LexmarkInternational,Inc.(UnitedStates)

Inthispaper,wepresentanewsegmentation-baseddescreeningtechnique.Scannedimagesaresegmentedintotext,imagesandhalftoneclassesusingamultiresolutionclassificationofedgefeatures.Thesegmentationresultsguideanonlinear,adaptivefiltertofavorsharpeningorblurringofimagepixelsbelongingtodifferentclasses.

Ourexperimentalresultsshowtheabilityofthenon-linear,segmentationdrivenfilterofsuccessfullydescreeninghalftoneareaswhilesharpeningsmallsizetextcontents.

7870-14, Session 3

Novel parametric priors for the distribution of multivariate linear prediction errorsI.Qazi,O.Alata,Univ.dePoitiers(France);J.Burie,Univ.deLaRochelle(France);A.Moussa,AbdelmalekEssaadiUniv.(Morocco);C.Fernandez-Maloigne,Univ.dePoitiers(France)

Inthispaperwepresentnovelaprioriparametricmodelstoapproximatethedistributionofthetwodimensionalmultichannellinearpredictionerrors.Theseparametricapproximationsandsubsequentlythediscussedmodels,areusedtoimprovetheperformanceofthecolortexturesegmentationalgorithms.Twodimensionalcausalreal(inRGBcolorspace)andcomplex(inIHLSandL*a*b*colorspaces)multichannellinearpredictionmodelsareusedtocharacterizethespatialstructuresincolorimages.Classically,thedistributionofthemultivariatelinearpredictionerrorsofthesetexturemodelsareapproximatedwithamultivariateGaussianprobabilitydistribution.WeuseWishartdistributionandmultivariateGaussianmixturemodelstoapproximatethedistributionoftheseerrors.AnovelcolortexturesegmentationframeworkbasedonthesemodelsandPottsmodelforthespatialregularizationofinitialclasslabelfieldsispresented.Theframeworkalsotakesintoaccounttheregionsizecharacteristicsofthelabeledregionsduringthespatialregularizationprocess.Thispaperalsodiscussestheperformanceofthisframeworkintheusedcolorspacesi.e.RGB,IHLSandL*a*b*.Experimentalresultsshowabetterperformancebytheproposedmethodintermsofpercentagesegmentationerrorfortheusedcolortextures,ascomparedtotheclassicalapproach,inallthreecolorspaces.TheL*a*b*colorspaceshowmorestableresultsthantheRGBandIHLScolorspaces.

7870-16, Session 4

Secure annotation for medical images based on reversible watermarking in the Integer Fibonacci Haar transformF.Battisti,M.Carli,A.Neri,Univ.degliStudidiRomaTre(Italy)

Inthiscontribution,apossiblesolutiontosecuremedicalimageannotationispresented.Theproposedframeworkisbasedonthejointuseofakey-dependentwavelettransform,ofasecurecryptographicscheme,andofareversiblewatermarkingscheme.Thesystemallows:i)theinsertionofthepatientdataintotheencryptedimagewithout



IS&T /

ReturntoContents

requiringtheknowledgeoftheoriginalimage,ii)theencryptionofannotatedimageswithoutcausinglossintheembeddedinformation,andiii)duetothecompletereversibilityoftheprocess,itallowsrecoveringtheexactoriginalimageoncethemarkisremoved.

7870-17, Session 4

Multi-seam carving via seamletsD.D.Conger,MichiganStateUniv.(UnitedStates);M.Kumar,EastmanKodakCo.(UnitedStates);H.Radha,MichiganStateUniv.(UnitedStates)

Seamcarving[AvidanandShamir2007]isapowerfulretargetingalgorithmformappingimagestoarbitrarysizeswitharbitraryaspectratios.Meanwhile,theseamlettransform[Congeretal.2010]hasbeenrecentlyintroducedasanefficientrepresentationforseam-carving-basedretargetingoverheterogeneousmultimediadeviceswithabroadrangeofdisplaysizes.TheoriginalseamlettransformwasdevelopedusingHaarfilters,andhence,itenabledtraditionalseamcarvingbyremovingasingleseamatatimeinarecursivemanneruntilthedesiredimagesizeisreached.Inthispaper,wedevelopamoreefficientapproachforseamcarvingbyenablingmulti-seamcarving,whereateachstepoftheretargetingalgorithmmultipleseamsarecarvedsimultaneously.Weachievemulti-seamcarvingby(a)extendingtheseamlettransformusingmoregeneralwaveletsthantheHaarwavelets,and(b)employinglocalcircularconvolutioninthevicinityoftheselectedseams.Weshowthatpopularfilterbanks,suchastheonesthatarebasedonDaubechieswavelets,canachieveefficientmulti-seamcarvingwithequivalentvisualqualitywhencomparedtosingle-seamcarvingusingtheHaartransform.Furthermore,withmulti-seamcarving,thenumberofiterationsneededtoachieveagiventargetsizecanbereducedsignificantly.

7870-18, Session 4

A new DCT-based algorithm for numerical reconstruction of electronically recorded hologramsL.Bilevich,L.Yaroslavsky,TelAvivUniv.(Israel)

Anewuniversallowcomputationalcomplexityalgorithmfornumericalreconstructionofhologramsrecordedinneardiffractionzoneispresented.ThealgorithmimplementsdigitalconvolutioninDCTdomain,whichmakesitvirtuallyinsensitivetoboundaryeffects.Itcanbeusedforreconstructionofhologramsforarbitraryratiosofhologramsizetotheobject-to-hologramdistanceandwavelengthtocamerapitchandallowsimagereconstructioninarbitraryscale.

7870-20, Session 5

User discrimination in automotive systemsA.Makrushin,J.Dittmann,Otto-von-Guericke-Univ.Magdeburg(Germany);C.Vielhauer,FachhochschuleBrandenburg(Germany)andOtto-von-Guericke-Univ.Magdeburg(Germany);M.Leich,Otto-von-Guericke-Univ.Magdeburg(Germany)

Therecentlydevelopeddual-viewtouchscreens,whichareannouncedtobeinstalledincarsinanearfuture,giverisetocompletelynewchallengesinhuman-machineinteraction.Theautomotivesystemshouldbeabletoidentifyifthedriverorthepassengeriscurrentlyinteractingwiththetouchscreentoprovideacorrectresponsetothetouch.Theopticaldevices,duetoavailability,acceptancebytheusersandmultifunctionalusage,approvedtobethemostappropriatesensingtechnologyfordriver/passengerdiscrimination.Inthisworktheprototypicopticaluserdiscriminationsystemisimplementedinthecarsimulatorandevaluatedinthelaboratoryenvironmentwithentirelycontrolledillumination.Threetestsweredoneforthisresearch.Oneofthemexaminedifthenear-infraredilluminationshouldbeswitchedonaroundtheclock,thesecondoneifthereisadifferencein

discriminationperformancebetweenday,twilightandnightconditions,andthethirdoneexaminedhowtheintensivedirectionallightinginfluencestheperformanceoftheimplementeduserdiscriminationalgorithm.Despitethehigherrorrates,theevaluationresultsshowthatverysimplecomputervisionalgorithmsareabletosolvecomplicateduserdiscriminationtask.Theaverageerrorrateof10.42%(daytimewithnear-infraredillumination)isaverypromisingresultforopticalsystems.

7870-21, Session 5

Study of radar system imaging with distributed architectureL.Lei,J.Jiang,Ctr.forSpaceScienceandAppliedResearch(China)

Theconceptofdistributedradarhasbeenproposedforseveralyearsforitssignificantadvantageswhileitalsohassomeproblemssinceitsgeometrystructure.Tillnow,researcheshavedevelopedsomealgorithms,butthereisstillmuchworktodo.

Thepaperismainlyonthetheoryandmethodshowtoresolvetheproblems.Withsomenewviewsandthoughts,itproposesnewimagingtheoryandmethodsfordistributedradarsandcanproducehighresolutionimageswiththree-dimension,whicharebeingdevelopedwiththeideathatSpatio-temporalinformationcanbecombinedinprocessing.Thebasesoftheideaareelectromagneticwavepropagationtheoryandradarresolutiontheory.Thepaperprovidesuniformechomodelandambiguityfunctionandthengivesuniformimagingprocess,whichcanapplytogeneraldistributedsystemsandtargets.Oneimportantpointintheprocessistherepresentationofdifferentviewangleinphaseandtheninfrequencydifference.Thesecondimportantpointistherearefewapproximationsinthecalculation,andthethirdistheuseofinterferometrictheoryindistinguishingtheambiguitytargets.

Atlast,simulationworkispresentedfordistributedradarimaging.Andresultsaregiventotestthetheoryandmethodofthepaper.

7870-22, Session 6

Wiener crosses borders: interpolation based on second order modelsA.Guevara,R.Mester,JohannWolfgangGoethe-Univ.FrankfurtamMain(Germany)

Interpolationofsignals(arbitrarydimension,here:2Dimages)withmissingdatapointsisaddressedfromastatisticalpointofview.

WepresentageneralframeworkforwhichaWiener-styleMMSEestimatorcanbeseamlesslyadaptedtodealwithproblemssuchasimageinterpolation(inpainting),reconstructionfromsparsesamples,andimageextrapolation.

Theproposedmethodgivesapreciseanswerona)howarbitrarycanlinearfilterscanbeappliedtoinitiallyincompletesignalsandb)showsthedefinitewaytoextendimagesbeyondtheirsborderssuchthatnosizereductionoccursifalinearfilter/operatoristobeappliedtotheimage.

7870-23, Session 6

Image interpolation based on a multi-resolution directional mapE.VanReeth,STMicroelectronics(France);P.Bertolino,Gipsa-lab(France);M.M.Nicolas,STMicroelectronics(France)

Thispaperdescribesaninterpolationmethodthattakesintoaccounttheedgeorientationinordertoavoidtypicalinterpolationartifacts(jagging,staircaseeffects...).Itisfirstbasedonanedgeorientationestimation,performedinthewaveletdomain.Theestimationusesthe



IS&T /

ReturntoContents

multi-resolutionfeaturesofwaveletstogiveanaccurateandnon-biaseddescriptionofthefrequencylocationoftheedges,aswellastheirorientation.Theinterpolationisthenperformed,usingthelocalinformationgivenbythisdirectionalmap,toimproveareferenceinterpolation(cubic-splineforinstance).Theimprovementiscarriedoutbyfilteringtheedgeswithagaussiankernelalongtheirdirectioninordertosmooththecontourinthedirectionparalleltotheedge,whichavoidsdisturbingvariationsacrossthem(jaggingandstaircaseeffects).Thistechniquealsokeepsthesharpnessofthetransitioninthedirectionperpendiculartothecontourtoavoidblur.

Resultswillbepresentedonbothsyntheticandrealimages,showingthevisualimpactofthepresentedmethodonthequalityofinterpolatedimages.Comparisonswillbemadewiththeusualcubic-splineinterpolation,andwithotheredgedirectedinterpolationtechniquestodiscussthecompromisesthathavebemadeinourmethodcomparedtoothers.Fullreferenceswillalsobegivenonstateoftheartorientationestimationanddirectedinterpolationmethods.

7870-24, Session 6

Images reconstruction using modified exemplar based methodV.V.Voronin,South-RussianStateUniv.ofEconomicsandService(RussianFederation)andTampereUniv.ofTechnology(Finland);V.I.Marchuk,South-RussianStateUniv.ofEconomicsandService(RussianFederation);K.O.Egiazarian,TampereUniv.ofTechnology(Finland)

Thispaperdescribesanewimagereconstructionmethod.Theproposedapproachusesmodifiedexemplarbasedtechnique.Proposedmodificationallowstochoosesub-optimallyimage-adaptiveformandsizeoftheblockinordertofindsimilarpatches,numberofwhichisfurtherincreasedbyrotationoftheseblocks.Weshowthattheefficiencyofimagereconstructiondependsonthechoiceofblocksizefortheexemplarbasedmethod.Proposedadaptivityallowstoobtainasmallerreconstructionerrorthanthatofthetraditionalmethodaswellasotherstate-of-theartimageinpaintingmethods.Wedemonstratetheperformanceofanewapproachviaseveralexamples,showingtheeffectivenessofouralgorithminremovalofsmallandlargeobjectsonthetestimages.

7870-25, Session 7

A graph non-tree representation of the topology of a gray scale imageP.Saveliev,MarshallUniv.(UnitedStates)

Thepaperprovidesamethodofgraphrepresentationofgrayscaleimages.Forbinaryimages,itisgenerallyrecognizedthatnotonlyconnectedcomponentsmustbecaptured,butalsotheholes.Forgrayscaleimages,therearetwokindsof“connectedcomponents”-darkregionssurroundedbylighterareasandlightregionssurroundedbydarkerareas.Theseregionsarethelowerandupperlevelsetsofthegraylevelfunction,respectively.Theproposedmethodrepresentsthehierarchyofthesesets,andthetopologyoftheimage,bymeansofagraph.Thisgraphcontainsthewell-knowninclusiontrees,butitisnotatreeingeneral.Twostandardtopologicaltoolsareused.Thefirsttooliscelldecomposition:theimageisrepresentedasacombinationofpixelsaswellasedgesandvertices.Thesecondtooliscycles:boththeconnectedcomponentsandtheholesarecapturedbycircularsequencesofedges.

7870-26, Session 7

Colour processing in Runge spaceA.Restrepo,Univ.deLosAndes(Colombia)

Wedocolourimageprocessinginaspacethatisasintuitiveasthemostcommonspacesofthetypehue-saturation-luminanceyetit

avoidsthecaveatsresultingformanormalizationofthesaturationbytheluminance.Wepresentapplicationsinimagecorrectionincasesofhighdynamicrangeimagesandfadedphotographs;also,wepresentapplicationsthatenhancetheappealofnaturalimages.ThespaceiscalledRungespace;itissphericalandthecolourattributestherearehue,coloufulnessandlightness;theyarereadilyderivedfromRGBcomponents.

7870-27, Session 7

Robust image registration for multiple exposure high dynamic range image synthesisS.Yao,InstituteforInfocommResearch(Singapore)

Imageregistrationisanimportantpreprocessingtechniqueinhighdynamicrange(HDR)imagesynthesis.Thispaperproposedarobustimageregistrationmethodforaligningagroupoflowdynamicrangeimages(LDR)thatarecapturedwithdifferentexposuretimes.Illuminationchangeandphotometricdistortionbetweentwoimageswouldresultininaccurateregistration.WeproposetotransformintensityimagedataintophasecongruencytoeliminatetheeffectofthechangesinimagebrightnessandusephasecrosscorrelationintheFouriertransformdomaintoperformimageregistration.Consideringthepresenceofnon-overlappedregionsduetophotometricdistortion,evolutionaryprogrammingisappliedtosearchfortheaccuratetranslationparameterssothattheaccuracyofregistrationisabletobeachievedatahundredthofapixellevel.Theproposedalgorithmworkswellforunderandover-exposedimageregistration.IthasbeenappliedtoalignLDRimagesforsynthesizinghighqualityHDRimages.


Efficiency analysis of DCT-based filters for color image databaseV.V.Lukin,D.V.Fevralev,S.K.Abramov,N.N.Ponomarenko,NationalAerospaceUniv.(Ukraine);J.T.Astola,K.O.Egiazarian,TampereUniv.ofTechnology(Finland)

EfficiencyofDCTbasedimagefilteringisanalyzedusingacolorimagedatabaseTID2008thatcontainsimagescorruptedbyi.i.d.andspatiallycorrelatednoisewithfourvaluesofnoisevariance.ItisshownthatimprovementofPSNRduetofilteringisverycloseforR,G,andBcomponentsofcolorimagesandthisimprovementdependsuponimagecontent.ImprovementofPSNRreaches7dBforquitesimpleimages(thatcontainlargehomogeneousregions)anditisonly1.5dBforhighlytexturalimagesifinitialPSNR=30dB.ThevisualqualitymetricPSNR-HVS-Misstudiedaswell.Forspatiallycorrelatednoise,resultsofanalysisclearlyshowthatthresholdingshouldbefrequencydependent.ThisallowsincreasingPSNRandPSNR-HVS-Mbyabout2...3dBcomparedtothecaseoffixed(frequencyindependent)threshold.


Color image lossy compression based on blind evaluation and prediction of noise characteristicsV.V.Lukin,N.N.Ponomarenko,NationalAerospaceUniv.(Ukraine);K.O.Egiazarian,TampereUniv.ofTechnology(Finland);L.Lepisto,NokiaResearchCtr.(Finland)

Mostofimagesformedbydigitalcamerasarecompressedbeforestorageand/ortransferringwhereJPEGlossycompressionisastandardtoolapplied.Usuallytherearethreeorstandardsmodesoflossycompression.Weproposetwoapproachestoautomaticselectionoflossycompressionparametersthatareabletotakeintoaccountblindestimatesorpredictionofnoisecharacteristicsand



IS&T /

ReturntoContents

blurparametersaswellascontentofagivenimageandtocarryoutadaptiveJPEGcompressionwithoutintroducingvisuallynoticeabledistortions.Thedesignedapproachesallowincreasingcompressionratioby,ontheaverage,from2to2.6times.


Unsupervised automated panorama creation for realistic surveillance scenes through weighted mutual information registrationT.P.Keane,E.Saber,H.E.Rhody,A.E.Savakis,RochesterInstituteofTechnology(UnitedStates);J.Raj,LenelSystemsInternationalInc.(UnitedStates)

Automatedpanoramacreationusuallyrequirescameracalibrationorextensiveknowledgeofcameralocationsandrelationstoeachother.Registrationproblemsareoftensolvedbythesesamecameraparametersortheresultofcomplexpointmatchingschemes.Thispaperpresentsanovelautomatedpanoramacreationalgorithmbyusinganaffinetransformationsearchbasedonmaximizedmutualinformation(MMI).MMItechniquesareoftenlimitedtoairborneandsatelliteimageryormedicalimages,butwecanshowthatasimpleMMIalgorithmverywellapproximatesrealisticscenesofvaryingdepthdistortion.Thisstudywasperformedonstationarycolorsurveillancevideocamerasandprovesextremelyworthwhileinanysystemwithlimitedornoaprioricamera-to-cameraparameters.Thisalgorithmisquiterobustonaverylargerangeofstrict-tonearly-affinerelatedscenes,andprovidesagreatapproximationfortheoverlapregionsinscenesrelatedbyaprojectivehomography.Surprisinglysignificantpracticalconsiderationsultimatelyoutweighedtheoreticalderivationsinthedevelopmentofthisrobustandversatilealgorithm.


Ellipse detection using an improved randomized Hough transformationZ.Teng,J.Kim,D.Kang,PusanNationalUniv.(Korea,Republicof)

Thispaperproposesanellipsedetectionalgorithmbasedontheanalyticalsolutiontotheparametersofellipseinimages.Inthefirstinstance,edgedetectionisprocessed,fromwhichlinesegmentsareextracted.ThenthemethodoffindingthecentercoordinatesoftheellipseisdescribedbasedonthepropertyofellipsebyusingthreepointsvotingatasenseofRandomizedHoughTransformation(RHT).Finally,ananalyticalsolutionoftheotherthreeparametersoftheellipse(semi-majoraxislength,semi-minoraxislengthandtheanglebetweentheX-axisandthemajoraxisoftheellipse)aregivenviacoordinatetransformation.Basedonthissolution,weproposetheseparatedparametervotingschemeforellipsecenterandtheotherthreeparametersinsteadof5parametersvotingschemeofRHT.Theexperimentsshowthattheproposedalgorithmperformswellinvariousimages.


Detection of motion blur direction based on maxima locations for blind deconvolutionR.M.Chong,T.Tanaka,TokyoUniv.ofAgricultureandTechnology(Japan)

Theblursinimagescloselyresembleanidealpointspreadfunction(PSF)model.ThissimilaritycanbeexploitedinthedeconvolutionprocessbylearningamodelthatbestfitstheestimatedPSF.Inordertoachievethis,amodelisselectedfromtheprovidedtrainingsetandthenintegratedintothereconstructioncostfunction.Inthispaper,weproposetoeliminatetheneedforatrainingsetandinsteaduseareferencePSF(RPSF)initsplace.Thiseliminatestheneedfor

specifyingatrainingsetaswellasthedependenceonestimatedquantities.Furthermore,itisonlydependentonthegivendegradedimageassumingthatitisuniformlyblurred.Wetestedourmethodwithmotionblursindifferentdirectionssinceitisoneofthemostcommonlyencounteredproblemswhenusingconsumercameras.Usingtheblursupportasaprioriknowledge,theresultsshowthatthemethodiscapableofdeterminingthemotiondirectioneveninthepresenceofnoise.ThereconstructionoftheimageisachievedbyusingamodifiedcostfunctionthatalsoaccountsforthecontouroftheestimatedPSF.ResultsshowthathigherimagequalityandlowerPSFestimationerrorcanbeobtained.


EM algorithm-based hyperparameters estimator for Bayesian image denoising using BKF priorL.Boubchir,B.Durning,E.Petit,Univ.Paris12-ValdeMarne(France)

ThispaperisdevotedtoanovelhyperparameterestimatorforbayesiandenoisingofimagesusingtheBesselKFormpriorwhichwerecentlydeveloped[1,2].Moreprecisely,thisapproachisbasedontheEMalgorithm.Thesimulationresultsshowthatthisestimatoroffersgoodperformancesandisslightlybettercomparedtothecumulant-basedestimatorsuggestedin[1,2].AcomparativestudyiscarriedtoshowtheeffectivenessofourbayesiandenoiserbasedonEMalgorithmcomparedtootherdenoisersdevelopedinbothclassicalandbayesiancontexts.Ourstudyhasbeeneffectedonnaturalandmedicalimagesforgaussianandpoissonnoiseremoval.

REFERENCES

[1]J.M.FadiliandL.Boubchir,“Analyticalformforabaesianwaveletestimatorofimagesusingthebesselkformdensities”,IEEETransactionsonImageProcessing,vol.14,no.2,pp.231-240,2005.

[2]L.BoubchirandJ.M.Fadili,“BayesiandenoisingbasedontheMAPestimationinwavelet-domainusingbesselkformprior”,IEEEInternationalConferenceonImageProcessing,vol.I,pp.113-116,2005.


Semantic analysis of facial gestures from video using a Bayesian frameworkG.Vashi,R.L.Canosa,RochesterInstituteofTechnology(UnitedStates)

Thecontinuousgrowthofvideotechnologyhasresultedinincreasedresearchintothesemanticanalysisofvideo.Themultimodalitypropertyofthevideohasmadethistaskverycomplex.Theobjectiveofthisresearchistoresearch,implementandexaminetheunderlyingmethodsandconceptsofsemanticanalysisofvideosandalsotoshowhowtoimproveuponstateoftheartingesturerecognitionbyusingsemanticknowledge.Themaindomainofanalysisisfacialgesturerecognitionfromvideo,includingbothvisualandvocalaspectsoffacialgestures.ABayesiannetworkclassificationalgorithmhasbeenusedtoidentifyandunderstandfacialexpressionsinvideo.TheBayesiannetworkisanattractivechoicebecauseitprovidesaprobabilisticenvironmentandgivesinformationaboutuncertaintyfromknowledgeaboutthedomain.Thegoalofthisresearchistodetermineifanexpressiononaperson’sfaceishappy,sad,angry,fearfulordisgusted.Thisinformationwillenhancethesemanticunderstandingandinterpretationofvideodata.Currently,ithasnotbeenestablishedthattwomodalitiesarenecessaryforaccurateinterpretationoffacialexpressionsinvideo.Therefore,thisresearchisacontributiontothecurrentknowledgebytestingthehypothesisthatcombiningthetwomodalitiesofvisionandspeechyieldsbetterclassificationresultsthaneitherusedalone.



IS&T /

ReturntoContents


Color image enhancement algorithm based on logarithmic transform coefficient histogram shiftingJ.Xia,K.A.Panetta,TuftsUniv.(UnitedStates);S.S.Agaian,TheUniv.ofTexasatSanAntonio(UnitedStates)

Thegoalofimageenhancementtechniquesistoimprovethecharacteristicorvisualqualityofanimageforspecificcriteria.Theycanbeclassifiedasspatialdomainenhancementandtransformdomainenhancement.Spatialdomaintechniquesdealwiththerawimagedata,alteringtheintensityvaluesbasedonaspecificalgorithmforasetofcriteria.TransformdomainenhancementtechniquesinvolvetransformingtheimageintensitydataintoaspecificdomainbyusingsuchmethodsastheDiscreteCosine,Fourier,andWavelettransforms.Thesetransformsareusedtoalterthefrequencycontentofanimagetoimprovedesiredtraits,suchashighfrequencycontent.Combiningspatialandtransformtechniquescanproducepowerfulresultswhichcancompensateforweaknessesinindividualalgorithms.Acolorimageenhancementalgorithmbasedonlogarithmictransformhistogramshiftingisproposedinthispaper.Experimentalresultsshowthattheproposedalgorithmprovidesbettercontrastimagewithnearlynohaloartifactsandgoodcolorconsistency.Alsocomparedwiththeretinex,whichisaconventionalcolorimageenhancementalgorithm,itissimplebutmoreeffective.ThisalgorithmseparatesthecolordataintochromaticityandbrightnessandappliesLogenhancementonbrightnessimageonly.ThenalltheRed,GreenandBluecomponentsareseparatelyenhancedbylogarithmictransformhistogramshiftingmethod,whichisbasedonalteringthetransformcoefficienthistogramsthroughshiftingandmapping.Optimalparameterselectionbasedonimagequalitymeasurementisalsodiscussedinthisalgorithm.


Neighbourhood-consensus message passing and its potentials in image processing applicationsT.Ru zic,A.Pi zurica,W.R.Philips,Univ.Gent(Belgium)

Inthispaper,anovelalgorithmforinferenceinMarkovRandomFields(MRFs)ispresented.Itsgoalistofindapproximatemaximumaposterioriestimatesinasimplemannerbycombiningneighbourhoodinfluenceofiteratedconditionalmodes(ICM)andmessagepassingofloopybeliefpropagation(LBP).Wecalltheproposedmethodneighbourhood-consensusmessagepassingbecauseasinglejointmessageissentfromthespecifiedneighbourhoodtothecentralnode.Themessage,asafunctionofbeliefs,representstheagreementofallnodeswithintheneighbourhoodregardingthelabelsofthecentralnode.Thiswayweareabletoovercomethedisadvantagesofreferencealgorithms,ICMandLBP.Ononehand,moreinformationispropagatedincomparisonwithICM,whileontheotherhand,thehugeamountofpairwiseinteractionsisavoidedincomparisonwithLBPbyworkingwithneighbourhoods.Theideaisrelatedtothepreviouslydevelopediteratedconditionalexpectationsalgorithm.Herewerevisititandredefineitinamessagepassingframeworkinamoregeneralform.Theresultsonthreedifferentbenchmarksdemonstratethattheproposedtechniquecanperformwellbothforbinaryandmulti-labelMRFswithoutanylimitationsonthemodeldefinition.Furthermore,itmanifestsimprovedperformanceoverrelatedtechniqueseitherintermsofqualityand/orspeed.


Alternative method for Hamilton-Jacobi PDEs in image processingC.Vachier-Mammar,A.Lagoutte,H.Salat,EcoleNormaleSupérieuredeCachan(France)

Multiscalesignalanalysishasbeenusedsincetheearly1990sasapowerfultoolforimageprocessing,notablyinthelinearcase.However,nonlinearPDEsandassociatednonlinearoperatorshaveadvantagesoverlinearoperators,notablypreservingimportantfeaturessuchasedgesinimages.Inthispaper,wefocusonnonlinearHamilton-JacobiPDEsdefinedwithadaptivespeedsor,alternatively,onadaptivemorphologicalfitersalsocalledsemi-flatmorphologicaloperators.Semi-flatmorphologywereinstroducedbyH.Heijmansandstudiedonlyinthecasewherethespeed(orequivalentlythefilteringparameter)isadecreasingfunctionoftheluminance.ItisproposedtoextendthedefinitionsuggestedbyH.Heijmansinthecaseofnondecreasingspeeds.Wealsoprovethatacentralpropertyfordefiningmorphologicalfilters,thatistheadjunctionproperty,ispreservedwhiledealingwithourextendeddefinitions.Finallyexperimentalapplicationsarepresentedonactualimages,includingconnectionofthinlinesbysemi-flatdilationsandimagefilteringbysemi-flatopenings.


A novel dimming algorithm using local boosting algorithm for LED backlight system in LCD TVsJ.Lee,LED-ITFusionTechnologyResearchCtr.(Korea,Republicof)

Inthispaper,anoveldimmingalgorithmusingthelocalboostingalgorithmforLEDbacklightsysteminLCDTVsisproposed.Theproposeddimmingalgorithmconsistsoftwonewalgorithms:imageclassificationandthelocalboostmethod.Theproposedalgorithmhashighercontrastratioandlowerpowerconsumptionthantheconventionalmethods


Spatially adaptive alpha-rooting in BM3D sharpeningM.Mäkitalo,A.Foi,TampereUniv.ofTechnology(Finland)

Theblock-matchingand3-Dfiltering(BM3D)algorithmiscurrentlyoneofthemostpowerfulandeffectiveimagedenoisingprocedures.Itexploitsaspecificnonlocalimagemodelingthroughgroupingandcollaborativefiltering.Groupingfindsmutuallysimilar2-Dimageblocksandstacksthemtogetherin3-Darrays.Collaborativefilteringproducesindividualestimatesofallgroupedblocksbyfilteringthemjointly,throughtransform-domainshrinkageofthe3-Darrays(groups).

BM3Dcanbecombinedwithtransform-domainalpha-rootinginordertosimultaneouslysharpenanddenoisetheimage.Specifically,thethresholded3-Dtransform-domaincoefficientsaremodifiedbytakingthealpha-rootoftheirmagnitudeforsomealpha>1,thusamplifyingthedifferencesbothwithinandbetweenthegroupedblocks.Whileonecanuseaconstant(global)alphathroughouttheentireimage,furtherperformancemaybepotentiallyachievedbyallowingdifferentdegreesofsharpeningindifferentpartsoftheimage,basedonsomecontent-dependentinformation.

Weproposetoadjustthevalueofalphausedforsharpeningagroupthroughweightedestimatesoftheedgeandtexturestrengthsoftheaverageblockinthegroup.Thisisshowntobeaviableapproachforimagesharpening,andinparticularitprovidesanimprovementoveritsglobalnon-adaptivealpha-rootingcounterpart.



IS&T /

ReturntoContents


Extracting global salient open curves from cluttered backgrounds via Markov random fieldsN.Durak,O.Nasraoui,Univ.ofLouisville(UnitedStates)

Longportionsofanellipsecanbemoreeasilydistinguishedbythehumaneyethanbyacomputerinclutteredimages.Weproposeamethodthatautomatestheprocessofextractingprincipalloopshapesfromclutteredimages.First,curvesegmentsarealgorithmicallytracedandpropertiesofcurvesegmentsarecomputed.Labelsareassignedtoeachcurvesegment.Startingfromthemostsalientcurvesegments,theirneighborhoodwassearchedforpossiblepairwisegrouping.Candidategroupsarecheckedwithrespecttoellipsefiterror,smoothness,cornerpoints,andanglesimilarity.Weupdatethelabelsofthecurvesegmentsaftereachiterationtillweobtainoptimumresults.Wetestedoursystemonsyntheticandrealimagestoshowtheeffectiveness.


Joint distributed source-channel coding for 3D videosV.Palma,M.Cancellaro,A.Neri,Univ.degliStudidiRomaTre(Italy)

Thispaperpresentsadistributedjointsource-channel3Dvideocodingsystem.

Ouraimisthedesignofanefficientcodingschemeforstereoscopicvideocommunicationovernoisychannelsthatpreservestheperceivedvisualqualitywhileguaranteeingalowcomputationalcomplexity.

Thedrawbackinusingstereosequencesistheincreasedamountofdatatobetransmitted.Severalmethodsarebeingusedintheliteratureforencodingstereoscopicvideo.AsignificantlydifferentapproachrespecttotraditionalvideocodinghasbeenrepresentedbyDistributedVideoCoding(DVC),whichintroducesaflexiblearchitecturewiththedesignoflowcomplexvideoencoders.

DVCstatesthatitistheoreticallypossibletoseparatelyencodeandjointdecodetwoormorestatisticallydependentsourcesatthesamerateobtainedwhenthesamesourcesarejointencodedanddecoded.Thisapproachconsiderablyreducestheoverallamountoftransmissionnecessaryfromthecamerastothecentraldecoderandsimplifiesthecomplexityofthevideoencoderbyshiftingallthecomplexvideoprocessingtaskstothedecoder.Power/processinglimitedsystemssuchaswirelesscamerasensorsthathavetocompressandsendvideotoafixedbasestationinapower-efficientwaycantakeadvantageofthisproperty.

Theoreticallysourceandchannelencodingisbasedontandemoftwoseparateencodingsystems.Inthiscontribution,wepresentthedesignofDVC-basedJointSource-Channel3DvideoCodingschemefornoisychannel.Weadoptasinglesource-channelencoderforbothcompressionandprotectionresultinginadistributed3Dvideocodingscheme.

Inthiscontribution,themathematicalframeworkwillbefullydetailedandtradeoffamongredundancyandperceivedqualityandqualityofexperiencewillbeanalyzedwiththeaidofnumericalexperiments


Simulating images captured by superposition lens camerasA.S.Thangarajan,R.Kakarala,NanyangTechnologicalUniv.(Singapore)

Asthedemandforreductioninthicknessofthelensesincamerarises,theneedtolookforbettersolutionsbecomesanecessity.Onesuchradicalapproachtowarddevelopingathinlenswasobtained

fromnature’ssuperpositionprincipleusedintheeyesofmanyinsects.Butgenerallytheimagesobtainedfromtheselensesarefuzzy,andrequirereconstructionalgorithmstocompletetheimagingprocess.Theexistingliteraturedoesnotproviderealistictestimagesforsuchalgorithms,andcommercialray-tracingsoftwarerequiredtoproducesuchimagesiscostly.Asolutionforthisproblemispresentedinthispaper.HereaGaborSuperLens(GSL)whichisbasedonsuperpositionprincipleisselectedandthecompletelensstructureissimulatedandistestedwithatestimageusingthepublic-domainray-tracingsoftwarePOV-ray.TheimageobtainedisasviewedthroughanactualGSL,andcanbeusedtotestalgorithmstoreconstructthoseblurryimages.Thelargecomputationaltimeinrenderingsuchimagesrequiresfurtheroptimization,andmethodstodosoarediscussed.


Features extraction based on Fisher’s informationL.Costantini,P.Sità,M.Carli,A.Neri,Univ.degliStudidiRomaTre(Italy)

InthispaperwepresentanoveltechniquefordetectingtheflatbackgroundonimagesbasedonFisher’sinformation.Ourgoalistoimprovetheperformancesofacontentbasedimageretrieval(CBIR)system.ManyCBIRsystemsarebasedonthelowlevelfeatureextracted,suchastexture,colour,andedges,onthewholeimages.TheperformancesoftheCBIRsystemscanbeimprovedifthelowlevelfeaturesareextractedonimageareascontainingrelevantinformation.Forselectingthoseregions,thelocalFisher’sinformationiscomputedandtheregionscharacterizedbyalowinformationlevelareremovedfromtheimage.IntheproposedCBIRschemewefirstevaluatethelocalFisher’sinformationandthenwecharacterizetheimagebyusingthelowlevelfeatures.TheevaluationoftheFisher’sinformationisbasedontheZernikepolynomials.Toselectonlytheconnectedperipheralareasoftheimagearegiongrowingprocedureisperformed.

Experimentalresultsshowthattheproposedalgorithmimprovesboththeretrievalrateandtheperformanceoftheimageclusteringalgorithm.


An improved RANSAC algorithm using within-class scatter matrix for fast image stitchingL.Zhang,Z.Liu,J.Jiao,GraduateUniv.oftheChineseAcademyofSciences(China)

Inthispaper,weproposedanimprovedRANSACalgorithmusingwithin-classscattermatrixforfastimagestitching.ThealgorithmlocalizesanddescribesthefeatureswithSIFT(shortforscale-invariantfeaturetransform)firstly,ensuringtheaccuracy.ThenthefeaturesarematchedusingMin-costK-flowalgorithm.NextweapplytheimprovedRANSAC(shortforrandomsampleconsensus)algorithmwiththewithin-classscattermatrixtoregistertheneighboringimages.Thewithin-classscattermatrixisusedtodescribethescatteringofthesampletomeasuretherandomsamplegeneratedbyRANSACalgorithm.TheimprovedalgorithmweproposedcanaccelerateRANSACalgorithmeffectivelywhileprovingtheaccuracyandrobustness.Finally,imageblendingcanbedonebymulti-bandblending.

WecomparedimagestitchingalgorithmusingimprovedRANSACalgorithmandusingoriginalRANSACalgorithmon20smallimagepairs(320*240)and8biggerimagepairs(800*600)onaccuracyandspeed.TheimagesareselectedrandomlyfromICCV2005ComputerVisionContestandourimagetests.AftermatchingthefeatureswithMin-costK-flowalgorithm,boththetwoalgorithmarerepeated40timesforeachpairsofimagestogetstatisticallyrepresentativeresults.Experimentresultsdemonstratethatouralgorithmismoreeffective.ItcangettheresultofthesamequalityasoriginalRANSACalgorithmwhileacceleratingspeedbyabout20%.



IS&T /

ReturntoContents


Edge-directed image zooming based on radial basis function interpolationY.J.Lee,KAIST(Korea,Republicof);J.Yoon,EwhaWomansUniv.(Korea,Republicof)

Imageinterpolationisaprimetechniqueinmanyapplication.

Inthisstudy,weproposeanedge-directednon-linearinterpolationalgorithmforimagezoomingbasedonmovingleastsquaresmethod.

Thebasicideaisfirsttouseaninitialestimateofpixelinformationattheresamplingposition.

Specifically,thisinitialestimateinvolvesmeasuringtheorientationofthelocalgradientsintheimage.

Next,thecovarianceestimatesisusedtocorrecttheorientationoftheedgedirection.Finally,thisorientationinformationisthenusedtoadaptivelysteerthelocalkernelfunction,notaccessingedge,whichresultsinimprovingqualityoftheinterpolatedimagesoverconventionallinearinterpolation.


Enhanced bleed through removal for scanned document imagesA.Sharma,Hewlett-PackardLabs.India(India);S.Mahaldar,ShellInc.(India);S.Banerjee,Hewlett-PackardLabs.India(India)

Back-to-frontinterferenceisacommonproblemindocuments,printedontranslucentpageswithinsufficientopacityandisreferredtoasbleedthrough.Thepresentstate-of-artalgorithmsaddressbleedthroughbasedonentropy,entropiccorrelationanddiscriminatoranalysis.However,acommondrawbackofsuchalgorithmsistheirinefficientprocessingofdocumentsthatareeithersparseintermsofcontentorhaveaverydarkbackground.Ourproposedalgorithm,basedonOtsu’sbinarizationmethodandpixellevelclassificationaddressestheseproblems.Experimentsindicatethatouralgorithmperformscomparabletostate-of-the-artformostoftheimagesandbetterthanstate-of-the-artforthelowcontrastimages.


Classification of texture features in pathological prostate imagesA.Almuntashri,S.S.Agaian,TheUniv.ofTexasatSanAntonio(UnitedStates)

Inthispaper,weproposeaclassificationmethodforprostatepathologicalimagesbasedontexturefeatures.TheclassificationisbasedonGleasonmethodforhistologicalgradingofmalignancyofcanceroustissues.Theproposedalgorithmhasasuperiorperformanceinvisualizingandclassifyingfinetissuesdetailsforanautomaticdetectionandclassificationofcancerinprostatebiopsyimages.


Image segmentation refinement by modeling in turning function spaceC.F.S.Volotao,InstitutoNacionaldePesquisasEspaciais(Brazil)andInstitutoMilitardeEngenharia(Brazil)andDiretoriadeServiçoGeográficodoExército(Brazil);R.D.C.Santos,G.J.Erthal,L.V.Dutra,InstitutoNacionaldePesquisasEspaciais(Brazil)

Thisworkproposesanewapproachfortheuseofturningfunctionspacetochangeshapesinaccordancewithshapedescriptionsandconsistentwithspectralinformation.Themainstepsare:(1)

segmentation;(2)contourextraction;(3)turningfunctionspacetransform;(4)classification;(5)shapeanalysis;and(6)blobenhancementonimagespace.Intheanalysisofshapetheboundaryismodifiedbasedonbothimageandmodelandconstraintsareimposedtoportionsoftheturningfunction.Shapemodelingcanbedonebydefiningcriteriasuchaslinearity,anglesandsizes.Resultsonsyntheticexamplesarepresented.


Integrating empirical mode decomposition and nonlinear diffusion method for noise reduction in underwater sonar imagesS.Bakhtiari,S.S.Agaian,M.Jamshidi,TheUniv.ofTexasatSanAntonio(UnitedStates)

Sonarimagesaresusceptibletobeaffectedbynonlinearnoisewhichmakesthedetectionorrecognitionprocessmorecomplicated.Thetraditionalnoiseremovaltechniqueswhichinvolvelinearstationarynoisearenotsuitableforsuchimages.EmpiricalModeDecomposition(EMD)hasprovedtobeapowerfultechniqueforanalyzingnon-linearandnon-stationarysignals.ThismethodisfullydatadriventhatdecomposesthesignalintosomeoscillatorycomponentscalledIntrinsicModeFunctions(IMFs)bysiftingprocess.Inthispaper,anewEMDbasedapproachisproposedtoreducethenoiseofunderwaterSideScanSonar(SSS)images.CombinationofEMDandNonlinearDiffusiontechniquehasshowntobeconsiderablyeffectiveforeliminatingthistypeofnoise.Theimagesarede-noisedbyfilteringeachIMFcomponentsbyNonlinearDiffusionmethodandrecombiningtheprocessedIMFs.


Extending JPEG-LS for low-complexity scalable video codingA.Ukhanova,TechnicalUniv.ofDenmark(Denmark);A.Sergeev,St.PetersburgStateUniv.ofAerospaceInstrumentation(RussianFederation);S.Forchhammer,TechnicalUniv.ofDenmark(Denmark)

JPEG-LS,thewell-knowninternationalstandardforlosslessandnear-losslessimagecompression,wasoriginallydesignedfornon-wirelessapplications.InthispaperweproposescalablemodificationofJPEG-LSandcompareitwiththeleadingvideocodingstandardsJPEG2000andH.264/SVCforapplicationtohigh-rateandlowcomplexitywirelessvideocodingandtransmission.



IS&T /

ReturntoContents

Conference 7871: Real-Time Image and Video Processing 2011Monday-Tuesday24-25January2011PartofProceedingsofSPIEVol.7871Real-TimeImageandVideoProcessing2011

7871-01, Session 1

Towards real-time image quality assessmentB.Geary,C.Grecos,Univ.oftheWestofScotland(UnitedKingdom)

Weintroduceareal-timeimplementationandevaluationofanew,fast,accurate,structurallybasedimagequalitymetric.Structuralapproachestoimagequalitymeasurementarepredicatedonthenotionthathumansperceiveimagequalityasafunctionoftheintegrityoflocalstructureinanimageafterithasbeensubjectedtodegradation.

InthispaperweoutlinethesalientfeaturesofthederivationoftheRotatedGaussianDiscriminationMetric(RGDM)andshowhowanalysesoflocalstatisticsofdistortiontypenecessitatevariationindiscriminationfunctionwidth.ResultsobtainedontheLIVEimagedatabaseshowtightbandingofRGDMmetricvaluewhenplottedagainstmeanopinionscoreindicatingtheusefulnessofthismetric.

Weexploreanumberofstrategiesforalgorithmicspeed-upofRGDMincludingtheapplicationofIntegralImagesforpatchbasedcomputationoptimisation,costreductionfortheevaluationofthediscriminationfunctionandgeneralloopunrolling.WealsoemployfastSIMDintrinsicsandexploremulti-coredataparalleldecompositiononamulti-coreIntelProcessor.Ourresultsshowinexcessofanorderofmagnitudespeed-upovertheun-optimisedRGDMalgorithm(dependingonthenumberofcoresemployed,thedatasetevaluatedandtheextentofpre-processing)measuredintermsofnumberofprocessorclockcyclesobtainedusingtheIntelV-Tuneprofilingtool.ItisanticipatedthatthisfastImageQualityAssessment(IQA)techniquewillbeemployedinbiterrorrateoptimisationexperimentsinthenearfuture.

7871-02, Session 1

2000 fps real-time target tracking vision system based on color histogramI.Ishii,T.Tatebe,Q.Gu,T.Takaki,HiroshimaUniv.(Japan)

Inthisstudy,wedevelopahigh-speedcolor-histogram-basedtargettrackingsystemthatcanbeappliedto512x512pixelimagesat2000fpsusingthehardwareimplementationofanimprovedCAM-SHIFTmethodonahigh-speedvisionplatform.IntheimprovedCAM-SHIFTmethod,thesize,position,andorientationofanobjecttobetrackedcanbeextractedusingonlythehardwareimplementationofhueconversionandthemomentfeaturecalculationof16binaryimagesquantizedbycolorbinsaccordingtothehuehistogrambasedontheadditivityinmomentfeaturecalculation.Byinstallingourtargettrackingsystemonatwo-axisactivevisionplatform,wepresentseveralmechanicaltargettrackingresultsforhigh-speedmovingobjects:(1)acolorpatternrotatingat15rpsand(2)ahumanhandmovingrapidlyat4Hzinaroom.Intheexperiments,thepanandtiltmotorsontheactivevisionplatformarecontrolledthroughfeedbackat2000fpstocorrespondtothecalculatedimagecentroidbyusingtheextractedmomentfeaturesforthecenterofthecameraview.Theseresultsindicatethatourcolor-histogram-basedtargettrackingsystemcanrobustlytrackhigh-speedmovingobjectsevenunderactualcomplexscenesforavisionsystemhavingaframerateofupto2000fps.

7871-03, Session 1

Real-time iris tracking with a smart cameraM.Mehrübeoglu,H.T.Bui,TexasA&MUniv.CorpusChristi(UnitedStates);L.McLauchlan,TexasA&MUniv.-Kingsville(UnitedStates)

Thispaperpresentsareal-timeirisdetectionprocedureforgrayintensityimages.Typicalapplicationsforirisdetectionutilizetemplateandfeaturebasedmethods.Thesemethodsaregenerallytimeandmemoryintensiveandnotapplicableforallpracticalreal-timeembeddedrealizationswithlimitedsystemresourcesthatrequirehighspeedinspectionrates.Inthisarticle,weproposeamethodthatutilizesasimplealgorithmthatistime-efficientwithhighdetectionandlowerrorrates.

Thereal-timeimageacquisitionsystemforthisresearchinvolvesa17xxseriesSmartCamera(NI)withLabVIEWReal-TimeModuleusedforautomatedapplications.First,theimagesareanalyzedtodeterminetheregionofinterest(face)beforedetectingtheeye.Utilizingaconvolution-basedalgorithmontheedgeimageandusingHoughTransform,theirisoftheeyeisthendetermined.Thisedgebasedmethodisefficient,sincethealgorithmislesscomplexandlesscomputationallyexpensivethanifthefullimagewastobeanalyzed.Inthisapproach,thefirstimageisusedtocomputethelocationinformationoftheiris.Theinitialcomputationinthefirstframeisthemosttimeconsumingaspectoftheprocedure.Theacquiredirislocationinformationisthenstoredinthecamera’simagebuffer,andusedtomodelonespecificeyepattern.Thelocationoftheiristhusdeterminedisthenusedasareferencetoreducethesearchregionusedtodetecttheirisinthesubsequentimageframeswithhighaccuracyandfast,appropriateforreal-timeimplementations.

Theirisdetectionalgorithmhasbeenappliedatdifferentframerates.Theresultsdemonstratethespeedofthisalgorithmallowsthetrackingoftheiriswhentheeyesorthesubjectismovinginfrontofthecameraatreasonablespeedsandwithlimitedocclusions.Theresultsofthisprojecthasapplicationsingazetrackinginautomotive(driverwarning),medical(patientmonitoring;instrumentcontrol),computer(consumerbehaviortesting),orgaming(gamecontrol)industriesandforsurveillance.

7871-05, Session 1

Optimization of image processing algorithms on mobile platformsM.V.Shirvaikar,P.Poudel,TheUniv.ofTexasatTyler(UnitedStates)

ThispaperpresentsatechniquetooptimizepopularimageprocessingalgorithmsonmobileplatformssuchascellphonesandPDAs.ThetargetplatformchosenforthedevelopmentwastheOMAP3530processorwhichiswidelyusedinembeddedmediasystems.Thebasicimagecorrelationalgorithmischosenasitfindswidespreadapplicationsforvarioustemplatematchingtaskssuchasface-recognitionandcontext-awarecomputing.Asthecorrelationalgorithmiscomputationallycomplex,itisnecessarytooptimizeperformancetomeetreal-timedeadlines,especiallyundermobilescenarios.ThebasicalgorithmprototypesconformtoOpenCV,apopularcomputervisionlibrarydevelopedbyIntelCorporation.Amethodologytotakeadvantageoftheasymmetricdual-coreprocessor,whichincludesanARMandaDSPcoresupportedbysharedmemory,ispresentedwithimplementationdetails.DSPLib,ahighlyoptimizedlibraryprovidedbyTIisusedtoperformbasicdigitalsignalprocessingtasks.TheCodec-EngineframeworkprovidedbyTexasInstruments(TI)isusedforInterProcessorCommunication(IPC)andRemoteProcedureCall(RPC)functionality.Theperformanceresultspresentedmeasurethealgorithmspeedupobtainedduetodual-coreimplementation.Theproceduresestablishedcanbeappliedtootheralgorithmsthatarepartofanygeneralpurposeimaginglibrary.


IS&T /

ReturntoContents

7871-06, Session 2

Scalable software architecture for on-line multicamera video processingM.Camplani,L.Salgado,Univ.PolitécnicadeMadrid(Spain)

Multi-camerasystemsdevelopmentisaveryactiveresearchareaduetotheincreasingdemandofefficientsystemsforseveralapplicationdomains.Systemsbasedonsmartcamerasdevicesarewellsuitedforrealtimeimageprocessingthankstodedicatedhardware.Moreover,theyarewidelyusedinlargecamerasystems.However,theypresentsomedrawbacks:dedicatedhardwarepresentslackofflexibilityandthedesignofcooperativetasksisnotstraightforward.

Theproposedarchitectureguaranteesagoodtrade-offbetweencomputationalpowerandflexibility.ThesystemiscomposedbyanetworkofProcessingUnits(PUs).EachPUmanagesseveralcameras.DataAcquisitionandprocessingtasksarecompletelydecoupled.Theincomingimagesarecopiedinasharedmemory.Eachprocessingtaskisimplementedinapipelinefashionwhereeachstageisexecutedbyadifferentthreadinordertotakeadvantageofthemulti-corearchitectureofthePU.

WepresentasystemcomposedbyonePUconnectedwiththreecameras.Thecamerasaresynchronizedwithanexternaltriggeringsystem.Inthecontextofreal-timetrackingwehaveimplementedabackgroundestimationalgorithm.Systemperformancehasbeenevaluatedunderdifferentloadconditionssuchasnumberofcameras,imagesizeandframerate.

7871-07, Session 2

Real-time implementation of logo detection on open source BeagleBoardM.K.George,N.Kehtarnavaz,TheUniv.ofTexasatDallas(UnitedStates);L.W.Estevez,TexasInstrumentsInc.(UnitedStates)

Thispaperpresentsafollow-uptoourpreviousworkonlogodetectionandtrackingalgorithmwhichtargetsmobilephoneuserstoobtaininformationordiscountsassociatedwithlogos.ThealgorithminvolvesahybridapproachbyusingacombinationofSIFT,onlinecolorcalibrationandmomentinvariantsinavideostreamoperationmode.AfterSIFTlogodetection,onlinecolorcalibrationusingk-meansclusteringintheCr-Cbcolorspaceisperformedtoextracttheprominentlogocolor.Thisinformationisusedtotrackthelogoinsubsequentframesofthevideostream.Momentinvariantsofallregionsarecalculatedandtheregionwhichhastheclosestmatchisselected.Thispaperaddressesthereal-timeportingorimplementationoftheabovestepsonBeagleBoard.TheBeagleBoardisanopensource,lowcost,fan-lessOMAPdeviceavailablefromTexasInstruments.TheOMAPprocessorhasanARMCortexGPP,aC64x+TIDSPandaSGXGPU.Theobjectivehereistoleveragetheseenginestowardachievingareal-timethroughputofthealgorithm.ThemainfocusistooffloadcertainoperationsandnativeOpenCVfunctionsontotheDSPinordertoreachareal-timesolution.

7871-08, Session 2

Image orientation detection for real-time implementation on embedded devicesV.V.Appia,GeorgiaInstituteofTechnology(UnitedStates);R.Narasimha,TexasInstrumentsInc.(UnitedStates)

Inthispaperwedescribealowcomplexityimageorientationdetectionalgorithmwhichcanbeimplementedinreal-timeonembeddeddevicessuchaslow-costdigitalcameras,mobilephonecamerasandvideosurveillancecameras.Providingorientationinformationtotamperdetectionalgorithminsurveillancecameras,colorenhancementalgorithmandvarioussceneclassifierscanhelpimprovetheirperformance.Variousimageorientationdetectionalgorithmshavebeen

developedinthelastfewyearsforimagemanagementsystems,asapostprocessingtool.But,thesetechniquesusecertainhigh-levelfeaturesandobjectclassificationtodetecttheorientation,thustheyarenotsuitableforimplementationonacapturingdeviceinreal-time.Ouralgorithmuseslow-levelfeaturessuchastexture,linesandsourceofilluminationtodetectorientation.Weimplementedthealgorithmonamobilephonecameradevicewitha180Mhz,ARM926processor.Theorientationdetectiontakes~10msforeachframewhichmakesitsuitabletouseinimagecaptureaswellasvideomode.Itcanbeusedefficientlyinparallelwiththeotherprocessesintheimagingpipelineofthedevice.Onhardwarethealgorithmachievedanaccuracyof~88%withafalsedetectionrateof4%onoutdoorimages.

7871-09, Session 2

Real-time topological image smoothing on shared memory parallel machinesR.Mahmoudi,M.Akil,EcoleSupérieured’IngénieursenElectroniqueetElectrotechnique(France)

Smoothingfilteristhemethodofchoiceforimagepreprocessingandpatternrecognition.Wepresentanewconcurrentmethodforsmoothing2Dobjectinbinarycase.Proposedmethodprovidesaparallelcomputationwhilepreservingthetopologybyusinghomotopictransformations.Weintroduceanadaptedparallelizationstrategycalledsplit,distributeandmerge(SDM)strategywhichallowsefficientparallelizationofalargeclassoftopologicaloperatorsincluding,mainly,smoothing,skeletonization,andwatershedalgorithms.Toachieveagoodspeedup,wecaredabouttaskscheduling.Distributedworkduringsmoothingprocessisdonebyavariablenumberofthreads.Testson2Dbinaryimage(512*512),usingsharedmemoryparallelmachine(SMPM)with8CPUcores(2×XeonE5405runningatfrequencyof2GHz),showedanenhancementof5.2.

7871-10, Session 2

Multithreaded real-time 3D image processing software architecture and implementationV.Ramachandra,K.Atanassov,M.Aleksic,S.R.Goma,QualcommInc.(UnitedStates)

Arealtime3DplayerwasimplementedontheGPUusingCUDAandOpenGL.Theplayerprovidesuserinteractive3Dvideoplayback.Stereoimagesarefirstreadbytheplayerfromafastdriveandthenrectified.Furtherprocessingoftheimagesdeterminestheoptimalconvergencepointinthe3Dscenetoreduceeyestrain.Therationaleforthisconvergencepointselectiontakesintoaccountscenedepthanddisplaygeometry.Thefirststepinthisprocessingchainisidentifyingkeypointsbydetectingverticaledgeswithintheleftimage.Regionssurroundingreliablekeypointsarethenlocatedontherightimagethroughtheuseofblockmatching.Thedifferenceinpositionbetweencorrespondingregionsarethenusedtocalculatedisparity.Theextremaofadisparityhistogramgivesthescenedisparityrange.Theleftandrightimagesareshiftedbaseduponthecalculatedrange.

AlltheabovecomputationswereperformedononeCPUthreadwhichcallsCUDAfunctions.Imageupsamplingandshiftingisperformedinresponsetouserzoomandpan.

TheplayeralsoconsistsofaCPUdisplaythread,whichusesOpenGLrendering(quadbuffers).Thisalsogathersuserinputfordigitalzoomandpanandsendsthemtotheprocessingthread.

Conference 7871: Real-Time Image and Video Processing 2011


IS&T /

ReturntoContents

7871-11, Session 3

Real-time video streaming using H 264 scalable video coding (SVC) in multihomed mobile networks: a testbed approachJ.M.Nightingale,Q.Wang,C.Grecos,Univ.oftheWestofScotland(UnitedKingdom)

Usersofthenextgenerationwirelessparadigmknownasmultihomedmobilenetworksexpectsatisfactoryqualityofservice(QoS)whenaccessingstreamedmultimediacontent.TherecentH.264ScalableVideoCoding(SVC)extensiontotheAdvancedVideoCodingstandard(AVC),offersthefacilitytoadaptreal-timevideostreamsinresponsetothedynamicconditionsofmultiplenetworkpathsencounteredinmultihomedwirelessmobilenetworks.Nevertheless,pre-existingstreamingalgorithmsweremainlyproposedforAVCdeliveryovermultipathwirednetworksandwereevaluatedbysoftwaresimulation.

Thispaperintroducesapracticalhardware-basedtestbedwherebyweimplementandevaluatereal-timeH.264SVCstreamingalgorithmsinmultihomedwirelessmobilenetworks.Weproposeanoptimisedstreamingalgorithmwithmulti-foldtechnicalcontributions.Firstly,weextendedtheAVCpacketprioritisationschemestoreflectthegreatergranularityofSVC.Secondly,wedesignedamechanismfortheevaluationoftheeffectsofdifferentstreamer‘readaheadwindow’sizesonreal-timeperformance.Thirdly,wetookaccountofthepreviouslyunconsideredpathswitchingandmobilenetworkstunnellingoverheadsencounteredinreal-worlddeployments.Finally,weimplementedapathconditionmonitoringandreportingschemetofacilitatetheintelligentpathswitching.TheproposedsystemhasbeenexperimentallyshowntoofferasignificantimprovementinPSNRofthereceivedstreamcomparedwithrepresentativeexistingalgorithms.

7871-12, Session 3

A new bitstream structure for parallel CAVLC decodingY.Lee,K.Cho,SamsungElectronicsCo.,Ltd.(Korea,Republicof)

ACAVLCdecodercannotknowtheexactstartingpositionofthek-thsyntaxelementinabit-streamuntilitfinishesdecodingofthe(k-1)-thsyntaxelement.Itmakesaparalleldecodingdifficultinhardwareimplementation.Itsignificantlyincreasehardwarecosttopredictthestartingpositionofasyntaxelementpriortodecodingofitspreviousone.Inthispaper,weproposeanewbitstreamstructuretoconcurrentlyaccessmultiplesyntaxelementsforparallelCAVLCdecoding.Themethoddividesabit-streamintoNkindsofsegmentswhosesizeisMbitsandputssyntaxelementsintothesegments,basedonaproposedrule.Then,aCAVLCdecodercansimultaneouslyaccessNsegmentstoreadNsyntaxelementsfromasinglebitstreamanddecodetheminparallel.ThistechniqueincreasesthespeedofCAVLCdecodingbyuptoNtimes.Sincethemethodjustrearrangesgeneratedbit-stream,itdoesnotaffectcodingefficiency.Simulationresultsshowthatspeed-upis80%withN=2.

7871-13, Session 3

3D video sequence reconstruction algorithms implemented on DSPV.I.Ponomaryov,E.Ramos-Diaz,InstitutoPolitécnicoNacional(Mexico)

Depthmapusuallyservesasimportantinformationinseveralfields:videofiltering,robotnavigation,videoediting,etc.inimagesandvideosequences.Depthmapcomputationhasbeenstudiedwidely;however,toobtaindensedepthmapinformationfromvideosequencesisadifficultproblem.Alotofalgorithmshavebeenproposedtoaddresssomeoftheaforementionedissuesinstereovision,howeveritisstillrelativelyanopenproblem.Promisingmethodtovisualize3D

informationusestheanaglyph.

Inthiswork,weproposedanalgorithmtocomputethedepthmapinformationemployingtherealvideosequences.Obtaineddepthmapinformationisthenappliedintheconstructionof3Dvideosequencebymeansofanaglyphemployment.Inordertoimprovetheanaglyphs,thedepthmapmanipulationviaP-thlawcompressionisrealized,then,thevideoconstructionshouldbedone.WepresentacomparisonbetweendepthmapresultsusingdifferentWaveletsandothertechniques(Differential,StereoMatching,Warping,etc.).ThequantityofBadDisparitiesasquantitativecriterioninordertoselectthebetterdepthmapisapplied.VisualreconstructionresultsarealsocomparedwithclassicalPhotoshopalgorithminordertoprovetheefficiencyoftheproposedframework.Additionally,DigitalSignalProcessorTMS320DM642TM,Matlab2009aTMinanIntelCore2QuadProcessorTMareusedtoimplementtheproposedmethod.

7871-14, Session 3

Real-time patch sweeping for high-quality depth estimation in 3D videoconferencing applicationsW.Waizenegger,I.Feldmann,O.Schreer,Fraunhofer-InstitutfürNachrichtentechnikHeinrich-Hertz-Institut(Germany)

Infuture3Dvideoconferencingsystems,depthestimationisrequiredtosupportautostereoscopicdisplaysandevenmoreimportant,toprovideeyecontact.Real-time3Dvideoprocessingiscurrentlypossible,butwithinsomelimits.Sincesub-pixeldisparityestimationiscomputationallyexpensive,thedepthresolutionoffaststereoapproachesisdirectlylinkedtopixelquantizationandtheselectedstereobaseline.Thecomputationalloadrequires4x4sub-sampleddisparityestimationandthereforealossoffinedetails.

Planesweepingoffersthecapabilitytoincreasethedepthresolution,butafronto-parallelsurfaceassumptionismade.Patchbasedapproachesuseorientedspatialpatchesforpiecewiselinearapproximationoftherealobjectsurface,butcurrentalgorithmsarecomputationallyexpensiveandhardtoparallelize.Hence,anovelreal-timecapablealgorithmispresented,theso-calledpatchsweeping,whichcombinesplanesweepingwithpatchbased3DreconstructionbyexploitingtheprocessingcapabilitiesofstandardGPUs.

Fortunately,the3Dvideoconferencingscenarioallowsforsignificantsimplifications.Physiognomicconstraintsandcoarsedepthestimationresultsinduceavalidsearchrangefordepthrefinement.ThecurrentimplementationonasingleGPUperformsthreepairwisedepthestimationsofatrifocalcameraona256x256blockinreal-timeinhighqualitydepthresolution.

7871-15, Session 4

Real-time scene change detection assisted with camera 3A: auto exposure, auto white balance, and auto focusL.Liang,B.Hung,Y.Noyes,R.Velarde,QUALCOMMMEMSTechnologies,Inc.(UnitedStates)

Manyscenechangedetectiontechniqueshavebeendevelopedforscenecuts,fadeinandfadeoutbyanalyzingvideoencoderinputsignals.Forrealtimescenechangedetection,sensorinputsignalsprovidethefirst-handinformationwhichcanbeusedforscenechangedetection.Inthispaper,byanalyzingcamcorderfrontendsensorinputsignalswithourproposedalgorithmsbasedoncamera3A(autoexposure,autowhitebalanceandautofocus),anovelscenechangedetectiontechniquehasbeendeveloped.Withthefeatureofthefastresponsestothescene,camera3Abasedscenechangedetectionalgorithmcandetectscenechangesinatimelymannerandthereforefitswellforrealtimescenechangedetectionapplication.Experimentalresultsshowthatthisalgorithmcandetectscenechangeswithasatisfyingaccuracy.Asutilizingtheembedded3Afeatures,theproposedalgorithmiscomputationallyefficientandeasytobeimplemented.



IS&T /

ReturntoContents

7871-16, Session 4

Fast approximate 4D:3D discrete radon transform, from light field to focal stack with O(N^4) sumsJ.G.Marichal-Hernandez,J.P.Lüke,F.L.Rosa,J.M.Rodriguez-Ramos,Univ.deLaLaguna(Spain)

Inthisworkwedevelopanewalgorithm,thatextendsthebidimensionalFastDigitalRadontransformfromGötzandDruckmüller(1996),todigitallysimulatetherefocusingofa4Dlightfieldintoa3Dvolumeofphotographicplanes,aspreviouslydonebyRenNgetal.(2005),butwiththeminimumnumberofoperations.

Thisnewalgorithmdoesnotrequiremultiplications,justsums,anditscomputationalcomplexityisO(N^4)toachieveavolumeconsistingof2Nphotographicplanesfocusedatdifferentdepths,fromaN^4plenopticimage.

Thisreducedcomplexityallowsfortheacquisitionandprocessingofaplenopticsequencewiththepurposeofestimating3Dshapeatvideorate.ExamplesaregivenofimplementationsonGPUandCPUplatforms.

Finally,amodifiedversionofthealgorithmtodealwithdomainsofsizesdifferentthanpoweroftwo,isproposed.

7871-17, Session 4

A cross-based filter for fast edge-preserving smoothingK.Zhang,IMEC(Belgium)andKatholiekeUniv.Leuven(Belgium);J.Lu,AdvancedDigitalSciencesCtr.(Singapore);G.Lafruit,R.Lauwereins,IMEC(Belgium);L.J.VanGool,KatholiekeUniv.Leuven(Belgium)


7871-18, Session 4

Human action recognition in a wide and complex environmentS.Kumar,IndianInstituteofTechnologyRoorkee(India);S.KumarMalik,Univ.degliStudidiUdine(Italy);B.Raman,N.Sukavanam,IndianInstituteofTechnologyRoorkee(India)

Inthispaper,adirectfractionallineardiscriminantanalysis(DF-LDA)basedclassifieremployedinatreestructureispresentedtorecognizethehumanactionsinawideandcomplexenvironment.Inparticular,theproposedclassifierisbasedonasupervisedlearningprocessandachievestherequiredclassificationinamulti-stepprocess.Thismulti-stepprocessisperformedsimplybyadoptingatreestructuredwhichisbuiltduringthetrainingphase.Hence,thereisnoneedofanyprioriinformationlikeinotherclassifierssuchasthenumberofhiddenneuronsorhiddenlayersinamultilayerneuralnetworkbasedclassifieroranexhaustivesearchasusedintrainingalgorithmsfordecisiontrees.Askeletonbasedstrategyisadoptedtoextractthefeaturesfromagivenvideosequencerepresentinganyhumanaction.Apan-tilt-zoom(PTZ)cameraisusedtomonitorthewideandcomplextestenvironment.Abackgroundmosaicimageisbuiltofflineandusedtocomputethebackgroundframeinrealtimeforanygivenpanandtiltsetting.Abackgroundsubtractionstrategyhasbeenadoptedfordetectingtheobjectinvariousframesandtoextracttheircorrespondingsilhouette.Askeletonbasedprocessisusedtoextractattributesofafeaturevectorcorrespondingtoahumanaction.Finally,theproposedframeworkistestedonvariousindoorandoutdoorscenariosandencouragingresultsareachievedintermsofclassificationaccuracyandcomputationaltime.


Swimming behavior detection for Nitocra Spinipes in water quality evaluationZ.Jia,W.Wang,HenanPolytechnicUniv.(China)

TheEnvironmentalProtectionAgencyhasovertheyearssuggestedanumberofbiologicaltestsforcharacterizationofindustrialwastewater,amongwhichatestwiththebrackishwatercrustaceanNitocraspinipescanbefound.Itisofsubstantialinteresttodesignalow-costearlywarningtestapparatusforbrackishwater.TheprincipleofatestistomonitortheswimmingbehaviourofNitocraspinipesbytheuseofdigitizedvideofilmsindaylightorindoorillumination.Inourstudy,grownanimalsareofasizebetween0.6to0.8mm,anditisempiricallyknownthattheirswimmingbehaviourisaffectedbytheamountoftoxicsubstancesinthewater.Thefirstprocessingstepistomanuallymarkthepositionofeachanimalonastartingimage,thenfindouttheimagedifferencebetweenstartingimageandtheimageinthesequence,finallyfindoutlocationsofanimalsonthenewimage.Theprocessingresultissatisfactory.Forealltheworkingsequence,aWindowsprogramwasdeveloped,thesoftwaresystemcanprocesstheimageseasily.


Human heart movement tracing on ultrasonic imagesX.Yang,FuzhouUniv.(China)

Currently,theaccuratehumanheartdiagnosingismoreandmoreimportantsincethiskindofsicknessincreasesyearandyear.Thetraditionalmethodcanonlyobtaintheheartmovementrateandacceleration,butwithoutdisplacementororientationinformation,inordertosupplementtheheartmovementinformation,thispaperpresentsanalgorithmforcollectinghumanheartinformationontwo-dimensionalultrasonicimages.Thealgorithmfirstlyenhancesimagesforeasilydetectingkeyheartpointsthathaveobviousinformationforheartmovements,then,thealgorithmfindsoutthekeypointsbydetectingobviousdisplacements,subsequentlyittracestheheartmovementsanddetectsthemovementdirections,andfinallyitevaluatesthemainorientationanddisplacementoftheheartmovements,andcalculatestheaveragespeedandaccelerationofthemovementsbasedonthesequenceultrasonicimages.Thealgorithmistestedbyusinganumberofhumanultrasonicimages,andtheexperimentsshowthedetectionresultsarecorrect.Whencomparedtoothertechniques-traditionalmethod,thisnoninvasiveapproachcanclearlyyieldmoreaccurateresults.Inthisway,theinformationoftheheartmovementscanbeobtainedaccurately,andtheinformationcanbeusedforhumanhealthevaluationbydoctors.Itprovidesquantitativefoundationforheartdiagnosing.


Efficient object tracking in WAAS data streamsT.R.Clarke,BallAerospace&TechnologiesCorp.(UnitedStates)andRochesterInstituteofTechnology(UnitedStates)

Wideareaairbornesurveillance(WAAS)systemsareanewclassofremotesensingimagerswhichhavemanymilitaryandcivilianapplications.Thesesystemsarecharacterizedbylongloitertimes(extendedimagingtimeoverfixedtargetareas)andlargefootprinttargetareas.Thesecharacteristicscomplicatemovingobjectdetectionandtrackingduetothelargeimagesizeandhighnumberofmovingobjects.ThispresentationevaluatesexistingobjectdetectionandtrackingalgorithmswithWAASdataandprovidesenhancementstotheprocessingchainwhichdecreaseprocessingtimeandmaintainorincreasetrackingaccuracy.Decreasesinprocessingtimeareneededtoperformreal-timeornearreal-timetrackingeitherontheWAASsensorplatformoringroundstationprocessingcenters.Increasedtrackingaccuracybenefitsreal-timeusersandforensic(off-line)users.



IS&T /

ReturntoContents


How fast can one numerically reconstruct digitally recorded holograms?L.Bilevich,L.Yaroslavsky,TelAvivUniv.(Israel)

Resultsofcomparativestudyofthecomputationalcomplexityofdifferentalgorithmsfornumericalreconstructionofelectronicallyrecordedhologramsarepresentedanddiscussed.Thefollowingalgorithmswerecompared:conventionalFourierandconvolutionalalgorithmswithandwithoutscalingandanewuniversalDCT-basedalgorithm,intermsofthenumberofoperationsandrequiredcomputertime.Basedonthecomparisonresults,thefeasibilityofreal-timeimplementationofnumericalreconstructionofhologramsisevaluated.


Tracking flow of leukocytes in blood for drug analysisA.Basharat,W.D.Turner,Kitware,Inc.(UnitedStates);G.Stephens,B.Badillo,R.Lumpkin,P.Andre,PortolaPharmaceuticalsInc.(UnitedStates);A.Perera,Kitware,Inc.(UnitedStates)

Modernmicroscopytechniquesallowtheimagingofbloodcomponents,includingleukocytes,underflowconditions.Theresultingvideosequencesprovideuniqueinsightsintothebehaviorofbloodundernormalandrestrictedflowsuchaswouldbefoundwithinvasculatureandtheyalsoallowfortestingvariousdrugtherapies;however,manualanalysisofthesevideosequencesisintractable,requiringhoursper6minutevideoclip.Inthispaper,wepresentanautomatedtechniquetoanalyzeleukocyteflowthroughthemicroscopestage.Ourtechniquesdetectandtrackthoseleukocyteswhichsloworwhichadheretothemicroscopeflowchamber.Weautomaticallycountthedetections,measurethevelocity,andidentifyleukocyteswhichstronglyadhere.Fromthis,wecalculateandgraphstatisticsofleukocytedetections,velocitydistributions,andadherence.


Phase correlation based adaptive mode decision for the H 264/AVCA.Abdelazim,S.Mein,M.R.Varley,Univ.ofCentralLancashire(UnitedKingdom);C.Grecos,Univ.oftheWestofScotland(UnitedKingdom);D.Ait-Boudaoud,Univ.ofPortsmouth(UnitedKingdom)

TheH.264videocodingstandardachieveshighperformancecompressionandimagequalityattheexpenseofincreasedencodingcomplexity,duetotheveryrefinedMotionEstimation(ME)andmodedecisionprocesses.Thispaperfocusesondecreasingthecomplexityofthemodeselectionprocessbyeffectivelyapplyinganovelfastmodedecisionalgorithm.

Firstlythephasecorrelationisanalysedbetweenamacroblockanditspredictionobtainedfromthepreviouslyencodedadjacentblock.Relationshipsareestablishedbetweenthecorrelationvalueandobjectsizeandalsobestfitmotionvector.

Fromthisanovelfastmodedecisionandmotionestimationtechniquehasbeendevelopedutilisingpre-processingfrequencydomainMEinordertoaccuratelypredictthebestmodeandthesearchrange.Wemeasurethecorrelationbetweenamacroblockandthecorrespondingprediction.Basedontheresultweselectthebestmode,orlimitthemodeselectionprocesstoasubsetofmodes.MoreoverthecorrelationresultisalsousedtoselectanappropriatesearchrangefortheMEstage.

ExperimentalresultsshowthattheproposedalgorithmsignificantlyreducesthemotionestimationtimewhilstmaintainingsimilarRateDistortionperformance,whencomparedtoboththeH.264/AVCJointModel(JM)referencesoftwareandrecentlyreportedwork.


Fast multilayered prediction algorithm for group of pictures in H 264/SVCA.Abdelazim,S.Mein,M.R.Varley,Univ.ofCentralLancashire(UnitedKingdom);C.Grecos,Univ.oftheWestofScotland(UnitedKingdom);D.Ait-Boudaoud,Univ.ofPortsmouth(UnitedKingdom)

Theobjectiveofscalablevideocodingistoenablethegenerationofauniquebitstreamthatcanadapttovariousbit-rates,transmissionchannelsanddisplaycapabilities.Thescalabilityiscategorisedintermsoftemporal,spatial,andquality.Toimproveencodingefficiency,theSVCschemeincorporatesinter-layerpredictionmechanismswhichincreasescomplexityofoverallencoding.

InthispaperseveralconditionalprobabilitiesareestablishedrelatingmotionestimationcharacteristicsandthemodedistributionatdifferentlayersoftheH264/SVC.Anevaluationoftheseprobabilitiesisusedtostructurealow-complexitypredictionalgorithmforGroupofPictures(GOP)inH.264/SVC,reducingcomputationalcomplexitywhilstmaintainingsimilarperformance.

WhencomparedtotheJSVMsoftware,thisalgorithmachievesasignificantreductionofencodingtime,withanegligibleaveragePSNRlossandbit-rateincreaseintemporal,spatialandSNRscalability.Experimentsareconductedtoprovideacomparisonbetweenourmethodandarecentlydevelopedfastmodeselectionalgorithm.Thesedemonstrateourmethodachievesappreciabletimesavingsforscalablespatialandscalablequalityvideocoding,whilemaintainingbetterPSNRandlowerbitrate.


X-Eye: a novel wearable vision systemY.Wang,C.Fan,S.Chen,H.Chen,Fu-JenCatholicUniv.(Taiwan)

Thispaperproposesasmartportabledevice,namedtheX-Eye,whichprovidesagestureinterfacewithasmallcomputingdeviceandalargedisplayfortheapplicationofphotocaptureandmanagement.Thesmallportabledevicecanachievethecaptureofphotosatanytimeandanywhereanddisplaycapturedphotosonlargescreenupto42inches.Thewearablevisionsystemisimplementedwithadual-coreembeddedsystemandcanachievereal-timeperformance.ThedisplaydeviceisapicoDLPprojectorwhichhasasmallvolumesizebutcanprojectlargescreensize.Fivesoftwaremodulesareintegratedintotheembeddedhardware.Coloridentificationandgesturerecognitionarethecoreofthesoftwaretechnologiesinthepaper.

ThedimensionsoftheX-Eyeareoptimizedtobe8.7(W)x8.5(H)x3.2(D)cm3,anditsweightis170g.TotalpowerconsumptionoftheX-Eyeisnomorethan9.5W.Thescreenresolutionis640x480pixels.Theprocessingspeedofthewholesystemincludingthegesturerecognitioniswiththeframerateof20FPS.Experimentalresultsgive85%recognitionrate.Itdemonstratesthatthissystemhaseffectivegestureinterfacewithreal-timeperformance,smallsize,butlargescreen.


Real-time vehicle matching for multi-camera tunnel surveillanceV.Jelaca,J.O.Nino-Castaneda,A.Frias-Velazquez,A.Pizurica,W.R.Philips,Univ.Gent(Belgium)

Trackingmultiplevehicleswithmultiplecamerasintunnelsisachallengingproblemofgreatimportancefortunnelsafety.Oneofthemainchallengesisaccuratevehiclematchingacrossthecameraswithnon-overlappingfieldsofview.Sincethecamerasusedinvideosurveillanceareusuallyofsubstantiallylowtomediumresolution,themotionblurandnoisearesignificantanditisdifficulttoextract



IS&T /

ReturntoContents

informativefeaturesfromtheacquiredvehicleimages.Additionally,computationalefficiencyisessentialbecausethesystemsdedicatedtotunnelsurveillancecancontainhundredsofcameraswhichobservedozensofvehicleseach.Inthispaper,weproposealowcomplexity,yethighlyaccuratemethodforvehiclematchingusingvehiclesignaturescomposedofRadontransformlikeprojectionsofthevehicleimage.Theproposedsignaturescanbecalculatedbyasimplescan-linealgorithm,bythecamerasoftwareitselfandtransmittedtothecentralserverortotheothercamerasinasmartcameraenvironment.Theamountofdataisdrasticallyreducedcomparedtothewholeimage,whichrelaxesthedatalinkcapacityrequirements.Experimentsonrealvehicleimages,extractedfromvideosequencesrecordedinatunnelbytwodistantsecuritycameras,validatetheproposedmethod.


Differential coding of intra modes for high efficiency video codingE.Maani,SonyElectronicsInc.(UnitedStates);W.Liu,HangzhouDianziUniv.(China)

Spatialdomaindirectionalintrapredictionhasbeenshowntobeveryeffectivetoremovethecorrelationbetweenthepixelsinthecurrentblockandreconstructedneighbors.InAVC,8directionalpredictionmodes(plustheDCpredictionmode)aredefined.Thepredictionmodenumberissignaledtothedecoderusingasimplepredictivecodingmethod.Thecurrentintrapredictionhastwomajordisadvantages:1)thesmallnumberofdirectionsdoesnotprovidesufficientprecisiontocoverarbitrarydirectionalpatterns;and2)themodenumberpredictionfromneighborsisnotaccurateenoughtoexploitthegeometricdependencybetweenblocks.Increasingthenumberofdirectionstypicallyresultsinalowerresidualenergy,however,thecostforsignalingthepredictionmodemayalsoincreasesignificantlysuchthatlittlegainisobserved.Thisisespeciallythecaseforsmallblocksizessuchas4x4or8x8.Toaddressthisproblem,inthissubmission,weproposeanewmethodtoaccuratelypredicttheintradirectionsfromreconstructedneighboringpixelsanddifferentiallyencodetheintradirections.Thisallowsamoreprecisedirectionalpredictionwithoutthesignificantincreaseinthecostfortransmittingthesideinformation.Simulationresultsshowsthatthenewintrapredictionmethodcanprovideasmuchas13%bitratereductioncomparedtoAVCintraprediction.



IS&T /

ReturntoContents

Conference 7872: Parallel Processing for Imaging ApplicationsMonday-Tuesday24-25January2011PartofProceedingsofSPIEVol.7872ParallelProcessingforImagingApplications

7872-01, Session 1

Using a commercial graphical processing unit (GPU) and the CUDA programming language to accelerate image processing applicationsR.P.Broussard,R.Ives,U.S.NavalAcademy(UnitedStates)

Theprocessingpoweravailableincurrentvideographicscardsisapproachingsupercomputerlevels.Inthepasttwoyearstheprocessingpowerinthesecardshasquadrupled.State-of-the-artgraphicalprocessingunits(GPU)boastofcomputationalperformanceintherangeof1.4trillionfloatingpointoperationspersecond(1.4Teraflops).Thisprocessingpowerisreadilyaccessibletothescientificcommunityatarelativelysmallcost.Highlevelprogramminglanguagesarenowavailablethatgiveaccesstotheinternalarchitectureofthegraphicscardallowinggreateralgorithmoptimization.Thisresearchtakescomputationallyexpensiveportionsofanimage-basedirisidentificationalgorithmandhostsitonaGPUusingtheC++compatibleCUDAlanguage.Theselectedsegmentationalgorithmusesbasicimageprocessingtechniquessuchasimageinversion,valuesquaring,thresholding,dilation,erosionandthecomputationallyintensivelocalkurtosiscalculation(fourthstandardizedmoment)andcircularHoughtransform.StrengthsandlimitationsoftheGPUSingleInstructionMultipleDataarchitecturearediscussed.Theprimarysourceofthegraphicalprocessingpower,themultipleprocessingelementsandlayeredmemorysystem,arediscussedindetail.Actualmemoryaccessandinstructionexecutiontimesareprovided.Impressiveaccelerationresultswereobtained.Theirissegmentationalgorithmwasacceleratedbyafactorof150overthehighlyoptimizedC++versionhostedonthecomputer’scentralprocessingunit.Somepartsofthealgorithmranatspeedsthatwereover400timesfasterthantheirC++counterpart.CUDAprogrammingdetailsandcodesamplesarepresentedaspartoftheaccelerationdiscussion.

7872-02, Session 1

Automatic distribution of vision-tasks on computing clustersT.Müller,A.Knoll,TechnischeUniv.München(Germany)

Distributionofcomputervisiontasksinparallelenvironmentsisessentialconsideringtheincreasingdemandforcomputationalresourcestoaccomplishadvancedvisualprocessingtasks.

Thus,aconsistentandefficientbutyetconvenientsystemforparallelcomputervision,andinfactalsorealtimeactuatorcontrolisproposed.Thesystemimplementsthemulti-agentparadigmandablackboardinformationstorage.This,incombinationwithagenericinterfaceforhardwareabstractionandintegrationofexternalsoftwarecomponents,issetuponbasisofthemessagepassinginterface,whichisthedefactostandardforHPCenvironmentsandhenceprovidessupportforalargevarietyofplatformsandhasahugeuserpool.

Thesystemfurthermoreallowsfordata-andtask-parallelprocessing,andsupportsbothsynchronouscommunication,asdataexchangecanbetriggeredbyevents,andasynchronouscommunication,asdatacanbepolled,strategies.Also,byduplicationofprocessingunits(agents)redundantprocessingispossibletoachievegreaterrobustness.

Asthesystemautomaticallydistributestheagentstoavailableresources,andamonitoringconceptallowsforcombinationoftasksandtheircompositiontocomplexprocesses,itisveryeasytodevelopvision/roboticsapplicationsquickly.

Thus,forevaluationmultiplevisionbasedapplicationshavealreadybeenimplemented,e.g.anevolutionaryapproachforlearningvisual

saliencyfeatures,oraparallelactiveperceptionsystemforroboticrecognitionandhandlingoflimpobjects.

7872-03, Session 1

Highly scalable digital front end architectures for digital publishingD.Staas,Hewlett-PackardCo.(UnitedStates)

HP’sdigitalprintingpressesconsumeatremendousamountofdata.ThearchitecturesoftheDigitalFrontEnds(DFEs)thatfeedtheselarge,veryfastpresseshaveevolvedfrombasic,single-RIP(RasterImageProcessor)systemstomulti-rack,distributedsystemsthatcantakeaPDFfileanddeliverdatainexcessof1.1Gigapixelspersecondtokeepthepressesprintingat2500+pagesperminute.ThispaperhighlightssomeofthemoreinterestingparallelismfeaturesofourDFEarchitecture.

Thehigh-performancearchitecturedevelopedoverthelast5+yearscanscaleuptoHP’slargestdigitalpress,outtomultiplemid-rangepresses,anddownintoaverylow-costsingleboxdeploymentforlow-enddevicesasappropriate.Principlesofparallelismpervadeeveryaspectofthearchitecture,fromthelowest-levelelementsofjobstoparallelimagingpipelinesthatfeedmultiplepresses.

Fromcorestothreadstoarraystonetworkteamstodistributedmachines,weuseasystematicapproachtomovebottlenecks.Theultimategoalsoftheseeffortsare:totakethebestadvantageoftheprevailinghardwareoptionsatourdisposal;toreducepowerconsumptionandcoolingrequirements;andtoultimatelyreducethecostofthesolutiontoourcustomers.

7872-04, Session 1

Parallel training and testing methods for complex image processing algorithms on distributed, heterogeneous, unreliable, and non-dedicated resourcesR.Usamentiaga,D.F.García,J.Molleda,I.Sainz,F.G.Bulnes,Univ.deOviedo(Spain)

Advancesintheimageprocessingfieldhavebroughtnewmethodswhichareabletoperformcomplextasksrobustly.However,inordertomeetconstraintsonfunctionalityandreliability,imagingapplicationdeveloperscommonlydesigncomplexalgorithmswithmanyparameterswhichneedtobefinelytunedforeachparticularenvironment.Thebestapproachtotunethesealgorithmsistouseanautomatictrainingmethod,butamajorissueariseswhendesigningthiskindoftrainingmethod:thecomputationalcost.Theexecutionofthetrainingmethodcanbecompletelyprohibitive,eveninpowerfulmachines.Thesameproblemshowsupwhendesigningtestingprocedures.Thisworkpresentsmethodstotrainandtestcompleximageprocessingalgorithmswithinparallelexecutionenvironments.Theapproachproposedinthisworkistouseexistingresources,inofficesorlaboratories,ratherthanexpensiveclusters.Theseresourcesaretypicallynon-dedicated,heterogeneousandunreliable.Theproposedmethodshavebeendesignedtodealwithalltheseissues.Twodifferentmethodsareproposed:intelligenttrainingbasedongeneticalgorithmsandPVM,andafullfactorialdesignbasedongridcomputingwhichcanbeusedfortrainingortesting.Thesemethodsarecapableofharnessingtheavailablecomputationalpowerresources,givingmoreworktomorepowerfulmachines,andalsotakingitsunreliablenatureintoaccount.Bothmethodshavebeentestedusingrealapplications.


IS&T /

ReturntoContents

7872-05, Session 1

Integrated parallel printing systems with hypermodular architectureD.K.Biegelsen,L.Crawford,C.Eldershaw,M.Fromherz,PaloAltoResearchCenter,Inc.(UnitedStates);G.Kott,B.Mandel,S.Moore,XeroxCorporation(UnitedStates);B.Preas,L.Swartz,PaloAltoResearchCenter,Inc.(UnitedStates)

Printingsystemscomposedofmultiple,interconnectedmarkingengines(MEs)providemanypotentialadvantagescomparedwithadhocsingleenginedesigns.Tightlyintegratedmulti-MEsystemsenable,forexample,nearoptimalsystemutilizationanddocumentthroughputthroughmulti-threadedjobproduction.Composableprintingsystemsallowreconfigurabilitytomatchuserneedsandredundancytosupporthighreliability.TheworkpresentedheredescribesasystemoffourMEslinkedbyapaperpaththathasadeeplevelofmodularity.Thepaperpathconsistsofaregulargridpopulatedbyasmallnumberofmoduletypes-nipmodulestoprovidebidirectionalsheetmotionandtwotypesofdirectorsforstaticanddynamicdefinitionofpathtopology.Eachmoduleiscapableofacting,sensing,computingandcommunicating.ModulesincludingMEs,arehot-swappable,andthesystemiscapableofauto-configuration.Real-timeplanningandcontrolsoftware,likethehardware,isdesignedtobemodular,distributed,reconfigurableandscalable.Thesystemcanhandleexceptions,suchassheetjams,whilemaintaining(reduced)throughput.

7872-06, Session 2

Parallel processing considerations for image recognition tasksS.J.Simske,Hewlett-PackardLabs.(UnitedStates)

Manyimagerecognitiontasksarewell-suitedtoparallelprocessing.Themostobviousexampleisthatmanyimagingtasksrequiretheanalysisofmultipleimages.Fromthisstandpoint,then,parallelprocessingneedbenomorecomplicatedthanassigningindividualimagestoindividualprocessors.However,therearethreelesstrivialcategoriesofparallelprocessingthatwillbeconsideredinthispaper:parallelprocessing(1)bytask;(2)byimageregion;and(3)bymeta-algorithm.

Parallelprocessingbytaskallowstheassignmentofmultipleworkflows-asdiverseasopticalcharacterrecognition[OCR],documentclassificationandbarcodereading-toparallelpipelines.Thiscansubstantiallydecreasetimetocompletionforthedocumenttasks.Forthisapproach,eachparallelpipelineisgenerallyperformingadifferenttask.

Parallelprocessingbyimageregionallowsalargerimagingtasktobesub-dividedintoasetofparallelpipelines,eachperformingthesametaskbutonadifferentdataset.Thistypeofimageanalysisisreadilyaddressedbyamap-reduceapproach.Examplesincludedocumentskewdetectionandmultiplefacedetectionandtracking.

Finally,parallelprocessingbymeta-algorithmallowsdifferentalgorithmstobedeployedonthesameimagesimultaneously.Thisapproachmayresultinimprovedaccuracy.

7872-07, Session 2

GPGPU real-time texture analysis frameworkM.A.Akhloufi,Ctr.ofRoboticsandVision(Canada)andLavalUniv.(Canada)

Inrecentyearsweassisttoanincreaseofinterestinusingtexturefeaturesinindustrialapplications.Forthistypeofapplications,realtimeprocessingofcapturedimagesisanimportantissue,particularlywiththecurrentincreaseinimageresolutionsusedinrealworldapplications.Differentapproachesareavailabletosolvethisproblem:DSPprocessing,FPGA,specializedhardware,parallelsystems,computerclusters.Allthesesolutionscomeatahighercost.

Morerecentlyaresearchcommunitybecameinterestedingraphicprocessingunitsavailableincommercialgraphiccards.ThisdomaincalledGPGPU(General-PurposecomputationonGraphicsProcessingUnits)aimtousingtheprocessingpoweroftheGPUinordertoaccelerategeneralprocessinglikemathematics,3Dvisualization,imageprocessing,etc.

InthisworkwepresenttheuseofGPGPUtechnologyforbuildingaframeworkforparallelrealtimetextureanalysisincomputervision.Thefollowingtechniquesweredeveloped:LocalBinaryPattern(LBP),LocalTernaryPattern,Lawstexturekernels,GaborfilterjetsandGrayLevelCo-OccurenceMatrix(GLCM).

ForthisworkwechosetouseCUDAtechnologyfordevelopingtheproposedalgorithms.CUDA(ComputeUnifiedDeviceArchitecture)isaparallelcomputingarchitecturedevelopedbyNVIDIA.ItenablesdramaticincreasesincomputingperformancebyharnessingthepoweroftheGPU(graphicsprocessingunit)andparallelarchitectureprogramming.GPUoptimizationsarecomparedtoCPUoptimizationsusingMMX-SSEtechnologiesandMulticoreparallelprogramming.TheexperimentalresultsshowanimportantincreaseintheperformanceoftheproposedalgorithmswhenGPGPUisusedparticularlyforlargeimagesizes.

7872-08, Session 2

A parallel implementation of 3D Zernike moment analysisD.Berjón,S.Arnaldo,F.Morán,Univ.PolitécnicadeMadrid(Spain)

Zernikepolynomialsareawellknownsetoffunctionsthatfindmanyapplicationsinimageorpatterncharacterizationbecausetheyallowtoconstructshapedescriptorsthatareinvariantagainsttranslations,rotationsorscalechanges.Theconceptsbehindthemcanbeextendedtohigherdimensionspaces,makingthemalsofittodescribevolumetricdata.Theyhavebeenlessusedthantheirpropertiesmightsuggestduetotheirhighcomputationalcost.

Wepresentaparallelimplementationof3DZernikemomentsanalysis,writteninCwithCUDAextensions,whichmakesitpracticaltoemployZernikedescriptorsininteractiveapplications,yieldingaperformanceofseveralframespersecondinvoxeldatasetsabout200^3insize.

Inourcontribution,wedescribethechallengesofimplementing3DZernikeanalysisinaGPGPU.Theseincludehowtodealwithnumericalinaccuracies,duetothehighprecisiondemandsofthealgorithm,orhowtodealwiththehighvolumeofinputdatasothatitdoesnotbecomeabottleneckforthesystem.

OurGPU-basedimplementationrunsabouttentimesfasterthanourpreviousCPU-basedone.

7872-09, Session 2

A novel parallel algorithm for airport runway segmentation in satellite images using priority-directional region growing strategy based on ensemble learningF.Duan,Y.Zhang,TsinghuaUniv.(China)

Thispaperaddressestheproblemofairportdetectionandrunwaysegmentationinsatelliteimageswithcomplexbackgroundclutter.Tothisends,weproposeanovelensemblelearningbasedparallelrunwaysegmentationalgorithm.Thecontributionsofourworkcanbesummarizedasfollows:(a)weproposetheconceptofprioritydirectionregiongrowing.(b)WeintroducetheBresenham’slinegenerationalgorithmintooursegmentationtask.(c)weadoptatwo-stagestrategytobettersegmenttheregionscorrespondingtotheairportrunwaybyapplyingtraditionalregiongrowingmethodandourprioritydirections(twoorthogonaldirectionsinourproblem)growingmethod.(d)Inourrunwaysegmentationalgorithm,ensemble-learningstrategyisusedtocombinethegrowingresultsofeachlinesegment.Inaddition,thosethinbranches,whichhavesignificantlydifferentwidth,areeliminated.

Conference 7872: Parallel Processing for Imaging Applications


IS&T /

ReturntoContents

Toevaluatetheeffectivenessofouralgorithm,extensivesimulationsarecarriedoutonthetestingimagesobtainedfromGoogleMap.Ourexperimentalresultsshowthatouralgorithmcaneffectivelyandefficientlysegmentedtheairportregion,andgenerateveryneatboundariesoftherunways,andhavegreatsuperiorityoverthestate-of-the-artmethods.

7872-10, Session 2

Visualization assisted by parallel processingB.Lange,Lab.d’InformatiquedeRobotiqueetdeMicroelectroniquedeMontpellier(France);X.Vasques,H.Rey,IBM(France);N.Rodriguez,W.Puech,Lab.d’InformatiquedeRobotiqueetdeMicroelectroniquedeMontpellier(France)

Thispaperdiscussestheexperimentalresultsofourvisualizationmodelfordataextractedfromsensors.Wehavetofindthefasterandmoreefficientmethodtoproducearealtimerenderingvisualizationforalargeamountofdata.Wedevelopvisualizationmethodstomonitortemperaturevarianceofadatacenter.Sensorsareplacedonthreelayersanddonotcoveralltheroom.Weuseparticleparadigmtointerpolatedatasensors,aspresentedincite{Latta2004}.Particlesmodelthe“space”intheroom,asstatedbytheIndustryFoundationClasses(IFC)model,astandardbuildingsemanticmodel.Inthisworkwemakeapartitionoftheparticlesetusingtwomathematicalmethods:DelaunaytriangulationandVorono”icells.AvisandBhattacharyapresentthesealgorithmsincite{Avis1983}.Particlescarryinformationoftheroomtemperature,anddisplaytheevolutionoftemperatureintime.Tolocateandupdateparticlesdatawehavedefinedacomputationalcostfunction.Tosolvethisfunctioninanefficientway,weuseaclientserverparadigmcite{Kapferer2008}.Servercomputesdata,clientdisplaythisdatainavirtualscreencomposedfromfourvideoprojectorsinstereoscopicview.Thispaperisorganizedasfollows.Thefirstpartpresentsrelatedsolutionsusedtovisualizelargeflowofdata.Thesecondpartpresentsourdifferentplatformsandmethodsused,whichhavebeenevaluatedinordertodeterminethebettersolutionforthetaskproposed.Thebenchmarkusethecomputationalfunctionofouralgorithm,itwasbasedonlocatedparticlescomparedtosensorsandonupdateofparticlesvalue.Figureref{fig-10}andref{fig-11}illustratetheinclusionmethodusingraytracing,eachparticleistestedonthenearesttetrahedron.ThebenchmarkwasdoneonapersonalcomputerusingCPU,multicoreprogramming.However,realtimerenderingishardtohave.ThisworkisperformedincollaborationwithIBMMontpellier,andwithinthiscollaboration,ourworkcanbeimprovedonHighPerformanceComputerandonahybridCPU,GPUserver.Thefirstmethodiscommonlyusedindatavisualization(astronomy,physic,etc.)andthesecondoneisgrowingincomputerscience.Withthisotherdifferentplatformwewanttohavearealtimerenderingofourlargedataflow.

7872-11, Session 3

A parallel impulse-noise detection algorithm based on ensemble learning for switching median filtersF.Duan,Y.Zhang,TsinghuaUniv.(China)

Inthispaper,ahighlyeffectiveandefficientensemblelearning-basedparallelalgorithmforimpulsenoisedetectionisproposed.Thecontributionofthispaperisthree-fold.First,anovelintensityhomogeneitymetric,whichhasverypowerfuldiscriminativeabilitythathasbeenproven,isproposed.Second,thisproposedalgorithmhashighparallelisminfeatureextractionstage,classifiertrainingandtestingstage.Finally,insteadofmanuallytuningthethresholdsforeachfeatureasmostoftheworksinthisresearchareado,RandomForests(RF)isusedtomakedecisionsinceithasbeendemonstratedtoownbettergeneralizationabilityandperformancecomparabletoSVMsinclassificationproblem.AnotherimportantreasonwhyRFisadoptedisthatithasnaturalparallelismstructureandverysignificantperformanceadvantage(e.g.theoverheadoftrainingandtestingthemodel)overotherpopularclassifierse.g.SVMs.Tothebestofourknowledge,this

isthefirsttimethattheensemblelearningstrategieshavebeenusedintheareaofswitchingmedianfiltering.Extensivesimulationsarecarriedoutoneightmostcommonstandardtestingimages.Theexperimentalresultsshowthatouralgorithmachieveszero-missdetectionresultwhilekeepingthefalsealarmrateataratherlowlevel,andhasgreatsuperiorityoverotherstate-of-the-artmethods.

7872-12, Session 4

GPU color space conversionG.L.Vondran,Jr.,P.Chase,Hewlett-PackardCo.(UnitedStates)

Tetrahedralinterpolationiscommonlyusedtoimplementcontinuouscolorspaceconversionsfromsparse3Dand4Dlookuptables.WeinvestigatetheimplementationandoptimizationoftetrahedralinterpolationalgorithmsforGPUs,andcomparetothebestknownCPUimplementations.Weshowthata$350NVIDIAGTX-470GPUis4xfasterthana$1000IntelCorei7980XCPUfor3Dinterpolation,and16xfasterfor4Dinterpolation.

Performance-relevantGPUattributesareexploredincludingthreadscheduling,localmemorycharacteristics,globalmemoryhierarchy,andcachebehaviors.WeconsiderexistingtetrahedralinterpolationalgorithmsandtunebasedonthestructureandbranchingcapabilitiesofcurrentGPUs.Globalmemoryperformanceisimprovedbyreorderingandexpandingthelookuptabletoensureoptimalaccessbehaviors.Permultiprocessorlocalmemoryisexploitedtoimplementoptimallycoalescedglobalmemoryaccesses,andlocalmemoryaddressingisoptimizedtominimizebankconflicts.Weexploretheimpactsoflookuptabledensityuponcomputationandmemoryaccesscosts.

WepresentaCPU-based3Dinterpolator,usingSSEvectoroperations,thatisfasterthananypreviouslypublishedsolution.

7872-13, Session 4

Acceleration of the Retinex algorithm for image restoration by GPGPU/CUDAY.Wang,W.Huang,Fu-JenCatholicUniv.(Taiwan)

Inthispaper,adataparallelalgorithmcalledGPURetinexisproposedtoparallelizetheRetinexalgorithmonGPGPU.ThecomputingoftheGaussianblurintheGPURetinexadoptsseparableconvolutionkernelstoreducethecomputationandtheinternaldatatransfer.ThedatadistributionofparallelGaussianblurconvolutionadoptsahorizontalstripemethod.EachthreadreadspixelsofahorizontalstripeoftheimagetoimplementtheGaussianblurconvolution.TheGaussianblurutilizestexturememoryandconstantmemorytoimproveefficiency.Thedatadistributioninthelog-domainsubtractionandnormalizationstepsusesasquaresubimagemethod.Eachthreadcomputesthetwooperationsforallpixelswithinitssquaresubimage.Aparallelreductionmethodisdevisedtofindthemaximumandminimumvaluesofthelog-domainsubtractionimage.Threadswithinagridcommunicatebyglobalmemoryandsharedmemory.

Inourexperiments,theGT200GPUandCUDA3.0aredeployed.TheexperimentalresultsshowthattheGPURetinexcangain23xspeedupcomparedwithCPU-basedimplementationontheimageswith2048x2048resolution.OurexperimentalresultsindicatethatusingCUDAcanmovetheRetinexalgorithmtotheGPUhardwareandachieveaccelerationtogainthereal-timeperformance.

7872-14, Session 4

Performance evaluation of Canny edge detection on a tiled multicore architectureA.Z.Brethorst,N.Desai,D.Enright,R.Scrofano,TheAerospaceCorp.(UnitedStates)

Becausetransistorsizehascontinuedtofallwhileprocessorclock



IS&T /

ReturntoContents

frequencyhasremainedfairlystatic,thetrendincomputerarchitecturehasbeentodevelopmulticoreprocessors.Totakefulladvantageofmulticoreprocessors,applicationsmustbeamenabletoparallelization.Manyimagingapplicationscanbenefitfrommulticoretechnologybecausetheyconsistoflocalizedoperations.Hence,thereareamultitudeofmulticoreplatformsandparallelizationstrategiesthatcanbeappliedtoimagingproblems.

Avarietyofmulticorearchitectureshavebeenintroduced,includingmoretraditionalshared-memoryparallelarchitecturesandmoreexotictiledmulticorearchitectures.Tiledmulticorearchitectureshavemanyprocessorcoresconnectedbyanon-chipinterconnectionnetwork.Withthesefeatures,tiledmulticorearchitecturescansupportavarietyofparallelprogrammingpatterns.

Inthispaper,weinvestigatetheperformanceandscalabilityofaparticularimagingapplication--Cannyedgedetection--ontheTileraTile64tiledmulticoreprocessor.Weapplyvariousparallelprogrammingpatterns,includingdivideandconquerandgeometricdecomposition,anddrawconclusionsabouttheirsuitabilitytotheapplicationandthetargetplatform.Aspartofourstudy,wewilldevelopimplementationsinwhichparallelismisexplicitlymanagedintheprogramandimplementationsinwhichparallelismismanagedbycompilerdirectivesandrun-timesystems.

7872-15, Session 5

Video transcoding using GPU accelerated decoderW.Hsu,AdvancedMicroDevices,Inc.(UnitedStates)

Combiningconsumerelectronics,digitalentertainment,andthepersonalcomputerplatformhasbecomeoneofthekeydrivingforcesofmodernPCdevelopment.Inthispaper,wepresentedtheimplementationofaUVD-accelerateddecoderMFTthroughMicrosoftDirectXVideoAcceleration(DXVA)interfaceforenhancingtheperformanceofdrag-and-droptranscodingofhigh-definitionvideosrunningonWindows7platforms.InwhichtheuncompressedvideoisdownsampledbyaGPU-basedresizetoconservememorybandwidthbetweentheGPUandCPU.ExperimentalresultsshowthisUVD-accelerateddecoderMFTisabletoperformreal-timeplaybackofhigh-definitionvideoonaPCwithextremelylowCPUusage.CombiningtheGPU-basedresolutionscalerandthedriverfeedback,theproposedtechnologycandoublethespeedofvideotranscodingof1080p(1920x1080)videocontentonaprocessorwithanintegratedgraphicsunitbyusinglessthanhalfCPUcapabilitycomparedtosoftwareVC1andH.264decoders.Althoughmodernhigh-speedmulti-coreCPUswithadvancedsingle-instructionmultiple-data(SIMD)architecturemayoutperformtheUVDunitinvideodecoding,UVDaccelerationwillalwaysbehelpfulinoff-loadingtheCPU’staskandincreasingthecomputationalcapacityofPCplatforms.

7872-16, Session 5

Real-time image deconvolution on the GPUsJ.T.Klosowski,S.Krishnan,AT&TLabs.Research(UnitedStates)

2Dimagedeconvolutionisanimportantandwell-studiedproblemwithapplicationstoimagedeblurringandrestoration.Mostofthebestdeconvolutionalgorithmsusenaturalimagestatisticsthatactaspriorstoregularizetheproblem.Recently,KrishnanandFergusprovideafastdeconvolutionalgorithmthatyieldresultscomparabletothecurrentstateoftheart.Theyuseahyper-Laplacianimagepriortoregularizetheproblem.Theresultingoptimizationproblemissolvedusingalternatingminimizationinconjunctionwithahalf-quadraticpenaltyfunction.Inthispaper,weprovideanefficientCUDAimplementationoftheiralgorithmontheGPU.

Ourimplementationleveragesmanywell-knownCUDAoptimizationtechniques,aswellasseveralothersthathaveasignificantimpactonthisparticularalgorithm.Wediscusseachofthese,aswellasmakeafewobservationsregardingtheCUFFTlibrary.OurexperimentswererunonannVidiaGeForceGTX260GPU.Forasinglechannelimageof

size710x470,weobtain40.6fps,whileonalargerimageofsize1900x1266,weget5.8fps(withoutcountingdiskI/O).Inadditiontolinearperformance,webelieveoursisthefirstimplementationtoperformdeconvolutionsatvideorates.

7872-17, Session 5

Stitching giga pixel images using parallel computingR.Kooper,P.Bajcsy,Univ.ofIllinoisatUrbana-Champaign(UnitedStates)

ThispaperaddressestheproblemofstitchingGigaPixelimagesfromairborneimagesacquiredovermultipleflightpathsofCostaRicain2005.Thesetofinputimagescontainsabout10,158images,eachofsizearound4072x4072pixels,withverycoarsegeoreferencinginformation(latitudeandlongitudeofeachimage).Giventhespatialcoverageandresolutionoftheinputimages,thefinalstitchedimageis294,847by269,195pixels(79.3GigaPixels).Ourapproachistoutilizethecoarsegeoreferencinginformationforinitialimagegroupingfollowedbyanintensity-basedstitchingofgroupsofimages.Thisgroup-basedstitchingishighlyparallelizable.Thestitchingprocessresultsinimagepatchesthatcanbecroppedtofitatileofanimagepyramidfrequentlyusedasadatastructureforfastimageaccessandretrieval.Wereportourpreliminaryexperimentalresultsobtainedwhenstitchingandpyramidtilinga4GigaPixelimagefromtheinputimagesatonefourthoftheiroriginalspatialresolutionusingasinglecoreonoureightcoreserver.Astheprocessingrequiresparallelcomputingapproachesinordertogeneratethefullresolutionimage,wearecollectingactualbenchmarksonparallelhardwareplatforms.

7872-18, Session 6

GPU-completeness: concept and implicationsI.Lin,Hewlett-PackardLabs.(UnitedStates)

Thispaperformalizesoneofamajorinsightintoaclassofalgorithmsthatrelatebetweenparallelismandperformance.Thepurposeofthispaperistodefineaclassofalgorithmsthattradesoffparallelismforqualityofresult(e.g.visualquality,compressionrate),andweproposeasimilarmethodforalgorithmicclassificationbasedonNP-Completenesstechniques,appliedtowardparallelacceleration.Wewilldefinethisclassofalgorithmas“GPU-Complete”andwillpostulatethenecessarypropertiesofthealgorithmsforadmissionintothisclass.Wewillalsoformallyrelatehisalgorithmicspaceandimagingalgorithmsspace.

WhileGPUsaremerelyonetypeofarchitectureforparallelization,weshowthattheirintroductionintothedesignspaceofprintingsystemsdemonstratethetrade-offsagainstcompetingmulti-core,FPGA,andASICarchitectures.Whileeacharchitecturehasitsownoptimalapplication,webelievethattheselectionofarchitecturecanbedefinedintermsofpropertiesofGPU-Completeness.

Forawell-definedsubsetofalgorithms,GPU-Completenessisintendedtoconnecttheparallelism,algorithmsandefficientarchitecturesintoaunifiedframeworktoshowthatmultiplelayersofparallelimplementationareguidedbythesameunderlyingtrade-off.

7872-20, Session 6

A parallel error diffusion implementation on a GPUY.Zhang,Univ.ofCalifornia,Davis(UnitedStates);J.Recker,R.A.Ulichney,G.B.Beretta,I.Tastl,I.Lin,Hewlett-PackardLabs.(UnitedStates);J.D.Owens,Univ.ofCalifornia,Davis(UnitedStates)

Withtheever-increasingprintingresolutionandspeed,digitalpresses



IS&T /

ReturntoContents

presenthighdemandsfortheprocessingpoweroftheRasterImagingProcess(RIP).Today’smassivelyparallelGPUscanpotentiallyprovideahigh-performanceandcost-effectivesolutionforRIPs.However,errordiffusion,asamajorstageintheprinterimagingpipeline,isinherentlyserialinitsoriginalform.Inthispaper,weinvestigatethesuitabilityoftheGPUforaparallelimplementationoftheerrordiffusionalgorithm.Wedemonstrateahigh-performanceGPUimplementationachievedbyefficientlymanagingthememoryusagetomeetthehardwareconstraints.OurGPUimplementationachievesa10-20xspeedupoverasequentialCPUerrordiffusionwithcomparableimagequality.Weconductvariousexperimentstostudytheperformanceandqualitytradeoffsfordifferencesinparallelism,randomization,blockandimagesizes.

7872-21, Session 6

Optimization of imaging algorithms on multiple core CPUsR.J.Moore,3MCo.(UnitedStates)

WiththereleaseofaneightcoreXeonprocessorbyIntelandatwelvecoreOpteronprocessorbyAMDinthespringof2010,theincreaseofmultiplecoresperchippackagecontinues.Multiplecoreprocessorsarecommonplaceinmostworkstationssoldtodayandareanattractiveoptionforincreasingimagingperformance.Mostimagingalgorithms,especiallylargedifferenceofGaussianfilters,segmentation,andregionfindingareverycomputeintensive.Inthispaperwepresentourworkinoptimizingtheperformanceofdifferentimagingalgorithmstorunonstandardmulti-coreWindowsworkstations.OurworkleveragestheOpenMPlibrariesinC++tocreateparallelforloops.Wewillpresentourexperienceingettingthebestperformanceforimagingalgorithmsfromtheselibraries.

7872-22, Session 7

Evaluation of CPU and GPU architectures for spectral image analysis algorithmsV.Fresse,Univ.JeanMonnetSaint-Etienne(France);D.Houzet,Gipsa-lab(France);C.Gravier,TelecomSaintEtienne(France)

GraphicalProcessingUnits(GPU)architecturesaremassivelyusedforresource-intensivecomputation.Initiallydedicatedtoimaging,visionandgraphics,thesearchitecturesservenowadaysawiderangeofmulti-purposeapplications.TheGPUstructure,however,doesnotsuitallapplications.Thiscanleadtoperformanceshortage.Amongseveralapplications,theaimofthisworkistoanalyzeGPUstructuresforimageanalysisapplicationsinmultispectraltoultraspectralimaging.Algorithmsusedfortheexperimentsaremultispectralandhyperspectralimagingdedicatedtoartauthentication.Suchalgorithmsuseahighnumberofspatialandspectraldata,alongwithbothahighnumberofmemoryaccessesandaneedforhighstoragecapacity.TimingperformancesarecomparedwithCPUarchitectureandaglobalanalysisismadeaccordingtothealgorithmsandGPUarchitecture.ThispapershowsthatGPUarchitecturesaresuitabletocompleximageanalysisalgorithminmultispectral.

7872-23, Session 7

Computational scalability of large size image disseminationR.Kooper,P.Bajcsy,Univ.ofIllinoisatUrbana-Champaign(UnitedStates)

Wehaveinvestigatedthecomputationalscalabilityofimagepyramidbuildingneededfordisseminationofverylargeimagedata.Thesourcesoflargeimagesincludehighresolutionmicroscopesandtelescopes,remotesensingandairborneimaging,andhighresolutionscanners.Theterm‘large’isunderstoodfromauserperspectivewhichmeanseitherlargerthanadisplaysizeorlargerthanamemory/

disktoholdtheimagedata.TheapplicationdriversforourworkaredigitizationprojectssuchastheLincolnPapersproject(eachimagescanisabout100-150MBorabout5000x8000pixelswiththetotalnumbertobearound200,000)andtheUIUClibraryscanningprojectforhistoricalmapsfrom17thand18thcentury(smallernumberbutlargerimages).Thegoalofourworkisunderstandcomputationalscalabilityoftheweb-baseddisseminationusingimagepyramidsfortheselargeimagescans,aswellasthepreservationaspectsofthedata.WereportourcomputationalscalabilitybenchmarksusingtheMicrosoftSeadragonlibraryforbuildingimagepyramids,hyper-threadingforcomputationexecutionandvariousharddriveconfigurationssuchasRAIDdrivesforinput/outputoperations.Thebenchmarksareobtainedwithamap(334.61MB,JPEGformat,17591x15014pixels).Thediscussioncombinesthespeedandpreservationobjectives.

7872-25, Session 8

Real-time 3D flash ladar imaging through GPU data processingC.M.Wong,C.Bracikowski,B.Baldauf,S.Havstad,NorthropGrummanAerospaceSystems(UnitedStates)

Wepresentreal-time3DimageprocessingofflashladardatausingourrecentlydevelopedGPUparallelprocessorkernels.Ourlaboratoryandairborneexperienceswithflashladarfocalplaneshaveshownthatperlaserflash,typicallyonlyasmallfractionofthepixelsontheFPAactuallyproduceameaningfulrangesignal.Therefore,tooptimizeoveralldataprocessingspeed,thislargequantityofuninformativedatashouldbefilteredoutandremovedfromthedatastreampriortothemathematicallyintensivedataprocessing.Thisfront-endpre-processing,whichlargelyconsistsofcontrolflowinstructions,isspecifictotheexacttypeofflashladarfocalplanearrayused.ThevalidsignalsalongwiththeircorrespondinginertialnavigationalmetadataarethentransferredtoaGPUdevicetoperformrange-correction,geo-location,andortho-rectificationoneach3Ddatapointsothatdatafrommultipleframescanbeproperlytiledtogethereithertocreateawide-areamaportoreconstructanobjectfrommultiplelookangles.GPUparallelprocessorkernelsweredevelopedusingtheOpenCLapplicationprogramminginterface.Post-processingtoperformfineregistrationbetweendataframesviacomplexiterativestepsalsobenefitsgreatlyfromthistypeofhigh-performancecomputing.TheperformanceimprovementsobtainedusingGPUprocessingtocreatecorrected3Dimagesandforframe-to-framefine-registrationarepresented.

7872-26, Session 8

Advanced MRI reconstruction toolbox with accelerating on GPUX.Wu,Y.Zhuo,J.Gai,F.Lam,M.Fu,J.P.Haldar,W.Hwu,Z.Liang,B.P.Sutton,Univ.ofIllinoisatUrbana-Champaign(UnitedStates)

Inthispaper,wepresentafastiterativeMRimagereconstructionalgorithmtakingadvantageoftheprevailingGPGPUprogrammingparadigm.Inclinicalenvironment,MRimagereconstructionisusuallyperformedviafastFouriertransform(FFT).However,imagingartifacts(signallossandsignaldistortions)resultingfromsusceptibility-inducedmagneticfieldinhomogeneitiesdegradethequalityofreconstructedimages.Theseartifactsmustbeaddressedusingaccuratemodelingofthephysicsofthesystemcoupledwithiterativereconstruction.Wehavedevelopedareconstructionalgorithmwithimprovedimagequalityattheexpenseofcomputationtime.Hence,animplementationonGPUsisproposed,achievingsignificantspeedup.TheproposedalgorithmimplementsaconjugategradientreconstructionusingexplicitFouriertransform(FT)inordertomodelthefieldinhomogeneityanditsgradients.Inaddition,asmoothingconstraintisincludedintheformofsparsematrixregularizationinordertoreducenoiseinreconstructedimages.Weapplythecompilationoptimizationsfromlevelsofalgorithm,programcodestructures,andspecificarchitecture



IS&T /

ReturntoContents

performancetuning,featuringbothourMRIreconstructionalgorithmandGPUhardwarespecifics.ThecurrentGPUimplementationproducesaccurateimageestimateswhileacceleratingthereconstructionbytwoordersofmagnitudes.Futuredirectionsincludefurtheroptimizationofcurrentandhigher-dimensionapproach.

7872-27, Session 8

Accelerating image recognition on mobile devices using GPGPUM.BordalloLopez,H.Nykänen,J.Hannuksela,O.J.Silvén,Univ.ofOulu(Finland);M.Vehviläinen,NokiaResearchCtr.(Finland)

Thefuturemultimodaluserinterfacesofbattery-poweredmobiledevicesareexpectedtorequireenergy-efficientcomputationallycostlyimageanalysistechniques.

GPUcomputingiswellsuitedforparallelprocessing.Theadditionofprogrammablestagesandhighprecisionarithmeticprovideforopportunitiestoimplementenergy-efficientcompletealgorithmsonGPU.Atthemomentthefirstmobilegraphicsacceleratorswithprogrammablepipelinesareavailable,enablingtheGPGPUimplementationofseveralimageprocessingalgorithms.

Inthiscontext,weconsiderafacetrackingapproachthatusesefficientgray-scaleinvarianttexturefeaturesandboosting.ThesolutionisbasedontheLocalBinaryPattern(LBP)featuresandmakesuseoftheGPUonthepre-processingandfeatureextractionphase.

WehaveimplementedaseriesofimageprocessingtechniquesintheshaderlanguageofOpenGLES2.0,compiledthemforamobilegraphicsprocessingunitandperformedtestsonamobileapplicationprocessorplatform(OMAP3530).

Inourcontribution,wedescribethechallengesofthedesignonamobileplatform,presenttheperformanceachievedandprovidemeasurementresultsfortheactualpowerconsumptionincomparisontousingtheCPU(ARMv7)onthesameplatform.

OurexperimentsshowhowaconsiderablespeedupofimageprocessingapplicationscanbeachievedbythesimultaneoususeoftheGPUandtheCPUonmobiledevices.

7872-28, Session 8

Multi-view stereo reconstruction via voxels clustering and parallel volumetric graph-cut optimizationY.Zhu,Y.Zhang,TsinghuaUniv.(China)

Traditionalmethodsofmulti-viewstereo(MVS)viavolumetricgraph-cutformulatethemulti-viewscenereconstructionproblemintoacomputationallytractableglobaloptimizationusinggraph-cut.Theoptimalsurfacecanbeobtainedasthemax-flow/min-cutsolutionoftheweightedgraph.Withtheresolutionofcubicalvoxelsbecomelarger,thereconstructionaccuracyisincreasedgradually,however,thephotoconsistencyestimationforvoxelsandthegraph-cutcomputationincreasingrapidly,ifmoreedgesbetweenneighborvoxelareaddedintograph,thedesktopcomputernearlycan’thandletheproblem.Therefore,thecontradictionbetweencomputationandaccuracyneedstobereducedurgently.Inourpaper,wearefocusingonimprovingtheperformanceofgraph-cutbasedMVSalgorithmwiththehelpofmulti-coreCPUs.Wehavedevelopedasystemtodemonstratetheclusteringanditcanutilizetheparalleloptimizationalgorithmforthisproblem.Akeytechnicalcontributionisthevoxelsclusteringalgorithm,itdividesthevoxelsintoseveraloverlappingclusters,afterthatgraph-cutalgorithmcanbeusedtoreconstructeachareaofthesurfaceinparallel.Finally,theoptimizationresultineachsubgraphiscollectedtogetherandthelabelsonoverlappedvoxelsareforcedtobeconsistentinteractively.Thistechniquedemonstratedfastergraphbasedmulti-viewvolumetricreconstructioncomputationswhenmultipleCPUcoresareavailable.Thereconstructionresultsarecomparablewiththestate-of-artMVSmethods.

7872-29, Session 8

A GPU accelerated PDF transparency engineJ.Recker,I.Lin,I.Tastl,Hewlett-PackardLabs.(UnitedStates)

Ascommercialprintingpressesbecomefaster,cheaperandmoreefficient,sotoomusttheRasterImageProcessors(RIP)thatprocessandfeedthemdatatoprint.DigitalpressRIPs,however,havebeenchallengedtobothmeettheeverincreasingprintperformanceofthelatestdigitalpresses,andthemorecommonuseofadvancedpixelprocessingsuchasICCcolorprofilespecifiedcolorspacesandtransparentimagelayers.Asaresult,thecostoftheRIPsdeployedatsomeofthemoredemandingPrintServiceProviders(PSP)canexceed$250,000.

ThispaperexploresthechallengesencounteredwhenimplementingaGPUaccelerateddriverfortheopensourceGhostscriptAdobePostScriptandPDFlanguageinterpretertargetedatacceleratingPDFtransparencyforhighspeed,largepagesizecommercialpresses.Itfurtherdescribesoursolution,includinganimagememorymanagerfortilinginputandoutputimages,acacheofdynamicallycompiledGPUprogramsforanacceleratedICCv4compatiblecolortransformationengine,andanAdobePDFcompatiblemultipleimagelayerblendingengine.Theresult,webelieve,isthefoundationforascalable,efficient,distributedRIPsystemthatcanmeetcurrentandfutureRIPrequirementsforawiderangeofcommercialdigitalpresses.


Infrared small target tracking based on SOPCT.Hu,ElectronicEngineeringInstituteofHefei(China);X.Fan,Univ.ofScienceandTechnologyofChina(China);Y.Zhang,TsinghuaUniv.(China);Z.Chen,B.Zhu,ElectronicEngineeringInstituteofHefei(China)

Thetrackingofinfraredsmalltargetshasbeenakeytechnologyinthefieldofsatelliteearlywarning,preciseguidance,surveillanceanddetection.ThepaperpresentsalowcostFPGAbasedsolutionforareal-timeinfraredsmalltargettrackingsystem.AspecializedarchitectureispresentedbasedonasoftRISCprocessorcapableofrunningkernelbasedmeanshifttrackingalgorithm.MeanshifttrackingalgorithmisrealizedinNiosIIsoft-coreWithSOPCtechnology.Thoughmeanshiftalgorithmiswidelyusedfortargettracking,theoriginalmeanshiftalgorithmcannotbedirectedusedforinfraredsmalltargettracking.Asinfraredsmalltargetonlyhasintensityinformation.Soanimprovedmeanshiftalgorithmispresentedinthispaper.Howtodescribetargetwilldetermentwhethertargetcanbetrackedbymeanshiftalgorithm.Becausecolortargetcanbetrackedwellbymeanshiftalgorithm,imitatingcolorimageexpression,spatialcomponentandtemporalcomponentareadvancedtodescribetarget,whichformspseudo-colorimage.Theexperimentalresultsshowthatinfraredsmalltargetistrackedstablyincomplicatedbackground.Inordertoimprovetheprocessingspeedparalleltechnologyandpipelinetechnologyistaken.TwoRAMaretakentostoredimagesseparatelybyping-pongtechnology.AFLASHisusedtostoremasstempdata.


A novel method for multi-view synthesis using relative affine structureZ.Huo,NanjingUniv.ofPostsandTelecommunications(China)

Nowadays,thereareincreasingresearchinterestsinFreeviewpointTelevision(FTV)thatoffersarbitraryviewsof3Dscene.Viewsynthesisisimportantforprediction-basedcompressionandnovelviewdisplayinFTVandothermultiviewimagingapplicationsuchas3DTV.

Viewsynthesisconsistsinrenderingimagesofasceneasiftheyweretakenfromavirtualviewpointdifferentfromalltheviewpointsoftherealviews.Thispaperproposesanovelapproachformultiviewsynthesisinwhichtheprocessofviewsynthesisisbasedonthe



IS&T /

ReturntoContents

relativeaffinestructure.Theadvantageisthatphotographsofrealscenescanbeusedasabasistocreateveryrealisticimages,andrenderingtimeisdecoupledfromthecomplexityofthescene.Thevirtualcamerapositionisspecifiedinanuncalibratedsetting-upviatheinterpolationofthemotioninformationamongthereferenceviews.ThismethodyieldsaparametricfamilyofcameraposesthatdescribesasmoothtrajectoryintheEuclideanspaceastheparametersvarycontinuously.Thenewsynthesizedimagescanberenderedfromvirtualcamerasmovingonthegivenparameterizedtrajectories.

Syntheticandrealexperimentalresultsillustratethatthemethodhasadvtangesoflowcomputationalcomplexitywithouttheerrorproducedfromdepthestimation.



IS&T /

ReturntoContents

Conference 7873: Computational Imaging IXMonday-Tuesday24-25January2011PartofProceedingsofSPIEVol.7873ComputationalImagingIX

7873-02, Session 2

Myopic reconstruction and its application to MRFM dataS.U.Park,Univ.ofMichigan(UnitedStates);N.Dobigeon,Univ.deToulouse(France);A.O.HeroIII,Univ.ofMichigan(UnitedStates)

Weproposeasolutiontotheimagedeconvolutionproblemwheretheconvolutionoperatororpointspreadfunction(PSF)isassumedtobeonlypartiallyknown.Smallperturbationsgeneratedfromthemodelareexploitedtoproduceafewprincipalcomponentsexplainingtheuncertaintyinahighdimensionalspace.Specifically,weassumetheimageissparsesincewefocusonrecoveringmagneticresonanceforcemicroscopy(MRFM)data.

Unlikeprevioustrials,ourapproachisstochastic,withintheBayesianMetropolis-within-Gibbssamplingprocedureforestimationoftheconvolutionkernelandtheimages.OuralgorithmiscomparedtoshowtheperformancewithpreviousBayesianapproachandalternatingminimization(AM)algorithmasblinddeconvolutionmethodonasparsesyntheticimage,anditisappliedtoMRFMdata.

7873-04, Session 2

Seismic imaging of transmission overhead line structure foundationsD.Vautrin,InstitutdeRechercheenCommunicationsetenCybernétiquedeNantes(France);M.Voorons,EcolePolytechniquedeMontréal(Canada);J.Idier,InstitutdeRechercheenCommunicationsetenCybernétiquedeNantes(France);Y.Goussard,EcolePolytechniquedeMontréal(Canada);S.Kerzalé,ApsideTechnologies(France);N.Paul,EDFRecherche&Developpement(France)

Theobjectiveofthepresentedworkistoretrievetheshapeoftransmissionlinestructurefoundationsusingaseismicimagingapproach.Thisnon-destructivetestingproblemisformulatedasaseismicinversescatteringproblemwheretwo-dimensionalmapsofthepressure-andshare-wavevelocitiesareestimatedfromthemeasureddataset.

Theinversionamountstoalarge-scale,nonlinearprogrammingproblem.Itisrenderedallthemoredifficultgiventhelargedimensionsofthescatteringobjectandthelargevelocitycontrasts.Inthiscontext,ourgoalistoproposeaninversionschemethatproducespreciseimageswithanacceptablecomputationaleffort.

Thisgoalismetbycombiningthefollowingelements:weminimizeapenalizedleast-squarecriterionunderthepositivityconstraintwiththeL-BFGS-Balgorithm,weworkinthefrequencydomaintointroducethemeasureddatainaprogressivewayandweintroducealogarithmicvariablesubstitution.

Ourmaincontributionistheintroductionofanewworkingvariable.Itcounterbalancesthelackofsensibilityofthecriterionandresultsinasignificantaccelerationoftheinversionprocess.Thisisconfirmedonsyntheticexamples:theconvergenceisreachedtwotoeighttimesfasterthanwiththeinitialworkingvariables.

7873-05, Session 2

Inverse problems for cryo electron microscopy of viruses: randomly oriented projection images of random 3D structures in noiseQ.Wang,P.C.Doerschuk,CornellUniv.(UnitedStates)

Instancesofbiologicalmacromolecularcomplexesthathaveidenticalchemicalconstituentsmaynothavethesamegeometrydueto,forexample,flexibility.Cryoelectronmicroscopyprovidesonenoisyprojectionimageofeachofmanyinstancesofacomplexwheretheprojectiondirectionsforthedifferentinstancesarerandom.Thenoiseissufficientsevere(SNR<<1)thattheprojectiondirectionforaparticularimagecannotbeeasilyestimatedfromtheindividualimage.Thegoalistodeterminethe3-Dgeometryofthecomplex(the3-Ddistributionofelectronscatteringintensity)whichrequiresfusinginformationfromthesemanyimagesofmanycomplexes.Inordertodescribethegeometricheterogeneityofthecomplexes,thecomplexisdescribedasaweightedsumofbasisfunctionswheretheweightsarerandom.Inordertogettractablealgorithms,theweightsaremodeledasGaussianrandomvariableswithunknownstatisticsandthenoiseismodeledasadditiveGaussianrandomvariableswithunknowncovariance.Thestatisticsoftheweightsandthestatisticsofthenoisearejointlyestimatedbymaximumlikelihoodbyageneralizedexpectationmaximizationalgorithm.AnexampleusingtheseideasonimagesofFlockHouseVirusisdescribed.

7873-06, Session 2

Inverse problems arising in different synthetic aperture radar imaging and a general Bayesian approach for themS.Zhu,A.Mohammad-Djafari,Lab.desSignauxetSystèmes(France)

SyntheticApertureRadar(SAR)imagingsystemsarenowadaysverycommontechnicsofimaginginremotesensingandenvironmentsurvey.Therearedifferentacquisitionmodes:Spotlight,Stripbands,Interferometric,Polarimetricanddifferentgeometries:mono-,bi-andmulti-static.

Inafirstapproximation,therelationbetweenthemeasureddataandthescenecanbemodelledbyalinearrelation.Inthispaper,first,usingthislinearforwardmodel,acommoninverseproblemframeworkforallofthemisgivenandthenageneralprobabilisticBayesianestimationmethodispresentedforimagereconstructionproblems.Inparticular,weconsidertwopriorswhichpermitparcimoniousmodelingofthescene,oneforpointsourcesdirectlyintheimagedomaineandthesecondbyrepresentingthesceneonadictionnarybasedelementaryfunctions.ForBayesiancomputationswewillconsiderandcomparetheMAPandtheposteriormeanestimates.Wewillshowtheperformancesoftheproposedmethodsonsomesimulatedandrealdata.

7873-07, Session 2

Medical image enhancement using resolution synthesisT.Wong,C.A.Bouman,PurdueUniv.(UnitedStates);J.Thibault,GEHealthcare(UnitedStates);K.D.Sauer,Univ.ofNotreDame(UnitedStates)

Noabstractavailable


IS&T /

ReturntoContents

7873-08, Session 3

An open level set framework for image segmentation and restoration using the Mumford and Shah modelR.Mohieddine,L.A.Vese,Univ.ofCalifornia,LosAngeles(UnitedStates)

Intwodimensions,theMumfordandShahfunctionalallowsforminimizersu(theimage)andK(theedgeset)suchthatthesetKcouldincludebothclosedandopencurves.Thecurrentlevelsetbasedsegmentationalgorithmscanonlydetectobjectswithclosededges,whichareboundariesthatareformedbyclosedloops.Weproposeanefficientlevelsetbasedalgorithmforsegmentingimageswithedgeswhicharemadeupofopencurvesorcrack-tips.ByadaptingP.Smereka’sopenlevelsetformulationtovariationalproblems,weareabletoextendthecurrentlevel-setbasedimagesegmentationmethods,suchasChan-Vese.Thealgorithmretainsmanyoftheadvantagesofusinglevelsets,suchasawell-definedboundariesandabilitytochangetopologies.WesolvetheresultingevolutionequationsusingSobolevgradients,avoidingtheneedforregularizationorre-initializationofthelevelsetsfunctionswhilealsoacceleratingconvergencetothereconstructedimage.AnothermodelderivedbyMumfordandShahsolelyforthedetectionoftheedgesetispresentedandisreformulated.Thissecondmodelisusefulwhenoneonlydesiresinformationonobjectboundariesratherthanareconstructedimage,suchaspathologydetectioninmedicalimaging,objectdetectionandlocationincomputervision,objecttrackinginmultipleimages,etc.Finally,wepresentthenumericalimplementationwithvariousexamplescomparingthismethodtobothclosedlevelsetandgeneraledgedetectionmethods.

7873-09, Session 3

Video indexing and retrieval using Fisher information nonlinear embeddingX.Chen,A.O.HeroIII,Univ.ofMichigan(UnitedStates)

Inthispaper,wepresentanovelinformationembeddingbasedapproachforvideoindexingandretrieval.Thehighdimensionalityandcomplexityofvideosequencesposesamajorchallengetovideoindexingandretrieval.DifferentfromthetraditionaldimensionalityreductiontechniquessuchasPrincipalComponentAnalysis(PCA),LinearDiscriminantAnalysis(LDA),weembedthevideodataintoalowdimensionalstatisticalmanifoldobtainedbyapplyingmanifoldlearningtechniquestotheinformationgeometryofvideofeatureprobabilitydistributions(PDF).WeestimatethefeaturePDFofthevideofeaturesusinghistogramestimationandGaussianmixturemodels(GMM),respectively.Bycalculatingthesimilaritiesbetweentheembeddedtrajectories,wedemonstratethattheproposedapproachoutperformstraditionalapproachestovideoindexingandretrievalwithrealworlddataintermsofprecisionandrecall.

7873-10, Session 3

Segmentation assisted food classification for dietary assessmentF.Zhu,M.Bosch,T.R.Schap,N.Khanna,D.S.Ebert,C.J.Boushey,E.J.DelpIII,PurdueUniv.(UnitedStates)

Accuratemethodsandtoolstoassessfoodandnutrientintakeareessentialforresearchontheassociationbetweendietandhealth.Preliminarystudieshaveindicatedthattheuseofamobiledevicewithabuilt-incameratoobtainimagesofthefoodconsumedmayprovidealessburdensomeandmoreaccuratemethodfordietaryassessment.Wehavedevelopedmethodstoidentifyfooditemsusingasingleimageacquiredfromthemobiledevice.Ourgoalistoautomaticallydeterminetheregionsinanimagewhereaparticularfoodislocated(segmentation)andcorrectlyidentifythefoodtype

basedonitsfeatures(classificationorfoodlabeling).ImagesoffoodsaresegmentedusingNormalizedCutsbasedonintensityandcolor.Localfeaturesareextractedfromimagepatchesaroundkey-pointsdetectedbyScale-invariantfeaturetransform(SIFT),wealsoextractglobalfeaturesofeachsegmentedfoodregion.Classificationdecisionforeachclassifier(orfeaturespace)ismadebychoosingthefooditemwhichisattheminimumdistanceinthatfeaturespace.Thefinaldecisionismadebyfusingthecandidateoutputsofseparateclassifiersbyamajorityvoterule.Segmentationofeachfoodregionisrefinedbasedonfeedbackfromtheoutputofclassifierstoprovidemoreaccurateestimationofthequantityoffoodconsumed.

7873-46, Session 3

Joint pose estimation and image segmentation for monocular articulated trackingL.M.Huffman,I.Pollak,PurdueUniv.(UnitedStates)

Theneedforautomatedtrackingofahuman’sposefromvideodatahasariseninapplicationsfromsurveillancetohuman-machineinteraction.Poseestimationisparticularlydifficultinmonocularapplicationswhicharerifewithtroublesomeself-occlusions.Weproposeanovelestimationmethodforarticulatedhumantrackingbasedonjointlymodelingtheobservedvideosequence,thearticulatedbodyandthefieldofbackgroundandforegroundlabels,whichaidintheresolutionofself-occlusions.Wedevelopastatisticalappearancemodelofthehumaninasceneandperformgraphicalinferenceusingnonparametricbeliefpropagationtoreliablytrackasinglehumaninvideosequencescapturedusingasinglestationarycamera.

7873-34, Session 4

Sparse Fisher linear discriminant analysisH.A.Siddiqui,H.Hwang,QualcommInc.(UnitedStates)


7873-13, Session 5

Characterization of moving dust particlesB.J.Bos,S.R.Antonille,N.Memarsadeghi,NASAGoddardSpaceFlightCtr.(UnitedStates)

Alargedepth-of-fieldParticleImageVelocimeter(PIV)hasbeendevelopedatNASAGSFCtocharacterizedynamicdustenvironmentsonplanetarysurfaces.Thisinstrumentdetectsandsenseslofteddustparticles.Tocharacterizeadynamicplanetarydustenvironment,theinstrumentwouldhavetooperateforatleastseveralminutesduringanobservationperiod,easilyproducingmorethanaterabyteofdataperobservation.Givencurrenttechnology,thisamountofdatawouldbeverydifficulttostoreonboardaspacecraftanddownlinktoEarth.WehavebeendevelopinganautonomousimageanalysisalgorithmarchitectureforthePIVinstrumenttogreatlyreducetheamountofdatathatithastostoreanddownlink.ThealgorithmanalyzesPIVimagesandreducestheimageinformationdowntoonlytheparticlemeasurementdataweareinterestedinreceivingontheground-typicallyreducingtheamountofdatatobehandledbymorethantwoordersofmagnitude.

WegiveageneraldescriptionofthePIValgorithmsanddescribeonlythealgorithmforestimatingthedirectionandvelocityofthetravelingparticlesinmoredetail,whichwasdonebytakingadvantageoftheopticalpropertiesofmovingdustparticlesandimageprocessingtechniques.Ourexperimentsonsimulatedparticlesimplyanaverageabsoluteerroroflessthan4degreesfordirectionestimationoftravelingparticleswhentheblurringfilterlengthwasgreaterthan3pixelslong.Ouralgorithmsperformedwellforsimulateddatainpresenceofsmallamountsofnoise.

Conference 7873: Computational Imaging IX


IS&T /

ReturntoContents

7873-14, Session 5

A super-resolution algorithm for enhancement of flash lidar dataA.Bulyshev,AnalyticalMechanicsAssociates,Inc.(UnitedStates);M.D.Vanek,F.Amzajerdian,NASALangleyResearchCtr.(UnitedStates);D.F.Pierrottet,CoherentApplications,Inc.(UnitedStates);G.D.Hines,R.A.Reisse,NASALangleyResearchCtr.(UnitedStates)

AnovelmethodoftheenhancementofthespatialresolutionofFlashLidarimagesinapplicationtothesurfacemapgenerationisproposed.TheabilityoftheFlashLIDARtogenerate3-dimensionalmapsofthelandingsiteareaduringthefinalstagesofthedescentphasefordetectionofhazardousterrainfeaturesisunderstudyintheframeofALHATproject.MajorgoalsofthisalgorithmaretocreateaDigitalElevationMap(DEM)coveringasufficientlylargeareaandwithacceptableaccuracyandprecisionandtoretrievetherelativetrajectory.ThealgorithmisutilizinganiterativeschemewhichupdatesDEMrelatedtothehighresolutiongridandfindsthenexttrajectorypointusingoneLidarframeatatime.AbackprojectionalgorithmisusedfortheDEMcreationand6-dgeneralizationofLucas-KanademethodisutilizedtoretrievethetrajectoryofthespacecraftandLidarattitude.Performanceofthesuper-resolutionalgorithmhasbeenanalyzedthroughaseriesofsimulationruns.Theresultsshowthatachievedlevelofaccuracyandprecisioningeneratinganelevationmapofthelandingsiteisadequatefordetectinghazardousterrainfeaturesandidentifyingsafeareas.

7873-17, Session 5

Image registration for stability testing of MEMSN.Memarsadeghi,J.LeMoigne,P.N.Blake,NASAGoddardSpaceFlightCtr.(UnitedStates);P.A.Morey,BallAerospace&TechnologiesCorp.(UnitedStates);W.B.Landsman,AdnetSystemsInc.(UnitedStates);V.J.Chambers,S.H.Moseley,NASAGoddardSpaceFlightCtr.(UnitedStates)

Imageregistration,oralignmentoftwoormoreimagescoveringthesamescenesorobjects,isofgreatinterestinmanydisciplinessuchasremotesensing,medicalimaging,astronomy,andcomputervision.Inthispaper,weintroduceanewapplicationofimageregistrationalgorithms.Wedemonstratehowthroughawaveletbasedimageregistrationalgorithm,engineerscanevaluatestabilityofMicro-Electro-MechanicalSystems(MEMS).Inparticular,weappliedimageregistrationalgorithmstoassessalignmentstabilityoftheMicroShuttersSubsystem(MSS)oftheNearInfraredSpectrograph(NIRSpec)instrumentoftheJamesWebbSpaceTelescope(JWST).ThisworkintroducesanewmethodologyforevaluatingstabilityofMEMSdevicestoengineersaswellasanewapplicationofimageregistrationalgorithmstocomputerscientists.

7873-35, Session 5

Shape-based segmentation of alloy micrographs using matching pursuitsL.M.Huffman,I.Pollak,PurdueUniv.(UnitedStates);J.P.Simmons,AirForceResearchLab.(UnitedStates);M.DeGraef,CarnegieMellonUniv.(UnitedStates)

Computerizedanalysisofalloymicrographsisexpectedtorevolutionizematerialsdevelopmentbyreplacinglaboriousphysicaltestingwithcomputersimulations.Thesesimulationsutilizesegmentationsofalloymicrographswhichindicatethearrangementofmaterialprecipitates.

Abundantpriorinformationistypicallyavailableregardingtheshapeoftheprecipitates,andanyviablesegmentationmethodmustproperlyaccountforsuchinformation.Weproposeanovelapplicationof

matchingpursuitstoconstructamicrographsegmentationconsistingofrectangleswhichmatchthevariablysizedandorientedrectangularmaterialprecipitates.

7873-18, Session 6

Capacitive touch sensing: signal and image processing algorithmsZ.I.Baharav,CorningInc.(UnitedStates);R.Kakarala,NanyangTechnologicalUniv.(Singapore)

Inthisworkwewillanalyzethemostcommonmethodofprojectedcapacitivesensing,thatofabsolutecapacitivesensing,togetherwiththemostcommonsensingpattern,thatofdiamond-shapedsensors.

Weformulatetheproblemasareconstructionfromprojections,andconsiderseveralaspects.Thefirstaspectisthatofusagescenarios,whichgiverisetoissueslikeworkingwithsmallandbigfingers,proximitydetection,stabilityofinterpolatedlocation,linearityoflinedrawing,andalike.Thesecondaspectrelatestomanufacturingvariations,andincludethingslikevariationinthethicknessofcoverglass(orPET),uniformityofITOlayer,minimumfeaturesizesonITO,backgroundcapacitance,etc.Thelastaspectweconsiderrelatestonoisesources,likeLCDnoise,electronicsamplingnoise,RFnoisecoupledin,finger-couplednoise,andmanymore.Thesefactors(usagemode,physicalvariations,andnoise)guidetheevaluationcriteriaforvariousalgorithmsused,inadditiontotheusualconstraintsofcomputationpowerandmemory.Algorithmsconsideredincludesimplelinearinterpolation(globalandlocal),curvefitting(bilinearinterpolation),filtering,generallook-up-table,andcombinationsthereof.Wediscussthemeritsofusingseparablealgorithms(asaremostcommonlyusedtoday),andcomparethemtooptimalones.

7873-19, Session 6

Denoising, deblurring, and super-resoluton in mobile phonesF.Sroubek,J.Kamenicky,J.Flusser,InstituteofInformationTheoryandAutomation(CzechRepublic)

Currentmobilephonesareequippedwithlow-budgetdigitalcamerasandverypooroptics.Consequently,imagesacquiredbythesecamerashaveeffectiveresolutionlowerthanthenumberofpixelsandcontainaconsiderableamountofnoiseand/orbluriflightconditionsarepoor.Weproposeanovelalgorithmwhichtakesasetofacquiredimagesfromsuchcamerasandperformssimultaneouslythreetasks:denoising,deblurringandresolutionenhancement.Theamountofeachdependsonthecharacteristicsoftheinputset.Thealgorithmimplementsasingleframework,whichweformulateasanenergyminimizationproblem,wheretheenergyfunctioncomprisesthreeterms.Thefirstoneisadatatermthatmodelstheacquisitionprocess.Thesecondandthirdareregularizationtermsthatactasimageandblurpriors,respectively.Sinceweworkwithmorethanoneimage,acriticalpreprocessingstepisaccurateimageregistration.Forthispurpose,weproposetouseamethodbasedonopticalflow,whichprovidessub-pixelaccuracy.Wedemonstratetheperformanceoftheproposedmethodonasystemwithacamerainamobilephone(orwebcamera)andaPC.

7873-20, Session 6

Arabic word recognizer for mobile applicationsN.Khanna,G.Abdollahian,B.Brame,M.Boutin,E.J.DelpIII,PurdueUniv.(UnitedStates)

WhentravelinginaregionwherethelocallanguageisnotwrittenusingtheRomanalphabet,translatingwrittentext(e.g.,documents,roadsigns,orplacards)isaparticularlydifficultproblemsincethetext



IS&T /

ReturntoContents

cannotbeeasilyenteredintoatranslationdeviceorsearchedusingadictionary.Toaddressthisproblem,wearedevelopingthe“RosettaPhone,”ahandhelddevice(e.g.,PDAormobiletelephone)capableofacquiringapictureofthetext,locatingtheregion(word)ofinterestwithintheimage,andproducingbothanaudibleandavisualEnglishinterpretationofthetext.ThepresentsystemistargetedforinterpretingwordswritteninArabiccharacterset.Wehavetestedourautonomous,segmentation-freephraserecognizeronadictionaryoffivethousandPashtowords.Presentsystemoffersclosetoperfectuniquenessandmorethan75%recognitionaccuracyondifferentnoisyversionsofthewordsinthisdictionary.GiventhelimitedimagequalityandresolutionforiPhoneimages,recognitionaccuracyisalsoafunctionoffontsizesasdemonstratedbyexperimentsondocumentsscannedatdifferentresolutions(150DPIto300DPI).

7873-21, Session 6

Volume estimation using food specific shape templates in mobile image-based dietary assessmentJ.Chae,I.Woo,S.Kim,R.Maciejewski,F.Zhu,E.J.DelpIII,C.J.Boushey,D.S.Ebert,PurdueUniv.(UnitedStates)

Giventhemountingconcernsofchildhoodandadultobesity,methodsarebeingdevelopedforpreventionandintervention.Keycomponentsinthesemethodsincludetherecording,catalogingandanalysisofdailydietaryrecordstomonitorenergyandnutrientintakes.Giventheubiquityofmobiledeviceswithbuilt-incameras,onepossiblemeansofimprovingdietaryassessmentisthroughphotographingfooditemsandinputtingtheseimagesintoasystemthatcandeterminethenutrientcontentofthefoodintheimages.Akeyproblemwithsuchimage-baseddietaryassessmenttoolsistheaccurateandconsistentestimationoffoodportionsize.Weproposeamethodtoautomaticallyestimatefoodvolumesthroughtheuseoffoodspecificshapetemplates.Weareabletoreconstructpropertiesofthe3Dscenefromasingleimage.Eachclassifiedfooditembyimagesegmentationmethodscorrespondstoaparticularfoodtemplateshape.Forvolumecomputation,weuseeitherafeaturepointextractionalgorithmortheactivecontourmethodologytosizeourshapetemplatesaccordingly.Byapplyingthistemplate-basedapproach,weareabletoautomaticallyestimatefoodportionsize.Thisleadstoreducedburdenonusershavingtoestimateportionsconsumedandprovidesaconsistentmethodforestimatingfoodvolume.

7873-37, Session 8

Spectral x-ray CT imaging using energy sensitive photon counting detectorsK.Taguchi,TheJohnsHopkinsOutpatientCtr.(UnitedStates)


7873-38, Session 8

Toward material characterization using dual energy x-ray CTJ.A.O’Sullivan,B.R.Whiting,D.G.Politte,WashingtonUniv.inSt.Louis(UnitedStates);J.F.Williamson,VirginiaCommonwealthUniv.(UnitedStates)

Noabstractavailable

7873-39, Session 8

A hybrid approach to imaging and anomaly characterization from dual energy CT dataE.L.Miller,O.Semerici,TuftsUniv.(UnitedStates)

Noabstractavailable

7873-40, Session 8

Robust multifrequency inversion in terahertz diffraction tomographyD.A.Castañón,K.A.Chen,BostonUniv.(UnitedStates)

Multi-frequencyterahertzimaginghasreceivedmuchattentioninrecentyearsduetoitsabilitytoobserveuniquespectralcharacteristicsofchemicals,whichcanbeusedinnumerousapplicationssuchasexplosivesdetection.Short-pulseTHzsourcescanprovidebroadbandexcitation,butcurrentapproachesforimageformationbasedondiffractiontomographyconstructimagesindependentlyforeachfrequency.Thisresultsinalackofresolutionatlowerfrequencies,andlowersignal-to-noisereconstructions.Inthispaper,weexploredifferenttechniquesforjointimageformationusingmultiplefrequenciesforenhanceddetection.Amongthesearetechniquesthatusepriorinformationonspectralcharacteristicsofmaterialsofinteresttocoherentlycombineinformationfrommultiplefrequencies,aswellasrobusttechniquesthatassumeincompleteorinaccuratepriorknowledgeofspectralsignatures.Weexploretherelativeperformanceofthesetechniquesonimagereconstructionandobjectrecognitiontasksusingnumericalsimulations.

7873-41, Session 8

Classification-aware dimensionality reduction methods for explosives detection using multi-energy x-ray computed tomographyW.C.Karl,P.Ishwar,L.Eger,BostonUniv.(UnitedStates)

Multi-EnergyX-rayComputedTomography(MECT)isanon-destructivescanningtechnologyinwhichmultipleenergy-selectivemeasurementsoftheX-rayattenuationcanbeobtained.Thisprovidesmoreinformationaboutthechemicalcompositionofthescannedmaterialsthansingle-energytechnologiesandpotentialformorereliabledetectionofexplosives.Westudytheproblemofdiscriminatingbetweenexplosivesandnon-explosivesusinglow-dimensionalfeaturesextractedfromthehigh-dimensionalattenuationversusenergycurvesofmaterials.Westudyvariousclassification-awaredimensionalityreductionmethodsanddemonstratethatthedetectionperformancecanbesignificantlyimprovedbyusingmorethantwofeaturesandwhenusingfeaturesdifferentthanthestandardphotoelectricandComptoncoefficients.Thissuggeststhepotentialforimproveddetectionperformancerelativetoconventionaldual-energyX-raysystems.

7873-42, Session 8

Robustness of spectral CT for explosives detectionS.Basu,MorphoDetectionInc.(UnitedStates)

Noabstractavailable



IS&T /

ReturntoContents

7873-03, Session 9

Bayesian estimation with Gauss-Markov-Potts priors in optical diffraction tomographyH.Ayasso,B.Duchêne,A.Mohammad-Djafari,Lab.desSignauxetSystèmes(France)

Inthispaper,weconsidertheOpticalDiffractionTomography(ODT)asaninversescatteringproblemandproposetousetheBayesianestimationframeworkfortheimagereconstructingproblem.

AGauss-Markov-Pottspriortranslatesappropriatelytheaprioriknowledgethattheobjectundertestiscomposedofcompacthomogeneousregionsmadeofafinitenumberofhomogeneousmaterials.WeproposetwosuchmodelsandusethemforproposingtwoimagereconstructionalgorithmsbasedontheMCMCsamplingschemes.Somepreliminaryresults,obtainedbyapplyingtheinversionalgorithmtoexperimentallaboratorycontrolleddata,willillustratetheperformancesoftheproposedmethod.

7873-43, Session 9

Constrain static target kinetic iterative image reconstruction for 4D cardiac CT imagingA.M.Alessio,Univ.ofWashingtonMedicalCtr.(UnitedStates);P.J.LaRivière,TheUniv.ofChicagoMedicalCtr.(UnitedStates)

IterativeimagereconstructionoffersimprovedsignaltonoisepropertiesforCTimaging.Theprimarychallengewithiterativemethodsisthesubstantialcomputationtime.Thiscomputationtimeisevenmoreprohibitivein4Dimagingapplications,suchascardiacgatedordynamicacquisitionsequences.Inthiswork,weproposeonlyupdatingthetime-varyingelementsofa4Dimagesequencewhileconstrainingthestaticelementstobefixedorslowlyvaryingintime.Wetestthemethodwithsimulationsof4Dacquisitionsbasedonmeasuredcardiacpatientdatafroma)aretrospectivecardiac-gatedCTacquisitionandb)adynamicperfusionCTacquisition.Wetargetthekineticelementswithoneoftwomethods:1)positionacircularROIontheheart,assumingareaoutsideROIisessentiallystaticthroughoutimagingtime;and2)selectvaryingelementsfromcoefficientofvariationimageformedfromfastanalyticreconstructionofalltimeframes.Targetedkineticelementsareupdatedwitheachiteration,whilestaticelementsremainfixedatinitialimagevaluesformedfromreconstructionofdatafromalltimeframes.Resultsconfirmthatthecomputationtimeisproportionaltothenumberoftargetedelements;oursimulationssuggestthat<25%ofelementsneedtobeupdatedineachframeleadingto>4xreductionsinreconstructiontime.Theimagesreconstructedwiththeproposedmethodhavematchedmeansquareerrorwithfull4Dreconstruction.Theproposedmethodisamenabletomostoptimizationalgorithmsandoffersthepotentialforsignificantcomputationimprovements,whichcouldbetradedoffformoresophisticatedsystemmodelsorpenaltyterms.

7873-44, Session 9

Model based motion artifact reduction for computed tomographyZ.Yu,J.Thibault,GEHealthcare(UnitedStates);J.Wang,K.D.Sauer,Univ.ofNotreDame(UnitedStates);C.A.Bouman,PurdueUniv.(UnitedStates)

Modelbasediterativereconstruction(MBIR)algorithmshaverecentlybeenappliedtocomputedtomographyanddemonstratedsuperiorimagequality.Typicalreconstructionalgorithmsassumethevoxelsareconstantovertime.Thisassumptionisnottruewhenpatientmotionispresent,which,inturn,resultsinmotionartifacts.Inthispaper,wepresentamethodthatmodelsthevoxelvaluesasafunctionoftimeintheMBIRframework.Ourresultsonphantomstudyandclinicaldata

showthattheproposedmethodcansignificantlyreducethemotionartifactsinthereconstruction.

7873-24, Session 10

An expectation maximization solution for fusing 2D and 3D ladar dataP.F.Dolce,S.C.Cain,AirForceInstituteofTechnology(UnitedStates)

FLASH3-DLADAR(LAserDetectionandRanging)systemsrepresentanimportantadvancementinimagingtechnologyinthattheyallowanentirescenetobecapturedsimultaneouslyasopposedtoscanningsystems.3-DFLASHsystemssufferfromspatialresolutionproblemsduetopixelpitchfabricationlimitations.A2-Dsystemcanproducehighspatialresolutionimageswithoutrangedata.Onemethodforobtainingbetterspatialandrangeresolutionfrom3-DLADARsystemsistointerpolatetheimagesthroughvarioustechniques.Interpolationmayintroduceerrorsduetoaliasingeffects.Thispaperproposesanexpectationmaximization(EM)solutionthatcorrectstheseproblemsthroughfusing2-Dand3-DLADARdata.TheEMsolutionisshowntoproduce3-Dimageswithimprovedresolutionoverthoseproducedwithstandardinterpolationtechniques.Thecombinationof2-Dhighspatialresolutionimagesand3-DFLASHLADARimagesproducesanewLADARsystemwithimprovedresolutionovercurrentrealizableFLASH3-Dsensors.Thisworkprovesimprovementusingasimplesimulatedtargetandassumesaknownpointspreadfunction(PSF)fortheopticsandatmosphere.Usingtheproposedalgorithmonreal3-DLADARdatawouldfurthertheevidencethatthealgorithmworkswithalldata.

7873-25, Session 10

Superresolution with the focused plenoptic cameraA.Lumsdaine,G.N.Chunev,IndianaUniv.(UnitedStates);T.G.Georgiev,AdobeSystemsInc.(UnitedStates)

Thisworkisbasedonthefocusedplenopticcamera,whichdiffersfromthetraditionalplenopticcamerabycapturinganarrayofmicroimagesfocusedontheobject.Ithasbeenshownthatthefocusedplenopticcameradatacanbeusedtogeneratefinalrenderedimagesofmuchhigherresolutionthantheplenopticcamera.Inthispaperweshowthatthisveryfactmakesitpossibletousethecameradatawithsuper-resolutiontechniques,whichenablesevenhigherspatialresolutiontobeobtainedfromthefocusedplenopticcamera.Wederivetheconditionsunderwhichthefocusedplenopticcameracancaptureradiancedatasuitableforsuper-resolutionanddevelopanalgorithmforsuperresolvingthoseimages.Experimentalresultsarepresentedthatshowa9xincreaseinspatialresolutioncomparedtothebasicfocusedplenopticrenderingapproach.

7873-26, Session 10

Content-preserving zoom-in view generation for surveillance videosK.Watanabe,N.Nitta,N.Babaguchi,OsakaUniv.(Japan)

Inordertogenerateazoom-inviewofcertainregionsofinterest(ROIs),mostexistingmethodssuchasfull-zoomandfisheyeviewdiscardordistorttheremainingregionsoftheinputframewithoutconsideringtheircontent.Inthispaper,weproposeamethodforgeneratingacontent-preservingzoom-inviewwhichprovidesmagnifiedROIsandatthesametimepreservesthecontentoftheremainingregions.Targetingonstationarysurveillancevideos,ourmethodfirstlyextractsmovingobjectsfromeveryinputframeasROIs.Then,theimportancescoreiscalculatedforeachpixelintheinputframebasedonitscontentinordertodeterminewherethedeformation,whichmaycausethedestructionofthecontent,shouldbeavoided.Finally,amapping



IS&T /

ReturntoContents

problemfromtheinputframetothezoom-inviewwithrespecttotheimportancescoresisformulatedtodeformlessimportantregionsmorethantheimportantones.Acontent-preservingzoom-inviewthatallowsviewerstoseeboththemagnifiedROIsandanoverviewofthewholeareaundermonitoringinasingleviewisgeneratedbysolvingamappingproblembyoptimization.Experimentsareconductedtostudytheeffectivenessofconsideringthecontentimportancebycomparingtheresultswhenchangingtheimportancescoresoftheremainingregions.

7873-45, Session 10

Accelerating sparse reconstruction for fast and precomputable system matrix inversesS.J.Reeves,AuburnUniv.(UnitedStates)

Signalreconstructionusinganl1-normpenaltyhasproventobevaluableinedge-preservingregularizationaswellasinsparsereconstructionproblems.Thedevelopingfieldofcompressedsensingtypicallyexploitsthisapproachtoyieldsparsesolutionsinthefaceofincoherentmeasurements.Unfortunately,sparsereconstructiongenerallyrequiressignificantlymorecomputationbecauseofthenonlinearnatureoftheproblemandbecausethemostcommonsolutionsdamageanystructurethatmayotherwiseexistinthesystemmatrix.Inthisworkweadoptamajorizingfunctionfortheabsolutevaluetermthatcanbeusedwithstructuredsystemmatricessothattheregularizationterminthematrixtobeinverteddoesnotdestroythestructureoftheoriginalmatrix.Asaresult,asysteminversecanbeprecomputedandappliedefficientlyateachiterationtospeedtheestimationprocess.Wedemonstratethatthismethodcanyieldsignificantcomputationaladvantageswhentheoriginalsystemmatrixcanberepresentedordecomposedintoanefficientlyappliedsingularvaluedecomposition.


Color image compression by gray-to-color mappingM.S.Drew,SimonFraserUniv.(Canada);G.D.Finlayson,Univ.ofEastAngliaNorwich(UnitedKingdom);A.Jindal,IndianInstituteofTechnologyKanpur(India)

Insteadofde-correlatingimageluminancefromchrominance,someusehasbeenmadeofthecorrelationbetweentheluminancecomponentofanimageanditschromaticcomponents,orthecorrelationofonecolorcomponent.

Inoneapproach,Greenwastakenasabase,andtheothercolorchannelsortheirDCTsubbandswereapproximatedaspolynomialfunctionsofthebaseinsideimagewindows.

Thispaperpointsoutthatwecandobetterifweintroduceanaddressingschemeintotheimagedescriptionsuchthatsimilarcolorsaregroupedtogetherspatially.WithaLuminancecomponentbase,wetestseveralcolorspacesandrearrangementschemes,includingsegmentation,andsettleonalog-geometric-meancolorspace.AlongwithPSNRversusbits-per-pixel,wefoundthatspatially-keyeds-CIELABcolorerrorbetteridentifiesproblemregions.Insteadofsegmentation,wefoundthatrearrangingonsortedchromaticcomponentshasalmostequalperformance.

Here,wesortoneachofthechromaticcomponentsandseparatelyencodewindowsofeach.

Theresultconsistsoftheoriginalgrayscaleplaneplusthepolynomialcoefficientsofwindowsofrearrangedchromaticvalues,whicharethenquantized.Thesimplicityofthemethodproducesafastandsimpleschemeforcolorimageandvideocompression,withexcellentresults.


Study of recognizing human motion observed from an arbitrary viewpoint based on decomposition of a tensor containing multiple view motionsT.Hori,J.Ohya,WasedaUniv.(Japan);J.Kurumisawa,ChibaUniv.ofCommerce(Japan)

ThispaperproposesamethodofhumanmotionrecognitionbasedonTensorDecompositionusingasingle-cameravideosequenceandamultipleviewpointimagedatabase.Thevideosequenceisofapersonperformingamotion.Theviewpointimagedatabaseconsistsofthreeviewpointsofahumanmodelimage:front,side,and45degree.Theaimofthispaperistoaccuratelyclassifytheperson’smotionfromthedatabaseofmultipleviewpointimages,usingacomputervisionbasedapproach.Thestudyofmotioninimagesequencesisacommonresearchtopicincomputervision.Motionisapowerfulfeatureofimagesequences,revealingsituationalinformationnotavailablewithstillpicturesbyrelatingspatialimagefeaturestotemporalchanges.Thetaskofmotionanalysis,inparticularhumanmotionrecognition,remainsachallengingandfundamentalproblemofcomputervision.Thevideosequence,viewpointsdatabase,andpossibleactionclassesthatthemotioncanbeclassifiedas,aremappedintotensorspace.Themulti-linearanalysisofthetensorspace(tensordecomposition)describedinthispapersubsumesasspecialcasesthesimple,linear(1-factor)analysisassociatedwithconventionalSVD(singularvaluedecomposition),aswellastheincrementallymoregeneralbilinear(2-factor)analysisthathasrecentlybeeninvestigatedinthecontextofcomputervision.Inthispaper,weexploretheeffectivenessoftheabove-mentionedrecognitionmethod.OurmethodwascomparedwiththeNearestNeighbormethod,whichisaverycommonrecognitionalgorithm,andachievedhigheraccuracy.


Visual real-time detection, recognition, and tracking of ground and airborne targetsL.Kovács,C.Benedek,ComputerandAutomationResearchInstitute(Hungary)

Thispaperpresentsmethodsandalgorithmsforreal-timetargetdetection,recognitionandtracking,bothinthecaseofground-basedobjects(surveyedfromamovingairborneimagingsensor)andflyingtargets(observedfromaground-basedorvehiclemountedsensor).ThemethodsarehighlyparallelizedandpartiallyimplementedonGPU,withthegoalofreal-timespeedseveninthecaseofmultipletargetobservations.Targetsegmentationsincluderobustforegroundandobjectextractionstepsinvolvingmulti-layerGaussianMixtureModelingandshape/texture-basedobjectsegmentation,viewregistrationbasedoninvariantfeaturepointdetectionandHiddenMarkovModelsforgroundobjectrecognitionandsegmentation.TrackingisimplementedonGPUforhighresolutionreal-timeprocessing.Recognitionoftheextractedtargetsisbasedonfusingshape,textureandmotioninformation,providingconstantestimationsfortheclassificationoftheobservedtargets.Recognitionevaluationispresented,byusingadatabaseofextractedandcategorizedobjectshapes,collectedfromrealvideosources.Real-timeapplicabilityisinfocus.Themethodsusesinglecameraobservations,providingapassiveandexpendablealternativeforexpensiveand/oractivesensors.Usecasesinvolveperimeterdefenseandsurveillancesituations,wherepassivedetectionandobservationisapriority(e.g.aerialsurveillanceofacompound,detectionofreconnaissancedrones,etc.).



IS&T /

ReturntoContents


Illuminant color estimation by hue categorization based on gray world assumptionH.Kawamura,NipponTelegraphandTelephoneCorp.(Japan);S.Yonemura,J.Ohya,WasedaUniv.(Japan);N.Matsuura,NipponTelegraphandTelephoneCorp.(Japan)

Thispaperproposesagrayworldassumptionbasedmethodforestimatinganilluminantcolorfromanimagebyhuecategorization.Thegrayworldassumptionhypothesizesthattheaveragecolorofalltheobjectsinasceneisgray.However,itisdifficulttoestimateanilluminantcolorcorrectlyifthecolorsoftheobjectsinascenearedominatedbycertaincolors.Tosolvethisproblem,ourmethodroughlycategorizesthecolorsintheimagebyhueandselectsthemonebyonetodecidewhethertheaverageoftheselectedcolorscanberegardedasanilluminantcolorornot.Weuseasurfacereflectancesetobtainedfromtheobjectcolorspectradatabaseandthreeilluminants,i.e.,CIEstandardilluminantsAandD65,andblackbodyradiationwith15000Kofcolortemperature,asanexperimentforestimatinganilluminantcolor.Experimentresultsshowthatestimatedilluminantsareclosertothecorrectonethanthatoftheconventionalone,andtheestimationerrorbyourmethodiswithinthejustnoticeabledifferenceinhumancolorperception.


Super-resolved refocusing with a plenoptic cameraZ.Zhou,Univ.ofScienceandTechnologyofChina(China);Y.Yuan,BeiHangUniv.(China);X.Bin,Univ.ofScienceandTechnologyofChina(China)andAcademyofOpto-electronics,ChineseAcademyofSciences(China);L.Qian,Univ.ofScienceandTechnologyofChina(China)

Thispaperpresentsanapproachtoenhancetheresolutionofrefocusedimagesbysuperresolutionmethods.Inplenopticimaging,wedemonstratethattherawsensorimagecanbedividedtoanumberoflow-resolutionangularimageswithsub-pixelshiftsbetweeneachother.Thesub-pixelshift,whichdefinesthesuperresolvingability,ismathematicallyderivedbyconsideringtheplenopticcameraasequivalentcameraarrays.Weimplementsimulationtodemonstratetheimagingprocessofaplenopticcamera,aswellasthatoftheequivalentcameraarrays.Ahigh-resolutionimageisthenreconstructedusingmaximumaposteriori(MAP)superresolutionalgorithms.Withoutotherdegradationeffectsinsimulation,thesuperresolvedimageachievesaresolutionashighaspredictedbytheproposedmodel.Wealsobuildanexperimentalsetuptoacquirelightfields.Withtraditionalrefocusingmethods,theimageisrenderedataratherlowresolution.Incontrast,weimplementthesuper-resolvedrefocusingmethodsandrecoveranimagewithmuchmorespatialdetails.Toevaluatetheperformanceoftheproposedmethod,wefinallycomparethereconstructedimagesusingimagequalitymetricslikepeaksignaltonoiseration(PSNR).


Plenoptic rendering with interactive performance using GPUsG.N.Chunev,A.Lumsdaine,IndianaUniv.(UnitedStates);T.G.Georgiev,AdobeSystemsInc.(UnitedStates)

Processingandrenderingofplenopticcameradatarequiressignificantcomputationalpowerandmemorybandwidth.Atthesametime,interactiverenderingperformanceishighlydesirablesothatuserscanexploretheinfinitevarietyofimagesthatcanberenderedfromasingleplenopticimage.InthispaperwedescribeaGPU-basedapproachforlightfieldprocessingandrendering,withwhichweareabletoachieveinteractiveperformanceforfocusedplenopticrenderingtaskssuch

asrefocusingandnovel-viewgeneration.WepresentaprogressionofrenderingapproachesforfocusedplenopticcameradataandanalyzetheirperformanceonpopularGPU-basedsystems.OuranalysesarevalidatedwithexperimentalresultsoncommerciallyavailableGPUhardware.Evenforcomplicatedrenderingalgorithms,weareabletorender39Mpixelplenopticdatato2Mpixelimageswithframeratesinexcessof500framespersecond.


Compressive through-focus wavefield imagingE.A.Marengo,O.Mangoubi,NortheasternUniv.(UnitedStates)

Opticalsensingandimagingapplicationsoftensufferfromacombinationoflowresolutionobjectreconstructionsandalargenumberofsensorswhich,dependingonfrequency,canbequiteexpensiveorbulky.Itisdesirabletominimizethenumberofsensors(whichreducescost)foragiventargetresolutionlevel(imagequality)andpermissibletotalsensorarraysize(compactness).Equivalently,foragivenimaginghardwareoneseekstomaximizeimagequality,whichinturnmeansfullyexploitingtheavailablesensorsaswellasallpriorsaboutthepropertiesofthesought-afterobjectssuchassparsityproperties,andother,whichcanbeincorporatedintoreconstructionschemes.Thispaperproposesacompressive-sensing-basedmethodtoprocessthrough-focusopticalfielddatacapturedatasensorarray.Theproposedapproachtreatsin-focusandout-of-focusdataasprojectivemeasurementsforcompressivesensing,andassumesthattheobjectsaresparseunderknowntransformationsappliedtothem.Theproposedcompressivethrough-focusimagingisillustratedforbothcoherentandincoherentlight.Theresultsillustratethecombineduseofthrough-focusimagingandcompressivesensingtechniques,andprovideinsightontheinformationinin-focusandout-of-focusdata.



IS&T /

ReturntoContents

Conference 7874: Document Recognition and Retrieval XVIIIWednesday-Thursday26-27January2011PartofProceedingsofSPIEVol.7874DocumentRecognitionandRetrievalXVIII


Improved document image segmentation algorithm using multiresolution morphologyS.S.Bukhari,TechnischeUniv.Kaiserslautern(Germany);F.Shafait,DFKIGmbH(Germany);T.M.Breuel,TechnischeUniv.Kaiserslautern(Germany)

Pagesegmentationintotextandnon-textcomponentsisanessentialpreprocessingstepbeforeOCRoperation.Ifthisisnotdoneproperly,anOCRclassificationengineproducesgarbagetextduetothepresenceofnon-textcomponents.Thispaperdescribesimprovementstothetext/imagesegmentationalgorithmdescribedbyBloomberg[1],whichisalsoavailableinhisopensourceLeptonicalibrary[2].ThemodificationsresultinsignificantimprovementsoverBloomberg’salgorithmonUW-III,UNLV,ICDAR2009pagesegmentationcompetitiontestimagesandcircuitdiagramdatasets.


A simple and effective figure caption detection system for old-style documentsZ.Liu,H.Zhou,Amazon.com,Inc.(UnitedStates)

Identifyingfigurecaptionshaswideapplicationsinproducinghighqualitye-bookssuchaskindlebooksoripadbooks.Inthispaper,wepresentarule-basedsystemtodetecthorizontalfigurecaptionsinold-styledocuments.Ouralgorithmconsistsofthreesteps:(i)segmentimagesintoregionsofdifferenttypessuchastextsandfigures,(ii)searchthebestcaptionregioncandidatebasedonheuristicrulessuchasregionalignmentsanddistances,and(iii)expandcaptionregionsidentifiedinstep(ii)withitsneighboringtext-regionsinordertocorrectover-segmentationerrors.

Wetestouralgorithmusing81imagescollectedfromold-stylebooks,witheachimagecontainingatleastonefigurearea.Weshowthattheapproachisabletocorrectlydetectfigurecaptionsfromimageswithdifferentlayouts,andwealsomeasureitsperformancesintermsofbothprecisionrateandrecallrate.


Reflowing-driven paragraph recognition for electronic books in PDFJ.Fang,Z.Tang,L.Gao,PekingUniv.(China)

Whenreadingelectronicbooksonhandhelddevices,contentsometimesshouldbereflowedandrecomposedtoadaptforsmall-screenmobiledevices.Accordingtopeople’sreadingpractice,itisreasonabletoreflowthetextcontentbasedonparagraphs.Hence,thispaperaddressestherequirementandproposesasetofnovelmethodsonparagraphrecognitionforelectronicbooksinPDF.Theproposedmethodsconsistofthreesteps,namely,physicalstructureanalysis,paragraphsegmentation,andreadingorderdetection.WemakeuseoflocallyorderedpropertyofPDFdocumentsandlayoutstyleofbookstoimprovetraditionalpagerecognitionresults.Inaddition,weemploytheoptimalmatchingofBipartiteGraphtechnologytodetectparagraphs’readingorder.Experimentsshowthatourmethodsachievehighaccuracy.Itisnoteworthythat,theresearchhasbeenappliedinacommercialsoftwarepackageforChineseE-bookproduction.


Ruling line detection and removalE.Kavallieratou,Univ.oftheAegean(Greece);D.P.Lopresti,J.Chen,LehighUniv.(UnitedStates)

Inthispaperwepresentaprocedureforremovingrulinglinesfromahandwrittendocumentimagethatdoesnotrequireanypreprocessingorpostprocessingtasksanditdoesnotbreakexistingcharacters.Wetakeadvantageofcommonrulinglinepropertiessuchasuniformwidth,predictablespacing,positionvs.text,etc.Theproposedprocedurecanalsodetecttheexistenceofrulinglinesinpage,soithasnoeffectondocumentimageswithoutrulinglines.Thesystemisevaluatedonsyntheticpageimagesinfivedifferentlanguagesandiscomparedtoapreviousmethodology.


Natural scene logo recognition by joint boosting feature selection in salient regionsW.Fan,J.Sun,S.Naoi,FujitsuResearchandDevelopmentCenterCo.,Ltd.(China);A.Minagawa,Y.Hotta,FujitsuLabs.,Ltd.(Japan)

Logosareconsideredvaluableintellectualpropertiesandakeycomponentofthegoodwillofabusiness.Inthispaper,weproposeanaturalscenelogorecognitionmethodwhichissegmentation-freeandcapableofprocessingimagesextremelyrapidlyandachievinghighrecognitionrates.Theclassifiersforeachlogoaretrainedjointly,ratherthanindependently.Inthisway,commonfeaturescanbesharedacrossmultipleclassesforbettergeneralization.Todealwithlargerangeofaspectratioofdifferentlogos,asetofsalientregionsofinterest(ROI)areextractedtodescribeeachclass.WeensuretheselectedROIstobebothindividuallyinformativeandtwo-by-twoweaklydependantbyaClassConditionalEntropyMaximizationcriteria.Experimentalresultsonalargelogodatabasedemonstratetheeffectivenessandefficiencyofourproposedmethod.


A framework to improve digital corpus uses: image-mode navigationL.Eynard,V.Malleron,H.Emptoz,Univ.ClaudeBernardLyon1(France)

Inthispaper,weproposeanewsystemtoenhancenavigationinsidedigitalcorpuses.

Thissystemisbasedonanautomaticindexationinimagemodeandprovidestheuserintuitivenavigationininteractivetime.

KeywordsandcontainersareextracteddirectlyfromthedocumentimagestocreateanImageModeIndex,whichshowsthekeywordsascut-outimagesoftheiractualappearances.

Ourapproachrecreatesasummaryofthestructureddocuments,followingindicationsgivenbythecreatorsofthedocumentthemselves.

Oursystemisdetailedinthegeneralcaseandsampleapplicationsona19thcenturyhandwrittencorpusanda18thcenturymachineprintedtextcorpusareprovided.

Thisapproach,developedfordocumentsinaccessibleotherwise,canbeappliedonanycorpuswherekeywordsandcontainerscanbeidentified.


IS&T /

ReturntoContents


Parameter calibration for synthesizing realistic-looking variability in offline handwritingW.Cheng,D.P.Lopresti,LehighUniv.(UnitedStates)

Beingmotivatedbythewidelyacceptedprinciplethatthemoretrainingdata,thebetterperformancetherecognitionsystemhas,weconductedexperimentsaskinghumansubjectstodotestonamixtureofrealEnglishhandwrittentextlinesandtextlinesalteredfromexistinghandwritingwithvariousdistortiondegrees.TheideaofgeneratingsynthetichandwritingisbasedonaperturbationmethodbyT.VargaandH.Bunkethatdistortsanentiretextline.Therearetwopurposesofourexperiments.First,wewanttocalibrateoptimaldistortionparametersettingsforVargaandBunke’sperturbationmodel.Second,weintendtocomparetheeffectsofparametersettingsondifferentwritingstyles,block,cursiveandmixed.Fromthepreliminaryexperimentalresults,wedeterminedappropriaterangesforparameteramplitude,andfoundthatparametersettingsshouldchangefordifferenthandwritingstyles.Oncetheproperparametersettingsarefound,wewillgeneratelargeamountoftrainingandtestingsetsforbuildingbetteroff-linehandwritingrecognitionsystems.


Automatic segmentation of subfigure image panels for multimodal biomedical document retrievalB.Cheng,MissouriUniv.ofScienceandTechnology(UnitedStates);S.K.Antani,NationalLibraryofMedicine(UnitedStates);R.J.Stanley,MissouriUniv.ofScienceandTechnology(UnitedStates);D.Demner-Fushman,G.R.Thoma,NationalLibraryofMedicine(UnitedStates)



A new method for perspective correction of document imagesJ.Rodríguez-Pinñeiro,P.Comesaña-Alfaro,F.Pérez-González,Univ.deVigo(Spain);A.Malvido-García,BitOceansResearch,S.L.(Spain)

Inthispaperweproposeamethodforperspectivedistortioncorrectionofrectangulardocuments.Thisschemeexploitstheorthogonalityofthedocumentedges,allowingtorecovertheaspectratiooftheoriginaldocument.Theresultsobtainedaftercorrectingtheperspectiveofseveraldocumentimagescapturedwithamobilephonearecomparedwiththoseachievedbydigitizingthesamedocumentswithseveralscannermodels.


Robust keyword retrieval method for OCRed textY.Fujii,H.Takebe,H.Tanaka,Y.Hotta,FujitsuLabs.,Ltd.(Japan)

Documentmanagementsystemshavebecomeimportantbecauseofthegrowingpopularityofelectronicfilingofdocumentsandscanningofbooks,magazines,manuals,etc.,throughascanneroradigitalcamera,forstorageorreadingonaPCoranelectronicbook.Textinformationacquiredbyopticalcharacterrecognition(OCR)isusuallyaddedtotheelectronicdocumentsfordocumentretrieval.SincetextsgeneratedbyOCRgenerallyincludecharacterrecognitionerrors,

robustretrievalmethodshavebeenintroducedtoovercomethisproblem.Inthispaper,weproposearetrievalmethodthatisrobustagainstbothcharactersegmentationandrecognitionerrors.Intheproposedmethod,theinsertionofnoisecharactersanddroppingofcharactersinthekeywordretrievalenablesrobustnessagainstcharactersegmentationerrors,andcharactersubstitutioninthekeywordoftherecognitioncandidateforeachcharacterinOCRoranyothercharacterenablesrobustnessagainstcharacterrecognitionerrors.Therecallrateoftheproposedmethodwas15%higherthanthatoftheconventionalmethod.However,theprecisionratewas64%lower.


Online medical symbol recognition using a tablet PCA.Kundu,Q.Hu,S.Boykin,R.Fish,C.Clark,S.Jones,S.Moore,MITRECorp.(UnitedStates)

InthispaperwedescribeaschemetoenhancetheusabilityofaTabletPC’shandwritingrecognitionsystembyincludingsymbolsthatarenotapartoftheTabletPCssymbollibrary.Thegoalofthisworkistomakehandwritingrecognitionmoreusefulformedicalprofessionalsaccustomedtousingmedicalsymbolsinmedicalrecords.Thefactthatmedicalabbreviationslooksimilartosymbolsmakesthisadifficulttask.Thepaperalsodescribesourefforttocreateacorpusofmedicalsymbolsandnon-symbolswhichcouldbepotentiallyidentifiedassymbols.Usingthedatafromthiscorpus,wedemonstratethatthisnewsymbolrecognitionmoduleisrobustandexpandableaswereportgoodresultsonbothamedicalsymbolsetandanexpandedsymboltestsetwhichincludesselectedmathematicalsymbols.Finally,wehaveshownthatusingamulti-classifierarchitectureprovidesrobustperformance.


Characterizing challenged 2008 Minnesota ballotsG.Nagy,RensselaerPolytechnicInstitute(UnitedStates);D.P.Lopresti,LehighUniv.(UnitedStates);E.H.BarneySmith,BoiseStateUniv.(UnitedStates);Z.Wu,RensselaerPolytechnicInstitute(UnitedStates)



Document image retrieval with morphology-based segmentation and features combinationT.Bockholt,Sr.,G.Darmiton,Sr.,C.Mello,Sr.,Univ.FederaldePernambuco(Brazil)

Digitallibrariesneedmorethanjustaretrievalbasedonkeywords,whichcanbeinecientforsomeapplications.Thus,adocumentretrievalbasedoncontentofthedigitizedimageversionofthedocumentcanbeamoreappropriatedapproach.Thispaperdiscussestheretrievalofdocumentimagesbymeansofidentifyingavarietyofelementspresentinthedocument’simagebody.Weproposeanewstrategytoidentifyandcombinefeaturesextractedfromadocumentimage.Wealsoconsiderthetaskofconstructinganoptimizedfeaturesettoimprovetheretrievalperformanceandtovalidateourexperimentsonanassorteddatabase.Experimentalresultsshowthattheproposedsegmentationtogetherwithawiselyfeaturecombinationincreasetheoverallretrievalperformance.Moreovertheretrievedimagesdemonstratethegeneralityandeectivenessofourapproachforanecientsegmentationandclassicationofdocumentimages.

Conference 7874: Document Recognition and Retrieval XVIII


IS&T /

ReturntoContents


Boosting based text and non-text region classificationB.Xie,G.Agam,IllinoisInstituteofTechnology(UnitedStates)

Layoutanalysisisacrucialprocessfordocumentimageunderstandingandinformationretrieval.Documentlayoutanalysisdependsonpagesegmentationandblockclassification.Thispaperdescribesanalgorithmsfortoextractingblocksfromdocumentimagesandaboostingbasedmethodtoclassifythoseblocksastextornot.Thefeaturevectorwhichisfeedintotheboostingclassifierconsistsofafourdirectionrun-lengthhistogram,andconnectedcomponentsfeatures,bothbackgroundandforeground.Usingacombinationoffeaturesthroughaboostingclassifier,weobtainaccuracyof99.5%onourtestcollection.


OMR of early plainchant manuscripts in square notation: a two-stage systemC.Ramirez,J.Ohya,WasedaUniv.(Japan)

WhileOpticalMusicRecognition(OMR)ofmodernprintedandhandwrittendocumentsisconsideredasolvedproblem,withmanycommercialsystemsavailabletoday,theOMRofancientmusicalmanuscriptsstillremainsanopenproblem.InthispaperwepresentasystemfortheOMRofdegradedwesternplainchantmanuscriptsinsquarenotationfromtheXIVtoXVIcenturies.Thesystemhastwomainblocks,thefirstonedealswithsymbolextractionandrecognition,whilethesecondoneactsasanerrordetectionstageforthefirstblockoutputs.Forsymbolextractionweusewidelyknownimage-processingtechniques,suchasSobelfilteringandHoughTransform,andSVMforclassification.TheerrordetectionstageisimplementedwithahiddenMarkovmodel(HMM),whichtakesadvantageofaprioriknowledgeforthisspecifickindofmusic.

7874-01, Session 1

Scientific challenges underlying production document processingE.Saund,PaloAltoResearchCenter,Inc.(UnitedStates)

ThefieldofDocumentRecognitionisbipolar.Ononeendliestheexcellentworkofacademicinstitutionsengaginginoriginalresearchonscientificallyinterestingtopics.Ontheotherendliesthedocumentrecognitionindustrywhichservicesneedsforhigh-volumedatacapturefortransactionandback-officeapplications.Theserealmsseemtoseldommeet,yettheneedisgreattoaddresstechnicalhurdlesforpracticalproblemsusingmodernapproachesfromtheDocumentRecognition,ComputerVision,andMachineLearningcommunities.

Thistalkwillreflectonthreecategoriesofproblemswehaveencounteredwhicharebothscientificallychallengingandofhighpracticalvalue.TheseareDoctypeClassification,FunctionalRoleLabeling,andDocumentSets.DoctypeClassificationasks,“WhatisthispageI’mlookingat?”FunctionalRoleLabelingasks,“Whatisthestatusoftextandgraphicalelementsinamodelofdocumentstructure?”DocumentSetsasks,“Howarepagesandtheircontentsrelatedtooneanother?”Eachofthesehasadhocengineeringapproachesthatprovide40-80%solutions,andeachofthembegsforadeeplygroundedformulationbothtoprovideunderstandingandtosupportcaptureoftheremaining20-60%ofpracticalvalue.ThepracticalneedisnotpurelytechnicalbutalsorevolvesaroundUserExperienceandtherefore,theartofDesign.

7874-02, Session 2

Automated identification of biomedical article type using support vector machinesI.Kim,NationalInstitutesofHealth(UnitedStates);D.X.Le,G.R.Thoma,NationalLibraryofMedicine(UnitedStates)

Authorsofshortpaperssuchaslettersoreditorialsoftenexpresscomplementaryopinions,andsometimescontradictoryones,onrelatedworkinpreviouslypublishedarticles.TheMEDLINE®citationsforsuchshortpapersarerequiredtolistbibliographicdataonthese“commentedon”articlesina“CON”field.ThechallengeistoautomaticallyidentifytheCONarticlesreferredtobytheauthoroftheshortpaper(called“Comment-in”orCINpaper).Ourapproachistousesupportvectormachines(SVM)tofirstclassifyapaperaseitheraCINoraregularfull-lengtharticle(whichisexemptfromthisrequirement),andthentoextractfromtheCINpaperthebibliographicdataoftheCONarticles.Asolutiontothefirstpartoftheproblem,identifyingCINarticles,isaddressedhere.WeimplementandcomparetheperformanceoftwotypesofSVM,onewithalinearkernelfunctionandtheotherwitharadialbasiskernelfunction(RBF).InputfeaturevectorsfortheSVMsarecreatedbycombiningfourtypesoffeaturesbasedonstatisticsofwordsinthearticletitle,wordsthatsuggestthearticletype(letter,correspondence,editorial),sizeofbodytext,andcuephrases.ExperimentsconductedonasetofonlinebiomedicalarticlesshowthattheSVMwithalinearkernelfunctionyieldsasignificantlylowerfalsenegativeerrorratethantheonewithanRBF.OurexperimentsalsoshowthattheSVMwithalinearkernelfunctionachievesasignificantlyhigherlevelofaccuracy,andlowerfalsepositiveandfalsenegativeerrorratesbyusinginputfeaturevectorscreatedbycombiningallfourtypesoffeaturesratherthananysingletype.

7874-03, Session 2

Introduction of statistical information in a syntactic analyzer for document image recognitionA.OliveiraMaroneze,B.Coüasnon,InstitutNationaldesSciencesAppliquéesdeRennes(France);A.Lemaitre,InstitutdeRechercheenInformatiqueetSystèmesAléatoires(France)

Thispaperpresentsanimprovementtoadocumentlayoutanalysissystem,offeringapossiblesolutiontoSayre’sparadox(“alettermustberecognizedbeforeitcanbesegmented;anditmustbesegmentedbeforeitcanberecognized”).Thisimprovement,basedonstochasticparsing,allowsintegrationofstatisticalinformation,obtainedfromrecognizers,duringsyntacticlayoutanalysis.Wepresenthowthisfusionofnumericandsymbolicinformationinafeedbackloopcanbeappliedtosyntacticmethodstosimplifydocumentdescription.Tolimitcombinatorialexplosionduringexplorationofsolutions,wedevisedanoperatorthatallowsoptionalactivationofthestochasticparsingmechanism.Ourevaluationon1250handwrittenbusinesslettersshowsthismethodallowstheimprovementofglobalrecognitionscores.

7874-04, Session 2

High recall document content extractionC.An,LehighUniv.(UnitedStates)

Wereportmethodologiesforcomputinghigh-recallmasksfordocumentimagecontentextraction,thatis,thelocationandsegmentationofregionscontaininghandwriting,machine-printedtext,photographs,blankspace,etc.Theresultingsegmentationispixel-accurate,whichaccommodatesarbitraryzoneshapes(notmerelyrectangles).Wedescribeexperimentsshowingthatiteratedclassierscanincreaserecallofallcontenttypes,withlittlelossofprecision.Wealsointroducetwomethodologicalenhancements:(1)amulti-stagevotingrule;and(2)ascoringpolicythatviewsblankpixelsasa“don’t



IS&T /

ReturntoContents

care”calsswithothercontentclasses.Theseenhancementsimprovebothrecallandprecision,achievingatleast89%recallandatleast87%precisionamongthreecontenttypes:machine-print,handwriting,andphoto.

7874-05, Session 2

Shape codebook based handwritten and machine printed text zone extractionJ.Kumar,Univ.ofMaryland,CollegePark(UnitedStates);R.Prasad,H.Cao,BBNTechnologies(UnitedStates);W.Abd-Almageed,D.S.Doermann,Univ.ofMaryland,CollegePark(UnitedStates);P.S.Natarajan,BBNTechnologies(UnitedStates)

Wepresentanovelmethodforextractinghandwrittenandprintedtextzonesfromnoisydocumentimageswithmixedcontent.WeuseTriple-Adjacent-Segment(TAS)basedfeatureswhichencodelocalshapecharacteristicsoftextinaconsistentmanner.Wefirstconstructtwodifferentcodebooksoftheshapefeaturesextractedfromasetofhandwrittenandprintedtextdocuments.Inthenextstep,wecomputethenormalizedhistogramofcodewordsforeachsegmentedzoneanduseittotrainSupportVectorMachine(SVM)classifier.Duetoacodebookbasedapproach,ourmethodisrobusttothebackgroundnoisepresentintheimage.TheTASfeaturesusedareinvarianttotranslation,scaleandrotationoftext.Inourexperimentalresults,weshowthatapixel-weightedzoneclassificationaccuracyof98%canbeachievedfornoisyArabicdocuments.

Further,wedemonstratetheeffectivenessofourmethodindocumentpageclassificationandshowthatahighprecisioncanbeachievedformachineprinteddocuments.

Theproposedmethodisrobusttothesizeofzones,whichmaycontaintextcontentatword,lineorparagraphlevel.

7874-06, Session 3

A MRF model with parameters optimization by CRF for on-line recognition of handwritten Japanese charactersB.Zhu,M.Nakagawa,TokyoUniv.ofAgricultureandTechnology(Japan)

ThispaperdescribesaMarkovrandomfield(MRF)modelwithweightingparametersoptimizedbyconditionalrandomfield(CRF)foron-linerecognitionofhandwrittenJapanesecharacters.Itextractsfeaturepointsalongthepen-tiptracefrompen-downtopen-up,andthensetseachfeaturepointfromaninputpatternasasiteandeachstatefromacharacterclassasalabel.Itemploysthecoordinatesoffeaturepointsasunaryfeaturesandthedifferencesofthecoordinatesbetweentheneighboringfeaturepointsasbinaryfeatures.TheweightingparametersareestimatedbyCRFortheminimumclassificationerror(MCE)method.InexperimentsusingtheTUATKuchibuedatabase,theproposedmethodachievesthecharacterrecognitionrateof92.77%,whichishigherthanthepreviousmodel,andthemethodestimatingtheweightingparametersbyCRFbringshigherrecognitionaccuracythanMCE.

7874-07, Session 3

Improving a HMM-based off-line handwriting recognition system using MME-PSO optimizationM.Hamdani,EcoleNationaled’IngénieursdeSfax(Tunisia);H.ElAbed,TechnischeUniv.Braunschweig(Germany);T.M.Hamdani,EcoleNationaled’IngénieursdeSfax(Tunisia);V.Märgner,TechnischeUniv.Braunschweig(Germany);A.M.Alimi,

EcoleNationaled’IngénieursdeSfax(Tunisia)

Oneofthetrivialstepsinthedevelopmentofaclassifieristhedesignofitsarchitecture.Thispaperpresentsanewalgorithm,MultiModelsEvolvement(MME)usingParticleSwarmOptimization(PSO).ThisalgorithmisamodifiedversionofthebasicPSO,whichisusedtotheunsuperviseddesignofHiddenMarkovModel(HMM)basedarchitectures.Forinstance,theproposedalgorithmisappliedtoanArabichandwritingrecognizerbasedondiscreteprobabilityHMMs.Aftertheoptimizationoftheirarchitectures,HMMsaretrainedwiththeBaum-Welchalgorithm.ThevalidationofthesystemisbasedontheIfN/ENITdatabase.Theperformanceofthedevelopedapproachiscomparedtotheparticipatingsystemsatthe2005competitionorganizedonArabichandwritingrecognitionontheInternationalConferenceonDocumentAnalysisandRecognition(ICDAR).Anabsoluteimprovementof6%ofwordrecognitionratewithabout81%ispresented.Theproposedsystemoutperformsalsomostoftheknownstate-of-the-artsystems.

7874-08, Session 3

SemiBoost-based Arabic character recognition methodB.Su,L.Peng,X.Ding,TsinghuaUniv.(China)

Traditionally,supervisedlearningmethodisadoptedtobuildcharacterrecognitionsystem.However,thedistributionoftraininglabeledsamplesandthedistributionofpracticalsamplesmaynotmatch,duetothevariationofcharacterimagequality.Tosolvethisproblem,itwouldbefeasibletoincorporateunlabeledpracticalsamplesinthetrainingstage.Inthispaper,wepresentaSemiboost-basedArabiccharacterrecognitionmethod.SVMisadoptedasthebaseclassifier.Ateachiteration,unlabeledexamplesareselectedandassignedlabels.TheselectedsamplesareusedalongwiththeoriginallabeledsamplestotrainanewSVMclassifier.Anempiricalstudyonseveralsimilarcharacterpairswithdifferentsimilaritiesshowsthattheproposedmethodimprovestheperformancewhenunlabeledsamplesrevealtheunderlyingstructureofdata.

7874-09, Session 3

First experiments on a new online handwritten flowchart databaseA.M.Awal,Univ.deNantes(France);G.Feng,NanjingUniv.(China);H.Mouchère,C.Viard-Gaudin,Univ.deNantes(France)

Weproposeinthispaperanewonlinehandwrittenflowchartdatabaseandperformsomefirstexperimentstohaveabaselinebenchmarkonthisdataset.Thecollecteddatabaseconsistsof78flowchartslabeledatthestrokeandsymbollevels.Inaddition,anisolateddatabaseofgraphicalandtextsymbolswasextractedfromthesecollectedflowcharts.Then,wetackletheproblemofonlinehandwrittenflowchartrecognitionfromtwodifferentpointsofview.Firstly,weconsiderthatflowchartsarecorrectlysegmented,andweproposedifferentclassifierstoperformtwotasks,text/non-textseparationandgraphicalsymbolrecognition.Testedwiththeextractedisolatedtestdatabase,weachieveupto99%and96%intext/non-textseparationandupto81.3%ingraphicalsymbolsrecognition.Secondly,weproposeaglobalapproachtoperformflowchartsegmentationandrecognition.Forthislatter,weadoptagloballearningschemaandarecognitionarchitecturethatconsidersasimultaneoussegmentationandrecognition.Globalarchitectureistrainedandtesteddirectlywithflowcharts.Resultsshowtheinterestofsuchglobalapproach,butregardingthecomplexityofflowchartsegmentationproblem,thereisstilllotofspacetoimprovethegloballearningandrecognitionmethods.



IS&T /

ReturntoContents

7874-10, Session 4

Segmenting texts from outdoor images taken by mobile phones using color featuresZ.Liu,H.Zhou,Amazon.com,Inc.(UnitedStates)

Recognizingtextsfromimagestakenbymobilephoneswithlowresolutionhaswideapplications.IthasshownthatagoodimagebinarizationcansubstantiallyhelptheaccuracyofdownstreamOCRs.

Inthispaper,wepresentaframeworktosegmenttextsfromoutdoorimagestakenbymobilephonesusingcolorfeatures.Theframeworkconsistsofthreesteps:(i)theinitialprocessincludingimageenhancement,binarizationandnoisefiltering,wherewebinarizetheinputimagesineachRGBchannel,andapplycomponentlevelnoisefiltering;(ii)groupingcomponentsintoblocksusingcolorfeatures,wherewecomputethecomponentsimilaritiesbydynamicallyadjustingtheweightsofRGBchannels,andmergegroupshierachically,and(iii)blocksselection,whereweusetherun-lengthfeaturesandchoosetheSupportVectorMachine~(SVM)astheclassifier.

Wetestedthealgorithmusing13outdoorimagestakenbyanold-styleLG-64693mobilephonewith640x480resolution.WecomparedthesegmentationresultswithTsar’salgorithm,astate-of-the-artcameratextdetectionalgorithm,andshowthatouralgorithmismorerobust,particularlyintermsofnoiseremoval.Inaddition,wealsoevaluatedtheimpactsofouralgorithmontheAbbyy’sFineReader,oneofthemostpopularcommercialOCRenginesinthemarket.

7874-11, Session 4

A perceptive method for handwritten text segmentationA.Lemaitre,InstitutdeRechercheenInformatiqueetSystèmesAléatoires(France);B.Coüasnon,InstitutNationaldesSciencesAppliquéesdeRennes(France)

Thispaperpresentsanewmethodtoaddresstheproblemofhandwrittentextsegmentationintotextlinesandwords.Thus,weproposeamethodbasedonthecooperationbetweenpointsofviewthatenablestolocalizethetextlinesinalowresolutionimage,andthentoassociatethepixelsatahigherlevelofresolution.Thankstothecombinationoflevelsofvision,wecandetectoverlappingcharactersandre-segmenttheconnectedcomponentsduringtheanalysis.Then,weproposeasegmentationoflinesintowordsbasedonthecooperationbetweendigitaldataandsymbolicknowledge.ThedigitaldataareobtainedfromdistancesinsideaDelaunaygraph,whichgivesaprecisedistancebetweenconnectedcomponents,atthepixellevel.Then,weintroducestructuralrulesinordertotakeintoaccountsomegenericknowledgeabouttheorganizationofatextpage.Thiscooperationbetweeninformationgivesabiggerpowerofexpression.WevalidatethisworkusingthemetricsandthedatabaseproposedforthesegmentationcontestofICDAR2009.Thus,weshowthatourmethodobtainsveryinterestingresults,comparedtotheothermethodsoftheliterature.Moreprecisely,weareabletodealwithslopeandcurvature,overlappingtextlinesandvariedkindsofwritings,whicharethemaindifficultiesmetbytheothermethods.

7874-39, Session 4

A masked-based enhancement method for historical documentsE.H.BarneySmith,BoiseStateUniv.(UnitedStates);J.Darbon,EcoleNormaleSupérieuredeCachan(France);L.Likforman-Sulem,TelecomParisTech(France)

Thispaperproposesanovelmethodfordocumentenhancement.Themethodisbasedonthecombinationoftwostate-of-the-artfiltersthroughtheconstructionofamask.ThemaskisappliedtoaTV(TotalVariation)-regularizedimagewherebackgroundnoisehasbeenreduced.ThemaskedimageisthenfilteredbyNLmeans(NonLocal

Means)whichreducesthenoiseinthetextareaslocatedbythemask.Thedocumentimagestobeenhancedarerealhistoricaldocumentsfromseveralperiodswhichincludeseveraldefectsintheirbackground.Thesedefectsresultfromscanning,paperagingandbleed-through.WeobservetheimprovementofthisenhancementmethodthroughOCRaccuracy.

7874-13, Session 5

Example-centric document design and developmentS.R.Klemmer,StanfordUniv.(UnitedStates)

Designersoftenleverageexampleswhencreatingnewwork.Usingsuccessfulelementsfrompriorideascanbemoreefficientthanreinventingthemfromscratch.Moreover,examplescanplayaninspirationalrole,helpingdesignersseethespaceofexistingsolutions,andillustratinghowdesirabledesigneffectsmaybeimplemented.Themorethanonetrillionpagesonthewebtodayprovideacorpusofdesignexamplesunparalleledinhumanhistory.Today,workingwithexistingWebdesignsrequiremanuallymanipulatingtheHTMLsource.Toenablenovicesandexpertsaliketomorecreativelyuseexamples,BricolageintroducesanautomaticmethodfortransferringlayoutandstylebetweenWebpages.Thistransferisguidedbymappingslearnedusingstructuredpredictionmethodsandothermachinelearningtechniques.Inparticular,ouralgorithmlearnstoidentifyvisuallyandsemanticallysimilarregionsbetweenpagesbytrainingonasetofhuman-generatedmappings.TheendresultisanautomaticsystemthatenablesdesignerstoviewtheircontentinthelayoutandstyleofanyHTMLpageontheWeb.

7874-14, Session 6

Feature relevance analysis for writer identificationI.Siddiqi,RenéDescartesUniv.(France)andNationalUniv.ofSciencesandTechnology(Pakistan);K.Khurshid,RenéDescartesUniv.(France)andInstituteofSpaceTechnology,Islamabad(Pakistan);N.Vincent,RenéDescartesUniv.(France)

Thisworkpresentsananalyticalstudyontherelevanceoffeaturesinanexistingframeworkforwriteridentificationfromofflinehandwrittendocumentimages.Theidentificationsystemcomprisesasetof15featurescombiningtheorientationandcurvatureinformationinawritingwiththewell-knowncodebookbasedapproach.Thisstudyaimstofindtheoptimalfeaturesubsetforthetaskofidentifyingtheauthorofaquestioneddocumentwhilemaintainingacceptableidentificationrates.Employingageneticalgorithmwithawrappermethodwecarryoutafeatureselectionmechanismandidentifythemostrelevantfeaturesincharacterizingthewriterofahandwrittentext.

7874-15, Session 6

Using perturbed handwriting to support writer identification in the presence of severe data constraintsJ.Chen,W.Cheng,D.P.Lopresti,LehighUniv.(UnitedStates)

Sincerealdataistime-consumingandexpensivetocollect,label,anduse,researchershaveproposedapproachesusingsyntheticvariationsforthetasksofsignatureverification,speakerauthentication,handwritingrecognition,keywordspotting,etc.However,thelimitationofrealdataisparticularlycriticalinthefiledofwriteridentifica-tioninthatinforensics,enemiesorcriminalsusuallyleavelittleamountofrealdata.Therefore,itisunrealistictoalwaysassumesufficientrealdataforwriteridentification.Inaddition,thisfielddiffersfrommanyothersinthatwestrivetopreserveasmuchinter-writervariations,butmodelperturbedhandwritingmightbreaksuchdiscriminabilityamongwriters.



IS&T /

ReturntoContents

Inthiswork,westartedbyconductinguserstudieswherehumansubjectswereinvolvedincalibratingrealistic-lookingtransformations.Next,wemeasuredtheeffectsofincorporatingperturbedhandwritingintotherealtrainingdataset.Experimentalresultsjustifiedourhypothesisthatwithlimitedrealdata,modelperturbedhandwritingimprovedtheperformanceofwriteridentification.Inaddition,wejustifiedbyexperimentsthatitwasbeneficialtosearchforbetterperformanceintheparametersubspaces.

7874-16, Session 6

Statistical characterization of handwriting characteristics using automated toolsG.R.Ball,S.N.Srihari,Univ.atBuffalo(UnitedStates)

Weprovideastatisticalbasisforreportingtheresultsofhandwritingexaminationbyquestioneddocument(QD)examiners.AsafacetofQuestionedDocument(QD)examination,theanalysisandreportingofhandwritingexaminationsuffersfromthelackofstatisticaldataconcerningthefrequencyofoccurrenceofcombinationsofparticularhandwritingcharacteristics.QDexaminerstendtoassignprobativevaluestospecifichandwritingcharacteristicsandtheircombinationsbasedentirelyontheexaminer’sexperienceandpowerofrecall.TheresearchusesdatabasesofhandwritingsamplesthatarerepresentativeoftheUSpopulation.FeaturelistsofcharacteristicsprovidedbyQDexaminers,areusedtodetermineastowhatfrequenciesneedtobeevaluated.Algorithmsareusedtoautomaticallyextractthosecharacteristics,e.g.,asoftwaretoolforextractingmostofthecharacteristicsfromthemostcommonletterpairth,isfunctional.Foreachlettercombinationthemarginalandconditionalfrequenciesoftheircharacteristicsareevaluated.Basedonstatisticaldependenciesofthecharacteristicstheprobabilityofanygivenletterformationiscomputed.TheresultingalgorithmsareincorporatedasystemforwriterverificationknownasCEDAR-FOX.

7874-17, Session 7

Keyword and image-based retrieval of mathematical expressionsR.Zanibbi,B.Yuan,RochesterInstituteofTechnology(UnitedStates)

Twonewmethodsforretrievingmathematicalexpressionsusingconventionalkeywordsearchandexpressionimagesarepresented.Anexpression-levelTF-IDF(termfrequency-inversedocumentfrequency)approachisusedforkeywordsearch,wherequeriesandindexedexpressionsarerepresentedbykeywordstakenfromLaTeXstrings.TF-IDFiscomputedatthelevelofindividualexpressions,ratherthandocumentstoincreasetheprecisionofmatching.ThesecondretrievaltechniqueisaformofContent-BasedImageRetrieval(CBIR),usingabagofvisualwords.ExpressionsaresegmentedintosubregionsusingtheXY-cuttingalgorithm,afterwhichvisualwordsaregeneratedforeachnodeoftheresultingXY-tree.Matchingofvisualwordsisbasedonexpressionshape(fromcontourfeatures),aspectratio,andthehistogramofnodedepthsforthesub-treeassociatedwithanodeinanXY-tree.PreliminaryresultsusingLaTeXdocumentsfromtheonlinearXivrepositorysuggestthatthetwomethodsareindividuallyeffective,andmaybeprofitablycombined.

7874-18, Session 7

Word spotting for handwritten documents using Chamfer distance and dynamic time warpingR.M.Saabni,J.A.El-Sana,Ben-GurionUniv.oftheNegev(Israel)

Alargeamountofhandwrittenhistoricaldocumentsarelocatedin

librariesaroundtheworld.Thedesiretoaccess,search,andexplorethesedocumentspavesthewayforanewageofknowledgesharingandpromotescollaborationandunderstandingbetweenhumansocieties.Currently,theindexesforthesedocumentsaregeneratedmanually,whichisverytediousandtimeconsuming.Resultsproducedbystateofthearttechniques,forconvertingcompleteimagesofhandwrittendocumentsintotextualrepresentations,arenotyetsufficient.Therefore,word-spottingmethodshavebeendevelopedtoarchiveandindeximagesofhandwrittendocumentsinordertoenableefficientsearchingwithindocuments.Inthispaper,wepresentanewmatchingalgorithmtobeusedinword-spottingtasksforhistoricalArabicdocuments.WepresentanovelalgorithmbasedontheChamferDistancetocomputethesimilaritybetweenshapesofword-parts.MatchingresultsareusedtoclusterimagesofArabicword-partsintodifferentclassesusingtheNearestNeighborrule.Tocomputethedistancebetweentwoword-partimages,thealgorithmsubdivideseachimageintoequal-sizedslices(windows).AmodifiedversionoftheChamferDistance,incorporatinggeometricgradientfeaturesanddistancetransformdata,isusedasasimilaritydistancebetweenthedifferentslices.Finally,theDynamicTimeWarping(DTW)algorithmisusedtomeasurethedistancebetweentwoimagesofword-parts.ByusingtheDTWweenabledoursystemtoclustersimilarword-parts,eventhoughtheyaretransformednon-linearlyduetothenatureofhandwriting.Wetestedourimplementationofthepresentedmethodsusingvariousdocumentsindifferentwritingstyles,takenfromJuma’aAlMajidCenter-Dubai,andobtainedencouragingresults.

7874-19, Session 7

Automatic identification of ROI in figure images toward improving hybrid (text and image) biomedical document retrievalD.You,Univ.atBuffalo(UnitedStates);S.K.Antani,D.Demner-Fushman,M.M.Rahman,NationalLibraryofMedicine(UnitedStates);V.Govindaraju,Univ.atBuffalo(UnitedStates);G.R.Thoma,NationalLibraryofMedicine(UnitedStates)

Biomedicalimagesareoftenreferencedforclinicaldecisionsupport(CDS),educationalpurposes,andresearch.Theyappearinspecializeddatabasesorinbiomedicalpublicationsandarenotmeaningfullyretrievableusingprimarilytext-basedretrievalsystems.Thetaskofautomaticallyfindingtheimagesinanarticlethataremostusefulforthepurposeofdeterminingrelevancetoaclinicalsituationisquitechallenging.ThistaskcanbedonebyautomaticallyannotatingimagesextractedfromscientificpublicationswithrespecttotheirusefulnessforCDS.Asanimportantsteptowardachievingthegoal,weproposedfigureimageanalysisforlocalizingpointers(arrows,symbols)toextractregionsofinterest(ROI)thatcanthenbeusedtoobtainmeaningfullocalimagecontent.Content-basedimageretrieval(CBIR)techniquescanthenassociatelocalimageROIswithidentifiedbiomedicalconceptsinfigurecaptionsforimprovedhybrid(textandimage)retrievalofbiomedicalarticles.

InthisworkwepresentmethodsthatmakerobustourpreviousMarkovrandomfield(MRF)-basedapproachforpointerrecognitionandROIextraction.TheseincludeuseofActiveShapeModels(ASM)toovercomeproblemsinrecognizingdistortedpointershapesandaregionsegmentationmethodforROIextraction.

Wemeasuretheperformanceofourmethodsontwocriteria:(i)effectivenessinrecognizingpointersinimages,and(ii)improveddocumentretrievalthroughuseofextractedROIs.Preliminarytestsonthreetestsetshaveshown87%accuracyinthefirstcriterion.Further,thequalityofdocumentretrievalusinglocalvisualfeaturesandtextisshowntobebetterthanusingvisualfeaturesalone.MoreintensivetestsareinprogresstoevaluateimpactofpointerlocalizationanduseofROIinimageannotationandretrieval.



IS&T /

ReturntoContents

7874-20, Session 7

Automatic extraction of numeric strings in unconstrained handwritten document imagesM.M.Haji,T.D.Bui,C.Y.Suen,ConcordiaUniv.(Canada)

Numericstringssuchasidentificationnumbersordatescarryvitalpiecesofinformationindocuments.Applicationsconcerningtheprocessingofnumericstringscanbecategorizedintotwotypesbasedonwhetherthelocationofthenumeralstringtoberecognizedisknownornot.Inmanyapplications,suchasreadingbackcheques,thelocationofthenumericstringstoberecognizedisfixedandknown,andthemainchallengeishowtorecognizethem.Consequently,plentyofmethodshavebeenproposedforrecognitionofnumericstrings.However,insomeapplications,wehavetospotthenumericstringsinthefirstplace,astheymayappearatarbitrarylocationsinsidethedocument.Thisisthecaseinadocumentretrievalapplicationwherewehavealargecollectionofdocumentsandwewishtheusertobeabletofetchtheonethatcontainsaspecificnumericstring(whichisforexampleareferenceoridentificationnumber).Thereareveryfewstudiesthathaveaddressedthisaspectoftheproblem.Inthispaper,wepresentanovelalgorithmforautomaticextractionofnumericstringsinunconstrainedhandwrittendocumentimages.Thealgorithmhastwomainphases:pruningandverification.Inthepruningphase,thealgorithmfirstperformsanewsegment-mergeprocedureoneachtextline,andthenusinganewregularitymeasure,itprunesallsequencesofcharactersthatareunlikelytobenumericstrings.Thesegment-mergeprocedureiscomposedoftwomodules:anewexplicitcharactersegmentationalgorithmwhichisbasedonanalysisofskeletalgraphsandamergingalgorithmwhichisbasedongraphpartitioning.Allthecandidatesequencesthatpassthepruningphasearesenttoarecognition-basedverificationphaseforthefinaldecision.Therecognitionisbasedonacoarse-to-fineapproachusingprobabilisticRBFnetworks.Wedevelopedouralgorithmfortheprocessingofreal-worlddocumentswhereletteranddigitsmaybeconnectedorbrokeninadocument.Wehaveaddressedallstepsofacompletesystemfrommarginremovalandlineextractiontoevaluation.Inordertoevaluatetheperformanceofthealgorithm,wehavecreatedacomprehensivedatabaseofhandwrittenandmachine-printeddocumentsincludingnumericstringsofdifferenttypes,lengthsandwritingstyles.Tothebestofourknowledge,thisisthefirstworktoreportaquantitativeevaluationofanumeralextractionalgorithmonareal-worlddataset.Theeffectivenessoftheproposedapproachisshownbyextensiveexperimentsdoneonthisdatabasewhichcontainoversixhundreddocumentswithdifferenttypesoflayoutsandlevelsofnoise.

7874-21, Session 8

Unsupervised method to generate page templatesH.Déjean,XeroxResearchCtr.EuropeGrenoble(France)

Inthispaper,weproposeamethodforautomaticallyinferringthedifferentpagetemplatesusedtolayoutthedocumentcontent.Aftertheidentificationoflabeledelementsthroughalogicalanalysis,geometricrelationsarecomputedbetweentheselabeledelements,andpagetemplatescandidatesaregeneratedusingfrequentrelatedelements.Afuzzymatchingoperationallowsforselectingthemostfrequentandrelevantpagetemplatesforagivendocument.Suchpagetemplatescanbeusedtocorrecterrorsproducedduringthedifferentpreviousstepsofthedocumentanalysis:zoning,OCR,andlogicalanalysis.EvaluationhasbeenperformedusingtheINEXbooktrackcollection.

7874-22, Session 8

Font group identification using reconstructed fontsM.P.Cutter,J.vanBeusekom,TechnischeUniv.Kaiserslautern(Germany);F.Shafait,DFKIGmbH(Germany);T.M.Breuel,TechnischeUniv.Kaiserslautern(Germany)

Ideally,digitalversionsofscannedoriginalsshouldberepresentedinaformatthatissearchable,compressed,highlyreadable,andfaithfultotheoriginal.ThesegoalscantheoreticallybeachievedthroughOCRandfontrecognition,re-typesettingthedocumenttextwithoriginalfonts.However,OCRandfontrecognitionremainhardproblems,andmanyhistoricaldocumentsusefontsthatarenotavailableindigitalforms.Itisdesirabletobeabletoreconstructfontswithvectorglyphsthatapproximatetheshapesofthelettersthatformafont.Inthiswork,weaddressthegroupingtokensinatoken-compresseddocumentintocandidatefonts.Thispermitsustoincorporatefontinformationintotoken-compressedimagesevenwhentheoriginalfontsareunknownorunavailableindigitalformat.Thispaperextendspreviousworkinfontreconstructionbyproposingandevaluatinganalgorithmtoassignafonttoeverycharacterwithinadocument.Thisisanecessarysteptorepresentascanneddocumentimagewithareconstructedfont.Throughourevaluationmethod,wehavemeasureda98.4%accuracyfortheassignmentofletterstocandidatefontsinmulti-fontdocuments.

7874-23, Session 8

How carefully designed open resource sharing can help and expand document analysis researchB.Lamiroy,Univ.Nancy2(France)andLehighUniv.(UnitedStates);D.P.Lopresti,J.Heflin,H.F.Korth,LehighUniv.(UnitedStates)

Makingdatasetsavailableforpeerreviewingofpublisheddocumentanalysismethodsordistributinglargecommonlyuseddocumentcorporaforbenchmarkingareextremelyusefulandsoundpracticesandinitiatives.Thispapershowsthattheycoveronlyaverytinysegmentoftheusessharedandcommonlyavailableresearchdatamayhave.Wedevelopacompletelynewparadigmforsharingandaccessingcommondatasets,benchmarksandothertoolsthatisbasedonaveryopenandfreecommunitybasedcontributionmodel.Themodelisoperationalandhasbeenimplementedsothatitcanbetestedonabroadscale.Thenewinteractionsthatwillarisefromitsusemaysparkinnovativewaysofconductingdocumentanalysisresearchontheonehand,butcreateverychallenginginteractionswithotherresearchdomainsaswell.

7874-24, Session 8

Multiple-agent adaptation in whole-book recognitionP.Xiu,H.S.Baird,LehighUniv.(UnitedStates)

Inordertoaccuratelyrecognizetextualimagesofabook,weoftenemployvariousmodelsincludingiconicmodel(forcharacterclassification),dictionary(forwordrecognition),charactersegmentationmodel,etc.,whicharederivedfrompriorknowledge.Imperfectionsinthesemodelsaffectrecognitionperformanceinevitably.Inthispaper,weproposeanunsupervisedlearningtechniquethatadaptsmultiplemodelson-the-flyonahomogeneousinputdatasettoachieveabetteroverallrecognitionaccuracyfullyautomatically.Themajorchallengeforthisunsupervisedlearningprocessis,howtomakemodelsimproveratherthandamageoneanother?Inourframework,modelsmeasuredisagreementsbetweentheirinputdataandoutputdata.Weproposeapolicybasedondisagreementstoadaptmultiplemodelssimultaneously(oralternately)safely.Wewillconstructabook



IS&T /

ReturntoContents

recognitionsystembasedonthisframework,anddemonstrateitsfeasibility.

7874-25, Session 9

Ancient documents bleed-through evaluation and its application for predicting OCR error ratesV.Rabeux,J.Nicholas,J.Domenger,Univ.Bordeaux1(France)

Thisarticlepresentsawaytoevaluatethebleed-throughdefectonveryolddocumentimages.Wedesignmeasurestoquantifyandevaluatetheversoinkbleedingthroughthepaperontotherectoside.Measuringthebleed-throughdefectalowsustoperformstatisticalanalysisthatareabletopredictthefeasibilityofdifferentpost-scantasks.InthisarticlewechoosetoillustrateourmeasuresbycreatingtwoOCRerrorratepredictingmodelsbasedbleed-throughevaluation.Twomodelsareproposed,oneforAbbyyFineReader　whichisaverypower-fullcommercialOCRandOCRopus　whichissponsoredbyGoogle.Bothpredictionmodelsappearstobeveryaccuratewhencalculatingvariousstatisticindicators.

7874-26, Session 9

Binarization of camera-captured document using A MAP approachX.Peng,S.Setlur,V.Govindaraju,Univ.atBuffalo(UnitedStates);R.Sitaram,Hewlett-PackardLabs.India(India)

Documentbinarizationisoneoftheinitialandcriticalstepsformanydocumentanalysissystems.Nowadays,withthesuccessandpopularityofhand-helddevices,largeeffortsaremotivatedtoconvertdocumentsintodigitalformatbyusinghand-heldcameras.Inthispaper,weproposeaBayesianbasedmaximumaposteriori(MAP)estimationalgorithmtobinarizethecamera-captureddocumentimages.AnoveladaptivesegmentationsurfaceestimationandnormalizationmethodisproposedasthepreprocessingstepinourworkandfollowedbyaMarkovRandomFieldbasedrefineproceduretoremovenoisesandsmoothbinarizedresult.Experimentalresultsshowthatourmethodhasbetterperformancethanotheralgorithmsonbadorunevenilluminationdocumentimages.

7874-27, Session 9

Statistical multiresolution schemes for historical document binarizationT.Obafemi-Ajayi,G.Agam,IllinoisInstituteofTechnology(UnitedStates)

Inpreviouswork,weproposedtheapplicationoftheExpectation-Maximization(EM)algorithminthebinarizationofhistoricaldocumentsbydefiningamulti-resolutionframework.

Inthiswork,weextendthemulti-resolutionframeworktotheOtsualgorithmforeffectivebinarizationofhistoricaldocuments.

WecomparetheeffectivenessoftheEMbasedbinarizationtechniquetotheOtsuthresholdingalgorithmonhistoricaldocuments.WedemonstratehowtheEMcanbeextendedtoperformaneffectivesegmentationofhistoricaldocumentsbytakingintoaccountmultiplefeaturesbeyondtheintensityofthedocumentimage.

Experimentalresults,analysisandcomparisonstoknowntechniquesarepresentedusingthedocumentimagecollectionfromtheDIBCO2009contestinadditiontodocumentimagesfromtheFriedercollection.



IS&T /

ReturntoContents

Conference 7875: Sensors, Cameras, and Systems for Industrial, Scientific, and Consumer Applications XIITuesday-Thursday25-27January2011PartofProceedingsofSPIEVol.7875Sensors,Cameras,andSystemsforIndustrial,Scientific,andConsumerApplicationsXII


Approach to quantitative detection of CD146 with the biosensor based on imaging ellipsometryY.Niu,L.Liu,InstituteofMechanics(China);X.Yan,InstituteofBiophysics(China);G.Jin,InstituteofMechanics(China)

CD146glycoprotein,amemberofcelladhesionmolecule(CAMs),isconsideredtobeanoveltargetonendothelialcellinvolvedintumorangiogenesis.Thebiosensorbasedonimagingellipsometry(BIE)wasusedforCD146detectionasatrialbythefollowingsteps.Firstly,CD146antibodyasligandwastemptedtoimmobilizeorientallyonmodifiedsiliconsubstratebyProteinG.Then,CD146detectionwasperformedanditscalibrationcurvewasestablishedfortheneedofquantitativedetection.Finally,18serumsamplesweretestedquantitatively,andtheirresultswerecomparedtoELISA’s.ThesensitivityforCD146detectionreachestheorderofng/mLandtherelationshipbetweenBIEsignaly(grayscalevalue)andCD146concentrationx(ng/mL)isy=3.3ln(x)+91.3.ItagreesmostlywithELISA’sresults,andthecorrelationcoefficientis0.714,whichindicatesthatresultsoftwoapproacheshavesignificantstatisticrelevance.Inaddition,biosensorwiththetotalinternalreflectionimagingellipsometry(TIRIE)wasappliedforreal-timemonitoringofthedetectionprocesses.Toconclude,BIEprovidesasimpleandeffectiveapproachforCD146detection,whichshowsapotentialforfurtherclinicalapplications.


Dynamic range extension of a CMOS active pixel sensor by in-pixel charge mixingS.Jo,M.Bae,J.Kong,J.Shin,KyungpookNationalUniv.(Korea,Republicof)

VariousapproacheshavebeenutilizedtoextendthedynamicrangeoftheCMOSimagesensor,whicharebasedonalinear-logarithmicCIS,overflowintegrationcapacitor,andmultiplesamplingorindividualpixelresetting.Theseapproaches,however,sufferfromnoise,nonlinearity,lowersensitivity,reducedoperatingspeed,andlowerresolution.Inordertoovercometheseproblems,wehavepreviouslyproposedadynamicrangeextensionmethodbycombiningoutputsignalsfromtwophotodiodeswithdifferentsensitivities,suchasahigh-sensitivityphotodiodeandalow-sensitivityphotodiode.

Theproposedactivepixelsensorhasbeenfabricatedbyusing2-poly4-metalstandardCMOSprocessanditscharacteristicshavebeenmeasured.Itisfoundthatchargesinthehigh-andlow-sensitivityphotodiodescouldbemixedeachotherandthelostimageinformationofthehigh-sensitivityphotodiodecouldberegeneratedusingthechargesinthelow-sensitivityphotodiode,asshownbysimulationresults.Also,dynamicrangeextensionoftheproposedactivepixelsensorhasbeenexperimentallyverified.Detailedexperimentalresultswillbepresentedinthepaper.


A novel 3D architecture for high dynamic range image sensor and on-chip data compressionG.M.Fadoua,Lab.d’ElectroniquedeTechnologiedel’Information(France);A.Dupret,EcoleSupérieured’IngénieursenElectroniqueetElectrotechnique(France);A.Peizerat,Lab.

d’ElectroniquedeTechnologiedel’Information(France);Y.Blanchard,EcoleSupérieured’IngénieursenElectroniqueetElectrotechnique(France)

Theintensityoflightofnaturalsceneshasadynamicrangethatcanbeover120dB.Classical3Tor4Tpixelarchitecturescoveronly60-70dB.CurrentworksonCMOSimageHighDynamicRange(HDR)sensorhaveledtodynamicrangeover120dBattheexpenseofmorecomplexarchitectures.Insomecases,thisleadstolowerFillFactororlargerpixelpitch.Theemergenceof3Dcircuitsmayhelptoovercomethoselimitations.MoreoverlargescaleimagesensormustfacetheincreaseinrequiredbandwidthandthisproblembecomesmoreacutewithHDRimages.Inthispaper,weproposeanoriginalarchitectureforextendingtheimagesensordynamicrangetogetherwithalocalcompressionofdatafora3Dcircuitimagesensor.Thetargetedcircuitiscomposedof2verticallystackedwaferswithapixelsizebelow5µmx5µm.TheproposedtechniqueforHDRisbased-onafloatingpointcoding.Afirstdatareductionisobtainedbyapplyingacommon4-bitexponenttoeachblockofpixels,referredtoasmacro-pixel.Foreachmacro-pixel,theoptimalexposureissetbyadynamicadaptationoftheintegrationtimeaccordingtothereceivedphotonquantity.Ittheoreticallyallowsreachingadynamicrangeequivalenttoabout20bits.Simulationresultsshowimageswithveryfewartefacts.Inordertofurtherreducetheamountofdata,anon-chipdatacompressionisperformedatthemacro-pixellevel.Indeed,acompactcompressionarchitectureimplementsacompressionalgorithmoneachblockofmacro-pixels.Onlythemantissaarrayiscompressedandthereducedexponentarraywithanexponentpermacro-pixelisstored.Thisnewconceptfeaturesagoodimagequality(PSNRofabout40dB)andahighdynamicrange(120dB)andshowsacompressionratioover75%,whilemaintainingacomplexitycompatiblewith3Dcircuits.Finally,furtherworksuchasA/Dconversionisdiscussed.


Improvement for sensitivity of biosensor with total internal reflection imaging ellipsometry (TIRIE)L.Liu,InstituteofMechanics(China);Y.Chen,SuzhouInstituteofNano-techandNano-bionics(China);Y.Meng,S.Chen,G.Jin,InstituteofMechanics(China)

Thebiosensorbasedonthetotalinternalreflectionimagingellipsometry(TIRIE)isrealizedasanautomaticanalysismethodforproteininteractionprocessesinreal-time,withhighthroughputandlabel-free.Anevanescentwaveisusedastheopticalprobetomonitorbio-molecularinteractionsonachipsurfacewithahighsensitivityduetoitsphasesensitiveproperty.Inthispaper,thetechniqueisoptimizedwithapolarizationsetting,aspectroscopiclightsourceandalownoiseCCDdetectortoimprovetheperformanceofthebiosensorinsensitivityanddetectionlimit,asevidencedbyaquantitativedetectionofhepatitisBvirussurfaceantigen(HbsAg)withconcentrationsof8,16,32,64,125and250ng/mL.Thesensitivityisincreasedbyoneorderofmagnitudeandthedetectionlimithasbeenextendedmorethan50timesforHbsAgdetection.

7875-01, Session 1

Single-chip color imaging for UHDTV camera with a 33M-pixel CMOS image sensorR.Funatsu,T.Yamashita,K.Mitani,Y.Nojiri,NHKScience&TechnicalResearchLabs.(Japan)


IS&T /

ReturntoContents

Wehavebeenresearchinganultrahigh-definitiontelevision(UHDTV)camerawitharesolution16timeshigherthanthatofHDTVresolution.TodevelopaUHDTVcamerathatiscompactandhashighmobility,weinvestigatedtheuseofa33M-pixelCMOSimagesensortoprovidesingle-chipcolorimaging.ThesensorhasaBayercolorfilterarray(CFA)anditsoutputsignalformatiscompatiblewiththeconventionalUHDTVcamerathatusesfour8M-pixelCMOSimagesensors.WefirstcalculatedthetheoreticalMTFcharacteristicsofthesingle-chipcameraandoftheconventionalfour-8M-pixelCMOScamera.WethenstudiedtheBayerCFAdemosaicingusedforthesingle-chipUHDTVcamera.Finally,wedevelopedanexperimentalpick-upsystemforsingle-chipimagingwitha33M-pixelcolorCMOSimagesensor.Theexperimentalresultsshowedthattheresolutionisequivalenttoorsurpassesthatoftheconventionalfour-8M-pixelCMOScamera.WeconfirmedthepossibilityofapracticalcompactUHDTVcamerathatmakesuseofsingle-chipcolorimaging.

7875-02, Session 1

On the design of multispectral color filter arraysJ.Y.Hardeberg,R.Khan,R.Shrestha,GjøvikUniv.College(Norway)

Inthepastfewyearstherehasbeenasignificantvolumeofresearchworkcarriedoutinthefieldofmultispectralimageacquisition.Thefocusofmostofthisresearchhasbeentofacilitateatypeofmultispectralimageacquisitionsystemsthatusuallyrequiresmultiplesubsequentshots(e.g.systemsbasedonfilterwheels,liquidcrystaltunablefilters,oractivelighting).

Recently,analternativeapproachforone-shotmultispectralimageacquisitionhasbeenproposed,basedonanextensionofthemuchusedcolorfilterarray(CFA)tousingmorethantheconventionalthreeRGBchannels,wecanthusintroducetheconceptofmultispectralcolorfilterarray(MCFA).Butthisfieldhasnotbeenmuchexplored,particularlylittlefocushasbeengiventodevelopingsystemswhichfocusesonthereconstructionofscenespectralreflectances.

BaoneandQi[1]madeaproposalofsuchanMCFAbasedimagingsystem,whosemainpurposewasclassification,wheretheyused4bandsinthemiddleandlowwaveinfraredregioninadditiontotheconventionalthreeinthevisiblespectrum.Theyalsoproposedanewdemosaickingalgorithmthattriedtobetterrestoretheimagebymaximizinga-posterioriprobability.AveryrecentworkbyLuetal.[2]alsofocusedonconstructingMCFAstocaptureaNIRbandalongwiththevisiblebands.ThepurposeofthisworkwasthesimultaneouscaptureofhighqualityvisibleandNIRimagepair.AnotherworkbyBrauersetal.[3]hasbeencarriedout,whichproposesanMCFAwithnarrowbandfiltersinthevisiblerange.Herealso,ademosaickingalgorithmhasbeenproposed,whichattemptstomakeuseoftheinter-bandcorrelationbylowpassfilteringofthechanneldifferences.

AMCFAbasedmultispectralcameraintroducesseveraldesignissuesthatneedtobehandled,evenwithoutworryingaboutpossibleandprobableissuesrelatedtotheeventualrealproductionofimagingsensorsandsystems.Notableonesincludethechoiceofthenumberoffiltersandtheirselectionfortheacquisitionsystem,thespatialarrangementofthefilters,andthedemosaickingalgorithm.Inthepresentworkwefocusontheissuewhichhasprobablyreceivedtheleastattentionsofar,namelythespatialarrangementofthefilterarray.

ForconventionalCFAsthewell-knownBayermatrix[4]hasenjoyedatremendoussuccess,althoughalternativeapproachesdoexist.ForMCFAs,recentlyMiaoetal.[5]proposedamethodforthespatialarrangementofthefiltersbasedontheprobabilityofappearanceofthecorrespondingbands.

Inthispaperwehaveusedthealgorithmproposedin[5]toconstructMCFAsofdifferentsizes.Wehavesimulatedacquisitionsofseveralspectralscenesusing6,8,and10-channelsystems,andcomparedtheresultswiththoseobtainedbytheconventionalregularMCFAarrangementproposedin[3],evaluatingtheprecisionofthereconstructedscenespectralreflectancesintermsofspectralRMSerror,goodness-of-fitcoefficient(GFC)andcolorimetricCIEDE2000colordifferences.Usingtheproposedapproachwesignificantlyimprovetheprecision,inparticularforaneight-channelMCFAwe

reducetheaverageCIEDE2000colordifferencebyupto50%.

Inconclusion,webelievethatMCFA-basedsystemscanbeaviablealternativeforaffordableacquisitionofmultispectralcolorimages,inparticularforapplicationswherespatialresolutioncanbetradedoffforspectralresolution.Wehaveshownthatthespatialarrangementofthearrayisanimportantdesignissue.

[1]GauravA.BaoneandHairongQi.“Demosaickingmethodsformultispectralcamerasusingmosaicfocalplanearraytechnology,”inSpectralImaging:EighthInternationalSymposiumonMultispectralColorScience,SPIEProceedings,volume6062,pages75-87,January2006.

[2]YueM.Lu,ClémentFredembach,MartinVetterli,andSabineSüsstrunk,“Designingcolorfilterarraysforthejointcaptureofvisibleandnear-infraredimages,”inProc.IEEEInternationalConferenceonImageProcessing(ICIP),November2009.

[3]JohannesBrauersandTilAach.Acolorfilterarraybasedmultispectralcamera.InProc.12.WorkshopFarbbildverarbeitung,GermanColorGroup,Ilmenau,October2006.

[4]BryceE.Bayer,“ColorImagingArray,”USPatent3,971,065,1976

[5]L.Miao,H.Qi,andW.Snyder,“Agenericmethodforgeneratingmulti-spectralfilterarrays,”IEEEInternationalConferenceonImageProcessing(ICIP),October2003.

7875-03, Session 1

Spectral-based calorimetric calibration of a 3CCD color camera for fast and accurate characterization and calibration of LCD displaysR.Safaee-Rad,QualcommInc.(Canada);M.Aleksic,QualcommInc.(UnitedStates)

LCDdisplaysexhibitsignificantamountofvariabilityintheirtone-responses,colorresponsesandbacklight-modulationresponses.Asaresult,afastandefficientsystemforafullLCDdisplay(notjustthepanelcenter)characterizationandcalibrationisrequired.Herein,asystembasedona3CCDcalorimetrically-calibratedcameraispresentedwhichcanbeusedforfullcharacterizationandcalibrationofLCDdisplays.Thecameracanprovidetri-stimulusmeasurementsoverthousandsoflocationsonaLCDdisplay(ashighascameratotalpixelcount)inrealtime--cameraframerate(33ms).Toachievehigh-degreeofaccuracy,colorimetriccalibrationofcameraiscarriedoutbasedonspectralmethod.

7875-04, Session 2

Optimizing quantum efficiency in a stacked CMOS sensorR.S.Hannebauer,LumiensePhotonics,Inc.(Canada);S.Yoo,HanVisionCo.Ltd.(Korea,Republicof);D.L.Gilblom,A.D.Gilblom,AlternativeVisionCorp.(UnitedStates)

Optimizingquantumefficiencyofimagesensors,whetherCCDorCMOS,hasusuallyrequiredbacksidethinningtobringthephotonreceivingsurfaceclosetothechargegenerationelements.AnewCMOSsensorarchitecturehasbeendevelopedthatpermitshigh-fill-factorphotodiodestobeplacedatthesiliconsurfacewithouttheneedforbacksidethinning.Thephotodiodeaccessprovidedbythisarchitecturepermitsultra-shallowfrontsideimplants,theapplicationofhighly-effectiveanti-reflectioncoatingsontheinputsurfaceandconstructionofamirrorinsidethesiliconbelowthephotodiodestoeffectivelydoublethethicknessofthesiliconchargegenerationvolume.SecondarybenefitsofthisarchitectureincludepreventionoflightfromreachingtheCMOScircuitryunderthephotodiodesandimprovementofoverallquantumefficiency.

Asensorwasconstructedwith4096x4096pixels4.8µmsquarewith95%fillfactorand100,000electronfull-wellcapacitybackedwithamirrortunedtothe400-700nmvisibleband.Amulti-layeranti-

Conference 7875: Sensors, Cameras, and Systems for Industrial, Scientific, and Consumer Applications XII


IS&T /

ReturntoContents

reflectancecoatingwasappliedtotheinputsurfacewithareflectivityinthevisibleoflessthan2%.Theresultwasmeasuredquantumefficiencyexceeding85%throughthevisible.Theblockingactionofthemirrorresultedinanextinctionratiofortheglobalshutterexceeding1,000,000:1.

7875-06, Session 2

Detailed characterisation of a new large area CCD manufactured on high resistivity siliconM.S.Robbins,P.Mistry,P.Jorden,e2vtechnologiesplc(UnitedKingdom)

e2vtechnologieshasdeveloped“Hi-Rho”devicesmanufacturedonveryhighresistivitysilicon.Specialdesignfeatureshavebeenincludedthatenableextremelyhighgatetosubstratepotentialstobeappliedwithoutsignificantcurrentleakagebetweenbackandfrontsubstrateconnections.Theapproachtakenallowstheusualdesignrulesforlownoiseoutputamplifiercircuitrytobefollowedthuslownoisedevicesverysensitivetoredandnearinfraredwavelengthscanbemanufactured.Thispaperreportsonthedetailedcharacterisationofthelargeformat“Hi-Rho”sensordesignedforastronomicalapplicationsandextendsthedatapreviouslyreportedtoincludedetailedassessmentoftheCTE,spatialresolution,darksignalandcosmeticquality.Theinfluenceofthebasematerialhasalsobeeninvestigatedwithdeviceshavingbeenmanufacturedonsiliconfromtwodifferentmanufacturers.New,detailedmeasurementsofthequantumefficiencyofdevicesutilisinganewlydevelopedantireflectioncoatingprocessarepresented.

7875-07, Session 2

Simulating enhanced photo carrier collection in the multifinger photogate active pixel sensorsP.V.R.Kalyanam,G.H.Chapman,A.M.Parameswaran,SimonFraserUniv.(Canada)

InourcurrentworkweuseanextensivesetofopticaltoolsprovidedbytheSentaurusdevicesimulatorsuitetosimulatethemultifingerphotogatedesignswithopticalillumination.Firsttheopticalgenerationprofileisextractedforallthelayersofthedevice.Thisprofileisimplementedwhensolvingfortheelectricalcharacteristicsofthedeviceindevicesimulations.Carriercollectionandaccumulationisstudiedbyintegratingthegeneratedphotocarriersovertime.Byshowingtheopticalgenerationandchargecollectionfordifferentwavelengthsoflight,weobservethebehaviourofthe7fingerdesigninthe0.5µmcase,9-fingerfor0.25µmand11-fingerfor0.18µm.Thesedesignswhichwereestimatedofhavinghighersensitivityratiosinourpreviousworksareobservedunderopticalilluminationtofindtheexactoptimummultifingerdesignwiththemaximumefficiency.

7875-08, Session 3

An introduction to the atmospheric imaging assembly (AIA) on the Solar Dynamics Observatory (SDO)A.M.Title,LockheedMartinSpaceSystemsCo.(UnitedStates)

Noabstractavailable

7875-10, Session 3

Correcting distortion and braiding of micro-images from multi-aperture imaging systemsA.Oberdörster,A.Brückner,F.C.Wippermann,A.Bräuer,Fraunhofer-InstitutfürAngewandteOptikundFeinmechanik(Germany)

Multi-apertureimagingsystemsinspiredbyinsectcompoundeyespromiseadvancesinbothminiaturizationandcostreductionofdigitalcamerasystems.Insteadofasinglelensstackwithsizeandsagintheorderofafewmillimeters,theopticalsystemconsistsofanarrayofmicrolenses.Atagivenfieldofviewofthecompletesystem,thefocallengthofthemicrolensesisafractionofthefocallengthofasingle-aperturesystem,reducingtracklengthandincreasingdepthoffieldsignificantly.Aseachmicroimageonlyspansasmallfieldofview,theopticalsystemscanbesimple.Becausethemicrolenseshaveadiameterofhundredsofmicronsandasagoftensofmicrons,theycanbemanufacturedcost-effectivelyonwaferscaleandwithhighprecision.However,reachingasufficientresolutionforapplicationssuchascameraphoneshasbeenachallengesofar.

Wedemonstrateamulti-aperturecolorcamerasystemwithapproximatelyVGAresolution(700x550pixels)andatracklengthof1.4mm.TheMTFofthecompletesystem(opticsandimageprocessing)iscomparabletocurrentcommercialminiaturizedVGAcameramodules.Thealgorithmforcorrectingopticaldistortionofthemicrolensesandcombiningthemicroimagesintoasingleimageisthefocusofthispresentation.

7875-11, Session 3

An analog logarithmic number system subtractor for edge detection in logarithmic CMOS image sensorsD.R.Desai,TheUniv.ofAkron(UnitedStates);F.Hassan,OhioNorthernUniv.(UnitedStates);R.Veillette,J.Carletta,TheUniv.ofAkron(UnitedStates)

Thispaperdescribesthedesignofanalogcircuitrytoimplementlogarithmicnumbersystem(LNS)subtraction.Suchcircuitry,ifincorporatedinthereadoutcircuitryofalogarithmicCMOSimagesensor,wouldallowfortheon-chipcalculationofspatialderivatives,whileoperatingdirectlyonlogarithmically-scaledpixels.Thecircuitwasimplementedfora1.2umCMOSprocess.ThemaximumrelativeerrorattheoutputoftheLNSsubtractorforpixelcurrentsthatcorrespondtoanilluminationrangeofmorethanfourdecadesis6.25%.

7875-12, Session 4

A CMOS image sensor with draining only modulation pixels for fluorescence lifetime imagingZ.Li,K.Yasutomi,T.Takasawa,S.Itoh,S.Kawahito,ShizuokaUniv.(Japan)

Fluorescencelifetimeimagingisbecomingapowerfultoolinbiology.Acharge-domainCMOSFLIMchipusingapinnedphotodiode(PPD)andthepinnedstoragediode(PSD)withdifferentdepthofpotentialwellshasbeendevelopedbytheauthors.However,atransfergatebetweenPPDandPSDcauseschargetransfernoiseduetotrapsatthechannelsurface.Thispaperpresentsatime-resolvedCMOSimagesensorwithdrainingonlymodulationpixelforfluorescencelifetimeimaging,whichremovesthetransfergatebetweenPPDandPSD.Thetimewindowingisdonebydrainingwithadraininggateonly,whichisattachedalongthecarrierpathfromPPDtoPSD.ThisallowsustorealizeatrappinglesschargetransferbetweenPPDandPSD,leadingtoaverylow-noisetime-resolvedsignaldetection.Avideo-rate



IS&T /

ReturntoContents

CMOSFLIMchiphasbeenfabricatedusing0.18µmstandardCMOSpinneddiodeimagesensorprocess.Thepixelarrayhas200(Row)×256(Column)pixelsandthepixelpitchis7.5µm.ThesignalintensityofthePSDasafunctionoftheTDgatevoltageismeasured.TheratioofthesignalfortheTDofftothesignalfortheTDonis212:1.

7875-14, Session 4

Development of biosensor based on imaging ellipsometry and its applicationsG.Jin,InstituteofMechanics(China)

Sofar,combinedwithamicrofluidicreactorarraysystem,aserviceableengineeringsystemofbiosensorbasedonimagingellipsometryisinstalledforbiomedicalapplications,suchasantibodyscreen,hepatitisBmarkersdetection,tumormarkersspectrumandvirusrecognition,etc.Furthermore,thebiosensorintotalinternalreflection(TIR)modehasbeimprovedbyaspectroscopiclight,optimizationsettingsofpolarizationandlownoiseCCDwhichbringsanobviousimprovementof10timeincreaseinthesensitivityandSNR,and50timeslowerconcentrationinthedetectionlimitwithathroughputof48independentchannelsandthetimeresolutionof0.08S.

7875-15, Session 4

Study on colony image acquisition and analysis systemZ.Jia,W.Wang,HenanPolytechnicUniv.(China)

Forcountingofbothcoloniesandplaques,thereisalargenumberofapplicationsincludingfood,dairy,beverages,hygiene,environmentalmonitoring,water,toxicology,sterilitytesting,AMEStesting,pharmaceuticals,paints,sterilefluidsandfungalcontamination.Recently,manyresearchersanddevelopershavemadeeffortsforthiskindofsystems.Byinvestigation,someexistingsystemshavesomeproblemssincetheybelongtoanewtechnologyproduct.Themainproblemsareimageacquisitionandimagesegmentation.Inordertoacquirecolonyimageswithgoodquality,anilluminationboxwasconstructedas:theboxincludesfrontlightningandbacklightning,whichcanbeselectedbyusersbasedonpropertiesofcolonydishes.Withtheilluminationbox,lightningcanbeuniform;colonydishcanbeputinthesameplaceeverytime,whichmakeimageprocessingeasy.AdigitalcamerainthetopoftheboxconnectedtoaPCcomputerwithaUSBcable,allthecamerafunctionsarecontrolledbythecomputer.Inthispaper,thedevelopedcolonyimagesegmentationalgorithmconsistsofthesub-algorithms:(1)imageclassification;(2)imageprocessing;and(3)colonydelineation.Thecolonydelineationalgorithmmaincontain:theproceduresbasedongreylevelsimilarity,onboundarytracing,onshapeinformationandcolonyexcluding.Inaddition,anumberofalgorithmsaredevelopedforcolony.

7875-16, Session 5

Aging effects on image sensors due to terrestrial cosmic radiationG.GangadharanNampoothiri,A.J.P.Theuwissen,TechnischeUniv.Delft(Netherlands);M.Horemans,Consultant(Belgium)

Weanalyzethe“ageing”effectsonimagesensorsintroducedbyneutronspresentinterrestrialcosmicenvironment.Defectsdevelopduringthelifetimeofimagersanddonotdisappear,limitingtheimagingperformance.Itishypothesizedthattheageingphenomenonisduetotheinfluenceofterrestrialcosmicrays,whicharetheresultofveryhighenergyparticlescreatedinspaceorbythesun.Inapreviousworkwehavecomparedpost-flightmeasurementsataviationaltitudestothatofsealevelandpresentedactivationenergyanalysisofthesensors.Forthefirsttime,hotpixeldevelopmentatsealevel(terrestrialcosmicradiationenvironment)iscorroboratedsuccessfullywithacceleratedneutronbeamtestsforvariousimagesensoroperatingconditions.

GroupofimagesensorswereirradiatedintheANITA(Atmospheric-likeNeutronsfromThickTarget)beamatTheSvedbergLaboratory(TSL),Swedentofurtherunderstandtheunderlyingmechanisms.Influenceofneutronflux(doserate)andbiasingonhotpixeldevelopmentisalsoreported.Theseexperimentsprovidefurthervalidationtothehypothesisthattheprominentcauseofhotpixelsisdisplacementdamageinthesiliconbulkduetoneutronradiation,introducedbysecondarycosmicrays.

7875-17, Session 5

Nonlinear time dependence of dark current in charge-coupled devicesR.Widenhorn,J.Dunlap,E.Bodegom,PortlandStateUniv.(UnitedStates)

Itisgenerallyassumedthatcharge-coupleddevice(CCD)imagersproducealinearresponseofdarkcurrentversusexposuretimeexceptnearsaturation.WefoundalargenumberofpixelswithnonlineardarkcurrentresponsetoexposuretimetobepresentintwoscientificCCDimagers.Thesepixelsarefoundtoexhibitdistinguishablebehaviorwithotheranalogouspixelsandthereforecanbecharacterizedingroupings.DatafromtwoKodakCCDsensorsarepresentedforexposuretimesfromafewsecondsuptotwohours.Linearbehavioristraditionallytakenforgrantedwhencarryingoutdarkcurrentcorrectionandasaresult,pixelswithnonlinearbehaviorwillbecorrectedinaccurately.

7875-18, Session 5

Tradeoffs in imager design parameters for sensor reliabilityG.H.Chapman,J.Leung,SimonFraserUniv.(Canada);Z.Koren,I.Koren,Univ.ofMassachusettsAmherst(UnitedStates)

Imagesensorsarecontinuouslysubjecttothedevelopmentofin-fieldpermanentdefectsintheformofhotpixels.Basedonlaboratorymeasurementsofdefectratesin23DSLRs,2midsizecamerasand11cellphonecameras,weshowinthispaperthattherateofthesedefectsdependsonthetechnology(APSorCCD)andondesignparametersthelikeofimagerarea,pixelsize,andgain(ISO).Increasingtheimagesensitivity(ISO)(from400uptoto25,600ISOrange)causesthedefectstobemorenoticeable,withsomegoingintosaturationandatthesametimeincreasesthedefectrate.Partiallystuckhotpixels,whichhaveanoffsetindependentofexposuretime,makeup>40%ofthedefectsandareparticularaffectedbyISOchanges.Comparingdifferentsensorsizeshasshownthatthedefectratedoesnotscaleentirelylinearly.Measuringimagerswithdifferentpixelsizes(from7.5to2.2microns)hasdemonstratedthatdefectratesgrowrapidlyaspixelareashrinks.Thesedefectratetrendsresultininterestingtradeoffsinimagerdesign,allowingthedesignertodeterminethespecificimagerparametersbasedontheimager’sdesignatedfunctionandreliabilityrequirements.

7875-19, Session 5

Dark noise in a CMOS imager pixel with negative bias on transfer gateH.Yamashita,ToshibaCorp.(Japan);M.Maeda,S.Furuya,T.Yagami,ToshibaMaterialsCo.,Ltd.(Japan)

Severalreportsonnegativebiasontransfergatein4-transistorCMOSimagerpixelhavealreadybeenpublished[1][2].Theadvantageofthenegativebiasontransfergateisadrasticdarkcurrentreductionduetoatransfergatechannelsurfacepinningbyaccumulatedholes.Butitwasalsoreportedthatfurtherloweringofnegativegatebiasbeyond-0.9Vinturnincreasesdarkcurrent[2].Theanalysisofthedarkcurrentincreasecausedbythenegativegatebiasonthetransfergatewasreported,byinvestigatinghotpixeldarkoutputdependency



IS&T /

ReturntoContents

bothonbiasandontemperaturewithtestpixelarray[3].TheproposedmechanismforthedarkcurrentincreaseinthereportisGate-Induced-Leak(GIL)Trap-Assisted-Tunneling(TAT).ThedarkcurrentobservedwhenlargenegativegatebiasisappliedfollowsI_dark~A*EXP(Vm/V0)2,whereAisaproportionalityfactor,V0isthresholdvoltage,andVmisdifferencebetweenfloatingdiffusion(FD)voltageandgatevoltage[3].TheexponentialdependenceonVmsuggeststhattheTATgeneratedbyhighelectricfieldinFDnearreadgateisthecauseofthedarkcurrent.ElectricfiledinaFDtendstobelargerwhenpixelsizeisreduced,becauseatransfergatescaledownisnecessaryinasmallsizepixel.ThusbothfullunderstandingofthegenerationmechanismofTATdarkcurrentinFDandaproposaltoreducethedarkcurrentisofvalueforfuturepixelsizereduction.Inthiscontribution,thedetailedanalysisbothwithexperimentalresultsandwithdevicesimulationshowsthedarkcurrentisdominatedbytheelectricfieldgeneratedatthetransfergateedgeoverlappedbyaFDn+.ItalsoisreportedthatthereductionoftheelectricfieldinFDbychangingthedopantprofileofFDdrasticallyreducesthedarkcurrent.

ThekeyparameterdominatingtheTATdarkcurrentiselectricfieldinFD.ThedevicesimulationresultsshowthatmaximumelectricfiledafterFDresetisinducedinsiliconsurfaceattheFDedgebelowtransfergate.Thegenerationofhighelectricfieldisattributedtothelargevoltagedifferencebetweenthehighconcentrationholelayerundertransfergateandhighconcentrationn+layerattheFDedgebelowtransfergate.ThereductionofdarkoutputlevelofhotpixelhasbeenobservedwhenthedopantconcentrationatthesurfaceofFDislowered.ThedevicesimulationshowsthatmaximumelectricfiledattheFDedgebelowtransfergateisreducedwhentheFDdopantconcentrationislowered.Italsoisshownthatthedarkoutputlevelofhotpixelcanbescaledwithlocalmaximumelectricfield.TheresultsprovethatTATisthedominantcauseofthedarkoutputincreasewhennegativegatebiasisappliedontransfergate.

[1]H.Han,etal,“EvaluationofaSmallNegativeTransferGateBiasonthePerformanceof4TCMOSImageSensorPixel”,InternationalImageSensorWorkshop,2007pp238

[2]B.Mheen,etal,“NegativeOffsetOperationforFour-TransistorCMOSImagePixelsforIncreasedWellCapacityandSuppressedDarkCurrent”ElectronDeviceLettersVol.29,No.4,April,2008,pp347

[3]H.Yamashitaetal.,“AnalysisofDarkCurrentin4-TransistorCMOSImagerPixelwithNegativeTransfer-GateBiasOperation”,InternationalImageSensorWorkshop2009,Session04-1

7875-20, Session 5

Image sensor noise: you love it or you hate it!A.J.P.Theuwissen,HarvestImaging(Belgium)

Asoftwaretoolisdevelopedthatallowstosimulatesolid-stateimagesensorsbasedontheirspecification.TheoutputofthesimulatorisasetofIMAGES.Bymeansofthetoolthecameraengineergetsadirectviewoftheimagesthesensorisabletocreate.Agreatadvantageofthissimulationtoolistochecktheinfluenceofalldifferentfixed-patternnoisesources,temporalnoisesourcesandperformanceparameters,aswellastheirimpactontheimagequality.

Nexttothesimulationtool,asecondtoolisdevelopedthatusesIMAGESastheinputandextractsthesensorparameters.Examplesofextracteddata:fixed-patternnoisecomponents,temporalnoisecomponent,conversiongain,qunatumefficiency,darkcurrent,etc.Theinputimagescanbethesimulatedonesgeneratedbythesimulatortool,orcanberealimagesfromexistingsensorsorcameras.

BothsoftwaretoolscanhandleCCDandCMOSdevices,coloraswellasmonochromedevices.

7875-21, Session 6

The early history of CCDsM.M.Blouke,PortlandStateUniv.(UnitedStates)

Noabstractavailable

7875-22, Session 6

3D ranging with a single-photon imaging arrayS.Bellisai,F.Guerrieri,PolitecnicodiMilano(Italy);S.Tisa,MicroPhotonDevicesS.r.l.(Italy);F.Zappa,PolitecnicodiMilano(Italy)andMicroPhotonDevicesS.r.l.(Italy)

Severalapplicationsrequiresystemsfor3Drangingacquisition,wherebothhighframe-rateandhighsensitivity(foreitherverydarkenvironmentsoropaqueobjects)areamust.Weexploitedamonolithicchipwith32x32Single-PhotonAvalancheDiodesmart-pixelsfor3DrangingapplicationsbasedonanIndirectTime-of-Flight(iTOF)technique.ThesceneisilluminatedbyasinusoidalmodulatedLEDandthereflectedlightisacquiredbytheimagerindifferenttime-slots,formeasuringthephasedelayofoutgoingvs.incomingsignal,hencecomputingthedistancebetweenchipandobjectsinthescene.

All1024arraypixelsaresynchronouslyenabledbyaglobalgatesignal,whichallowsphotoncountinginwell-definedtime-slotswithineachframe.TheframedurationissetinaccordancetothedesiredSNR.Wereportonmeasurementsperformedonchipsfabricatedinastandardhigh-voltage0.35µmCMOStechnology,whichfeature40%photondetectionefficiencyat450nmand20%at650nm.Thesingle-photonsensitivityallowedtheuseofjustonesingleLEDat650nmand20MHzforacquiringascenewithamaximumdistanceof7.5m,withbetterthan10cmdistanceresolutionandhigherthan50frames/sframe-rate.

7875-23, Session 6

Linear arrays of single-photon detectors for photon counting and timingF.Guerrieri,PolitecnicodiMilano(Italy);S.Tisa,MicroPhotonDevicesS.r.l.(Italy);A.Tosi,S.Bellisai,B.Markovic,PolitecnicodiMilano(Italy);F.Zappa,PolitecnicodiMilano(Italy)andMicroPhotonDevicesS.r.l.(Italy)

Scientificexperimentsoftendemandthedetectionofveryweaklightsignalsathigh-speedortopreciselymeasurethetimeofarrivalofsinglephotons.ArraysofSingle-PhotonAvalancheDiodes(SPAD)areidealcandidateswhenhighsensitivityisrequiredtogetherwithhighframe-rateorprecisephoton-timingresolution.Wedesignedalinear32x1SPADarrayusingahigh-voltageCMOStechnologyabletoprovidebothgoodSPADperformanceandfastelectronics.

Duringframeacquisitionallpixelsworkinparallel,eachofthembeingequippedwithanythingnecessaryforphotoncounting.Thearrayarchitectureiscapableoffully-paralleloperationofallpixelsallowingfree-runningacquisitionathighframe-rate.Withalow-speed10MHzclockfrequency,onepixelisreadoutin100nswhilethewholearrayisreadoutin320ns,correspondingtoaframe-rateof312.5kframe/s.Theframe-ratecantopto4Mframe/swithaclockof128MHz.

Thephotontimingmodalityemploysthephotontime-of-arrivalinformationprovidedbyeachofthe32outputs.All32“timing”outputsfeedexternalTime-CorrelatedPhotonCountingboards.TheFull-WidthatHalf-Maximumusingveryshortlaserpulsesis55pswithfewkcpscountingrate.



IS&T /

ReturntoContents

7875-24, Session 6

A single photon sensitive fast ebCMOS camera system for multitarget tracking of single fluorophores: applications to nano-biophotonicsT.Cajgfinger,R.Barbier,A.Dominjon,E.Chabanat,D.QuangTuyen,C.Guérin,J.Houles,InstitutdePhysiqueNucléairedeLyon(France)

OurdevelopmentofacamerasystembasedonelectronbombardedCMOS(ebCMOS)deviceisconsistentwiththedemandsofapplicationssuchasfastreal-timemultitargettrackingoffluorescentdyesusedinfluorescencemicroscopyandnano-photonics.ThedesignandfabricationofaBackSideIlluminated(BSI)CMOS(160kPixels-10micronpitch)isoptimizedforsinglephotoelectrondetection.TheebCMOSshowsahighresolutionwithsinglephotonsensitivityata500Hzframerateandisusedasaproofofconceptofrealtimetrackingatsinglephotonsensitivityoffluorescentnano-particles.TheperformancesoftheebCMOSitselfintermsofspatialresolution,darkcountrate,singlephotonsensitivityandtruecountingcapabilityarepresented.Thefullcamerasystemisdescribed.Thenwepresentthemeasurementbasedonsinglephotondetectioncapabilityforspatio-temporalidentificationoffluorescenttargets(QuantumDots).TheaccuracyoflocalizationfordifferentnoiseandsignalconditionsismeasuredonQuantumDotsandonadedicatedopticaltestbench.Weconcludebygivingthemeasurementofthespeedlimitofatargetthatcanbefollowedbythecamerasystemasafunctionofphotonsignalandphotonnoise.

7875-25, Session 6

Monolithic single-photon detectors and time-to-digital converters for picoseconds time-of-flight rangingB.Markovic,PolitecnicodiMilano(Italy);S.Tisa,MicroPhotonDevicesS.r.l.(Italy);A.Tosi,F.Zappa,PolitecnicodiMilano(Italy)

Wepresentanovel“smart-pixel”abletomeasureandrecordin-pixelthetimedelay(photontiming)betweenaSTART(e.g.givenbythelaserexcitation,thecellstimulus,orthelidarflash)andaSTOP(e.g.thearrivalofthefirstreturningphotonfromthefluorescencedecaysignalorbackreflectionfromanobject).Suchsmart-pixelreliesofaSPADdetectorandaTime-to-DigitalConvertermonolithicallydesignedandmanufacturedinthesamechip.Manypixelscanbelaidoutinarowsbycolumnsarchitecture,togivebirthtoexpandable2Dimagingarraysforpicoseconds-levelsingle-photontimingapplications.Distancemeasurements,bymeansofthedirectTOFdetection(thesameusedinlidarsystems)providedbyeachsmart-pixel,canopenthewaytothefabricationofsingle-chip3Drangingarraysforscenereconstructionandintelligentobjectrecognition.

Wereportonthedesignandcharacterizationofprototypecircuits,fabricatedina0.35µmstandardCMOStechnologycontainingcompleteconversionchannels,smart-pixelsandancillaryelectronicswith20µmactiveareadiameterSPADdetectorsandrelatedquenchingcircuitry.Witha100MHzreferenceclock,theTDCprovidesatime-resolutionof10ps,adynamicrangeof160nsandveryhighconversionlinearity.

7875-30, Session 6

Human-technology interaction for IED detectionA.Zhang,Y.Zou,L.Wu,EYZtek,Inc.(UnitedStates);J.E.Fulton,NavalSurfaceWarfareCtr.CraneDiv.(UnitedStates)

Noabstractavailable



IS&T /

ReturntoContents

Conference 7876: Digital Photography VIIMonday-Tuesday24-25January2011PartofProceedingsofSPIEVol.7876DigitalPhotographyVII

7876-01, Session 1

High dynamic range image sensor architecturesB.Fowler,FairchildImaging(UnitedStates)

Digitalphotographerscontinuouslydemandmoreperformancefromtheirequipment.Digitalcameraperformanceisdefinedbyasetofparametersincludingdynamicrange,noise,framerate,resolution,andcolor.Amongsttheseparametersdynamicrangeisbecomingincreasinglymoreimportant.Thisistruebecausethehumaneyetypicallyhasawiderdynamicrangethanadigitalcamera.Inthispaperwedefinedynamicrangeastheratioofthemaximumtotheminimumsignalthatcanbedetected.AttheheartofalldigitalcamerasiseitheraCCDoraCMOSimagesensor(CIS).Thedynamicrangeofthesensortypicallylimitsthedynamicrangeofthecamera.

InthispaperwereviewfiveCISarchitecturesthataredesignedtoimproveddynamicrange.WestartbyreviewingstandardCCDandCISarchitecturesandthenpresentasimplesensormodel.Usingthismodelweshowhowsignaltonoiseratio(SNR)canbeusedtoevaluatedifferentwidedynamicrange(WDR)sensorarchitectures.Thenwesequentiallyreviewfivedifferentwidedynamicrangetechniques.ThefirstWDRtechniqueismultiplegains,andthesecondtechniqueisnon-linearpixelresponse.Thethirdtechniqueisvariableexposure,andtheforthtechniqueiswellcapacityrecycling.Thefifthandfinaltechniqueistimetosaturation.Foreachofthesetechniqueswepresentthepixellevelcircuitryanditsadvantagesanddisadvantages.Furthermore,allofthesetechniquesarecomparedbasedonSNRandimplementationcomplex.Wediscusshowimplementationcomplexityaffectssignalprocessinginadigitalcamera,andotherparametersinthesensorsuchasquantumefficiencyandreadnoise.Weconcludewithafewsummarycomments.

7876-02, Session 2

Bayer and panchromatic color filter array demosaicing by sparse recoveryM.Aghagolzadeh,A.AbdolhosseiniMoghadam,H.Radha,MichiganStateUniv.(UnitedStates);M.Kumar,EastmanKodakCo.(UnitedStates)

TheutilityofCompressedSensing(CS)fordemosaicingofdigitalimageshavebeenexploredbyseveralrecentefforts[1,2and3].Mostrecently,aCompressiveDemosaicing(CD)[4]framework,basedonemployingarandompanchromaticColorFilterArray(CFA)atthesensingstage,hasprovidedcompellingCS-baseddemosaicingresultsbyvisuallyoutperformingotherleadingtechniques.Meanwhile,itiswellknownthattheBayerpatternisarguablythemostpopularCFAusedinlow-costconsumerdigitalcameras.Inthispaper,weexploreandcomparetheBayerandrandompanchromaticCFAstructuresusingagenericapproachfordemosaicingofimagesbasedonrecentadvancesinthefieldofCS.Inparticular,akeyobjectiveofthisworkistoprovideacomparativeanalysisbetweenthesetwoCFApatterns(Bayerandrandompanchromatic)underthegeneralumbrellaofsparserecovery,whichrepresentsthecornerstoneofCS-baseddecoding.WedemonstratetheviabilityoftheBayerpatternundercertainCSconditions.Meanwhile,weshowthatarandompanchromaticCFA,whichmeetscertainincoherenceconstraints,canvisuallyoutperformaBayerbasedsparserecovery.Asillustratedinoursimulationresults,apanchromaticCFAismoreconsistentintermsofprovidingbettervisualqualitywhentestedonawiderangeofcolorimages.

REFERENCES

[1]J.Mairal,M.Elad,andG.Sapiro,“Sparserepresentationforcolorimagerestoration,”IEEETransactionsonImageProcessing,Vol.17,2008.

[2]J.Mairal,F.Bach,J.Ponce,G.SapiroandA.Zisserman,“Non-LocalSparseModelsforImageRestoration,”InternationalConference

onComputerVision,Tokyo,Japan,2009.

[3]P.NageshandB.Li,“CompressiveImagingofColorImages”.IEEEICASSP,Taipei,Taiwan,09.

[4]A.A.Moghadam,M.Aghagolzadeh,M.KumarandH.Radha,“Compressivedemosaicing,”acceptedinIEEEInternationalWorkshoponMultimediaSignalProcessing,Saint-Malo,France,2010.

7876-07, Session 3

Implementation of a multispectral color imaging device without color filter arrayG.Langfelder,A.F.Longoni,F.Zaraga,PolitecnicodiMilano(Italy)

Multispectralacquisitionofdigitalimagesisinterestingforseveralapplicationsasawaytoimprovetheaccuracyincolorreproduction.Withrespecttocolorimetric-basedimaging,wheretypicallyasetofthreecolorfilterarraysisused,multichannelacquisitionallowsestimatingthescenespectralreflectance.

InthisworkwefirstreviewtheworkingprincipleoftheTransverseFieldDetector(TFD),aproposedsensorforcoloracquisitionwithoutCFA.Initssimplestgeometricalandbiasingconfiguration,aTFDpixelimplementsasetofthreespectralresponses.Thankstoitsbasicworkingprinciple,thespectralresponsesimplementedonthedevicedependonthebiasingconfigurationandcanbetunedbychangingthevoltagesappliedtothecollectingelectrodes.ExperimentalresultsonaTFD,ingoodagreementwithelectrondevicesimulations,areshown.

Wethenpresentdetailedsimulationresultsonanimprovedstructure,wheretemporalorspatialpixeltunabilitycanbeusedtogeneratedifferentspectralresponses.WethenshowaTFDpixeltunedinanon-symmetricconfiguration,aconceptthatallowsincreasingthenumberofdifferentspectralresponsesofthedevice,avoidingtheneedforaspatiallyortemporallyseparatedacquisition.A5to6micronwideTFDpixelcanbeusedtoimplement5to7differentspectralresponsesinasingleacquisitionatfullresolution.

7876-08, Session 3

One shot multispectral color imaging with a stereo cameraR.Shrestha,J.Y.Hardeberg,GjøvikUniv.College(Norway);A.Mansouri,Univ.deBourgogne(France)

Multispectralcolorimagingisapromisingtechnology,whichcansolvemanyoftheproblemsoftraditionalRGBcolorimaging.However,itstilllackswidespreadandgeneralusebecauseofitsownlimitations.Stateoftheartmultispectralimagingsystemsneedmultipleshotsmakingthemnotonlyslowbutalsoincapableofcapturingscenesinmotion.Moreover,thesystemsaremostlycostlyandcomplextooperate.Thispurposeoftheworkdescribedinthispaperistoconceiveafastandpracticalsix-channelmultispectralcolorimageacquisitionsystemusingastereocameraandapairofopticalfilters.Thebestpairoffiltersisselectedfromamongreadilyavailablefilterssuchthattheymodifythesensitivitiesofthetwocamerasinsuchawaythattheygetspreadreasonablywellspacedthroughoutthevisiblespectrum.Asthecamerasareinastereoscopicconfiguration,thesystemiscapableofacquiring3Dimagesaswell,andstereomatchingalgorithmsprovideasolutiontotheimagealignmentproblem.Thusthesystemcanbeusedasa“two-in-one”multispectral-stereosystem.BothsimulationsandexperimentshaveshownthattheproposedsystemperformsbetterthantheRGBsysteminscenespectralreflectancereconstructionaswellasinscenecolorreproduction.


IS&T /

ReturntoContents

7876-09, Session 3

Multispectral image invariant to illumination colour, strength, and shadingM.S.Drew,A.Yazdani,SimonFraserUniv.(Canada)

Wepresenthereamethodthatmakesuseofmultispectralimagedataandgeneratesanovel“photometric-invariantmultispectralimage”forthistypeofdata.ForRGB,an“invariantimage”hasbeenconstructedindependentofthecolourandintensityoftheilluminantandtoshading[ECCV04].Togeneratethisimageeitherasetofcalibrationimagesisrequired,orentropyinformationtakenfromasingleimagecanbeusedtodeveloptheparametersnecessarytoproducetheinvariant[IJCV09].Nonetheless,generatinganinvariantimageremainsacomplexanderror-pronetaskforRGBimagedata.Formultispectralimages,weshowthatphotometric-invariantimageformationisinessencegreatlysimplified:oneoftherequirementsforforminganinvariantisthenecessityofnarrowband-sensorsensors.Herethisisthecase,andweshowthatwiththesimpleknowledgeofpeaksensorwavelengthswecangenerateahigh-Dmultispectralinvariant:thePSNRisshowntobehighbetweentherespectiveinvariantmultispectralfeaturesformultispectralimagestakenunderdifferentilluminationconditions,showinglightinginvarianceforaper-pixelmeasure;andthes-CIELABerrormeasureshowsthatthecolourerrorbetweenthe3-Dcolourimagesusedtovisualizetheoutputinvarianthigh-Ddataisalsosmall.

7876-10, Session 3

Methods for spectral characterization of multispectral camerasJ.Klein,J.Brauers,T.Aach,RWTHAachen(Germany)

Highfidelitycolorimageacquisitionrequiresanaccuratecharacterizationofthecamera’sspectralsensitivitycurvestoperformcolorcalibrationorspectralestimation.Severalmethodshavebeenproposedtoperformthistask;theseincludecharacterizationsviatestcharts,narrowbandfiltersandmethodsutilizingamonochromator.Inmostpublications,RGBcamerasarecharacterized.Inthispaper,wedescribethecharacterizationofthespectralsensitivitycurvesofamultispectralcamerafeaturingsevenopticalbandpassfilters.Weshowtwodifferentmethodsforthecalibrationusingamonochromator-eitherbymeasuringthegrayscalesensorofthecameraandthefiltersseparatelyorbycharacterizingthemultispectralcameraasacompletesystem.Acomparisonofbothmethodsvalidatesthemeasurementresults.

Wefurthermoredevelopdifferentreconstructionmethods(maximumvaluemethod,principaleigenvectormethod,linearorWienerestimation).Weperformalsosimulationsofthecharacterizationprocesstoevaluatethemethodsandshowtheimpactofthebandwidthofthemonochromatorstimulionthereconstruction.

7876-11, Session 4

Evaluation of a hyper-spectral image database for color filter array design and demosaicking algorithmsM.Larabi,Univ.dePoitiers(France);S.E.Süsstrunk,EcolePolytechniqueFédéraledeLausanne(Switzerland)

Thedesignofcolorfilterarrays(CFA)andassociateddemosaickingalgorithmsisstillahottopictodayindigitalphotography,astheperfectspatialarrangementofthefiltersandtheirspectralcharacteristicshavealargeinfluenceonimagequality.Inthiswork,weproposetostudytheapplicabilityofthehyperspectralimagedatabaseproposedbyFosteretal.[1]forCFAanddemosaickingdesigntesting.Theevaluationofthedemosaicingalgorithmsisstudiedbyusingdifferentwell-knownmetricssuchasCPSNR,s-CIELABbutalsosubjectivelybyrunningapsychovisualexperimentwhereobserversareaskedtojudgethedemosaickedimagesandgivetheirvisualpreference.

7876-12, Session 4

Automatic annotation of outdoor photographsC.Cusano,R.Schettini,Univ.degliStudidiMilano-Bicocca(Italy)

Weproposehereastrategyfortheautomaticannotationofoutdoorphotographs.Imagesaresegmentedinhomogeneousregionswhicharethenassignedtosixdifferentclasses:sky,vegetation,snow,water,ground,andsand.Thesevisualcategoriesallowsforcontent-awareprocessingstrategies.Forinstance,theknowledgeaboutthepresenceofuniformlycoloredregions(suchasthesky)canbeusedtodrivecolorbalancingalgorithmswhichshouldignorethoseregions.Anotherexampleisprovidedbyedgesharpeningalgorithmswhichshouldavoidboostingtheedgesinhighfrequencyregions,suchasthosetypicallyfoundinthevegetationclass.Ourstrategydescriberegions,obtainedusinganormalizedcutsegmentationstrategy,usingajointhistogramofcolorandtextureinformation.Theclassificationisperformedbyamulti-classSupportVectorMachine.ThestrategyhasbeenevaluatedonimagestakenfromtheLabelMedataset.

7876-14, Session 5

How many pixels does it take to make a good 4”x6” print? Pixel count wars revisitedM.A.Kriss,Consultant(UnitedStates)

Thefallacyofimplyingthatmore,smallpixels,producesbetterimagesthanfewer,largerpixelsforagivensensorsizeisexploredindetailusingphotographicmodelsdevelopedinthe1970’sandmodifiedfordigitalimagesandbyexperimentsusingaconsistentsetofdigitalcamerasrangingfrom6millionpixelsto14millionpixelsincompactcamerasandusing6millionand12millionpixelsinSLRformatdigitalcameras.Boththemodelmetricsandexperimentalresultsclearlydemonstratethatsmallerpixels,assmallas1.4microns,producelowerqualityimagesthantheirlargercounterpartsevenifthepixelcountislowerforthelargersensors.Thesmallerpixelsalsointroducegreaternoise,lowertrueISOspeed(nocameragain)andsignificantlossinexposurelatitude.Thehighpixelcount,smallpixelcamerasalsoshowamuchlargertendencyforJPEGartifacts.Theresultsindicatethatdigitalphotographersshouldpickcamerasthatmeettheirneeds,beitstudioworkorsportsphotography,bypurchasingtherightcombinationofpixelcountandpixelsize.

7876-15, Session 5

A prototype high-speed CMOS image sensor with 10,000,000 burst-frame rate and 10,000 continuous-frame rateY.Tochigi,K.Hanzawa,Y.Kato,N.Akahane,R.Kuroda,S.Sugawa,TohokuUniv.(Japan)

Inthispaper,ahigh-speedCMOSimagesensorhavinganewarchitectureandanewoperatingprinciplehasbeendeveloped.Theimagesensorachievesboththecontinuouscapturingandtheburstcapturingbyasinglechip,andthisimagesensorhaslowpowerconsumption,lowheatgeneration,highsensitivityandhighS/Nratio.Thisimagesensorconsistofmainlyfourblocks,twodimensionalpixelarrayof4-trangisterCMOSactivepixel,analogmemoryarraysconnectedindependentlytothepixelarraybyeachpixeloutputline,scanningcircuitsandmultiplenumberofoutputamplifiers.Aprototypeimagesensorwasfabricatedusinga0.18um2-Poly3-MetalCMOStechnologywiththediesizeof5550um(H)x4575um(V),thepixelsizeof48um(H)x48um(V),thenumberofpixelsof72(H)x32(V),thenumberofanalogmemoriesof104memoriesperpixelandthe6parallelhorizontaloutputcircuitsandoutputamplifiers.Theapertureratiois35%andtheconversiongainis60uV/e-(inputreferred).Ithasbeenconfirmedthatthisimagesensorachieves10,000,000fpsduringburstcapturingmodeand10,000fpsduringthecontinuous

Conference 7876: Digital Photography VII


IS&T /

ReturntoContents

capturingmodethroughtheimagecaptureexperimentsofhighspeedphenomenasuchasrotatingobjectanddischargephenomenon.

7876-16, Session 5

Two-dimensional measurement of the lens optical transfer function from a digital imageD.P.Morgan-Mar,M.R.Arnison,C.A.Deller,P.A.Fletcher,K.G.Larkin,CanonInformationSystemsResearchAustraliaPty.Ltd.(Australia)

Thelensopticaltransferfunction(OTF)describestheresolutionandsharpnessofimagesformedthroughalens.WepresentanovelmethodforaccuratelymeasuringtheOTFofacameralensbydigitallyimagingatartantestpatterncontainingsinusoidalfunctionswithmultiplefrequenciesandorientations.Thetartanpatterncanbetunedtooptimisethemeasurementaccuracyforanadjustablesetofsparsespatialfrequencies.Themeasurementmethodisdesignedtobeaccurate,reliable,andfastinawiderangeofmeasurementconditions,includinguncontrolledlighting.WedescribethedesignofthetartanpatternandthealgorithmforestimatingtheOTFaccuratelyfromacaptureddigitalimage.WepresentsimulationresultswhichshowthatthetartanmethodhassignificantlybetteraccuracyformeasuringthemodulusoftheOTF(themodulationtransferfunction,orMTF)thantheISO12233standardslanted-edgemethod,especiallyathighspatialfrequencies.With1%simulatedimagingnoise,therootmeansquare(RMS)errorofthetartanmethodisonaverage5timessmallerthantheRMSerroroftheslanted-edgemethod.

7876-17, Session 6

Efficient defect pixel cluster detection and correction for Bayer CFA image sequencesT.Tajbakhsh,TechnischeUniv.Hamburg-Harburg(Germany)


7876-20, Session 7

Comparison of objective metrics for image sensor crosstalk characterizationA.Dokoutchaev,AptinaImagingCorp.(UnitedStates);H.Eliasson,SonyEricssonMobileCommunicationsAB(Sweden);F.Li,AptinaImagingCorp.(UnitedStates)

Imagesensorcrosstalkcanbedividedintospectralcrosstalkandpixelcrosstalk.Thispaperfocusesonthepixelcrosstalkanditseffectonsignaltonoiseratio(SNR).Pixelcrosstalkoccursinthespatialdomainandisduetothesignalleakagebetweenadjacentpixelseitherbyimperfectopticalisolationordiffusionofelectrons.Thiswillhaveanegativeimpactonimagequalitymainlyintwoways:spatialblurringanddecreasedSNRduetomoreaggressivecolorcorrectionrequired.Amethodformodelingthespectralbroadeningduetothepixelcrosstalkispresentedwhereamatrixiscalculatedfromcrosstalkkernelsrepresentingthespatialleakagebetweenneighboringpixels.Inordertoquantifytheamountofcrosstalkwepresentamethodinwhichratiosofintegralsofthesamecolorchannelbutwithindifferentwavelengthintervalsarecalculated.Thisprovidesametricthatismorerobustwithrespecttocolorchannelscaling.TostudytheimpactonSNRduetopixelcrosstalk,anumberofSNRmetricsarecomparedtoresultsfromalimitedpsychophysicalstudy.ThestudiedSNRmetricsarethemetricusedforcalculatingtheSNR10valueinmobileimaging,theISO12232noisemetricandametricwherethesignalistransformedintoorthogonalcoloropponentchannels,therebyenablingtheanalysisoftheluminancenoiseseparatefromthechrominancenoises.TheresultsindicatethattheISOtotalnoiseandSNR10metricyieldverysimilarresultsandthatthegreenchannelhasthelargestindividualimpactonthecrosstalk.

7876-21, Session 7

An image quality evaluation tool simulating image sensors including quantum efficiency off-axis effectC.Mornet,J.M.Vaillant,T.Decroux,N.Virollet,D.Herault,STMicroelectronics(France);I.Schanen,InstitutdeMicroélectroniqueÉlectromagnétismeetPhotonique(France)

TheimagequalityevaluationofCMOSsensorsisabigchallengeforcameraphonemanufacturers.Inthispaper,wepresentanupdateoftheImageQualityEvaluationTool,agraphicsuserinterfacesimulatingimagesensorstoassesstheperformanceofapixel.Thesimulatedimagesarecomputedfromoperatingconditionsandsensor’scharacteristicdatalikeQuantumEfficiencyincludingoff-axiseffect.SimulationofQEoff-axisimpacthasbeenbasedoncharacterizationdata.Themethoddoesnotrequireoptics,makingitsuitableforearlydesignphasesasforoptimizationsandinvestigations.Bothmeasurementandimplementationinthetoolwillbeexplained.TheQEdegradationwithangleeffect,especiallythenoiseincorners,willbehighlightedonsimulatedimages.AuniformgraysceneorcoloredimagesimulationfromQEoff-axismeasurementwillhelpengineerstocalculatepost-processingdigitalcorrectionlikecolorshadingcorrectionorcolorcorrectionmatrixversuspixelposition

7876-22, Session 7

Image quality assessment based on edgeX.Mou,M.Zhang,W.Xue,Xi’anJiaotongUniv.(China);L.Zhang,TheHongKongPolytechnicUniv.(China)

Theresearchonimagequalityassessment(IQA)hasbeenbecomeahottopicinmostareaconcerningimageprocessing.SeekingfortheefficientIQAmodelwiththeneurophysiologysupportisnaturallythegoalpeopleputtheeffortstopursue.Inthispaper,wearguethatcomparingtheedgespositionofreferenceanddistortedimagecanwellmeasuretheimagestructuraldistortionandbecomeanefficientIQAmetric,whiletheedgeisdetectedfromtheprimitivestructuresofimageconvolvingwithLOGfilters.Theso-calledNSERmetricisdesignedfollowingasimplelogicbasedonthecosinedistanceoftheprimitivestructuresandtwoaccessibleimprovements.Validationistakenbycomparisonofthewellknownstate-of-the-artIQAmetrics:VIF,MS-SSIM,VSNRoverthesixIQAdatabases:LIVE,TID2008,MICT,IVC,A57,andCSIQ.ExperimentsshowthatNSERworksstablyacrossallthesixdatabasesandachievesthegoodperformance.

7876-23, Session 8

Method for evaluating tone mapping operators for natural high dynamic range imagesM.Kuhna,M.Nuutinen,P.Oittinen,AaltoUniv.SchoolofScienceandTechnology(Finland)

Thedynamicrangeofdigitalcamerashasbeenincreasinginrecentyears.HighDynamicRange(HDR)andespeciallytonemappinghasbeenanactivefieldofresearchforyears.WithcurrentimagesensortechnologiesthereishighfeasibilityforconsumerlevelsingleshotHDRcameras.TonemappingoperatorsareneededforrenderingHDRimagesonconsumerdisplays.Thisstudyfocusesonestablishingamethodforevaluatingtonemappingoperatorsintermsofimagequality.Thestudyisbasedontheobservationthatthetestimagesusedinsimilarstudiesinthepastlackimportantfeaturessuchashumanskintoneforevaluatingimagequalityusingsubjectivemethods.Themethodconsistsofimagecaptureandprocessingaswellassubjectiveevaluationincontrolledconditions.ThecurrentlevelofobjectivequalitymetricsforHDRimageswasalsobenchmarked.Objectivequalitymetricshavebeendevelopedandpublishedwidelytoeaseimagequalityevaluation,whichisoftenperformedwithextremelytimeconsumingsubjectivetests.



IS&T /

ReturntoContents

7876-24, Session 8

High dynamic range imaging of non-static scenesI.Hossain,B.Gunturk,LouisianaStateUniv.(UnitedStates)



Toward a quantitative visual noise evaluation of sensors and image processing pipes to improve color reconstructionC.Mornet,STMicroelectronics(France);D.J.Baxter,STMicroelectronics(R&D)Ltd.(UnitedKingdom);J.M.Vaillant,T.Decroux,D.Herault,STMicroelectronics(France);I.Schanen,InstitutdeMicroélectroniqueÉlectromagnétismeetPhotonique(France)

Theevaluationofsensor’sperformanceintermsofsignal-to-noiseratio(SNR)isabigchallengeforbothcameraphonemanufacturersandcustomers.Thefirstoneswanttopredictandassesstheperformanceoftheirpixelwhilethesecondsrequiresbeingabletobenchmarkrawsensorsandprocessingpipes.TheSNR10metricisverysensitivetocrosstalkwhereasforlow-lightissue,theweightofsensitivityshouldbeincreased.Toevaluatenoiseonfinalimage,theanalyticalcalculationofSNRonluminancechannelhasbeenperformedbytakingintoaccountnoisecorrelationduetotheprocessingpipe.However,thisluminancenoisedoesnotmatchtheperceptionofhumaneyewhichisalsosensitivetochromaticnoise.Alternativemetricshavebeeninvestigatedtofindavisualnoisemetricclosertothehumanvisualsystem.Theyhavebeenusedtoimprovecolorreconstructionbyoptimizingthecolorcorrectionmatrix:trade-offbetweenthesemetrics,coloraccuracyandsaturationhasbeenexplored.


Fidelity tolerance analysis for computational imaging systemC.Chang,Y.Chen,K.Chen,H.Tsao,H.Sung,C.Chang,P.Chen,H.Chang,IndustrialTechnologyResearchInstitute(Taiwan)

Inthepaper,wepresentananalysismethodusingforcomputationalimagingsystemwhichincludingaffectsofopticalaberrationsandfinitesamplingformimagesensor,andfidelitytoleranceanalysiswillbearchivedbysuitableimagemetric(peaksignaltonoiseratio,PSNR).ToleranceanalysisforcomputationalimagingsystemwhichconsideringsurfaceerrorofcubicphasemaskisillustratedandbehaviorofPSNRandpointspreadfunction(PSF)similarityinsuchkindofsystemisdiscussed.Finally,byusingPSNR,thecapabilityofextensiondepthoffocusandsurfacetoleranceofphasemaskcanbedetermined.


Noise-robust image deblurring by blending regular- and short-exposure imagesY.Tsuda,H.Hatanaka,S.Fukumoto,M.Ueda,SANYOElectricCo.,Ltd.(Japan);K.Chihara,NaraInstituteofScienceandTechnology(Japan)



Improving the sensitometric and OECF standards: recognizing the photosensitive exposure rangeM.G.Prais,Consultant(UnitedStates)

Thisarticledemonstratesthatandhowsensitometricandopto-electroniccharacteristicfunction(OECF)standardsshouldbechanged:ThesensitivitySofallphotosensitivearraysis--and,instandards,shouldbe--determinedbythemidtonephotosensitiveexposureofthearrayHmid,thebinarylogarithmofwhichis[log2(Hmax)+log2(Hmin)]/2.Thesequantitiesaredependentonthewidthofthephotosensitiveexposurerange∆,whichisdeterminedbythemeasuredminimumandmaximumusablephotosensitiveexposures,HmaxandHmin.ThereferenceexposureHo=S/Hspofaphotosensitivearrayis(and,instandards,shouldbe)determinedby∆ofthearray.Nevertheless,Ho,thespeedpointexposureHspandthesafetyfactor　arenolongerneededinthefaceofknowledgeofHmaxandHminorHmidand∆andshouldbeeliminated.Thesensitometricstandardforsolid-statearrays,ISO12232-2006,andtheOECFstandard,ISO14524-1999,shouldbechangedbecausetheyuseaphotosensitiveexposurerangewhichisinappropriateforsolid-statearrays.Finally,itshowsthatallcurrentstandardsestablishmidtonereflectancesRmidforstandardphotosensitivearraysthataremuchlessthanoft-touted18%makingmostreferencesto18%inappropriate.


Image enhancement technique using color and edge features for mobile imaging systemsW.Cho,T.Kim,SAMSUNGElectronicsCo.,Ltd.(Korea,Republicof)

Thepaperprovidesamethodofselectivelycontrollingthestrengthofimagenoisereduction(NR)andsharpeninginregionsassociatedwithspecificcolorsthatarevisuallyimpactonthehumanvisualsystem.Thesubjectivecolorqualityisoftenjudgedbyhowregionsclassifiedbyspecificcolorslookintheimagesthroughcolorcharacterizationbytheviewer.OurmethodcontrolsthesubjectivequalityofaspecificcolorregionbydeterminingthestrengthofsharpeningandNRoperation.Inaddition,theproposedalgorithmcarriesouttheregionalsegmentationsothatthealgorithmselectivelycancontrolthestrengthofNRandsharpeningforeachregion.However,sinceCISSoC(CMOSimagesensorSystemonChip)productscannotuseaffordablememoryduetothecostissue,theproposedalgorithmsuggestshowtheregionalsegmentationcanbedonewithoutanylinememory.Intheproposedmethod,pixelsarelabeledandthen,clusteredbyusingtheregionalinformationandthecolorproximity.TheedgeinformationalongwiththecolorcodedpixelsisusedforeffectiveNRandsharpeningofimages.Themaincontributionoftheproposedmethodincludes(i)amemorycolorclassifierinnormalizedcolorspaceforefficienthardwareoptimization,(ii)anedge-directedrun-lengthfilter(iii)fornoisereductionandsharpeningforvisualappearance.


Rectangular pixels for efficient color image samplingT.Singh,M.Singh,Consultant(UnitedStates)

WepresentCFAdesignsthatfaithfullycaptureimageswithspecifiedluminanceandchrominancebandwidths.PreviousacademicresearchhasmostlybeenconcernedwithmaximizingPSNRofreconstructedimageswithoutregardtochrominancebandwidthandcross-talk.Commercialsystems,ontheotherhand,paycloseattentiontoboththeseparametersaswellastothevisualqualityofreconstructedimages.Theycommonlysacrificeresolutionbyusingasufficiently



IS&T /

ReturntoContents

aggressiveOLPFtoachievelowcross-talkandartifactfreeimages.

WeintroducetheChrominanceBandwidthRatiomodelthatcapturesboththechrominancebandwidthandthecross-talkbetweenthevarioussignals.Next,weexaminetheeffectoftuningphotositeaspectratio,ahithertoneglecteddesignparameter.WederivepanchromaticCFApatternswithprovablyminimumphoto-sitecountforallvaluesoftheChrominanceBandwidthRatio.

AninterestingoutcomeisaCFAdesignthatcapturesfullchrominancebandwidth,yetusesfewerphoto-sitesthanthevenerablecolor-stripedesign.AnotherinterestingoutcomeisapracticalCFAdesignthatcaptureschrominanceathalftheresolutionofluminanceusingonly4uniquefiltercolors,thatlendsitselftoefficientlineardemosaicking,andyetvastlyoutperformstheBayerCFAwiththesamephotositecount,demosaickedwithstateoftheartnonlinearalgorithms.


A robust color signal processing with wide dynamic range WRGB CMOS image sensorS.Kawada,R.Kuroda,S.Sugawa,TohokuUniv.(Japan)

WehavedevelopedahighlyaccuracyandrobustcolorreproductionbyasimplecalculationwithanewcolorlinearmatrixusingtheformerlydevelopedwidedynamicrangeWRGBLOFICCMOSimagesensor.

Theimagesensorwasfabricatedthrougha0.18um2-poly3-MetalCMOStechnologyandhasa45degreesobliquepixelarray,the4.2umeffectivepixelpitchandtheWpixels.AWpixelwasformedbyreplacingoneofthetwoGpixelsintheBayerRGBcolorfilter.TheWpixelhasahighsensitivitythroughthevisiblelightwaveband.Anemerald-greenandyellow(EGY)signalsaregeneratedfromthedifferencebetweentheWsignalandthesumofRGBsignals.ThisEGYsignalsmainlyincludeemerald-greenandyellowingredients.ThesecolorscouldnotbereproducedaccuratelybytheconventionallinearmatrixbecausetheirwavelengthsareinthevalleyofthespectralsensitivitycharacteristicsofRGBpixels.AnewlinearmatrixbasedontheEGY-RGBsignalwasdeveloped.Usingthislinearmatrix,ahighlyaccuratecolorprocessingwithalargemargintothesensitivityfluctuationandnoisehasbeenachieved.


Adaptive contrast enhancement for underexposed imagesS.Corchs,F.Gasparini,R.Schettini,Univ.degliStudidiMilano-Bicocca(Italy)

Inthepresentarticlewefocusonenhancingthecontrastofimageswithlowilluminationthatpresentlargeunderexposedregions.Fortheseparticularimages,whenapplyingthecontrastenhancementtechniques,wealsointroducenoiseover-enhancementwithinthedarkerregions.Evenifboththecontrastenhancementanddenoisingproblemhavebeenwidelyaddressedwithintheliterature,thesetwoprocessingstepsare,ingeneral,independentlyconsideredintheprocessingpipeline.Therefore,thegoalofthisworkistointegratecontrastenhancementanddenoisealgorithmstoproperenhancetheabovedescribedtypeofimages(forexamplenightimagesorindoorimagesacquiredwithashortexposuretimeand/orhighISOsetting).Afterapplyingthecontrastenhancedmethod,andinordertoselectivelyenhancethedifferentregionsoftheunderexposedimages,weevaluatethesaliencymapoftheimage.Atthispoint,thelocalincreaseofnoiseisestimatedapplyingapropernoisemeasure.Inasubsequentmodule,thedenoiseandfinalcontrastcorrectionaretunedwithrespectthestrengthofthecontrastandconsequentnoiseincreaseandthelocalsalienceaswell.Anedgeenhancementmoduleisalsoincludedattheendofthepipelinetoobtainthefinalenhancedimage.ThemethodhasbeenappliedtoaproperdatabaseofunderexposedimagesandcomparedwithRetinexresults.


Moving refractive optical low pass filter for digital camerasM.Schöberl,Friedrich-Alexander-Univ.Erlangen-Nürnberg(Germany);J.D.Ernst,W.Schnurrer,S.Fößel,Fraunhofer-InstitutfürIntegrierteSchaltungen(Germany);A.Kaup,Friedrich-Alexander-Univ.Erlangen-Nürnberg(Germany)



A JPEG-like algorithm for compression of camera sensors imagesO.BenahmedDaho,XLIM-SIC(France);M.Larabi,Univ.dePoitiers(France)

Toreducecosts,digitalcamerasuseasinglesensorperpixel.ABayerCFAfilter(ColorFilterArray)isgenerallyusedtorecoveronlyonecolorcomponentperpixel.Subsequently,theimagesarefirstinterpolatedwithademosaicingprocesstoreconstructthefullcolorpicturepriortothecompressionstageforstorage.Thisschemeiscalledtheinterpolation-firstscheme.

Inthiswork,weintroducetheproblemofCFAdatacompression.Weproposetoadaptthecompressionschemetothedemosaicingprocess,wherethedecodeddataaredirectlyusedtoreconstructthefullresolutioncolorimage.


Reduced reference image quality assessment based on statistics of edgeM.Zhang,W.Xue,X.Mou,Xi’anJiaotongUniv.(China)

ObjectiveImageQualityAssessment(IQA)modelinvestigationisahottopicinrecenttimes.ThispaperproposedanovelandefficientuniversalReducedReference(RR)imagequalityassessmentmethodbaseduponthestatisticsofedgediscrimination.Firstly,binaryedgemapscreatedfromthemulti-scalewavelettransformmodulusmaximawereusedasthelowlevelfeaturetodiscriminatethedifferencebetweenthereferenceanddistortedimageforIQApurpose.Thenthegradientoperatorwasappliedonthebinarymaptoproducethesocallededgepatternmap.Thehistogramofedgepatternmapwasusedtoverifythepatternoftheedgesofreferenceanddistortedimage,respectively.TheRRfeaturesextractedfromthehistogramwasusedtodiscriminatethedifferenceofedgepatternmaps,andthenformanewRRIQAmodel.ComparingtothetypicalRRmodel(ZhouWang’smethod,2005),only12features(96bits)areneededinsteadof18features(162bits)inZhouWangetal.’smethodwithbetteroverallperformance.


Evaluation of LED flash performance for camera phonesJ.Pincenti,C.Sheldon,B.Richards,G.John,Motorola,Inc.(UnitedStates)

Inthiswork,LEDbasedflashsolutionsareevaluatedforuseinacameraphoneapplication.Theperformanceofagivenflashsolutionismeasuredintermsofcoloraccuracyandsignaltonoiseratio(SNR),bothofwhicharestandardtestmethodsusedinindustry.Earlyinacameraphonedesignbeforecompletedcameramodulesareavailable,coloraccuracyandSNRareevaluatedthroughamodelwhichisbasedonknowledgeofagivenimagesensor’scolorresponseaswellasthepowerspectraldistributionoftheflash.Laterinthedesign,



IS&T /

ReturntoContents

whenworkingcameramodulesbecomeavailable,theevaluationisperformedthroughdirectmeasurement.Thesedirectmeasurementsarealsousedtoverifytheresultsoftheaforementionedmodel.Coloraccuracy,SNR,howtheyarerelated,andthecompromisebetweenthetwoarediscussedaswellastheefficiencyatwhichelectricalpowerisconvertedtolightthatisdetectablebytheimagesensor.Thoughmanyissuesremaintobeinvestigated,measuringcoloraccuracyandSNRprovidesanevaluationmethodthatbuildsondevelopedtechniquesandprovidesapracticalfoundationforflashevaluationasitappliestothecameraphoneindustry.


Characterization of pixel crosstalk and impact of Bayer patterning by quantum efficiency measurementJ.M.Vaillant,STMicroelectronics(France);C.Mornet,STMicroelectronics(France)andIMEP(France);T.Decroux,D.Herault,STMicroelectronics(France);I.Schanen,IMEP(France)

Developmentofsmallpixelsforhighresolutionsensorsimpliesalotofchallenges.Ahighlevelofperformanceshouldbeguaranteedwhereastheoverallsizemustbereducedandsothedegreeoffreedomindesignandprocess.Onekeyparameterofthisconstantimprovementistheknowledgeandthecontrolofthecrosstalkbetweenpixels.Inthispaper,wepresentanadvanceincrosstalkcharacterizationmethodbasedonthedesignofspecificcolorpatternsandthemeasurementofquantumefficiency.Inafirstpart,wedescribethecolorpatternsdesignedtoisolateonepixelortosimulateun-patternedcoloredpixels.Thesepatternshavebeenimplementedontest-chipandcharacterized.Thesecondpartdealswiththecharacterizationsetupforquantumefficiency.Indeed,theuseofspectralmeasurementsallowsustodiscriminatepixelsbasedonthecolorfilterplacedontopofthemandtoprobethecrosstalkasafunctionofthedepthinSilicon,thankstothephotonabsorptionlengthvariationwiththewavelength.Inthelastpart,resultsarepresentedshowingtheimpactofcolorfilterspatterning,i.e.pixelsinaBayerpatternversusun-patternedpixels.Thecrosstalkdirectionsandamplitudesarealsoanalyzedinrelationtopixellayout.



IS&T /

ReturntoContents

Conference 7877: Image Processing: Machine Vision Applications IVTuesday-Thursday25-27January2011PartofProceedingsofSPIEVol.7877ImageProcessing:MachineVisionApplicationsIV


Vehicle detection using new AdaBoost featuresH.Park,J.Kim,C.Lee,J.Jang,LED-ITFusionTechnologyResearchCtr.(Korea,Republicof)

ThispaperpresentanimprovementtotheobjectdetectionmethodofViolaandJones,usingtheexampleofvehicledetection.OurmethodfortrainingdetectorsisAdaBoost(adaptiveboosting)usinganewtypeofvisualfeatureswhichisbasedonmaximallyextremalpoint.Ourfeaturesarefasterandrobusttopartialvisibilityandclutter.Theproposedmethodisexpectedtohavevariousapplications,suchasvehicledetection,Roadenvironmentrecognition,real-timeimageprocessingandsoon.


Vehicle detection using DOM-FAST and support vector machineJ.Kim,LED-ITFusionTechnologyResearchCtr.(Korea,Republicof);C.Lee,J.Jang,YeungnamUniv.(Korea,Republicof);H.Park,LED-ITFusionTechnologyResearchCtr.(Korea,Republicof)

Inthispaper,wepresentanovelvehicledetectionalgorithmusingthedifferenceofmean-featuresfromacceleratedsegmenttest(DOM-FAST)andthesupportvectormachine(SVM).Ingiventestimages,wedetectfirstlytheinterestfeaturepointsbytheDOM-FASTalgorithm.Andthenthelocaldescriptoriscomputedbythecontourlettransform(CT)ateachfeaturepoint.ThelocaldescriptorbasedonthecoefficientsofCTrepresentsthemostsignificantinformationoftheimagepatcharoundthefeaturepointsefficiently.ThenthecombinationsofcoefficientsareappliedasstudysamplestotheSVMclassifiers.Finally,thecoefficientsoftestingimagesareusedtotestclassifiers,andthevehicledetectionresultsareobtained.Theexperimentsareperformedonvariousdatabaseswithourdatabase.


Failures on atmospheric visibility measurements using digital image processingA.Restrepo-Martínez,F.E.Lopez,InstitutoTecnológicoMetropolitano(Colombia)

ThispaperstudiessomeproblemsofvisibilityoftheatmosphericvisibilitymeasurementswhenartificialvisionisusedtoapplyBeer´sLawsandedgesanalysis.

DaytimedigitalimagesofMedellindowntowncitywerecapturedusingaCanonEOSRebelXSI®camera.Regionsofinterestwereextractedforeachimage,thenusingstrategiesofBeer´sLawsthevisibilityweremeasured.EdgesAnalysisSobelandCannyweredonetoo.

Differentvaluesfortheshutterspeed,sensibility(ISOnumber)andf-number,wereusedtoevaluatehowthevisibilityvaluesareaffected.Wefoundthatforthesamescene,changeintheseparametersproducesvariationsinthevisibilityvalues;thisfactisaseriousproblem.Thensomestrategiesabouthowdefinedtheseparametersarenecessarytoimplementforfuturestudies.

Theimageswithbestcontrasthavemanyedgesandhighestvaluesofvisibility.Inthesecaseslowestlevelsofmatterparticulateinthecitycouldbehope.

Inspiteofthelimitationsofmeasurementsobtained,thestrategymonitoringthevisibilitywithartificialvisionhasahighpotentialduetothelowcostandthewidepossibilitiesoftheautomationwhenmachinevisionisused.


Segmentation and visualization of anatomical structures from volumetric medical imagesJ.Park,S.Park,MokpoNationalUniv.(Korea,Republicof);W.Cho,ChonnamNationalUniv.(Korea,Republicof)

Thispaperpresentsamethodthatcanextractandvisualizeanatomicalstructuresfromvolumetricmedicalimagesbyusinga3Dlevelsetsegmentationmethodandahybridvolumerenderingtechnique.First,thesegmentationusingthelevelsetmethodwasconductedthroughasurfaceevolutionframeworkbasedonthegeometricvariationprinciple.Thisapproachaddressesthetopologicalchangesinthedeformablesurfacebyusingthegeometricintegralmeasuresandlevelsettheory.Theseintegralmeasurescontainarobustalignmentterm,anactiveregionterm,andameancurvatureterm.Byusingthelevelsetmethodwithanewhybridspeedfunctionderivedfromthegeometricintegralmeasures,theaccuratedeformablesurfacecanbeextractedfromavolumetricmedicaldataset.Second,weemployedahybridvolumerenderingapproachtovisualizetheextracteddeformablestructures.Ourmethodcombinesindirectanddirectvolumerenderingtechniques.Segmentedobjectswithinthedatasetarerenderedlocallybysurfacerenderingonanobject-by-objectbasis.Globally,alltheresultsofsubsequentobjectrenderingareobtainedbydirectvolumerendering(DVR).Thenthetworenderedresultsarefinallycombinedinamergingstep.Thisisespeciallyusefulwheninnerstructuresshouldbevisualizedtogetherwithsemi-transparentouterparts.Thismergingstepissimilartothefocus-plus-contextapproachknownfrominformationvisualization.Finally,weverifiedtheaccuracyandrobustnessoftheproposedsegmentationmethodforvariousmedicalvolumeimages.Thevolumerenderingresultsofsegmented3Dobjectsshowthatourproposedmethodcanaccuratelyextractandvisualizehumanorgansfromvariousmultimodalitymedicalvolumeimages.


Extraction and fusion of spectral parameters for face recognitionZ.Abdessalem,B.Billiot,P.Gouton,J.Y.Hardeberg,Univ.deBourgogne(France)

I.Introduction:Manymethodshavebeendevelopedinimageprocessingforfacerecognition,especiallyinrecentyearswiththeincreaseofbiometrictechnologies.However,allthesetechniquesaremainlyusedongrayscaleimagesacquiredinthevisiblerangeoftheelectromagneticspectrum.

Theaimsofourstudyaretoimproveexistingtoolsandtodevelopnewmethodsforfacerecognition.Thetechniquesusedtakesadvantagesofthedifferentspectralranges,thevisible,opticalinfraredandthermalinfrared,byeithercombiningthemoranalyzingthemseparatelyinordertoextractthemostappropriateinformationforfacerecognition.

II.Methods:Ourstudyusethreeacquisitiondevices:adigitalSLRcamera,aninfraredcamera(800-2600nm)andathermalcamera(12000nm).

Severalfacerecognitiontechniquesarebasedonalgorithmsoffacialfeatures,inordertocharacterizethem.Firstly,weapplytheSIFTalgorithm[1](Lowe,2004)(ScaleInvariantFeatureTransform)ona


IS&T /

ReturntoContents

databaseofvisible,infraredandthermalimagesoffacesacquiredwithourequipment.Suchalgorithmdeterminesthecharacteristicpointsoftheimagesanddefinesthembycharacteristicvectors.

Tovalidatetheresults,theASIFTalgorithm[2](MorelandYu,2009),aderivativeofSIFTinvarianttoaffinetransformation,isappliedtothesamedatabase.

III.Results:Byanalyzingresults,wenoticethatthenumberofcharacteristicpointsintheinfraredrangeismoreimportant(anaverageof152points/kindofperson(SIFT)and242points/kindofperson(ASIFT)),thisnumberishalfreducedinthevisiblespectrum(anaverageof75points/person(SIFT)and151points/person(ASIFT)).Inthethermalrange,thecharacteristicpointsarelowercomparetothepreviousspectrumrange(anaverageof43points/person(SIFT)and57points/person(ASIFT)).

Theotherresultswillbeobtainbyacomparisonbetween2differentfacesinsamerange.Wecannotethatintheinfra-redspectrumandthermalthenumberofcharacteristicpointsbetweentwodifferentpersonisreduced(anaverageof1point/even(SIFT)and1point/even(ASIFT))ontheotherhandthevisiblespectrumpresentsmorecharacteristicpoints(anaverageof3points/even(SIFT)and11points/even(ASIFT)).

IV.Conclusion:Theseinitialresultsshowwellthattheinfra-redspectrumismostadequatetoensureahigherrateofrecognitionwiththeextractionofcharacteristicpointmethod.Howevertheuseofthethermalbandisnotwelladaptedforthiskindofalgorithm.

Currently,severalresearchgroupsstudytheuseofmethodssuchassegmentationandfacerecognition(Gaborwavelet)arebeingimplemented.Theresultsaimtodeterminewhetherthecontributionofinformationcontainedinimagesacquiredininfraredrangecanimprovetheperformanceofmethodsused.

Thefusionofinformationcontainedineachtypeofimageisalsoplannedtocharacterizethefaceinthemostdiscriminatingpossible.


Monitoring plant growth using high resolution micro-CT imagesV.C.Paquit,S.S.Gleason,U.C.Kalluri,OakRidgeNationalLab.(UnitedStates)

AmultidisciplinaryresearchconductedattheOakRidgeNationalLaboratoryaimsatunderstandingthemolecularcontrolsofpartitioning,transportandfateofcarbonfixedbyphotosynthesisinplantsanditscorrelationwithothermeasuredplantsystemproperties.Ultimately,weintendtodevelopamodelingframeworktoassess,correlateandpredictastowhichspatiotemporalchangesinsystemdynamicsarekeytopredictingemergentpropertiesofsystem.

Withinthisresearch,thispaperrelatestothequantitativemorphologicalmeasurementsofthemainstructuresformingaplant(stem,roots,andleaves),theirinternalsub-structures,andchangesoccurringovertime.


Automating the estimation of coating thickness measurements in the ball crater techniqueJ.Huang,TheCityUniv.(UnitedKingdom)andTeerCoatingsLtd.BerryHillIndustrialEstate(UnitedKingdom);P.Liatsis,TheCityUniv.(UnitedKingdom);K.Cooke,D.Teer,TeerCoatingsLtd.(UnitedKingdom)

Noabstractavailable

7877-01, Session 1

Lipschitz exponents based signal restorationB.Jalil,O.Beya,E.Fauvet,O.Laligant,Univ.deBourgogne(France)

Inthiswork,weattempttoproposeasignalrestorationtechniquefromthenoisecorruptedsignal.Thescopeoftheworkhasalsobeenexpandedtotheidentificationofsingularitiesinsidethesignalaswell.Asthesesingularitiesinducestrongcoefficients,thereforetheadjacentsmallsingularitycannotbedetectedduringthedenoisingprocess.Themainaimoftheproposemethodistoaddresssuchtypesofproblemwithoutintroducingfurtherspuriouspeaksoroscillations.Inordertoovercomethisproblemweareproposingamultisegmentationcriteriontoseparatethenoiseelementsfromthesesignificantsingularpoints.Atthesametime,theproblemoftheinfluenceofstrongsingularitiesonadjacentweakersingularpointscanbesolvedbyusingmultisegmentationapproach.Inmultisegmentationapproach,weareproposingarecursivealgorithmwhichidentifiestheweakestrequiresingularpointinthesignal.

7877-02, Session 1

Real-time wavelet-based inline banknote-in-bundle counting for cut-and-bundle machinesV.Lohweg,Ostwestfalen-LippeUniv.ofAppliedSciences(Germany);J.Schaede,T.Türke,KBA-GIORIS.A.(Switzerland);E.Gillich,Ostwestfalen-LippeUniv.ofAppliedSciences(Germany);D.Petker,OWITAGmbH(Germany);H.Willeke,KBA-Bielefeld(Germany)

Automaticbanknotesheetcut-and-bundlemachinesarewidelyusedwithinthescopeofbanknoteproduction.Besidethecuttingandbundlingfeatureswhicharetodayamaturetechnology,image-processing-basedqualityinspectionforthistypeofmachinebecomesattractive.Wepresentinthisworkanewreal-timetouchlesscountingandcuttingbladequalityinsurancesystem,basedonaColor-CMOS-Cameraandadual-coreComputer,forcut-and-bundleapplicationsinbanknoteproduction.ThesystemwhichappliesWavelet-basedmultiscalefilteringisabletocountbanknotesinsidea100-bundlewithin200-300msdependingonthewindowsize.

7877-03, Session 1

A robust segmentation and tracking method for characterizing GNSS signals reception environmentA.Cohen,C.Meurie,Y.Ruichek,Univ.deTechnologiedeBelfort-Montbéliard(France);J.Marais,Univ.LilleNorddeFrance(France)

ThispaperisfocusedonthecharacterizationofGNSSsignalsreceptionstatebynewimageprocessingtechniques.Themainaimoftheapplicationconsiststodetectsatellitessituatedinskyregion(withdirectreceptionstate)infish-eyeimages.Thisproposedstrategyiscomposedbyfoursteps:1/anewadaptiveandautomaticsegmentationmethodcombiningcolorandtextureinformation.2/aclassificationstepbythek-meansalgorithmfordeterminingtheskyandnon-skyregions.3/acalibrationandrectificationstage;4/aregion-trackingmethodbasedonablock-matchingestimationthatreducestheexecutiontimeoftheapplicationinordertoapproachthereal-timeconstraints.Thetrackingresultsarecomparedtotheresultsoftheclassificationmethodonalargerealdatabase.Theevaluationshowsthattheproposedmethodhasaverylowerror,reachingagoodclassificationrateof90%(vs94.2%obtainedinpreviousworks)anddecreasingtheexecutiontimeoftheapplicationbytentimes.

Conference 7877: Image Processing: Machine Vision Applications IV


IS&T /

ReturntoContents

7877-04, Session 2

Accurate, fast, and robust centre localisation for images of semiconductor componentsF.Timm,Univ.zuLübeck(Germany)andPatternRecognitionCo.GmbH(Germany);E.Barth,Univ.zuLübeck(Germany)

Weproposetwonovelapproachesfortheprecisecentrelocalisationofcircularobjects,e.g.p-electrodesoflightemittingdiodes.

Thefirstapproachisbasedonimagegradientsforwhichweprovideanobjectivefunctionthatissolelybasedondotproductsandthusbeingmaximisedbygradientascend.

Thesecondapproachisinspiredbytheconceptofisophotesforwhichwederiveanobjectivefunctionthatisbasedonthedefinitionofradialsymmetry.

WecomparetheaccuracyandtheruntimetotheHoughtransformforartificialimageswithseveralkindsofnoiseandimagesofsemiconductorcomponentswithocclusionsandstrongimagenoise.

Theradialsymmetryapproachprovetobethemostrobustone,especiallyforlowcontrastimageswithstrongnoisewithameanerrorrateof0.86pixelforartificialimagesand0.98pixelforrealworldimages.Thegradientapproachyieldsmoreaccurateresultsforalmostallimages(meanerror4pixel)comparedtotheHoughtransform(meanerror8pixel).

Concerningtheruntime,thegradient-basedapproachsignificantlyoutperformstheotherapproacheswithareductionof79%comparedtotheHoughtransform(100%),whereastheradialsymmetryapproachyieldsareductionofstill12%.

7877-09, Session 3

Vision based forest smoke detection using analyzing of temporal patterns of smoke and their probability modelsS.Ham,B.Ko,J.Nam,KeimyungUniv.(Korea,Republicof)

Ingeneral,sincesmokeappearsbeforeflames,smokedetectionisparticularlyimportantforearlyfiredetectionsystems.Todetectfire-smokeusingvideocameraisadifficultworkbecausemaincharacteristicsofasmokeareuncertain,vague,constantpatternsofshapeandcolor.Thus,thispaperproposesanewfire-smokedetectionmethod,especiallyforestsmokeusinganalyzingoftemporalpatternsofsmokeandFuzzyFiniteAutomata(FFA).Toconsiderthesmokecharacteristicsovertime,thetemporalpatternsofintensityentropy,waveletenergyandmotionorientationhavebeenusedforgenerating,multivariateGaussianprobabilitydensityfunctions(PDFs)areappliedFuzzyFiniteAutomata(FFA)forsmokeverification.TheproposedFFAconsistofasetoffuzzystates(VH,H,L,VL),andatransitionmappingthatdescribeswhateventcanoccuratwhichstateandresultingnewstate.Forsmokeverification,FFAismostappropriatemethodincasevariablesaretime-dependentanduncertain.Theproposedalgorithmissuccessfullyappliedtovariousfire-smokevideosandshowsabetterdetectionperformance.

7877-10, Session 3

Estimation of fire volume by stereovisionT.Molinier,L.Rossi,A.Pieri,Univ.diCorsicaPasqualePaoli(France);M.A.Akhloufi,Ctr.ofRoboticsandVision(Canada);Y.Tison,Univ.diCorsicaPasqualePaoli(France)

Thispaperpresentstheestimationoffirefrontvolumeinthecontextoflaboratoryexperimentsoffirespreadingoninclinabletable.Themethodisbasedontheuseoftwoprecalibratedsynchronizedstereovisionsystemspositionedrespectivelyinabackpositionandinafrontpositionofthefirepropagationdirection.Thetwovisionsystemsareusedtoobtaincomplementary3Dfirepointsthatareexpressedin

asinglereferencesystemandthatwillbeusedtoobtainaglobalformofthefire.Todeterminethecoordinatetransformationsofeachsystemwithrespecttoasinglereferencesystemacalibrationprocedureiscarriedoutwithacubepattern.Thesametechniqueisusedtoperforma3dmeshwiththeseveralsetsof3dpointsandtoconnectthosesetsbetweenthem.3Dshapereconstructionandestimationofthevolumeofthefirearefinallyestimated.

7877-11, Session 3

Pavement distress detection and severity analysisE.Salari,G.Bao,TheUniv.ofToledo(UnitedStates)

Summary:

Automaticrecognitionofroaddistressesisachallengingresearchareasinceitreduceseconomiclosesbeforecracksandpotholesbecometoosevere.However,duetofactorssuchascomplextexture,unevenillumination,andnon-uniformbackground,pavementdistressdetectionturnsouttobeaverydifficultproblemratherthanasimpleedgedetectionprocess.Inthispaper,anovelautomaticpavementcrackdetectionapproachbasedonadvancedimageprocessingtechniquesisproposed.Theproposedmethodcanprovidereal-timepavementdistressdetectionanditsevaluationbasedonimagescapturedfromacamerainstalledatthefrontofatestingvehicle.Theentiredetectionprocessconsistsoftwomainphases.Thefirstphaseispavementsurfaceextractionandthesecondphaseinvolvespavementdistressdetectionandevaluation.Inpavementsurfaceextraction,anovelcolorsegmentationmethodbasedonaneuralnetworkisappliedtoseparatetheroadsurfacefrombackgroundfeatures,suchashouses,bushes,grassandtrees.Afterroadsegmentationisaccomplished,apavementdistressdetectionalgorithmbasedonprobabilisticrelaxationisexecutedtoobtaintheskeletonofthecracks.Thenanewpavementdistressclassificationalgorithmbasedonneuralnetworksisintroducedtoassignthecracksintodifferenttypesandseveritygroupsaccordingtothegeometricalandtopologicalparametersobtainedinthecrackdetectionstep.Theproposedroadinspectionsystemcanpreciselydetectvariousdistressfeaturesandestimateitsseverityfromaregularoutdoorsceneimage.Simulationresultsshowthemethodiseffectiveandrobustintheextractionofcracksinavarietyofpavementimages.

7877-28, Session 3

Line segment based structure and motion from two viewsS.Mosaddegh,A.Fazlollahi,D.Fofi,Univ.deBourgogne(France);P.Vasseur,Univ.dePicardieJulesVerne(France)

Wepresentanefficientmeasureofoverlapbetweentwoco-linearsegmentswhichconsiderablydecreasestheoverallcomputationaltimeofaSegment-basedmotionestimationandreconstructionalgorithmalreadyexistinliterature.Wealsodiscussthespecialcaseswheresparsesamplingofthemotionspaceforinitializationofthealgorithmdoesnotresultinagoodsolutionandsuggesttousedensesamplinginsteadtoovercometheproblem.Finally,wedemonstrateourworkontworealdatasets.

7877-14, Session 4

Multi-frame face recognition with a discriminant analysis and decision level fusionS.Yeom,H.Lee,DaeguUniv.(Korea,Republicof)

Currently,CCTVandDVRsystemsarewidelyinstalledforsecurityandsurveillance.However,thevideostreamtransferredfromthewidelyusedmonitoringsystemhascomparablylowresolutionandpoorqualitysinceitusuallyoperatesinharshconditions.Thesystemcovers



IS&T /

ReturntoContents

certainregionofinterestfromadistance.Theimageinformationisacquiredwithouttheconsiderationofdesirableimagingconditionssuchasilluminationandfocusing.Moreover,dramaticchangeofrealwordsoftengeneratesblurringeffectsontheimage.Therefore,facerecognitionwiththeCCTVsystemisadifficulttask.Althoughtherehavebeenresearchtoincreasethelowresolutionusingmulti-framesforfacerecognition,itusuallyrequireshighcomputationalloadandcapacity.

Inthispaper,afacerecognitionmethodbasedonphoton-countinglineardiscriminantanalysisanddecisionlevelfusionisdiscussed.Thephoton-countinglineardiscriminantanalysisrealizestheFisher’scriterionwiththePoissonprobabilitymodelwithoutsufferingfromthesingularityproblemofFisherlineardiscriminantanalysis.Avideosurveillancesystemprovidesmulti-frameimagesoflowquality.Therefore,wecanutilizeasequenceofimagesbyperformingdecisionlevelfusiontoimprovethefacerecognitionperformance.Thisfacerecognitiontechniqueisshowntoberobusttolowresolutionandnoiseenvironments.Intheexperiments,simulateddataandlowresolutionfacialimagesaretestedtoverifytheperformanceoftheproposedmethod.Theaccuracyrateandthefalsealarmrateareobtainedtocomparetheresultswithconventionaltechniques.Theproposedsystemshowspotentialsofusingphoton-countinglineardiscriminantanalysisanddecisionlevelfusioncombinedforwidelyavailablelow-endsurveillancesystems.

7877-15, Session 4

Pose-robust face recognition using shape-adapted texture featuresT.Gernoth,A.Goossen,R.Grigat,TechnischeUniv.Hamburg-Harburg(Germany)

Unconstrainedenvironmentswithvariableambientilluminationandchangesofheadposearestillchallengingformanyfacerecognitionsystems.Torecognizeapersonindependentofpose,wefirstfitanactiveappearancemodeltoagivenfacialimage.Shapeinformationisusedtotransformthefaceintoapose-normalizedrepresentation.Wedecomposethetransformedfaceintolocalregionsandextracttexturefeaturesfromthesenotnecessarilyrectangularregionsusingashape-adapteddiscretecosinetransform.Weshowthatthesefeaturescontainsufficientdiscriminativeinformationtorecognizepersonsacrosschangesinpose.Furthermore,ourexperimentalresultsshowasignificantimprovementinfacerecognitionperformanceonfaceswithposevariationswhencomparedwithablock-DCTbasedfeatureextractiontechniqueinanaccesscontrolscenario.

7877-16, Session 5

A novel framework for white blood cell segmentation based on stepwise rules and morphological featuresJ.Gim,J.Park,J.Lee,B.Ko,J.Nam,KeimyungUniv.(Korea,Republicof)

Inautomaticcellanalysisusingimageprocessing,WBCsegmentationisthemostimportantprocedure,wheretheultimategoalistoextractalltheWBCsfromacomplicatedbackgroundandthenonlysegmenttheWBCsintomorphologicalcomponents,suchasthenucleusandcytoplasm.

ThisstudyproposesanewWBCsegmentationmethodusingregionmergingschemeandGVF(GradientVectorFlow)snake.WBCsegmentationconsistsoftwoschemes;nucleisegmentationandcytoplasmsegmentation.Fornucleisegmentation,wecreateaprobabilitymapusingprobabilitydensityfunctionestimatedfromsamplesofWBC’snucleiandcropthesub-imagestoincludenucleusbyusingthefactthatnucleihavesalientcoloragainstbackgroundandredbloodcells.Then,mean-shiftclusteringisperformedforregionsegmentationandstepwiseruleisappliedtomergeparticleclusterstonucleus.Forcytoplasmsegmentation,wecreatesaturationmapwithinthesub-imagesbasedonthefactthatcytoplasmhashighersaturation

thanredbloodcell.Then,theGVFsnakeisappliedtogradientobtainedfromthesaturationmap.Finally,GVFforcestoguidesnakestodeformtocytoplasmboundaryedges.

7877-17, Session 5

Contour extraction and amendments of left ventricle short axis from heart ultrasonic image sequencesX.Yang,ChongqingUniv.ofPostsandTelecommunications(China)

Inthediagnosisforheartdisease,peopleareusuallyveryconcernedaboutthemovementsituationofventricle,atriaandvascularcavity.Thediastolicandsystolicmovementofheartandvascularcavityrepresentseachcomponent’sperformanceofheartbody,anditoffersanimportantreferencefordiseasediagnosis.Inthesectionofultrasonicimages,theconformationofheartstructure’scontourpreciselydescribestheirmovementfeatures.However,theextractionofthesecontourlinesisverydifficult.

Theventricularmovementisalsocalledcontourmovingofventricle.Thisarticlepresentsanautomaticdetectionmethodaccordingtothecharacteristicsofultrasonicimages.Themethodcanalsouseartificialinterventionalgorithmtogetimagecontourifnecessary.

Atfirst,forthefirstframefromtheselectedimages,thecharacteristicpointsOandPontheoutlinearemanuallydesignated.Andthen,thepointPistreatedasastartingposition,andthecontourissearched.Theradialgradient-basedsearchingmethodisusedtodetecttheborderline,afterfindingthediscretepointsontheoutline,then,thesmoothcontourcurvesareobtainedbythemethodofcurvefitting.

Togettheoutlineofthefollowingframe,oneneedstoconsidercontinuityofheart’smovement.Theoutline’sdifferencebetweenadjacentframeswillnotbetoogreat,sothatthefollowingframe’soutlineafterthesecondoneisextractedonthebaseoftheformerone.

Inadditiontothecardiacdiastolicandsystolicmovement,italsoincludesrotationalandtranslationalmotioncausedbybloodshockandmuscletraction.

Inrecentyears,clinicianspaymoreattentiontodetectventriculardiastolicfunctionofpatientssufferedwithcoronaryheartdisease.Inordertoprovideamoreintuitivereferencefordoctors,oneshouldamendheart’stranslation.

7877-18, Session 6

Non-parametric texture defect detection using Weibull featuresF.Timm,Univ.zuLübeck(Germany)andPatternRecognitionCo.GmbH(Germany);E.Barth,Univ.zuLübeck(Germany)

Weproposeanovel,non-parametric,localapproachfordefectdetectionintextureimageswithonlytwofeatures.WecomputethetwoparametersofaWeibullfitofthegradientmagnitudeswithinalocalwindow.Then,weperformasimplenoveltydetectionalgorithmtodetectarbitrarydeviationsofthenormaltexture.

Therefore,wecomputetheEuclideandistanceoftheparametersofthelocalwindowstoareferencepointthatislearntduringtraining.

Ourapproachisindependentofthepresenttypeoftextureandalsoindependentofthedefecttype.

ForperformanceevaluationweusethehighlychallengingdatasetproposedattheDAGM2007withdifferentclassesoftexturesanddifferenttypesofdefects.

TheWeibullparameterscandetectlocaldeviationswithindifferenttypesoftextureswithanerrorrateoflessthan5%usingdefect-freeimagesfortraining.ComparedtoexistingapproachessuchasGaborfiltersorgreylevelstatistics,ournovelapproachisnotonlypowerfulbutalsoveryefficient.



IS&T /

ReturntoContents

7877-19, Session 6

Machine vision applied to industrial quality control of artificial teeth: lighting methodology and image enhancementJ.W.Branch,Univ.NacionaldeColombiaSedeMedellín(Colombia);A.Restrepo-Martínez,InstitutoTecnológicoMetropolitano(Colombia);E.Mesa-Múnera,J.F.Ramírez-Salazar,P.Atencio,E.Franco,Univ.NacionaldeColombiaSedeMedellín(Colombia);O.Franco,R.Carmona,H.Rodriguez,NewSteticS.A.(Colombia)

Thispaperconsidersthequalitycontrolofpolymericresinartificialteeth,whicharemanufacturedfromtwodifferentlayers.Itwillbetakenintoaccountonlydarkparticlesdetectioninteethsurfaces,whicharecalledparticledefects.Aspeciallightingsystemwasdesignedwithfeaturesthatconsiderthesizeandshapeofteeth.Forthatpurpose,someLEDarrayswereassembledinvariousconfigurations:Directdiffusedlightandbacklight,whichallowustodefinetheRegionofInterest(ROI),toavoidbrightnessonthesurfaceandtogenerateahomogeneousilluminationoftheteeth.However,inordertoavoidsaturationofthesensor,itwasnecessarytoimplementanintensitycontrolsystemaccordingtothesizeofthetooth.Inthestageofimageacquisition,differentamateurcameraswereevaluatedduetoitslowcost,butalsotheperformanceoftheprocessingtoolsofaSMARTcamerafromNationalInstrumentswascompared.Additionallytolighting,anadaptivecontrastenhancementstagewasperformedinordertoimprovetheconditionofsomeimagesinwhichlightingmethodsuseddonotimprovethescene.Asfutureworkitwillbenecessarytointegratesomeoftheimplementedtechniques.

7877-25, Session 6

Quantitative measurement by artificial vision of small bubbles in flowing mercuryV.C.Paquit,M.W.Wendel,D.K.Felde,OakRidgeNationalLab.(UnitedStates)

At,theSpallationNeutronSource(SNS),anaccelerator-basedneutronsourcelocatedattheOakRidgeNationalLaboratory(Tennessee,USA),theproductionofneutronsisobtainedbyacceleratingprotonsagainstamercurytarget.Thisselfcoolingtarget,however,suffersrapidheatdepositionbythebeampulseleadingtolargepressurechangesandthustocavitationsthatmaybedamagingtothecontainer.Inordertolocallycompensateforpressureincreases,asmall-bubblepopulationisaddedtothemercuryflowusinggasbubblers.Thegeometryofthebubblersbeingunknown,wearetestingseveralbubblers’configurationsandareusingmachinevisiontechniquestocharacterizetheirefficiencybyquantitativemeasurementofthecreatedbubblepopulation.

Inthispaperwethoroughlydetailtheexperimentalsetupandtheimageprocessingtechniquesusedtoquantitativelyassessthebubblepopulation.Tosupportthisapproachwearecomparingourpreliminaryresultsfordifferentbubblersandoperatingmodes,comparetheefficiencyofourmethodtofluiddynamicstheory,anddiscusspotentialimprovements.

7877-21, Session 7

Coded source neutron imagingP.R.Bingham,OakRidgeNationalLab.(UnitedStates)

Whileneutronradiographyingeneralisnotanewtechnologywithfirstimagestakeninthe1930s,recentinnovationsinneutronimagingtechniqueshaveincreasedtheapplicabilityofneutronradiographytocomplexscienceandengineeringproblemssuchasstressmeasurements[1],magneticfieldmeasurements[2],andfuelcellresearchanddevelopment[3]tonameafew.Theseextensionshaveimprovedtheapplicabilityofneutronradiography,butsystem

resolutionisstilllimitedbythedetectionsystemstoaround50umwithanintegratingdetector.Conventionalneutronradiographyisalsolimitedtodaybythecountrateatthedetectorrestrictingtimeresolvedmeasurements.Sinceneutronbeamsaremarginally-diffractingorrefractingatmicroscopicandmacroscopicscales,neutronopticsthatcanmagnifyorde-magnify(i.e.,focus)imagedobjectsaredifficulttocreateandrequireexpensivedesignsandmaterials[4].Currentimprovementsindetectortechnologyarepushingresolutiondowntothe10-15ummark[5]withhigh-costdetectors,butthereisacleardemandforresolutionsof1umorlessthatwoulddramaticallyextendtheapplicationofneutronimagingtomicro-scalestructuressuchasmicrochannelheatexchangers,fuelcellcomponents,biologicalmicroscopyforpharmacology,drugdeliveryresearch,fuelinjectorsprayersforefficientdieselenginetechnology,andbiofuelsresearch.

Onepossibleroutetoachieveresolutionsonthisscaleisthenovelapplicationofcodedapertureimagingtoneutronradiography.Codedapertureimagingisatechniqueforimagingnon-diffractingandrefractingsourcesthathasbeenimplementedbytheAstronomycommunitysincethe1960s.Pinholecameras,suchasthosecommonlyemployedinneutronradiography,havelimitedfluxduetothesmallsizeofthepinhole.Acodedaperturesystemexhibitstheresolutionofapinhole-stylecamerabutwithcollectionefficiencyproportionaltothenumberofpinholesintheaperture.Theeffectofacodedaperturecanbedeconvolvedfromthemeasuredimageallowinghighcollectionefficiencyandhighspatialresolution[6,7]inamagnifyingimagingconfiguration.

Inthispresentation,wewillinvestigatethetheoreticallimitationsforcodedsourceneutronradiography,presentacodedsourcedesignimplementedataprototypeneutronimaginginstrumentatOakRidgeNationalLaboratory(ORNL),discussimagereconstructionmethodsforthistypeofimager,andshowinitialresolutionmeasurementresultsfromtheORNLinstrument.

1.Penumadu,D.,“MaterialScienceandEngineeringwithNeutronImaging,”NeutronImagingandApplications,Springer,NewYork,2009.

2.Manke,I.,Kardjilov,N.,Hilger,A.,Stroble,M.,Dawson,M.,andBanhart,J.,“PolarizedneutronimagingattheCONRADinstrumentatHelmholtzCentreBerlin,”NuclearInstrumentsandMethodsinPhysicsResearchA,605(2009)26-29.

3.Arif,M.,Hussey,D.S.,andJacobson,D.L.,“NeutronImagingfortheHydrogenEconomy,”NeutronImagingandApplications,Springer,NewYork,2009.

4.Beguiristain,H.R.,Anderson,I.S,etal.“Asimpleneutronmicroscopeusingacompoundrefractivelens,”AppliedPhysicsLetters81(22)(2002)4290-4292.

5.Tremsin,A.S.,Vallerga,J.V.,McPhate,J.B.,Siegmund,O.H.W.,Feller,W.B.,Crow,L.,andCooper,R.G.,“Onthepossibilitytoimagethermalandcoldneutronswithsub-15　mspatialresolution,”NuclearInstrumentsandMethodsinPhysicsResearchA,592(2008)374-384.

6.Damato,A.L.,Binns,P.,andLanza,R.C.,“ProgressReportonaNeutronCodedSourcePhaseContrastImagingSystemattheMITReactor,”IEEENuclearScienceSymposiumConferenceRecord,(2007)1725-28.

7.Coakley,K.J.andHussey,D.S.,“Feasibilityofsingle-viewcodedsourceneutrontransmissiontomography,”Meas.Sci.Technol.18(2007)3391-3398.

7877-22, Session 7

Toward autonomic computing in machine vision applications: techniques and strategies for in-line 3D reconstruction in harsh industrial environmentsJ.Molleda,R.Usamentiaga,D.F.García,F.G.Bulnes,Univ.deOviedo(Spain)

Nowadaysmachinevisionapplicationsrequireskilleduserstoconfigure,tune,andmaintain.Becausesuchusersarerarelyfound,thisusuallymeansthatrobustnessandreliabilityofapplicationsissignificantlyaffected.Autonomiccomputingoffersasetofprincipleswhichcanbeusedtopartiallyovercometheseproblems,suchasself-



IS&T /

ReturntoContents

monitoring,self-regulation,andself-repair.

Systemswhichincludeself-monitoringpropertyobserveitsinternalstate,andextractfeaturesaboutit.Systemswithself-regulationarecapableofregulatingitsinternalparameterstoprovidethebestqualityofservicedependingontheoperationalconditionsandenvironment.Finally,self-repairingsystemsareabletodetectanomalousworkingbehaviorandtoprovidestrategiestodealwithsuchconditions.

Machinevisionapplicationsaretheperfectfieldtoapplytheautonomiccomputingtechniques.Thistypeofapplicationshasstrongconstraintsonreliabilityandrobustness,especiallywhenworkinginindustrialenvironments,andmustprovideaccurateresultsevenunderchangingconditionssuchasvariableluminance,ornoise.

Inordertoexploittheautonomicapproachofamachinevisionapplication,webelievethearchitectureofthesystemmustbedesignedusingasetororthogonalmodules.

Inthispaper,wedescribehowautonomiccomputingtechniquescanbeappliedtomachinevisionsystems,usingasexamplearealapplication:3Dreconstructioninharshindustrialenvironmentsbasedonlaserrangefinding.

Theapplicationisdesignedbasedonmoduleswithdifferentresponsibilitiesatthreelayers:imageprocessing(lowlevel),monitoring(middlelevel)andsupervision(highlevel).

Highlevelmodulessupervisetheexecutionoflowerlevelmodulesand,basedontheinformationgatheredbymiddlelevelmodules,regulatelowerlevelmodulesinordertooptimizetheglobalqualityofservice,andtunethemoduleparametersbasedonoperationalconditionsandtheenvironment.

Regulationactionsinvolvemodifyingtheexposuretimeoftheimagesensorduetochangesinlightingconditions,ormodifyingthelaserextractionmethodtomeetcontinuousdeadlinesduetochangesinspeedmanufacturing.

7877-23, Session 7

Evaluating distances using a coded lens camera and blur metricsL.Angot,C.Chang,Y.Chen,IndustrialTechnologyResearchInstitute(Taiwan)

Amethodandasystemareproposedtomeasuredistancesfromareferencepointtoatargetandatthesametimeobtaininganimageofthetarget.Thesystemisbasedonawavefrontcodedlensandanimageprocessingunit.Themethodconsistsincapturingaseriesofimagesofthetarget,computingablurmetricoftheimagesinordertoobtaincalibrationdata,andobtainingthedistancetothetargetforanypositionofthelater.Themethodandsystemareeasytomanufactureandprovideanalternativetootherdistancemeasuringmethodsanddevices,whilealsoproducinganimageofthescene.Thetargetisaprintedimageofpseudorandomblackandwhiteelementswhichcanbestickedtoobjectsfordistanceevaluation.Furtherinvestigationsareunderwaytoevaluatedistancetoparticularobjects.Thedistanceprecisionislessthan3cmovera16cmto120cmrange.Otherrangescanbeselectedbychangingthefocaldistanceofthelens.

7877-24, Session 7

Automatic firearm class identification from cartridge casesS.Kamalakannan,TexasTechUniv.(UnitedStates);C.J.Mann,P.R.Bingham,T.P.Karnowski,S.S.Gleason,OakRidgeNationalLab.(UnitedStates);H.Sari-Sarraf,TexasTechUniv.(UnitedStates)

Wepresentamachinevisionsystemforautomaticidentificationoftheclassoffirearmsbyextractingandanalyzingtwosignificantpropertiesfromspentcartridgecases,namelytheFiringPinImpression(FPI)andtheFiringPinApertureOutline(FPAO).Withintheframeworkoftheproposedmachinevisionsystem,awhitelightinterferometerisemployedtoimagetheheadofthespentcartridgecases.Asafirst

stepofthealgorithmicprocedure,thePrimerSurfaceArea(PSA)isdetectedusingacircularHoughtransform.OncethePSAisdetected,acustomizedstatisticalregion-basedparametricactivecontourmodelisinitializedaroundthecentreofthePSAandevolvedtosegmenttheFPI.Subsequently,thescaledversionofthesegmentedFPIisusedtoinitializeacustomizedMumford-ShahbasedlevelsetmodelinordertosegmenttheFPAO.OncetheshapesofFPIandFPAOareextracted,ashape-basedlevelsetmethodisusedinordertocomparetheseextractedshapestoanannotateddatasetofFPIsandFPAOsfromvariedfirearmtypes.Atotalof74cartridgecaseimagesnon-uniformlydistributedoverfivedifferentfirearmsareprocessedusingtheaforementionedschemeandthepromisingnatureoftheresults(95%classificationaccuracy)demonstratetheefficacyoftheproposedapproach.

7877-34, Session 7

Generation of biologically motivated artificial retina tessellations log (z) and log (z+a) and Point based matching performance evaluation on backprojected response (V1) on retina domainI.Ram,P.Siebert,Univ.ofGlasgow(UnitedKingdom)

WepresenttheresultsofaninvestigationthatcomparesmatchinglocalSIFT-likeimagefeatures;extractedfromasoftware-basedretinamodel,tomatchingstandardSIFTimagefeatures.Ourretinas,conformalandnon-conformalissampledbyreceptivefields(RF)whichareorganisedatahighdensityinthecentralfovealregionoftheretinaandatasparseresolutioninthesurroundingperipherysimilartothatfoundinbiologicalvision.WehavealsoshownthepointbasedandvariableGaussiankernelbasedoverlappingsamplingresponsequality.Multi-resolution,space-variantvisualinformationisextractedonascale-spacecontinuumandSIFT-likeinterestpointdescriptorsareextractedthatrepresentthevisualappearanceoflocalregions.Thispaperalsodescribesthedesign,implementationandinitialevaluationofspacevariantartificiallog(z)andlog(z+alpha)retinatessellationscomprisingcircularoverlappingRFmodel.WecomparethematchingperformanceofthebackprojectedresponseonstandardSIFTbyplottingreceiveroperatingcharacter(ROC)curve.WhiletheprimaryobjectiveofretinaSIFTistoreducefeaturedatarateswhilefocusingattentioninthecontextofvisualsearch,ourpreliminarymatchingresultsindicatethatstandardSIFTinfactoutperformedonlog(z+alpha)responseby5%and4%atfalsealarmratessetto10%and20%respectively.



IS&T /

ReturntoContents

Conference 7878: Intelligent Robots and Computer Vision XXVIII: Algorithms and TechniquesMonday-Tuesday24-25January2011PartofProceedingsofSPIEVol.7878IntelligentRobotsandComputerVisionXXVIII:AlgorithmsandTechniques

7878-01, Session 1

Software framework for nano and micro scale measurement applicationsJ.Röning,V.Tuhkanen,R.Sipola,T.J.Vallius,Univ.ofOulu(Finland)

Developmentofnewinstrumentsandmeasurementmethodshasadvancedresearchinfieldofnanotechnology.Developmentofmeasurementsystemsusedinresearchrequiressupportfromreconfigurablesoftware.

Applicationframeworkscanbeusedtodevelopdomain-specificapplicationskeletons.Newapplicationsarespecializedfromframeworkbyfillingitsextensionpoints.

Thispaperpresentsapplicationframeworkfornanoandmicroscaleapplications.Frameworkconsistsofimplementationofroboticcontrolarchitectureandcomponentsthatimplementfeaturesavailableinmeasurementapplications.Toeasethedevelopmentofuserinterfacesformeasurementsystems,frameworkalsocontainsready-to-useuserinterfacecomponents.

Thegoaloftheframeworkwastoeasethedevelopmentofnewapplicationsformeasurementsystems.Featuresofimplementedframeworkwereexaminedthroughtwotestcases.Benefitsgainedbyusingtheframeworkwereanalyzedbydeterminingworkneededtospecializenewapplicationsfromtheframework.Alsodegreeofreusabilityofspecializedapplicationswasexamined.

Theworkshowsthatdevelopedframeworkcanbeusedtoimplementsoftwareformeasurementsystemsandthatmajorpartofthesoftwarecanbeimplementedbyusingreusablecomponentsoftheframework.Whendevelopingnewsoftware,developeronlyneedstodevelopcomponentsrelatedtohardwareusedandperformingmeasurementtask.Usingtheframeworkdevelopingnewsoftwaretakeslesstime.Theframeworkalsounifiesstructureofdevelopedsoftware.

7878-02, Session 1

A traffic situation analysis systemO.Sidla,SLREngineeringOG(Austria);M.Ulm,AustrianInstituteofTechnology(Austria);M.Rosner,SLREngineeringOG(Austria);N.Braendle,AustrianInstituteofTechnology(Austria)

Theobservationandmonitoringoftrafficwithsmartvisionssystemsforthepurposeofimprovingtrafficsafetyhasabigpotential.Forexampleembeddedvisionsystemsbuiltintovehiclescanbeusedasearlywarningsystems,orstationarycamerasystemscanmodifytheswitchingfrequencyofsignalsatcrossingsorwarnvehiclesofpedestriantrafficatintersections.

Theautomaticanalysisoftrafficpatternsisstillinitsinfancy-thecomplexityofvehiclemotionandpedestrianflowinacomplexenvironmentistoocomplextobefullyunderstoodbyavisionsystem.

Wepresentstepstowardssuchatrafficmonitoringasystemwhichisdesignedtodetectpotentiallydangeroustrafficsituations,especiallyincidentsinwhichtheinteractionofpedestriansandvehiclesmightleadtodangerousorevencriticalencounters.

Theproposedsystemconsistsofaclusterof3smartcameraswhicharebasedonverycompactPChardwarerunningaLinuxoperatingsystem.Twocamerasrunvehicledetectionsoftwareincludinglicenseplatedetectionandrecognition,onecamerarunsacomplexpedestriandetectionandtrackingmodule:

Cameras1,2:Real-time(25Hz)licenseplatedetectionwithfastsegmentationandcharacterrecognitionbasedontheHOGprinciple.

Wedescribetheoutlineofourveryfastlicenseplatedetectorincludingstatisticalresultsbasedonalargedatasetofcharactersandtesting

images.

Camera3:Pedestriantrackingwithatracking-by-detectionapproachbasedonacascadedHOGdetector.Trainingofwellsuitedfeaturesetsiscrucialinordertoachievegooddetectionratesandfastrun-timeperformance,butalsoanoptimizedimplementationcontributestoaneffectiveuseofavailablehardwareresources.Wedescribetheprocessingandtrainingpipelineofthepedestriantrackingsystemwhichisreal-timecapableonstandardPChardware.

NoneedofGPUorFPGAprocessingisrequiredbyourapproachinordertoachieveusabletrackingframeratesonVGAResolutionevenonlowprofileIntelAtomprocessors.

TheremainingpaperconcentratesonthesystemarchitectureanddescribesresultsofourexperimentsduringextensivetrialsandtestsinanoutdoorenvironmentintheCityofVienna,Austria:thetrafficmonitoringsystemisinstalledatadoublestreetintersectionlocationwhichisespeciallydangerousforpedestrians.Theoperationofthesystemisassessedinextensivegroundtruthevaluationcampaignwhichusesthesamevideodataasthesystemitself.

7878-03, Session 1

The 18TH Annual Intelligent Ground Vehicle Competition: trends and influences for intelligent ground vehicle controlB.L.Theisen,W.Smuda,P.A.Frederick,U.S.ArmyTankAutomotiveResearch,DevelopmentandEngineeringCtr.(UnitedStates)

TheIntelligentGroundVehicleCompetition(IGVC)isoneoffour,unmannedsystems,studentcompetitionsthatwerefoundedbytheAssociationforUnmannedVehicleSystemsInternational(AUVSI).TheIGVCisamultidisciplinaryexerciseinproductrealizationthatchallengescollegeengineeringstudentteamstointegrateadvancedcontroltheory,machinevision,vehicularelectronicsandmobileplatformfundamentalstodesignandbuildanunmannedsystem.Teamsfromaroundtheworldfocusondevelopingasuiteofdual-usetechnologiestoequipgroundvehiclesofthefuturewithintelligentdrivingcapabilities.Overthepast18years,thecompetitionhaschallengedundergraduate,graduateandPh.D.studentswithrealworldapplicationsinintelligenttransportationsystems,themilitaryandmanufacturingautomation.Todate,teamsfromover75universitiesandcollegeshaveparticipated.Thispaperdescribessomeoftheapplicationsofthetechnologiesrequiredbythiscompetitionanddiscussestheeducationalbenefits.TheprimarygoaloftheIGVCistoadvanceengineeringeducationinintelligentvehiclesandrelatedtechnologies.Theemploymentandprofessionalnetworkingopportunitiescreatedforstudentsandindustrialsponsorsthroughaseriesoftechnicaleventsoverthefour-daycompetitionarehighlighted.Finally,anassessmentofthecompetitionbasedonparticipationispresented.

7878-04, Session 2

Stereo matching based on two cameras and one 3D image sensorL.Yang,X.Shao,R.Shibasaki,TheUniv.ofTokyo(Japan);R.Wang,JilinUniv.(China)

Duetotheproblemsofnoise,texturelessregionanddepthdiscontinuityinstereomatching,anewmatchingmethodbasedontwocamerasandone3Dimagesensorisproposedinthispaper.The3Dimagesensorcanofferanintensityimageandadepthmap.Theintensityimageisusedincalibrationofimagepairsanddepthmap.


IS&T /

ReturntoContents

Aftercalibration,thedepthmapistransformedtoaninitialdisparitymap.Withtheconstraintofthisinitialdisparitymap,theleftandrightimagesarematchedbyusingthenormalizedcovarianceoperator.Thedisparitysearchingrangecanbereducedfrom80pixelsto10pixels.Itcanlargelyimprovethematchingaccuracyanddecreasetherunningtime.Furthermore,thedisparitymapswithleftandrightimageviewsarecheckedbyleft-and-rightconsistency.Theexperimentresultsindicatethattheproposedalgorithmperformswellandthedisparitymaphasmoreaccuracycomparingwithexistingmethods.Theresearchachievementhasagoodprospectinapplication.

7878-06, Session 2

Linear stereo vision based objects detection and tracking using spectral clusteringS.Moqqaddem,Y.Ruichek,Univ.ofTechnologyofBelfort-Montbéliard(France);R.Touahni,A.Sbihi,IbnTofailUniv.ofKénitra(Morocco)

Objectsdetectionandtrackingisakeyfunctionformanyapplicationslikevideosurveillance,robotic,intelligenttransportationsystems,etc.Thisproblemiswidelytreatedintheliteratureintermsofsensors(videocameras,laserrangefinder,Radar)andmethodologies.Thispaperproposesanewapproachfordetectingandtrackingobjectsusingstereovisionwithlinearcameras.Afterthematchingprocessappliedtoedgepointsextractedfromtheimages,thereconstructedpointsinthesceneareclusteredusingspectralanalysis.TheobtainedclustersarethentrackedthroughouttheircenterofgravityusingaKalmanfilterandaNNbaseddataassociationalgorithm.Theapproachistestedandevaluatedonrealdatatodemonstrateitseffectivenessforobstacledetectionandtrackinginfrontofavehicle.Thisworkisapartofaprojectthataimstodevelopadvanceddrivingaidsystems,supportedbytheCPER,STICandVolubilisprograms.

7878-07, Session 2

Implementation of stereo vision on GPU for intelligent ground vehicle navigation in the presence of obstaclesC.Gamache,T.Padir,WorcesterPolytechnicInstitute(UnitedStates)

Thispaperdiscussesanimagesegmentationalgorithmthatusesaselforganizingmap(SOM)basedcolorreductionwithsimulatedannealing(SA)basedcolorclusteringimplementedonaGraphicsProcessingUnit(GPU)forintelligentgroundvehiclenavigationinthepresenceofobstacles.Thealgorithmusesaneuralnetworkvariantcalledaself-organizingmapwhichistrainedtoperformanonlinearcolorreduction.ThesimulatedannealingisthenusedtogrouptheSOMintoclusterstoproducethesegmentedimage.ThepaperspecificallydiscussesthecomputationalmethodusedtomodifytheSOM-SAalgorithmtoberunontheGPU.TheoriginalalgorithmrequiresanewSOMtobegeneratedforeachimageincreasingthecomputationalburden.Toavoidthisserialoperationatruntime,onemasterSOMondifferentimagesundervariouslightingconditionscanbetrainedandcolorclustered.TheexperimentalresultsshowthatthismodificationimprovedtheruntimeofthealgorithmontheGPUfromapproximately3minutesto8ms.TheimplementationofthecomputationallyheavysegmentationalgorithmontheGPUimprovestheautonomousnavigationfortheintelligentgroundvehiclebeingdevelopedatWorcesterPolytechnicInstituteinthepresenceofobstacles.

7878-08, Session 2

Probabilistic recognition of person reoccurrence for visual surveillance of pedestrian flowsL.Paletta,G.Fritz,JOANNEUMRESEARCHForschungsgesellschaftmbH(Austria)

Surveillancetasksareubiquitousinsecurityservicesandinpublictransportation.However,thecoverageofthesensornetworkisoftenfarfrombeingcompleteduetounobservableregions.Recognitionofpersonreoccurrenceismandatorytoenablewiderareaswithcontinuouscoveragefortracking.Theproposedworkcombinesthealreadyproveduseofcolorinformation,makesuseofsegmentationofpersoninformationtoweighttheambiguityinthecontributionofdifferentbodyparts,andappliesBayesianinformationfusionwithweightedcontributionsforoverallidentityhypotheses.Thisinnovativecombinationofsuccessfullyappliedcomponentsclearlyenablesahigherdegreeofdiscriminationandrobustnessandthereforebetterrecognitionrates.

WeperformedtwoexperimentstoevaluatetheperformanceinanextensivestudyatabusstationinAustria.Firstly,thereoccurrenceof29personswasevaluatedbyarecognitionrateof82%,withimagesshowingsignificantchangesinpersonorientation,pose,andilluminationconditions.Thesecondexperimentevaluatedanautomatedpersondetectorthathasbeenpost-processedbyourmethodresultinginarecognitionrateof72%.Weconcludethatthismethoddemonstratesrobustperformanceundernormalconditionsandisthereforehighlyusefulforconnectingdistributedcameranetworksintowidesurveillanceareas.

7878-09, Session 3

A multimodal eye tracking system for studies of embodied attentionL.Paletta,A.Almer,G.Fritz,K.Amlacher,P.Luley,S.Ladstätter,JOANNEUMRESEARCHForschungsgesellschaftmbH(Austria)

Measurementsofeyegazeontheenvironmentareoneofthemostimportantindicatorsofvisualattention.Inmobileeyetrackingtestpersonsinteractwithinataskspecificenvironment.Untilrecently,resultshavemostlybeengeneratedbymanualvideoannotation.Attentionhastobeconsideredinaframeworkofembodiment:bodiesselectlocationandorientation,eyesfocusonobjectsofinterest,andinteractionsdecideaboutenvironmentchanges.

Thisworkpresentsaninnovative,multimodalsystemfortheoutdoorstudyofembodiedattention.Visualattentioniscapturedfromthemobileeyetrackingvideosandsemanticallyinterpretedviaobjectdetectiontools.Thelocationofthetestperson’sbodyisestimatedfromtheanalysisofmultisensorydatafromGPS,accelerometersandadigitalcompass.Imagebasedlocalisationsupportsmultisensoryprocessingfortheestimationofageo-referencedhumangaze.Thesystemisabletomapthehumangazetogetherwiththeextractedvisualsemanticsusingworldcoordinatesintoa3Dmodeloftheenvironment.

Inareconstructionofhumangazeweachievedanaccuracyof20-80cmtargetlocalisationand95%inlogorecognitioninworldcoordinates.Positiontrajectories,views,gazeandtargetscanbevisualizedina3Dmodeloftheurbanshoppingzone.

7878-10, Session 3

Real-time car detection systemM.Rosner,SLREngineeringOG(Austria)

Thispaperpresentsacardetectionsystemthatisabletoworkinclosetoreal-timeonasmartcamera.Acascadeofhistogramsoforientedgradientswasusedasadetector.Thealgorithmandcodewereoptimizedforspeedtomeetthereal-timeconstraints,withoutloosing

Conference 7878: Intelligent Robots and Computer Vision XXVIII: Algorithms and Techniques


IS&T /

ReturntoContents

toomuchondetectionquality.Thesystemisnowabletoprocess10framespersecondonanAtomZ530(1.6GHz)processorusedinthesmartcamera.Onvideosusedforbenchmarkingonly1falsepositiveper5framesanddetectionrateof80%wasobserved.

Becausethereisnoadequatecardatasetknowntotheauthor,anewcardatasetwasintroduced(SLRCarDataset).Itconsistsofcarimagesscaledtoacertainsize,imageswithcarsandotherobject,imageswithoutcarsandvideoswithcars.Theapplicationonwhichthepaperisbasedisreadytodetectcarsinrealworldscenarios.Itisplannedtoextendittoalsotrackandanalysethedriverbehaviourpatterns.

7878-11, Session 3

Real-time people and vehicle detection from UAV imageryA.Gaszczak,T.Breckon,J.Han,CranfieldUniv.(UnitedKingdom)

Agenericandrobustapproachforthereal-timedetectionofpeopleandvehiclesfromanUnmannedAerialVehicle(UAV)isanimportantgoalwithintheframeworkoffullyautonomousUAVdeploymentforaerialreconnaissanceandsurveillance.HerewepresentanapproachtotheautomaticdetectionofvehiclesbasedonusingmultipletrainedcascadedHaarclassifiers(adisjunctivesetofcascades)withsecondaryconfirmationinthermalimageryaswellasapproachforpeopledetectioninthermalimageryusingalsomultipletrainedcascadedHaarclassifierswithmulti-variantGaussianshapematching.Theresultspresentedshowthesuccessfuldetectionofvehicleandpeopleundervaryingconditionsinbothisolatedruralandclutteredurbanenvironmentswithminimalfalsepositivedetection.Performanceofthedetectorisoptimizedtoreducetheoverallfalsepositiveratebyaimingatthedetectionofeachobjectofinterest(vehicle/person)atleastonceintheenvironment(i.e.perflight)ratherthaneveryobjectineachframe.Currentlythedetectionrateforpeopleis~70%andcars~80%althoughtheoverallepisodicobjectdetectionrateforeachflightpatternexceeds90%.

7878-12, Session 3

Real-time pose invariant logo and pattern detectionO.Sidla,SLREngineeringOG(Austria)

Thedetectionofposeinvariantplanarpatternshasmanypracticalapplicationsincomputervisionandsurveillancesystems.Therecognitionofcompanylogosisusedinmarketstudyanalysistoexaminethevisibilityandfrequencyoflogosinadvertisementordangersignsonvehiclescouldbedetectedtotriggerwarningsystemsintunnels.

WepresenttheresultsofastudyonlogodetectionwhichisbasedonthedetectionofNinvariant2dfeaturesandsubsequentmatchingandclustering.

Specificallywelookatthefollowingfeaturetypes:

-SURF

-Compactsignatures+randomferns

-Onewaydescriptor

whicharecombinedwiththefollowingpointdetectors:

-LoweDoGasfromtheSURFimplementation

-HarrisCornerDetector

-FASTCornerDetector

Forapplicationorientedtestswefirstgenerateasetoftestingimageswhichareusedtoexaminethelimitsofthe2dfeaturedescriptorsunderpose,perspective,andresolutionvariations.

Areal-worldtesttriestodetectvehicleswithadistinctivelogoinanoutdoorenvironmentunderdifferentlightingandweatherconditions:acameraismountedtoobserveagatesothatincomingtruckscanbe

monitored-sequencesofincomingvehicleswithaspecificbrandlogoaredetected,loggedandstoredformanualevaluation.

7878-13, Session 3

FirstAidAssistanceSystem: improvement of first aid measures by using Car2Car-communicationS.Tuchscheerer,T.Hoppe,C.Kraetzer,J.Dittmann,Otto-von-Guericke-Univ.Magdeburg(Germany)

Thiswork’sgoalistheenhancementoffirstaidmeasuresdirectlyaftercaraccidentsbydeterminingsuitedfirstaidersviaC2Ccommunicationandtoprovidedetailedsupportinstructions.Theconceptcombinesupcomingcar2car(C2C)communicationwithestablishedtechnologyasGPSandGSM.Afteracrash,theproposedFirstAidAssistanceSystem(FAAS)sendsabroadcastmessageviathe802.11pC2Cstandard.Allnearbycarsaspotentialfirstaidersarelocatedandatleastonenearestcandidate(wesuggest3-5asdiscussedinfinalpaper)drivingtowardstheaccidentischosenandnotifiedasfirstaider.Asupportguideonhismultipurposedisplay(e.g.thenavigationsystem)providesthehelperwithdetailedinstructionsandillustrativetutorials.Thepaperpresentstheconceptindetailwithapracticalevaluationusingafirstimplementation.

7878-14, Session 4

The report of estimating the egomotion of the moving stereo cameras in the environment including moving objects and reconstructing the observed space in 3DN.Tatematsu,J.Ohya,WasedaUniv.(Japan)

ThispaperproposesatemporalmodifiedRANSACbasedmethodthatcandiscriminateeachmovingobjectfromthestillbackground,cancomputethestereocameras’egomotion,andcanreconstruct3Dstructureofeachmovingobject.Wecomputed3Dopticalflowsfromthedepthmapandthetrackingfeaturepoint.Ingeneral,flowsfromdifferentobjectshavedifferentorientationsandlengths,whileflowsfromasameobjecthaveuniformorientationandlength.Wedefine“flowregion”asasetofconnectedpixelswhose3Dopticalflowshaveuniformorientationandlength.OurtemporalmodifiedRANSACsegmentsthedetected3Dopticalflowsintoflowregionsandcomputestherotationandtranslationmatrixforeachflowregion.ThemodifiedRANSACestimatesmultiplemodelsfromthedataandcanfindclusters,ThetemporalmodifiedRANSACperformsthemodifiedRANSACtoeachoftheflowregion.Finally,the3Dpointscomputedfromthedepthmapinalltheframesareregisteredusingeachflowregion’smatrixtotheinitialpositionsintheinitialframesothatthe3Dstructuresofthemovingobjectsandstillbackgroundarereconstructed.Experimentsusingmultiplemovingobjectsandrealstereosequencesdemonstratetheeffectivenessofourproposedmethod.

7878-15, Session 4

A multiple feature based particle filter using mutual information maximizationK.Hong,K.Han,PurdueUniv.(UnitedStates)

Indesigningatrackingalgorithm,utilizingseveraldifferentfeatures,e.g.,colorhistogram,gradienthistogramandotherobjectdescriptors,ispreferabletoincreaserobustnessoftrackingperformance.Inthispaper,weproposeamultiplefeaturefusionframeworktoimprovethetrackingbyassigningappropriateweightstoindividualfeatures.Thefeatureweightsareoptimallyobtainedbyawaterfillingprocedurethatmaximizesmutualinformationbetweentargetobjectfeaturesandqueryfeatures.Especially,inthispaper,wefocusonaparticle



IS&T /

ReturntoContents

filtertrackingimplementationofthemultiplefeaturefusionframework.Ourexperimentsshowthatobjecttrackingwithmultiplefeaturesoutperformssinglefeaturebasedtrackingmethodsandillustratesthattheproposedoptimalfeatureweightingincreasesrobustnessofmultiple-featurebasedtrackingperformance.

7878-16, Session 4

High precision object segmentation and tracking for use in super-resolution video reconstructionT.N.Mundhenk,R.N.Sundareswara,Y.Chen,HRLLabs.,LLC(UnitedStates)

Superresolutionimagereconstructionallowsfortheenhancementofimagesinavideosequencethatisbetterthantheoriginalpixelresolutionoftheimager.Difficultyariseswhenthereareforegroundobjectsthatmovedifferentlythanthebackground.Acommonexampleofthisisacarinmotioninavideo.Giventhecommonoccurrenceofthis,superresolutionreconstructionbecomesnon-trivial.Onemethodfordealingwiththisistosegmentoutforegroundobjectsandquantifytheirpixelmotiondifferently.FirstweestimatelocalpixelmotionusingastandardblockmotionalgorithmcommontoMPEGencoding.Thisisthencombinedwiththeimageitselfintoasixdimensionalmean-shiftkerneldensityestimationbasedimagesegmentationwithmixedmotionandcolorimagefeatureinformation.Thisresultsinatightsegmentationofobjectsintermsofbothmotionandvisibleimagefeatures.Thenextstepistocombinesegmentsintoasinglemasterobject.Statisticallycommonmotionandproximityareusedtomergesegmentsintomasterobjects.Toaccountforinconsistenciesthatcanarisewhentrackingobjects,wecomputestatisticsovertheobjectandfititwithageneralizedlinearmodel.UsingtheKullback-Leiblerdivergence,wehaveametricforthegoodnessofthetrackforanobjectbetweenframes.

7878-17, Session 4

Robust pedestrian detection and tracking from a moving vehicleN.X.Tuong,NanyangTechnologicalUniv.(Singapore);T.Müller,A.Knoll,TechnischeUniv.München(Germany)

Inthispaper,weaddresstheproblemofmulti-persondetection,trackinganddistanceestimationinacomplexscenariousingmulti-cameras.Specifically,weareinterestedinavisionsystemforsupportingthedriverinavoidinganyunwantedcollisionwiththepedestrian.

WeproposeanapproachusingHistogramsofOrientedGradients(HOG)todetectpedestriansonstaticimagesandaparticlefilterasarobusttrackingtechniquetofollowtargetsfromframetoframe.

Becausethedepthmaprequiresexpensivecomputation,weextractdepthinformationoftargetsusingDirectLinearTransformation(DLT)toreconstruct3D-coordinatesofcorrespondentpointsfoundbyrunningSpeededUpRobustFeatures(SURF)ontwoinputimages.Usingtheparticlefiltertheproposedtrackercanefficientlyhandletargetocclusionsinasimplebackgroundenvironment.However,toachievereliableperformanceincomplexscenarioswithfrequenttargetocclusionsandcomplexclutteredbackground,resultsfromthedetectionmoduleareintegratedtocreatefeedbackandrecoverthetrackerfromtrackingfailuresduetothecomplexityoftheenvironmentandtargetappearancemodelvariability.

Theproposedapproachisevaluatedondifferentdatasetsbothinasimplebackgroundscenarioandaclutteredbackgroundenvironment.Theresultshowsthat,byintegratingdetectorandtracker,areliableandstableperformanceispossibleevenifocclusionoccursfrequentlyinhighlycomplexenvironment.

Avision-basedcollisionavoidancesystemforanintelligentcar,asaresult,canbeachieved.

7878-18, Session 5

Design and evaluation of security multimedia warnings for interaction between human and industrial robotsJ.Fruth,J.Dittmann,C.Krätzer,Otto-von-Guericke-Univ.Magdeburg(Germany)

Inthisarticleasecurity-warningdesignconceptforproductionscenarioswithdirecthuman-machineinteractionisintroduced.TheuseofstandarddesktopITnetworktechnologiesforproductionnetworkscouldintroducethesamevulnerabilitiesknownfromdesktopIT,e.g.theinfectionwithmaliciouscode.Amalwareinfectionofacontrolcomputerofindustrialrobotscouldleadtopotentialindirectimpactstosafety,e.g.humanscouldbeharmedand/orworkpiecescouldbedamaged.Sofaronlysafetywarningsintheproductiondomainaredesignedandused.Inouropinionsecuritywarningsareneededtowarntheoperatorforspecialthreatscomefrommalwareinfectionsofrobotcontrolsystems,becauseoftheirpotentialsafetyimpacts.Thereforewedesignedsecuritywarningsbasedonstandarddesignconceptsofsafetywarnings,commonantivirus-warningsandownideas.Thewarningincludestwoclassesofwarningicons,thepropertyofthemaliciouscodeandinstructionstotheoperatorandaconfirmationbutton.Thecriticalityoftheeventiscommunicatedviasignallightandacousticalwarnings.Inthisarticleanevaluationapproach,operationalisationandtestresultsofthesecuritywarnings,usingusabilitytestingtechniques,advancementsofthedesignapproachandfutureworkarepresented.

7878-19, Session 5

Fingertip guiding manipulator for blind persons to create mental images by switch passive/active hybrid line-drawing explorationsS.N.SyedYusoh,MieUniv.(Japan)

Blindpeopleencountermanyobstaclesdaily.Onesuchobstacleistovisualize,mentally,graphicsthatarepresentedtothem.Therefore,supportsystemshavebeendevelopedtohelpimprovecommunicationandqualityoflifeforthevisuallyimpairedpeople.Afingertipguidingmanipulatorhasbeendevelopedasahapticgraphicdisplaytohelpblindpersonscreatementalimagesoflinedrawings.Themanipulatorhastwocharacteristicfunctions:(1)anindicatingfunctionofslider-moving-directionbyastepping-motorattachedatthearmend,and(2)afingertiptractionfunctionbyamanipulator.Theformertwofunctionsareutilizedtorepresentplaner2Dfigures.Somepreliminaryexperimentswerecarriedout,andthehumanperceptualthresholdswiththefingertiprotation,andthehumankinestheticfollowing-performancewereclarified.Thesethresholdsshallbeusedasthebackdatafordesigningmechanicalspecificationsofthefingertipguidingmanipulator.Thispaperdescribesourinvestigationonthedualmodefingertip-guidingfunctionallowingeitherpassiveoractiveexploration.Whenusingthismanipulator,thepersonisassumedtotouchasliderbyhis/herfingertip:thesliderisattachedattheendofthemanipulator.Intheactivemode,thefingertipguidingmanipulatorpullshis/herfingertipalonglinedrawings.Inthepassivemode,itprovidesakindofselectivecompliance,andallowsthepersonstomovehis/herfingertipfreelyjustonlyinthedirectionofthelinedrawings.Forthesakeofthedualmodefunction,itisexpectedthattheefficiencywouldbeimprovedalotcomparingtotheformermodelbeingequippedwiththepassivemodealone.Thus,ahapticgraphicdisplaybasedonthenovelconcept,i.e.,adualmodefingertipguidingmanipulatorwaspresented,andsomebasiccharacteristicswereconfirmed.Ithelpsblindpersonscreatementalimagesoflinedrawings.Thepreliminaryresultsaresummarizedasfollows:(1)Apassivemodefingertipguidingmanipulatorwasdeveloped.Itwasabletoteachpersonsthelinedrawingsofroutemaps.Inthepassivemode,thepositionoftheslidertracesthestrokesofthetargetlinedrawing.Sinceapersonpinchestheslider,theperson’sfingertipswerepulledalongthetargetlinedrawings.Duringthisprocess,thepersonisassumed



IS&T /

ReturntoContents

toperceivethelinedrawingsthroughhis/herkinestheticpositionalsense.(2)Aprototypeofanactivemodefingertipguidingmanipulatorwasexamined.Itisalsoabletoteachgraphics.Intheactivemode,thepositionoftheslidershouldbemovedwithfreeofforceinanappropriatedirectionalone.Therefore,thepersoncanmovehis/herfingertipsalongthetargettrajectorybyutilizinghis/herkinestheticforcesensewithmuchhighervelocity.Duringthisprocess,thepersonisexpectedtoperceivethelinedrawingthroughhis/herkinestheticpositionalsense.Inthefuture,theauthorsareplanningtoimplementthedualmodefingertipguidingmanipulatorinamoreconcreteform,andaregoingtocarryoutsomeexperimentswithblindpeople.

7878-20, Session 5

Augmented reality user interface for mobile ground robots with manipulator armsS.Vozar,D.Tilbury,Univ.ofMichigan(UnitedStates)

AugmentedReality(AR)isatechnologyinwhichreal-worldvisualdataiscombinedwithanoverlayofcomputergraphics,enhancingtheoriginalfeed.ARisanattractivetoolforteleoperatedUGVUIsasitcanimprovecommunicationbetweenrobotsandusersviaanintuitivespatialandvisualdialogue,therebyincreasingoperatorsituationalawareness.ThesuccessfuloperationofUGVsoftenreliesuponbothchassisnavigationandmanipulatorarmcontrol,andsinceexistingliteratureusuallyfocusesononetaskortheother,thereisagapinmobilerobotUIsthattakeadvantageofARforteleoperatingtherobotmanipulatorandbasetogether.

ThisworkdescribestheimplementationofanARUIsystemforaUGVwithanattachedmanipulatorarm,alongwiththeresultsofpreliminaryusertests.Thesystemsupplementsavideofeedshowntoanoperatorwithinformationaboutgeometricrelationshipswithintherobottaskspacetoimprovetheoperator’ssituationalawareness.

PreviousstudiesonARsystemsandpreliminaryanalysesindicatethatsuchanimplementationofARforamobilerobotwithamanipulatorarmisanticipatedtoimproveoperatorperformance.Afulluser-studycandetermineifthishypothesisissupportedbyperformingananalysisofvarianceoncommontestmetricsassociatedwithUGVteleoperation.

7878-21, Session 5

An embedded omnidirectional vision navigator for automatic guided vehiclesW.Feng,TianjinUniv.(China);B.Zhang,Z.Cao,X.Zong,TianjinUniv.ofTechnology(China);J.Röning,Univ.ofOulu(Finland)

Omnidirectionalvisionappearsthedefinitesignificancesinceitsadvantageofacquiringfull360°horizontalfieldofvisioninformationsimultaneously.Inthispaper,anembeddedoriginalomnidirectionalvisionnavigator(EOVN)basedonfish-eyelensandembeddedtechnologyhasbeenimplemented.Fish-eyelensisoneofthespecialwaystoestablishomnidirectionalvision,however,itappearswithanunavoidableinherentandenormousdistortion.Auniqueintegratednavigationmethodwhichisconductedonthebasisoftargetstrackinghasbeenresearched.Itiscomposedoftargetsrecognition,multi-targettracking,distortionrectification,spatiallocationandnavigationcontrol.Inordertoadapttothedifferentindoorandoutdoornavigationenvironments,weimplantmeanshiftanddynamicthresholdadjustmentintotheparticlefilteralgorithmtoimprovetheefficiencyandrobustnessoftrackingcapability.RTRLNhasbeenimplantedinanindependentresearchembeddedplatformwhichiscomposedofCOMS+FPGA+DSP.Itislikeasmartcrammertoguidevariousvehiclesindifferentenvironmentsbytrackingthediverselandmarkshangingintheairorontheroof.TheexperimentsprovethattheEOVNisparticularlysuitablefortheguidanceapplicationswhichhavehighrequirementsonprecision,repeatabilityandlongdistance.Theresearchachievementhasagoodprospectinapplication.

7878-23, Session 6

Detecting stationary human targets in FLIR imageryA.L.Chan,U.S.ArmyResearchLab.(UnitedStates)

Inthemilitaryarena,intelligentunmannedgroundvehicles(UGVs),weighing10tonsormore,maybedesignedandusedfortransportationorcombatpurposes.Toensuresafeoperationsamongciviliansandfriendlycombatants,itiscrucialfortheseUGVstodetectandavoidhumanswhomightbeinjuredunintentionally.Inthispaper,amulti-stagedetectionalgorithmforstationaryhumansinforward-lookinginfrared(FLIR)imageryisproposed.Thisalgorithmfirstappliesanefficientfeature-basedanomaliesdetectionalgorithmtosearchtheentireinputimage,whichisfollowedbyaneigen-neural-basedclutterrejecterthatexaminesonlytheportionsoftheinputimageidentifiedbythefirstalgorithm,andculminateswithasimpleevidenceintegratorthatcombinestheresultsfromthetwopreviousstages.TheproposedalgorithmwasevaluatedusingalargesetofchallengingFLIRimagesandtheresultssupporttheusefulnessofthismulti-stagearchitecture.

7878-24, Session 6

Spectrally queued feature selection for robotic visual odometeryP.A.Frederick,U.S.ArmyTankAutomotiveResearch,DevelopmentandEngineeringCtr.(UnitedStates);D.Pirozzo,BoozAllenHamilton(UnitedStates);M.S.DelRose,U.S.ArmyTankAutomotiveResearch,DevelopmentandEngineeringCtr.(UnitedStates)

Overthelasttwodecades,researchinUnmannedVehicles(UV)hasrapidlyprogressedandbecomemoreinfluencedbythefieldofbiologicalsciences.ResearchershavebeeninvestigatingmechanicalaspectsofvaryingspeciestoimproveUVairandgroundintrinsicmobility,theyhavebeenexploringthecomputationalaspectsofthebrainforthedevelopmentofpatternrecognitionanddecisionalgorithms,andtheyhavebeenexploringperceptioncapabilitiesofnumerousanimalsandinsects.Thispaperdescribesa3monthexploratoryappliedresearcheffortperformedattheUSARMYResearch,DevelopmentandEngineeringCommand’s(RDECOM)TankAutomotiveResearch,DevelopmentandEngineeringCenter(TARDEC)intheareaofbiologicallyinspiredspectrallyaugmentedfeatureselectionforroboticvisualodometery.Themotivationforthisappliedresearchwastodevelopafeasibilityanalysisonmulti-spectrallyqueuedfeatureselection,withimprovedtemporalstability,forthepurposesofvisualodometery.Theintendedapplicationisfuturesemi-autonomousUGVcontrolastherichnessofdatasetsrequiredtoenablehumanlikebehaviorinthesesystemshasyettobedefined.

7878-25, Session 6

Intuitive control of robotic manipulatorsD.Rusbarsky,RE2,Inc.(UnitedStates);J.P.Gray,U.S.ArmyTankAutomotiveResearch,DevelopmentandEngineeringCtr.(UnitedStates);D.J.Peters,RE2,Inc.(UnitedStates)

Inthefieldofunmannedgroundvehicleswithdexterousmanipulators,currentcontrolsystemsrequireahighcognitiveloadandtrainingtoproperlypositionthemanipulatorandhaveiteffectivelyinteractwithitsenvironment.AsroboticmanipulatorsgrowmorecapablethroughadditionaldegreesoffreedomandasExplosiveOrdinanceDisposal(EOD)robotsaredevelopedthattakeadvantageofmultiplemanipulatorsonthesameplatform,thedemandformoreintuitivecontrolandenhancedsituationalawarenesswillalsoincrease.AspartoftheModularIntelligentManipulationsystemwithIntuitiveControl(MIMIC)program,industryisworkingwiththeU.S.Armytoexploretechnologiesthatwillallowausertointuitivelycontrolmultipledegreeoffreedomroboticarmsandmaintainbetterawarenessoftheoperatingenvironmentthroughhapticfeedback.Inadditiontoreporting



IS&T /

ReturntoContents

resistance,hapticfeedbackcanhelpmakeoperatorsfeelliketheyareactuallytherewiththerobot.Coupledwithintuitivecontrolsandadvancedvideofeedback,thegoalofthisprogramistoprovideuserswiththesensationthatrobotsareanextensionoftheirbodies.Thispaperpresentson-goingresearchintheareaofintuitivecontrolandhapticfeedbackalongwiththeresultsofsomeofourearlydesignandprototypingwork.

7878-26, Session 6

Vision based low cost, precise, and robust localization method in GPS denied environmentsJ.Walter,U.S.ArmyTankAutomotiveResearch,DevelopmentandEngineeringCtr.(UnitedStates);D.C.Bentivegna,SeegridCorp.(UnitedStates)

Alowcost,preciseandrobustvision-basedlocalizationsystemexistsinthecommercialwarehouseindustry.Thesystemusesaseriesoflow-coststereocamerastocollectimagedataalonganautonomouspallet-jack’sroute.Thisimagedataiscompiledinto3Dmaps,knownasevidencegrids,whichcorrelateimagestopositionsinthevehicle’sroute.Duringsubsequenttraversesofthisroute,visualinformationischeckedagainsttheevidencegridtopinpointthevehicle’slocation.Theindoorlocalizationsystemhasshownanaccuracyoflessthanonecentimeter.Thispapersharespromisingresearchresultswhenthisindoorsystemwasfieldedwithlittlemodificationtoanoutside,GPS-deniedurbancanyonlikeenvironment.

7878-27, Session 7

Curved solid and dotted line characters segmentation and classificationK.Mohammad,S.S.Agaian,H.Saleh,TheUniv.ofTexasatSanAntonio(UnitedStates)

Segmentationisacrucialstepinavisionbasedrecognitionsystemsbecauseitextractsmeaningfulregionsforanalysis.Thesegmentationprocessattemptstodecomposethetextimageintoclassifiableimagescalledcharacters.Apoorsegmentationprocessproduceslessaccuraterecognition.Thepotentialapplicationforvisionbasedcharacterrecognitionishuge,andtherearemanychallengestodesignasinglesystemthatiscapableofperformingautomaticrecognitionforcurvedtext.Issueslikecurvature,characterconnectivity,varyingtextformats,androtationallhavestrongnegativeimpactontheaccuracyofthesystem.Theseeffectsreducetheaccuracyofsegmentationstep.

Thispaperpresentsandimplementstwonewalgorithmsforhandlingtherotationissues.ThefirstisbasedontheHoughtransformandisusedtodrawalinethroughthetext,mimickingtherotationangle.Thisalgorithmshowsexcellentresultsfordottedlinecharactersinanimage.Thesecondalgorithmisbasedonlocatingthecornersofthetextboxandextractingthemtodeterminerotation.Thisalgorithmshowexcellentresultsforsolidlinecharacters.Afterthetextislocalized,therotationisdeterminedandorientationiscorrected.Thetextisthensegmentedandthecharactersintheimageareidentified.Aseparatefillingalgorithmisusedtodealwiththedottedcharacter.

Featurevectorsbasedoncharactershapeareproposedforclassification.Thesevectorsaredividedintofourgroups:1)usesthe4levelsoftheHaartransform.2)Usesthedensityof‘ones’intheskeletonimageofthecharacter.3)Usesthedensityof‘ones’intheareaofthebinaryimage,and4)groupingofthe‘ones’withinthesameset.Differentweightsareappliedbasedontheirabilitytodistinguishthecharacters.The(SVM)classifierestimatessimilaritiesbetweentheexaminedtargetcharactersandthetrainingsetofcharacters.

ImagesfromOzarkaandDesaniwaterbottlesweretestedusingthenewalgorithmsandshowedexcellentresultsandimprovedruntimescomparedtoconventionalsegmentationmethods.

Therotationhandling,segmentationandclassificationusingthenewsetoffeaturevectorswerecombinedandtestedusingadifferent

setofimagesforwaterbottles.Experimentalresultsforthesesetsofmorethan300differentcharacterimageswithdifferenttextshowanaverageof93%successfulusingthenewsetoffeaturevectorsforclassificationandanewalgorithmofrotation,fillingandsegmentationofthetext.Theproposedsolutionimprovesruntime(lessthan5second),accuracy(99%)andbetterforHardwareimplementationcomparingtothecurrentstateofart.

7878-28, Session 7

Accelerating robust 3D pose estimation utilizing a graphics processing unitA.R.Gerlach,B.K.Walker,Univ.ofCincinnati(UnitedStates)

Thespin-imageposeestimationalgorithmisanaccuratemethodforestimatingposeofthree-dimensionalobjectswhilebeingbothrobusttoclutterandsensornoise.Unfortunately,thealgorithmhasahighcomputationalcomplexity,thuspreventingitsuseinapplicationsthatrequirearoboticsystemtointeractwithadynamicenvironment.Uponinspection,thespin-imagealgorithmcanbebrokendownintofiveportionswhereasingleportioncalledspin-imagematchingcommands96%ofthecomputationtimeinestimatingpose.Because,thematchingofindividualspin-imagescanbeperformedindependentlyregardlessoforder,thisportionofthealgorithmisidealforthemassivelyparallelarchitectureofthegraphicsprocessingunit(GPU).

ThispaperintroducesaGPUimplementationofthespin-imagematchingportionofthespin-imagealgorithmwhichmakesnomodificationstothespin-imagealgorithm,thusnotcompromisingitsrobustnessandaccuracy.Thisimplementationresultsinaspeed-upinspin-imagematchingof515xandatotalalgorithmicspeed-upof24.6xoutofatheoreticalmaximumof26.0xoveraMATLABimplementation.ThisGPUimplementationextendstheuseofthespin-imagealgorithmtowardspracticalreal-timeroboticapplications.

7878-29, Session 7

Calibration and rectification research for fish-eye lens applicationW.Feng,TianjinUniv.(China);B.Zhang,Z.Cao,X.Zong,TianjinUniv.ofTechnology(China);J.Röning,Univ.ofOulu(Finland)

Accurateparameterscalibrationandeffectivedistortionrectificationofanimagingdeviceisofutmostimportanceincomputervision.Fish-eyelensproducesahemisphericalfieldofviewofanenvironment,whichappearsdefinitesignificantsinceitsadvantageofpanoramicsightwithasinglecompactvisualscene.Buttheimagestakenbyfish-eyelenshaveanunavoidableinherentseveredistortion.Anovelcalibrationmethodisproposedtoestimatingtheinternalandintrinsicparametersofthevisionsystem,whichemploysatransformationbetweenpointsintheworldcoordinatesystemandtheircorrespondinglocationontheimageusingthespecialcalibrationpatterns.Thenthecalibrationresultsareemployedtorectifytheimagedistortion.SupportVectorMachine(SVM)andSphericalEquidistanceProjectionAlgorithm(SEPA)areintegratedtoreplaceordinaryrectificationmodel.SVMisamachinelearningmethodbasedonthetheoryofstatistics,whichhavegoodcapabilitiesofimitating,regressionandclassification.TheapproachusingSVMprovidesamappingbetweenthefisheyeimageandthestandardimageforhumaneyes,andutilizingSEPAreproducestheinformationabouttheedgeofthefish-eyelensimage.Thevalidityandeffectivenessofourcalibrationandrectificationproceduresaredemonstratedbyprocessingtherealimages.



IS&T /

ReturntoContents

7878-30, Session 7

A hardware-software co-design approach to a JPEG encoder design for a planetary micro-rover applicationS.Sarma,K.Parameswaran,S.Udupa,K.M.Bharadwaj,IndianSpaceResearchOrganisation(India)

Thereisagreatinterestamongstthescientificcommunitiestoexplorevariousplanetsforscientificstudiesusingroboticmissionswiththeleveragetohostileenvironmentsatreducedrisk,cost,andgreatermobilitythanthatispossiblewithmannedexplorations.Admittedly,theprimaryobjectiveofmanyofsuchroboticmissionsaretocollectsciencedata,forinstance,siteexplorationusingvisualimagesorvideos,chemicalanalysisofsurfaceterrain,andcollectionofsamplestomentionafew.However,suchmissionsarecharacterizedbyseverpowerandbandwidthconstraintsduetolongdistanceandlackofabundantpowersources.Largesetofvisualimagescollectedbyvariouscamerason-boardaroveristobeprocessedandtransmittedtoEarthforscientificstudieswhicharemostlyachievedbyusingvariousdatacompressiontechniques.Inparticular,theJPEGimagecompressionstandardthatisdevelopedbytheJointPhotographicExpertsGroupcommitteeforuseincompressingdigitalimagesandfullcolorphotographicimagesisverypopular.ItisoneoftheprimaryformatsusedforexchangingpicturesontheWorldWideWeb,anditiscommonlyusedindigitalcamerasasthestorageformat.Inthispaper,ahardware-softwarebasedco-designapproachpresentedwiththeaimtoimplementaJPEGencoderforamicro-roverfortwoprocessorsystemsnamely,MIL-STD-1750andPowerPCunderseriouspowerandbandwidthconstraints.Twomethods,onemostlyusingsoftwareimplementationandtheother-aFPGAbasedpipelinedhardwarearchitecture,arecomparedfortheirperformanceandresourceutilizationusingplanetaryterrainimagesofvarioussizesandqualitysettingsforboththeseprocessorarchitecture.Infact,theresultspresentedareextensivelysubstantiatedbysimulationandpracticalimplementationinFPGA.Basedonthesestudies,suitableguidelinesareelucidatedtoarriveataneffectualarchitectureforaplanetarymicro-roverforfutureexplorationbyanIndianMoonmission.

7878-31, Session 8

Phobetor: Princeton University’s entry in the 2010 Intelligent Ground Vehicle CompetitionJ.Newman,S.O.Abiola,R.M.Corey,S.A.Suresh,L.J.Szocs,B.A.Partridge,D.D.Yu,H.Zhu,PrincetonUniv.(UnitedStates)

InthispaperwepresentPhobetor,ourentryinthe2010IntelligentGroundVehicleCompetition(IGVC).Wedescriberevisedvisionandnavigationsoftwarethatimprovetherobustnessandspeedofourrobot,andournewly-constructedplatformdesignedtoaddresspreviousyears’concernssuchaswatertightnessandserviceability.

Ourvisionsoftwareusescolorstereoimages.WeuseRANSACtolocallyestimatethegroundplaneandclassifypointsthatlieabovethatplaneasobstacles.Thisrobustmethoddetectsavarietyofobstaclesonuneventerrain.Todetectlanes,weprocesstheimagewithanedgedetectionfilterandrejectedgesifthecorrespondingimagepixelsarenotlane-colored,greatlyreducingfalsepositivesandnoise.WeuseRANSACtoidentifyseparatelanesandmodeleachwitharotatedparabola.

ForpathplanningPhobetorusestheAnytimeDynamicA*algorithm.Thealgorithmallowsincrementalre-planning,forefficiency.Itgeneratesoptimalpathsgivenenoughtime,butfirstquicklygeneratesavalid,sub-optimalpathsotherobotneverneedstowait.Weaugmentthecostmapoftheenvironmentwithapotentialfieldwhichaddressestheproblemof“wall-hugging”andsmoothesthepath.

7878-32, Session 8

Application of parallelized software architecture to an autonomous ground vehicleR.Shakya,A.Wright,Y.H.Shin,O.Momin,S.Petkovsek,P.Wortman,P.Gautam,A.Norton,TrinityCollege(UnitedStates)

ThispaperpresentsimprovementsmadetoQ,anautonomousgroundvehicledesignedtoparticipateintheIntelligentGroundVehicleCompetition(www.igvc.org).TheIGVChastwomainchallenges,calledtheautonomouschallengeandthenavigationchallenge.Intheautonomouschallengethevehicleisrequiredtofollowacoursewhileavoidingobstaclesandstayingwithinthecourseboundaries,whicharemarkedbywhitelines.Forthenavigationchallenge,thevehicleisrequiredtoreachasetoftargetdestinations,knownaswaypoints,withgivenGPScoordinatesandavoidobstaclesthatitencountersintheprocess.Forthe2010IGVC,Qwasupgradedwithanewparallelizedsoftwarearchitectureandanewdual-corevisionprocessor.Thenewsoftwarearchitecturemodularizesallthenecessarytaskssuchasmotorcontrol,navigationandsensordatacollectionandexecutestheminparallel,providingconsiderableflexibilityandfacilitatingefficientuseofprocessingpower.ThevisionprocessorincreasedspeedandreliabilityoftheimageprocessingalgorithmonQ.WithalltheseimprovementsQwasabletonavigatethroughtwoS-curvesandtravelalmost300feet,whichisjustpastthehalfwaymarkoftheautonomouschallengecourse.AsaresultQplaced2ndintheautonomouschallengeand3rdoverallamong57participatingentries.

7878-33, Session 8

WOAH: an obstacle avoidance technique for high speed path followingN.Tuck,M.McGuinness,F.W.Martin,Univ.ofMassachusettsLowell(UnitedStates)

ThispaperpresentsWOAH,aWorkingObstacleAvoidanceHeuristic.WOAHisareal-timereactiveobstacleavoidancetechniqueformobilerobotsdesignedtoleveragepolarrangingdatafromasinglelaserrangingdevice.Unlikemostcurrenttechniques,thismethodallowsarobottotravelquicklypastobstacleswithoutslowingdown,resultinginconsistentfastprogresstowardsaspecifiedgoal.

IntestingobstacleavoidancetechniquesthatworkwithasingleLIDAR,wediscoveredthatexistingmethodsdidnottraversepathsasquicklyastheycould.AlgorithmswetestednotablyincludedSmoothNearnessDiagramandVectorFieldHistogram.Essentially,wefoundthesetechniquestobeexcessivelycautiousnearobstaclesevenwhentheobstacleswerenotintherobot’spath.WOAHwasdesignedspecificallytoavoidthisproblem.

Preliminaryresultshavebeenpromising.ArobotusingaprototypeversionofWOAHachieved3rdplaceinthe2010IGVCNavigationChallenge,visitingsevenofninewaypointsanddoingsoatthefastestpaceofallcompetitorsthatvisitedatleasttwo.InthepaperwewillpresentresultscomparingWOAHtoprevioustechniquesbothinsimulationandinlivetestingonouroutdoortestcourse.


Continuous target tracking based on multiple viewsY.Liu,BeijingUniv.ofPostsandTelecommunications(China)

Inordertosolvetheproblemoftargethandoffinhelpingservicerobottotracktargetsbasedonceilingcamerasinsmartspace,thepaperputsforwardanalgorithmofcombiningprojectiveinvariantwithhistogram,whichcanbeusedinnon-overlappingoroverlappingconditions.Meanwhile,itanalyzesmulti-targetmotiontrajectories.Theexperimentindicatesthatthemethodmaytrackobjectseffectivelyanditisrobusttoocclusion,whichcanmeetactualneeds.



IS&T /

ReturntoContents


A target detection method in multimodal images with complex backgrounds and different viewsZ.He,X.Ding,TsinghuaUniv.(China)

Targetdetectioninmultimodal(multisensor)imagesisadifficultproblemespeciallywithdifferentviewsandcomplexbackgrounds.Inthispaper,weproposeatargetdetectionmethodbasedongroundobjectregionextractionandgraphmodelmatchingtosolveit.First,theextrinsicparametersofcameraareusedtotransformtheimagestoreducetheimpactofviewpointsdifferences.ThenthestableobjectregionsareextractedtodescribetheobjectshapesinmultimodalimagesbyMSERwhichcaneffectivelyreducetheimpactofnoiseandmultimodal.Thoseregionsofgroundobjectswhoseshapesarelessaffectedbytheviewtransformationsareusedtobuildagraphmodeltodescribethetargetinthereferenceimagewithspatialconstraintstoreducetheimpactofcomplexbackgrounds.Atlast,agraphmodelregistrationalgorithmisdeveloppedusingroundregionmatchingandspatialconstraintstofindthetargetinthesensedimages.ThealgorithmisbasedontheideaofRANSACandobtainedasatisfiedexperimentresultinourdatasetoftwovisiblereferenceimagesintopviewandfourgropsofinfraredsensedimagesinsideview.Thefinaldetectionratesofgroup1,2and4areallabove95%whilegroup3is83.61%.


Gender classification robust to face pose variation and partial occlusionP.Zeng,NanchangUniv.(China);Y.Zhang,TsinghuaUniv.(China)

Anewapproachforautomaticgenderclassificationisproposedtobalancethealgorithmcomplexityandaccuracyrates.Threekeycontributionsinthepaperarepresented:

Firstly,aconcepttomendapartlyoccludedfacewiththesymmetricpixelsorastandardneuralfaceisputforwardaccordingtotheoccludedposition,whichmakesallimagesacompleteuniformfacegothroughthesameclassifier.Inthisway,therequirementsformanyclassifiersandtrainingdatumareavoidedfordifferentoccludedparts.

Second,arelativesimplecharactertrianglestructuredbyeyes’andnose’spositionsistappedtorotatethein-depthheadrotation,inordertocompensateposevariation.

Finally,anunevenlysampledGaborWaveletMask(GWM),accordingtoeachpartoffacedevotingtogenderclassification,isapplied.Thisprovidesaconsiderablywiderrangeoftoleranceagainstheadrotation.

Thecomparativeexperimentsareconductingonseveralpubliclyavailabledatabases,suchasFERET,ARfacedatabasesandUTechcompanyfaceimages.Theresultsdemonstratethattheproposedapproachhasaconsiderablytoleranceagainstbothin-depthheadrotationandpartialocclusion.


Lane marking detection by extracting white regions with predefined width from bird’s-eye road imagesS.Abe,UtsunomiyaUniv.(Japan)

Detectinglanemarkingsonroadsfromin-vehiclecameraimagesisveryimportantbecauseitisoneofthefundamentaltasksforautonomousrunningtechnologyandsafetydrivingsupportsystem.Thereareseverallanemarkingsdetectionmethodsusingthewidthinformation,butmostoftheseareconsideredtobeinsufficientforobliquemarkings.So,theprimaryintentofthispaperistoproposeadetectinglanemarkingsmethodrobusttoorientationofmarkings.Inthiswork,wefocusonthewidthoflanemarkingsstandardizedbyroadactinJapan,andproposeamethodfordetectingwhitelanemarkingsbyextractingwhiteregionswithconstantpredefinedwidthfrombird’s-eyeroadimagesaftersegmentationsuchascategoricalcolorareaone.TheproposedmethodisbasedontheconstrainedDelaunaytriangulation.Theproposedmethodhasameritthatcanbemeasureanexactwidthforobliquemarkingsonthebird’s-eyeimagesbecauseitcanbeobtainedperpendicularwidthforedge.Theeffectivenessoftheproposedmethodwasshownbyexperimentalresultsfor187actualroadimagestakenfromanin-vehiclecamera.


Selective locality preserving projections for face recognitionF.Dornaika,Univ.oftheBasqueCountry(Spain);A.Assoum,LebaneseUniv.(Lebanon)

Recently,agraph-basedmethodwasproposedforLinearDimensionalityReduction(LDR).ItisbasedonLocalityPreservingProjections(LPP).LPPisatypicallineargraph-baseddimensionalityreduction(DR)methodthathasbeensuccessfullyappliedinmanypracticalproblemssuchasfacerecognition.LPPisessentiallyalinearextensionofLaplacianEigenmaps.Whendealingwithfacerecognitionproblems,LPPisprecededbyaPrincipalComponentAnalysis(PCA)stepinordertoavoidpossiblesingularities.BothPCAandLPParecomputedbysolvinganeigendecompositionproblem.Inthispaper,weproposeanovelapproachcalled“SelectiveLocalityPreservingProjections”thatgoesbeyondthecombinationoftheprinciplesofLPPwithFeatureSelectionparadigm.ItperformsasimultaneouseigenvectorselectionandconstructionassociatedwithPCAandLPP.Wehavetestedourproposedapproachonseveralpublicfacedatasets.ExperimentsonORL,UMIST,andYALEFaceDatabasesshowsignificantperformanceimprovementsinrecognitionovertheclassicalLPP.Theproposedapproachlendsitselfnicelytomanybiometricapplications.



IS&T /

ReturntoContents

Conference 7879: Imaging and Printing in a Web 2.0 World IIWednesday-Thursday26-27January2011PartofProceedingsofSPIEVol.7879ImagingandPrintinginaWeb2.0WorldII

7879-01, Session 1

Web-based magazine design for self publishersA.A.Hunter,D.N.Slatter,Hewlett-PackardLabs.(UnitedKingdom)

ShortrunprintingtechnologyandwebservicessuchasMagCloudprovidenewopportunitiesforlong-tailmagazinepublishing.Theyenableselfpublisherstosupplymagazinestoawiderangeofcommunities,includinggroupsthataretoosmalltobeviableastargetcommunitiesforconventionalpublishers.

InaWeb2.0worldwhereusersconstantlydiscovernewservicesandwheretheymaybeinfrequentpatronsofanysingleservice,itisunreasonabletoexpectuserstolearnthecomplexservicebehaviors.Furthermore,wewanttoopenuppublishingopportunitiestonoviceswhoareunlikelytohavepriorexperienceofpublishingandwholackdesignexpertise.

Magazinedesignautomationisanambitiousgoal,butrecentprogresswithanotherwebservice,Autophotobook,provesthatsomelevelofautomationofpublicationdesignisfeasible.ThispaperdescribesourcurrentresearchefforttoextendtheautomationcapabilitiesofAutophotobooktoaddresstheissuesofmagazinedesignsothatwecanprovideaservicetosupportprofessional-qualityselfpublishingbynoviceusersforawiderangeofcommunitytypesandsizes.

7879-02, Session 1

Improve artwork design through data tracking systemW.H.Wang,R.Muzzolini,Shutterfly,Inc.(UnitedStates)

Inpersonalizeddigitalprinting,suchasgreetingcards,calendarsandphotobooks,peopleselectartworkstomatchtheirphotosattheirpreference.Artworkdesignelementsareoftencategorizedbyoccasions,styles,andproduct.Themountofdesigngrowssignificantly,ascustomersdemandmorechoicesandthetrendsofpopulardesignsriseandfadeseasonbyseason.Itiscrucialtomanageandunderstandhowdesignelementsareusedinordertocreatemostdesirableproductions.Inthispaper,weanalyzeandcomparedifferentdesigntrackingsystems.Artworkdesignsarelabeled,ranked,andcrossreferenced.Foreachsystem,wedemonstratethescaleofapplications,datacollectiontechniquesanditsadvantagesanddisadvantages.

7879-03, Session 1

DOM-based print-link detection for web article extractionS.J.Liu,S.H.Lim,J.Liu,Hewlett-PackardLabs.(UnitedStates)

WebpagesfromsomeWebsitesprovideahyperlink(orlink)thatleadstoaprint-friendlywebpagethatcontainsmainlythearticleitself.Contentextractionusingtheseprint-friendlypagesisgenerallyeasierandmorereliable.Buttherearemanyvariationsoftheprint-linkrepresentationsthatmadetheprintlinkdetectionmoredifficultthanitfirstappeared.First,thelinkcanbetext-based,image-based,orboth.Forexample,thereisalexiconofphrasesusedtorepresenttheprintlinks,suchas“print”,“printarticle”,“print-friendlyversion”,etc.Inaddition,somepagesuseprinter-resemblingimageiconswithorwithoutaprintphrasepresent.Tocomplicatethematterfurther,notallthelinkscontainavalidURL,butinsteadthepagesaredynamicallygeneratedeitherbytheclientJavascriptorbytheserver,whicharenot

retrievableusingtheDOM-basedextractiontechniquesincenovalidURLisavailableintheDOM.Oursolutiontotheprint-linkextractionproblemtakesontwostages:(1)thedetectionoftheprint-link,(2)theretrievaloftheprint-friendlyURLfromthelinkattributes,includingthetestforitsvalidityandtheconversionofrelativetoabsoluteURL.Experimentalresultsbasedonroughly2000webarticlepagessuggestoursolutioniscapableofachievingover99%precisionandover97%recall.

7879-05, Session 2

A web-based troubleshooting tool to help customers self-solve color issues with a digital printing workflowH.J.Santos-Villalobos,PurdueUniv.(UnitedStates);V.Loewen,Hewlett-PackardCo.(UnitedStates);M.R.Letho,J.P.Allebach,PurdueUniv.(UnitedStates)

Currentprintingtechnologiesenablecustomerstoreproducehighquality,realistic,andcolorfulhardcopiesoftheirdigitaldocuments.Althoughtheactivityofprintingistransparenttothecustomers,theprogressionofacustomer’sdocumentthroughthedigitalprintingworkflow(DPW)isacomplexprocessthatmayalterthecolorsintheprintjob.GiventhecomplexityoftheDPW,itisadifficultproblemtodiagnosethesourceofthecolorissue.Noveltoolsandmethodsthataddressthischallengearebeneficialforboththemanufactureranditscustomers.Weproposeaweb-basedtroubleshootingtoolthathelpscustomerstoself-solvecolorissueswithelectrophotographiclaserprinterswhenprintingsolidcolorsingraphicsandtext,Thetoolhelpsthecustomertoreconfigurehis/herDPWfollowingprintingbestpractices.Iftheissueisstillunresolved,thetoolguidestheusertosearchthegamutoftheprinterforhis/hercolorpreference.Theusabilityofthetoolwascarefullyevaluatedwithhumansubjectexperiments.Also,thedescriptionandorganizationofthetroubleshootingtaskswerecontinuouslyreviewedandimprovedinregularmeetingsofthedevelopmentteam.Inthispaper,wedescribethetroubleshootingstrategy,thecolorpreferencesearchalgorithm,andtheresultsoftheusabilityexperiments.

7879-06, Session 2

Language-based color editing for mobile deviceY.Zhao,R.Bala,K.M.Braun,Z.Langford,R.J.Rolleston,XeroxCorp.(UnitedStates)

NaturallanguagecolorwasinitiallydevelopedasadesktopapplicationandthendeployedinoneXeroxprintdriver.NLCchangestheimage-editingparadigmfromtheuseofcurves,sliders,andknobs,totheuseofverbaltext-basedcommandssuchas“Makelightgreensmuchlessyellowish”.Thetechnologyappealstoacommonuserwhohasnoexpertknowledgeincolorscience,andthisnaturallyleadsonetothinkaboutitsuseinmobiledevices.AprototypeGUIdesignforalanguagetext-basedcoloreditingiPhoneapplicationwillbepresentedthatusesseveralofitshapticinterfaces(e.g.“slot-machine”,shaking,swiping,etc.).Atextualinterfaceisprovidedtoselectacolortobemodifiedwithintheimageandadirectionofchangeforthemodification.Aswipeinterfaceisprovidedtoselectamagnitudeandpolarityforthemodification.Actionsonthetextualandswipeinterfaceareconvertedtonaturallanguagecommandsthatareinturnusedtoderiveacolortransformationthatisappliedtorelevantportionsoftheimagetoyieldamodifiedimage.Themodificationsaredisplayedinrealtimeforausertoobserveastheyareinputted.


IS&T /

ReturntoContents

7879-07, Session 2

Personalized imaging: moving closer to realityH.Ding,PurdueUniv.(UnitedStates);R.Bala,Z.Fan,XeroxCorp.(UnitedStates);C.A.Bouman,J.P.Allebach,PurdueUniv.(UnitedStates)

Noabstractavailable

7879-16, Session 2

How Web 2 0 technologies lead to more tangible printed outputR.Fageth,CeWeColorAG&Co.OHG(Germany)

Noabstractavailable

7879-08, Session 3

Document distance measures and document browsingI.Ahmadullin,PurdueUniv.(UnitedStates);J.Fan,N.Damera-Venkata,S.H.Lim,Q.Lin,J.Liu,S.J.Liu,E.O’Brien-Strain,Hewlett-PackardLabs.(UnitedStates);J.P.Allebach,PurdueUniv.(UnitedStates)

Managinglargedocumentdatabasesisanimportanttasktoday.Beingabletoautomaticallycomparethedocumentlayouts,andclassifyandsearchdocumentswithrespecttotheirlayoutsprovestobedesirableinmanyapplications.Wemeasuredocumentsimilaritywithrespecttothedocumentlayoutcomponents.Documentsareinitiallysegmentedtoidentifyregionsoffourclasses:text,header,imageandbackground.Tosimplifythedocumentlayoutcomparisontaskwerepresenttheregionsasboundingblocksthatenclosedocumentcomponentsofdifferentclasses.Thedocumentsimilaritymeasureisthencalculatedwithrespecttoposition,size,andcolorhistogramoftheregions.Usingthisdocumentsimilaritymeasureweproposeabrowsingmechanismoperatingonadocumentdataset.Forthesepurposes,weuseahierarchicalbrowsingenvironmentwhichwecallthedocumentsimilaritypyramid.Itallowstheusertobrowselargedocumentdatasetandsearchfordocumentsinthedatasetthataresimilartothequerydocument.Documentsclosetoeachotherinthedatasetaregroupedtogethertorepresentdocumentclustersthataresequentiallymergedinaformofaquad-tree.Eachclusterisrepresentedbyoneofthedocumentsitcontains,whichallowsustocreateasimilaritypyramidofthedocumentdataset.Theuserisallowedtobrowsethedatasetondifferentlevelsofthepyramidandreconstructthepyramidwithrespecttothedocumentsthatareofinterest.Oneapplicationofthealgorithmisbrowsingdesigntemplates.Thesoftwareperformanceistestedonadocumentdesigntemplatesdataset.

7879-09, Session 3

Adaptive removal of background and white space from document images using seam categorizationC.S.Fillion,Z.Fan,XeroxCorp.(UnitedStates)

Documentimagesareobtainedregularlybyrasterizationofdocumentcontentandasscansofprinteddocuments.Resizingviabackgroundandwhitespaceremovalisoftendesiredforbetterconsumptionoftheseimageswhetherondisplaysorinprint.Whilewhitespaceandbackgroundareeasytoidentifyinimages,existingmethodssuchasnaïveremovalandcontentawareresizing(seamcarving)eachhavelimitationsthatcanleadtoundesirableartifacts,suchasunevenspacingbetweenlinesoftextorpoorarrangementofcontent.An

adaptivemethodbasedonimagecontentishenceneeded.

Inthispaperweproposeanadaptivemethodtointelligentlyremovewhitespaceandbackgroundcontentfromdocumentimages.Documentimagesaredifferentfrompictorialimagesinstructure.Theytypicallycontainobjects(textletters,picturesandgraphics)separatedbyuniformbackground,whichincludebothwhitepaperspaceandotheruniformcolorbackground.Pixelsinuniformbackgroundregionsareexcellentcandidatesfordeletionifresizingisrequired,astheyintroducelesschangeindocumentcontentandstyle,comparedwithdeletionofobjectpixels.

Weproposeabackgrounddeletionmethodthatexploitsbothlocalandglobalcontext.Themethodaimstoretainthedocumentstructuralinformationandimagequality.Thealgorithmwillbeillustratedwithexperimentalexamples.

7879-10, Session 3

Aesthetic role of transparency and layering in the creation of a photo layoutM.V.OrtizSegovia,PurdueUniv.(UnitedStates);N.Damera-Venkata,E.O’Brien-Strain,J.Fan,S.H.Lim,S.J.Liu,J.Liu,Q.Lin,Hewlett-PackardLabs.(UnitedStates);J.P.Allebach,PurdueUniv.(UnitedStates)

Eventhoughtechnologyhasallowedustomeasuremanydifferentaspectsofimages,itisstillachallengetoobjectivelymeasuretheiraestheticappeal.Amorecomplexchallengeispresentedwhenanarrangementofimagesistobeanalyzed,suchasinaphoto-bookpage.Severalapproacheshavebeenproposedtomeasuretheappealofadocumentlayoutthat,ingeneral,makeuseofgeometricfeaturessuchthepositionandsizeofasingleobjectrelativetotheoveralllayout.Evenfewereffortshavebeenmadetoincludeinametrictheinfluenceofthecontentandcompositionofimagesinthelayout.Manyoftheaestheticcharacteristicsthatgraphicdesignersandartistsuseintheirdailyworkhavebeeneitherleftoutoftheanalysisorpartiallyquantizedintheefforttomaterializetheconcepts.

Moreover,graphicdesigntoolssuchastransparencyandlayeringplayanimportantroleintheprofessionalcreationoflayoutsfordocumentssuchaspostersandflyers.Themaingoalsofourstudyaretoapplysimilartechniqueswithinanautomatedphoto-bookgenerationtool,andtofurtherevaluatetheaestheticcharacteristicsoftheresultinglayouts.Amongotherdesigntechniques,thetoolencouragestheuseoflayeringandtransparencyinthelayouttoproduceaprofessionallookingarrangementofthepictures.Twoseriesofexperimentswithpeoplefromdifferentlevelsofexpertisewithgraphicdesignprovideduswiththetoolstomaketheresultsofoursystemmoreappealing.Forthefirstexperiment,weentrusted12graphicdesigners,professionalsandseniorstudents,thetaskofcreatingasingle-pagelayoutwithagivenphotocollectionundersomespecificconstraints.Theconstraintsimposedtothedesignersweremeanttoguidetheirwork.Inthesecondexperiment,weaskedpeoplewithlowornolevelofexpertiseingraphicdesigntouseourcomputer-basedtooltogenerateaphoto-layout.Bothgroupswererequiredtoreportadetaileddescriptionabouttheircreativeprocess.Inthispaper,wediscusstheresultsofourexperimentsandexaminetheunderlyingaestheticsoftheresultingphotolayoutsinthecontextofdistinctgraphicdesignconcepts.

7879-11, Session 3

Automatic picture orientation detection based on classifier combinationC.Liu,Y.Sun,X.Ding,TsinghuaUniv.(China)

Automaticpictureorientationrecognitionisofgreatsignificanceinmanyapplicationssuchasconsumergallerymanagement,webpagebrowsing,content-basedsearchingorwebprinting.Wetrytosolvethishigh-levelclassificationproblembyrelativelylow-levelfeaturesincludingSpacialColorMoment(CM)andEdgeDirectionHistogram(EDH).Animproveddistance-basedclassificationschemeisadoptedasourclassifier.Weproposeaninput-vector-rotatingstrategy

Conference 7879: Imaging and Printing in a Web 2.0 World II


IS&T /

ReturntoContents

insteadofcollectingandtrainingsamplesforallfourclasses,whichiscomputationallymoreefficientthanseveralconventionalschemes.Thenweresearchontheclassifiercombinationalgorithmtomakefulluseofthecomplementaritybetweendifferentfeaturesandclassifiers.Ourclassifierscombinationmethodsincludetwolevels:feature-levelandmeasurement-level.Andwepresenttwoclassifiercombinationstructures(parallelandcascaded)atmeasurement-levelwitharejectionoption.Asthepreconditionofmeasurement-levelmethods,thetheoryofClassifier’sConfidenceAnalysis(CCA)isintroducedwiththedefinitionofconceptssuchasclassifier’sconfidenceandgeneralizedconfidence.Theclassificationsystemfinallyapproached90%recognitionaccuracyonawideunconstrainedconsumerpictureset.

7879-12, Session 4

Whiteboard sharing: capture, process, and print or emailM.J.Gormish,B.Erol,D.G.VanOlst,T.Li,A.Mariotti,RicohInnovations,Inc.(UnitedStates)

Whiteboardscontinuetobeusedtosupportmeetingsbyfacilitatingthesharingofideas,focusingattention,andsummarizing.However,attheendofthemeetingparticipantsareleftwithoutatangible,orevenelectronicsummary.Weconsiderthecaptureoftheinformationonawhiteboard,improvingtheimagequality,andsharingtheresults.Thispaperdescribestheinitialalgorithmforimprovingwhiteboardimagequalityandchangesmadetoreducecomputation.ThealgorithmhasbeenprovidedfreelyasawebwidgetandasanapplicationontheiPhoneandAndroidphones.Userfeedbackandanalyticsonusagehasledtofurtherchangesintheimageprocessingandtheuserinterface.

7879-13, Session 4

Building a print on demand web serviceP.D.Reddy,Hewlett-PackardLabs.(UnitedStates);B.Rozario,V.AnilDev,S.Dudekula,Hewlett-PackardLabs.India(India)

Morethan91millionbookshavebeenprintedinmultiplelanguagesusingavarietyofprintingtechnologiesandstyles.Therearecurrentlyabout32MbooksthatareoutofprintintheUnitedStates.Thereisconsiderableeffortunderwaytodigitizeallbooksthathaveeverbeenprinted.ThereisneedforaservicethatcantakerawbookscansandconvertthemintoPrintonDemand(POD)books.Suchaservicedefinitelyaugmentsthedigitizationeffortandenablesbroaderaccesstoawideraudience.Tomakethisservicepracticalwehaveidentifiedthreekeychallengesthatneededtobeaddressed.Theseare:a)producehighqualityimageimagesbyeliminatingartifactsthatexistduetotheageofthedocumentorthosethatareintroducedduringthescanningprocessb)developanefficientautomatedsystemtoprocessbookscanswithminimumhumanintervention;andc)buildanecosystemwhichallowsusthetargetaudiencetodiscoverthesebooks.

Thethrustofthispaperistodiscussourapproachandtheprogresswehavemadeinaddressingeachofthechallengesoutlinedabove.

7879-14, Session 4

An unsupervised fusion method for large scale cross-media meta-search engine with clickthrough dataY.Cao,Y.Tian,T.Huang,W.Gao,PekingUniv.(China)

SupportedbytheChina-USMillionBookProject(MBP),wehavedevelopedandopenedthefirstUniversalDigitalLibraryinNov.2007.Thedigitallibrary,whichoffersfoursearchingenginesconcerningdifferenttypeofqueries,providesgreatfacilityforusersviatheInternet.Nevertheless,itslargeamongandvariousformofdataraisenewchallengeforcurrentmeta-searchengineresultfusionmethodslikeBordaCountModelandCombModel,sincethesemethodsare

mainlydesignedfortext-basedretrievalsystemsonly.Thus,weproposeanunsupervisedfusionmethodbasedonuserfeedback(i.e.clickthrough)tosolvetheproblem.Theproposedmethodtakesintoaccountcross-mediaconditions.Itprovidesaglobalandlocalfusionweightbasedresultrerankingschemetofuseresultsfromdifferentmembersearchsystems.Aslidingwindowisalsoemployedtosolvethelackofuniformitywithinclickthroughandoverlappedresultsinordertoprovidemorereasonableresultsrank.ExperimentscarriedontheWikipediaMMdatabasedemonstratedthattheproposedmethodoutperformstraditionalfusionmethodsintermsofMeanAveragePrecision,B-Pre,R-Preandotherevaluationmeasurements.

7879-15, Session 4

iULib: where UDL and Wikipedia could meetY.Tian,T.Huang,W.Gao,PekingUniv.(China)

Empoweringthegroupcollaborationandknowledge-sharingcapabilitiesfortheUniversalDigitalLibrary(UDL)isdefinitelyanimportantworkaftermorethan1.5milliondigitalizedbookswereopentoaccessonline.OneofthekeymotivationsofthedevelopmentofsuchaplatformistheemergenceofWeb2.0inrecentyears,especiallywiththerapidlyincreasedpopularityofWikipedia,anincentiveapplicationofWeb2.0forusers’stronginvolvementandknowledgesharing.Thispaperpresentsourvision,whichwecalliUlib,aboutwhereandhowUDLandWikipediacouldmeet.Inthefirstphase,wedirectlyapplytheWikiarchitectureandsoftwareinUDLtoupgradethedigitallibraryasaninteractiveplatformthatfacilitatescommunityandcollaboration.Preliminaryimplementationshowsthefeasibilityandreliabilityofourdesign.Furthermore,asafreeencyclopediathatassemblescontributionsfromdifferentusers,WikipediamayalsobeusedasaknowledgebaseforUDL.Asaresult,UDLcanbeupgradedasanintelligentplatformforinformationretrievalandknowledgesharing.OurpracticeattheWikipediaMMtaskintheImgeCLEF2008showsthattheknowledgenetworkconstructedfromWikipediacanbeusedtoeffectivelyexpandthequerysemanticsofimageretrieval.ItisexpectedthatWikipediaanddigitallibrarycanintegrateeachother’svaluableresultsandbestpracticestobenefiteachother.

7879-04, Session 5

Book Widget: embedding automated photo-document publication on the web and in mobile devicesE.O’Brien-Strain,Hewlett-PackardLabs.(UnitedStates);A.A.Hunter,Hewlett-PackardLabs.(UnitedKingdom);J.Liu,Q.Lin,D.Tretter,Hewlett-PackardLabs.(UnitedStates);J.Wang,Hewlett-PackardLabs.China(China);X.Zhang,Hewlett-PackardLabs.(UnitedStates)

Wedescribeacloud-basedautomated-publishingplatformthatallowsthirdpartydeveloperstoembedoursoftwarecomponentsintotheirapplications,enablingtheiruserstorapidlycreatedocumentsforinteractiveviewing,orfulfillmentviamailorretailprinting.Wealsodescribehowapplicationsbuiltonthisplatformcanintegratewithavarietyofdifferentconsumerdigitalecosystems,andhowwewilladdressthequalityandscalingchallenges.

Theplatformwillprovidecontenttransformationalgorithmssuchasphototriage,clustering,andphoto-booklayout.Theplatformwillprovidetemporarytransactionalstorageofcontent(photosandtext),editabledocuments(photo-books),andgeneratedartifacts(PDFs).Itwillbeeasilyembeddableinanapplication,accessedviasecureRESTfulURLresources.Itwillprovideanelegantlypowerfuldocument-orienteddatamodelwithnoconceptofa“user”thatwouldcauseproblemsintegrating.Itwillhaveanembeddablewidget/iframeuserinterfaceforinteractivedocumentediting.

Onedifferentiationofthisplatformisthatitfacilitatesseparatingthreepropertiesofapplications:theuserinterfacecontext,thesourcecontent,andthegeneratedartifacts.Thisallowsforthereadybuildingofawidevarietyofapplications.



IS&T /

ReturntoContents

7879-17, Session 5

Semantic photo books: leveraging blogs and social media for photo book creationM.Rabbath,P.Sandhaus,OFFISe.V.(Germany);S.C.J.Boll,CarlvonOssietzkyUniv.Oldenburg(Germany)

InthisworkweintroduceanapproachforcreatingphotobooksfromWeb2.0resources.Weconcentrateontwokindsofonlinesharedmedia:(a)Blogsespeciallytravelblogs(b)SocialcommunitywebsiteslikeFacebook.Weintroduceanapproachtoselectmediaelementsincludingphotos,geographicalmapsandtexts,andthenusetheseelementstocreateaprintablephotobook.Becausetheselectedmediaelementscanbetoomany,wechoosethemostproper.Additionallyweaddexternalmediaelementssuchasgeographicalmaps,textsandexternallyhostedphotosfromlinkedresources.Importantmediaarechosenaccordingtoseveralcriteriaincludingthesocialimportanceofthepersonsinthephotostotheuserandthelevelofuser-mediainteractivity.Havingselectedtheimportantmedia,ourapproachintroducesageneticalgorithmtocreateanappealinglayoutusingaestheticrules,wheresomeofthephotosarechosenasbackgroundphotos,andothermediaarepositionedintheforeground.WeimplementedourapproachaswebservicesconnectedtoaFacebookapplication,andatooltochooseentriesfrompersonalblogs.Asaresult,theoutputofourimplementedapplicationisaphotobookinCeweMCFformat.

7879-18, Session 5

Automatic image selection scheme utilizing comments for insertion of images into weblogsT.Konno,E.Myodo,K.Takagi,R.Kawada,KDDIR&DLabs.,Inc.(Japan)

Thispaperproposesaschemewhichutilizescommentsgiventoimagesonanimagesharingsiteinordertofindanappropriateimageforinsertionintopoem-likeweblogs(blogs)asawaytorepresenttheiratmosphere(impression).Theresultshowsthatutilizingcommentsiseffective.Toachievethispurpose,therearetwoissues:howimpressionwordsareextractedfromblogsandhowimagesrepresentingtheimpressionwordsareobtained.Assumingthatitisimportanttoobtainimagesrepresentingtheimpressionwords,thispaperfocusesonthelatterissueonly.Wehypothesizethatcommentsandtagsextractedfromanimagesharingsitecanbeadequateforobtainingimagescorrespondingtoimpressionwordsatlowcost.Inparticular,utilizingcommentscanbemoreadequatefortheimagesearchwithimpressionwordsthanutilizingtagsbecausetheimpressionwordsareoftenusedincomments.Therefore,weproposeaschemewhichutilizescommentstofindappropriateimages.Inordertoinvestigatetheeffectivenessofutilizingcomments,conformancebetweenimpressionwordsandtheimageswasevaluated.Theratingforconformanceis3.5onascaleof1to5whenutilizingcomments,whichis0.6higherthanwhenutilizingtags.

7879-19, Session 5

Title identification of web article pages using html and visual featuresJ.Fan,Hewlett-PackardLabs.(UnitedStates);P.Luo,Hewlett-PackardLabs.China(China);P.Joshi,Hewlett-PackardLabs.(UnitedStates)

ExtractinginformativecontentfromWebarticlepageshasmanyapplicationssuchasprintingandcontentreuse.Titleisasignificantanduniquecomponentofanarticle.However,identifyingthetruetitleisanon-trivialproblemevenforhumanreaders.Inthispaper,wepresentatitleidentificationmethodthattakingintoaccountofthetitlefieldofthehtmlpageandhtmltagofaDOMnodeaswellasfontsize

andhorizontalalignment.Wetestourmethodonagroundtruthdatasetconsistingof2000pagesfrom100websitesandachieved97.2%precisionand96.9%recall.

7879-20, Session 5

Creating 3D realistic head: from two orthogonal photos to multiview face contentsY.Lin,TsinghuaUniv.(China);Q.Lin,F.Tang,Hewlett-PackardLabs.(UnitedStates);L.Tang,Hewlett-PackardLabs.China(China);S.H.Lim,Hewlett-PackardLabs.(UnitedStates);S.Wang,TsinghuaUniv.(China)

3DHeadmodelshavemanyapplications,suchasvirtualconference,3Dwebgame,Biometrics,andsoon.Thereareseveralweb-basedfacemodelingsolutionsthatcancreatea3Dfacemodelfromoneortwouseruploadedfaceimages.Withthecreated3Dmodel,realisticanimationscanbegenerated.Theexistingapproachesarelimitedtogeneratingthe3Dmodelofthefaceregion.Theaccuracyofsuchreconstructionisverylimitedforsideviews,aswellashairregions.Thegoalofourresearchistodevelopaframeworkforreconstructingtherealistic3Dhumanheadbasedontwoapproximateorthogonalimages.

Ourframeworktakesafrontalheadimageandaside-viewheadimage,andgoesthroughsegmentation,featurepointsdetection,featurepointsmatching,andtexturemappingtocreatea3Dheadmodel.Themaincontributionofthepaperisthattheprocessingstepsareappliestoboththefaceregionaswellasthehairregion.

Wewillshowexamplesofthereconstructioninthepaper.Wewillalsocomparethereconstructionwiththe3Dmodelofheadobtainedusingacommercial3Dscanner.Finally,wewilldiscusspotentialapplications.

7879-21, Session 6

Mobile multimedia understanding applications: an overviewX.Lin,Vobile,Inc.(UnitedStates)

Inrecentyears,mobiledevicesarequicklyreachingalmosteverycornerofourdailylifeinavarietyofforms:personalmediaplayers,smartphones,netbooks,andtablets.Besidesthemorepowerful,smaller,andmoreversatilehardware,anotherdrivingforceisthevastnumberofsoftwareapplications(“apps”)onthosemobiledevices.Anumberofmobileappsemployintelligentmultimediaunderstanding(MU)technologies.Thispapergivesanoverviewofsuchapps.ThefocusisnotontheunderlyingMUtechniques,whicharealreadycoveredinhugeamountofliterature.Instead,itattemptstoshedsomelightonthejunctionofmobileappsandMU.Forthispurpose,itaddressesanumberofimportantaspects:uniquerequirementsandcharacteristicsofMU-relatedapps,valuesbroughtinbyMU,typicalMUtechnologiesinvolved,comparisonofalternativesystemarchitectures,andavailabledevelopmenttools.

7879-22, Session 6

Learning object detectors from web image searchF.Tang,D.Tretter,Hewlett-PackardLabs.(UnitedStates)

Beingabletodetectdistinguishableobjectsisveryusefulinmanyhighlevelcomputervisionapplications.Traditionalmethodsforbuildingsuchadetectorrequirealargeamountofcarefullycollectedandcleaneddata.Forexampletobuildafacedetector,alargenumberoffaceimagesneedtobecollectedandfacesineachimageneedtobecroppedandalignedasthedatafortraining.Thisprocessistediousanderror-proning.Recentlymoreandmorepeoplearesharingtheirphotosontheinternet,ifwecouldleveragethesedataforbuildingadetector,itwillsavetremendousamountofefforttocollecttraining



IS&T /

ReturntoContents

data.PopularinternetsearchenginesandcommunityphotowebsiteslikeGoogleimagesearch,Picassa,Flickrmakeitpossibletoharvestingonlineimagesforimageunderstandingtasks.Inthispaper,wedevelopamethodleveragingimagesobtainedfromGoogleimagesearchtobuildanobjectdetector.Theproposedmethodcanautomaticallyidentifythemostdistinguishablefeaturesacrossthedownloadedimages.Usingtheselearnedfeatures,thealgorithmcandetecttheobjectinanewimage.Experimentsshowpromisingresultsofourapproach.

7879-23, Session 6

Image categorization for marketing purposesM.I.Almishari,H.Lee,N.Gnanasambandam,XeroxCorp.(UnitedStates)

Imagesmeantformarketingandpromotionalpurposes(e.g.coupons)representabasiccomponentinincentivizingcustomerstovisitshoppingoutletsandpurchasediscountedcommodities.Theyalsohelpdepartmentstoresinattractingmorecustomersandpotentially,speedinguptheircashflow.Whilecouponsareavailablefromvarioussources-print,web,etc.thereisstillagapintermsofancentralizedaggregatoroftheseimages.Aggregationoftheseimageshelpsdispensingthesecouponsinanon-demandfashionfromacentralizedrepository.Butaggregatingandgatheringofmeta-datarelatingtothepromotionalmaterialisnoteasilyachieved.Firstlythecreationofsuchmarketingaidsisadistributedandartisticendeavor.Couponscontainbothimagesandtext.Designsarearbitrary(structureishidden)andmostoftendon’tconformtoanyspecification.Onthecontrarytextadsinthewebdomainarewellstructuredandfollowstrictwordlimitations.Furtherrespondingtochangingconditionsinthemarketorinventoriesareharderwithcouponsthanwith,say,textadsaswithcouponstheartistsorcreatorsofthemarketingmaterialhavetobeinvolvedinredesign.

Inourwork,weaimforamechanismthatdecouplesthedesignprocessandpromotionactivity.

7879-24, Session 6

Text extraction from web imagesC.Liu,C.Yang,X.Ding,TsinghuaUniv.(China)

Imagesplayakey-roleinwebpagesandtheyareanimportantpartofwebdocumentanalysisandunderstanding.Statisticsshowthatmanywebimagescontainingtextcarryimportantinformationaboutboththelayoutandthecontentofthepagedocument.Detectingandrecognizingtextembeddedinwebimagesbecomespotentiallyessentialforapplicationssuchaseffectiveindexingandsearching.Sincewebimageshavespecialcharacteristicsthatdistinguishthemfromconventionalcomplexbackgroundimages,mosttextsegmentationandrecognitionalgorithmswithgoodperformanceinotherfieldsfailtorecognizewebimagestext.Inthispaperwepresentasurveyofthemethodsandprinciplesthathavebeenproposedtohandlesegmentationandrecognitionoftextinwebimages.Andthepurposeofthispaperistoclassifyandreviewthesealgorithms,discussbenchmarkdataandperformanceevaluation,andtopointoutpromisingdirectionsforfutureresearch.

7879-25, Session 6

Web image annotation using two-step filtering on social tagsS.Cho,J.Cha,H.Byun,YonseiUniv.(Korea,Republicof)

Webimageannotationhasbecomeanimportantissuewithexplodingwebimagesandthenecessityofeffectiveimagesearch.Thesocialtagshaverecentlyutilizedatimageannotationbecausetheycanreflecttheuser’staggingtendency,andreducethesemanticgap.However,aneffectivefilteringprocedureisrequiredtoextracttherelevanttagssincetheuser’ssubjectivityandnoiseoftags.Inthispaper,weproposeatwo-stepfilteringonsocialtagsforimageannotation.Thismethodconductsthefilteringandverificationtasksbyanalyzingthedistributionofvisualfeaturesandtherelationbetweentagsonvisualneighborimages.Ourmethodconsistsofthefollowingthreesteps:1)thetagcandidatesetisfoundedbysearchingthevisualneighborimages,2)fromagiventagcandidateset,coarsefilteringisconductedbytaggroupingandvotingtechnique,3)thedensefilteringisconductedbyusingsimilarityverificationforcoarsefilteredcandidatetagset.Toevaluatetheperformanceofourapproach,weconducttheexperimentsonasocial-taggedimagedatabaseobtainedfromFlickr.Wecomparetheaccuracybetweenthevotingtechniqueandourproposedtechnique.Ourexperimentalresultsshowthatourmethodhasasignificantimprovementinimageannotation.



IS&T /

ReturntoContents

Conference 7880: Media Watermarking, Security, and Forensics XIIIMonday-Wednesday24-26January2011PartofProceedingsofSPIEVol.7880MediaWatermarking,Security,andForensicsIII

7880-01, Session 1

Signal rich art: enabling the vision of ubiquitous computingB.Davis,DigimarcCorp.(UnitedStates)

AdvancesinnetworkingandmobilecomputingareconvergingwithdigitalwatermarkingtechnologytorealizethevisionofUbiquitousComputing,whereinmobiledevicescansense,understand,andinteractwiththeirenvironments.Watermarkingistheprimarytechnologyforembeddingsignalsinthemedia,objects,andartconstitutingoureverydaysurroundings,andsoitisakeycomponentinachievingSignalRichArt:artthatcommunicatesitsidentitytocontext-awaredevices.However,significantobstaclestointegratingwatermarkingandartremain,specificallyquestionsofincorporatingwatermarkingintotheprocessofcreatingart.Thispaperidentifiesnumerouspossibilitiesforresearchinthisarena.

7880-02, Session 2

Comparison of three solutions to correct erroneous blocks to extract an image of a multiplicative homomorphic cryptosystemN.Islam,W.Puech,R.Brouzet,LIRMM(France)

Multiplicativehomomorphicpropertiesofacryptosystemcanbeusedinvariousapplicationsrequiringsecurity,protectionandauthenticatione.g.digitalngerprinting,electronicvoting,onlinebettingetc.SecretsharingbetweentwoormorepartiesexploitingmultiplicativehomomorphicpropertyofRSAresultsintoerroneousblockswhileextractingthemessage.ThegenerationoftheseerroneousblockslimitsthecapabilitiesofhomomorphicpropertiesofRSAtobeusedinitsfullextend.Thispaperprovidesthreedierentapproachesassolutionstotheproblemoferroneousblocksinimage.Thesesolutionsare,meanvalueapproach,shortestdistanceapproachandimagepreprocessingapproach.IthasbeenobservedthatshortestdistanceapproachresultsintogoodPSNRbutitiscomputationallyexpensive.ThebestapproachwithhighPSNRisimagepreprocessingapproachbeforesharingprocess,whichresultsintonoerroneousblocksintheextractedimage,thusnoextraextractiontechniquesarerequired.

7880-08, Session 4

Using feature point-based extraction for STDM 3D-mesh watermarking that withstands the cropping attackM.MontañolaSales,I.R.M.Darazi,J.Giard,Univ.CatholiquedeLouvain(Belgium);P.RondaoAlface,Alcatel-LucentBellLabs.(Belgium);B.M.Macq,Univ.CatholiquedeLouvain(Belgium)

State-ofthe-artblindandrobust3Dwatermarkingschemesalreadywithstandcombinationsofawidevarietyofattacks(e.g.Noiseaddition,simplification,smoothing,etc)exceptcropping.

SpreadSpectrumDitheringModulation(STDM)methodisanextensionofQuantizationIndexModulation(QIM).Besidesthesimplicityandthetrade-offbetweenhighcapacityandrobustnessprovidedbyQIMmethods,itisalsoresistantagainstre-quantization.

Thispaperfocusesontwostate-of-the-arttechniqueswhichofferdifferentandcomplementaryadvantages,respectivelyQIM-based3Dwatermarkingandfeaturepoint-basedwatermarkingsynchronization.Theideaistocombinebothinsuchawaythatthenewschemewouldbenefitfromtheadvantagesofbothtechniquesandcompensatefortheirrespectivefragilities.

Weshowthatrobustnessagainstcroppingandothercommonattacksisachievedprovidedthatatleastonefeaturepointaswellasitscorrespondinglocalneighborhoodisretrieved.

7880-12, Session 5

A curiosity regarding steganographic capacity of pathologically nonstationary sourcesA.D.Ker,Univ.ofOxford(UnitedKingdom)

Noabstractavailable

7880-13, Session 5

Design of adaptive steganographic schemes for digital images in spatial domainT.Filler,J.Fridrich,BinghamtonUniv.(UnitedStates)

Moststeganographicschemesforrealdigitalmediaembedmessagesbyminimizingasuitablydefineddistortionfunction.Inpractice,thisisoftenrealizedbysyndromecodeswhichoffernear-optimalrate--distortonperformance.However,thedistortionfunctionsaredesignedheuristicallyandtheresultingsteganographicalgorithmsarethussuboptimal.Inthispaper,wepresentapracticalframeworkforoptimizingtheparametersofadditivedistortionfunctionstominimizethestatisticaldetectability.Wefirstintroducearichparametricmodelwhichassignsacostofmakingachangeateverypixelbasedonitsneighborhood.Then,wepresentapracticalmethodforoptimizingtheparameterswithrespecttoachosendetectionmetricandfeaturespaceusedinblindsteganalysis.Acomputationallyappealingchoiceforadetectionmeasureistheso-calledMaximumMeanDiscrepancy(MMD)alsousedinsteganalysis.Unfortunately,theparametersoptimizedw.r.t.MMDdonotleadtomoresecurestegosystems.WeexplainthisbehaviorbyrecallingthedirectconnectionbetweenMMDandBayesriskofParzenwindowclassifiers.Atighterconnectiontopracticeisobtainedusingamoretheoreticallyfoundeddetectionmetric--thesizeofthemarginbetweensupportvectorsinsoft-marginSVMs.Weshowthatmodelparameterswithsmallermarginleadtomoresecurestegosystems.OptimalparametersobtainedbytheNelder-Meadsimplexalgorithmarepresentedandembeddingmethodsaretestedbyblindsteganalyzersutilizingvariousfeaturesets.Experimentalresultsshowthatasfewas50imagesaresufficientforobtainingoptimalparametersofthecostmodelallowingustospeeduptheparametersearch.

7880-14, Session 5

Feature restoration and distortion metricsV.Chonev,A.D.Ker,Univ.ofOxford(UnitedKingdom)


7880-15, Session 6

Image and video manipulation: Past, present, and futureNoabstractavailable


IS&T /

ReturntoContents

7880-16, Session 7

Lossless image data embedding in plain areasM.Fallahpour,D.Megias,Univ.ObertadeCatalunya(Spain);Y.Q.Shi,NewJerseyInstituteofTechnology(UnitedStates)

Thisletterpresentsalosslessdatahidingschemefordigitalimageswhichusesanedgedetectortolocateplainareasforembedding.Theproposedmethodtakesadvantageofthewell-knowngradientadjacentpredictionutilizedinimagecoding.Inthesuggestedscheme,predictionerrorsandedgevaluesarefirstcomputedandthen,excludingtheedgepixels,predictionerrorvaluesareslightlymodifiedthroughshiftingthepredictionerrorstoembeddata.Theaimofproposedschemeistodecreasetheamountofmodifiedpixelstoimprovetransparencybykeepingedgepixelvaluesoftheimage.TheexperimentalresultshavedemonstratedthattheproposedmethodiscapableofhidingmoresecretdatathantheknowntechniquesatthesamePSNR,thusprovingthatusingedgedetectortolocateplainareasforlosslessdataembeddingcanenhancetheperformanceintermsofdataembeddingrateversusthePSNRofmarkedimageswithrespecttooriginalimage.

7880-17, Session 7

Re-synchronizing audio watermarking after nonlinear time stretchingM.Steinebach,S.Zmudzinski,Fraunhofer-InstitutfürSichereInformations-Technologie(Germany)


7880-19, Session 8

On locating steganographic payload using residualsT.Quach,SandiaNationalLabs.(UnitedStates)

LocatingsteganographicpayloadusingWeightedStego-image(WS)residualshasbeenprovensuccessfulprovidedalargenumberofstegoimagesareavailable.Inthispaper,werevisitthistopicwithtwogoals.First,weargueitisapromisingapproachtolocatepayloadbyshowingintheidealscenariowherethecoverimagesareavailable,theexpectednumberofstegoimagesneededtoperfectlylocateallload-carryingpixelsisthelogarithmofthepayloadsize.Second,wegeneralizecoverestimationtoamaximumlikelihooddecodingproblemanddemonstratethatasecondorderstatisticalcovermodelcanbeusedtocomputeresidualstolocatepayload.

7880-20, Session 8

Steganalysis using logistic regressionI.Lubenko,A.D.Ker,Univ.ofOxford(UnitedKingdom)


7880-21, Session 8

Steganalysis in high dimensions: fusing classifiers built on random subspacesJ.Kodovsky,J.Fridrich,BinghamtonUniv.(UnitedStates)

Modernsteganographicmethodsachievesteganographicsecuritybyminimizinganappropriatelydefineddistortionfunctioninafeaturespaceofaveryhighdimension.Ontheotherhand,steganalysis,asimplementedtodayusingclassifiers,doesnotscaleaseasily--

workingwithveryhighdimensionalfeaturesleadstoproblemswiththelackoftrainingdata,infeasiblecomplexityoftraining,degradationofgeneralizationabilities,lackofrobustnesstocoversource,andsaturationofperformancebelowitspotential.Toaddresstheseproblemscollectivelyknownasthecurseofdimensionality,weproposeanewcleanapproachinwhichwestrivetominimizetheroleofhumandesignandputemphasisonautomatizationandgeneralityoftheentireprocess.Basedonthecharacterofthemediabeinganalyzed,thesteganalystfirstputstogetherahigh-dimensionalsetof“prefeatures”selectedtocapturedependenciesamongindividualcoverelements.Then,afamilyofweakclassifiersisbuiltonrandomsubspacesoftheprefeaturespace.Thefinalclassifierisconstructedbyfusingthedecisionsofindividualclassifiers.Theadvantageofthisapproachisitsuniversality,lowcomplexity,simplicity,andimprovedperformancewhencomparedtoclassifierstrainedontheentireprefeatureset.ExperimentswiththesteganographicalgorithmnsF5demonstratetheusefulnessofthisapproachincomparisonwithfeaturesetsbuiltusingheuristic“byhand.”

7880-22, Session 9

Private content identification based on soft fingerprintingS.V.Voloshynovskiy,T.S.Holotyak,O.J.Koval,F.P.Beekhof,Univ.ofGeneva(Switzerland)

Contentidentificationsystemsarewidelyusedinvariousemergingapplicationsrangingfromidentificationofphysicalobjectsandhumanstomultimediamanagement(contentfiltering,contenttagging)andsecurity(copyrightprotection,broadcastmonitoring,etc.).Mostidentificationtechniquesarebasedonbinarydigitalfingerprinting.Adigitalfingerprintrepresentsashort,robustanddistinctivecontentdescriptionallowingfastandprivacy-preservingoperations.Inthiscase,alloperationsareperformedonthefingerprintinsteadofontheoriginallargeandprivacy-sensitivedata.

Digitalcontentidentificationbasedondigitalfingerprintingbecomeadefactostandardinbiometricsapplications,physicalobjectidentificationbasedonphysicalunclonablefunctions(PUFs)aswellasvariousmultimediaandsecurityandmanagementapplications.Duringlastyears,certainimportantpracticalandtheoreticalachievementswerereported.Themaineffortsonthesideofpracticalalgorithmshavebeenconcentratedonrobustfeatureselectionandfastindexingtechniquesmostlyborrowedfromcontent-basedretrievalapplications[1,2].Theinformation-theoreticlimitsofcontentidentificationunderinfinitelengthandergodicassumptionshavebeeninvestigatedbyWillemset.al.[3]usingthejointlytypicaldecoder.Thedetection-theoreticlimitshavebeenfirststudiedin[4]undergeometricaldesynchronizationdistortionsandafurtherextensionofthisframeworkwasproposedin[5]forthecaseoffinite-lengthfingerprintingandnullhypothesis.TheuseddecisionruleisbasedonminimumHammingdistancedecoderwithafidelityconstraintunderbinarysymmetricchannelmodel.Sincethisdecisionrulerequiresthecomputationoflikelihoods/distancesbetweenthequeryandalldatabaseentries,thecomplexityoftheconsideredidentificationisexponentialwiththeinputlength.Duetotheadditionalfactthatidentificationservicesareoftenoutsourcedtothirdpartiesandstateauthorities,theprivacyofdataownersisanimportantissueandremainslargelyunexplored.

PrivacyissueshavebeenmainlystudiedintheauthenticationapplicationsduetothepublicsharingofhelperdataandextendedtovariouspracticalimplementationsbasedonSlepian-WolfandWyner-Zivdistributedcoding.Inourpreviouswork[6]wehaveconsideredtherate-privacy-complexitytrade-offforidentificationapplicationsbasedonaframeworkpresentedin[7].Thisapproachisbasedonaglobalprivacyamplificationwhereallbitsofstoredfingerprintarerandomizedwiththesameprobabilitydisregardingtheirreliabilities.Themainbenefitfromthepresentedframeworkofbitreliability(a.k.a.softfingerprinting)wasdemonstratedforthereductionofidentificationcomplexitybasedonboundeddistancedecoder.Obviouslysuchaconstructiondoesnotfullybenefitfromthefactthattheinformationaboutthereliablebitscanbepresentattheencoderanddecoderthatcanbeusednotonlyfortheefficientdecodingbutalsofortheenhancedprivacyamplification.

Conference 7880: Media Watermarking, Security, and Forensics XIII


IS&T /

ReturntoContents

Therefore,inthispaperweintroduceaninformation-theoreticframeworkfortheanalysisofprivatecontentidentificationbasedonfinitelengthfingerprintingwithbitreliabilitysideinformation.Contrarytopreviousworks,weproposeaprivacyamplificationmechanism,whichisadaptivetothebitreliabilityanddemonstrateitsadvantagesoverthestate-of-the-artprivacyamplification.Wepresentandanalyzeaprivacy-preservingtechniquewhichasymptoticallyachievesthetheoreticalperformancelimitsintermsofidentificationrate.TheproposedtechniqueisbasedonForney’stypeoferasure/listdecoding[8]implementedintheformofboundeddistancedecoder.Theanalysisisperformedforthecaseofperfectmatchbetweenthesideinformationsharedbetweentheencoderanddecoderaswellasforthecaseofimperfectsideinformation.Weanalyzetheoptimaltrade-offbetweentheachievablerateandprivacyasthesolutiontoaconstraintoptimizationproblem.Finally,wewillshowthatcontentidentificationiscloselyrelatedtotheproblemoferasureandlistdecoding[8]andfurtherinvestigationofthisconnectionmightrevealmanyinterestinginsightstotheanalysisanddesignoffutureidentificationsystemsbasedonsoftfingerprinting.

[1]J.Haitsma,T.Kalker,andJ.Oostveen,“Robustaudiohashingforcontentidentification,”inInternationalWorkshoponContent-BasedMultimediaIndexing,Brescia,Italy,September2001,pp.117-125.

[2]F.LefebvreandB.Macq,“Rash:RAdonSoftHashalgorithm,”inProceedingsofEUSIPCO-EuropeanSignalProcessingConference,Toulouse,France,2002.

[3]F.Willems,T.Kalker,J.Goseling,andJ.-P.Linnartz,“Onthecapacityofabiometricalidentificationsystem,”inProc.2003IEEEInt.Symp.Inform.Theory,Yokohama,Japan,June29-July42003,p.82.

[4]S.Voloshynovskiy,O.Koval,F.Beekhof,andT.Pun,“Robustperceptualhashingasclassificationproblem:decision-theoreticandpracticalconsiderations,”inProceedingsoftheIEEE2007InternationalWorkshoponMultimediaSignalProcessing,Chania,Crete,Greece,October1-32007.

[5]A.L.Varna,A.Swaminathan,andM.Wu,“Adecisiontheoreticframeworkforanalyzinghash-basedcontentidentificationsystems,”inACMDigitalRightsManagementWorkshop,Oct.2008,pp.67-76.

[6]S.Voloshynovskiy,O.Koval,F.Beekhof,F.Farhadzadeh,T.Holotyak,Information-TheoreticalAnalysisofPrivateContentIdentification,IEEEInformationTheoryWorkshop,ITW2010,Dublin,Ireland,August30-Spetember3,2010.

[7]S.Voloshynovskiy,F.Beekhof,O.Koval,andT.Holotyak,“Onprivacypreservingsearchinlargescaledistributedsystems:asignalprocessingviewonsearchableencryption,”inProceedingsoftheInternationalWorkshoponSignalProcessingintheEncryptEdDomain,Lausanne,Switzerland,2009.

[8]G.D.Forney,“Exponentialerrorboundsforerasure,list,anddecisionfeedbackschemes,”IEEETrans.Inf.Theory,vol.14,pp.206-220,March1968.

7880-23, Session 9

Geometrically robust perceptual fingerprinting: an asymmetric caseO.J.Koval,S.V.Voloshynovskiy,Univ.ofGeneva(Switzerland)

Inthispaperweconsidertheproblemofgeometricallyresilientmultimediaobjectidentification.Weanalyzeperformancelimitsattainableinthisapplicationunderacertainparametricclassofgeometricaldistortions.Intheanalysisweassumethatthequeryanddatabaseentriesaredistortedanddesynchronizedversionsofthesameideal/originalmultimediadata.Wepresentconditionsonthegeometricaldesynchronizationparametersetcardinalitytoensurereliablecommunications.

7880-24, Session 9

Trade-offing privacy-complexity of identification problemT.S.Holotyak,S.V.Voloshynovskiy,O.J.Koval,F.P.Beekhof,Univ.ofGeneva(Switzerland)

Inthispaperweadvocatetheextensionoftechniquesforthefastidentificationofmultimediacontent.Solvingperformance-complexity-privacypreservingoptimizationproblemweproposeanapproach,whereperformance-complexitytrade-offisanalyzedforthepredefinedlevelsoftheprivateinformationdisclosure.Theproposedidentificationmethodisbasedonasoftfingerprintingandconsistsoftwostages:atthefirststage,thelistofpossiblecandidatesisestimatedbasedonthemostreliablebitsofsoftfingerprint,and,atthesecondstage,thetraditionalmaximumlikelihooddecodingisappliedtotheobtainedlisttofindasinglethebestmatch.Thecomplexity-performancetrade-offisinvestigatedbyconsideringdifferentdistortionsintroducedduringimageacquisition.Thesoftfingerprintiscomputedbasedonrandomprojectionswithsign-magnitudedecompositionofprojectedcoefficients.Theestimateofbitreliabilityisdeduceddirectlyfromtheobservedcoefficients.Weinvestigatedifferentdecodingstrategiestoestimatethelistofcandidates,whichminimizetheprobabilityofmissingtherightindexonthelist.Theattainedcomplexity-performancetrade-offdemonstratessuperiorityoftheproposedtechniqueovercertainstate-of-the-artmethodsincludinglocalsensitivityhashing.

7880-25, Session 10

A context model for microphone forensics and its application in evaluationsC.Krätzer,K.Qian,M.Schott,J.Dittmann,Otto-von-Guericke-Univ.Magdeburg(Germany)

Inthispaperwefirstdesignasuitablecontextmodelformicrophonerecordings,formalizinganddescribingtheinvolvedsignalprocessingpipelineandthecorrespondinginfluencefactors.Asasecondcontributionweapplythecontextmodeltodeviseempiricalinvestigationsabout:a)theidentificationofsuitableclassificationalgorithmsforstatisticalpatternrecognitionbasedmicrophoneforensics,evaluating74supervisedclassificationtechniquesand8clusterers;b)thedeterminationofsuitablefeaturesforthepatternrecognition(withverygoodresultsforsecondorderderivativeMFCCbasedfeatures),showingthatareductiontothe20bestfeatureshasnonegativeinfluencetotheclassificationaccuracy,butincreasestheprocessingspeedbyfactor30;c)thedeterminationoftheinfluenceofchangesinthemicrophoneorientationandmountingontheclassificationperformance,showingthatthefirsthasnodetectableinfluence,whilethelattershowsastrongimpactundercertaincircumstances;d)theperformanceachievedinusingthestatisticalpatternrecognitionbasedmicrophoneforensicsapproachforthedetectionofaudiosignalcompositions.

7880-26, Session 10

Double H 264/AVC compression detection using quantized nonzero AC coefficientsD.Liao,R.Yang,H.Liu,J.Li,J.Huang,SunYat-SenUniv.(China)

Developmentsofvideoprocessingtechnologymakeitmucheasiertotamperwithvideo.Insomesituation,suchasinalawsuit,itisnecessarytoprovevideosarenottampered.Thiscontradictionposeschallengestoascertainintegrityofdigitalvideos.Mostoftamperingsoccurinpixeldomain.However,nowadaysvideosareusuallystoredincompressedformat,suchasH.264/AVC.Forattackersitisnecessarytodecompressoriginalvideobitstreamsandrecompressitintocompresseddomain.Asaresult,bydetectingdoublecompression,



IS&T /

ReturntoContents

wecanauthenticateintegrityofdigitalvideo.Inthispaper,weproposeanefficientmethodtodetectwhetherornotadigitalvideohasbeendoublecompressed.Specifically,weuseprobabilitydistributionofquantizednonzeroACcoefficientsasfeaturestodistinguishdoublecompressedvideofromthoseoriginalonecompressedvideo.IfasmallerQPisusedinthesecondcompression,theoriginaldistributionlawwillbeviolated,whichcanbeusedastheevidenceoftampering.

7880-27, Session 10

Forensic printer detection using intrinsic signaturesA.K.Mikkilineni,N.Khanna,E.J.DelpIII,PurdueUniv.(UnitedStates)

Theabilitytointrinsicallycharacterizeaprinterleadstoquestionsaboutprivacyandanonymity.Therearemanyinstanceswhereexistanceoftheintrinsicsignatureintheprinteddocumentisundesireable.Thisisuseful,forexample,inprotectingtheanonymityofpeopledistributingprinteddocumentsduringpeacefulprotest.Ontheotherhand,somegroupsmaywanttohidetheintrinsicsignatureforillegalpurposes,suchasdistributionofcounterfeitcurrency.Wehaveshownresultsthatindicatetheadditionofmaskingsignalsand/ornoisetothedocumentbeforeprintingdoesnotpreventestimationoftheintrinsicsignature.Theintrinsicsignaturecouldstillbeobtainedsimplybyextendingthefeaturesetinordertomaintainthesameperformanceoverthoseattacks.

Inthisworkwefollowuponthoseresultsanddesignadistancebasedmetricforuseinprinteridentificationbasedontheintrinsicsignature.Thiswillprovideasolutiontotheprinterdetectionproblemforprinteddocuments.

Aswefoundinourearlierwork,usingbothtextureandbandingfeaturestocharacterizeprintedregionsofthepagecapturestheintrinsicsignatureirrespectiveofanymodificationsperformedonthedocumentbeforeprinting.

Wewillpresentnewresultsshowingthatthissetoffeaturescanbeusedforforensicdetectionofprinters.

7880-28, Session 11

Non-destructive forensic latent fingerprint acquisition with chromatic white light sensorsM.Leich,S.Kiltz,J.Dittmann,C.Vielhauer,Otto-von-Guericke-Univ.Magdeburg(Germany)

Latentfingerprintsareofvitalimportanceformoderncrimesceneinvestigation.Themostfrequentlyusedmethodstosecurethesefingerprints(i.e.dustingwithpowder)destroytheoriginalevidenceirreversiblyandthusmakeitunavailableforadditionalverificationorfurtheranalysisliketestsforsubstanceabuseandageestimation.

Inthispaperaseriesoftestsisperformedtoinvestigatetheoverallsuitabilityofahighresolutionoff-the-shelfchromaticwhitelightsensorforthecontact-lessandnon-destructiveacquisition.Inparticular3Dheightfieldandreflectiondataof10differentlatentfingerprintsonfourdifferenttypesofsurfaces(harddiskplatter,paintedcarbody,brushedaluminum,veneeredplywood)areexperimentallystudied.Standardalgorithmsaswellascustomizedmethodsforthevisualenhancementoftheacquiredfingerprintsareassessed.

Whilethequalityoftheacquireddataishighlydependentonsurfacestructure,thequalityofthefingerprintandtheprocessing,preliminaryresultsforscansonidealsurfacesareverydetailedandenableeventheuseoflevelthreefingerprintfeatures(pores)formatching.Underthesecircumstanceserrorratesarecurrentlyexpectedtobebelow0.005(FalseAcceptanceRate)and0.05(FalseRejectionRate).

7880-29, Session 11

Detecting messages of unknown lengthT.Pevny,CzechTechnicalUniv.inPrague(CzechRepublic)


7880-30, Session 11

A new paradigm for steganalysis via clusteringA.D.Ker,Univ.ofOxford(UnitedKingdom);T.Pevny,CzechTechnicalUniv.inPrague(CzechRepublic)


7880-31, Session 12

Collusion-secure patchwork embedding for transaction watermarkingW.Berchtold,S.Zmudzinski,M.Schäfer,M.Steinebach,Fraunhofer-InstitutfürSichereInformations-Technologie(Germany)


7880-32, Session 12

Probabilistic fingerprinting codes used to detect traitor zero-bit watermarkM.Desoubeaux,G.LeGuelvouit,OrangeLabs.(France);W.Puech,LIRMM(France)

Traitortracingaimsatpreventingunauthorizedredistributionofmultimediacontentbyembeddingeachauthorizedcopywithindividualsequencesofbitshavingrobustnessagainstcollusionattacks.Collusionistheprocessusedbydishonestuserstoforgeanuntraceablecontentwiththeircopies.Inthispaperwepresentanewmethoddedicatedtovideocontentdistribution.Itisbasedonaprobabilistictraitortracingcodeandanorthogonalzero-bitinformedwatermark.WeusethewellknownTardosfingerprintingtracingfunctiontoreducethesearchspaceofsuspicioususers.Theirguiltinessisthenconfirmedbydetectingthepresenceoftheirpersonalwatermarkembeddedwithapersonalkey.Topreventwatermarkingkeysstorageforthedistributorweuseapartoftheuserfingerprintingsequenceasapersonalembeddingkey.Thismethodpermitstoreducethecodelengthinfunctionofthefalsealarmprobabilityofthewatermarkingmethod.Thereforeitensuresaglobalsmallestfalsealarmprobabilitycomparedtooriginalprobabilisticcodes.Howeverefficiencyofthisprocessdependsstronglyonthenumberofcolludersatthewatermarkingside.Indeedweassumethatcolludersattempttoerasetheirpersonalwatermarkwithaverageattacks.Toincreaserobustnessagainstsuchattacksweproposeanadditivecorrelationmethodbasedonsuccessivewatermarkedimages.Wepresenttherobustnessofsuchamethodfordifferentsizesofcollusion.WefinallystudythefalsealarmprobabilityofthisadditivecorrelationmethodandpresenttracingresultsforshortlengthofTardoscode.



IS&T /

ReturntoContents

7880-33, Session 12

Rihamark: perceptual image hash benchmarkingM.Steinebach,Fraunhofer-InstitutfürSichereInformations-Technologie(Germany);H.Eckehard,C.Zauner,FHOÖStudienbetriebsGmbH(Austria)


7880-34, Session 13

A spatio-temporal framework based on eigenvectors for improved face recognitionM.Ouaret,J.E.Dugelay,EURECOM(France)

Mostofstate-of-the-artaccuratefacerecognitionsystemsrequireheavyprocessingandfacenormalization.Inthispaper,weintroducearealtimehybridfacerecognitionsystem,combiningspatialandtemporalvideoinformationwhilemaintainingafairprocessingcomplexity.Theintroducedhybridsolutioncombinesspatial(eigenfaces)withtemporal(tomofaces)eigenvectorsinatwolayersfusionschemeforimprovedfacerecognition.Initially,eigenfacesandtomofacesaresimultaneouslyappliedtoanon-normalizedinputvideosequence.Then,severalfusionmethods(firstfusionlayer)areappliedtotheresultingscoresfromboth,eigenfacesandtomofaces.Theproposedsystemgeneratesthefinalresultbymajorityvotingofallthefusionmodulesfromthefirstlayerofbothbiometrictraits.Theproposedtechniqueshowsimprovementsofaround13%and8%incorrectidentificationrateoverstandalonetomofacesandeigenfaces,respectively.Thus,theproposedschemeachievesagoodperformanceunderrealisticconditionswithlowcomplexity(PCAandCannyedgedetector)andwithoutheavypre-processing(imagenormalization).



IS&T /

ReturntoContents

Conference 7881A: Multimedia on Mobile Devices 2011Tuesday-Wednesday25-26January2011PartofProceedingsofSPIEVol.7881AMultimediaonMobileDevices2011

7881A-01, Session 1

Towards a multimedia remote viewer for mobile thin clientsM.P.Mitrea,B.Joveski,L.Gardenghi,TELECOM&ManagementSudParis(France);P.Simoens,IBBT(Belgium);J.Marshall,Prologue(France);B.Vankeirsbilck,IBBT(Belgium);F.J.Prêteux,TELECOM&ManagementSudParis(France);B.Dhoedt,IBBT(Belgium)

Bethereamobileuserwantingtoconnecttoamultimediaserver.Inordertoallowhim/hertoenjoythesameuserexperience(play,interact,edit,storeandsharecapabilities)asinafixedLANenvironment,severaldead-locksaretobedealtwith:(1)aheavyandheterogeneouscontentshouldbesentthroughabandwidthconstraintnetwork;(2)thedisplayedcontentshouldbeofgoodquality;(3)userinteractionshouldbeprocessedinreal-timeand(4)thecomplexityofthepracticalsolutionshouldnotexceedthefeaturesofthemobileclientintermsofCPU,memoryandbattery.

ThepresentpaperdemonstratesthattheMPEG-4scenetechnologies(BiFSandLASeR)canprovideforalltheneedsofaremotemobilethinviewer.

First,aBiFS/LASeR-basedarchitectureforremoteviewerisadvanced.Inordertoensurebackwardcompatibilitywiththelegacyapplications,thisarchitecturetakesasinputthetraditionalX11graphicalcontentanditconvertsitintoMPEG-4BiFS/LASeR.Onceconverted,thecontentisstreamedlivetoathinclientdevice,wheretheusercanwatchandinteractwithit.

Thesecondpartofthepaperisdevotedtoanobjectiveassessmentofthisarchitecture.Byconsideringthreetypesofcontent(simplegraphics,atexteditorandwwwbrowsing)thetwoMPEGtechnologies(BiFSandLASeR)arecomparedtotheircompetitors.TheoverallresultsdemonstratethatMPEGtechnologiesaremoreefficientformobilethinclientsthenthedirectextensionofthetraditionalsolutions.

Thefinalpartofthepaperpresentstheperspectivesopenedbythisproofofconcept.

7881A-02, Session 1

Multimodal sensing-based camera applicationsM.BordalloLopez,J.Hannuksela,O.J.Silvén,Univ.ofOulu(Finland);M.Vehviläinen,NokiaResearchCtr.(Finland)

Theincreasedsensingandcomputingcapabilitiesofmobiledevicescanprovideenhancedmobileuserexperience.Integratingthedatafromdifferentsensorsoffersawaytoimproveapplicationperformanceincamerabasedapplications.

Akeyadvantageofusingcamerasasaninputmodalityisthatitenablesrecognizingthecontext.Therefore,computervisionhasbeentraditionallyutilizedinuserinterfacestolookatpeopleandautomaticallydetectingtheuseractions.Theimagingapplicationscanalsomakeuseofvarioussensorsforimprovinguserinteractionandrobustnessofthesystem.

Inthiscontext,twoapplicationsfusingthesensordatawiththeresultsobtainedfromvideoanalysishavebeenimplementedonaNokiaNseriesmobiledevice.

Thefirstapplicationisareal-timepanoramabuilderthatusesthemobiledevice’saccelerometerstoimprovetheoverallquality,providingalsoinstructionsduringthecapture.Thesecondsolutionisareal-timeuserinterfacethatcanbeusedforbrowsinglargeimages.Thesolutionenablesthedisplaytobecontrolledbythemotionoftheuser’shandusingthebuilt-insensorsascomplementaryinformation.

Theexperimentsshowthatfusingthesensordatagreatlyimprovescamerabasedapplicationsespeciallywhentheconditionsarenotoptimalforapproachesusingonlycameras.

7881A-03, Session 1

Mobile text messaging solutions for obesity preventionD.Akopian,V.Jayaram,L.Aaleswara,M.Esfahanian,TheUniv.ofTexasatSanAntonio(UnitedStates);C.Mojica,D.Parra-Medina,TheUniv.ofTexasHealthScienceCtr.atSanAntonio(UnitedStates);S.Kaghyan,YerevanStateUniv.(Armenia)

Thispaperprovidesanoverviewofthestate-of-the-artinmobilephonetechnologieswhichcanbeusedforhealthpromotioninterventionsandrelateddatacollection.Italsodescribesaproposedsystemarchitecturecustomizedforanobesitypreventionprogram.

Recentlyseveralhealthcareprojectshavebeenreportedthatintegratecellphonesintothedatacommunicationchain.Theyusecell-phonetechnologiestovariousextents.Whileitistemptingtoincorporatephonesasahealthcareinstrumentbroadly,thecellphonemarketisverydiversewithmanytechnologiesavailableforapplicationdevelopment.Thispapersummarizesmarketdataforgeneralandsmartphonesales,systematizessoftwaredevelopmentlayersfromaportabilitypointofview,andanalyzesexistingdevelopmenttools.

Asacasestudy,amessagingsystemisproposedforahealth-promotionresearchstudytopreventobesityandobesity-relatedhealthdisparitiesamonglow-incomeLatinoadolescentgirls.Messagingandpollingmechanismsareusedtocommunicateandautomaticallyprocessresponsedatafromthetargetconstituency.Theaimoftheprojectistoincorporatelow-cost,mobiletechnologytopromotehealthandconnectyouthtocommunityresources,designaninterventiontoincreasemoderatetovigorousphysicalactivity,andtoascertainthefeasibilityoftheapproachandeffectsamongLatinoadolescentgirls.

7881A-04, Session 2

Quality and noise measurements in mobile phone video captureD.Petrescu,J.Pincenti,Motorola,Inc.(UnitedStates)

Thequalityofvideoscapturedwithmobilephoneshasbecomeincreasinglyimportantparticularlysinceresolutionsandformatshavereachedalevelthatrivalsthecapabilitiesavailableinthedigitalcamcordermarket,andsincemanymobilephonesnowallowdirectplaybackonlargeHDTVs.Thevideoqualityisdeterminedbythecombinedqualityoftheindividualpartsoftheimagingsystemincludingtheimagesensor,thedigitalcolorprocessingandthevideocompression,eachofwhichhasbeenstudiedindependently.Inthiswork,westudythecombinedeffectoftheseelementsontheoverallvideoquality.Wedothisbyevaluatingthecaptureundervariouslighting,colorprocessing,andvideocompressionconditions.First,wemeasurefullreferencequalitymetricsbetweenencoderinputandthereconstructedsequence,wheretheencoderinputchangeswithlightandcolorprocessingmodifications.Second,weintroduceasystemmodelwhichincludesallelementsthataffectvideoquality,includingalowlightadditivenoisemodel,ISPcolorprocessing,aswellasthevideoencoder.Ourexperimentsshowthatinlowlightconditionsandforcertainchoicesofcolorprocessingthesystemlevelvisualqualitymaynotimprovewhentheencoderbecomesmorecapableorthecompressionratioisreduced.

7881A-05, Session 2

3D scene reconstruction based on multiview distributed video coding in the Zernike domain for mobile applicationsV.Palma,M.Carli,A.Neri,Univ.degliStudidiRomaTre(Italy)


IS&T /

ReturntoContents

InthispaperaMulti-viewDistributedVideoCoding(DVC)schemeformobileapplicationsispresented.SpecificallyanewfusiontechniquebetweentemporalandspatialsideinformationinZernikeMoments(ZM)spaceisproposed.

AswellknownDVCintroducesaflexiblearchitecturethatenablesthedesignofverylowcomplexvideoencoderscomparedtoitstraditionalcounterparts.Themaingoalofourworkistogenerateatthedecoderthesideinformationthatoptimallyblendstemporalandinterviewdata.Multi-viewDVCperformancestronglydependsonthesideinformationqualitybuiltatthedecoder.Atthisaimtoimproveitsqualityaspatialviewcompensation/predictioninZernikemoments’domainisapplied.Moreindetail,wefirstapplystateoftheartkeypointextractionandmatchingalgorithmstoestimatetheparameterscharacterizingtheeffectsofthegeometricaltransformationsamongdifferentviewsintheimageplanes.Then,tohandlerotations,wepartitioneachviewinblocksandforeachofthemwecomputetheZernikemomentsasaprojectionofthefunctiondefiningtheRegionOfInterestontoasetoforthonormalfunctionswithincircleswhoseradiiareselectedaccordingtothepreviouslyestimatedzoomfactors.Spatialandtemporalmotionactivitywillbefusedtogethertoobtaintheoverallside-information.Theproposedmethodwillbeevaluatedbyrate-distortionperformancesfordifferentinter-viewandtemporalestimationqualityconditions.

7881A-06, Session 2

Psycho-physiological effects of head-mounted displays in ubiquitous use: a comparison of see-through and non-see-through, binocular, and monocular conditionsT.Kawai,WasedaUniv.(Japan);J.P.Häkkinen,UniversityofHelsinki(Finland)andNokiaResearchCenter(Finland)andAaltoUniversity(Finland);K.Oshima,WasedaUniversity(Japan);H.Saito,T.Yamazoe,WasedaUniv.(Japan);H.Morikawa,WasedaUniversity(Japan);G.S.Nyman,UniversityofHelsinki(Finland)

Multimediadevicesarenowfoundeverywhereinmanydifferentforms,includingdevicesthatcanbewornlikeclothing.Theheadmounted-displays(HMDs)arewearabledevicestopresentinformationastheuserviewsthesurroundingenvironment.HMDscanbeclassifiedintofollowingfourtypes:binocularsee-through,binocularnon-see-through,monocularsee-through,monocularnon-see-through.Thesetypesmaybedifferentintermsofuser’svisualinformationprocessing,andconsequentlytheuserexperience.Inpracticaluse,theworkloadmayalsodifferbythetypes.

Inthispaper,theauthorscarriedoutanexperimenttoexaminetheworkloadbytheuseofthreeofabove-mentionedfourtypesHMDs:binocularandmonocularsee-through,andmonocularnon-see-through.Asthetaskstobeperformed,thetensubjectswereaskedshortwalkingthroughaUniversitybuildingwhilewearingthreetypesHMDsandperceivingvisualstimulation.Thetotaldistancewalkedwasapproximately600meters.Newsvideowasprovidedtothesubjectsastheaudio-visualstimulation.Thesubjectiveindexesweremeasuredusingataskloadindex(NASA-TLX)aftereachtrial.Asobjectiveindexes,theheartratewasmeasuredduringeachtrial.Theresultsshowedcommontendenciesinbothsubjectiveandobjectiveindexes.

7881A-07, Session 2

Progressive imagery with scalable vector graphicsG.A.Fuchs,H.Schumann,Univ.Rostock(Germany);R.U.Rosenbaum,Sr.,Univ.ofCalifornia,Davis(UnitedStates)

Vectorgraphicscanbescaledwithoutlossofquality,makingthemsuitableformobileimagecommunicationwhereagivengraphicsmustbetypicallyrepresentedinhighqualityforawiderangeofscreenresolutions.Oneproblemisthatfilesizeincreasesrapidlyascontentbecomesmoredetailed,whichcandegraderesponsetimesand

efficiencyinmobilesettings.Similarchallengeshavebeenaddressedforrasterimagesusingprogressiverefinementschemes,howevertakingadvantageofcompliantprogressivevectorgraphicsincommonimagecommunicationisstillanopenresearchquestion.

ThereforeinthispublicationweshowhowtoprovideprogressiverefinementschemesbasedontheextensibleScalableVectorGraphics(SVG)standard.Weproposetwostrategies:decompositionoftheoriginalSVGandincrementaltransmissionusing(1)severallinkedfilesand(2)element-wisestreamingofasinglefile.ThefirststrategyexploitsSVG’sabilitytoreferenceexternalresources,thesecondusesatranscodertosequentiallystreamindividualgeometricprimitivesandforclient-sidereassemblyoftheSVGfile.

ThepublicationfurtherdiscusseshowbothstrategiesareemployedinmobileimagecommunicationscenarioswheretheusercaninteractivelydefineRoIsforprioritizedimagecommunication.Ourcontributioncloseswithresultsweobtainedfromaprototypicallyimplementedclient/serversetup.

7881A-08, Session 3

Mobile 3D quality of experience evaluation: a hybrid data collection and analysis approachA.P.Gotchev,S.Jumisko-Pyykkö,A.R.Boev,T.Utriainen,J.Häyrynen,M.Mikkola,TampereUniv.ofTechnology(Finland);M.Hannuksela,NokiaResearchCtr.(Finland)

Thepaperpresentsahybridapproachtostudytheuser’sexperiencedqualityof3Dvisualcontentonmobileauto-stereoscopicdisplays.Itcombinesextensivesubjectivetestswithcollectionandobjectiveanalysisofeye-trackeddata.3Dcueswhicharesignificantformobilesaresimulatedinthegenerated3Dtestcontent.Themethodologyforconductingsubjectivequalityevaluationincludeshybriddata-collectionofquantitativequalitypreferences,qualitativeimpressions,andbinoculareye-tracking.Wepresentearlyresultsofthesubjectivetestsalongwithgazefixationmapsobtainedfromraweye-trackeddataafterstatisticalanalysis.Thestudycontributestothequestionwhatisimportanttobevisualizedonportableauto-stereoscopicdisplaysandhowtomaintainandvisuallyenhancethequalityof3Dcontentforsuchdisplays.

7881A-09, Session 3

Overcome the shortcoming in mobile stereoscopyK.Lee,S.Kim,KoreaInstituteofScienceandTechnology(Korea,Republicof)

Instereoscopiccamerasystem,representativetwotypessuchasaparallelandaconvergencehavebeenreported.RecentlyadivergingtypeisintroducedbyDr.Son.Ithasasimilarstereoscopicpropertytotheconvergingtype.Divergingstereocameraalignmentmaytobeasuitabletoamobilestereoscopybecausethemobiledevicehasthelackedspaceforconfiguringcamerastomakeeitherortho-orhyperstereoscopicconditionwithasmallsizeofdisplay,thereisonlyahypostereoscopicconditiongivingacard-boardeffect.Thismattermeansthatmobilestereoscopycannotprovideapresencewithagooddepthsensetoanobserver.Forthisreason,wefocusedonthedepthsensecontrolmethodwithaswitchablestereocameraalignment.Inconvergingtype,thefusiblestereoareabecomeswidercomparedtoaparalleltypewhenthesamefocallengthwasusedinbothtypes.Thismattermeansthatthestereofusibleareaformedbyconvergingtypetobeequaltotheparalleltypewithashortenfocallength.Thereforethereisakindofthezoom-outeffectatthereconstructeddepthsense,becausethedisparityobtainedbytheconvergingtypetobeequaltothedisparitybytheparalleltypehavingashortenfocallengththanbeforethecomparison.Indivergingtype,thefusiblestereoareabecomesnarrowerthantheparallel.Asthesameway,thedivergingtypeguaranteesthesamecharacteristicofthatanincreasedfocallengthisconsideredinparalleltype.Thereforethereisazoom-ineffectexisting.Weconsideredthepermitteddisparity

Conference 7881A: Multimedia on Mobile Devices 2011


IS&T /

ReturntoContents

about2.5mmatthemobiledisplayfortakingsuitablestereofusion.Astheresult,Thesatisfiedbothconverginganddiverginganglesaretakenbythetheoreticalconsiderationsuchas4and1.5degreesundertheconditionofthenarrowedintercameradistance(5mm)withawideFOV(commonmobilephonecamera).Additionally,thezoom-ineffectbecomesrapidlychangedbytheincreasedanglebutzoom-outbecomesretardedrelatively.

7881A-10, Session 3

Comparative study of autostereoscopic displays for mobile devicesA.R.Boev,J.Häyrynen,A.P.Gotchev,TampereUniv.ofTechnology(Finland)

Wepresentacomparativestudyofseveralportableauto-stereoscopicdisplays.Weoverviewthetechnologies,theyarebasedonandpresentparameterswhichareusedfortheirevaluation,suchascrosstalk,3Dcontrast,optimalviewingzone,optimaldisparityrange,andfrequencydomainthroughput.Wepresentasimplyyetprecisemethodologyfortheirmeasurementandsummarizeanddiscusstheresultsofthemeasurements.

7881A-11, Session 3

Subjective evaluation of mobile 3D content: depth range versus compression artifactsT.Haustola,S.Jumisko-Pyykkö,A.R.Boev,A.P.Gotchev,TampereUniv.ofTechnology(Finland)

Thetheoriesabouthumanvisualperceptionstatethattheviewsformedbytheeyesareusedtoformafusedcentral,so-called‘cyclopean’,viewandtoformstereopsis.Correspondingly,theperceivedqualityofa3Dsceneisacombinationoftwocomponents-“2D”quality(imagedetails),andits“3D”quality(qualityofthebinoculardepthcues).However,theverypresenceofstereoscopicdepthchangesthewayimagedetailsareperceivedandequivalentlossofimagefidelitymightcausedifferentqualityexperienceof2Dand3Dvideo.

Inthisstudy,weaimatquantifyingtheimpactofvaryingbinoculardepthin3Dvideoontheoverallperceptualqualityinthepresenceofvaryingcompressionartefacts.Wehavedesignedandconductedasetofsubjectiveexperiments,wherethebinoculardepthandtheimagequalityofasceneareindependentlyvariedondensergrids.Fourreal-worldmultiviewvideoswithdifferentcharacteristicsofscenedynamicsareusedinthetests.Fromeachmultiviewsequence,anumberofstereoscopicvideoswerecreatedusingdifferentcamerapairs,thusachievingvaryingcamerabaselineforthesamescene.Morespecifically,thedepthwasvariedbetween2D,HD-opticalbaselineandmobile-optimalbaseline.Eachstereoscopicvideowascompressedwithvaryingquality,i.e.withfivedifferentQPs.Intheconductedtests,theparticipantswereaskedtogradethequalityofeachcompressedvideoinasinglestimulussetting.Theresultsobtainedforasolidbasefordesigninganobjectivemetricfor3Dvideoqualityevaluation.

7881A-12, Session 3

Development of a 3D mobile receiver for stereoscopic video and data service in T-DMBG.Lee,H.Lee,K.Jung,N.Hur,S.Lee,ElectronicsandTelecommunicationsResearchInstitute(Korea,Republicof)

3DMobilebroadcastingthatdelivers3Dcontentsviamobilebroadcastingnetworkisbelievedtobeaveryattractiveservicebecausesingle-userenvironmentattheterminalsideissuitableforglasses-free3Dviewingandavarietyofmultimediaservicesareapplicableon3Dmobiledevices.Withthehelpofahigh-qualityauto-stereoscopictechnologies,a3DTVserviceoverT-DMB(Terrestrial-

digitalMultimediaBroadcasting),whatwecalla3DDMBservice,hasrecentlybeenintroducedtoprovidefurtherrealisticmobilebroadcastingservices.Asakillerapplicationofthe3DDMB,3DdataservicehasbeenalsodevelopedbasedonMPEG-4systemtechnologies,wherestereoscopicdatacontentswithformatsofJPEG,PNGandMNGaredeliveredandrenderedonthe3DmobileterminalsupportingT-DMB.Itisbelievedthatsuch3Dservicetechnologywillbeusefulformany3Dbroadcastingapplicationssuchasadvertisement,education,sports,movie,drama,andsoon.Generally,backwardcompatibilityisacrucialrequirementforsuccessfullaunchofnewservicesinthedigitalbroadcasting.

7881A-13, Session 3

A right scaled depth sense formed by using a distorted objective space based on CG stereoscopyK.Lee,KoreaInstituteofScienceandTechnology(Korea,Republicof)andKonkukUniv.(Korea,Republicof);D.Kim,KoreaInstituteofScienceandTechnology(Korea,Republicof);G.Um,E.Chang,G.Bang,N.Hur,ElectronicsandTelecommunicationsResearchInstitute(Korea,Republicof);S.Kim,KoreaInstituteofScienceandTechnology(Korea,Republicof)

Instereoscopy,depthdistortionisaseriousproblemtoprovidethecorrectdepthsenseoftheobjectwhenthereconstructeddepthimagewasdisplayedandperceived.Uncorrecteddisparityisamaincauseinducingtheproblemanditisdependedonthestereoscopiccircumstancessuchasastereoscopicsystemconfiguredastereocameraanddisplayandanobservationrelatedaviewingcondition.Thenumerousstudieshavebeenreportedtosolvetheproblembuttheydidnotgiveusageneralsolutionbecausethecausesinducingproblemarecrossedlinkingbetweenthestereoscopicsystemandobservation.Inthispaper,wesuggestedanewwaybasedoncomputergraphics(CG)toovercometheaforementionedshortcomingofacommonstereoscopy.Intermsoftheway,lettheobjectivespacetransformasthedistortedspacetomakeacorrectperceiveddepthsenseasifweareseeingthescaledobjectvolumewhichiswelladjustedtouser’sstereoscopiccircumstance.Sincethedistorteddepthsensestrongrelatestothefixedobjectivespace.Indetail,allparameterswhichrelatedthedepthdistortionsuchasafocallength,aninter-cameradistance,aninneranglebetweencamera’saxes,asizeofdisplay,aviewingdistanceandaneyedistancecanbealteredtotheamountofinverseddistortioninthetransformedobjectivespacebythelinearrelationshipbetweenthereconstructedimagespaceandtheobjectivespace.Therefore,thedepthdistortionwillberemovedafterimagereconstructionprocesswithadistortedobjectivespace.Astheresult,wepreparedastereoimagehavingarightscaleddepthfrom-50mmto+200mmwithanintervalas50mminanofficialstereoscopiccircumstanceandshoweditto5subjects.Allsubjectsrecognizedandindicatedthedesigneddepths.Consequently,theadoptionofdistortedobjectivespaceismorethepowerfulwaytopresentarightscaleddepthsensewithoutthedepthdistortionthantheexistingwaysinCGbasedstereoscopy.

7881A-14, Session 4

Smart travel guide: from internet image database to intelligent systemG.Chareyron,J.DaRugna,PôleUniv.LéonarddeVinci(France)

Frommanyyears,tourismisaprimordialmattertoregioneconomy.Tohelpthetouristtodiscoveracity,aregionorapark,manyoptionsareprovidedbypublictourismtravelcenters,byfreeonlineguidesorbydedicatedbookguides.Nonetheless,theseguidesprovideonlymainstreaminformationwhicharenotconformtoaparticulartouristbehavior.Ontheotherhand,wemayfindseveralonlineimagedatabasesallowinguserstouploadtheirimagesandtolocalizeeachimageonamap.Arecentworkhasdemonstratedthatthesewebsites



IS&T /

ReturntoContents

arerepresentativeoftourismpracticesandconstituteaproxytoanalyzetourismflows.Thisworkintendstoanswerthisquestion:knowingwhatIhavevisitedandwhatotherpeoplehavevisited,whereIshouldgonow?Thisprocessneedstoprofileusers,sitesandphotos.ourpaperpresentstheacquireddataandrelationshipbetweenphotographers,sitesandphotosandintroducestheBayesianmodeldesignedtocorrectlyestimatethesiteinterestofeachtourismpoint.Thethirdpartshowsanapplicationofourschema:asmarttravelguideongeolocatedmobiledevices.Thisapplicationpermitsthetravelguidetomatchtheuserwishes

7881A-15, Session 4

Revised benchmarking of contact-less fingerprint scanners for forensic fingerprint detection: challenges and results for chromatic white light scanners (CWL)S.Kiltz,C.Kraetzer,J.Dittmann,C.Vielhauer,Otto-von-Guericke-Univ.Magdeburg(Germany)

Mobilecontact-lessfingerprintscannerscanbeveryimportanttoolsfortheforensicinvestigationofcrimescenes.Tobeadmissibleincourt,dataandthecollectionprocessmustadheretorulesw.r.t.technologyandproceduresofacquisition,processingandtheconclusionsdrawnfromthatevidence.Currently,nooverallacceptedbenchmarkingmethodologyisusedtosupportsomeoftherulesregardingthelocalisation,acquisitionandpre-processingusingcontact-lessfingerprintscanners.Benchmarkingisseenessentialtoratethosedevicesaccordingtotheirusefulnessforinvestigatingcrimescenes.

Ourmaincontributionisarevisedversionofourextensibleframeworkformethodologicalbenchmarkingofcontact-lessfingerprintscannersusingacollectionofextensiblecategoriesanditems.Thesuggestedmaincategoriesdescribingacontact-lessfingerprintscannerarepropertiesofforensiccountry-specificlegalrequirements,technicalproperties,application-relatedaspects,inputsensorytechnology,pre-processingalgorithm,testedobjectandmaterials.Usingthoseitispossibletobenchmarkfingerprintscannersanddescribethesetupandtheresultingdata.Additionally,benchmarkingprofilesfordifferentusagescenariosaredefined.Firstresultsforallsuggestedbenchmarkingproperties,whichwillbepresentedindetailinthefinalpaper,weregainedusinganindustrialdevice(FRTMicroprof200)andconducting18testson10differentmaterials.

7881A-16, Poster Session

A new mobile service: automatic lottery winning identification systemF.Tan,TheHongKongPolytechnicUniv.(HongKong,China);Q.Huang,Univ.ofMissouri-KansasCity(UnitedStates)

Thenumber-basedlotteryticketsareincreasinglypopularallaroundtheworldandmostofpeoplewillbuymorethanonecombinationforonetime.Whenthecountofticketsincreases,thewinningidentificationwillbecomeatroubleandtimeconsuming.Inthispaper,anewservicewillbepresentedasatotalsolutionfortheautomaticlotterywinningidentification.Thisservicebasedonmobiledeviceandusesimageprocessing,specialopticalcharacterrecognition(OCR)initsanalysis.Initially,thecell-phonecamerawillbeusedtocaptureaphotoofthelotterytickets.Thenthetargetnumberscombinationswillbeextractedautomatically.Furthermore,theapplicationwillautomaticallyseekthelatestwinningnumberscombinationfromthelotteryagency’sonlineserverbythecell-phone.Lastly,itwillshowtheidentificationresultofwhetherwinningornot,ifwinning,whichgradewillbe.Theapplicationwasdevelopedonanandroid-basedmobiledeviceandusedtheHongKongMarkSixticketsasthetrainingandtesttargets,goodperformanceandusabilitywereobtained.


Development of testbeds for AGPS mobile applicationsD.Akopian,G.K.Ramachandran,A.Soghoyan,G.V.S.Raju,TheUniv.ofTexasatSanAntonio(UnitedStates)

Duringrecentyearslocationtechnologieshaveemergedasaresearcharea.ThisisessentiallydrivenbythesuccessofUSGlobalPositioningSystem(GPS)andthedevelopmentofotherGlobalNavigationSatelliteSystems(GNSS).

ThispaperstudiestestbeddesignoptionsbasedonGNU-RadioopensourceandLabviewdevelopmentplatforms.GNUradionativeenvironmentallowseasyincorporationofcustomC/C++unitswhichacceleratesthesystemfasterandmakeitsuitableforprocessingrealGPSsignalsinrealtime.ThebenefitsofusingGNUradioSDKisthatitprovidesalreadyaninterfacebetweenthesoftwareandUSRPhardware.ThepaperprovidesdetailsontheenhancementsoftheconventionalAGPSreceiversimplementedonGNUradioplatformincludingadvancednovelacquisitionandtrackingunits.

Labviewisdataflowsoftwarewhichprovidesflexibleandfacilitateddesignenvironment.DifferentfromothersimilarsoftwaretheLabviewhasbeenbuilttoconvenientlyinterfacewithdataacquisitionhardwareforreal-timesignalprocessingincludingRFsignals.Inaddition,LabviewnativeenvironmentalsoallowseasyincorporationofcustomC/C++unitswhichisimportantforreal-timeperformanceevaluations.AlsotheavailabilityofNIGPSsimulationtoolkitprovidesfacilitatedoptionstosimulatescenariosandtestreceiveroperations.

TheimplementationandanalysisofsoftwareGPSreceiverinGNURadioplatformandusingLabviewsimulatortoolkitempoweredwithseveralalgorithmicimprovementsintheGPSreceiverarchitectureitselfisachieved.


Integrity monitoring and mobile platform implementation for WLAN positioningD.Akopian,S.Yalamanchili,A.Melkonyan,TheUniv.ofTexasatSanAntonio(UnitedStates)

GlobalPositioningSystem(GPS)productshelptonavigatewhiledriving,hiking,boating,andflying.GPSusesacombinationoforbitingsatellitestodeterminepositioncoordinates.Thisworksgreatinmostoutdoorareas,butthesatellitesignalsarenotstrongenoughtopenetrateinsidemostindoorenvironments.Asaresult,anewstrainofindoorpositioningtechnologiesthatmakeuseof802.11wirelessLANs(WLAN)isbeginningtoappearonthemarket.InWLANpositioningthesystemeithermonitorspropagationdelaysbetweenwirelessaccesspointsandwirelessdeviceuserstoapplytrilaterationtechniquesoritmaintainsthedatabaseoflocation-specificsignalfingerprintswhichisusedtoidentifythemostlikelymatchofincomingsignaldatawiththosepreliminarysurveyedandsavedinthedatabase.InthispaperweinvestigatetheissueofdeployingWLANpositioningsoftwareonmobileplatformswithtypicallylimitedcomputationalresources.Wesuggestanovelreceivedsignalstrengthrankorderbasedlocationestimationsystemtoreducecomputationalloadswitharobustperformance.Theproposedsystemperformanceiscomparedtoconventionalapproaches.


Optimizing bandwidth and storage requirements for mobile images using perceptual-based JPEG recompressionD.Gill,T.Shoham,S.Carmel,ICVTLtd.(Israel)

Theincreasingqualityandresolutionofcellularphonecamerasiscreatingasignificantburdenonthedevicestorageandthecellularnetworkbandwidth.Inthispaperweproposeanovelmethodfor



IS&T /

ReturntoContents

recompressingdigitalphotos,whichsignificantlyreducestheirfilesize,withoutaffectingtheirperceptualquality.Themethodisappliedbyiterativelyrecompressingtheinputimagebydifferentamounts,andcomputingthevalueofarobust,perceptualimagequalitymeasure,whichconsistsofapixelvaluedifference,ablockinessdetectorandatexturedistortiondetector.Theiterativeprocessensuresthatthemaximumamountofcompression,whichstillyieldsaperceptuallyidenticalimage,isappliedtoeachinputimage.

Insubjectivetestingwehavefoundthatusingourproposedmethod,thefilesizeofhighresolutionphotosmaybereducedbyafactorof3Xto4X(66%-75%reduction)onaveragewithoutaffectingtheirperceptualvisualquality.WehaveimplementedthealgorithminamobileapplicationfortheiPhone3Gsdevice,whichrecompressesthephotoscapturedbythedevice’s3Megapixelcameraby2.5X(60%)onaverage.Thealgorithmhasalsobeenimplementedasacommand-lineapplicationinWindows,LinuxandMacOS.


mQIM principles for MPEG-4 AVC watermarkingM.P.Mitrea,M.Hasnaoui,M.Belhaj,F.J.Prêteux,TELECOM&ManagementSudParis(France)

Watermarkingimposeditselfasanefficientyetflexiblesolutiontodigitalvideoprotection:bypersistently(robustly)andimperceptibly(transparently)insertingsomeextradata(amark)intoavideoexcerpt,illegalcopiescanbetrackeddowntothelastlegaldistribution.

Thepresentpapertakesthechallengeofvideowatermarkingformobile(thin)terminalswherevideoissubjecttohighlyperformingcompressionschemes.Oneoftheauthorpreviouspapersestablishedtheproofofconceptsforcompresseddomainwatermarking:thefirsttransparentmethod,robustagainsttranscodingandgeometricattackshasbeendesigned.However,themainlimitationwasconnectedtothedatapayload.

Inordertosolvethisproblem,anewwatermarkingschemeinsertingthemarkintheMPEG-4AVCstreamisproposed.ThemainnoveltyconsistsinconsideringmultiplesymbolQuantisationIndexModulation(mQIM)watermarkingtechniquesinsteadofbinaryQIM.Inthisrespect,theembedding/detectionrulesarefirstmathematicallydemonstratedandthenobjectivelyassedinindustrialpartnership(undertheframeworkoftheMEDIEVALSFrenchnationalproject);withrespecttopreviousstudies,thismethodallowsthesizeoftheinsertedmark(thedatapayload)tobeincreasedbyanfactorlog2(m),whilekeepingthesamegoodtransparency(objectivelyandsubjectivelyevaluated)androbustness(againsttranscodingandgeometricattacks).


Generalized Phi number system and its application for image decomposition and encryptionS.Agaian,StanfordUni.(UnitedStates);Y.Zhou,TuftsUniv.(UnitedStates)

Inthispaper,weintroduceanewgeneralizedPhinumbersystem(GPNS).Byselectingappropriateparameters,TheGPNScanbespecifiedtothetraditionalPNS,thebinarynumbersystem(base-2),andotherintegerbasenumbersystems.WeinvestigatetheapplicationsofthenewGPNSinimageprocessing.Weintroduceanewparameterbit-planedecompositionmethodusingthenewGPNS.Integratingthisnewdecompositionmethodwiththechaoticlogisticmap,anewimageencryptionalgorithmisintroduced.ExperimentalresultsaregiventodemonstratethattheGPNSshowsexcellentperformanceinimagedecompositionandencryption.


Local polynomial approximation and local binary pattern based face classificationR.Mehta,J.Yuan,K.Egiazarian,TampereUniv.ofTechnology(Finland)

FaceClassificationiswidelystudiedtopicinfieldofComputerVisionandMultimediaInformationProcessing.Oneofthemostfundamentalpartsofthisprocessisanefficientfacerepresentation.Thefaceshouldberepresentedinsuchawaythatthefeaturevectorisrobusttoilluminationchangesandtheposevariationofthesubject.Inliteratureoffaceclassificationmanymethodshavebeenproposedwherefeaturesareextractedatmultiplescalesforrobustclassification.Thesemethodshoweverarenotabletocompletelycapturetheinformationfromdifferentdirectionsofthefaceimage.Facefeaturesarealignedinspecificdirections,e.g.eyes,eyebrowsandlipsarealignedinhorizontaldirectionwhilenoseandfacecontourarealignedinverticaldirection.Bycapturingtheinformationinspecificdirectionsatdifferentscaleswecanrepresentthefaceimageinawaywhichisbettersuitedfortheclassificationpurpose.InthispaperwehaveproposedanovelmethodwhichutilizesLocalPolynomialApproximation(LPA)techniquestocapturethedirectionalinformationofthefaceimageatdifferentscales.Sinceafaceimageisspatiallyvariedandclassificationworksbetterwhenlocaldescriptorsareused,wehaveincorporatedLocalBinaryPattern(LBP)operatorinordertoobtainLPA-LBPmap.AblockwiseoperationisperformedontheLPA-LBPmaptoextractthefaceimagedescriptor.Thedimensionalityofthefinalfeaturevectorisquitehighduetotheblockwiseoperation.Inordertofurtherenhancetheclassificationaccuracyandtoreducethecomplexity,thedimensionalityofthefeaturevectorisreducedbytakingintoaccountthosefeatureswhichvaryacrosstheclasseswhileremainingrelativelyconstantwithinaclass.

Inthismethodfirstofallthedirectionalestimatesoffaceimages(calledLPADirectionalFaces)areobtainedusingLPAfiltersforaspecificnumberofdirectionsandscales.AfterextractingthedirectionalinformationLBPoperatorisappliedonLPADirectionalFacestoobtainLPA-LBPmapwhichcompletelycapturethetextureinformationfromthem.TheLPA-LBPmapisaholisticrepresentationoftheface.InordertohaveafinallocaldescriptorLPA-LBPmapsaredividedintoblocksandhistogramisevaluatedforeachblock.ThenallthehistogramsareconcatenatedtoformtheLPA-LBPHistogramSequence(LPA-LBPHS).Sincethedimensionalityofthisfeaturevectorishigh,LinearDiscriminantAnalysis(LDA)isusedtoreducethedimensionalityofLPA-LBPHS.ThisreducedLPA-LBPHSfeaturevectorisusedforfacerepresentation.FinallySupportVectorMachine(SVM)classifierislearnedinthereducedLPA-LBPHSfeaturespaceforfaceclassification.Experimentsdoneonstandarddatasetsdemonstratethattheproposedmethodhashigherclassificationaccuracythanpreviouslyproposedstate-of-the-artmethods.


Anisotropic multiscale Lucas Kanade pyramidJ.Yuan,K.Egiazarian,TampereUniv.ofTechnology(Finland)

TheLucasKanade(LK)algorithmprovidesasmartiterativeparameter-updateruleforefficientimagealignment,andithasbecomeoneofthemostwidelyusedtechniquesincomputervision.Applicationsrangefromopticalflowandtrackingtolayeredmotion,mosaicconstruction,andfacecoding.TheLKalgorithmhasbeenprovedtobeeffectiveundersmall-noiseconditions.Butinrealworldapplications,especiallyforfacerecognition,objecttrackingandvideorecognition,theimagestobealignedarealwayscapturedbysurveillancesmeaningthattheycouldbequitenoisyandmightbetakenfromvariousangles.Insuchcases,theLKalgorithmcouldnothandle:theaccuracyseverelydegradeswhentheimagequalityispoor;andtheLKalgorithmmaynotevenconvergewhentheanglebetweentemplateandthecapturedimageistoolarge.Atthispoint,anovelconceptofLucasKanadepyramidemergesin2009.ByextractingimagepyramidsfromtheoriginalimagesanditerativelyimplementingLKalgorithmateachlevel,



IS&T /

ReturntoContents

theLKpyramidgainedbetterrobustnessandaccuracy.

YettheresultofLKpyramidstillsuffersfromheavynoiseconditionsandsevereimagedistortions,thiscanbemainlyattributedtothedisabilityofcorrectimagegradientcalculationfromthedistortedimage.Thus,onthebasisofLKpyramid,thispaperproposesanovelAnisotropicMulti-ScaleLucasKanadePyramid(AMSLKP)method.Insteadofcalculatinggradientsinsingledirectionwithfixedscalesizes,thispaperintroducesanisotropiclocalpolynomialapproximation(LPA)andintersectionofconferenceintervals(ICI)methodtotheLKpyramid.TheproposedAMSLKPmethodfirstcalculatesthedirectionalestimatesandgradientswithmultiplescales;thenforeachdirection,itadaptivelyselectstheoptimalscaleforeachpixelintheimageusingICIrule;atlast,theestimateandgradientsofthedistortedimageiscomputedbyfusingthedirectionalresultstogether.

Theproposedmethodisevaluatedindifferentnoiseconditionswithvariousdistortionlevels.ExperimentresultsshowthattheAMSLKPmethodimprovestheaccuracybymorethantenpercentcomparedtoLKpyramid;moreover,theconvergenceprocessisaccelerated.


iPhone forensics with Mac OS X based open source toolsR.Creutzburg,T.Höne,FachhochschuleBrandenburg(Germany)

TheaimofthisarticleistoshowtheusefulnessofMacOSXbasedOpenSourceToolsforforensicinvestigationofmoderniPhones.

ItisdemonstratedhowimportantdatastoredintheiPhoneareinvestigated.

Threedifferentscenariosofinvestigationsarepresentedthatarewell-suitedforaforensicslabwork.


Forensic investigation of certain types of mobile devicesR.Creutzburg,S.Luttenberger,FachhochschuleBrandenburg(Germany)

TheaimofthisarticleistoshowtheusefulnessofWindowsbasedOpenSourceToolsforforensicinvestigationofmodernmobiledevices.

Itisdemonstratedhowimportantdatastoredinthemobileldeviceareinvestigated.

Differentscenariosofinvestigationsarepresentedthatarewell-suitedforaforensicslabwork.



IS&T /

ReturntoContents

Conference 7881B: Multimedia Content Access: Algorithms and Systems VTuesday-Wednesday25-26January2011PartofProceedingsofSPIEVol.7881BMultimediaContentAccess:AlgorithmsandSystemsV

7881B-52, Poster Session

No-reference blur estimation based on the average cone ratio in the wavelet domainL.Platisa,A.Pizurica,E.Vansteenkiste,W.R.Philips,Univ.Gent(Belgium)

Withextensivetechnologicaladvancementsinelectronicimagingtoday,highimagequalityisbecominganimperativenecessityinthemodernimagingsystems.Animportantpartofqualityassurancearetechniquesformeasuringthelevelofimagedistortion.Recently,weproposedawaveletbasedmetricofblurrinessinthedigitalimagesnamedCogACR.Themetricishighlyrobusttonoiseandabletodistinguishbetweenagreatrangeofblurriness.Also,itcanbeusedeitherwhenthereferencedegradation-freeimageisavailableorwhenitisunknown.However,themetriciscontentsensitiveandthusinano-referencescenarioitwasnotfullyautomated.Inthispaper,wefurtherinvestigatethisproblem.First,weproposeamethodtoclassifyimagesbasedonedgecontentsimilarity.Next,weusethismethodtoautomatetheCogACRestimationofblurinano-referencescenario.Ourresultsindicatehighaccuracyofthemethodforarangeofnaturalsceneimagesdistortedwiththeout-of-focusblur.Withintheconsideredrangeofblurradiusof0to10pixels,variedinstepsof0.25pixels,theproposedmethodestimatestheblurradiuswithanabsoluteerrorofupto1pixelin80to90%oftheimages.


Texture based Markovian modelling for image retrievalD.Benboudjema,EcoleNationaleSupérieuredel’ElectroniqueetdesesApplications(France)

Textureisoneofthemainfeatureswithcolor,shape,edges...bywhichhumanbeingperceivesimage.Itcanbeviewedasasetofpixelswithinanimagewhoselocalstatisticsorlocalproperties(e.g.periodicity,frequency)areconstantorslightlyvarying.Inthispaperweaddressfromthestatisticalstandpoint,theimageindexingproblemforimageretrieval.TwonewMarkovmodelbasedapproachesallowingtexturefeatureextractionwillbeproposedandacomparisontothetexturefeaturesbasedonGaborfilterswillbepresented.ThethreemethodshavebeentestedforimageretrievaltaskusingSVMclassifierwithaGaussiankernelontexture-orienteddatabase,Brodatz,andonanothertexturedatabase.Theexperimentalresultsshowfortheproposedschemepromisingresults.


Non-supervised macro segmentation of the large-scale TV videosH.Bai,Y.Dong,FranceTelecomR&DBeijing(China)

Inthispaper,anovelnon-supervisedmacrosegmentationalgorithmispresentedbydetectingduplicatesequencesoflarge-scaleTVvideos.Motivatedbythefactthat``Inter-Programs’’arerepeatedlyinsertedintotheTVvideos,sothemacrostructureofthevideoscanbeeffectivelyandautomaticallygeneratedbyidentifyingthespecialsequences.Therearefoursectionsinthealgorithm,namely,keyframeextraction,discretecosinetransform-basedfeaturegeneration(afixed-size$64D$signature),Locality-SensitiveHashing(LSH)-basedframeretrievalandmacrosegmentationthroughtheduplicatedsequencedetectionandthedynamicprogramming.Themaincontributionsare:(1)supplyoneeffectiveandefficientalgorithmforthemacrosegmentationinthelarge-scaleTVvideos,(2)LSHcanquicklyquerythesimilarframes,and

(3)thenon-supervisedlearnedduplicatesequencemodelsareusedtofindthelostduplicatesequencesbythedynamicprogramming.Thealgorithmhasbeentestedin15-daydifferent-typeTVstreams.TheF-measureofthesystemisgreaterthan96%.Theresultshowsthatthealgorithmisefficientandeffectiveforthemacrosegmentation.

7881B-39, Session 5

Material classification and automatic content enrichment of images using supervised learning and knowledge basesG.Knapp,S.A.Mallepudi,R.A.Calix,LouisianaStateUniv.(UnitedStates)

Inrecentyearstherehasbeenarapidincreaseinthesizeofvideoandimagedatabases.Effectivesearchingandretrievingofimagesfromthesedatabasesisasignificantcurrentresearcharea.Inparticular,thereisagrowinginterestinquerycapabilitiesbasedonsemanticimagefeaturessuchasobjects,locations,andmaterials,knownascontent-basedimageretrieval.Thisstudyinvestigatedmechanismsforidentifyingmaterialspresentinanimage.Thesecapabilitiesprovideadditionalinformationimpactingconditionalprobabilitiesaboutimages(e.g.objectsmadeofsteelaremorelikelytobebuildings).ThesecapabilitiesareusefulinBuildingInformationModeling(BIM)andinautomaticenrichmentofimages.I2Tmethodologiesareawaytoenrichanimagebygeneratingtextdescriptionsbasedonimageanalysis.Inthiswork,alearningmodelistrainedtodetectcertainmaterialsinimages.Totrainthemodel,animagedatasetwasconstructedcontainingsinglematerialimagesofbricks,cloth,grass,sand,stones,andwood.Forgeneralizationpurposes,anadditionalsetof50imagescontainingmultiplematerials(somenotusedintraining)wasconstructed.Twodifferentsupervisedlearningclassificationmodelswereinvestigated:asinglemulti-classSVMclassifier,andmultiplebinarySVMclassifiers(onepermaterial).ImagefeaturesincludedGaborfilterparametersfortexture,andcolorhistogramdataforRGBcomponents.AllclassificationaccuracyscoresusingtheSVM-basedmethodwereabove85%.Thesecondmodelhelpedingatheringmoreinformationfromtheimagessinceitassignedmultipleclassestotheimages.AframeworkfortheI2Tmethodologyispresented.

7881B-40, Session 5

Personal photo album summarization for global and local photo annotationM.Broilo,F.G.B.DeNatale,Univ.degliStudidiTrento(Italy)

Althoughcontent-basedmediaretrievaltoolsarecontinuouslyimproving,personalizedimageannotationisstilloneofthemostreliablewaystoindexlargeimagearchives.Unfortunately,itisalsoatimeconsumingandrepetitiveoperation.Usingcontenttofacilitatetheuserinmediaannotationmayleadtoreducedeffortandmoreaccurateresults.Inthispaperweproposeacontent-basedinteractivetoolthatsupportsauserinannotatinghispersonalphotoalbums.Thesystemprovidestwomainfunctionalities:tosummarizeaphotocollectioninsalientmoments,andtoannotatepicturesinasemi-supervisedwaybasedontheirglobalandlocalcontent.Thesummarizationisbasedonabottom-upunsupervisedhierarchicalclusteringthatexploitstwodifferentmatricesofvisualdistances,whilethelocaltaggingusesanobjectretrievalmethodbasedonlocalimagefeatures.Experimentsonpersonalphotocollectionsshowthattheproposedtechniqueproducesgoodresultsintermsoforganizationandaccesstodata.


IS&T /

ReturntoContents

7881B-41, Session 5

Event-driven people re-identification across photo collectionsL.LoPresti,M.Morana,M.LaCascia,Univ.degliStudidiPalermo(Italy)

Personre-identificationacrosspersonalphotoalbumsenablesthedevelopmentofnewautomatedtoolstosupporttheuserinbrowsingandmanaginghisowncollection.

Inthispaper,anewsystemispresentedtosupporttheuserduringtheannotationtask.Thesystemautomaticallydetectspersonsineachphotoandtriestoinferhypotheticcorrespondencesamongpersonacrossthephotoalbum.Inthisway,thesystemisabletofindthegroupsofphotoswhereapersonwasdetectedtakingadvantagefromthefactthateachpersoncanappearatmostonceineachphoto.

Weproposetomodeltheproblemofpeoplere-identificationinphotosasadataassociationproblemdrivenbytemporalevents.

Notingthatappearanceinformationsuchasclothingdescriptorsaremorereliablewithinshorttemporalwindowwhilefacialfeaturesaremorereliableacrosswidertemporalwindow,weproposetoperformthere-identificationprocessintwosteps:firstidentifyingpersonswithinthesameeventusingbothfacialandclothingdescriptors,theninferringassociationamongpersonsidentifiedacrosstemporaleventsusingfacialinformation.Ourmethodisfullyautomatedanddoesnotrequireanyinitializationneitheraprioriknowledgeofthenumberofpersonsthatareinthephotocollection.

Experimentswereperformedonapubliclyavailabledatasetandresultsarecomparedwiththoseobtainedbyusingstandardclusteringmethods.

7881B-42, Session 6

Web-scale multimedia processingM.Slaney,Yahoo!Inc.(UnitedStates)

Noabstractavailable

7881B-43, Session 7

Spatially organized visualization of image query resultsG.Ciocca,C.Cusano,Univ.degliStudidiMilano-Bicocca(Italy);S.Santini,Univ.AutónomadeMadrid(Spain);R.Schettini,Univ.degliStudidiMilano-Bicocca(Italy)

Inthisworkwepresentasystemwhichvisualizestheresultsobtainedfromimagesearchenginesinsuchawaythatuserscanconvenientlybrowsetheretrievedimages.Thewayinwhichsearchresultsarepresentedallowstheusertograspthecompositionofthesetofimages“ataglance”.Todoso,imagesaregroupedandpositionedaccordingtotheirdistributioninaprosemanticfeaturespacewhichencodesinformationabouttheircontentatanabstractionlevelthatcanbeplacedbetweenvisualandsemanticinformation.Thecompactnessofthefeaturespaceallowsafastanalysisoftheimagedistributionsothatallthecomputationcanbeperformedinrealtime.

7881B-45, Session 7

Image retrieval considering people co-occurrence relations using relevance feedbackK.Shimizu,N.Nitta,N.Babaguchi,OsakaUniv.(Japan)

Therecentpopularityofdigitalcamerasallowsustotakeimageseasily.Thelargenumbersofimagestakenbyconsumersarestoredin

personalcomputers,onwebservers,etc.,andthereisanincreasingneedforefficientlyandaccuratelyretrievingimagescontainingspecificobjectsfromsuchimagecollections.Oneofthepopularapproachesforrealizingsuchimageretrievalisexample-basedretrieval,wheretheuserpresentsanexampleimagecontainingaspecificobjectasaqueryimagetoretrieveotherimagescontainingthesameobjectfromimagecollections.Visualfeaturesareextractedfromobjectregionsinthequeryimageandineveryimageintheimagecollection,andtheimagesarepresentedtotheuserintheorderofsimilaritiesbetweenthesefeatures.Especiallyforretrievingimagescontainingspecificpersons,thefeaturesareusuallyextractedfromtheirfaceregionsinimages.Furthermore,thefeaturesextractedfromotherregionsinimagesareusedtoimprovetheretrievalorrecognitionperformancerecently.Wefocusonthefactthatsomepeoplesuchasfamilyorfriendsaremorelikelytoappearinthesameimagesthanothersandusevisualfeaturesofnotonlythequeriedpersonbutalsopeoplewhohavestrongco-occurrencerelationswiththequeriedpersontoimprovetheretrievalperformance.

Inthispaper,peoplewhoappearwiththequeriedpersoninthesameimagesarecalledco-occurringpersons.Ourproposedsystemretrievesimagesofspecificpersonsbyconsideringtheco-occurrencerelationsbetweenthequeriedpersonsandco-occurringpersons.Givenaqueryimage,thesystemcalculatestheexpectationdegreeswhichindicatehowhighthequeriedpersonisexpectedtobeinimagesandpresentstheimageswithhighexpectationdegreestotheuser.Therelevancefeedbackisusedtolearntheco-occurringrelationsbetweenthequeriedandco-occurringpersons.Aftertheuserspecifiesifeachpresentedimagecontainsthequeriedperson,thesystemlearnsvariousco-occurringpersonswhoarenotinthequeryimage,variousfacialappearancesofthequeriedandco-occurringpersons,andthestrengthoftheco-occurrencerelationsbetweeneachco-occurringpersonandthequeriedperson.Then,theretrievalresultscanbeimprovedbyconsideringthelearnedpeopleco-occurrencerelations.

Inordertoevaluatetheproposedsystem,wecollectedimagestakenbyconsumersfrom6persons,allofwhicharehis/herimageswithfamilyorfriends.Thenumberofpersonsappearinginanimagerangesfrom1to6butmostfrequently3.Fromthepersonscontainedintheseimages,wemanuallyselected20personswhooftenappearwithfamilyorfriendsasthequeriedpersons.WeevaluatedtheretrievalresultswiththethreemeasuresMeanAveragePrecision(MAP),recallrate,andaveragerankofcorrectlyretrievedimages.Afterfivefeedbackiterations,therecallratewasimprovedfrom34%,whichistheresultobtainedwhenconsideringonlythequeriedperson,to53%byconsideringthepeopleco-occurrencerelationsafterlearningvariousco-occurringpersonsbyusingonlythequeriedperson.Theresultshaveverifiedtheeffectivenessoflearningthepeopleco-occurrencerelationsforretrievingimagesofaspecificperson.

7881B-47, Session 9

Face detection and recognition in FacebookR.Yan,J.Yang,Facebook(UnitedStates)

Noabstractavailable

7881B-48, Session 9

Applications of consumer photo content understandingD.Tretter,H.Chao,Hewlett-PackardLabs.(UnitedStates);Y.Gao,Hewlett-PackardCo.(UnitedStates);N.Lyons,F.Tang,C.Willis,P.Wu,J.Xiao,T.Zhang,X.Zhang,Q.Lin,Hewlett-PackardLabs.(UnitedStates)

Noabstractavailable

Conference 7881B: Multimedia Content Access: Algorithms and Systems V


IS&T /

ReturntoContents

Conference 7881B: Multimedia Content Access: Algorithms and Systems V

7881B-49, Session 9

Know your data: understanding implicit usage versus explicit action in video content classicationJ.Yew,D.A.Shamma,Yahoo(UnitedStates)

Inthispaper,wepresentamethodforvideocategoryclassicationusingonlysocialmetadatafromwebsiteslikeYouTube.Inplaceofcontentanalysis,weutilizecommunicativeandsocialcontextssurroundingvideosasameanstodetermineacategoricalgenre,e.g.Comedy,Music.Wehypothesizethatvideoclipsbelongingtodifferentgenrecategorieswouldhavedistinctsignaturesandpatternsthatareredirectedintheircollectedmetadata.Inparticular,wedeneanddescribesocialmetadataasusageoractiontoaidinclassication.WetrainedaNaiveBayesclassiertopredictcategoriesfromasampleof1,740YouTubevideosrepresentingthetopvegenrecategories.Usingjustasmallnumberoftheavailablemetadatafeatures,wecomparetheclassicationsproducedbyourNaiveBayesclassierwiththoseprovidedbytheuploaderofthatparticularvideo.ComparedtorandompredictionswiththeYouTubedata(21%accurate),ourclassierattainedamediocre33%accuracyinpredictingvideogenres.However,wefoundthattheaccuracyofourclassiersignicantlyimprovesbynominalfactoringoftheexplicitdatafeatures.Byfactoringtheratingsofthevideosinthedataset,theclassierwasabletoaccuratelypredictthegenresof75%ofthevideos.Wearguethatthepatternsofsocialactivityfoundinthemetadataarenotjustmeaningfulintheirownright,butareindicativeofthemeaningofthesharedvideocontent.Theresultspresentedbythisprojectrepresentsarststepininvestigatingthepotentialmeaningandsignificanceofsocialmetadataanditsrelationtothemediaexperience.

7881B-50, Session 9

Multimedia information retrieval at FX Palo Alto LaboratoryM.L.Cooper,J.Adcock,A.Girgensohn,J.Pickens,L.D.Wilcox,FXPaloAltoLab.(UnitedStates)

ThispaperdescribesresearchactivitiesatFXPaloAltoLaboratory(FXPAL)intheareaofmultimediabrowsing,search,andretrieval.Overviewsofrelevantsystemsarepresentedandreferenceswithadditionaldetailsareprovided.Wefirstconsiderinterfacesfororganizationandmanagementofpersonalphotocollections.Wethensurveyourworkoninteractivevideosearchandretrieval.Throughoutwediscusstheevolutionofboththeresearchchallengesintheseareasandtheproposedsolutions.

7881B-51, Session 9

Image and video content analysis: challenges of scaleJ.Yagnik,GoogleInc.(UnitedStates)

Noabstractavailable


IS&T /

ReturntoContents

Conference 7882: Visual Information Processing and Communication IITuesday-Wednesday25-26January2011PartofProceedingsofSPIEVol.7882VisualInformationProcessingandCommunicationII

7882-12, Session 1

Visual search: a tutorial overviewR.Grzeszczuk,NokiaResearchCtr.(UnitedStates)

Noabstractavailable

7882-02, Session 2

A hybrid video codec based on extended block sizes, recursive integer transforms, improved interpolation, and flexible motion representationM.Karczewicz,P.Chen,R.Joshi,X.Wang,W.Chien,R.Panchal,M.Coban,I.S.Chong,Qualcomm(UnitedStates);Y.A.Reznik,QualcommInc.(UnitedStates)

Noabstractavailable

7882-03, Session 2

Achieving H 264/AVC performance using distributed video coding combined with super-resolutionR.Klepko,D.Wang,G.Huchet,CommunicationsResearchCtr.Canada(Canada)

DistributedVideoCoding(DVC)isanemergingvideocodingparadigmforthesystemsthatrequirelowcomplexityencoderssupportedbyhighcomplexitydecodersaswouldberequiredfor,say,real-timevideostreamingfromonemobilephonetoanother.Undertheassumptionofanerror-freetransmissionchannel,thecodingefficiencyofcurrentDVCsystemsisstillbelowthatofthelatestconventionalvideocodecs,suchasH.264/AVC.ToincreasecodingefficiencyweproposeinthispaperthateithereverysecondKeyframeoreveryWyner-Zivframeisdownsampledbyafactoroftwoinbothdimensionspriortocoding.However,thisinturnwouldrequireupsamplingcoupledwithinterpolationatthedecoder.Simpleinterpolation(e.g.,bicubicorFIRfilter)wouldnotsufficesincehigh-frequency(HF)spatialimagecontentwouldbemissing.Instead,weproposetheincorporationofasuper-resolution(SR)technique,specificallybaseduponanexample-basedscene-specificmethod,toallowthisHFcontenttoberecovered.TheSRtechniquewilladdcomputationalcomplexitytothedecodersideoftheDVCsystem,whichisallowablewithintheDVCframework.Rate-distortioncurveswillshowthatthisnovelcombinationofSRwithDVCimprovesthesystemperformancebyuptoseveraldecibelsasmeasuredbythePSNR,andcanactuallyexceedtheperformanceofanH.264/AVCcodecusingGOP=IP.

7882-04, Session 2

Real-time priority-aware transfer of SVC encoded video over MIMO communications systemD.Radakovic,Y.Yao,R.Ansari,Univ.ofIllinoisatChicago(UnitedStates)

Anovelcross-layermethodisproposedforreal-timetransmissionofstandardcompliantscalablevideooverapower-limitedmultiple-input

multiple-output(MIMO)systemwithchannelstatefeedback.IntheMIMOsystem,adaptivepowerallocationandantennaselectionareutilizedforcreationofunequalbiterrorrate(BER)sub-channels.BERacrossallthesub-channelscanbeimprovedbyreducingthechannelthroughput.Intheproposedmethod,thescalablevideoisfirstdividedintomultiplevideosub-streamsofunequalimportancebycontent-basedpartitioningandsortingofvideolayers.Anoveltechniqueisutilizedtoselectthesub-streamdatatobesentovertheavailableMIMOsub-channelsastomatchtheimportanceofthevideodatatoboththechannelBERanddatatransmissiondelay.Videopacketsthataredelayedexcessivelyarediscardedatthetransmitter.Atrade-offexistsbetweenthelossesinvideopeaksignal-to-noiseratio(PSNR)resultingfromdiscardedvideopacketsatthetransmitter,andgainsinvideoPSNRduetolowerchannelBER.SimulationresultsshowthattheproposedmethodresultsinsignificantlyimprovedperformancecomparedwithvideotransmissionoverconstantBERchannelswiththroughputequaltothevideobitrate.

7882-05, Session 2

Optimal power allocation and joint source-channel coding for wireless DS-CDMA visual sensor networksK.Pandremmenou,L.P.Kondi,K.E.Parsopoulos,Univ.ofIoannina(Greece)

Noabstractavailable

7882-06, Session 3

A device and an algorithm for the separation of visible and near infrared signals in a monolithic silicon sensorG.Langfelder,PolitecnicodiMilano(Italy);T.Malzbender,Hewlett-PackardLabs.(UnitedStates);A.F.Longoni,F.Zaraga,PolitecnicodiMilano(Italy)

Noabstractavailable

7882-07, Session 3

Localization of buildings with a gable roof in very-high-resolution aerial imagesL.Hazelhoff,CycloMediaTechnologyB.V.(Netherlands)andTechnischeUniv.Eindhoven(Netherlands);P.H.N.deWith,TechnischeUniv.Eindhoven(Netherlands)andCycloMediaTechnologyB.V.(Netherlands)

Thisstudyaimsattherobustautomaticdetectionofbuildingswithagableroofinvaryingruralareasfromvery-high-resolutionaerialimages.Theoriginalityofourapproachresidesinacustom-madedesignextractingkeyfeaturesclosetomodeling,suchase.g.roofridgesandgutters.Inthisway,weallowalargefreedominroofappearances.Theproposedmethodisbasedonacombinationoftwohypotheses.First,itexploitsthephysicalpropertiesofgableroofsanddetectsstraightline-segmentswithinnon-vegetatedandnon-farmlandareas,aspossibilitiesofoccurringroof-ridges.Second,foreachofthesecandidateroof-ridges,thelikelyroof-gutterpositionsareestimatedforbothsidesofthelinesegment,resultinginasetofpossibleroofconfigurations.Thesehypothesesarevalidatedbasedontheanalysisofsize,shadow,colorandedgeinformation,whereforeachroof-ridge


IS&T /

ReturntoContents

candidatetheoptimalconfigurationisselected.Roofconfigurationswithunlikelypropertiesarerejectedandafterwardsridgeswithoverlappingconfigurationsarefused.Experimentsconductedonasetof200imagescoveringvariousruralregions,withalargevariationinbothbuildingappearanceandsurroundings,showthatthealgorithmisabletodetect75%ofthebuildingswithaprecisionof69.4%.Weconsiderthisasareasonablygoodresult,sincethecomputingisfullyunconstrained,numerousbuildingswereoccludedbytreesandbecausethereisasignificantappearancedifferencebetweentheconsideredtestimages.

7882-08, Session 3

Impact of near-lossless and lossy coding on information extraction from hyperspectral dataA.C.Miguel,SeattleUniv.(UnitedStates)

Inthispaper,weevaluatetheabilitytoextractmeaningfulinformationfromthedecompressedimagingspectrometerdata.Wecomparetheresultsofimageprocessingperformedonhyperspectraldatacompressedwithanear-losslessbitplanecoderwiththeresultsobtainedonimagescodedwithalossyJPEG2000-basedalgorithm.Ourstudyisextensive:weinvestigateawiderangeofbitrates,useallscenesoftheAVIRIS224-bandCuprite,JasperRidge,andMoffettFieldsradianceimages,andemployseveralmeasuresofrate-distortionandinformationextractionperformanceincludingPSNR,MAD,spectralanglemapper(SAM),andthetruepositiveandnegativeratesofwhole-andmixed-pixelanalysis,andanomalydetection.Ourresultsshowthatrestrictingthecompressionerrortobeuniformoverthewholeimagedoesnotimprovetheresultsofimageprocessing,whencomparedtoanefficientlossycodingtechnique.Athigherbitrates,theresultsofpost-processingperformedondatacompressedwiththenear-losslesscoderarecomparabletothoseobtainedwithanefficientlossycoder.Atlowerbitrates,thelossycoderoutperformsthenear-losslesscoder.Weconcludethatlossycompressionalgorithmsarepreferableforhyperspectraldatacompressionandprocessingapplicationsovernear-losslessmethods.

7882-09, Session 3

Motion adaptive Kalman filter for superresolutionM.Richter,F.Nasse,H.Schroeder,TechnischeUniv.Dortmund(Germany)

Superresolutionisastrategytoenhanceimagequalityofbothlowandhighresolutionvideo.Especiallyrecursivesuperresolutionalgorithmscanfulfillthesequalityaspectsbecausetheycontrolthevideooutputusingafeed-backloopandadapttheresultinthenextiteration.AverypromisingapproachistheutilizationofKalmanfiltersasproposedbyFarsiuetal.Reliablemotionestimationisessentialforsuperresolutionandthereforerobustglobalmotionmodelsarepreferredoverlocalmodels,therebylimitingtheapplicationofsuperresolution.

ThecontributionofourpaperisaninvestigationhowtheKalmanfiltercanbeextendedtoallowimprovedhandlingofsequenceswithcomplexmotion.Motionadaptivevarianceestimationandasegmentationtoapplysuperresolutiononlyincaseofglobalmotionarekeyfeaturestoreachthisgoal.Experimentsconfirmthepotentialofourproposalforidealandrealvideosequencesincomparisontostate-of-the-artmethodsliketrainablefilters.Thefinalpaperwillgiveadetailedexplanationofthealgorithmwithourproposedimprovements.Moreover,resultsofadetailedevaluationwillbepresented,e.g.investigationofrequiredmotionestimationaccuracytoreachaqualityhigherthanspatialprocessing,algorithmconvergencespeedandqualitygainreachedbyourimprovements.

7882-10, Session 3

Hyper-cube watermarking schemeM.Chaumont,D.Goudia,W.Puech,LIRMM(France)

In2007,LiandCoxshowedthattheirschemecalledPerceptual-QIM(P-QIM)wasoneofthesolutionsthemostsuccessful(eventhebest)inordertowatermarkmulti-bitsinanimagebyaquantizationapproach.Ourresearchledustotakesomeoftheirideasandbroughtnewproposals.Thus,thispaperpresentsanewschemewhichwecallHyper-Cube.Inadditiontore-expressthemechanismsofwatermarkingfromadifferentangle,weproposethreeimprovements:theuseoftheJPEGquantizationtableinsettingthesizeoflattices,thecalculationofmodifiedWatsonslacksonaneighborhood,andtheuseofacorrectingcodemoreefficientthanthesimplerepetitioncode.Giventheobtainedresults,wecanconcludethattheHyper-Cubewatermarkingschemeiscurrentlyoneofthemostsuccessfultechniquewhenonewantstowatermarkanimageusingquantization-basedapproaches.

7882-11, Session 3

A joint JPEG2000 compression and watermarking system using a TCQ-based quantization schemeD.Goudia,M.Chaumont,W.Puech,LIRMM(France);N.HadjSaid,Univ.MohamedBoudiafDesSciencesEtDeLaTechnologied’Oran(Algeria)

Inthispaper,wedescribeaTrellisCodedQuantization(TCQ)-basedquantizationandwatermarkingtechniqueintheframeworkofJPEG2000stillimagecompression.Furthermore,weinvestigatethedesignofanoveljointcompressionandwatermarkingschemebasedonahybridTCQmodulewhichcanperformatthesametimequantizationandwatermarkembedding.Thewatermarkextractionprocesscanbeachievedbothduringandafterimagedecompression.Anotheradvantageisthelowercomplexityofthesystembecausethequantizationstageisusedforbothcompressionandwatermarkingpurposes.ExperimentalresultshavedemonstratedthattheproposedjointschemesuccessfullysurvivesJPEG2000compressionwithminimaldegradationoftheimagequality.Wealsostudytherobustnessoftheschemeagainstgaussianfilteringattackandvalumetricattack.


Depth map coding based on color motion informationB.T.Oh,SamsungAdvancedInstituteofTechnology(Korea,Republicof)andTheUniv.ofSouthernCalifornia(UnitedStates);H.Wey,D.Park,SamsungAdvancedInstituteofTechnology(Korea,Republicof)

Thispaperpresentsanefficientdepthmapcodingmethodbasedoncolormotioninformationinmulti-viewplusdepth(MVD)system.Ascomparedtotheconventionaldepthmapcodinginwhichthedepthvideoisseparatelycoded,theproposedschemeinvolvesthecolorinformationfordepthcoding.Indetails,theproposedalgorithmsubsamplestheinputdepthdataalongtemporaldirectiontoreducethebit-rate,andnon-encodeddepthframesarefullyrecoveredatthedecodersideguidedbythemotioninformationextractedfromthedecodedcolorvideo.Thesimulationresultsshowsthehighcodingefficiencyoftheproposedscheme,anditalsoshowsthattherecovereddepthdataisnotmuchdifferentfromthereconstructedone,anditevenprovidesthetemporallyconsistentdepthmapwhichresultsinbettersubjectivequalityforview-interpolation.

Conference 7882: Visual Information Processing and Communication II


IS&T /

ReturntoContents

7882-25, Session 4

Joint design of optics and image processing for application-specific sensors: overthrowing old optical design principles in the new era of electro-opticsD.G.Stork,RicohInnovations,Inc.(UnitedStates)

Noabstractavailable

7882-13, Session 5

Robust HOSVD-based multi-camera motion trajectory indexing and retrievalQ.Li,X.Shi,D.Schonfeld,Univ.ofIllinoisatChicago(UnitedStates)

Wepresentanovelmethodforrobustindexingandretrievalofmultiplemotiontrajectoriesobtainedfromamulti-camerasystem.Motiontrajectoriesdescribethemotioninformationbyrecordingtheobjects’coordinatesinthevideosequence.Wegenerateafour-dimensionaltensorrepresentationofmultiplemotiontrajectoriesfrommultiplecameras.Wesubsequentlyrelyonhigh-ordersingularvaluedecomposition(HOSVD)forcompactrepresentationanddimensionalityreductionofthetensor.WeshowthatHOSVD-basedrepresentationprovidesarobustframeworkthatcanbeusedforaunifiedrepresentationoftheHOSVDofallsubtensors.WethusdemonstrateanalyticallyandexperimentallythattheproposedHOSVD-basedrepresentationcanhandleflexiblequerystructureconsistingofanarbitrarynumberofobjectsandcameras.Simulationresultsarefinallyusedtoillustratethesuperiorperformanceoftheproposedapproachtomultipletrajectoryindexingandretrievalfrommulti-camerasystemscomparedtotheuseofasinglecamera.

7882-14, Session 5

Particle filtering with missing frames and its application to video tracking over lossy networksJ.Huang,D.Schonfeld,Univ.ofIllinoisatChicago(UnitedStates)

Manypracticalscenariossuchasvideotrackinginlossyenvironmentrequirearobustaccuratetrackingalgorithmwithdroppedframes.AnovelrobustapproachisproposedforvisualtrackinginthefirstpartofthispaperinthepresenceofframelosswiththeBayesianImportanceSamplingframeworkbasedonfirst-orderhiddenMarkovmodel(HMM).Thegraphicalmethodsarefirstlyusedtoprovideanexactsolutionforestimationusingfirst-orderhiddenMarkovmodel(HMM)withdroppedframes.WesubsequentlyrelyonSequentialImportanceSamplingtoderivethefirst-orderparticlefilteringalgorithmwithmissingframes.Inthesecondpartofthepaper,wepromotethisresultandpresentthatgraphicalmethodscanalsobeusedtoprovideanexactsolutiontoparticlefilteringwithmissingframesforanmth-orderhiddenMarkovmodel(HMM)andcycle-freegraphs.Theresultingalgorithmrequiresasmallnumberofparticlesforefficienttracking.Experimentalresultsdemonstratethesuperiorityandrobustnessoftheproposedapproachtothestandardmethods,yettheadditionalcomputationaltimerequiredisnegligible.

7882-15, Session 5

Affine image registration with curve mappingY.Li,R.L.Stevenson,Univ.ofNotreDame(UnitedStates)

Noabstractavailable

7882-16, Session 5

Optimal optical flow based disparity map estimation for lossless stereo image codingA.KumarK.C.,I.R.M.Darazi,B.M.Macq,Univ.CatholiquedeLouvain(Belgium)

Independentstereoimagecompression,theaimistominimizethebitrateofdisparitymapandthatofresidualimage.Traditionally,focushasbeenpaidoneitherdisparitymaporresidualimage.Inthispaper,wecomputeanoptimaldisparitymap(intermsofbitrates)byjointlyexploitingthetrade-offbetweendisparitymapandresidualimage.Firstly,thedensedisparitymapisobtainedusingexistingopticalflowtechnique.Secondly,thedensedisparitymapisquantized.Consequently,thebitratefordisparitymapdecreasessignificantlyatthecostofslightincreaseinresidualimage.Asaresult,theoverallbitrateattainsminimumvalue.TheproposedschemeiscompatibleandcanbeintegratedinJPEG2000framework.

7882-17, Session 6

Background subtraction using pixel-wise adaptive learning rate for object tracking initializationK.K.Ng,E.J.DelpIII,PurdueUniv.(UnitedStates)

Inthispaperwepresentanewmethodforobjecttrackinginitializationusingbackgroundsubtraction.

Weproposeaneffectiveschemeforupdatingabackgroundmodeladaptivelyindynamicscenes.Unlikethetraditionalmethodthatusesthesame``learningrate’’fortheentireframeorsequence,ourmethodassignsalearningrateforeachpixelaccordingtotwoparameters.Thefirstparameterdependsonthedifferencebetweenthepixelintensitiesofthebackgroundmodelandthecurrentframe.Thesecondparameterdependsonthedurationofthepixelbeingclassifiedasabackgroundpixel.

Wealsointroduceamethodtodetectandcompensateforsuddenilluminationchange.

Experimentalresultsshowsignificantimprovementsinmovingobjectdetectionindynamicscenessuchaswavingtreeleavesandsuddenilluminationchange,andithasmuchlowercomputationalcostcomparedtoGaussianmixturemodel.

7882-18, Session 6

Background estimation and update in cluttered surveillance video via the radon transformN.Conci,Univ.degliStudidiTrento(Italy);E.Izquierdo,QueenMary,Univ.ofLondon(UnitedKingdom)

Inthispaperweproposeabackgroundestimationandupdatealgorithmforclutteredvideosurveillancesequencesinindoorscenarios.TheimplementationreliesontheintegrationoftheRadontransformintheprocessingchain,appliedonablock-by-blockbasis.TheRadontransformisappliedinthiscontexttoextractthemeaningfulinformationintermsofedgesandtexture,providingasignatureforeachportionoftheimageplane.ThealgorithmisvalidatedintypicalsurveillancecontextsandpresentedinthispaperusingaPETSvideosequence.



IS&T /

ReturntoContents

7882-19, Session 6

People re-identification in camera networks based on probabilistic color histogramsA.D’Angelo,J.E.Dugelay,EURECOM(France)

Noabstractavailable

7882-20, Session 6

Estimating the number of people in crowded scenesM.Kim,W.Kim,C.Kim,KAIST(Korea,Republicof)

Inthispaper,weproposeanovelmethodforestimatingthenumberofpeopleinanindirectmanner.Tothisend,webasicallyemploystatisticalinformationofspace-timeinterestpointstoanalyzecrowdbehavior.Althoughcrowdmotioncanbeeasilyobtainedbysalientcornersinthespace-timedomain,thenumberofthosepointstendstobehighlyvariableduetodifferenceofcrowdmotions.Tocopewithsuchvariationsbetweenconsecutiveframes,wecombineforegroundinformationobtainedfromtheGaussianmixturemodel(GMM)withextractedsalientcorners.Basedonthiscombination,wedefineourfeatures,whicharethenumberofspace-timeinterestpoints,thenumberandsizeofforegroundregions,andfinallyfeedthemintothemultipleregressiontopreciselyestimatethenumberofpeopleincrowdscenes.Tojustifytheefficiencyandrobustnessofourapproach,theexperimentsareconductedonPETS2009datasets.

7882-21, Session 6

MD/PNC with feedback for heterogeneous video multicast in lossy networksA.K.Ramasubramonian,J.W.Woods,RensselaerPolytechnicInstitute(UnitedStates)

Weprovideheterogeneoususersinamulticastwithvideobasedontheiravailableresources.Mostexistingsolutionsrequireknowledgeofthenetworkstructure,whichcanbeimpracticalinlargenetworks.Wepresentaschemethatcombinesmultipledescriptioncodingandpracticalnetworkcoding(MD/PNC)toprovideheterogeneousvideomulticastinlossynetworks.Theparityprovidedbyrandomnetworkcodesnotonlyhelpsincounteringchannellosses,butalsoprovidesdifferentsourceratestoreceiversbasedontheirmax-flowbandwidths.Thismethodonlyrequiresknowledgeofthemax-flowbandwidthsofthereceivers,andnotthenetworkstructure.Theusers’feedbackinformationisusedtocomputetheaveragelossratetheyexperience,andthisisthenusedbythevideoservertooptimizethesourcerateallocationinthedescriptions.Simulationofmulticastofthewell-knownForemantestsequenceonarandomnetworkshowsa1.3-1.5dBimprovementintheaveragePSNRofthereceiverswhileusingMD/PNCwhencomparedtouniratemulticastusingnetworkcoding.

7882-22, Session 6

Object-adaptive depth compensated inter-prediction for depth video coding in 3D video systemM.Kang,GwangjuInstituteofScienceandTechnology(Korea,Republicof);J.Lee,I.Lim,SamsungAdvancedInstituteofTechnology(Korea,Republicof);Y.Ho,GwangjuInstituteofScienceandTechnology(Korea,Republicof)

Nowadays,the3DvideosystemincludingMVD(multi-viewvideoplusdepth)isbeingactivelystudied.Thesystemhasmanyadvantageswithrespecttovirtualviewsynthesissuchasanauto-stereoscopicfunctionality,butcompressionofhugeinputdataremainsaproblem.Therefore,efficient3Ddatacompressionisextremelyimportantinthesystem,andlowtemporalconsistencyandviewpointcorrelationproblemsshouldberesolvedforefficientdepthvideocoding.Inthispaper,weproposeanobject-adaptivedepthcompensatedinterpredictionmethodtoresolvetheproblems.Toachievethis,amean-depthdifferencebetweenacurrentblocktobecodedandareferenceblockiscompensatedduringinterprediction,anduniquepropertiesofdepthvideoareexploitedtoreducesideinformationrequiredforsignalingofthedepthdifference.Toevaluatethecodingperformance,wehaveimplementedtheproposedmethodintoMVC(multiviewvideocoding)referencesoftware,JMVM6.0.ExperimentalresultshavedemonstratedthatourproposedmethodisespeciallyefficientforthetestdepthvideosgeneratedbyDERS(depthestimationreferencesoftware)ofMPEG3DVCgroup.Thecodinggainwasupto11.5%bit-saving,andsubjectivequalityofsynthesizeviewwasnoticeablyimprovedbysupportingbetterperformanceofinterpredictionaroundobjectboundaries.