33
Business Identification: Spatial Detection Alexander Darino Week 8

Business Identification: Spatial Detection Alexander Darino Week 8

Embed Size (px)

Citation preview

Page 1: Business Identification: Spatial Detection Alexander Darino Week 8

Business Identification:Spatial Detection

Alexander DarinoWeek 8

Page 2: Business Identification: Spatial Detection Alexander Darino Week 8

2

Weaknesses to Current Approach

LatitudeLongitude

Geocoding

ReverseGeocoding

Nearby Businesses

Image OCR Detected Text

Business Name

Matching

BusinessIdentification

Business Spatial

Detection

Page 3: Business Identification: Spatial Detection Alexander Darino Week 8

STR Implementation

• STR Implementation: “Automatic Detection and Recognition of Signs From Natural Scenes”

Multiresolution-based potential

characters detection

Character/layout geometry and color properties analysis

Local affine rectification

Refined Detection

Page 4: Business Identification: Spatial Detection Alexander Darino Week 8

Multiresolution-based potential characters detection

Page 5: Business Identification: Spatial Detection Alexander Darino Week 8

Multiresolution-based potential characters detection

Page 6: Business Identification: Spatial Detection Alexander Darino Week 8

Multiresolution-based potential characters detection

Page 7: Business Identification: Spatial Detection Alexander Darino Week 8

STR Implementation

• Original Next Step: Replace with readily available text detector

• Text detectors are not readily available

(Will revisit later)

Page 8: Business Identification: Spatial Detection Alexander Darino Week 8

TEMPLATE-IMAGE SIFT MATCHINGAfter many technical difficulties…

Page 9: Business Identification: Spatial Detection Alexander Darino Week 8

Template

Name George

Font Trebuchet MS

# Levels 3

Peak Threshold 0

Edge Threshold 10

Scale Cutoff 20-24.1(95%)

Image

Name George …

# Levels 3

Peak Threshold 0

Edge Threshold 10

Statistics

Good 1

Bad 0

Total (% G) 1 (100%)

Page 10: Business Identification: Spatial Detection Alexander Darino Week 8

Template

Name George

Font Trebuchet MS

# Levels 5

Peak Threshold 0

Edge Threshold 10

Scale Cutoff 25.60547(95%)

Image

Name George …

# Levels 3

Peak Threshold 0

Edge Threshold 10

Statistics

Good 1

Bad 0

Total (% G) 1 (100%)

Page 11: Business Identification: Spatial Detection Alexander Darino Week 8

Template

Name George

Font Trebuchet MS

# Levels 5

Peak Threshold 0

Edge Threshold 10

Scale Cutoff 25.60547(95%)

Image

Name George …

# Levels 1

Peak Threshold 0

Edge Threshold 10

Statistics

Good 2

Bad 0

Total (% G) 2 (100%)

Page 12: Business Identification: Spatial Detection Alexander Darino Week 8

Template

Name George

Font Trebuchet MS

# Levels 5

Peak Threshold 0

Edge Threshold 10

Scale Cutoff 25.60547(95%)

Image

Name George …

# Levels 1

Peak Threshold 20

Edge Threshold 10

Statistics

Good 2

Bad 0

Total (% G) 2 (100%)

Page 13: Business Identification: Spatial Detection Alexander Darino Week 8

Template

Name George

Font Trebuchet MS

# Levels 5

Peak Threshold 0

Edge Threshold 10

Scale Cutoff 25.60547(95%)

Image

Name George …

# Levels 1

Peak Threshold 20

Edge Threshold 6

Statistics

Good 2

Bad 0

Total (% G) 2 (100%)

Page 14: Business Identification: Spatial Detection Alexander Darino Week 8

Template

Name Aiken

Font Trebuchet MS

# Levels 5

Peak Threshold 0

Edge Threshold 10

Scale Cutoff 24.439163(95%)

Image

Name George …

# Levels 1

Peak Threshold 20

Edge Threshold 6

Statistics

Good 3

Bad 0

Total (% G) 3 (100%)

Page 15: Business Identification: Spatial Detection Alexander Darino Week 8

Template

Name Delicious

Font Trebuchet MS

# Levels 5

Peak Threshold 0

Edge Threshold 10

Scale Cutoff 26.656116

Image

Name George …

# Levels 1

Peak Threshold 20

Edge Threshold 6

Statistics

Good 0

Bad 0

Total (% G) 0 (0%)

Page 16: Business Identification: Spatial Detection Alexander Darino Week 8

Template

Name Prepared

Font Trebuchet MS

# Levels 5

Peak Threshold 0

Edge Threshold 10

Scale Cutoff 26.656116

Image

Name George …

# Levels 1

Peak Threshold 20

Edge Threshold 6

Statistics

Good 0

Bad 0

Total (% G) 0 (0%)

Page 17: Business Identification: Spatial Detection Alexander Darino Week 8

Template

Name Foods

Font Trebuchet MS

# Levels 5

Peak Threshold 0

Edge Threshold 10

Scale Cutoff 26.656116

Image

Name George …

# Levels 1

Peak Threshold 20

Edge Threshold 6

Statistics

Good 0

Bad 0

Total (% G) 0 (0%)

Page 18: Business Identification: Spatial Detection Alexander Darino Week 8

Template

Name Bruegger’s

Font Arial Rounded MT Bold

# Levels 5

Peak Threshold 0

Edge Threshold 10

Scale Cutoff 32.288651

Image

Name Bruegger’s …

# Levels 1

Peak Threshold 20

Edge Threshold 6

Statistics

Good 0

Bad 0

Total (% G) 0 (0%)

Page 19: Business Identification: Spatial Detection Alexander Darino Week 8

Template

Name Bakery

Font Arial Rounded MT Bold

# Levels 5

Peak Threshold 0

Edge Threshold 10

Scale Cutoff 29.145470

Image

Name Bruegger’s …

# Levels 1

Peak Threshold 20

Edge Threshold 6

Statistics

Good 0

Bad 0

Total (% G) 0 (0%)

Page 20: Business Identification: Spatial Detection Alexander Darino Week 8

Template

Name Bakery

Font Arial Rounded MT Bold

# Levels 5

Peak Threshold 0

Edge Threshold 10

Scale Cutoff 29.145470

Image

Name Bruegger’s …

# Levels 1

Peak Threshold 20

Edge Threshold 6

Statistics

Good 0

Bad 0

Total (% G) 0 (0%)

Page 21: Business Identification: Spatial Detection Alexander Darino Week 8

SCENE TEXT RECOGNITIONMoving away from SIFT and revisiting

Page 22: Business Identification: Spatial Detection Alexander Darino Week 8

Scene Text Recognition

• Did not hear back from individuals contacted for STR implementation

• Returning to STR Implementation– Further reading indicates that

patches are necessary for subsequent algorithms

– Text detection is not enough: need to implement specified text detector

Multiresolution-based potential

characters detection

Character/layout geometry and color properties analysis

Local affine rectification

Refined Detection

Page 23: Business Identification: Spatial Detection Alexander Darino Week 8

Color Properties Analysis

• Implemented Gaussian Mixture Model (GMM) to obtain μ and σ of foreground/background for: R/G/B/H/I

• Calculated Confidences that component (RGBHI) can be used to recognize characters

Multiresolution-based potential

characters detection

Character/layout geometry and color properties analysis

Local affine rectification

Refined Detection

Page 24: Business Identification: Spatial Detection Alexander Darino Week 8

Original

Page 25: Business Identification: Spatial Detection Alexander Darino Week 8

Redμ1=141.965609756098 σ1=9.9487

μ2=255

σ2=0.2000

𝐶𝑜𝑛𝑓𝑖𝑑𝑒𝑛𝑐𝑒=0.011361775502324

Page 26: Business Identification: Spatial Detection Alexander Darino Week 8

Greenμ1=172.337447154472 μ2=255 𝐶𝑜𝑛𝑓𝑖𝑑𝑒𝑛𝑐𝑒=0.017056947503074  σ1=4.8463 σ2=0.2000

Page 27: Business Identification: Spatial Detection Alexander Darino Week 8

Blueμ1=122.673512195122 μ2=255 𝐶𝑜𝑛𝑓𝑖𝑑𝑒𝑛𝑐𝑒=0.021524159560500  σ1=6.1478 σ2=0.2000

Page 28: Business Identification: Spatial Detection Alexander Darino Week 8

Hueμ1=106.601736628811 μ2=0 𝐶𝑜𝑛𝑓𝑖𝑑𝑒𝑛𝑐𝑒=0.017897920959170  σ1=5.9561 σ2=0.2000

Page 29: Business Identification: Spatial Detection Alexander Darino Week 8

Intensityμ1=145.658856368567 μ2=255 𝐶𝑜𝑛𝑓𝑖𝑑𝑒𝑛𝑐𝑒=0.051403296762968  σ1=2.1271 σ2=0.2000

Page 30: Business Identification: Spatial Detection Alexander Darino Week 8

Evaluation

• The highest confidence was found in Intensity even though most letters vanish, vs Hue where letters are easily distinguisible

• This suggests text recognition should occur individually per character

• The paper further suggests it needs the patches around the individual characters

Page 31: Business Identification: Spatial Detection Alexander Darino Week 8

Next Step

Page 32: Business Identification: Spatial Detection Alexander Darino Week 8

Next Step

Page 33: Business Identification: Spatial Detection Alexander Darino Week 8

Thank You