20
Mini Jarvis LIVE ANNOTATIONS WITH GOOGLE GLASS August 14, 2014

Mini Jarvis Final Presentation

Embed Size (px)

Citation preview

Page 1: Mini Jarvis Final Presentation

Mini JarvisLIVE ANNOTATIONS WITH GOOGLE GLASS

August 14, 2014

Page 2: Mini Jarvis Final Presentation

Saumya Soman Jin Bing Lin Anthony Yang Yash Sanghavi

Page 3: Mini Jarvis Final Presentation

Outline

• Background• Architecture Overview• Hardware• Software• Challenges Faced• Demonstration• Conclusion• Future Work• Acknowledgements

Page 4: Mini Jarvis Final Presentation

Background

• Object and Facial Detection technology have existed for a number of years• Glass presents an exciting new platform for object and facial detection• Mini Jarvis performs searches using facial and object recognition to learn more

information about those people or objects

Page 5: Mini Jarvis Final Presentation

Architecture Overview

Page 6: Mini Jarvis Final Presentation

Architecture Overview

Page 7: Mini Jarvis Final Presentation

Hardware

Page 8: Mini Jarvis Final Presentation

Software

● Google GDK (Glass Development Kit)● OpenCV Library (Open Source Computer Vision)● Apache Tomcat 7● Eclipse Juno IDE● RESTful webservice● Google Custom Search API

Page 9: Mini Jarvis Final Presentation

OpenCV Library

● OpenCV is a software library for computer vision● Used to train the server to detect an object or face● Cross-platform i.e. it runs on Windows, Android, Linux, etc.● It is written mainly in C++● Also has interfaces for Java anad Python

Page 10: Mini Jarvis Final Presentation

Apache Tomcat 7

● Tomcat is a Servlet container (Web server) licensed under the Apache Software License

● Used for deploying and running web applications written in Java● Open source non-commercial project● Lightweight and easy for server management

Page 11: Mini Jarvis Final Presentation

RESTful webservices

• A web service is a web page meant for a computer to request and process.

• REST (Representational State Transfer) is an architectural style which is based

on web-standards and the HTTP protocol.

• REST based architecture everything is a resource.

• Every resource should support the HTTP common operations (GET, POST, PUT

and DELETE)

Page 12: Mini Jarvis Final Presentation

Google Custom Search API

• Google offers Custom Search API

• Use color and name of entity as search terms

• Returns JSON response of search results

• Allows 100 requests per day

Page 13: Mini Jarvis Final Presentation

Training

• Requires positive and negative images• Positive images contain the object

• Negative images do not contain the object

• OpenCV

• create_samples generates samples

• train_cascade uses samples to make classifier XML

• Successfully trained for one object (banana) and one person (Anthony)

Page 14: Mini Jarvis Final Presentation

Color Extraction

• Extracts color of object recognized using OpenCV libraries

• Detected image is enclosed in a rectangle

• Rescaled to 32 x 32

• Returns dominant color within pixels

Page 15: Mini Jarvis Final Presentation

• Accuracy of detection

• Long training time

• Accuracy of color extraction

• Frequent overheating of the Glass

• Delay to server

• Frequent updates to Glass

Challenges Faced

Page 16: Mini Jarvis Final Presentation

Demonstration

Page 17: Mini Jarvis Final Presentation

Conclusion• Mini Jarvis is proof-of-concept of object and facial detection on

Google Glass• Takes advantage of the built in hardware of Glass• Privacy Issues?

Page 18: Mini Jarvis Final Presentation

Future Work• Add more objects and faces• Develop methodology to train with better accuracy• Optimize the request-response time from server• Reverse Image Search• Multiple Simultenous Annotations

Page 19: Mini Jarvis Final Presentation

Acknowledgements• Ivan Seskar• Roy Yates• Shridatt (James) Sugrim

Page 20: Mini Jarvis Final Presentation

Questions?