Crowdsourcing for Business: An Emerging Paradigm Shourya Roy Area Manager, Human Computation Xerox Research Centre India, Bangalore [email protected]

Embed Size (px)

Citation preview

  • Slide 1
  • Crowdsourcing for Business: An Emerging Paradigm Shourya Roy Area Manager, Human Computation Xerox Research Centre India, Bangalore [email protected] Workshop on Social Computing, IIT Kharagpur 5 th Oct, 2012
  • Slide 2
  • Crowdsourcing : What is it? The act of taking a task traditionally performed by an employee or contractor, and outsourcing it to an undefined, generally large group of people, in the form of an open call Digitization, image labeling, user studies, machine translation evaluation, logo design, EDA simulation, innovation contests,...
  • Slide 3
  • Services Thrust Xerox Confidential Handwriting Recognition Problem* Many tasks are easy/feasible/doable for humans, but difficult/challenging/impossible for computer programs 1.Make progress towards deciphering this handwriting 2.Put words which you are unsure about in parenthesis Instructions to Crowd
  • Slide 4
  • Services Thrust Examples (1/5)
  • Slide 5
  • Services Thrust Examples (2/5)
  • Slide 6
  • Services Thrust Examples (3/5)
  • Slide 7
  • Services Thrust Examples (4/5)
  • Slide 8
  • It has been Existing Humans were the first computers, computers, used for math computations 9 examples of crowdsourcing, before crowdsourcing existed : http://bit.ly/mXFdRp http://bit.ly/mXFdRp
  • Slide 9
  • Xerox Confidential Internet and Mobile Have Made it More Common and Promising
  • Slide 10
  • Services Thrust Increasing Activities and Popularity Page 10 2M contributors who does more than 4PY of work on an average day! -- Increasing Popularity as Depicted by Google Trends Crowdsourcing on Google Scholar Over the Last Few Years CEO
  • Slide 11
  • Services Thrust Changing Demographics
  • Slide 12
  • What is the Problem Given a computational problem, design a solution using human computers and automated computers Xerox Confidential
  • Slide 13
  • Human in the loop (and not Guinea Pigs) Main doer is Human (and not Machines as in Assembly Lines) Humans are actively computing(not merely carrier of sensors) The outcome is determined by an algorithm (and not the natural dynamics of the crowd) Why is it Different?
  • Slide 14
  • Where is Research? Xerox Confidential Quality Estimation and Assurance ( Redundancy and voting; Gold data; joint estimation of worker quality and task difficulty; Symbiosis with Machine Learning ) Complex Tasks ( No discrete answer; Exploration and exploitation; crowd workflows ;) Task Design ( Optimize cost, quality and time; infinite completion time; Real time ) Incentive and Motivation ( Payment vs. non-payment; Optimal payment; Payment and quality ; ) Market Design ( Reputation Mechanism; Monitoring and feedback; Task Discovery ; Behavioral Aspects ( Noisy behaviour; Non-reproducible ;)
  • Slide 15
  • An Emerging Research Field
  • Slide 16
  • An Interdisciplinary Research Field
  • Slide 17
  • Thats Alright but Xerox!!?
  • Slide 18
  • We have transformed into the worlds leading enterprise for Business Process and Document Management Revenue Market Opportunity 2011 Services-led ~50% Services Document Outsourcing Business Process Outsourcing Information Tech Outsourcing ~$23 billion $500 billion + Services Leadership In $15.2 billion 2009 Technology-led ~25% Services $132 billion Document Outsourcing 18
  • Slide 19
  • Xerox Revenue by Business Segment* * http://www.fastcompany.com/magazine/161/ursula-burns-xerox
  • Slide 20
  • Page 20 Is Crowdsourcing a Viable Alternative to Outsourcing? Outsourcing is Focus on the core business while partnering with 3rd party vendors to tackle the non-core operations Tasks requiring human intelligence and skills Data and process migration by smart use of technology Heavily human intensive; typically with the help of computing technologies Large distributed workforce enabled by technology executing tiny pieces of work requiring human intelligence
  • Slide 21
  • Page 21 Data Entry by Crowd We started by considering a typical outsourced process (Data Entry) Objective is to understand a process in detail and identify implications for crowdsourcing Digitisation of insurance forms and medical records for US based insurance companies Typing in, validation/ correction of information from scanned forms Outsourced, distributed process
  • Slide 22
  • Features that make Form Digitization process amenable to crowd sourcing Page 22 Relatively low skill data entry work, known as key what you see Already an outsourced process requiring a low level of interactivity between sequential steps Strong workflow tool to manage work, which flows through a series of system and human steps Between sites Between sequential tasks Between agents (given their known skill set)
  • Slide 23
  • Findings from Work-Practice Study (1/2)
  • Slide 24
  • Findings from Work-Practice Study (2/2) Page 24 Workplace Ecology : Data security is physical, technical & social Crowdsourcing: lose physical and social enforcement, reduced control of workforce. Need technical solutions. Skills and Knowledge 1)key what you see data entry actually involves extensive rule set. 2) Form difficulty is situational. 3) Non-standard means non- standard. Crowdsourcing: Situational-based incentives and supporting learning Being a Corporate Employee Pay alone not enough to achieve SLA. Agents made accountable. Crowdsourcing: reduced accountability could increase rejections of difficult work. Making the Workflow Work: Push model of work Crowdsourcing: Pull model of work raises coordination and completion issues. Collaborative Working: Work is not collaborative at workflow level; but it is at claim level (floorwalkers & colleagues). Crowdsourcing: building collaboration in? Pull models of supervision?
  • Slide 25
  • Conclusion Page 25 Crowdsourcing is an emerging Research area It requires expertise and research competencies from a number disciplines Crowdsourcing can be applied in various domains to solve problems in a more effective manner Finally, a large fraction of the crowd comes from India Focused research and technologies will be highly relevant
  • Slide 26
  • Services Thrust References TurKit: Tools for Iterative Tasks on Mechanical Turk; Greg Little, Lydia B. Chilton, Robert C. Miller, and Max Goldman Matt Lease Tutorial Soylent A cr Fold.it S. Cooper et. al