27
BIG DATA IN FINANCE Tomasz Bednarz 1

Big Data in Finance, 2012

Embed Size (px)

DESCRIPTION

Presentation for CSIRO Big Data Workshop 2012. October 2013, North Ryde, Australia.

Citation preview

Page 1: Big Data in Finance, 2012

BIG DATA IN FINANCETomasz Bednarz

���1

Page 2: Big Data in Finance, 2012

OVERVIEW OF THIS TALK

���2

• Flavors of big data in finance • Technology map for big data in finance • Wheel of reincarnation • A touch of history • Role of visualisation • Frameworks • Real Time HFT

Page 3: Big Data in Finance, 2012

BIG DATA AGAIN• Big data is not a “crystal ball”

• The value from big data can only be extracted when there is precise business problem to be addressed

• Business problem has to be understood and well defined at the first place

• Business to data mapping • Better business predictions = difficult process

• Analysing big data • Data scientists to examine the data and extract critical information such as customers buying habits - need for experts

• Catalogue data assets (10% of data may actually mean something)

• Privacy • No clear access controls to access data?

Volume

VelocityVarietyBig Data

Page 4: Big Data in Finance, 2012

SURVEY

http://strata.oreilly.com/2012/01/enterprise-big-data-survey-results.html

Page 5: Big Data in Finance, 2012

FLAVORS OF BIG DATA IN FINANCE

• Structured • Market data feeds

• Big and fast • Order flows • Execution info

• Unstructured • News & • Research

Dad Bailey’s news stand in 1939. Comics are on the top left.

���5

Page 6: Big Data in Finance, 2012

TECHNOLOGY MAP FOR DATA

Information Retrieval Analytics

Loosely Structured Information

Highly Structured Information

Human Computer

Market Interface

= high frequency ticks/trading & order flow

= text and relationships

���6

Page 7: Big Data in Finance, 2012

TECHNOLOGY MAP FOR DATA

Information Retrieval Analytics

Loosely Structured Information

Highly Structured Information

Human Computer

Market Interface

= text and relationships

= high frequency ticks/trading & order flow

fast: trading

���7

Page 8: Big Data in Finance, 2012

TECHNOLOGY MAP FOR DATA

Information Retrieval Analytics

Human Computer

Market Interface

= text and relationshipsLoosely Structured Information

Highly Structured Information = high frequency ticks/trading & order flow

slow: investment research, portfolio management

���8

Page 9: Big Data in Finance, 2012

SKILL ZONES - HUMAN VERSUS MACHINE

���9

Time to exploit

information

Data Subtlety & Complexity. Language & Concept Importance.

ms

months

low highpure HPC (HFT)

collaborative hybrid generation

smart market

monitoringalgo

gurus

Visualizing Marathon 2011CSIRO GPU Cluster

Reinvent tech/self every ~three years!!!

Page 10: Big Data in Finance, 2012

WHEEL OF REINCARNATION

���10

T. H. Myer and I. E. Sutherland. On the design of display processors. Communications of ACM, June 1968, 410-414.

Page 11: Big Data in Finance, 2012

STRUCTURED DATA

���11

Page 12: Big Data in Finance, 2012

���12

NYSENew York Stock Exchange started as very low-tech place. In 1779, the NYSE was a bunch of guys standing around a buttonwood tree at 68 Wall Street shouting at each other on days when it didn’t rain or snow.

In 1794 we see the first big technological solution: the roof.

Page 13: Big Data in Finance, 2012

NYSEEverybody moved inside, to the Tontine Coffee House at the corner of Water and Wall streets. Hands, roofs, chalk (oh technology)!

1n 1823, the Difference Engine was invented by Charles Babbage “I wish to

God these calculations had been executed by steam.”

Page 14: Big Data in Finance, 2012

TECHNOLOGICAL INVASION - SIMPLER ERA

���14

Before telegraphy, in 1850s, the sky over Wall Street was open and clear

It took only a short time for telegraphy’s compression of time and space to transform

the scenery. Everybody had to have it!

BUT you had to know Morse Code to participate in the market! (Morse, 1837)

Page 15: Big Data in Finance, 2012

TICKER TAPEA huge success (as important as the roof, hand signals, and the telegraph). People use jumbo magnifying lenses - people traded faster than the machines, so delay meters were installed.

All that ticker tape also made for nice parades. Here group of specialists

celebrating the one-millionth bagging of a buy-side trader.

Page 16: Big Data in Finance, 2012

AND MORE TECHNOLOGYTicker tapes (an enduring market visualisation) are still with us today.

Type 80 Card Sorter IBM 1925

Our CSIRAC 1 operation per second - 1949

Page 17: Big Data in Finance, 2012

ROLE OF VISUALISATION

Dynamic market view: Oculus Info’s Visible Marketplace - portfolio visualisation. !Not very dynamic on paper. !Another reason to be glad we have web browsers.

Page 18: Big Data in Finance, 2012

MAP OF THE MARKET

Map of the Market , the classic big picture of whole market visualization. It was invented by Marten Wattenberg, now at IBM Many Eyes.

Page 19: Big Data in Finance, 2012

Cultural Finance

Finance Cover Letter

Page 20: Big Data in Finance, 2012

UNSTRUCTURED DATA

���20

Page 21: Big Data in Finance, 2012

TECHNOLOGY MAP FOR DATA

Information Retrieval Analytics

Human Computer

Market Interface

= text and relationshipsLoosely Structured Information

Highly Structured Information = high frequency ticks/trading & order flow

slow: investment research, portfolio management

���21

Page 22: Big Data in Finance, 2012

INFORMATION SOURCES• Specialised Industry Media • Local and International Media • Direct Corporate Communications • Research Labs • Government Agencies • Social Media • Crowd Sourcing !

• HTML / Text Feeds / XML

Page 23: Big Data in Finance, 2012

BIG DATA AT NICTA• Developed to deliver customized business intelligence and advanced data mining to the Enterprise marketplace

• Machine Learning to obtain deep insights in real time

• Build upon a scalable open source software platforms

• See http://www.youtube.com/watch?v=TxQPdUt_x3c • Scoobi - a Scala productivity framework for Hadoop (allows you to write

what you want rather than how to do it). • Source code: https://github.com/nicta/scoobi

• Demo of Scoobi tomorrow at the hands-on afternoon session (by Piotr)

http://www.ambiata.com/index.html

Page 24: Big Data in Finance, 2012

���24

http://www.zdnet.com/how-is-big-data-faring-in-the-enterprise-7000002404/

Page 25: Big Data in Finance, 2012

REAL-TIME REAL-TIME REAL-TIME

• Everything is moving to real time

• Everything is moving towards continuous time

• Everything is moving towards mobility (anywhere, anytime)

• Also GPU for high-frequency trading (HFT)

• GPU alongside FPGAs receiving stream of market data to increase the accuracy of the strategy, or even to suggest change in the algorithm used as market conditions evolve

• GPU Direct enables a GPU to talk directly to the FPGA, receiving data from it into a kernel, which may be preloaded to perform its magic once the data is available

• Hadoop + GPUs

Page 26: Big Data in Finance, 2012

ACKNOWLEDGEMENTS

• Andrew Sheppard, Fontainhead

• David Leinweber, The Lawrance Berkley National Lab, author of Nerds on Wall Street NOWS

• Colleagues from Westac and Commonwealth banks

• Craig Mudge for organising Big Data workshop, and always great discussions

Page 27: Big Data in Finance, 2012

���27

THANK YOU