11
texasmulticore.com Sample Customer Use Cases & Results February 2017 Customer quote: “SequenceL is Matlab on Steroids”

TMT SequenceL customer use cases and results

Embed Size (px)

Citation preview

Page 1: TMT SequenceL customer use cases and results

texasmulticore.com

Sample Customer Use Cases & Results

February 2017

Customer quote:

“SequenceL is Matlab on Steroids”

Page 2: TMT SequenceL customer use cases and results

Customer Example: Predict Financial Market Using Neural Network Users Learn SequenceL and Implement Significant AI Application in <2 weeks

Challenge:

─ Predict financial market with retrospective data using artificial neural network (AI) algorithms

─ Speed is key for big data

─ Goal: Limit time for analysis to 1 minute

Solution

─ Implement neural network algorithms in SequenceL to utilize full potential of user’s computer

Results

─ Customer went from zero-to-proficient in <2 weeks, with no outside assistance needed

─ Downloaded SequenceL, learned it via online tutorial, and wrote significant application

─ Complex algorithms moved to SequenceL easily

─ Faster analysis of the data for end users

─ More effective use of hardware; uses all CPU cores

─ Scalability/portability on any hardware platform

2

Retrospective financial data from 08/18/2006 to 11/29/2016

© 2017 Texas Multicore Technologies, Inc.

All Rights Reserved

0

20

40

60

80

100

120

140

1 2 3 4

Tim

e (

sec)

Threads

Page 3: TMT SequenceL customer use cases and results

Customer Example: Seismic RTM for Oil & Gas Achieves >2x Speedup on Existing Platform with <50% Code + Portability

Challenge:

─ ION Geophysical has one of the best and most optimized RTM codes in the oil & gas industry

─ ION RTM C/C++ and MPI was given to Acceleware to optimize with CUDA for GPUs and it ran slower

─ Goal: “Our customers don’t pay us to make the code run faster, they pay us for the geoscience. Our team

should be focused on the geoscience, not low-level SIMD instructions.”

─ Goal: Achieve platform portability + optimization to run our RTM on other platforms

Solution

─ TMT refactored computationally-intensive

portions of existing C/C++ and MPI code

into SequenceL

~10,000 lines of 40,000 LOC program

Results

─ >2x speedup on existing platforms

─ Platform dependencies removed

─ 5x speedup on POWER8 platform

─ <50% as many lines of code, much more

readable and supportable

3 © 2017 Texas Multicore Technologies, Inc.

All Rights Reserved

“Matlab on steroids”

Page 4: TMT SequenceL customer use cases and results

Customer Example: CFD (Computational Fluid Dynamics) SwRI Achieves 17.8x 26x Speedup with 25% Less Code

Challenge:

─ Researchers already often have to wait weeks for these fluid flow simulation runs

─ This slows the pace of innovation and makes it impractical to further enhance models for higher

accuracy (e.g.- irregular shaped particles)

─ Goal: reduce the runtime of in-house Lattice Boltzmann method-based CFD simulation by a factor of 10

Solution

─ SwRI reformulated this existing Fortran+OpenMP code into SequenceL

Results

─ “SequenceL implementation was 26x faster for most relevant benchmark“ (70 particles)

─ “Runs that previously took 2 weeks now completed overnight”

─ “The SequenceL compiler created a parallel executable with no burden on the programmer and is

provably race-free”

─ “SequenceL version was 25% shorter (LOC) and closely resembles the mathematical equations”

4 © 2017 Texas Multicore Technologies, Inc.

All Rights Reserved

2 weeks

Monday

Tuesday

Wed

Thurs

Friday

Sat

Sunday

Monday

Tuesday

Wed

Thurs

Friday

Sat

Sunday

Dramatically reduced time to discovery; now practical to enhance model 12 hours!

Page 5: TMT SequenceL customer use cases and results

Customer Example: Industrial Control Networking (WirelessHART, IEC 62591, IEEE 802.15.4)

Challenge

─ WirelessHART was too complex and slow to process at scale

Generating Downlink graphs for 200 nodes would take 5 hours and 20 minutes

Generating Uplink, Downlink, and Broadcast network graphs for a plant

or oil field with a 1,000 nodes would take a solid month to process

─ Goal: Generate graphs for a 200 node network in <1 minute

Solution

─ Emerson Research proposed a new Downlink algorithm (SRDR) in a whitepaper

─ Had TMT implement in SequenceL to achieve multicore acceleration

Results

─ SequenceL finished by TMT in 3 weeks, ran 10x faster

10X faster performance and right the first time

Generated graphs in 9.3 seconds

─ Inventors took 5 months in Java, never got performance

Initially had errors and 10x slower

Used SequenceL to debug Java, get perf. insights (still 3x slower)

5

Fast & robust code, faster time to market

© 2017 Texas Multicore Technologies, Inc.

All Rights Reserved

Page 6: TMT SequenceL customer use cases and results

Customer Example: Video Processing Using SequenceL

Proprietary algorithms remove air turbulence, radiated heat, etc.

Goal: 30Hz (fps) to keep up with real time input video feed

Best performance (8 core x86 platform) ─ 58 Hz: SequenceL

─ 21 Hz: Matlab (Interpreter)

─ 1.2 Hz: Matlab (Coder/C-out)

Input video feed Processed video output

© 2017 Texas Multicore Technologies, Inc.

All Rights Reserved 6

Page 7: TMT SequenceL customer use cases and results

Customer Example: Video Processing Using SequenceL

Proprietary algorithms remove air turbulence, radiated heat, etc.

Goal: 30Hz (fps) to keep up with real time input video feed

Best performance (8 core x86 platform) ─ 58 Hz: SequenceL

─ 21 Hz: Matlab (Interpreter)

─ 1.2 Hz: Matlab (Coder/C-out)

Input video feed Processed video output

© 2017 Texas Multicore Technologies, Inc.

All Rights Reserved 7

Page 8: TMT SequenceL customer use cases and results

Customer Example: Video Processing Using SequenceL

Proprietary algorithms remove air turbulence, radiated heat, etc.

Goal: 30Hz (fps) to keep up with real time input video feed

Best performance (8 core x86 platform) ─ 58 Hz: SequenceL

─ 21 Hz: Matlab (Interpreter)

─ 1.2 Hz: Matlab (Coder/C-out)

Input video feed Processed video output

© 2017 Texas Multicore Technologies, Inc.

All Rights Reserved 8

Page 9: TMT SequenceL customer use cases and results

Customer Example: Video Processing Using SequenceL

Proprietary algorithms remove air turbulence, radiated heat, etc.

Goal: 30Hz (fps) to keep up with real time input video feed

Best performance (8 core x86 platform) ─ 58 Hz: SequenceL

─ 21 Hz: Matlab (Interpreter)

─ 1.2 Hz: Matlab (Coder/C-out)

Input video feed Processed video output

© 2017 Texas Multicore Technologies, Inc.

All Rights Reserved 9

SequenceL also won on code readability

Page 10: TMT SequenceL customer use cases and results

SequenceL Changes the Game

© 2017 Texas Multicore Technologies, Inc.

All Rights Reserved 10

Faster Performance Uses all cores, GPUs

10X Faster Time to Market

Prototyping tool = Production tool

Get it Right the First Time Far fewer lines of code; Algorithms in

SequenceL often match their definition

Portability + Optimization True application portability with

high performance

Built on Open Industry Standards Integrates with existing tools,

languages, & methodologies

Fast ROI No need for expensive “parallel ninjas”

nor the time they add to the schedule

Accelerates time to innovation/discovery

Page 11: TMT SequenceL customer use cases and results

SequenceL Changes the Game

© 2017 Texas Multicore Technologies, Inc.

All Rights Reserved 11

Faster Performance Uses all cores, GPUs

10X Faster Time to Market

Prototyping tool = Production tool

Get it Right the First Time Far fewer lines of code; Algorithms in

SequenceL often match their definition

Portability + Optimization True application portability with

high performance

Built on Open Industry Standards Integrates with existing tools,

languages, & methodologies

Fast ROI No need for expensive “parallel ninjas”

nor the time they add to the schedule

Business Agility: TTM, new platforms, Cloud