17
Chat Bot topic summary Team FICC Tech macro

Data sience coffe

Embed Size (px)

Citation preview

Page 1: Data sience coffe

Chat Bot topic summary

TeamFICC Tech macro

Page 2: Data sience coffe

Self introduction

• Noboru Kano– 2016 new grad

• Interesting topicsNLP(Natural language processing), Statistics, ML(Machine learning)

• Experience1year part time on a start up company as a ML engineer(Did 3 NLP projects)

• HobbyHandball, Chinese food, drink party(sometimes), 2ch

Page 3: Data sience coffe

Agenda

• What is Bot ?• Chat bot history • type of chat bot

algorithm and demo• case study

Page 4: Data sience coffe

What is Bot ?

• A computer program that simulates human conversation, or chat, through artificial intelligence.(From wiki)

Page 5: Data sience coffe

Why Chat Bot ?

• This year, chat bot has attracted a great deal of public attention.

Page 6: Data sience coffe

History of Chat bot

• ELIZA “doctor”(1966)An early example of primitive NLP chat bot

• A simulation of a psychotherapist. On your Emacs

Page 7: Data sience coffe

• you can find free source about ELIZA in internet

Page 8: Data sience coffe

Type of Chat bot

I bought a computer

I like the computer

Hello

Hello I’m kanono !

1. If-then-eles Type

• If the words in dictionaries, bot can respond to you.• Accuracy depends on the volume of the dictionary

example : ELIZA

Page 9: Data sience coffe

Type of Chat bot2. Use dialogue dataStore dialogue data into DB and response a similar message.

• The response would be a human-like message.• Accuracy depends on the volume of the dialogue

Are you free now?

sorry I’m really busy

That too bad...

What happened?

Are:1, you:1, free:1, now:1

dialogue DB

ex : パン太一郎

Page 10: Data sience coffe

Type of Chat bot3. Generate model• generate sentence with statistical method.• calculate the next words appearance ratio.

which dessert do you like the best ?

Yogurt is dessert

I really like frozen yogurt

I watched “Frozen” last night

• Can use variety of phrase and expression• Difficult to control context in message

Dialogue corpus

Generate Model

Page 11: Data sience coffe

Algorithm for Generate model

1. Markov chainI am John.I am kanono.I do not like English

I

do

am

not

0.66

0.33

John

kanono

0.5

0.5

・・・

Strong in make a short sentence.

not good at generating long story.with large scale corpus data

Page 12: Data sience coffe

Demo

Page 13: Data sience coffe

What I made?

Input: ImageOut put: generate a sentence related to the image

Hi ! I’m going to take a flight to NY training, see you soon!

Page 14: Data sience coffe

技術解説(画像分類部分)

アルゴリズム1 位 aircraft( 飛行機 )2 位 plane ...

上位 10 クラスのスコアを出力→ 日本語に翻訳

ラベル

Page 15: Data sience coffe

技術解説(ツイート生成部分)1 位 aircraft (飛行機)

ラベル

ラベルに該当するツイートをたくさん取得♡

・・・

テキストを自動生成(自作)今日は飛行機にのるよー!名古屋みんな待っててね!

ヒミツの自作アルゴリズム♥

Page 16: Data sience coffe

Algorithm for Generate model

2. RNN(Reccurent Neural Network) ex.) Allo, りんな

RNN for semantic analysis RNN for generate response

Page 17: Data sience coffe

Case study

• Check my Qiita page• http://qiita.com/kanottyan/items/

2783bf91c8ea6a8a4ce8