Deep Learning intro. - Kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdf · 2016-06-17 · 𝑖...

𝑠𝑖𝑔𝑚𝑎 𝜶

Deep Learning intro.

2016.01.02.

𝑠𝑖𝑔𝑚𝑎 𝜶 2

Outline

Natural Language Processing (NLP)

Representation and Processing

Deep Learning Models

Natural Language Processing

Natural Language Processing (NLP)

• 답변

• 검색

• 추론

• 대화

언어이해 언어생성응용• 지능형로봇

• 정보검색

• 기계번역

• 문서요약

• 질문

• 단어이해

• 의미이해

• 의도파악

Representation and Processing

Representation in mathematics

<0.156, 0.421, 0.954, …>

<0.096, 0.510, 0.991, …>

<0.496, 0.951, 0.321, …>

<0.196, 0.851, 0.119, …>

<…, 0.486, 0.854, …>

<…, 0.751, 0.912, …>

<…, 0.123, 2.554, 5.124, …>

<…, 7.451, 21.45, 8.999>

<…, 1.109, 11.854, 0.456>

Real World Vector Space

https://www.google.com/imghp?hl=ko

오리 vs. 토끼

위장

Neural Network for Human

https://uncyclopedia.kr/wiki/%EB%87%8C

Neural Network

Pattern recognition

Multi layer

Human: 10 layers

I see lion

Neural Network

Vector representation

Pattern of layers

+ Learning

Pattern of layers

Deep learning automatic pattern combination

Why we say deep ?

… … … … … …

Connection link: (n x n) x (m-1)

Automatic combination

How to use layers?

Input vector

Output real number or class (vector)

Vector representation “One-hot”

Vector representation

[Symbol]

Lion[Text representation] [One-hot representation]

<0, 0, 0, 0, 0, 1, 0, 0, 0, 0, …>

[Symbol representation]

<1.45, 75.12, 0.425, 0.953, …>

Jung, DEEP LEARNING FOR KOREAN NLP

How to define symbol to one-hot

Big cat

[Symbolic words]

<0, 0, 1, 0, 0>

<0, 1, 0, 0, 1>

[One-hot]

If it uses AND op., two words is non-match

∴ we need symbolic vector representation

How to define symbol to one-hot

Big cat

TigerDog

∴ [Symbolic representation]

<0, 0, 1, 0, 0>

<0, 1, 0, 0, 1>

<1.45, 75.12, 0.425, 0.953, …>

<1.78, 61.11, 0.611, 2.011, …>

Use cosine similarity

[Symbolic vectors] (from NNLM)

Neural Network Language Model

Feed-forward NN

parametric Estimator

overall parameter set 𝜃 = (𝐶,𝑤)

one-hot representation• [0 1 0 0 0 0 0 0 0 0]

Lookup Table• word embedding

Non-linear projection• activation function

Normalize weight• softmax (length: 𝑛)

Neural Network Language Model

max𝜃 → 𝑙𝑜𝑑 𝑙𝑖𝑘𝑒𝑙𝑖ℎ𝑜𝑜𝑑

𝐿 = max𝜃

𝑇 𝑡 𝑙𝑜𝑔𝑓(𝑤𝑡, 𝑤𝑡−1, … , 𝑤𝑡−𝑛+1)

parameters• ℎ: 𝑡ℎ𝑒 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 ℎ𝑖𝑑𝑑𝑒𝑛 𝑢𝑛𝑖𝑡𝑠

• 𝑚: 𝑡ℎ𝑒 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑓𝑒𝑎𝑡𝑢𝑟𝑒𝑠 𝑤𝑖𝑡ℎ 𝑒𝑎𝑐ℎ 𝑤𝑜𝑟𝑑

• 𝑏: 𝑡ℎ𝑒 𝑜𝑢𝑡𝑝𝑢𝑡 𝑏𝑖𝑎𝑠𝑒𝑠

• 𝑑: 𝑡ℎ𝑒 ℎ𝑖𝑑𝑑𝑒𝑛 𝑙𝑎𝑦𝑒𝑟 𝑏𝑖𝑎𝑠𝑒𝑠

• 𝑈: ℎ − 𝑡𝑜 − 𝑜 𝑤𝑒𝑖𝑔ℎ𝑡𝑠

• 𝑊: 𝐼 − 𝑡𝑜 − 𝑜 𝑤𝑒𝑖𝑔ℎ𝑡𝑠

• 𝐻: 𝐼 − 𝑡𝑜 − 𝐻 𝑤𝑒𝑖𝑔ℎ𝑡𝑠

• 𝐶:𝑤𝑜𝑟𝑑 𝑓𝑒𝑎𝑡𝑢𝑟𝑒𝑠 (𝑙𝑜𝑜𝑘𝑢𝑝 𝑡𝑎𝑏𝑙𝑒)

• 𝜃 = (𝑏, 𝑑,𝑊, 𝑈,𝐻, 𝐶)

NNLM for Korean

Leeck, 딥러닝을이용한한국어의존구문분석

Deep Learning Models

Deep learning Models

“강대주변에스타벅스위치가어디야?”• 강대/NNG 주변/NNG 에/JX 스타벅스/NNG …

Feed-forward Neural Network (FFNN)

𝑊𝑡

강대

주변

1-FFNN 2-FFNN 3-FFNN

“강대주변에스타벅스위치가어디야?”• 𝑌𝑡𝑒𝑥𝑡 [강대주변에스타벅스위치], [어디]

• 𝑌𝑡𝑎𝑔𝑠 [ B I I I I ], [ B ]

Recurrent Neural Network (RNN)

𝑊𝑡

unfold 강대

주변

스타벅스

위치

• 𝑌𝑡𝑎𝑔𝑠 [ B I I I I ], [ B ]

Long Short-Term Memory RNN (LSTM-RNN)• Using gate matrix (LSTM or GRU)

𝑊𝑡

unfold 강대

주변

스타벅스

위치

LSTM-RNN

• 𝑌𝑡𝑎𝑔𝑠 [ B I I I I ], [ B ]

LSTM-RNN CRF • Using gate matrix (LSTM or GRU)

𝑊𝑡

unfold 강대

주변

스타벅스

위치

LSTM-RNN

Viterbi or Beam search

• 𝑌𝑡𝑎𝑔𝑠 [ B I I I I ], [ B ]

Bidirectional LSTM-RNN CRF (Bi-LSTM-RNN CRF)• Using gate matrix (LSTM or GRU)

Viterbi or Beam search

강대

주변

스타벅스

위치

forward

backward

Sequence-to-sequence model

Two different LSTM: Input/output sentence LSTM

Using the Shallow LSTM

Reverse input sentence

Training: Decoding & Rescoring

Encoder-Decoder Architecture

Pointer Networks

• Seq2seq와 attention mechanism 을기반으로한딥러닝모델

• 입력열의위치(인덱스)를출력열로하는모델

• X = {A:0, B:1, C:2, D:3, <EOS>:4}

• Y = {3, 2, 0, 4}

A B C D <EOS> D C A <EOS>

Encoding Decoding

Siamese Neural Network

References

Jung, DEEP LEARNING FOR KOREAN NLP

Lee, 딥러닝을이용한한국어의존구문분석

Park, Point networks for Coreference Resolution

Park, Bi-LSTM-RNN CRF for Mention Detection

감사합니다.

박천음, 최수길, 박찬민, 최재혁, 홍다솔

𝑠𝑖𝑔𝑚𝑎 𝜶 , 강원대학교

Email: parkce3@gmail.ac.kr

Deep Learning intro. - Kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdf · 2016-06-17 · 𝑖...

Documents

MicroStrategy Basic Reporting Guide · 2020-02-23 · MicroStrategy HyperIntelligence 확장프로그램에서 Chrome 에 추가를 클릭합니다. 팝업 대화 상자에서 확장

Q( 문제 ):Excuse me, h ow much ( is ) ( this ) ( sweater ) ? A( 답변 ) : ( It’s ) ( $ 30 )

대화, 그림 설명, 이야기 말하기의 비교 · 2019-06-28 · 노년층의 담화 특성: 대화, 그림 설명, 이야기 말하기의 비교 지도 김 향 희 교수 이

Reading 구문 해설 Lesson 8. 연결사 추론 수능길잡이 pp. 48~52 1

Molecular Mechanism in the Brain : A Computational Perspective 강의 ‘지식 표현 및 추론’ 중간 발표

Avigilon Control Center 클라이언트 사용자 설명서 - …4a54f0271b66873b1ef4-ddc094ae70b29d259d46aa8a44a90623.r7.cf2.rackcdn…2. 확인 대화 상자가 나타나면 예을

통계적 추론 Statistical Inference 개념wolfpack.hnu.ac.kr/2015_Fall/D4BE/통계적추론.pdfMathematical Statistics Statistical Inference 통계적 추론 Statistical Inference

PHC말뚝 기초연결 질의 및 답변 요약...PHC말뚝 기초연결 질의 및 답변 요약 1)현황 한국도로공사에서 대한토목학회에 PHC말뚝 기초연결 방법에

“청소년을 쌩까는 국회의원 안 뽑겠다!” · 2020-04-01 · “청소년을 쌩까는 국회의원 안 뽑겠다!” 청소년 총선 공약 설문조사 & 정당 답변

오감(五感)인식기술이 불러오는혁신 - CHICchic.re.kr/webzine/2013/01/images/TECH_Trend_SERI.pdf · -빌게이츠마이크로소프트창업주는'로봇'과'음성인식(대화)'을

화상영어 민트영어학원교육제안서new3.mint05.co.kr/pubhtml/customer/images/mint05_VDclass...영어문법질문&답변 민트영어필수자료 [폰]얼굴철판딕테이션

대화인터페이스 챗봇 그리고 자연어처리sigai.or.kr/workshop/AI-for-everyone/2017/slides/대화... · Natural Language Processing Lab. 대화인터페이스-챗봇이야기

폐암 진단을 받으신 환자들을 위한 질문 답변 - Don’t …...비소세포폐암(NSCLC) 4기에서 PD-L1이 양성으로 나온 경우라도 EGFR을 포함한 모든

OCaml Tutorial - Seoul National Universityropas.snu.ac.kr/~ta/4190.310/14/ocaml_tutorial13f.pdf · 타입은 자동으로 추론 • 내가 쓰지도 않은 타입이 표기되어

온톨로지 기반 상황인지 모델링 연구 : u-Convention을 중심으로 · 상황인지 시스템 온톨로지 모델명 온톨로지 언어 컨텍스트 추론 CoBrA SOUPA

OCaml Tutorial - Seoul National Universityropas.snu.ac.kr/~ta/4190.310/15/ocaml_tutorial13f.pdf타입은 자동으로 추론 • 내가 쓰지도 않은 타입이 표기되어 있다?

목 차...문화비(도서․공연비, 박물관․미술관 입장료) 소득공제 시행 관련 질의 답변 Q&A 문화체육관광부·국세청 목 차 1 문화비 소득공제

GMO 에 대한 소비자들의 10 가지 질문과 답변 · 2020-01-03 · gmo에 대한 소비자들의 10가지 질문과 답변 가진 매우 낮은 알레르겐 레벨의 땅콩을

인지 / 추론 : 추론 기술

ai-5 지식표현과 추론-IIelearning.kocw.net/KOCW/document/2016/chungbuk/leegeonmyeong/5.pdf논리프로그래밍언어 Horn 절(Horn clause) 논리식을논리합의형태로표현할때,