17
APPLYING INFORMATION RETRIEVAL TO TEXT MINING 2011.10.26. Data mining Lab 이이이

APPLYING INFORMATION RETRIEVAL TO TEXT MINING 2011.10.26. Data mining Lab 이아람

Embed Size (px)

Citation preview

Page 1: APPLYING INFORMATION RETRIEVAL TO TEXT MINING 2011.10.26. Data mining Lab 이아람

APPLYING INFOR-MATION RE-TRIEVAL TO TEXT MINING2011.10.26.Data mining Lab이아람

Page 2: APPLYING INFORMATION RETRIEVAL TO TEXT MINING 2011.10.26. Data mining Lab 이아람

IR ( Information retrieval )

Returning relevant texts for query

A measure of similarity is computed be-

tween the query and each document

The similarity scores

The vector space model

Page 3: APPLYING INFORMATION RETRIEVAL TO TEXT MINING 2011.10.26. Data mining Lab 이아람

Counting Letters

Page 4: APPLYING INFORMATION RETRIEVAL TO TEXT MINING 2011.10.26. Data mining Lab 이아람

Counting Letters

Page 5: APPLYING INFORMATION RETRIEVAL TO TEXT MINING 2011.10.26. Data mining Lab 이아람

Counting Letters

Page 6: APPLYING INFORMATION RETRIEVAL TO TEXT MINING 2011.10.26. Data mining Lab 이아람

Counting words

Page 7: APPLYING INFORMATION RETRIEVAL TO TEXT MINING 2011.10.26. Data mining Lab 이아람

Counting words

Page 8: APPLYING INFORMATION RETRIEVAL TO TEXT MINING 2011.10.26. Data mining Lab 이아람

Counting Pronouns Occurring

Page 9: APPLYING INFORMATION RETRIEVAL TO TEXT MINING 2011.10.26. Data mining Lab 이아람

Counting Pronouns Occurring

he shehim herhis herhis hershimselfherself

Page 10: APPLYING INFORMATION RETRIEVAL TO TEXT MINING 2011.10.26. Data mining Lab 이아람

TEXT COUNT AND VECTOR

Page 11: APPLYING INFORMATION RETRIEVAL TO TEXT MINING 2011.10.26. Data mining Lab 이아람

Vectors and Angles

두 Text 를 비교하기 위해 Angle 이용 Vector 를 이용하여 Angle 을 구한다 . Angle 값이 0 에 가까울 수록 두 Text 는

유사함

Page 12: APPLYING INFORMATION RETRIEVAL TO TEXT MINING 2011.10.26. Data mining Lab 이아람

Vectors and Angles

Inner product Dot product

Page 13: APPLYING INFORMATION RETRIEVAL TO TEXT MINING 2011.10.26. Data mining Lab 이아람

Vectors and Angles

Vector length =

Page 14: APPLYING INFORMATION RETRIEVAL TO TEXT MINING 2011.10.26. Data mining Lab 이아람

Computing Angles

Page 15: APPLYING INFORMATION RETRIEVAL TO TEXT MINING 2011.10.26. Data mining Lab 이아람

Computing Angles

Page 16: APPLYING INFORMATION RETRIEVAL TO TEXT MINING 2011.10.26. Data mining Lab 이아람

Computing Angles

Page 17: APPLYING INFORMATION RETRIEVAL TO TEXT MINING 2011.10.26. Data mining Lab 이아람

Computing Angles

cosθ = 0.89503

Angle of 0.46230 radians, which about 26.5º